TY - GEN
T1 - Enhance Learning Performance Predictions with Explainable Machine Learning
AU - Choi, Wan Chong
AU - Lam, Chan Tong
AU - Mendes, António José
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - This Research Full Paper focuses on predicting learning performance using machine learning algorithms and interpreting the results using Explainable Machine Learning (EML) techniques. The study compared a comprehensive set of machine learning algorithms, including Logistic Regression, Decision Trees, AdaBoost, XGBoost, SVM, and KNN. The performance of these algorithms in predicting students' final grades in a course was accessed using various evaluation metrics. Our study used feature selection to identify the most relevant predictors to enhance predictive accuracy, implemented the Synthetic Minority Over-sampling Technique (SMOTE) to address class imbalance, and performed hyperparameter optimization to find the most effective model settings. This comprehensive approach improved the predictive accuracy of our models over previous studies. Additionally, the importance of early prediction in identifying at-risk students was explored, with models demonstrating promising accuracy at the first checkpoint of the course. Departing from traditional machine learning research that often focused on model performance, our study integrated the EML technique of Shapley Additive exPlanations (SHAP), which is grounded on the theoretical framework of Game Theory, to facilitate the interpretation of the predictive outcomes. This approach offered an explanatory perspective on the key factors influencing model decisions. By contributing to the predictability and interpretability of student performance, this research enriched the field of Educational Data Mining (EDM) and enhanced the understanding of student learning trajectories.
AB - This Research Full Paper focuses on predicting learning performance using machine learning algorithms and interpreting the results using Explainable Machine Learning (EML) techniques. The study compared a comprehensive set of machine learning algorithms, including Logistic Regression, Decision Trees, AdaBoost, XGBoost, SVM, and KNN. The performance of these algorithms in predicting students' final grades in a course was accessed using various evaluation metrics. Our study used feature selection to identify the most relevant predictors to enhance predictive accuracy, implemented the Synthetic Minority Over-sampling Technique (SMOTE) to address class imbalance, and performed hyperparameter optimization to find the most effective model settings. This comprehensive approach improved the predictive accuracy of our models over previous studies. Additionally, the importance of early prediction in identifying at-risk students was explored, with models demonstrating promising accuracy at the first checkpoint of the course. Departing from traditional machine learning research that often focused on model performance, our study integrated the EML technique of Shapley Additive exPlanations (SHAP), which is grounded on the theoretical framework of Game Theory, to facilitate the interpretation of the predictive outcomes. This approach offered an explanatory perspective on the key factors influencing model decisions. By contributing to the predictability and interpretability of student performance, this research enriched the field of Educational Data Mining (EDM) and enhanced the understanding of student learning trajectories.
KW - EDM
KW - Educational data mining
KW - Explainable machine learning
KW - Learning performance prediction
KW - SHapley Additive exPlanations
UR - http://www.scopus.com/inward/record.url?scp=105000625910&partnerID=8YFLogxK
U2 - 10.1109/FIE61694.2024.10893036
DO - 10.1109/FIE61694.2024.10893036
M3 - Conference contribution
AN - SCOPUS:105000625910
T3 - Proceedings - Frontiers in Education Conference, FIE
BT - 2024 IEEE Frontiers in Education Conference, FIE 2024 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 54th IEEE Frontiers in Education Conference, FIE 2024
Y2 - 13 October 2024 through 16 October 2024
ER -