Analyzing the Interpretability of Machine Learning Prediction on Student Performance Using SHapley Additive exPlanations

Wan Chong Choi, Chan Tong Lam, Antonio Jose Mendes

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This study compared several machine learning algorithms to predict student programming learning performance in an online learning environment. It used Explainable Machine Learning (EML) techniques to enhance interpretability. A range of algorithms, including Random Forest, Extra Trees, CatBoost, XGBoost, Naive Bayes, and KNearest Neighbors (KNN), were compared, with Extra Trees delivering the best results. Distinct from other EDM research mainly focused on predictive efficiency, we contributed by using the EML technique of SHapley Additive Explanations (SHAP), rooted in the Game Theory framework, to enhance model interpretability at both global and individual levels. At the global level, summary plots showed overall feature impacts, bar plots quantified the average effect of each feature, and dependence plots highlighted specific relationships. At the individual level, force plots identified critical features for individual predictions, decision plots traced the cumulative impact of features from the base value to the final output, and waterfall plots provided a breakdown of predictions. This study contributes to EDM by offering accurate predictive models and detailed interpretability, helping educational stakeholders make data-informed decisions to improve student outcomes.

Original languageEnglish
Title of host publication2024 IEEE International Conference on Teaching, Assessment and Learning for Engineering, TALE 2024 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350376234
DOIs
Publication statusPublished - 2024
Event13th IEEE International Conference on Teaching, Assessment and Learning for Engineering, TALE 2024 - Bengaluru, India
Duration: 9 Dec 202412 Dec 2024

Publication series

Name2024 IEEE International Conference on Teaching, Assessment and Learning for Engineering, TALE 2024 - Proceedings

Conference

Conference13th IEEE International Conference on Teaching, Assessment and Learning for Engineering, TALE 2024
Country/TerritoryIndia
CityBengaluru
Period9/12/2412/12/24

Keywords

  • Educational data mining
  • Explainable machine learning
  • Learning performance prediction
  • SHapley Additive exPlanations

Fingerprint

Dive into the research topics of 'Analyzing the Interpretability of Machine Learning Prediction on Student Performance Using SHapley Additive exPlanations'. Together they form a unique fingerprint.

Cite this