TY - GEN
T1 - A Systematic Literature Review on Performance Prediction in Learning Programming Using Educational Data Mining
AU - Choi, Wan Chong
AU - Lam, Chan Tong
AU - Mendes, António José
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Programming education has become an essential skill for the digital generation. However, it presents a unique set of challenges that can be difficult for beginners. Educational data mining (EDM) has been increasingly utilized in programming education to enhance learning outcomes and understand students' learning behavior. By collecting and analyzing data from various sources, such as students' learning activities, interactions with learning resources, and assessment results, EDM can provide valuable insights into students' learning performance and potential areas for improvement. This paper presents a systematic literature review of recent literature (last five years) and reports on state of the art and trends in using EDM for student performance prediction in programming courses. It provides a comprehensive analysis of the input data used in previous work, exploring the different types of datasets used and the features that affect student performance. In addition, it addresses the predictive objectives and target variables for performance prediction in programming courses. On the other hand, it explores the most common prediction approaches, data pre-processing procedures, cross-validation methods, and evaluation metrics used to describe the performance of prediction algorithms. In addition, we discuss the limitations and challenges of various prediction approaches and provide valuable insights and directions for future research.
AB - Programming education has become an essential skill for the digital generation. However, it presents a unique set of challenges that can be difficult for beginners. Educational data mining (EDM) has been increasingly utilized in programming education to enhance learning outcomes and understand students' learning behavior. By collecting and analyzing data from various sources, such as students' learning activities, interactions with learning resources, and assessment results, EDM can provide valuable insights into students' learning performance and potential areas for improvement. This paper presents a systematic literature review of recent literature (last five years) and reports on state of the art and trends in using EDM for student performance prediction in programming courses. It provides a comprehensive analysis of the input data used in previous work, exploring the different types of datasets used and the features that affect student performance. In addition, it addresses the predictive objectives and target variables for performance prediction in programming courses. On the other hand, it explores the most common prediction approaches, data pre-processing procedures, cross-validation methods, and evaluation metrics used to describe the performance of prediction algorithms. In addition, we discuss the limitations and challenges of various prediction approaches and provide valuable insights and directions for future research.
KW - Computer science
KW - Educational data mining
KW - Performance prediction
KW - Programming Education
KW - Systematic review
UR - http://www.scopus.com/inward/record.url?scp=85183052794&partnerID=8YFLogxK
U2 - 10.1109/FIE58773.2023.10343346
DO - 10.1109/FIE58773.2023.10343346
M3 - Conference contribution
AN - SCOPUS:85183052794
T3 - Proceedings - Frontiers in Education Conference, FIE
BT - 2023 IEEE Frontiers in Education Conference, FIE 2023 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 53rd IEEE ASEE Frontiers in Education International Conference, FIE 2023
Y2 - 18 October 2023 through 21 October 2023
ER -