TY - GEN
T1 - Predicting European top 5 league football match results based on EA series football video game data and betting odds
AU - Su, Jiasheng
AU - Wong, Dennis
AU - Wang, Linjun
N1 - Publisher Copyright:
© 2024 Copyright held by the owner/author(s).
PY - 2025/5/9
Y1 - 2025/5/9
N2 - This paper introduces a new dataset that utilizes EA football video game data to predict actual game outcomes. The dataset includes real match records from five seasons across Europe’s five biggest leagues, as well as player ratings in seven categories (attack, skill, movement, power, mentality, defense, goalkeeping) and 34 skill scores. In addition, actual betting odds from various bookmakers are included for match predictions. By utilizing this dataset, the XGBoost, Random Forest, CNN, LSTM, and SVM models employed in our study outperformed the baseline model, Which composed of odds and achieving accuracy rates of 52.6% for match outcome prediction and 59.6% for over/under total 2.5 goals scored predictions. Among these models, Random Forest demonstrated the best performance, with accuracy rates of 55.9% and 62.7% for match outcome and over/under total 2.5 goals scored predictions, respectively, representing improvements of 6.3% and 5.2% over the baseline models.
AB - This paper introduces a new dataset that utilizes EA football video game data to predict actual game outcomes. The dataset includes real match records from five seasons across Europe’s five biggest leagues, as well as player ratings in seven categories (attack, skill, movement, power, mentality, defense, goalkeeping) and 34 skill scores. In addition, actual betting odds from various bookmakers are included for match predictions. By utilizing this dataset, the XGBoost, Random Forest, CNN, LSTM, and SVM models employed in our study outperformed the baseline model, Which composed of odds and achieving accuracy rates of 52.6% for match outcome prediction and 59.6% for over/under total 2.5 goals scored predictions. Among these models, Random Forest demonstrated the best performance, with accuracy rates of 55.9% and 62.7% for match outcome and over/under total 2.5 goals scored predictions, respectively, representing improvements of 6.3% and 5.2% over the baseline models.
KW - Dataset
KW - Football Prediction
KW - Machine learning
UR - https://www.scopus.com/pages/publications/105010764199
U2 - 10.1145/3723936.3723980
DO - 10.1145/3723936.3723980
M3 - Conference contribution
AN - SCOPUS:105010764199
T3 - Proceedings of 2024 International Conference on Sports Technology and Performance Analysis, ICSTPA 2024
SP - 285
EP - 291
BT - Proceedings of 2024 International Conference on Sports Technology and Performance Analysis, ICSTPA 2024
PB - Association for Computing Machinery, Inc
T2 - 2024 International Conference on Sports Technology and Performance Analysis, ICSTPA 2024
Y2 - 13 December 2024 through 15 December 2024
ER -