TY - JOUR
T1 - Machine learning-driven credit risk
T2 - a systemic review
AU - Shi, Si
AU - Tse, Rita
AU - Luo, Wuman
AU - D’Addona, Stefano
AU - Pau, Giovanni
N1 - Publisher Copyright:
© 2022, The Author(s).
PY - 2022/9
Y1 - 2022/9
N2 - Credit risk assessment is at the core of modern economies. Traditionally, it is measured by statistical methods and manual auditing. Recent advances in financial artificial intelligence stemmed from a new wave of machine learning (ML)-driven credit risk models that gained tremendous attention from both industry and academia. In this paper, we systematically review a series of major research contributions (76 papers) over the past eight years using statistical, machine learning and deep learning techniques to address the problems of credit risk. Specifically, we propose a novel classification methodology for ML-driven credit risk algorithms and their performance ranking using public datasets. We further discuss the challenges including data imbalance, dataset inconsistency, model transparency, and inadequate utilization of deep learning models. The results of our review show that: 1) most deep learning models outperform classic machine learning and statistical algorithms in credit risk estimation, and 2) ensemble methods provide higher accuracy compared with single models. Finally, we present summary tables in terms of datasets and proposed models.
AB - Credit risk assessment is at the core of modern economies. Traditionally, it is measured by statistical methods and manual auditing. Recent advances in financial artificial intelligence stemmed from a new wave of machine learning (ML)-driven credit risk models that gained tremendous attention from both industry and academia. In this paper, we systematically review a series of major research contributions (76 papers) over the past eight years using statistical, machine learning and deep learning techniques to address the problems of credit risk. Specifically, we propose a novel classification methodology for ML-driven credit risk algorithms and their performance ranking using public datasets. We further discuss the challenges including data imbalance, dataset inconsistency, model transparency, and inadequate utilization of deep learning models. The results of our review show that: 1) most deep learning models outperform classic machine learning and statistical algorithms in credit risk estimation, and 2) ensemble methods provide higher accuracy compared with single models. Finally, we present summary tables in terms of datasets and proposed models.
KW - Credit risk
KW - Deep learning
KW - Machine learning
KW - Statistical learning
UR - http://www.scopus.com/inward/record.url?scp=85134550089&partnerID=8YFLogxK
U2 - 10.1007/s00521-022-07472-2
DO - 10.1007/s00521-022-07472-2
M3 - Review article
AN - SCOPUS:85134550089
SN - 0941-0643
VL - 34
SP - 14327
EP - 14339
JO - Neural Computing and Applications
JF - Neural Computing and Applications
IS - 17
ER -