TY - GEN
T1 - SHAP-based Interpretable Models for Credit Default Assessment Using Machine Learning
AU - Xu, Qingyang
AU - Liao, Yunlong
AU - Li, Qiutong
AU - Zhang, Jiaqi
AU - Song, Zhilan
AU - Wang, Linjun
AU - Yuan, Xiaochen
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - In recent years, the issue of credit fraud risk has garnered increased attention from the banking and financial sectors. However, prevailing credit assessment models predominantly focus on predictive outcomes, often overlooking the imperative of model interpretability. Understanding the contributions of model features and their interactions is paramount for elucidating model behavior and furnishing vital insights for model enhancement and optimization. To address this gap, this paper proposes a machine learning model leveraging SHAP for explicating credit assessment. Utilizing publicly available datasets from Lending Club, this study validates the proposed model against four industry-standard machine learning approaches: SVM, MLP, XGBoost, and LightGBM. Experimental findings unveil feature importance rankings and elucidate relationships between features and target variables. Notably, the study identifies the predominant roles of loan interest rates and credit policies in credit fraud assessment within the dataset and endeavors to uncover interactions within individual key features. The SHAP framework, as demonstrated, holds promise for informing the design and construction of future credit risk assessment models, thereby bolstering support for financial decision-making and risk management endeavors.
AB - In recent years, the issue of credit fraud risk has garnered increased attention from the banking and financial sectors. However, prevailing credit assessment models predominantly focus on predictive outcomes, often overlooking the imperative of model interpretability. Understanding the contributions of model features and their interactions is paramount for elucidating model behavior and furnishing vital insights for model enhancement and optimization. To address this gap, this paper proposes a machine learning model leveraging SHAP for explicating credit assessment. Utilizing publicly available datasets from Lending Club, this study validates the proposed model against four industry-standard machine learning approaches: SVM, MLP, XGBoost, and LightGBM. Experimental findings unveil feature importance rankings and elucidate relationships between features and target variables. Notably, the study identifies the predominant roles of loan interest rates and credit policies in credit fraud assessment within the dataset and endeavors to uncover interactions within individual key features. The SHAP framework, as demonstrated, holds promise for informing the design and construction of future credit risk assessment models, thereby bolstering support for financial decision-making and risk management endeavors.
KW - credit assessment model
KW - interpretability
KW - machine learning
KW - SHAP
UR - http://www.scopus.com/inward/record.url?scp=85217783334&partnerID=8YFLogxK
U2 - 10.1109/ICSTE63875.2024.00044
DO - 10.1109/ICSTE63875.2024.00044
M3 - Conference contribution
AN - SCOPUS:85217783334
T3 - Proceedings - 2024 14th International Conference on Software Technology and Engineering, ICSTE 2024
SP - 213
EP - 217
BT - Proceedings - 2024 14th International Conference on Software Technology and Engineering, ICSTE 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 14th International Conference on Software Technology and Engineering, ICSTE 2024
Y2 - 16 August 2024 through 18 August 2024
ER -