Using Small Business Banking Data for Explainable Credit Risk Scoring
Abstract
Machine learning applied to financial transaction records can predict how likely a small business is to repay a loan. For this purpose we compared a traditional scorecard credit risk model against various machine learning models and found that XGBoost with monotonic constraints outperformed scorecard model by 7% in K-S statistic. To deploy such a machine learning model in production for loan application risk scoring it must comply with lending industry regulations that require lenders to provide understandable and specific reasons for credit decisions. Thus we also developed a loan decision explanation technique based on the ideas of WoE and SHAP. Our research was carried out using a historical dataset of tens of thousands of loans and millions of associated financial transactions. The credit risk scoring model based on XGBoost with monotonic constraints and SHAP explanations described in this paper have been deployed by QuickBooks Capital to assess incoming loan applications since July 2019.
Cite
Text
Wang et al. "Using Small Business Banking Data for Explainable Credit Risk Scoring." AAAI Conference on Artificial Intelligence, 2020. doi:10.1609/AAAI.V34I08.7055Markdown
[Wang et al. "Using Small Business Banking Data for Explainable Credit Risk Scoring." AAAI Conference on Artificial Intelligence, 2020.](https://mlanthology.org/aaai/2020/wang2020aaai-using/) doi:10.1609/AAAI.V34I08.7055BibTeX
@inproceedings{wang2020aaai-using,
title = {{Using Small Business Banking Data for Explainable Credit Risk Scoring}},
author = {Wang, Wei and Lesner, Christopher and Ran, Alexander and Rukonic, Marko and Xue, Jason and Shiu, Eric},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2020},
pages = {13396-13401},
doi = {10.1609/AAAI.V34I08.7055},
url = {https://mlanthology.org/aaai/2020/wang2020aaai-using/}
}