Abstract
Default risk prediction presents a significant challenge for micro and small enterprises due to the unavailability of comprehensive information databases. This paper develops a default risk management tool based on user portrait theory, utilizing common and objective indicators of micro and small enterprises, such as basic information about entrepreneurs, enterprises, and loans. The Shapley Additive exPlanations (SHAP) method is employed to analyze the contribution of each indicator to default prediction. Empirical results show that household income and personal income are the two most important variables in general, with higher household income associated with a lower probability of default. However, a higher personal income is associated with a higher probability of default. Moreover, the importance of variables and the direction of their relationship with default prediction vary across samples. These findings provide significant insights for developing an accurate default prediction warning system for financial institution managers and policymakers, using the proposed methodology and technical framework.




Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
The data that support the findings of this study are available on request from the corresponding author, [Yang Cai, 1558325442@qq.com],upon reasonable request.
References
Aas K, Jullum M, Løland A (2021) Explaining individual predictions when features are dependent: more accurate approximations to Shapley values. Artif Intell 298:103502
Abdullah F, Ward R (2016) Developing a General Extended Technology Acceptance Model for E-Learning (GETAMEL) by analysing commonly used external factors. Comput Hum Behav 56:238–256
Abedin MZ, Hajek P, Sharif T, Satu MS, Khan MI (2023) Modelling bank customer behaviour using feature engineering and classification techniques. Res Int Bus Financ 65:101913
Alfaro R, Gallardo N (2012) The determinants of household debt default. Revista De Análisis Económico-Economic Analysis Review 27(1):27–54
Altman EI (1968) Financial ratios, discriminant analysis and the prediction of corporate bankruptcy. J Financ 23(4):589–609
Baklouti I (2014) A psychological approach to microfinance credit scoring via a classification and regression tree. Intell Syst Account Finance Manag 21(4):193–208
Bartels LM (2002) Beyond the running tally: Partisan bias in political perceptions. Polit Behav 24(2):117–150
Beaver W H (1966) Financial ratios as predictors of failure. J Account Res: 71–111.
Beck AT (2008) The evolution of the cognitive model of depression and its neurobiological correlates. Am J Psychiatry 165(8):969–977
Berger AN, Udell GF (1998) The economics of small business finance: the roles of private equity and debt markets in the financial growth cycle. J Bank Finance 22(6–8):613–673
Bussmann N, Giudici P, Marinelli D (2021) Explainable machine learning in credit risk management. Comput Econ 57(1):203–216
Çallı BA, Coşkun E (2021) A longitudinal systematic review of credit risk assessment and credit default predictors. SAGE Open 11(4):21582440211061332
Carrington AM, Fieguth PW, Qazi H (2020) A new concordant partial AUC and partial c statistic for imbalanced data in the evaluation of machine learning algorithms. BMC Med Inform Decis Mak 20:1–12
Carrington AM, Manuel DG, Fieguth PW, Ramsay T, Osmani V, Wernly B, Holzinger A (2022) Deep ROC analysis and AUC as balanced average accuracy, for improved classifier selection, audit and explanation. IEEE Trans Pattern Anal Mach Intell 45(1): 329–341.
Cheung ACK, Slavin RE (2013) The effectiveness of educational technology applications for enhancing mathematics achievement in K-12 classrooms: a meta-analysis. Educ Res Rev 9:88–113
Ciampi F, Gordini N (2013) Small enterprise default prediction modeling through artificial neural networks: an empirical analysis of i talian small enterprises. J Small Bus Manage 51(1):23–45
Croxson K, Frost J, Gambacorta L, et al. (2021) Platform-based business models and financial inclusion. BIS Paper, forthcoming.
Divino JA, Rocha LCS (2013) Probability of default in collateralized credit operations. North Am J Econ Finance 25:276–292
Dumitrescu E, Hué S, Hurlin C (2022) Machine learning for credit scoring: improving logistic regression with non-linear decision-tree effects. Eur J Oper Res 297(3):1178–1192
Durand D (1941) Risk elements in consumer installment financing. National Bureau of Economic Research, New York
Eid N, Maltby J, Talavera O (2016) Income Rounding and Loan Performance in the Peer-to-Peer Market. Mpra Paper: 1–38.
Fernandes GB, Artes R (2016) Spatial dependence in credit risk and its improvement in credit scoring. Eur J Oper Res 249(2):517–524
Ganzach Y, Amar M (2017) Intelligence and the repayment of high-and low-consequences debt. Personality Individ Differ 110:102–108
Giudici P, Raffinetti E (2021) Shapley-Lorenz eXplainable artificial intelligence. Expert Syst Appl 167:114104
Hermanto B, Gunawidjaja S (2010) Default Prediction Model for SME’s: Evidence from Indonesian Market Using Financial Ratios. Universitas Indonesia, Graduate School of Management Research Paper, (13–04).
Holzinger A, Saranti A, Molnar C (2020) Explainable AI methods-a brief overview[C] AI-Beyond Explainable AI: International Workshop, Held in Conjunction with ICML, July 18, Vienna, Austria. Springer International Publishing, Revised and Extended Papers. Cham, pp 13–38
Hyytinen A, Pajarinen M (2008) Opacity of young businesses: evidence from rating disagreements. J Bank Finance 32(7):1234–1241
Jordan MI, Mitchell TM (2015) Machine learning: Trends, perspectives, and prospects. Science 349(6245):255–260
Joseph, A. (2019). Shapley regressions: A framework for statistical inference on machine learning models.
Ke G, Meng Q, Finley T (2017) Lightgbm: a highly efficient gradient boosting decision tree. Adv Neural Inf Process Syst 30:3146–3154
Sihem Khemakhem, Younés Boujelbène (2015) Credit risk prediction: a comparative study between discriminant analysis and the neural network Approach.
Leonard K (1995) The development of a rule based expert system model for fraud alert in consumer credit. Eur J Oper Res 80(2):350–356
Liang D, Lu CC, Tsai CF (2016) Financial ratios and corporate governance indicators in bankruptcy prediction: a comprehensive study[J]. Eur J Oper Res 252(2):561–572
Lin M, Prabhala NR, Viswanathan S (2013) Judging borrowers by the company they keep: friendship networks and information asymmetry in online peer-to-peer lending. Manage Sci 59(1):17–35
Lin X, Li X, Zheng Z (2017) Evaluating borrower’s default risk in peer-to-peer lending: evidence from a lending platform in China. Appl Econ 49(35):3538–3545
Lundberg SM, Nair B, Vavilala MS (2018) Explainable machine-learning predictions for the prevention of hypoxaemia during surgery. Nat Biomed Eng 2(10):749–760
Lundberg SM, Erion G, Chen H (2020) From local explanations to global understanding with explainable AI for trees. Nat Mach Intell 2(1):56–67
Lundberg S M, Lee S I (2017) A unified approach to interpreting model predictions[C]//Proceedings of the 31st international conference on neural information processing systems: 4768–4777.
Martinez-Sola C, Garcia-Teruel PJ, Martinez-Solano P (2017) SMEs access to finance and the value of supplier financing. Spanish J Finance Account/revista Espanola De Financiacion y Contabilidad 46(4):455–483
Mester LJ, Nakamura LI, Renault M (2007) Transactions accounts and loan monitoring. Rev Financial Stud 20(3):529–556
Molnar C (2020) Interpretable machine learning. Lulu. Com.
Moscatelli M, Parlapiano F, Narizzano S (2020) Corporate default forecasting with machine learning. Expert Syst Appl 161:113567
Moya-Carvajal J, Pérez-Galarce F, Taramasco C (2023) ML models for severity classification and length-of-stay forecasting in emergency units. Expert Systems with Applications: 119864.
Ochoa J G D, Maier L, Csiszar O (2021) Bayesian logical neural networks for human-centered applications in medicine. medRxiv.
Ohlson J A (1980) Financial ratios and the probabilistic prediction of bankruptcy. J Account Res: 109–131.
Oni OA, Oladele OI, Oyewole IK (2006) Analysis of factors influencing loan default among poultry farmers in ogun state nigeria. J Central Euro Agricult 6(4):619–624
Petropoulos A, Chatzis SP, Xanthopoulos S (2016) A novel corporate credit rating system based on Student’st hidden Markov models. Expert Syst Appl 53:87–105
Puri M, Rocholl J, Steffen S (2017) What do a million observations have to say about loan defaults? Opening the black box of relationships. J Financial Intermediation 31:1–15
Shapley LS (1953) Stochastic games. Proc Natl Acad Sci 39(10):1095–1100
Sohn SY, Kim DH, Yoon JH (2016) Technology credit scoring model with fuzzy logistic regression. Appl Soft Comput 43:150–158
Strumbelj E, Kononenko I (2010) An efficient explanation of individual classifications using game theory. J Mach Learn Res 11:1–18
Su M, Cheng D, Xu Y, Weng F (2023) An improved BERT method for the evolution of network public opinion of major infectious diseases: case study of COVID-19. Expert Syst Appl 233:120938
Torres-Tello J, Ko S B (2021) Interpretability of artificial intelligence models that use data fusion to predict yield in aeroponics J Ambient Intell Human Comput: 1–12.
Wahyudi I (2014) Default risk analysis in micro, small and medium enterprises: Does debt overhang theory occur? Asian Acad Manage J Account Finance 10(1):95–131
Weng F, Zhang H, Yang C (2021) Volatility forecasting of crude oil futures based on a genetic algorithm regularization online extreme learning machine with a forgetting factor: the role of news during the COVID-19 pandemic. Resour Policy 73:102148
Weng F, Zhu J, Yang C (2022) Analysis of financial pressure impacts on the health care industry with an explainable machine learning method: China versus the USA. Expert Syst Appl 210:118482
Xiao Y, Zhai H, Friederici AD (2016) The development of the intrinsic functional connectivity of default network subsystems from age 3 to 5. Brain Imaging Behav 10(1):50–59
Yang C, Abedin MZ, Zhang H, Weng F, Hajek P (2023) An interpretable system for predicting the impact of COVID-19 government interventions on stock market sectors. Ann Oper Res, 1–28.
Yoshino N, Taghizadeh-Hesary F (2014) Monetary policy and oil price fluctuations following the subprime mortgage crisis. Int J Monetary Econ Finance 7(3):157–174
Zhang L, Wang J, Liu Z (2023) What should lenders be more concerned about? Developing a profit-driven loan default prediction model. Expert Syst Appl 213:118938
Acknowledgements
We are grateful to the reviewers and the editor for their helpful comments and suggestions. This work was supported by the National Office for Philosophy and Social Sciences of China under Grant 23BTJ044.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zheng, C., Weng, F., Luo, Y. et al. Micro and small enterprises default risk portrait: evidence from explainable machine learning method. J Ambient Intell Human Comput 15, 661–671 (2024). https://doi.org/10.1007/s12652-023-04722-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12652-023-04722-6