Skip to main content
Log in

Micro and small enterprises default risk portrait: evidence from explainable machine learning method

  • Original Research
  • Published:
Journal of Ambient Intelligence and Humanized Computing Aims and scope Submit manuscript

Abstract

Default risk prediction presents a significant challenge for micro and small enterprises due to the unavailability of comprehensive information databases. This paper develops a default risk management tool based on user portrait theory, utilizing common and objective indicators of micro and small enterprises, such as basic information about entrepreneurs, enterprises, and loans. The Shapley Additive exPlanations (SHAP) method is employed to analyze the contribution of each indicator to default prediction. Empirical results show that household income and personal income are the two most important variables in general, with higher household income associated with a lower probability of default. However, a higher personal income is associated with a higher probability of default. Moreover, the importance of variables and the direction of their relationship with default prediction vary across samples. These findings provide significant insights for developing an accurate default prediction warning system for financial institution managers and policymakers, using the proposed methodology and technical framework.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Data availability

The data that support the findings of this study are available on request from the corresponding author, [Yang Cai, 1558325442@qq.com],upon reasonable request.

References

  • Aas K, Jullum M, Løland A (2021) Explaining individual predictions when features are dependent: more accurate approximations to Shapley values. Artif Intell 298:103502

    Article  MathSciNet  Google Scholar 

  • Abdullah F, Ward R (2016) Developing a General Extended Technology Acceptance Model for E-Learning (GETAMEL) by analysing commonly used external factors. Comput Hum Behav 56:238–256

    Article  Google Scholar 

  • Abedin MZ, Hajek P, Sharif T, Satu MS, Khan MI (2023) Modelling bank customer behaviour using feature engineering and classification techniques. Res Int Bus Financ 65:101913

    Article  Google Scholar 

  • Alfaro R, Gallardo N (2012) The determinants of household debt default. Revista De Análisis Económico-Economic Analysis Review 27(1):27–54

    Google Scholar 

  • Altman EI (1968) Financial ratios, discriminant analysis and the prediction of corporate bankruptcy. J Financ 23(4):589–609

    Article  Google Scholar 

  • Baklouti I (2014) A psychological approach to microfinance credit scoring via a classification and regression tree. Intell Syst Account Finance Manag 21(4):193–208

    Article  Google Scholar 

  • Bartels LM (2002) Beyond the running tally: Partisan bias in political perceptions. Polit Behav 24(2):117–150

    Article  Google Scholar 

  • Beaver W H (1966) Financial ratios as predictors of failure. J Account Res: 71–111.

  • Beck AT (2008) The evolution of the cognitive model of depression and its neurobiological correlates. Am J Psychiatry 165(8):969–977

    Article  PubMed  Google Scholar 

  • Berger AN, Udell GF (1998) The economics of small business finance: the roles of private equity and debt markets in the financial growth cycle. J Bank Finance 22(6–8):613–673

    Article  Google Scholar 

  • Bussmann N, Giudici P, Marinelli D (2021) Explainable machine learning in credit risk management. Comput Econ 57(1):203–216

    Article  Google Scholar 

  • Çallı BA, Coşkun E (2021) A longitudinal systematic review of credit risk assessment and credit default predictors. SAGE Open 11(4):21582440211061332

    Article  Google Scholar 

  • Carrington AM, Fieguth PW, Qazi H (2020) A new concordant partial AUC and partial c statistic for imbalanced data in the evaluation of machine learning algorithms. BMC Med Inform Decis Mak 20:1–12

    Article  Google Scholar 

  • Carrington AM, Manuel DG, Fieguth PW, Ramsay T, Osmani V, Wernly B, Holzinger A (2022) Deep ROC analysis and AUC as balanced average accuracy, for improved classifier selection, audit and explanation. IEEE Trans Pattern Anal Mach Intell 45(1): 329–341.

  • Cheung ACK, Slavin RE (2013) The effectiveness of educational technology applications for enhancing mathematics achievement in K-12 classrooms: a meta-analysis. Educ Res Rev 9:88–113

    Article  Google Scholar 

  • Ciampi F, Gordini N (2013) Small enterprise default prediction modeling through artificial neural networks: an empirical analysis of i talian small enterprises. J Small Bus Manage 51(1):23–45

    Article  Google Scholar 

  • Croxson K, Frost J, Gambacorta L, et al. (2021) Platform-based business models and financial inclusion. BIS Paper, forthcoming.

  • Divino JA, Rocha LCS (2013) Probability of default in collateralized credit operations. North Am J Econ Finance 25:276–292

    Article  Google Scholar 

  • Dumitrescu E, Hué S, Hurlin C (2022) Machine learning for credit scoring: improving logistic regression with non-linear decision-tree effects. Eur J Oper Res 297(3):1178–1192

    Article  MathSciNet  Google Scholar 

  • Durand D (1941) Risk elements in consumer installment financing. National Bureau of Economic Research, New York

    Google Scholar 

  • Eid N, Maltby J, Talavera O (2016) Income Rounding and Loan Performance in the Peer-to-Peer Market. Mpra Paper: 1–38.

  • Fernandes GB, Artes R (2016) Spatial dependence in credit risk and its improvement in credit scoring. Eur J Oper Res 249(2):517–524

    Article  MathSciNet  Google Scholar 

  • Ganzach Y, Amar M (2017) Intelligence and the repayment of high-and low-consequences debt. Personality Individ Differ 110:102–108

    Article  Google Scholar 

  • Giudici P, Raffinetti E (2021) Shapley-Lorenz eXplainable artificial intelligence. Expert Syst Appl 167:114104

    Article  Google Scholar 

  • Hermanto B, Gunawidjaja S (2010) Default Prediction Model for SME’s: Evidence from Indonesian Market Using Financial Ratios. Universitas Indonesia, Graduate School of Management Research Paper, (13–04).

  • Holzinger A, Saranti A, Molnar C (2020) Explainable AI methods-a brief overview[C] AI-Beyond Explainable AI: International Workshop, Held in Conjunction with ICML, July 18, Vienna, Austria. Springer International Publishing, Revised and Extended Papers. Cham, pp 13–38

    Google Scholar 

  • Hyytinen A, Pajarinen M (2008) Opacity of young businesses: evidence from rating disagreements. J Bank Finance 32(7):1234–1241

    Article  Google Scholar 

  • Jordan MI, Mitchell TM (2015) Machine learning: Trends, perspectives, and prospects. Science 349(6245):255–260

    Article  MathSciNet  CAS  PubMed  ADS  Google Scholar 

  • Joseph, A. (2019). Shapley regressions: A framework for statistical inference on machine learning models.

  • Ke G, Meng Q, Finley T (2017) Lightgbm: a highly efficient gradient boosting decision tree. Adv Neural Inf Process Syst 30:3146–3154

    Google Scholar 

  • Sihem Khemakhem, Younés Boujelbène (2015) Credit risk prediction: a comparative study between discriminant analysis and the neural network Approach.

  • Leonard K (1995) The development of a rule based expert system model for fraud alert in consumer credit. Eur J Oper Res 80(2):350–356

    Article  Google Scholar 

  • Liang D, Lu CC, Tsai CF (2016) Financial ratios and corporate governance indicators in bankruptcy prediction: a comprehensive study[J]. Eur J Oper Res 252(2):561–572

    Article  Google Scholar 

  • Lin M, Prabhala NR, Viswanathan S (2013) Judging borrowers by the company they keep: friendship networks and information asymmetry in online peer-to-peer lending. Manage Sci 59(1):17–35

    Article  Google Scholar 

  • Lin X, Li X, Zheng Z (2017) Evaluating borrower’s default risk in peer-to-peer lending: evidence from a lending platform in China. Appl Econ 49(35):3538–3545

    Article  ADS  Google Scholar 

  • Lundberg SM, Nair B, Vavilala MS (2018) Explainable machine-learning predictions for the prevention of hypoxaemia during surgery. Nat Biomed Eng 2(10):749–760

    Article  PubMed  PubMed Central  Google Scholar 

  • Lundberg SM, Erion G, Chen H (2020) From local explanations to global understanding with explainable AI for trees. Nat Mach Intell 2(1):56–67

    Article  PubMed  PubMed Central  Google Scholar 

  • Lundberg S M, Lee S I (2017) A unified approach to interpreting model predictions[C]//Proceedings of the 31st international conference on neural information processing systems: 4768–4777.

  • Martinez-Sola C, Garcia-Teruel PJ, Martinez-Solano P (2017) SMEs access to finance and the value of supplier financing. Spanish J Finance Account/revista Espanola De Financiacion y Contabilidad 46(4):455–483

    Google Scholar 

  • Mester LJ, Nakamura LI, Renault M (2007) Transactions accounts and loan monitoring. Rev Financial Stud 20(3):529–556

    Article  Google Scholar 

  • Molnar C (2020) Interpretable machine learning. Lulu. Com.

  • Moscatelli M, Parlapiano F, Narizzano S (2020) Corporate default forecasting with machine learning. Expert Syst Appl 161:113567

    Article  Google Scholar 

  • Moya-Carvajal J, Pérez-Galarce F, Taramasco C (2023) ML models for severity classification and length-of-stay forecasting in emergency units. Expert Systems with Applications: 119864.

  • Ochoa J G D, Maier L, Csiszar O (2021) Bayesian logical neural networks for human-centered applications in medicine. medRxiv.

  • Ohlson J A (1980) Financial ratios and the probabilistic prediction of bankruptcy. J Account Res: 109–131.

  • Oni OA, Oladele OI, Oyewole IK (2006) Analysis of factors influencing loan default among poultry farmers in ogun state nigeria. J Central Euro Agricult 6(4):619–624

    Google Scholar 

  • Petropoulos A, Chatzis SP, Xanthopoulos S (2016) A novel corporate credit rating system based on Student’st hidden Markov models. Expert Syst Appl 53:87–105

    Article  Google Scholar 

  • Puri M, Rocholl J, Steffen S (2017) What do a million observations have to say about loan defaults? Opening the black box of relationships. J Financial Intermediation 31:1–15

    Article  Google Scholar 

  • Shapley LS (1953) Stochastic games. Proc Natl Acad Sci 39(10):1095–1100

    Article  MathSciNet  CAS  PubMed  PubMed Central  ADS  Google Scholar 

  • Sohn SY, Kim DH, Yoon JH (2016) Technology credit scoring model with fuzzy logistic regression. Appl Soft Comput 43:150–158

    Article  Google Scholar 

  • Strumbelj E, Kononenko I (2010) An efficient explanation of individual classifications using game theory. J Mach Learn Res 11:1–18

    MathSciNet  Google Scholar 

  • Su M, Cheng D, Xu Y, Weng F (2023) An improved BERT method for the evolution of network public opinion of major infectious diseases: case study of COVID-19. Expert Syst Appl 233:120938

    Article  Google Scholar 

  • Torres-Tello J, Ko S B (2021) Interpretability of artificial intelligence models that use data fusion to predict yield in aeroponics J Ambient Intell Human Comput: 1–12.

  • Wahyudi I (2014) Default risk analysis in micro, small and medium enterprises: Does debt overhang theory occur? Asian Acad Manage J Account Finance 10(1):95–131

    MathSciNet  Google Scholar 

  • Weng F, Zhang H, Yang C (2021) Volatility forecasting of crude oil futures based on a genetic algorithm regularization online extreme learning machine with a forgetting factor: the role of news during the COVID-19 pandemic. Resour Policy 73:102148

    Article  PubMed  PubMed Central  Google Scholar 

  • Weng F, Zhu J, Yang C (2022) Analysis of financial pressure impacts on the health care industry with an explainable machine learning method: China versus the USA. Expert Syst Appl 210:118482

    Article  Google Scholar 

  • Xiao Y, Zhai H, Friederici AD (2016) The development of the intrinsic functional connectivity of default network subsystems from age 3 to 5. Brain Imaging Behav 10(1):50–59

    Article  PubMed  Google Scholar 

  • Yang C, Abedin MZ, Zhang H, Weng F, Hajek P (2023) An interpretable system for predicting the impact of COVID-19 government interventions on stock market sectors. Ann Oper Res, 1–28.

  • Yoshino N, Taghizadeh-Hesary F (2014) Monetary policy and oil price fluctuations following the subprime mortgage crisis. Int J Monetary Econ Finance 7(3):157–174

    Article  Google Scholar 

  • Zhang L, Wang J, Liu Z (2023) What should lenders be more concerned about? Developing a profit-driven loan default prediction model. Expert Syst Appl 213:118938

    Article  Google Scholar 

Download references

Acknowledgements

We are grateful to the reviewers and the editor for their helpful comments and suggestions. This work was supported by the National Office for Philosophy and Social Sciences of China under Grant 23BTJ044.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Cai Yang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zheng, C., Weng, F., Luo, Y. et al. Micro and small enterprises default risk portrait: evidence from explainable machine learning method. J Ambient Intell Human Comput 15, 661–671 (2024). https://doi.org/10.1007/s12652-023-04722-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12652-023-04722-6

Keywords

Navigation