×

Identifying hidden patterns in credit risk survival data using generalised additive models. (English) Zbl 1431.91411

Summary: Modelling patterns in credit risk using survival analysis techniques have received considerable and increasing attention over the past decade. In these models, the predictor of the hazard of default is often expressed as a simple linear combination of the risk factors. In this work, we discuss how these models can be enhanced using generalised additive models (GAMs). In the GAMs framework, the predictor is formulated as a combination of flexible univariate functions of the risk factors. In this paper, we parametrise GAMs for credit risk data in terms of penalised splines, outline the implementation via frequentist and Bayesian MCMC methods, apply them to a large portfolio of credit card accounts, and show how GAMs can be used to improve not only the application, behavioural and macro-economic components of survival models for credit risk data at individual account level, but also the accuracy of predictions. From a practitioner point of view, this work highlights that some accounts may actually become more (less) attractive to the lender if flexible smooth functions are used whereas the same applicant may be denied (accepted) a loan if the linearity assumption is forced.

MSC:

91G40 Credit risk
91G70 Statistical methods; risk measures
62P05 Applications of statistics to actuarial sciences and financial mathematics

Software:

mgcv; WinBUGS; gamair; BayesX

References:

[1] Allison, P. D., Survival analysis using SAS: A practical guide (2010), SAS Institute Inc.: SAS Institute Inc. NC
[2] Andreeva, G., European generic scoring models using survival analysis, The Journal of Operational Research Society, 57, 1180-1187 (2006) · Zbl 1121.90072
[3] Bellotti, T.; Crook, J., Forecasting and stress testing credit card default using dynamic models, International Journal of Forecasting, 29, 563-574 (2013)
[4] Berg, D., Bankruptcy prediction by generalized additive models, Applied Stochastic Models in Business and Industry, 23, 129-143 (2006) · Zbl 1164.62078
[5] Brezger, A.; Kneib, T.; Lang, S., Bayesx: analyzing Bayesian structural additive regression models, Journal of Statistical Software, 14, 1-22 (2005)
[6] Brezger, A.; Lang, S., Generalized structured additive regression based on Bayesian p-splines, Computational Statistics and Data Analysis, 50, 947-991 (2006) · Zbl 1431.62308
[7] Crainiceanu, C.; Ruppert, D.; Wand, M. P., Bayesian analysis for penalized spline regression using winBUGS, Journal of Statistical Software, 14 (2005)
[8] Dakovica, R.; Czadoa, C.; Bergb, D., Bankruptcy prediction in norway: A comparison study, Applied Economics Letters, 17, 1739-1746 (2010)
[9] De Boor, C., A practical guide to splines (1978), Springer · Zbl 0406.41003
[10] Djeundje, V. A.B., Systematic deviation in smooth mixed models for multilevel longitudinal data, Statistical Methodology, 32, 203-217 (2016) · Zbl 1487.62021
[11] Djeundje, V. A.B.; Crook, J., Dynamic survival models with varying coefficients for credit risks, European Jo of Operational Research, 275, 319-333 (2019) · Zbl 1431.91410
[12] Drexler, M.; Ainsworth, C. H., Generalized additive models used to predict species abundance in the gulf of mexico: An ecosystem modeling tool, PLoS ONE, 8 (2013)
[13] Eilers, P. H.C.; Marx, B. D., Flexible smoothing with b-splines and penalties, Statistical Science, 11, 89-121 (1996) · Zbl 0955.62562
[14] Eilers, P. H.C.; Marx, B. D., Splines, knots, and penalties, Computational Statistics, 2, 637-653 (2010)
[15] Fahrmeir, L.; Lang, L., Bayesian inference for generalized additive mixed models based on Markov random field priors, Journal of the Royal Statistical Society. Series C, 50, 201-220 (2001)
[16] Friedman, J. H., Multivariate adaptive regression splines, Annals of Statistics, 19, 1-67 (1991) · Zbl 0765.62064
[17] Gamerman, D., Sampling from the posterior distribution in generalized linear mixed models, Statistics and Computing, 7, 57-68 (1997)
[18] Green, P. J.; Silverman, B. W., Nonparametric regression and generalized linear models (1995), Chapman and Hall
[19] Hastie, T. J.; Tibshirani, R. J., Generalized additive models, Statistical Science, 1, 297-318 (1986) · Zbl 0645.62068
[20] Hastie, T. J.; Tibshirani, R. J., Generalized additive models (1990), Chapman & Hall/CRC · Zbl 0747.62061
[21] Kaplan, E. L.; Meier, P., Nonparametric estimation from incomplete observations, Journal of American Statistical Association, 53, 457-481 (1958) · Zbl 0089.14801
[22] Lang, S.; Brezger, A., Bayesian p-splines, Journal of Computational and Graphical Statistics, 13, 183-212 (2004)
[23] Liu, W.; Chase, J. P.M.; Vu, C.; Acxiom; Cela, J., Generalizations of generalized additive model (GAM): A case of credit risk modeling (2009), SAS Global
[24] Sapra, A. K., Generalized additive models in business and economics, International Journal of Advanced Statistics and Probability, 1, 64-81 (2013)
[25] Stepanova, M.; Thomas, L. C., Survival analysis for personal loan data, The Journal of the Operational Research Society, 50, 277-289 (2002) · Zbl 1163.91521
[26] Therneau, T.; Grambsch, P., Modeling survival data: Extending the cox model (2000), Springer Verlag: Springer Verlag New York · Zbl 0958.62094
[27] Wood, S. N., Modelling and smoothing parameter estimation with multiple quadratic penalties, Journal of the Royal Statistical Society. Series B, 62, 413-428 (2000)
[28] Wood, S. N., Generalized additive models: An introduction with R (2006), Chapman & Hall/CRC · Zbl 1087.62082
[29] Wood, S. N., Fast stable direct fitting and smoothness selection for generalized additive models, Journal of the Royal Statistical Society. Series B, 70, 495-518 (2008) · Zbl 05563356
[30] Wood, S. N., Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models, Journal of the Royal Statistical Society. Series B, 73, 3-36 (2011) · Zbl 1411.62089
[31] Wood, S. N., Package mgcv (2016)
[32] Wood, S. N.; Goude, Y.; Shaw, S., Generalized additive models for large datasets, Journal of the Royal Statistical Society. Series C, 64, 139-155 (2015)
[33] Wood, S. N.; Li, Z.; Shaddick, G.; Augustin, N. H., Generalized additive models for gigadata: modelling the UK black smoke network daily data, Journal of the American Statistical Association, 64, 139-155 (2016)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.