×

Selecting bivariate copula models using image recognition. (English) Zbl 1524.62238

Summary: The choice of a copula model from limited data is a hard but important task. Motivated by the visual patterns that different copula models produce in smoothed density heatmaps, we consider copula model selection as an image recognition problem. We extract image features from heatmaps using the pre-trained AlexNet and present workflows for model selection that combine image features with statistical information. We employ dimension reduction via Principal Component and Linear Discriminant Analyses and use a Support Vector Machine classifier. Simulation studies show that the use of image data improves the accuracy of the copula model selection task, particularly in scenarios where sample sizes and correlations are low. This finding indicates that transfer learning can support statistical procedures of model selection. We demonstrate application of the proposed approach to the joint modelling of weekly returns of the MSCI and RISX indices.

MSC:

62H05 Characterization and structure theory for multivariate probability distributions; copulas

References:

[1] Abdallah, A., Boucher, J.-P. and Cossette, H. (2015) Modeling dependence between loss triangles with hierarchical archimedean copulas. ASTIN Bulletin, 45(3), 577-599. · Zbl 1390.91154
[2] Androschuck, T., Gibbs, S., Katrakis, N., Lau, J., Oram, S., Raddall, P., Semchyshyn, L., Stevenson, D. and Waters, J. (2017) Simulation-based capital models: Testing, justifying and communicating choices. A report from the life aggregation and simulation techniques working party. British Actuarial Journal, 22(2), 257-335.
[3] Bengio, Y., Courville, A. and Vincent, P. (2013). Representation learning: A review and new perspectives. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(8), 1798-1828.
[4] Czado, C., Kastenmeier, R., Brechmann, E.C. and Min, A. (2012) A mixed copula model for insurance claims and claim sizes. Scandinavian Actuarial Journal, 2012(4), 278-305. · Zbl 1277.62249
[5] Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K. and Fei-Fei, L. (2009) ImageNet: A large-scale hierarchical image database. 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248-255. IEEE.
[6] Denuit, M., Dhaene, J., Goovaerts, M. and Kaas, R. (2006) Actuarial Theory for Dependent Risks: Measures, Orders and Models. New Jersey, US: John Wiley & Sons.
[7] Denuit, M. and Lambert, P. (2005) Constraints on concordance measures in bivariate discrete data. Journal of Multivariate Analysis, 93(1), 40-57. · Zbl 1095.62065
[8] Embrechts, P., Puccetti, G., Rüschendorf, L., Wang, R. and Beleraj, A. (2014) An academic response to Basel 3.5. Risks, 2(1), 25-48.
[9] Frees, E.W. and Valdez, E.A. (1998) Understanding relationships using copulas. North American Actuarial Journal, 2(1), 1-25. · Zbl 1081.62564
[10] Frees, E.W. and Wang, P. (2005) Credibility using copulas. North American Actuarial Journal, 9(2), 31-48. · Zbl 1085.62121
[11] Fukui, K. and Maki, A. (2015) Difference subspace and its generalization for subspace-based methods. IEEE Transactions on Pattern Analysis and Machine Intelligence, 37(11), 2164-2177.
[12] Genest, C. and Favre, A.-C. (2007) Everything you always wanted to know about copula modeling but were afraid to ask. Journal of Hydrologic Engineering, 12(4), 347-368.
[13] Genest, C., Nešlehová, J.G., Rémillard, B. and Murphy, O.A. (2019) Testing for independence in arbitrary distributions. Biometrika, 106(1), 47-68. · Zbl 1506.62309
[14] Genest, C., Rémillard, B. and Beaudoin, D. (2009) Goodness-of-fit tests for copulas: A review and a power study. Insurance: Mathematics and Economics, 44(2), 199-213. · Zbl 1161.91416
[15] Genest, C. and Scherer, M. (2020) Insurance applications of dependence modeling. Dependence Modeling, 8(1), 93-106.
[16] Grønneberg, S. and Hjort, N.L. (2014) The copula information criteria. Scandinavian Journal of Statistics, 41(2), 436-459. · Zbl 1416.62249
[17] Hastie, T., Tibshirani, R. and Friedman, J. (2009) The Elements of Statistical Learning: Data Mining, Inference and Prediction. New York: Springer Series in Statistics. · Zbl 1273.62005
[18] Hofert, M., Kojadinovic, I., Maechler, M. and Yan, J. (2020) copula: Multivariate Dependence with Copulas. R package version 1.0-1.
[19] Hofert, M., Prasad, A. and Zhu, M. (2021) Quasi-random sampling for multivariate distributions via generative neural networks. Journal of Computational and Graphical Statistics, 30(3), 647-670. · Zbl 07499908
[20] Hu, S., Murphy, T.B. and O’Hagan, A. (2021) mvClaim: An R package for multivariate general insurance claims severity modelling. Annals of Actuarial Science, 15(2), 441-457.
[21] Huard, D., Evin, G. and Favre, A.-C. (2006) Bayesian copula selection. Computational Statistics & Data Analysis, 51(2), 809-822. · Zbl 1157.62359
[22] (2022) RISX: ICMR (Re)Insurance Specialty Index. https://risxindex.com/. Accessed: 2022-01-19.
[23] Islam, M., Dinh, A., Wahid, K. and Bhowmik, P. (2017) Detection of potato diseases using image segmentation and multiclass support vector machine. 2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE), pp. 1-4. IEEE.
[24] Jordanger, L.A. and Tjøstheim, D. (2014) Model selection of copulas: AIC versus a cross validation copula information criterion. Statistics & Probability Letters, 92, 249-255. · Zbl 1396.62099
[25] Krizhevsky, A., Sutskever, I. and Hinton, G.E. (2012) ImageNet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems, pp. 1097-1105.
[26] Kularatne, T.D., Li, J. and Pitt, D. (2021) On the use of Archimedean copulas for insurance modelling. Annals of Actuarial Science, 15(1), 57-81.
[27] Mcneil, A.J., Frey, R. and Embrechts, P. (2015) Quantitative Risk Management: Concepts, Techniques and Tools – revised edition. New Jersey, US: Princeton University Press. · Zbl 1337.91003
[28] (2022). MSCI developed markets indexes. https://www.msci.com/our-solutions/indexes/de-veloped-markets. Accessed: 2022-01-19.
[29] Nelsen, R.B. (2007) An Introduction to Copulas. New York: Springer. · Zbl 1152.62030
[30] Pan, S.J. and Yang, Q. (2009) A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering, 22(10), 1345-1359.
[31] Pesenti, S.M., Millossovich, P. and Tsanakas, A. (2019) Reverse sensitivity testing: What does it take to break the model?European Journal of Operational Research, 274(2), 654-670. · Zbl 1406.91203
[32] Rosco, J. and Joe, H. (2013) Measures of tail asymmetry for bivariate copulas. Statistical Papers, 54(3), 709-726. · Zbl 1307.62157
[33] Rüschendorf, L. and De Valk, V. (1993) On regression representations of stochastic processes. Stochastic Processes and their Applications, 46(2), 183-198. · Zbl 0779.60058
[34] Schepsmeier, U., Stoeber, J., Brechmann, E. C., Graeler, B., Nagler, T., Erhardt, T., Almeida, C., Min, A., Czado, C., Hofmann, M., Killiches, M., Joe, H. and Vatter, T. (2021) Package ‘vinecopula’. R package version, 2.4.2.
[35] Shaw, R., Smith, A.D. and Spivak, G. (2010). Calibration and communication of dependencies with a case study based on market returns. https://www.actuaries.org.uk/system/files/documents/pdf/b3.pdf. Presented to the Institute and Faculty of Actuaries, November 2010.
[36] Sheykhmousa, M., Mahdianpari, M., Ghanbari, H., Mohammadimanesh, F., Ghamisi, P. and Homayouni, S. (2020) Support vector machine vs. random forest for remote sensing image classification: A meta-analysis and systematic review. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 13, 6308-6325.
[37] Shi, P. (2014) A copula regression for modeling multivariate loss triangles and quantifying reserving variability. ASTIN Bulletin, 44(1), 85-102. · Zbl 1284.62644
[38] Shi, P. and Frees, E.W. (2011) Dependent loss reserving using copulas. ASTIN Bulletin, 41(2), 449-486.
[39] Shi, P. and Valdez, E.A. (2014) Multivariate negative binomial models for insurance claim counts. Insurance: Mathematics and Economics, 55, 18-29. · Zbl 1296.91169
[40] Tzotsos, A. and Argialas, D. (2008) Support vector machine classification for object-based image analysis. In Object-Based Image Analysis, pp. 663-677. Springer.
[41] Tzougas, G. and Pignatelli Di Cerchiara, A. (2021) The multivariate mixed negative binomial regression model with an application to insurance a posteriori ratemaking. Insurance: Mathematics and Economics, 101, 602-625. · Zbl 1475.91319
[42] Venables, W.N. and Ripley, B.D. (2002) Modern Applied Statistics with S, 4th ed. New York: Springer. ISBN 0-387-95457-0. · Zbl 1006.62003
[43] Wold, S., Esbensen, K. and Geladi, P. (1987) Principal component analysis. Chemometrics and Intelligent Laboratory Systems, 2(1-3), 37-52.
[44] Yan, J. (2007). Enjoy the joy of copulas: With a package copula. Journal of Statistical Software, 21(4), 1-21.
[45] Yang, L. and Jin, R. (2006) Distance metric learning: A comprehensive survey. Michigan State University, 2(2), 4.
[46] Zhuang, F., Qi, Z., Duan, K., Xi, D., Zhu, Y., Zhu, H., Xiong, H. and He, Q. (2020) A comprehensive survey on transfer learning. Proceedings of the IEEE, 109(1), 43-76.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.