×

Recurrent neural network for complex survival problems. (English) Zbl 07739785

Summary: Survival analysis has become one of the paramount procedures in the modeling of time-to-event data. When we encounter complex survival problems, the traditional approach remains limited in accounting for the complex correlational structure between the covariates and the outcome due to the strong assumptions that limit the inference and prediction ability of the resulting models. Several studies exist on the deep learning approach to survival modeling; moreover, the application for the case of complex survival problems still needs to be improved. In addition, the existing models need to address the data structure’s complexity fully and are subject to noise and redundant information. In this study, we design a deep learning technique (CmpXRnnSurv\(_{-}\)AE) that obliterates the limitations imposed by traditional approaches and addresses the above issues to jointly predict the risk-specific probabilities and survival function for recurrent events with competing risks. We introduce the component termed Risks Information Weights (RIW) as an attention mechanism to compute the weighted cumulative incidence function (WCIF) and an external auto-encoder (ExternalAE) as a feature selector to extract complex characteristics among the set of covariates responsible for the cause-specific events. We train our model using synthetic and real data sets and employ the appropriate metrics for complex survival models for evaluation. As benchmarks, we selected both traditional, and machine learning models and our model demonstrates better performance across all datasets.

MSC:

62-XX Statistics
62N01 Censored data models
62-07 Data analysis (statistics) (MSC2010)
Full Text: DOI

References:

[1] Cox, DR., Partial likelihood, Biometrika, 62, 2, 269-276 (1975) · Zbl 0312.62002
[2] Kaplan, EL; Meier, P., Nonparametric estimation from incomplete observations, J Am Stat Assoc, 53, 457-481 (1958) · Zbl 0089.14801
[3] Luck, M, Sylvain, T, Cardinal, H, et al. Deep learning for patient-specific kidney graft survival analysis. arXiv preprint arXiv:1705.10245; 2017.
[4] Lee, C.; Yoon, J.; Schaar, M., Dynamic-DeepHit: a deep learning approach for dynamic survival analysis with competing risks based on longitudinal data, IEEE Trans Biomed Eng, 67, 1, 122-133 (2020)
[5] Lee, C, Zame, WR, Yoon, J, et al. DeepHit: a deep learning approach to survival analysis with competing risks. In 32nd Association for the Advancement of Artificial Intelligence (AAAI) Conference; 2018.
[6] Wang, P.; Li, Y.; Reddy, CK., Machine learning for survival analysis: A survey, ACM Comput: Surv, 51, 6, 36 (2019)
[7] Gupta, G.; Sunder, V.; Prasad, R.; Yang, Q.; Zhou, ZH; Gong, Z.; Zhang, ML; Huang, SJ, Advances in knowledge discovery and data mining. PAKDD 2019. lecture notes in computer science; vol 11440, CRESA: a deep learning approach to competing risks, recurrent event survival analysis (2019), Cham: Springer, Cham · doi:10.1007/978-3-030-16145-3-9
[8] Yao, J.; Zhu, X.; Zhu, F.; Descoteaux, M.; Maier-Hein, L.; Franz, A.; Jannin, P.; Collins, D.; Duchesne, S., Medical image computing and computer-assisted intervention− MICCAI 2017. MICCAI 2017. Lecture notes in computer science; vol 10434, Deep correlational learning for survival prediction from multi-modality data (2017), Cham: Springer, Cham · doi:10.1007/978-3-319-66185-8-46
[9] Du, S.; Li, T.; Yang, Y., Multivariate time series forecasting via attention-based encoder-decoder framework, Neurocomputing, 388, 269-279 (2020)
[10] Coccia, M., Deep learning technology for improving cancer care in society: new directions in cancer imaging driven by artificial intelligence, Technol Soc, 60, 1-11 (2020)
[11] Rietschel, C, Jinsung, Y, van der Schaar, M. Feature selection for survival analysis with competing risks using deep learning. arXiv preprint arXiv:1811.09317; 2018.
[12] Falcon, W. Pytorch lightning. GitHub. Web page: https://github.com/PyTorchLightning/pytorch-lightning. Last accessed: 10, 2022.
[13] Wang, P.; Shi, T.; Reddy, CK., A novel tensor-based temporal multi-task survival analysis model, IEEE Trans Knowl Data Eng, 33, 9, 3311-3322 (2021)
[14] Li, Y, Wang, J, Ye, J, et al. A multi-task learning formulation for survival analysis. Proceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ser. KDD ’16. ACM, 2016: 1715-1724. doi:10.11452939672.2939857.
[15] Giunchiglia, E.; Nemchenko, A.; van der Schaar, M.; Kůrková, V.; Manolopoulos, Y.; Hammer, B.; Iliadis, L.; Maglogiannis, I., Artificial neural networks and machine learning - ICANN 2018. ICANN 2018. lecture notes in computer science, vol 11141, RNN-SURV: a deep recurrent model for survival analysis (2018), Cham: Springer, Cham · doi:10.1007/978-3-030-01424-7-3
[16] Alaa, AM, van der Schaar, M. Deep multi-task Gaussian processes for survival analysis with competing risks. Proceedings of the 31th Conference on Neural Information Processing Systems (NIPS 2017), 2017; 2326-2334.
[17] Zhu, X, Yao, J, Huang, J. Deep convolutional neural network for survival analysis with pathological images. 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Shenzhen, 2016; 544-547. doi:.
[18] Liao, L.; Ahn, H., Combining deep learning and survival analysis for asset health management, Int J Prognostics Health Manage, 7, 1-7 (2016)
[19] Johnson, A.; Pollard, T.; Shen, L., MIMIC-III, a freely accessible critical care database, Sci Data, 3, 160035 (2016) · doi:10.1038/sdata.2016.35
[20] Fine, JP; Gray, RJ., A proportional hazards model for the sub-distribution of a competing risk, J Am Stat Assoc, 94, 446, 496-509 (1999) · Zbl 0999.62077
[21] Manaswi, NK., Deep Learning with Applications Using Python: Chatbots and Face, Object, and Speech Recognition With TensorFlow and Keras, RNN and LSTM, 115-126 (2018), Berkeley, CA: Apress, Berkeley, CA
[22] Nielsen, MA., Neural networks and deep learning. Vol. 25 (2015), San Francisco (CA: Determination press, San Francisco (CA
[23] Rumelhart, DE; Hinton, GE; Williams, RJ., Learning representations by back-propagating errors, Nature, 323, 6088, 533-536 (1986) · Zbl 1369.68284
[24] Bengio, Y.; Goodfellow, I.; Courville, A., Deep learning (Vol. 1) (2017), Cambridge (MA: MIT press, Cambridge (MA
[25] Ravanelli, M.; Brakel, P.; Omologo, M., Light gated recurrent units for speech recognition, IEEE Trans Emerg Top Comput Intell, 2, 2, 92-102 (2018) · doi:10.1109/TETCI.2017.2762739
[26] Li, J, Hu, X, Wu, L, et al. Robust unsupervised feature selection on networked data. Proceedings of the 2016 SIAM International Conference on Data Mining. Society for Industrial and Applied Mathematics; 2016.
[27] Bank, D, Koenigstein, N, Giryes, R. Autoencoders. arXiv preprint arXiv:2003.05991, 2020.
[28] Wang, S, Zhengming, D, Fu, Y. Feature selection guided auto-encoder. Proceedings of the AAAI Conference on Artificial Intelligence. 2017; 31(1).
[29] Bahdanau, D, Cho, K. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv: 1409.0473, 2014.
[30] Chorowski, JK, Bahdanau, D, Serdyuk, D, et al. Attention-Based Models for Speech Recognition. ArXiv, abs/1506.07503, 2015.
[31] Sinha, A.; Dolz, J., Multi-scale self-guided attention for medical image segmentation, IEEE J Biomed Health Inform, 25, 1, 121-130 (2020)
[32] Huang, S, Wang, D, Wu, X, et al. DSANet: Dual self-attention network for multivariate time series forecasting. Proceedings of the 28th ACM international conference on information and knowledge management, 2019:2129-2132.
[33] Graves, A, Wayne, G, Danihelka, I. Neural turing machines. arXiv preprint arXiv:1410.5401, 2014.
[34] Vaswani, A.; Shazeer, N.; Parmar, N., Attention is all you need, Adv Neural Inf Process Syst, 30 (2017)
[35] Luong, MT, Pham, H, Manning, CD. Effective approaches to attention-based neural machine translation. arXiv preprint arXiv:1508.04025, 2015.
[36] Yessou, H. Analysis of deep learning loss functions for multi-label remote sensing image classification. Master’s thesis, The Polytechnic University of Milan, 2020.
[37] Longato, E.; Vettoretti, M.; Di Camillo, B., A practical perspective on the concordance index for the evaluation and selection of prognostic time-to-event models, J Biomed Inform, 108, 103496 (2020)
[38] Harrell, FE; Califf, RM; Pryor, DB, Evaluating the yield of medical tests, Jama, 247, 18, 2543-2546 (1982)
[39] UNOS-KIDPAN dataset. Organ Procurement and Transplantation Network (OPTN), 2022.
[40] Pénichoux, J, Moreau, T. Latouche A. Simulating recurrent events that mimic actual data: a review of the literature with emphasis on event-dependence. arXiv preprint arXiv:1503.05798, 2015.
[41] Ata Tutkun, N.; Marthin, P., A comparative study with bootstrap resampling technique to uncover behavior of unconditional hazards and survival functions for gamma and inverse Gaussian frailty models, Math Sci, 15, 99-109 (2021) · Zbl 1475.62246 · doi:10.1007/s40096-020-00366-1
[42] PyTorch: ReduceLROnPlateau — PyTorch 1.9.0 documentation, https://pytorch.org/docs/stable/generated/torch.optim.lr_scheduler.ReduceLROnPlateau.html#torch.optim.lr_scheduler.ReduceLROnPlateau.
[43] Bergstra, J.; Bengio, Y., Random search for hyper-parameter optimization, J Mach Learn Res, 13, 10, 281-305 (2012) · Zbl 1283.68282
[44] Ishwaran, H.; Gerds, TA; Kogalur, UB, Random survival forests for competing risks, Biostatistics, 15, 4, 757-773 (2014)
[45] Gray, B.; Gray, MB., Package ‘cmprsk’. subdistribution analysis of competing risks, R Package Version, 2, 2-7 (2014)
[46] Ishwaran, H.; Kogalur, UB; Kogalur, MUB., Package ‘randomForestSRC’, Breast, 6, 1 (2023)
[47] Hu, Y.; Yang, T.; Zhang, J., Dynamic prediction of mechanical thrombectomy outcome for acute ischemic stroke patients using machine learning, Brain Sci, 12, 7, 938 (202218)
[48] Lin, H.; Yan, J.; Yuan, L., Impact of diabetes mellitus developing after kidney transplantation on patient mortality and graft survival: a meta-analysis of adjusted data, Diabetol Metab Syndr, 13, 1, 126 (202130) · doi:10.1186/s13098-021-00742-4
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.