×

Multiple imputation of masked competing risks data using machine learning algorithms. (English) Zbl 07632290

Summary: The analysis of masked cause of failure data is an important area in the reliability analysis. Prior researches mostly included masking probability as a part of likelihood function to handle masked competing risks analysis which were much time-consuming, high costly and complicated computationally. To optimize time and cost and also overcome complexity of calculation, in this paper, a new two-step approach is presented which is based on multiple imputation of masked causes of failure via some machine learning algorithms. Then, in the second step, the filled-in competing risks data are analysed using standard maximum likelihood approach. For competing risks data, the superiority of the proposed method comparing with the prior ones is evaluated in ML Estimations (MLE) of Life-time parameters via several simulation studies and applying on real data. Also, sensitivity analysis for biasness versus different sample sizes is exemplified.

MSC:

62-XX Statistics

Software:

ISLR
Full Text: DOI

References:

[1] Misaei, H.; Eftekhari Mahabadi, S.; Haghighi, F., Analysis of masked competing risks data with cause and time-dependent masking mechanism, J Stat Comput Simul (2020) · Zbl 07480172 · doi:10.1080/00949655.2020.1773465
[2] Shi, X.; Shi, Y.; Song, Q., Inference for four-unit hybrid system with masked data under partially acceleration life test, Syst Sci Control Eng, 6, 1, 195-206 (2018) · doi:10.1080/21642583.2018.1474503
[3] Cai, J.; Shi, Y.; Liu, B., Inference for a series system with dependent masked data under progressive interval censoring, J Appl Stat, 44, 1, 3-15 (2017) · Zbl 1516.62174 · doi:10.1080/02664763.2016.1156658
[4] Sen, A, Banerjee, M, Basu, S, et al. Analysis of masked failure data under competing risks. Handbook of statistics. Vol. 20, Amsterdam, The Netherlands: North-Holland; 2001. p. 523-540.
[5] Yousif, Y.; Elfaki, F.; Hrairi, M., A Bayesian approach to competing risks model with masked causes of failure and incomplete failure times, Math Probl Eng, 2020 (2020) · Zbl 07345990 · doi:10.1155/2020/8248640
[6] Xu, A.; Basu, S.; Tang, Y., A full Bayesian approach for masked data in step-stress accelerated life testing, IEEE Trans Reliab, 63, 3, 798-806 (2014) · doi:10.1109/TR.2014.2315940
[7] Liu, B.; Shi, Y.; Cai, J., Nonparametric Bayesian analysis for masked data from hybrid systems in accelerated lifetime tests, IEEE Trans Reliab, 66, 3, 662-676 (2017) · doi:10.1109/TR.2017.2704582
[8] Cai, J.; Shi, Y.; Liu, B., Bayesian analysis for dependent competing risks model with masked causes of failure in step-stress accelerated life test under progressive hybrid censoring, Commun Stat - Simul Comput, 49, 9, 2302-2320 (2020) · Zbl 07552798 · doi:10.1080/03610918.2018.1517212
[9] Basu, SSen; Banerjee, M., Bayesian analysis of competing risks with partially masked cause of failure, Appl Statist, 52, 77-93 (2003) · Zbl 1111.62326
[10] Lin, DKJ; Guess, FM., System life data analysis with dependent partial knowledge on the exact cause of system failure, Micro Electron Rel, 34, 535-544 (1994)
[11] Guttman, I.; Lin, DKJ; Reiser, B., Dependent masking and system life data analysis: Bayesian inference for two-component systems, Lifetime Data Anal, 1, 87-100 (1995) · Zbl 0821.62063
[12] Kuo, L.; Yang, TE., Bayesian reliability modeling for masked system lifetime data, Statist Probab Lett, 47, 229-241 (2000) · Zbl 0968.62072
[13] Mukhopadhyay, C, Basu, APMasking without the symmetry as-sumption: a Bayesian approach. Proc. Abstract Book 2nd Int. Conf. Math. Methods Rel.; Vol. 2, Bordeaux, France: Universite Victor Segalen; 2000. p. 784-787, .
[14] Craiu, RV, Duchesne, T, Gelmanand, A, et al. Using EM and DA for the competing risk model. Applied Bayesian modeling and causal inference from an incomplete-data perspective. New York (NY): Wiley; 2004. p. 234-245.
[15] Mukhopadhyay, C., Maximum likelihood analysis of masked series system lifetime data, J Statist Plann Inference, 136, 803-838 (2006) · Zbl 1077.62095
[16] Xu, A.; Tang, Y., Bayesian analysis of Pareto reliability with dependent masked data, IEEE Trans Rel, 58, 4, 583-588 (2009)
[17] Xu, A.; Tang, Y., Non-parametric Bayesian analysis of competing risks problem with masked data, Commun Statist-Theor Meth, 40, 2326-2336 (2011) · Zbl 1318.62119
[18] Cai, J.; Shi, Y.; Zhang, Y., Robust Bayesian analysis for parallel system with masked data under inverse Weibull lifetime distribution, Commun Stat-Theor Meth, 49, 1-13 (2019)
[19] Misaii, H, Haghighi, F, Eftekhari Mahabadi, S. Bayesian analysis of masked data with non-ignorable missing mechanism. 5nd Seminar on Reliability Theory and its Applications; 2019. · Zbl 07632290
[20] Han, S.; Andrei, A.; Tsui, K., Multiple imputation for competing risks survival data via pseudo-observations, Commun Stat Appl Meth, 25, 385-396 (2018) · doi:10.29220/CSAM.2018.25.4.385
[21] Lu, K.; Tsiatis, AA., Multiple imputation methods for estimating regression coefficients in the competing risks model with missing cause of failure, Biometrics, 57, 1191-1197 (2001) · Zbl 1209.62223
[22] Fine, JP; Gray, RJ., A proportional hazards model for the sub-distribution of a competing risk, J Amer Statist Assoc, 88, 400-409 (1999)
[23] Bakoyannis, G.; Siannis, F.; Touloumi, G., Modelling competing risks data with missing cause of failure, Statist Med, 29, 3172-3185 (2010) · doi:10.1002/sim.4133
[24] Hosmer JR, DW; Lemeshow, S., Applied logistic regression (2004), New York: John Wiley and Sons, New York
[25] James, G.; Witten, D.; Hastie, T., An introduction to statistical learning: with applications in R (2013), New York: Springer-Verlag, New York · Zbl 1281.62147
[26] Little, RJA; Rubin, DB., Statistical analysis with missing data (2002), Hoboken (NJ): Wiley, Hoboken (NJ) · Zbl 1011.62004
[27] Rubin, DB., Multiple imputation for nonresponse in surveys (1987), New York: Wiley, New York · Zbl 1070.62007
[28] Breiman, L.; Friedman, JH; Olshen, RA, Classification and regression trees (1984), New York: Routledge, New York · Zbl 0541.62042 · doi:10.1201/9781315139470
[29] Levuliene, R., Semiparametric estimation and goodness-of-fit tests for tire wear and failure time data, Nonlinear Anal Model Control, 7, 61-95 (2002) · Zbl 1036.62104
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.