×

A general model-checking procedure for semiparametric accelerated failure time models. (English) Zbl 1541.62008

Summary: We propose a set of goodness-of-fit tests for the semiparametric accelerated failure time (AFT) model, including an omnibus test, a link function test, and a functional form test. This set of tests is derived from a multi-parameter cumulative sum process shown to follow asymptotically a zero-mean Gaussian process. Its evaluation is based on the asymptotically equivalent perturbed version, which enables both graphical and numerical evaluations of the assumed AFT model. Empirical \(p\)-values are obtained using the Kolmogorov-type supremum test, which provides a reliable approach for estimating the significance of both proposed un-standardized and standardized test statistics. The proposed procedure is illustrated using the rank-based estimator but is general in the sense that it is directly applicable to some other popular estimators such as induced smoothed rank-based estimator or least-squares estimator that satisfies certain properties. Our proposed methods are rigorously evaluated using extensive simulation experiments that demonstrate their effectiveness in maintaining a Type I error rate and detecting departures from the assumed AFT model in practical sample sizes and censoring rates. Furthermore, the proposed approach is applied to the analysis of the Primary Biliary Cirrhosis data, a widely studied dataset in survival analysis, providing further evidence of the practical usefulness of the proposed methods in real-world scenarios. To make the proposed methods more accessible to researchers, we have implemented them in the R package afttest, which is publicly available on the Comprehensive R Archive Network.

MSC:

62-08 Computational methods for problems pertaining to statistics
62N05 Reliability and life testing
62P10 Applications of statistics to biology and medical sciences; meta analysis

References:

[1] Bae, W., Choi, D., Yan, J., Kang, S.: afttest: model diagnostics for accelerated failure time models. R package version 4.3.2.2. https://cran.r-project.org/web/packages/afttest/index.html (2022)
[2] Bagdonavičius, VB; Levuliene, RJ; Nikulin, MS, Chi-squared goodness-of-fit tests for parametric accelerated failure time models, Commun. Stat.-Theory Methods, 42, 2768-2785, 2013 · Zbl 1280.62053 · doi:10.1080/03610926.2011.617483
[3] Balakrishnan, N.; Chimitova, E.; Galanova, N.; Vedernikova, M., Testing goodness of fit of parametric aft and ph models with residuals, Commun. Stat.-Simul. Comput., 42, 1352-1367, 2013 · Zbl 1347.62212 · doi:10.1080/03610918.2012.659824
[4] Barlow, WE; Prentice, RL, Residuals for relative risk regression, Biometrika, 75, 65-74, 1988 · Zbl 0632.62102 · doi:10.1093/biomet/75.1.65
[5] Brown, BM; Wang, Y., Induced smoothing for rank regression with censored survival times, Stat. Med., 26, 828-836, 2006 · doi:10.1002/sim.2576
[6] Buckley, J.; James, I., Linear regression with censored data, Biometrika, 66, 429-436, 1979 · Zbl 0425.62051 · doi:10.1093/biomet/66.3.429
[7] Cain, SR, Distinguishing between lognormal and weibull distributions [time-to-failure data], IEEE Trans. Reliab., 51, 32-38, 2002 · doi:10.1109/24.994903
[8] Cavallo, A., Rosenthal, B., Wang, X., Yan, J.: Treatment of the data collection threshold in operational risk: a case study using the lognormal distribution. J. Operat. Risk 7(1), 3-38 (2012). doi:10.21314/jop.2012.101
[9] Chiou, S.; Kang, S.; Yan, J., Rank-based estimating equations with general weight for accelerated failure time models: an induced smoothing approach, Stat. Med., 34, 1495-1510, 2015 · doi:10.1002/sim.6415
[10] Chiou, SH; Kang, S.; Kim, J.; Yan, J., Marginal semiparametric multivariate accelerated failure time model with generalized estimating equations, Lifetime Data Anal., 20, 599-618, 2014 · Zbl 1357.62296 · doi:10.1007/s10985-014-9292-x
[11] Chiou, SH; Kang, S.; Yan, J., Fitting accelerated failure time models in routine survival analysis with r package aftgee, J. Stat. Softw., 61, 1-23, 2014 · doi:10.18637/jss.v061.i11
[12] Chiou, SH; Kang, S.; Yan, J., Semiparametric accelerated failure time modeling for clustered failure times from stratified sampling, J. Am. Stat. Assoc., 110, 621-629, 2015 · Zbl 1373.62492 · doi:10.1080/01621459.2014.917978
[13] Cockeran, M.; Meintanis, SG; Santana, L.; Allison, JS, Goodness-of-fit testing of survival models in the presence of type-ii right censoring, Comput. Statistics, 36, 977-1010, 2021 · Zbl 1505.62108 · doi:10.1007/s00180-020-01050-7
[14] Cox, DR, Regression models and life-tables, J. Roy. Stat. Soc.: Ser. B (Methodol.), 34, 187-202, 1972 · doi:10.1111/j.2517-6161.1972.tb00899.x
[15] Diehl, S.; Stute, W., Kernel density and hazard function estimation in the presence of censoring, J. Multivar. Anal., 25, 299-310, 1988 · Zbl 0661.62028 · doi:10.1016/0047-259X(88)90053-X
[16] Ding, Y.; Nan, B., Estimating mean survival time: when is it possible?, Scand. J. Stat., 42, 397-413, 2015 · Zbl 1364.62257 · doi:10.1111/sjos.12112
[17] Fleming, TR; Harrington, DP, Counting processes and survival analysis, 2013, John Wiley & Sons
[18] Hardy, G., Littlewood, J., Pólya, G.: Inequalities. Cambridge Mathematical Library, Cambridge University Press https://books.google.com/books?id=t1RCSP8YKt8C (1952) · Zbl 0047.05302
[19] Huang, C-Y; Luo, X.; Follmann, DA, A model checking method for the proportional hazards model with recurrent gap time data, Biostatistics, 12, 535-547, 2011 · Zbl 1218.62112 · doi:10.1093/biostatistics/kxq071
[20] Jin, Z.; Lin, D.; Wei, L.; Ying, Z., Rank-based inference for the accelerated failure time model, Biometrika, 90, 341-353, 2003 · Zbl 1034.62103 · doi:10.1093/biomet/90.2.341
[21] Jin, Z.; Lin, DY; Ying, Z., On least-squares regression with censored data, Biometrika, 93, 147-161, 2006 · Zbl 1152.62068 · doi:10.1093/biomet/93.1.147
[22] Johnson, LM; Strawderman, RL, Induced smoothing for the semiparametric accelerated failure time model: asymptotics and extensions to clustered data, Biometrika, 96, 577-590, 2009 · Zbl 1170.62069 · doi:10.1093/biomet/asp025
[23] Lee, C.H., Ning, J., Shen, Y.: Model diagnostics for the proportional hazards model with length-biased data. Lifetime Data Anal. 25, 79-96 (2019) · Zbl 1429.62562
[24] Li, J.; Scheike, TH; Zhang, M-J, Checking fine and gray subdistribution hazards model with cumulative sums of residuals, Lifetime Data Anal., 21, 197-217, 2015 · Zbl 1322.62028 · doi:10.1007/s10985-014-9313-9
[25] Lin, D.; Wei, L.; Ying, Z., Accelerated failure time models for counting processes, Biometrika, 85, 605-618, 1998 · Zbl 0947.62069 · doi:10.1093/biomet/85.3.605
[26] Lin, D., Ying, Z.: Semiparametric inference for the accelerated life model with time-dependent covariates. J. Stat. Plan. Inference 44, 47-63 (1995) · Zbl 0812.62096
[27] Lin, DY; Spiekerman, CF, Model checking techniques for parametric regression with censored data, Scand. J. Stat., 23, 157-177, 1996 · Zbl 0854.62081
[28] Lin, DY; Wei, L-J; Ying, Z., Checking the cox model with cumulative sums of martingale-based residuals, Biometrika, 80, 557-572, 1993 · Zbl 0788.62094 · doi:10.1093/biomet/80.3.557
[29] Lu, W.; Liu, M.; Chen, Y-H, Testing goodness-of-fit for the proportional hazards model based on nested case-control data, Biometrics, 70, 845-851, 2014 · Zbl 1393.62081 · doi:10.1111/biom.12239
[30] Novák, P., Goodness-of-fit test for the accelerated failure time model based on martingale residuals, Kybernetika, 49, 40-59, 2013 · Zbl 1297.62212
[31] Pollard, D., Empirical processes: theory and applications, NSF-CBMS Reg. Conf. Series Probab. Stat., 2, 1-86, 1990 · Zbl 0741.60001
[32] Prentice, RL, Linear rank tests with right censored data, Biometrika, 65, 167-179, 1978 · Zbl 0377.62024 · doi:10.1093/biomet/65.1.167
[33] Sfumato, P.; Filleron, T.; Giorgi, R.; Cook, RJ; Boher, J-M, Goftte: A r package for assessing goodness-of-fit in proportional (sub) distributions hazards regression models, Comput. Methods Programs Biomed., 177, 269-275, 2019 · doi:10.1016/j.cmpb.2019.05.029
[34] Shorack, GR; Wellner, JA, Empirical processes with applications to statistics, Soc. Ind. Appl. Math., 2009 · Zbl 1171.62057 · doi:10.1137/1.9780898719017
[35] Silverman, B., Density estimation for statistics and data analysis, 2018, CRC Press · doi:10.1201/9781315140919
[36] Spiekerman, C.; Lin, D., Checking the marginal cox model for correlated failure time data, Biometrika, 83, 143-156, 1996 · Zbl 0865.62080 · doi:10.1093/biomet/83.1.143
[37] Team, R. C.: R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria https://www.R-project.org/ (2023)
[38] Therneau, T. M.: A package for survival analysis in R. R package version 3.5-5. https://cran.r-project.org/package=survival (2023)
[39] Tsiatis, AA, Estimating regression parameters using linear rank tests for censored data, Ann. Stat., 18, 354-372, 1990 · Zbl 0701.62051 · doi:10.1214/aos/1176347504
[40] Wei, L-J, The accelerated failure time model: a useful alternative to the cox regression model in survival analysis, Stat. Med., 11, 1871-1879, 1992 · doi:10.1002/sim.4780111409
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.