×

Accelerated failure time modeling via nonparametric mixtures. (English) Zbl 1522.62225

Summary: An accelerated failure time (AFT) model assuming a log-linear relationship between failure time and a set of covariates can be either parametric or semiparametric, depending on the distributional assumption for the error term. Both classes of AFT models have been popular in the analysis of censored failure time data. The semiparametric AFT model is more flexible and robust to departures from the distributional assumption than its parametric counterpart. However, the semiparametric AFT model is subject to producing biased results for estimating any quantities involving an intercept. Estimating an intercept requires a separate procedure. Moreover, a consistent estimation of the intercept requires stringent conditions. Thus, essential quantities such as mean failure times might not be reliably estimated using semiparametric AFT models, which can be naturally done in the framework of parametric AFT models. Meanwhile, parametric AFT models can be severely impaired by misspecifications. To overcome this, we propose a new type of the AFT model using a nonparametric Gaussian-scale mixture distribution. We also provide feasible algorithms to estimate the parameters and mixing distribution. The finite sample properties of the proposed estimators are investigated via an extensive stimulation study. The proposed estimators are illustrated using a real dataset.
{© 2021 The International Biometric Society.}

MSC:

62P10 Applications of statistics to biology and medical sciences; meta analysis

Software:

R; survival; aftgee
Full Text: DOI

References:

[1] Bickel, P.J. (1982) On adaptive estimation. Annals of Statistics, 10, 647-671. · Zbl 0489.62033
[2] Böhning, D. (1985) Numerical estimation of a probability measure. Journal of Statistical Planning and Inference, 11, 57-69. · Zbl 0574.62046
[3] Böhning, D. (1986) A vertex‐exchange‐method in D‐optimal design theory. Metrika, 33, 337-347. · Zbl 0601.62091
[4] Brown, B.M.& Wang, Y.‐G. (2005) Standard errors and covariance matrices for smoothed rank estimators. Biometrika, 92, 149-158. · Zbl 1068.62037
[5] Brown, B.M.& Wang, Y.‐G. (2007) Induced smoothing for rank regression with censored survival times. Statistics in Medicine, 26, 828-836.
[6] Buckley, J.& James, I. (1979) Linear regression with censored data. Biometrika, 66, 429-436. · Zbl 0425.62051
[7] Chiou, S.H., Kang, S.& Yan, J. (2014) Fitting accelerated failure time models in routine survival analysis with R package aftgee. Journal of Statistical Software, 61, 1-23.
[8] Chiou, S.H., Kang, S.& Yan, J. (2015) Semiparametric accelerated failure time modeling for clustered failure times from stratified sampling. Journal of the American Statistical Association, 110, 621-629. · Zbl 1373.62492
[9] Ding, Y. & Nan, B. (2015) Estimating mean survival time: When is it possible?Scandinavian Journal of Statistics, 42, 397-413. · Zbl 1364.62257
[10] Efron, B. & Olshen, R.A. (1978) How broad is the class of normal scale mixtures?The Annals of Statistics, 6, 1159-1164. · Zbl 0385.62013
[11] Fleming, T.R. & Harrington, D.P. (2011) Counting Processes and Survival Analysis. Hoboken, NJ: John Wiley & Sons.
[12] Fygenson, M.& Ritov, Y. (1994) Monotone estimating equations for censored data. The Annals of Statistics, 22, 732-746. · Zbl 0807.62032
[13] Gehan, E.A. (1965) A generalized Wilcoxon test for comparing arbitrarily singly‐censored samples. Biometrika, 52, 203-223. · Zbl 0133.41901
[14] Graf, E., Schmoor, C., Sauerbrei, W.& Schumacher, M. (1999) Assessment and comparison of prognostic classification schemes for survival data. Statistics in Medicine, 18, 2529-2545.
[15] Jin, Z., Lin, D.Y., Wei, L.J.& Ying, Z. (2003) Rank‐based inference for the accelerated failure time model. Biometrika, 90, 341-353. · Zbl 1034.62103
[16] Jin, Z., Lin, D.Y.& Ying, Z. (2006) On least‐squares regression with censored data. Biometrika, 93, 147-161. · Zbl 1152.62068
[17] Johnson, L.M.& Strawderman, R.L. (2009) Induced smoothing for the semiparametric accelerated failure time model: Asymptotics and extensions to clustered data. Biometrika, 96, 577-590. · Zbl 1170.62069
[18] Karlsson, M.& Laitila, T. (2014) Finite mixture modeling of censored regression models. Statistical Papers, 55, 627-642. · Zbl 1416.62215
[19] Kiefer, J.& Wolfowitz, J. (1956) Consistency of the maximum likelihood estimator in the presence of infinitely incidental parameters. The Annals of Mathematical Statistics, 27, 886-906. · Zbl 0073.14701
[20] Klein, J.P. & Moeschberger, M.L. (2006) Survival Analysis: Techniques for Censored and Truncated Data. Berlin, Germany. Springer Science & Business Media.
[21] Komárek, A., Lesaffre, E.& Hilton, J.F. (2005) Accelerated failure time model for arbitrarily censored data with smoothed error distribution. Journal of Computational and Graphical Statistics, 14, 726-745.
[22] Lai, T.L.& Ying, Z. (1991) Large sample theory of a modified Buckley‐James estimator for regression analysis with censored data. The Annals of Statistics, 19, 1370-1402. · Zbl 0742.62043
[23] Lesperance, M.L.& Kalbfleisch, J.D. (1992) An algorithm for computing the nonparametric MLE of a mixing distribution. Journal of the American Statistical Association, 87, 120-126. · Zbl 0850.62336
[24] Lindsay, B.G. (1983) The geometry of mixture likelihoods : A general theory. The Annals of Statistics, 11, 86-94. · Zbl 0512.62005
[25] Lindsay, B.G.& Lesperance, M.L. (1995) A review of semiparametric mixture models. Journal of Statistical Planning and Inference, 47, 29-39. · Zbl 0832.62027
[26] Murphy, S.A.& van derVaart, A.W. (2000) On profile likelihood. Journal of the American Statistical Association, 95, 449-465. · Zbl 0995.62033
[27] Prentice, R.L. (1978) Linear rank tests with right censored data (Corr: V70 p304). Biometrika, 65, 167-180. · Zbl 0377.62024
[28] R Core Team (2020) R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing.
[29] Ritov, Y. (1990) Estimation in a linear regression model with censored data. The Annals of Statistics, 18, 303-328. · Zbl 0713.62045
[30] Seo, B.& Lee, T. (2015) A new algorithm for maximum likelihood estimation in normal scale‐mixture generalized autoregressive conditional heteroskedastic models. Journal of Statistical Computation and Simulation, 85, 202-215. · Zbl 1457.62279
[31] Seo, B., Noh, J., Lee, T.& Yoon, Y. (2017) Adaptive robust regression with continuous Gaussian scale mixture errors. Journal of the Korean Statistical Society, 46, 113-125. · Zbl 1357.62246
[32] Stute, W.& Wang, J.‐L. (1993) The strong law under random censorship. The Annals of Statistics, 21, 1591-1607. · Zbl 0785.60020
[33] Susarla, V.& Van Ryzin, J. (1980) Large sample theory for an estimator of the mean survival time from censored samples. The Annals of Statistics, 8, 1002-1016. · Zbl 0455.62030
[34] Therneau, T.M. (2021) A Package for Survival Analysis in R. R package version 3.2‐11, https://CRAN.R-project.org/package=survival.
[35] Therneau, T.M. & Grambsch, P.M. (2000) Modeling Survival Data: Extending the Cox Model. New York: Springer. · Zbl 0958.62094
[36] Tsiatis, A.A. (1990) Estimating regression parameters using linear rank tests for censored data. The Annals of Statistics, 18, 354-372. · Zbl 0701.62051
[37] van derVaart, A.W. (1996) Efficient maximum likelihood estimation in semiparametric mixture models. The Annals of Statistics, 24, 862-878. · Zbl 0860.62029
[38] Wang, Y. (2007) On fast computation of the non‐parametric maximum likelihood estimate of a mixing distribution. Journal of the Royal Statistical Society, Series B, Methodological, 69, 185-198. · Zbl 1120.62022
[39] Xiang, S., Yao, W.& Seo, B. (2016) Semiparametric mixture: Continuous scale mixture approach. Computational Statistics and Data Analysis, 103, 413-425. · Zbl 1466.62218
[40] Ying, Z. (1993) A large sample study of rank estimation for censored regression data. The Annals of Statistics, 21, 76-99. · Zbl 0773.62048
[41] Zeller, C.B., Cabral, C. R.B., Lachos, V.H.& Benites, L. (2019) Finite mixture of regression models for censored data based on scale mixtures of normal distributions. Advances in Data Analysis and Classification, 13, 89-116. · Zbl 1474.62259
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.