×

Performance of parametric survival models under non-random interval censoring: a simulation study. (English) Zbl 1468.62153

Summary: In many medical studies, individuals are seen periodically, at a set of pre-scheduled clinical visits. In such cases, when the outcome of interest is the occurrence of an event, the corresponding times are only known to fall within an interval, formed by the times of two consecutive visits. Such data are called interval censored. Most methods for the analysis of interval-censored event times are based on a simplified likelihood function which relies on the assumption that the only information provided by the censoring intervals is that they contain the actual event time (i.e. non-informative censoring). In this simulation study, the performance of parametric models for interval-censored data when individuals miss some of the pre-scheduled visits completely at random (MCAR), at random (MAR) or not at random (MNAR) was assessed comparing also with a simpler approach that is often used in practice. A sample of HIV-RNA measurements and baseline covariates of HIV-1 infected individuals from the CASCADE study is used for illustration in an analysis of the time between the initiation of antiretroviral treatment and viral load suppression to undetectable levels. Results suggest that parametric models based on flexible distributions (e.g. generalised Gamma) can fit such data reasonably well and are robust to irregular visit times caused by an MCAR or MAR mechanism. Violating the non-informative censoring assumption though, leads to biased estimators with the direction and the magnitude of the bias depending on the direction and the strength of the association between the probability of missing visits and the actual time-to-event. Finally, simplifying the data in order to use standard survival analysis techniques, can yield misleading results even when the censoring intervals depend only on a baseline covariate.

MSC:

62-08 Computational methods for problems pertaining to statistics
62P10 Applications of statistics to biology and medical sciences; meta analysis
62N01 Censored data models

Software:

R; Intcox; Stata; STPM; Icens; INTCENS
Full Text: DOI

References:

[1] Aalen, O.; Gjessing, H., Understanding the shape of the hazard rate: a process point of view, Statistical Science, 16, 1-14, (2001) · Zbl 1059.62613
[2] Althoff, K. N.; Justice, A. C.; Gange, S. J.; Deeks, S. G.; Saag, M. S.; Silverberg, M. J.; Gill, M. J.; Lau, B.; Napravnik, S.; Tedaldi, E.; Klein, M. B.; Gebo, K. A., Virologic and immunologic response to HAART, by age and regimen class, JAIDS Journal of Acquired Immune Deficiency Syndromes, 24, 2469-2479, (2010)
[3] Betensky, R. A.; Finkelstein, D. M., Testing for dependence between failure time and visit compliance with interval-censored data, Biometrics, 58, 58-63, (2002) · Zbl 1209.62228
[4] Dorey, F.; Little, R.; Schenker, N., Multiple imputation for threshold-crossing data with interval censoring, Statistics in Medicine, 12, 1589-1603, (1993)
[5] Finkelstein, D. M.; Goggins, W. B.; Schoenfeld, D. A., Analysis of failure time data with dependent interval censoring, Biometrics, 58, 298-304, (2002) · Zbl 1209.62285
[6] Gentleman, R., Vandal, A., 2010. Icens: NPMLE for censored and truncated data. R package version 1.22.0. http://CRAN.R-project.org/package=Icens.
[7] Geretti, A. M.; Harrison, L.; Green, H.; Sabin, C.; Hill, T.; Fearnhill, E.; Pillay, D.; Dunn, D.; Anderson, J.; Asboe, D.; Pozniak, A.; Burns, S.; Cameron, S.; Cane, P.; Churchill, D.; Chrystie, I.; Clark, D.; Delpech, V.; Pillay, D.; Lazarus, L.; Dunn, D.; Fearnhill, E.; Green, H.; Porter, K.; Easterbrook, P.; Zuckerman, M.; Geretti, A. M.; Kellam, P.; Pillay, D.; Phillips, A.; Sabin, C.; Goldberg, D.; Gompels, M.; Hale, A.; Kaye, S.; Konov, S.; Mackie, N.; Orkin, C.; Smit, E.; Tilston, P.; Williams, I.; Zhang, H., Effect of HIV-1 subtype on virologic and immunologic response to starting highly active antiretroviral therapy, Clinical Infectious Diseases, 48, 1296-1305, (2009)
[8] Gomez, G.; Calle, M.; Oller, R., Frequentist and Bayesian approaches for interval-censored data, Statistical Papers, 45, 139-173, (2004) · Zbl 1052.62096
[9] Gomez, G.; Luz Calle, M.; Oller, R.; Langohr, K., Tutorial on methods for interval-censored data and their implementation in R, Statistical Modelling, 9, 259-297, (2009) · Zbl 07257705
[10] Griffin, J., 2005. Intcens: stata module to perform interval-censored survival analysis.
[11] Groeneboom, P.; Wellner, J. A., (Information Bounds and Nonparametric Maximum Likelihood Estimation, DMV Seminar, vol. 19, (1992), Birkhäuser Verlag Basel) · Zbl 0757.62017
[12] Henschel, V., Heiss, C., Mansmann, U., 2009. Intcox: iterated convex minorant algorithm for interval censored event data. R package version 0.9.2. http://CRAN.R-project.org/package=intcox.
[13] Law, C.; Brookmeyer, R., Effects of midpoint imputation on the analysis of doubly censored-data, Statistics in Medicine, 11, 1569-1578, (1992)
[14] Lesaffre, E.; Komarek, A.; Declerck, D., An overview of methods for interval-censored data with an emphasis on applications in dentistry, Statistical Methods in Medical Research, 14, 539-552, (2005) · Zbl 1099.62130
[15] Lindsey, J., A study of interval censoring in parametric regression models, Lifetime Data Analysis, 4, 329-354, (1998) · Zbl 0917.62085
[16] Odell, P.; Anderson, K.; Dagostino, R., Maximum-likelihood-estimation for interval-censored data using a Weibull-based accelerated failure time model, Biometrics, 48, 951-959, (1992)
[17] Pan, W., Extending the iterative convex minorant algorithm to the Cox model for interval-censored data, Journal of Computational and Graphical Statistics, 8, 109-120, (1999)
[18] Pence, B. W.; Miller, W. C.; Gaynes, B. N.; Eron, J. J., Psychiatric illness and virologic response in patients initiating highly active antiretroviral therapy, JAIDS-Journal of Acquired Immune Deficiency Syndromes, 44, 159-166, (2007)
[19] R Development Core Team, 2010. R: a language and environment for statistical computing. ISBN: 3-900051-07-0. http://www.R-project.org.
[20] Royston, P., 2007. Stpm: stata module to fit flexible parametric models for survival-time data.
[21] Self, S.; Grossman, E., Linear rank-tests for interval-censored data with application to PCB levels in adipose-tissue of transformer repair workers, Biometrics, 42, 521-530, (1986)
[22] StataCorp, 2011. Stata statistical software. College Station, TX: Statacorp lp.
[23] Turnbull, B., The empirical distribution function with arbitrarily grouped, censored and truncated data, Journal of the Royal Statistical Society: Series B, 38, 290-295, (1976) · Zbl 0343.62033
[24] van der Laan, M. J.; Robins, J. M., Locally efficient estimation with current status data and time-dependent covariates, Journal of the American Statistical Association, 93, 693-701, (1998) · Zbl 0953.62118
[25] Wang, L.; Sun, J.; Tong, X., Regression analysis of case II interval-censored failure time data with the additive hazards model, Statistica Sinica, 20, 1709-1723, (2010) · Zbl 1410.62172
[26] Wellner, J. A.; Zhan, Y., A hybrid algorithm for computation of the nonparametric maximum likelihood estimator from censored data, Journal of the American Statistical Association, 92, 945-959, (1997) · Zbl 0889.62026
[27] Williams, J. S.; Lagakos, S. W., Models for censored survival analysis—constant-sum and variable-sum models, Biometrika, 64, 215-224, (1977) · Zbl 0366.92009
[28] Zhang, Z. G.; Sun, J. G.; Sun, L. Q., Statistical analysis of current status data with informative observation times, Statistics in Medicine, 24, 1399-1407, (2005)
[29] Zhang, Z.; Sun, L.; Sun, J.; Finkelstein, D. M., Regression analysis of failure time data with informative interval censoring, Statistics in Medicine, 26, 2533-2546, (2007)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.