×

A new multilevel modeling approach for clustered survival data. (English) Zbl 1447.62076

Summary: In multilevel modeling of clustered survival data, to account for the differences among different clusters, a commonly used approach is to introduce cluster effects, either random or fixed, into the model. Modeling with random effects may lead to difficulties in the implementation of the estimation procedure for the unknown parameters of interest because the numerical computation of multiple integrals may become unavoidable when the cluster effects are not scalars. On the other hand, if fixed effects are used, there is a danger of having estimators with large variances because there are too many nuisance parameters involved in the model. In this article, using the idea of the homogeneity pursuit, we propose a new multilevel modeling approach for clustered survival data. The proposed modeling approach does not have the potential computational problem as modeling with random effects, and it also involves far fewer unknown parameters than modeling with fixed effects. We also establish asymptotic properties to show the advantages of the proposed model and conduct intensive simulation studies to demonstrate the performance of the proposed method. Finally, the proposed method is applied to analyze a dataset on the second-birth interval in Bangladesh. The most interesting finding is the impact of some important factors on the length of the second-birth interval variation over clusters and its homogeneous structure.

MSC:

62H30 Classification and discrimination; cluster analysis (statistical aspects)
62N05 Reliability and life testing

Software:

AS 136
Full Text: DOI

References:

[1] Andersen, P.K. & Gill, R.D. (1982) Cox’s regression model for counting processes: A large sample study.Annals of Statistics10, 1100-1120. · Zbl 0526.62026
[2] Ando, T. & Bai, J. (2017) Clustering huge number of financial time series: A panel data approach with high-dimensional predictors and factor structures.Journal of the American Statistical Association112, 1182-1198.
[3] Azuma, K. (1967) Weighted sums of certain dependent random variables.Tohoku Mathematical Journal, Second Series19, 357-367. · Zbl 0178.21103
[4] Bai, J. (1997) Estimating multiple breaks one at a time.Econometric Theory13, 315-352.
[5] Bonhomme, S. & Manresa, E. (2015) Grouped patterns of heterogeneity in panel data.Econometrica83, 1147-1184. · Zbl 1410.62100
[6] Bradic, J., Fan, J., & Jiang, J. (2011) Regularization for Cox’s proportional hazards model with np-dimensionality.Annals of Statistics39, 3092-3210. · Zbl 1246.62202
[7] Breslow, N. (1972) Comment on “regression and life tables” by DR Cox. Journal of the Royal Statistical Society, Series B34, 216-217.
[8] David, C.R. (1972) Regression models and life tables (with discussion).Journal of the Royal Statistical Society, Series B34, 187-220.
[9] Fleming, T.R. & Harrington, D.P. (2011) Counting Processes and Survival Analysis.Wiley. · Zbl 0727.62096
[10] Hartigan, J.A. & Wong, M.A. (1979) Algorithm as 136: A k-means clustering algorithm.Journal of the Royal Statistical Society, Series C28, 100-108. · Zbl 0447.62062
[11] Harvey, G. (2003) Multilevel Statistical Models. 3rd edition. Arnold, London, UK. · Zbl 1014.62126
[12] Hoeffding, W. (1963) Probability Inequalities for Sums of Bounded Random Variables. Vol. 58. Taylor & Francis, pp. 13-30. · Zbl 0127.10602
[13] Ke, Y., Li, J., & Zhang, W. (2016) Structure identification in panel data analysis.Annals of Statistics44, 1193-1233. · Zbl 1341.62214
[14] Ke, Z.T., Fan, J., & Wu, Y. (2015) Homogeneity pursuit.Journal of the American Statistical Association110, 175-194. · Zbl 1373.62345
[15] Mitra, S., Al-Sabir, A., Cross, A.R., & Jamil, K. (1997) Bangladesh Demographic and Health Survey 1996-1997. National Institute of Population Research and Training (NIPORT), Mitra and Associates, and Macro International Inc., Dhakaand Calverton, MD.
[16] Ortega, J.M. & Rheinboldt, W.C. (1970) Iterative Solution of Nonlinear Equations in Several Variables. Academic Press, San Diego, CA. · Zbl 0241.65046
[17] Su, L. & Ju, G. (2018) Identifying latent grouped patterns in panel data models with interactive fixed effects.Journal of Econometrics206, 554-573. · Zbl 1452.62960
[18] Su, L., Shi, Z., & Phillips, P.C. (2016) Identifying latent structures in panel data.Econometrica84, 2215-2264. · Zbl 1410.62110
[19] Su, L., Liangjun, W.X., & Jin, S. (2019) Sieve estimation of time-varying panel data models with latent structures.Journal of Business & Economic Statistics37, 334-349.
[20] Venkatraman, E.S. (1993) Consistency results in multiple change-point problems. Ph.D. thesis, Stanford Univ., ProQuest LLC, Ann Arbor, MI.
[21] Volinsky, C.T. & Raftery, A.E. (2000) Bayesian information criterion for censored survival models.Biometrics56, 256-262. · Zbl 1060.62557
[22] Vostrikova, L. (1981) Detection of the disorder in multidimensional random-processes.Doklady Akademii Nauk SSSR259, 270-274.
[23] Wang, W., Phillips, P.C., & Su, L. (2018) Homogeneity pursuit in panel data models: Theory and application.Journal of Applied Econometrics33, 797-815.
[24] Wang, W., & Su, L. (2019) Identifying latent group structures in nonlinear panels.Journal of Econometrics (in press).
[25] Zhang, W., & Steele, F. (2004) A semiparametric multilevel survival model.Journal of the Royal Statistical Society: Series C (Applied Statistics)53, 387-404. · Zbl 1111.62361
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.