×

Variable selection in Cox regression models with varying coefficients. (English) Zbl 1432.62338

Summary: We deal with Cox regression models with varying coefficients. We concentrate on time-varying coefficient models and give a brief comment on another kind of varying coefficient models. When we have \(p\)-dimensional covariates and \(p\) increases with the sample size, it is often the case that only a small part of the covariates are relevant. Therefore we consider variable selection and estimation of the coefficient functions by using the group SCAD-type estimator and the adaptive group Lasso estimator. We examine the theoretical properties of the estimators, especially the \(L_2\) convergence rate, the sparsity, and the oracle property. Simulation studies and a real data analysis show the performance of these procedures.

MSC:

62N02 Estimation in survival analysis and censored data
62G08 Nonparametric regression and quantile regression
62J07 Ridge regression; shrinkage estimators (Lasso)
62G20 Asymptotic properties of nonparametric inference
62-08 Computational methods for problems pertaining to statistics
Full Text: DOI

References:

[1] Antoniadis, A.; Fryzlewicz, P.; Letué, F., The Dantzig selector in Cox’s proportional hazards model, Scand. J. Stat., 37, 531-552 (2010) · Zbl 1349.62473
[2] Bender, R.; Augustin, T.; Blettner, M., Generating survival times to simulate Cox proportional hazards models, Stat. Med., 24, 1713-1723 (2005)
[3] Bradic, J.; Fan, J.; Jiang, J., Regularization for Cox’s proportional hazards model with NP-dimensionality, Ann. Stat., 39, 3092-3120 (2011) · Zbl 1246.62202
[5] Bühlmann, P.; van de Geer, S., Statistics for High-Dimensional DataMethods Theory and Applications (2011), Springer: Springer New York, Dordrecht, Heidelberg, London · Zbl 1273.62015
[6] Cai, J.; Fan, J.; Jiang, J.; Zhou, H., Partially linear hazard regression for multivariate survival data, J. Am. Stat. Assoc., 102, 538-551 (2007) · Zbl 1172.62328
[7] Cai, J.; Fan, J.; Jiang, J.; Zhou, H., Partially linear hazard regression with varying coefficients for multivariate survival data, J. R. Stat. Soc. B, 70, 141-158 (2008) · Zbl 1400.62217
[8] Cai, J.; Fan, J.; Li, R.; Zhou, H., Variable selection for multivariate failure time data, Biometrika, 92, 303-316 (2005) · Zbl 1094.62123
[9] Cai, J.; Fan, J.; Zhou, H.; Zhou, Y., Hazard models with varying coefficients for multivariate failure time data, Ann. Stat., 35, 324-354 (2007) · Zbl 1114.62104
[10] Cai, Z.; Sun, Y., Local linear estimation for time-dependent coefficients in Cox’s regression models, Scand. J. Stat., 30, 93-111 (2003) · Zbl 1034.62096
[11] Chen, K.; Lin, H.; Zhou, Y., Efficient estimation for the Cox model with varying coefficients, Biometrika, 99, 379-392 (2012) · Zbl 1318.62291
[12] DeVore, R. A.; Lorentz, G. G., Constructive Approximation (1993), Springer: Springer Berlin · Zbl 0797.41016
[13] Du, P.; Ma, S.; Liang, H., Penalized variable selection procedure for Cox models with semiparametric relative risk, Ann. Stat., 38, 2092-2117 (2010) · Zbl 1202.62132
[14] Fan, J.; Lin, H.; Zhou, Y., Local partial-likelihood estimation for lifetime data, Ann. Stat., 34, 290-325 (2006) · Zbl 1091.62099
[15] Fan, J.; Li, R., Variable selection via nonconcave penalized likelihood and its oracle properties, J. Am. Stat. Assoc., 96, 1348-1360 (2001) · Zbl 1073.62547
[16] Fan, J.; Lv, J., Nonconcave penalized likelihood with NP-dimensionality, IEEE Trans. Inf. Theory, 57, 5467-5484 (2011) · Zbl 1365.62277
[17] Fan, J.; Peng, H., Nonconcave penalized likelihood with a diverging number of parameters, Ann. Stat., 32, 928-961 (2004) · Zbl 1092.62031
[18] Fan, J.; Zhang, W., Statistical methods with varying coefficient models, Stat. Interface, 1, 179-195 (2008) · Zbl 1230.62031
[19] Fleming, T. R.; Harrington, D. P., Counting Processes and Survival Analysis (1991), Wiley: Wiley Hoboken, NJ · Zbl 0727.62096
[20] Hastie, T.; Tibshirani, R., Varying-coefficient models, J. R. Stat. Soc. B, 55, 757-796 (1993) · Zbl 0796.62060
[22] Hu, Y.; Lian, H., Variable selection in partially linear proportional hazards model with a diverging dimensionality, Stat. Probab. Lett., 83, 61-69 (2012) · Zbl 1489.62307
[23] Huang, J. Z., Projection estimation in multiple regression with application to functional ANOVA models, Ann. Stat., 26, 242-272 (1998) · Zbl 0930.62042
[24] Huang, J. Z.; Kooperberg, C.; Stone, C. J.; Truong, Y. K., Functional ANOVA modeling for proportional hazards regression, Ann. Stat., 28, 961-999 (2000) · Zbl 1105.62378
[25] Huang, J. Z.; Stone, C. J., The \(L_2\) rate of convergence for event history regression with time-dependent covariates, Scand. J. Stat., 25, 603-620 (1998) · Zbl 0927.62087
[26] Huang, Z. S.; Lin, B.; Feng, F.; Pang, Z., Efficient penalized estimating method in the partially varying-coefficient single-index model, J. Multivar. Anal., 114, 189-200 (2012) · Zbl 1255.62056
[27] Kalbfleisch, J. D.; Prentice, R. L., The Statistical Analysis of Failure Time Data (2002), Wiley: Wiley Hoboken, NJ · Zbl 1012.62104
[29] Leng, C.; Zhang, H. H., The \(L_2\) rate of convergence for event history regression with time-dependent covariates, J. Nonparam. Stat., 18, 417-429 (2006) · Zbl 1116.62106
[30] Lian, H., Variable selection for high-dimensional generalized varying-coefficient models, Stat. Sinica, 22, 1563-1588 (2012) · Zbl 1253.62027
[31] Lian, H.; Li, J.; Hu, Y., Shrinkage variable selection and estimation in proportional hazards models with additive structure and high dimensionality, Comp. Stat. Data Anal., 63, 99-112 (2013) · Zbl 1468.62119
[32] Meier, L.; van de Geer, S.; Bühlmann, P., The group lasso for logistic regression, J. R. Stat. Soc. B, 70, 53-71 (2008) · Zbl 1400.62276
[33] Noh, H. S.; Park, B. U., Sparse variable coefficient models for longitudinal data, Stat. Sinica, 20, 1183-1202 (2010) · Zbl 1507.62241
[34] Schumaker, L. L., Spline FunctionsBasic Theory (2007), Cambridge University Press: Cambridge University Press Cambridge · Zbl 1123.41008
[35] Tian, L.; Zucker, D.; Wei, L. J., On the Cox model with time-varying regression coefficients, J. Am. Stat. Assoc., 100, 172-183 (2005) · Zbl 1117.62435
[36] Tibshirani, R. J., Regression shrinkage and selection via the Lasso, J. R. Stat. Soc. B, 58, 267-288 (1996) · Zbl 0850.62538
[37] Wang, L.; Li, H.; Huang, J. Z., Variable selection in nonparametric varying-coefficient models for analysis of repeated measurements, J. Am. Stat. Assoc., 103, 1556-1569 (2008) · Zbl 1286.62034
[38] Wang, L.; Liu, X.; Liang, H.; Carroll, R. J., Estimation and variable selection for generalized additive partial linear models, Ann. Stat., 39, 1827-1851 (2011) · Zbl 1227.62053
[39] Wang, S.; Nan, B.; Zhou, N.; Zhu, J., Hierarchically penalized Cox regression with grouped variables, Biometrika, 96, 307-322 (2009) · Zbl 1163.62089
[40] Wei, F.; Huang, J.; Li, H., Variable selection and estimation in high-dimensional varying-coefficient models, Stat. Sinica, 21, 1515-1540 (2011) · Zbl 1225.62056
[41] Yan, J.; Huang, J., Model selection for Cox models with time-varying coefficients, Biometrics, 68, 419-428 (2012) · Zbl 1251.62052
[42] Yuan, M.; Lin, Y., Model selection and estimation in regression with grouped variables, J. R. Stat. Soc. B, 68, 49-67 (2006) · Zbl 1141.62030
[43] Zhang, H. H.; Cheng, G.; Liu, Y., Linear or nonlinear? Automatic structure discovery for partially linear models, J. Am. Stat. Assoc., 106, 1099-1112 (2011) · Zbl 1229.62051
[44] Zhang, H. H.; Lu, W., Adaptive Lasso for Cox’s proportional hazards model, Boimetrika, 94, 691-703 (2007) · Zbl 1135.62083
[45] Zhang, S.; Wang, L.; Lian, H., Estimation by polynomial splines with variable selection in additive Cox models, Statistics, 48, 67-80 (2014) · Zbl 1367.62132
[46] Zou, H., The adaptive Lasso and its oracle properties, J. Am. Stat. Assoc., 101, 1418-1429 (2006) · Zbl 1171.62326
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.