×

Variable selection for semiparametric varying-coefficient partially linear models with missing response at random. (English) Zbl 1225.62061

Summary: We present a variable selection procedure by combining basis function approximations with penalized estimating equations for semiparametric varying-coefficient partially linear models with missing response at random. The proposed procedure simultaneously selects significant variables in parametric components and nonparametric components. With appropriate selection of the tuning parameters, we establish the consistency of the variable selection procedure and the convergence rate of the regularized estimators. A simulation study is undertaken to assess the finite sample performance of the proposed variable selection procedure.

MSC:

62G08 Nonparametric regression and quantile regression
62G20 Asymptotic properties of nonparametric inference
65C20 Probabilistic models, generic numerical methods in probability and statistics

Keywords:

SCAD; missing data
Full Text: DOI

References:

[1] Li, Q., Huang, C. J., Li, D., et al.: Semiparametric smooth coefficient models. Journal of Business & Economic Statistics, 20, 412–422 (2002) · doi:10.1198/073500102288618531
[2] Zhang, W., Lee, S. Y., Song, X.: Local polynomial fitting in semivarying coefficient models. Journal of Multivariate Analysis, 82, 166–188 (2002) · Zbl 0995.62038 · doi:10.1006/jmva.2001.2012
[3] Fan, J. Q., Huang, T.: Profile likelihood inference on semiparametric varying-coefficient partially linear models. Bernoulli, 11, 1031–1057 (2005) · Zbl 1098.62077 · doi:10.3150/bj/1137421639
[4] You, J. H., Zhou, Y.: Empirical likelihood for semiparametric varying-coefficient partially linear regression models. Statistics & Probability Letters, 76, 412–422 (2006) · Zbl 1086.62057 · doi:10.1016/j.spl.2005.08.029
[5] Fan, J. Q., Li, R.: Variable selection via nonconcave penalized likelihood and its oracle properties. Journal of the American Statistical Association, 96, 1348–1360 (2001) · Zbl 1073.62547 · doi:10.1198/016214501753382273
[6] Wang, L., Chen, G., Li, H.: Group SCAD regression analysis for microarray time course gene expression data. Bioinformatics, 23, 1486–1494 (2007) · doi:10.1093/bioinformatics/btm125
[7] Wang, L., Li, H., Huang, J. Z.: Variable selection in nonparametric varying-coefficient models for analysis of repeated measurements. Journal of the American Statistical Association, 103, 1556–1569 (2008) · Zbl 1286.62034 · doi:10.1198/016214508000000788
[8] Li, R., Liang, H.: Variable selection in semiparametric regression modeling. The Annals of Statistics, 36, 261–286 (2008) · Zbl 1132.62027 · doi:10.1214/009053607000000604
[9] Wang, Q. H., Sun, Z. H.: Estimation in partially linear models with missing responses at random. Journal of Multivariate Analysis, 98, 1470–1493 (2007) · Zbl 1116.62042 · doi:10.1016/j.jmva.2006.10.003
[10] Sun, Z. H., Wang, Q. H., Dai, P. J.: Model checking for partially linear models with missing responses at random. Journal of Multivariate Analysis, 100, 636–651 (2009) · Zbl 1163.62032 · doi:10.1016/j.jmva.2008.07.002
[11] Zhou, Y., Wan, A. T. K., Wang, X. J.: Estimating equations inference with missing data. Journal of the American Statistical Association, 103, 1187–1199 (2008) · Zbl 1205.62037 · doi:10.1198/016214508000000535
[12] Tong, X. W., Zhu, L., Sun, J. G.: Variable selection for recurrent event data via nonconcave penalized estimating function. Lifetime Data Analysis, 15, 197–215 (2009) · Zbl 1282.62083 · doi:10.1007/s10985-008-9104-2
[13] Tong, X. W., He, X., Sun, L. Q., et al.: Variable selection for panel count data via nonconcave penalized estimating function. Scandinavian Journal of Statistics, 36, 620–635 (2009) · Zbl 1224.62007 · doi:10.1111/j.1467-9469.2009.00658.x
[14] Tang, Q. G., Cheng, L. S.: M-estimation and B-spline approximation for varying coefficient models with longitudinal data. Journal of Nonparametric Statistics, 20, 611–625 (2008) · Zbl 1147.62028 · doi:10.1080/10485250802375950
[15] Zou, H.: The adaptive lasso and its oracle properties. Journal of the American Statistical Association, 101, 1418–1429 (2006) · Zbl 1171.62326 · doi:10.1198/016214506000000735
[16] Xue, L. G., Zhu, L. X.: Empirical likelihood for a varying coefficient model with longitudinal data. Journal of the American Statistical Association, 102, 642–654 (2007) · Zbl 1172.62306 · doi:10.1198/016214507000000293
[17] Xue, L. G., Zhu, L. X.: Empirical likelihood semiparametric regression analysis for longitudinal data. Biometrika, 94, 921–937 (2007) · Zbl 1156.62324 · doi:10.1093/biomet/asm066
[18] Li, G. R., Tian, P., Xue, L. G.: Generalized empirical likelihood inference in semiparametric regression model for longitudinal data. Acta Mathematica Sinica, English Series, 24, 2029–2040 (2008) · Zbl 1151.62316 · doi:10.1007/s10114-008-6434-7
[19] Schumaker, L. L.: Spline Functions, Wiley, New York, 1981
[20] Johnson, B. A.: Variable selection in semiparametric linear regression with censored data. Journal of the Royal Statistical Society: Series B, 70, 351–370 (2008) · Zbl 1148.62052 · doi:10.1111/j.1467-9868.2008.00639.x
[21] Johnson, B. A., Lin, D. Y., Zeng, D. L.: Penalized estimating functions and variable selection in semiparametric regression models. Journal of the American Statistical Association, 103, 672–680 (2008) · Zbl 1471.62330 · doi:10.1198/016214508000000184
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.