×

Jackknife empirical likelihood test for high-dimensional regression coefficients. (English) Zbl 1468.62226

Summary: A novel way to test coefficients in high-dimensional linear regression model is presented. Under the ‘large \(p\) small \(n\)’ situation, the traditional methods, like \(F\)-test and \(t\)-test, are unsuitable or undefined. The proposed jackknife empirical likelihood test has an asymptotic chi-square distribution and the conditions are much weaker than those in the existing methods. Moreover, an extension of the proposed method can test part of the regression coefficients, which is practical in considering the significance for a subset of covariates. Simulations show that the proposed test has a good control of the type-I error, and is more powerful than P.-S. Zhong and S. X. Chen’s [J. Am. Stat. Assoc. 106, No. 493, 260–274 (2011; Zbl 1396.62110)] method in most cases. The proposed test is employed to analyze a rheumatoid arthritis data to find the association between rheumatoid arthritis and the SNPs on the chromosomes 6.

MSC:

62-08 Computational methods for problems pertaining to statistics
62H15 Hypothesis testing in multivariate analysis
62J05 Linear regression; mixed models

Citations:

Zbl 1396.62110
Full Text: DOI

References:

[1] Amos, C.; Chen, W.; Seldin, M.; Remmers, E.; Taylor, K.; Criswell, L.; Lee, A.; Plenge, R.; Kastner, D.; Gregersen, P., Data for genetic analysis workshop 16 problem 1, association analysis of rheumatoid arthritis data, 3, (2009), BioMed Central Ltd, S2
[2] Bai, Z. D.; Saranadasa, H., Effect of high dimension: by an example of a two sample problem, Statist. Sinica, 6, 311-329, (1996) · Zbl 0848.62030
[3] Bravo, F., Second-order power comparisons for a class of nonparametric likelihood-based tests, Biometrika, 90, 881-890, (2003) · Zbl 1436.62160
[4] Chen, S. X.; Peng, L.; Qin, Y. L., Effects of data dimension on empirical likelihood, Biometrika, 96, 711-722, (2009) · Zbl 1170.62023
[5] Chen, S. X.; Qin, Y. L., A two-sample test for high-dimensional data with applications to gene-set testing, Ann. Statist., 38, 808-835, (2010) · Zbl 1183.62095
[6] Chen, S. X.; Van Keilegom, I., A review on empirical likelihood methods for regression, Test, 18, 415-447, (2009) · Zbl 1203.62035
[7] Chen, S. X.; Zhang, L. X.; Zhong, P. S., Tests for high-dimensional covariance matrices, J. Amer. Statist. Assoc., 105, (2010) · Zbl 1321.62086
[8] Diao, G.; Hanlon, B.; Vidyashankar, A. N., Multiple testing for high-dimensional data, Perspect. Big Data Anal. Methodol. Appl., 622, 95, (2014) · Zbl 1320.62129
[9] Donoho, D.L., et al. 2000. High-dimensional data analysis: The curses and blessings of dimensionality. In: American Mathematical Society Conf. Math Chanllenges of the 21st Century.
[10] El Karoui, N.; Bean, D.; Bickel, P.; Lim, C.; Yu, B., On robust regression with high-dimensional predictors, Proc. Natl. Acad. Sci. USA, 110, 14557-14562, (2013) · Zbl 1359.62184
[11] Ellinghaus, E.; Stuart, P. E.; Ellinghaus, D.; Nair, R. P.; Debrus, S.; Raelson, J. V.; Belouchi, M.; Tejasvi, T.; Li, Y.; Tsoi, L. C., Genome-wide meta-analysis of psoriatic arthritis identifies susceptibility locus at REL, J. Invest. Dermatol., 132, 1133-1140, (2012)
[12] Fan, J.; Li, R., Variable selection via nonconcave penalized likelihood and its oracle properties, J. Amer. Statist. Assoc., 96, 1348-1360, (2001) · Zbl 1073.62547
[13] Fan, J.; Lv, J., Sure independence screening for ultrahigh dimensional feature space, J. R. Stat. Soc. Ser. B Stat. Methodol., 70, 849-911, (2008) · Zbl 1411.62187
[14] Goeman, J. J.; Van De Geer, S. A.; Van Houwelingen, H. C., Testing against a high dimensional alternative, J. R. Stat. Soc. Ser. B Stat. Methodol., 68, 477-493, (2006) · Zbl 1110.62002
[15] Goeman, J. J.; Van Houwelingen, H. C.; Finos, L., Testing against a high-dimensional alternative in the generalized linear model: asymptotic type I error control, Biometrika, 98, 381-390, (2011) · Zbl 1215.62068
[16] Gong, Y.; Peng, L.; Qi, Y., Smoothed jackknife empirical likelihood method for ROC curve, J. Multivariate Anal., 101, 1520-1531, (2010) · Zbl 1186.62053
[17] Huang, Jian; Ma, Shuange; Zhang, Cun-Hui, The iterated lasso for high-dimensional logistic regression, the university of iowa department of statistical and actuarial science technical report, 392, (2008)
[18] Huber, P., Robust regression: asymptotics, conjectures and Monte Carlo, Ann. Statist., 799-821, (1973) · Zbl 0289.62033
[19] Jing, B. Y.; Yuan, J.; Zhou, W., Jackknife empirical likelihood, J. Amer. Statist. Assoc., 104, 1224-1332, (2009) · Zbl 1388.62136
[20] Li, Q.; Hu, J.; Ding, J.; Zheng, G., Fisher’s method of combining dependent statistics using generalizations of the gamma distribution with applications to genetic pleiotropic associations, Biostatistics, 15, 284-295, (2014)
[21] Owen, A., Empirical likelihood, (2001), Chapman and Hall/CRC · Zbl 0989.62019
[22] Peng, L.; Qi, Y.; Wang, R., Empirical likelihood test for high dimensional linear models, Statist. Probab. Lett., 86, 74-79, (2014)
[23] Peng, L.; Qi, Y.; Wang, R.; Yang, J., Jackknife empirical likelihood method for some risk measures and related quantities, Insurance Math. Econom., 51, 142-150, (2012) · Zbl 1284.62205
[24] Portnoy, S., Asymptotic behavior of M-estimators of p regression parameters when \(p^2 / n\) is large. I. consistency, Ann. Statist., 12, 1298-1309, (1984) · Zbl 0584.62050
[25] Wang, R.; Peng, L., Jackknife empirical likelihood intervals for spearmans rho, N. Am. Actuar. J., 15, 475-486, (2011) · Zbl 1291.62117
[26] Wang, R.; Peng, L.; Qi, Y., Jackknife empirical likelihood test for equality of two high dimensional means, Statist. Sinica, 23, 667-690, (2013) · Zbl 1379.62041
[27] Zhang, R.; Peng, L.; Wang, R., Tests for covariance matrix with fixed or divergent dimension, Ann. Statist., 41, 2075-2096, (2013) · Zbl 1277.62151
[28] Zheng, G.; Wu, C. O.; Kwak, M.; Jiang, W.; Joo, J.; Lima, J. A., Joint analysis of binary and quantitative traits with data sharing and outcome-dependent sampling, Genet. Epidemiol., 36, 263-273, (2012)
[29] Zhong, P. S.; Chen, S. X., Tests for high-dimensional regression coefficients with factorial designs, J. Amer. Statist. Assoc., 106, 260-274, (2011) · Zbl 1396.62110
[30] Zou, H., The adaptive lasso and its oracle properties, J. Amer. Statist. Assoc., 101, 1418-1429, (2006) · Zbl 1171.62326
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.