×

An explicit mean-covariance parameterization for multivariate response linear regression. (English) Zbl 07499905

Summary: We develop a new method to fit the multivariate response linear regression model that exploits a parametric link between the regression coefficient matrix and the error covariance matrix. Specifically, we assume that the correlations between entries in the multivariate error random vector are proportional to the cosines of the angles between their corresponding regression coefficient matrix columns, so as the angle between two regression coefficient matrix columns decreases, the correlation between the corresponding errors increases. We highlight two models under which this parameterization arises: a latent variable reduced-rank regression model and the errors-in-variables regression model. We propose a novel nonconvex weighted residual sum of squares criterion which exploits this parameterization and admits a new class of penalized estimators. The optimization is solved with an accelerated proximal gradient descent algorithm. Our method is used to study the association between microRNA expression and cancer drug activity measured on the NCI-60 cell lines. An R package implementing our method, MCMVR, is available online.

MSC:

62-XX Statistics

Software:

FRCC; MCMVR

References:

[1] Breiman, L.; Friedman, J. H., “Predicting Multivariate Responses in Multiple Linear Regression” (with discussion and a reply by the authors), Journal of the Royal Statistical Society, Series B, 59, 3-54 (1997) · Zbl 0897.62068 · doi:10.1111/1467-9868.00054
[2] Chen, L.; Huang, J. Z., “Sparse Reduced-Rank Regression for Simultaneous Dimension Reduction and Variable Selection, Journal of the American Statistical Association, 107, 1533-1545 (2012) · Zbl 1258.62075 · doi:10.1080/01621459.2012.734178
[3] Chen, T.-H.; Sun, W., “Prediction of Cancer Drug Sensitivity Using High-Dimensional Omic Features, Biostatistics, 18, 1-14 (2017) · doi:10.1093/biostatistics/kxw022
[4] Cook, R. D.; Zhang, X., “Foundations for Envelope Models and Methods, Journal of the American Statistical Association, 110, 599-611 (2015) · Zbl 1390.62131 · doi:10.1080/01621459.2014.983235
[5] Cruz-Cano, R.; Lee, M.-L. T., “Fast Regularized Canonical Correlation Analysis,”, Computational Statistics & Data Analysis, 70, 88-100 (2014) · Zbl 1471.62048
[6] Datta, A.; Zou, H., “CoCoLasso for High-Dimensional Error-in-Variables Regression, The Annals of Statistics, 45, 2400-2426 (2017) · Zbl 1486.62210 · doi:10.1214/16-AOS1527
[7] Gleser, L. J.; Watson, G. S., “Estimation of a Linear Transformation, Biometrika, 60, 525-534 (1973) · Zbl 0271.62074 · doi:10.1093/biomet/60.3.525
[8] Huber, P. J., “Robust Estimation of a Location Parameter, The Annals of Mathematical Statistics, 35, 73-101 (1964) · Zbl 0136.39805 · doi:10.1214/aoms/1177703732
[9] Izenman, A. J., “Reduced-Rank Regression for the Multivariate Linear Model, Journal of Multivariate Analysis, 5, 248-264 (1975) · Zbl 0313.62042 · doi:10.1016/0047-259X(75)90042-1
[10] Lange, K., MM Optimization Algorithms (2016), Philadelphia, PA: Society for Industrial and Applied Mathematics, Philadelphia, PA · Zbl 1357.90002
[11] Lee, W.; Liu, Y., “Simultaneous Multiple Response Regression and Inverse Covariance Matrix Estimation via Penalized Gaussian Maximum Likelihood, Journal of Multivariate Analysis, 111, 241-255 (2012) · Zbl 1259.62043 · doi:10.1016/j.jmva.2012.03.013
[12] Li, H.; Lin, Z.; Cortes, C.; Lawrence, N. D.; Lee, D. D.; Sugiyama, M.; Garnett, R., Advances in Neural Information Processing Systems, 28), Accelerated Proximal Gradient Methods for Nonconvex Programming, 379-387 (2015), Curran Associates, Inc
[13] Liu, H.; D’Andrade, P.; Fulmer-Smentek, S.; Lorenzi, P.; Kohn, K. W.; Weinstein, J. N.; Pommier, Y.; Reinhold, W. C., “mRNA and microRNA Expression Profiles of the NCI-60 Integrated With Drug Activities, Molecular Cancer Therapeutics, 9, 1080-1091 (2010) · doi:10.1158/1535-7163.MCT-09-0965
[14] Molstad, A. J.; Rothman, A. J., “Indirect Multivariate Response Linear Regression, Biometrika, 103, 595-607 (2016) · Zbl 1506.62338 · doi:10.1093/biomet/asw034
[15] Obozinski, G.; Wainwright, M. J.; Jordan, M. I., “Support Union Recovery in High-Dimensional Multivariate Regression, The Annals of Statistics, 39, 1-47 (2011) · Zbl 1373.62372 · doi:10.1214/09-AOS776
[16] Parikh, N.; Boyd, S., “Proximal Algorithms, Foundations and Trends in Optimization, 1, 127-239 (2014) · doi:10.1561/2400000003
[17] Peng, J.; Zhu, J.; Bergamaschi, A.; Han, W.; Noh, D.-Y.; Pollack, J. R.; Wang, P., “Regularized Multivariate Regression for Identifying Master Predictors With Application to Integrative Genomics Study of Breast Cancer, The Annals of Applied Statistics, 4, 53-77 (2010) · Zbl 1189.62174 · doi:10.1214/09-AOAS271
[18] Peng, Y.; Croce, C. M., “The Role of MicroRNAs in Human Cancer, Signal Transduction and Targeted Therapy, 1, 15004 (2016) · doi:10.1038/sigtrans.2015.4
[19] Perrot-Dockès, M.; Lévy-Leduc, C.; Sansonnet, L.; Chiquet, J., “Variable Selection in Multivariate Linear Models With High-Dimensional Covariance Matrix Estimation, Journal of Multivariate Analysis, 166, 78-97 (2018) · Zbl 1499.62180 · doi:10.1016/j.jmva.2018.02.006
[20] Pourahmadi, M., “Joint Mean-Covariance Models With Applications to Longitudinal Data: Unconstrained Parameterisation, Biometrika, 86, 677-690 (1999) · Zbl 0949.62066 · doi:10.1093/biomet/86.3.677
[21] Pourahmadi, M., High-Dimensional Covariance Estimation: With High-Dimensional Data (2013), Hoboken, NJ: Wiley, Hoboken, NJ · Zbl 1276.62031
[22] Rothman, A. J.; Levina, E.; Zhu, J., “Sparse Multivariate Regression With Covariance Estimation, Journal of Computational and Graphical Statistics, 19, 947-962 (2010) · doi:10.1198/jcgs.2010.09188
[23] Shoemaker, R. H., “The NCI-60 Human Tumour Cell Line Anticancer Drug Screen, Nature Reviews Cancer, 6, 813 (2006) · doi:10.1038/nrc1951
[24] Velu, R.; Reinsel, G. C., Multivariate Reduced-Rank Regression: Theory and Applications, 136 (2013), New York: Springer-Verlag, New York
[25] Yin, J.; Li, H., “A Sparse Conditional Gaussian Graphical Model for Analysis of Genetical Genomics Data, The Annals of Applied Statistics, 5, 2630-2650 (2011) · Zbl 1234.62151 · doi:10.1214/11-AOAS494
[26] Yuan, M.; Ekici, A.; Lu, Z.; Monteiro, R., “Dimension Reduction and Coefficient Estimation in Multivariate Linear Regression, Journal of the Royal Statistical Society, Series B, 69, 329-346 (2007) · Zbl 07555355 · doi:10.1111/j.1467-9868.2007.00591.x
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.