Abstract
We consider Gaussian mixture models in high dimensions, focusing on the twin tasks of detection and feature selection. Under sparsity assumptions on the difference in means, we derive minimax rates for the problems of testing and of variable selection. We find these rates to depend crucially on the knowledge of the covariance matrices and on whether the mixture is symmetric or not. We establish the performance of various procedures, including the top sparse eigenvalue of the sample covariance matrix (popular in the context of Sparse PCA), as well as new tests inspired by the normality tests of Malkovich and Afifi [J. Amer. Statist. Assoc. 68 (1973) 176–179].
Citation
Nicolas Verzelen. Ery Arias-Castro. "Detection and feature selection in sparse mixture models." Ann. Statist. 45 (5) 1920 - 1950, October 2017. https://doi.org/10.1214/16-AOS1513
Information