Detection and feature selection in sparse mixture models

October 2017 Detection and feature selection in sparse mixture models

Nicolas Verzelen, Ery Arias-Castro

Ann. Statist. 45(5): 1920-1950 (October 2017). DOI: 10.1214/16-AOS1513

Abstract

We consider Gaussian mixture models in high dimensions, focusing on the twin tasks of detection and feature selection. Under sparsity assumptions on the difference in means, we derive minimax rates for the problems of testing and of variable selection. We find these rates to depend crucially on the knowledge of the covariance matrices and on whether the mixture is symmetric or not. We establish the performance of various procedures, including the top sparse eigenvalue of the sample covariance matrix (popular in the context of Sparse PCA), as well as new tests inspired by the normality tests of Malkovich and Afifi [J. Amer. Statist. Assoc. 68 (1973) 176–179].

Citation

Download Citation

Nicolas Verzelen. Ery Arias-Castro. "Detection and feature selection in sparse mixture models." Ann. Statist. 45 (5) 1920 - 1950, October 2017. https://doi.org/10.1214/16-AOS1513

Information

Received: 1 May 2014; Revised: 1 December 2015; Published: October 2017

First available in Project Euclid: 31 October 2017

zbMATH: 06821114

MathSciNet: MR3718157

Digital Object Identifier: 10.1214/16-AOS1513

Subjects:

Primary: 62H15 , 62H30

Keywords: detection of mixtures , feature selection for mixtures , Gaussian mixture models , projection tests based on moments , sparse mixture models , the sparse eigenvalue problem

JOURNAL ARTICLE
31 PAGES

DOWNLOAD PDF + SAVE TO MY LIBRARY