Document Zbl 1229.62085

Adaptive confidence intervals for the test error in classification. (English) Zbl 1229.62085

J. Am. Stat. Assoc. 106, No. 495, 904-913 (2011).

Summary: The estimated test error of a learned classifier is the most commonly reported measure of classifier performance. However, constructing a high-quality point estimator of the test error has proved to be very difficult. Furthermore, common interval estimators (e.g., confidence intervals) are based on the point estimator of the test error and thus inherit all the difficulties associated with the point estimation problem. As a result, these confidence intervals do not reliably deliver nominal coverage. In contrast, we directly construct the confidence interval by using smooth data-dependent upper and lower bounds on the test error. We prove that, for linear classifiers, the proposed confidence interval automatically adapts to the nonsmoothness of the test error, is consistent under fixed and local alternatives, and does not require that the Bayes classifier be linear. Moreover, the method provides nominal coverage on a suite of test problems using a range of classification algorithms and sample sizes.

Cited in 1 Review

Cited in 19 Documents

MSC:

62H30	Classification and discrimination; cluster analysis (statistical aspects)
62F25	Parametric tolerance and confidence regions
65C60	Computational problems in statistics (MSC2010)

Keywords:

nonregular asymptotics; pretesting

Cite Review PDF

Full Text: DOI Link