Density estimation under qualitative assumptions in higher dimensions. (English) Zbl 0847.62027
Summary: We study the problem of estimating a real valued function in \(\mathbb{R}^d\) under qualitative assumptions like monotonicity, symmetry, or modality. We consider density estimation in an i.i.d. setup. Assumptions on the underlying density \(f\) are formulated as shape restrictions. We assume that all density contour clusters \(\Gamma (\lambda) = \Gamma_f(\lambda) = \{x \in \mathbb{R}^d : f(x) \geq \lambda\}\), \(\lambda > 0\), lie in a given class \(\mathbb{C}\) of measurable subsets of \(\mathbb{R}^d\). Under this assumption the sets \(\Gamma(\lambda)\) are estimated by using the so-called excess mass approach. This leads to certain minumum volume sets (at random levels) as estimators for \(\Gamma (\lambda)\). A minimum volume set in \(\mathbb{C}\) at level \(\alpha\), by definition, has minimal Lebesgue measure among all sets in \(\mathbb{C}\) containing empirical mass not less than \(\alpha\). Using these estimators of \(\Gamma (\lambda)\) a density estimator called silhouette is constructed. The goal of this paper is to study the (asymptotic) behaviour of the silhouette.
The paper is organized as follows: In Section 2 estimators of the density contour clusters are given and the silhouette is defined. The connections of the silhouette to the Grenander estimator [U. Grenander, Skand. Aktuarietidskr. 1956, 125-153 (1957; Zbl 0077.33715)] and to the estimator of T. W. Sager [Lect. Notes Stat., Springer-Verlag 37, 69-90 (1986; Zbl 0609.62055)] are given in Section 2 also. Section 3 contains asymptotic results for the silhouette. Consistency results and rates of convergence in terms of \(L_1\)-distance are derived by means of empirical process theory. Section 4 contains some concluding remarks. Among others it is indicated there that the presented approach to density estimation can in principle also be applied to other (non-i.i.d.) situations such as nonparametric regression or spectral density estimation in time series analysis. All the proofs are given in Section 5.
The paper is organized as follows: In Section 2 estimators of the density contour clusters are given and the silhouette is defined. The connections of the silhouette to the Grenander estimator [U. Grenander, Skand. Aktuarietidskr. 1956, 125-153 (1957; Zbl 0077.33715)] and to the estimator of T. W. Sager [Lect. Notes Stat., Springer-Verlag 37, 69-90 (1986; Zbl 0609.62055)] are given in Section 2 also. Section 3 contains asymptotic results for the silhouette. Consistency results and rates of convergence in terms of \(L_1\)-distance are derived by means of empirical process theory. Section 4 contains some concluding remarks. Among others it is indicated there that the presented approach to density estimation can in principle also be applied to other (non-i.i.d.) situations such as nonparametric regression or spectral density estimation in time series analysis. All the proofs are given in Section 5.
MSC:
62G07 | Density estimation |
62G20 | Asymptotic properties of nonparametric inference |
62H12 | Estimation in multivariate analysis |
62G30 | Order statistics; empirical distribution functions |