×

Choosing the most relevant level sets for depicting a sample of densities. (English) Zbl 1417.62062

Summary: When exploring a sample composed with a set of bivariate density functions, the question of the visualisation of the data has to front with the choice of the relevant level set(s). The approach proposed in this paper consists in defining the optimal level set(s) as being the one(s) allowing for the best reconstitution of the whole density. A fully data-driven procedure is developed in order to estimate the link between the level set(s) and their corresponding density, to construct optimal level set(s) and to choose automatically the number of relevant level set(s). The method is based on recent advances in functional data analysis when both response and predictors are functional. After a wide description of the methodology, finite sample studies are presented (including both real and simulated data) while theoretical studies are reported to a final appendix.

MSC:

62G07 Density estimation
62G08 Nonparametric regression and quantile regression
62G20 Asymptotic properties of nonparametric inference

Software:

fda (R)

References:

[1] Aneiros-Pérez G, Cao R, Vilar-Fernàndez J (2011) Functional methods for time series prediction: a nonparametric approach. J Forecast 30(4):377-392 · Zbl 1217.91138 · doi:10.1002/for.1169
[2] Baíllo A (2003) Total error in a plug-in estimator of level sets. Stat Probab Lett 65(4):411-417 · Zbl 1116.62338 · doi:10.1016/j.spl.2003.08.007
[3] Baíllo A, Cuesta-Albertos J, Cuevas A (2001) Convergence rates in nonparametric estimation of level sets. Stat Probab Lett 53(1):27-35 · Zbl 0980.62022 · doi:10.1016/S0167-7152(01)00006-2
[4] Cadre B (2006) Kernel estimation of density level sets. J Multivar Anal 99(4):999-1023 · Zbl 1085.62039 · doi:10.1016/j.jmva.2005.05.004
[5] Choi S, Park J (2014) Nonparametric additive model with grouped lasso and maximizing area under the ROC curve. Comput Stat Data Anal 77:313-325 · Zbl 1506.62044 · doi:10.1016/j.csda.2014.03.010
[6] Delicado P, Vieu P (2014) Optimal level sets for bivariate density representation. J Multivar Anal 140:1-18 · Zbl 1325.62038 · doi:10.1016/j.jmva.2015.04.005
[7] Fan J, Li R (2001) Variable selection via nonconcave penalized likelihood and its oracle properties. J Am Stat Assoc 96:1348-1360 · Zbl 1073.62547 · doi:10.1198/016214501753382273
[8] Fan J, Peng H (2004) Nonconcave penalized likelihood with a diverging number of parameters. Ann Stat 32:928-961 · Zbl 1092.62031 · doi:10.1214/009053604000000256
[9] Ferraty F, Vieu P (2006) Nonparametric functional data analysis. Theory and practice. Springer, New York · Zbl 1119.62046
[10] Ferraty F, Vieu P (2009) Additive prediction and boosting for functional data. Comput Stat Data Anal 53(4):1400-1413 · Zbl 1452.62989 · doi:10.1016/j.csda.2008.11.023
[11] Ferraty F, Hall P, Vieu P (2010) Most-predictive design points for functional data predictors. Biometrika 97(4):807-824 · Zbl 1204.62064 · doi:10.1093/biomet/asq058
[12] Ferraty F, Laksaci A, Tadj A, Vieu P (2011) Kernel regression with functional response. Electron J Stat 5:159-171 · Zbl 1274.62281 · doi:10.1214/11-EJS600
[13] Ferraty F, Van Keilegom I, Vieu P (2012) Regression when both response and predictor are functions. J Multivar Anal 109:10-28 · Zbl 1241.62054 · doi:10.1016/j.jmva.2012.02.008
[14] Goia A (2012) A functional linear model for time series prediction with exogenous variables. Stat Probab Lett 82(5):1005-1011 · Zbl 1241.62124 · doi:10.1016/j.spl.2012.02.009
[15] Härdle W, Marron JS (1985) Optimal bandwidth selection in nonparametric regression function estimation. Ann Stat 13(4):1465-1481 · Zbl 0594.62043 · doi:10.1214/aos/1176349748
[16] Horváth L, Kokoszka P (2012) Inference for functional data with applications. Springer, New York · Zbl 1279.62017 · doi:10.1007/978-1-4614-3655-3
[17] Kudraszow N, Vieu P (2013) Uniform consistency of \[k\] kNN regressors for functional variables. Stat Probab Lett 83(8):1863-1870 · Zbl 1277.62113 · doi:10.1016/j.spl.2013.04.017
[18] Mason D, Polonik W (2009) Asymptotic normality of plug-in level sets estimates. Ann Appl Probab 19(3):1108-1142 · Zbl 1180.62048 · doi:10.1214/08-AAP569
[19] Rachdi M, Vieu P (2007) Nonparametric regression for functional data: automatic smoothing parameter selection. J Stat Plan Inference 137(9):2784-2801 · Zbl 1331.62240 · doi:10.1016/j.jspi.2006.10.001
[20] Ramsay JO, Silverman BW (2002) Applied functional data analysis. Springer, New York · Zbl 1011.62002
[21] Ramsay JO, Silverman BW (2005) Functional data analysis, 2nd edn. Springer, New York · Zbl 1079.62006
[22] Roberts S, Nowak G (2014) Stabilizing the lasso against cross-validation variability. Comput Stat Data Anal 70:198-211 · Zbl 1471.62171 · doi:10.1016/j.csda.2013.09.008
[23] Stone CJ (1985) Additive regression and their nonparametric models. Ann Stat 13(2):689-705 · Zbl 0605.62065 · doi:10.1214/aos/1176349548
[24] Vieu P (1995) Order choice in nonlinear autoregressive models. Statistics 26(4):307-328 · Zbl 0836.62067 · doi:10.1080/02331889508802499
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.