×

Distance-based depths for directional data. (English. French summary) Zbl 1492.62095

Summary: Directional data are constrained to lie on the unit sphere of \(\mathbb{R}^{q}\) for some \(q \geq 2\). To address the lack of a natural ordering for such data, depth functions have been defined on spheres. However, the depths available either lack flexibility or are so computationally expensive that they can only be used for very small dimensions \(q\). In this work, we improve on this by introducing a class of distance-based depths for directional data. Irrespective of the distance adopted, these depths can easily be computed in high dimensions too. We derive the main structural properties of the proposed depths and study how they depend on the distance used. We discuss the asymptotic and robustness properties of the corresponding deepest points. We show the practical relevance of the proposed depths in two applications, related to (i) spherical location estimation and (ii) supervised classification. For both problems, we show through simulation studies that distance-based depths have strong advantages over their competitors.

MSC:

62H11 Directional data; spatial statistics
62G20 Asymptotic properties of nonparametric inference

References:

[1] Agostinelli, C. & Romanazzi, M. (2013a). Localdepth. R package version 0.5‐7.
[2] Agostinelli, C. & Romanazzi, M. (2013b). Nonparametric analysis of directional data based on data depth. Environmental and Ecological Statistics, 20, 253-270.
[3] Banerjee, A., Dhillon, I., Ghosh, J., & Sra, S. (2003). Generative model‐based clustering of directional data. In Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York, pp. 19-28.
[4] Banerjee, A., Dhillon, I., Ghosh, J., & Sra, S. (2005). Clustering on the unit hypersphere using von Mises-Fisher distributions. Journal of Machine Learning Research, 6, 1345-1382. · Zbl 1190.62116
[5] Bogashev, V. I. (2007). Measure Theory, Vol. I. Springer, Berlin, Heidelberg. · Zbl 1120.28001
[6] Dryden, I. L. (2005). Statistical analysis on high‐dimensional spheres and shape spaces. The Annals of Statistics, 33, 1643-1665. · Zbl 1078.62058
[7] Ferguson, T. (1996). A Course in Large Sample Theory. Chapman and Hall/CRC, Boca Raton. · Zbl 0871.62002
[8] Fisher, N. I. (1985). Spherical medians. Journal of the Royal Statistical Society, Series B, 47, 342-348. · Zbl 0605.62055
[9] Genest, M., Masse, J., & Plante, J. (2012). Depth. R package version 2.0‐0.
[10] Ghosh, A. K. & Chaudhuri, P. (2005). On maximum depth and related classifiers. Scandinavian Journal of Statistics, 32, 327-350. · Zbl 1089.62075
[11] Gill, J. & Hangartner, D. (2010). Circular data in political science and how to handle it. Political Analysis, 18, 316-336.
[12] Hampel, F. R., Ronchetti, E. M., Rousseeuw, P. J., & Stahel, W. A. (1986). Robust Statistics: The Approach Based on Influence Functions. Wiley, New York. · Zbl 0593.62027
[13] Hornik, K., Feinerer, I., Kober, M., & Buchta, C. (2012). Spherical k‐means clustering. Journal of Statistical Software, 50, 1-22.
[14] Ko, D. & Chang, T. (1993). Robust m‐estimators on spheres. Journal of Multivariate Analysis, 45, 104-136. · Zbl 0777.62056
[15] Koshevoy, G. & Mosler, K. (1997). Zonoid trimming for multivariate distributions. The Annals of Statistics, 25, 1998-2017. · Zbl 0881.62059
[16] Kosorok, M. R. (2008). Introduction to Empirical Processes and Semiparametric Inference. Springer, New York. · Zbl 1180.62137
[17] Ley, C., Sabbah, C., & Verdebout, T. (2014). A new concept of quantiles for directional data. Electronic Journal of Statistics, 8, 795-816. · Zbl 1349.62197
[18] Li, J., Cuesta‐Albertos, J., & Liu, R. Y. (2012). Dd‐classifier: Nonparametric classification procedures based on dd‐plots. Journal of the American Statistical Association, 107, 737-753. · Zbl 1261.62058
[19] Liu, R. Y. (1990). On a notion of data depth based on random simplices. Annals of Statistics, 18(1), 405-414. · Zbl 0701.62063
[20] Liu, R. Y. & Singh, K. (1992). Ordering directional data: Concepts of data depth on circles and spheres. Annals of Statistics, 20, 1468-1484. · Zbl 0766.62027
[21] Lund, U. & Agostinelli, C. (2013). Circular. R package version 0.4‐7.
[22] Mardia, K. V. & Jupp, P. E. (2000). Directional Statistics. John Wiley & Sons, New York. · Zbl 0935.62065
[23] Paindaveine, D. & Van Bever, G. (2013). From depth to local depth: A focus on centrality. Journal of the American Statistical Association, 105, 1105-1119. · Zbl 06224990
[24] Paindaveine, D. & Van Bever, G. (2015). Nonparametrically consistent depth‐based classifiers. Bernoulli, 21, 62-82. · Zbl 1359.62258
[25] Paindaveine, D. & Verdebout, T. (2017). Inference on the mode of weak directional signals: A le cam perspective on hypothesis testing near singularities. The Annals of Statistics, 45, 800-832. · Zbl 1371.62043
[26] Rousseeuw, P. J. & Struyf, A. (2004). Characterizing angular symmetry and regression symmetry. Journal of Statistical Planning and Inference, 122, 161-173. · Zbl 1040.62041
[27] Small, C. (1987). Measures of centrality for multivariate and directional distributions. The Canadian Journal of Statistics, 39, 31-39. · Zbl 0622.62054
[28] Strasser, H. (1985). Mathematical Theory of Statistics. Statistical Experiments and Asymptotic Decision Theory. De Gruyter, Berlin, New York. · Zbl 0594.62017
[29] Tukey, J. W. (1975). Mathematics and the picturing of data. In Proceedings of the International Congress of Mathematicians, Vol. 2. Canadian Mathemathical Congress, Montreal, Canada, pp. 523-531. · Zbl 0347.62002
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.