×

Domain selection and familywise error rate for functional data: a unified framework. (English) Zbl 1522.62097

Summary: Functional data are smooth, often continuous, random curves, which can be seen as an extreme case of multivariate data with infinite dimensionality. Just as componentwise inference for multivariate data naturally performs feature selection, subsetwise inference for functional data performs domain selection. In this paper, we present a unified testing framework for domain selection on populations of functional data. In detail, \(p\)-values of hypothesis tests performed on pointwise evaluations of functional data are suitably adjusted for providing control of the familywise error rate (FWER) over a family of subsets of the domain. We show that several state-of-the-art domain selection methods fit within this framework and differ from each other by the choice of the family over which the control of the FWER is provided. In the existing literature, these families are always defined a priori. In this work, we also propose a novel approach, coined thresholdwise testing, in which the family of subsets is instead built in a data-driven fashion. The method seamlessly generalizes to multidimensional domains in contrast to methods based on a priori defined families. We provide theoretical results with respect to consistency and control of the FWER for the methods within the unified framework. We illustrate the performance of the methods within the unified framework on simulated and real data examples and compare their performance with other existing methods.
{© 2022 The Authors. Biometrics published by Wiley Periodicals LLC on behalf of International Biometric Society.}

MSC:

62P10 Applications of statistics to biology and medical sciences; meta analysis

References:

[1] Abramowicz, K., Häger, C.K., Pini, A., Schelin, L., Sjöstedt de Luna, S. and Vantini, S. (2018) Nonparametric inference for functional‐on‐scalar linear models applied to knee kinematic hop data after injury of the anterior cruciate ligament. Scandinavian Journal of Statistics, 45, 1036-1061. · Zbl 1408.62084
[2] Assaf, Y. and Pasternak, O. (2008) Diffusion tensor imaging (DTI)‐based white matter mapping in brain research: a review. Journal of Molecular Neuroscience, 34, 51-61.
[3] Basser, P.J., Mattiello, J. and LeBihan, D. (1994) MR diffusion tensor spectroscopy and imaging. Biophysical Journal, 66, 259-67.
[4] Cardot, H., Goia, A. and Sarda, P. (2004) Testing for no effect in functional linear regression models, some computational approaches. Communications in Statistics—Simulation and Computation, 33, 179-199. · Zbl 1058.62037
[5] Corain, L., Melas, V.B., Pepelyshev, A. and Salmaso, L. (2014) New insights on permutation approach for hypothesis testing on functional data. Advances in Data Analysis and Classification, 8, 339-356. · Zbl 1414.62142
[6] Cox, D.D. and Lee, J.S. (2008) Pointwise testing with functional data using the Westfall-Young randomization method. Biometrika, 95, 621-634. · Zbl 1437.62430
[7] Crainiceanu, C.M., Staicu, A.‐M., Ray, S. and Punjabi, N. (2012) Bootstrap‐based inference on the difference in the means of two correlated functional processes. Statistics in Medicine, 31, 3223-3240.
[8] Degras, D. (2017) Simultaneous confidence bands for the mean of functional data. Wiley Interdisciplinary Reviews: Computational Statistics, 9, e1397. · Zbl 07914924
[9] Fan, J. and Zhang, J.T. (2000) Two‐step estimation of functional linear models with applications to longitudinal data. Journal of the Royal Statistical Society Series B, 62, 303-322.
[10] Holm, S. (1979) A simple sequentially rejective multiple test procedure. Scandinavian Journal of Statistics, 6, 65-70. · Zbl 0402.62058
[11] Holmes, A.P., Blair, R.C., Watson, J.D.G. and Ford, I. (1996) Nonparametric analysis of statistic images from functional mapping experiments. Journal of Cerebral Blood Flow & Metabolism, 16, 7-22.
[12] Horsfield, M. and Jones, D. (2002) Applications of diffusion‐weighted and diffusion tensor MRI to white matter diseases— a review. NMR in Biomedicine, 15, 570-577.
[13] Horváth, L. and Kokoszka, P. (2012) Inference for Functional Data with Applications. Springer Series in Statistics, volume 200. New York: Springer. · Zbl 1279.62017
[14] Liebl, D. and Reimherr, M. (2020) Simultaneous inference for function‐valued parameters: a fast and fair approach. In Aneiros, G. (ed.), Horová, I. (ed.), Hušková, M. (ed.) and Vieu, P. (ed.) (Eds.) Functional and High‐Dimensional Statistics and Related Fields. Cham: Springer International Publishing, pp. 153-159. · Zbl 1444.62160
[15] Logan, B.R. and Rowe, D.B. (2004) An evaluation of thresholding techniques in fMRI analysis. NeuroImage, 22, 95-108.
[16] Marcus, R., Peritz, E. and Gabriel, K.R. (1976) On closed testing procedures with special reference to ordered analysis of variance. Biometrika, 63, 655-660. · Zbl 0353.62037
[17] Mrkvička, T., Myllymäki, M., Kuronen, M. and Narisetty, N.N. (2022) New methods for multiple testing in permutation inference for the general linear model. Statistics in Medicine, 41, 276-297.
[18] Naouma, H. and Pataky, T.C. (2019) A comparison of random‐field‐theory and false‐discovery‐rate inference results in the analysis of registered one‐dimensional biomechanical datasets. PeerJ, 7, e8189.
[19] Olsen, N.L., Pini, A. and Vantini, S. (2021) False discovery rate for functional data. TEST, 30, 784-809. · Zbl 1474.62461
[20] Panagiotaki, E., Schneider, T., Siow, B., Hall, M., Lythgoe, M. and Alexander, D. (2012) Compartment models of the diffusion MR signal in brain white matter: a taxonomy and comparison. Neuroimage, 59, 2241-2254.
[21] Park, S.Y., Staicu, A.‐M., Xiao, L. and Crainiceanu, C.M. (2017) Simple fixed‐effects inference for complex functional models. Biostatistics, 19, 137-152.
[22] Pataky, T.C., Abramowicz, K., Liebl, D., Pini, A., deLuna, S.S. and Schelin, L. (2021) Simultaneous inference for functional data in sports biomechanics. AStA Advances in Statistical Analysis . Published online. https://doi.org/10.1007/s10182‐021‐00418‐4 · doi:10.1007/s10182‐021‐00418‐4
[23] Perone Pacifico, M., Genovese, C., Verdinelli, I. and Wasserman, L. (2004) False discovery control for random fields. Journal of the American Statistical Association, 99, 1002-1014. · Zbl 1055.62105
[24] Pesarin, F. and Salmaso, L. (2010) Permutation Tests for Complex Data: Theory, Applications and Software. Wiley.
[25] Pini, A. and Vantini, S. (2017) Interval‐wise testing for functional data. Journal of Nonparametric Statistics, 29, 407-424. · Zbl 1369.62096
[26] Rathnayake, L.N. and Choudhary, P.K. (2016) Tolerance bands for functional data. Biometrics, 72, 503-512. · Zbl 1419.62429
[27] Reiss, P.T., Huang, L. and Mennes, M. (2010) Fast function‐on‐scalar regression with penalized basis expansions. International Journal of Biostatistics, 6, 28.
[28] Staicu, A.‐M., Li, Y., Crainiceanu, C.M. and Ruppert, D. (2014) Likelihood ratio tests for dependent data with applications to longitudinal and functional data analysis. Scandinavian Journal of Statistics, 41, 932-949. · Zbl 1305.62182
[29] Telschow, F.J. and Schwartzman, A. (2022) Simultaneous confidence bands for functional data using the Gaussian kinematic formula. Journal of Statistical Planning and Inference, 216, 70-94. · Zbl 1477.62398
[30] Van Essen, D.C., Smith, S.M., Barch, D.M., Behrens, T.E.J., Yacoub, E., Ugurbil, K. and WU‐Minn HCP Consortium, (2013) The WU‐Minn Human Connectome Project: an overview. NeuroImage, 80, 62-79.
[31] Vsevolozhskaya, O., Greenwood, M. and Holodov, D. (2014) Pairwise comparison of treatment levels in functional analysis of variance with application to erythrocyte hemolysis. The Annals of Applied Statistics, 8, 905-925. · Zbl 1454.62225
[32] Winkler, A.M., Ridgway, G.R., Webster, M.A., Smith, S.M. and Nichols, T.E. (2014) Permutation inference for the general linear model. NeuroImage, 92, 381-397.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.