×

An exact distribution-free test comparing two multivariate distributions based on adjacency. (English) Zbl 1095.62053

Summary: A new test is proposed comparing two multivariate distributions by using distances between observations. Unlike earlier tests using interpoint distances, the new test statistic has a known exact distribution and is exactly distribution free. The interpoint distances are used to construct an optimal non-bipartite matching, i.e., a matching of the observations into disjoint pairs to minimize the total distance within pairs. The cross-match statistic is the number of pairs containing one observation from the first distribution and one from the second. Distributions that are very different will exhibit few cross-matches. When comparing two discrete distributions with finite support, the test is consistent against all alternatives.
The test is applied to a study of brain activation measured by functional magnetic resonance imaging during two linguistic tasks, comparing brains that are impaired by arteriovenous abnormalities with normal controls. A second exact distribution-free test is also discussed: it ranks the pairs and sums the ranks of the cross-matched pairs.

MSC:

62G10 Nonparametric hypothesis testing
62P10 Applications of statistics to biology and medical sciences; meta analysis
62A09 Graphical methods in statistics
65C60 Computational problems in statistics (MSC2010)
62H15 Hypothesis testing in multivariate analysis
Full Text: DOI

References:

[1] Anderson T. W., Multivariate Analysis pp 5– (1966)
[2] Daykin D. E., Am. Math. Mnthly 76 pp 98– (1969)
[3] Derigs U., Ann. Oper. Res. 13 pp 225– (1988)
[4] Drion E. F., Ann. Math. Statist. 23 pp 563– (1952)
[5] Durbin R., Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids (1999) · Zbl 0929.92010
[6] Friedman J. H., Ann. Statist. 7 pp 697– (1979)
[7] DOI: 10.1145/6462.6502 · Zbl 0606.68064 · doi:10.1145/6462.6502
[8] DOI: 10.1093/biostatistics/5.2.263 · Zbl 1096.62078 · doi:10.1093/biostatistics/5.2.263
[9] Hajek J., Theory of Rank Tests (1999)
[10] Hardy G., Inequalities (1952)
[11] Henze N., Ann. Statist. 16 pp 772– (1988)
[12] DOI: 10.1214/aos/1018031112 · Zbl 0944.62057 · doi:10.1214/aos/1018031112
[13] Hollander M., Nonparametric Statistical Methods (1999) · Zbl 0997.62511
[14] Holst L., Ann. Statist. 7 pp 551– (1979)
[15] Johnson N. L., Discrete Multivariate Distributions (1997) · Zbl 0868.62048
[16] Lehericy S., Radiology 223 pp 672– (2002)
[17] Lehmann E. L., Nonparametrics (1998)
[18] DOI: 10.1198/1061860043470 · doi:10.1198/1061860043470
[19] DOI: 10.1198/016214501753381896 · Zbl 1051.62113 · doi:10.1198/016214501753381896
[20] DOI: 10.1214/aos/1032526956 · Zbl 0862.62047 · doi:10.1214/aos/1032526956
[21] Papadimitriou C. H., Combinatorial Optimization: Algorithms and Complexity (1982) · Zbl 0503.90060
[22] Rao C. R., Linear Statistical Inference and Its Applications (1973) · Zbl 0256.62002 · doi:10.1002/9780470316436
[23] Rosenbaum P. R., Ann. Statist. 19 pp 1091– (1991)
[24] Schilling M. F., J. Am. Statist. Ass. 81 pp 799– (1986)
[25] Weiss L., Ann. Math. Statist. 31 pp 159– (1959)
[26] Wilks S., Mathematical Statistics (1962) · Zbl 0173.45805
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.