×

Czekanowski’s index of overlap, its \(L_p\)-type extension, and bias reduction. (English) Zbl 1188.62147

Summary: J. Czekanowski’s proportional similarity index [Korrespondenzbl. Dtsche. Gesellschaft Anthropol. 40, 44–47 (1909); Anthrop. Anzeiger 9, 227–249 (1932); Current Anthropol. 3, 481–494 (1962)] which is the area of overlap of two density functions, has played a prominent role in anthropology, ecology, econometrics, engineering, health and other sciences. It has been shown that the corresponding nonparametric kernel-density estimator is consistent. Based on simulation studies several authors have also noted that the estimator exhibits a larger than anticipated bias. We examine the situation in detail and learn that the bias is an integral part of the estimator. We then suggest an \(L_p\)-extension of Czekanowski’s index; the latter being an \(L_1\)-type quantity. We show in particular that if \(p > 4/3\), then under an appropriate choice of the bandwidth the kernel-density estimator of the \(L_p\)-extension loses its bias.

MSC:

62G07 Density estimation
62G20 Asymptotic properties of nonparametric inference
65C60 Computational problems in statistics (MSC2010)
62P99 Applications of statistics
Full Text: DOI

References:

[1] Anderson G., Journal of Econometrics 122 pp 1– (2004) · Zbl 1282.62244 · doi:10.1016/j.jeconom.2003.10.017
[2] Anderson G., Distributional overlap: Simple, multivariate, parametric and non-parametric tests for alienation, convergence and general distributional difference issues. (2005) · Zbl 1187.91148
[3] Bloom S.A., Marine Ecology – Progress Series 5 pp 125– (1981) · doi:10.3354/meps005125
[4] Bradley E.L., Encyclopedia of Statistical Sciences 6 pp 546– (1985)
[5] Bray J.R., Ecological Monographs 27 pp 325– (1957) · doi:10.2307/1942268
[6] Clemons T.E., The Overlapping Coefficient for Two Normal Probability Functions with Unequal Variances (1996)
[7] Clemons T.E., Computational Statistics & Data Analysis 34 pp 51– (2000) · Zbl 1052.62514 · doi:10.1016/S0167-9473(99)00074-2
[8] Csörgö M., Probability Theory and Related Fields 80 pp 269– (1988) · Zbl 0657.60026 · doi:10.1007/BF00356106
[9] Czekanowski J., Korrespondenzblatt der Deutschen Gesellschaft fur Anthropologie 40 pp 44– (1909)
[10] Czekanowski J., Anthropologischer Anzeiger 9 pp 227– (1932)
[11] Czekanowski J., Current Anthropology 3 pp 481– (1962) · doi:10.1086/200319
[12] Devroye L., Nonparametric Density Estimation. The L1 View (1985)
[13] Feinsinger P., Ecology 62 pp 27– (1981) · doi:10.2307/1936664
[14] Field J.G., Zoologica Africana 3 pp 119– (1968)
[15] Gerrard R., IMA Journal of Mathematics Applied in Medicine and Biology 3 pp 115– (1986) · doi:10.1093/imammb/3.2.115
[16] Gastwirth J.L., American Statistician 29 pp 32– (1975)
[17] Gower J.C., Encyclopedia of Statistical Sciences 5 pp 397– (1985)
[18] Horváth L., Annals of Statistics 19 pp 1933– (1991) · Zbl 0765.62045 · doi:10.1214/aos/1176348379
[19] Ichikawa M., Reliability Engineering & System, Safety 41 pp 203– (1993) · doi:10.1016/0951-8320(93)90033-U
[20] Inman H.F., Migration in the Cotton South: The Geographic Mobility of Alabama Farmers, 1850–1860 (1981)
[21] Inman H.F., Communications in Statistics – Theory and Methods 18 pp 3851– (1989) · Zbl 0696.62131 · doi:10.1080/03610928908830127
[22] Inman H.F., Environmetrics 5 pp 167– (1994) · doi:10.1002/env.3170050207
[23] Karian Z.A., Modem Statistical, Systems, and GPSS Simulation (1999)
[24] Kistler S., Vegetatio 40 pp 185– (1979)
[25] Lu R.-P., Theoretical Population Biology 35 pp 1– (1989) · Zbl 0665.92021 · doi:10.1016/0040-5809(89)90007-5
[26] Mishra S.N., Communications in Statistics – Theory and Methods 15 pp 123– (1986) · Zbl 0602.62020 · doi:10.1080/03610928608829110
[27] Mizuno S., Clinical Trials 4 pp 174– (2005) · doi:10.1191/1740774505cn077oa
[28] Mulekar M.S., Estimating overlap of two exponential populations (2001)
[29] Mulekar M.S., American Journal of Mathematical and Management Sciences 28 pp 61– (2008) · Zbl 1155.62072 · doi:10.1080/01966324.2008.10737717
[30] Mulekar M.S., Journal of the Japan Statistical Society 24 pp 169– (1994)
[31] Mulekar M.S., Computational Statistics and Data Analysis 34 pp 121– (2000) · Zbl 1054.62502 · doi:10.1016/S0167-9473(99)00096-1
[32] Mueller L.D., Ecology 66 pp 1204– (1985) · doi:10.2307/1939173
[33] Petrov V.V., Limit Theorems of Probability Theory. Sequences of Independent Random Variables (1995) · Zbl 0826.60001
[34] Reiser B., The Statistician 48 pp 413– (1999)
[35] Renkonen O., Annales Societatis Zoologicoe - Botanicoe Fennicoe ”Vanamo” 6 pp 1– (1938)
[36] Ricklefs R.E., Ecology 61 pp 1019– (1980) · doi:10.2307/1936817
[37] Rom D.M., Statistics in Medicine 15 pp 1489– (1996) · doi:10.1002/(SICI)1097-0258(19960730)15:14<1489::AID-SIM293>3.0.CO;2-S
[38] Rosenthal H.P., Israel Journal of Mathematics 8 pp 273– (1970) · Zbl 0213.19303 · doi:10.1007/BF02771562
[39] Al-Saidy O., ESAIM: Probability and Statistics 9 pp 206– (2005) · Zbl 1136.62378 · doi:10.1051/ps:2005010
[40] Sale P.F., Coral Reefs 11 pp 147– (1992) · doi:10.1007/BF00255469
[41] Schatzmann E., Mathematical Medicine and Biology 3 pp 99– (1986) · doi:10.1093/imammb/3.2.99
[42] Serfling R., Approximation Theorems of Mathematical Statistics (1980) · Zbl 0538.62002 · doi:10.1002/9780470316481
[43] Slobodchikoff C.N., Ecology 61 pp 1051– (1980) · doi:10.2307/1936823
[44] Smith E.P., Ecology 63 pp 1675– (1982) · doi:10.2307/1940109
[45] Sneath P.H.A., Mathematical Geology 9 pp 123– (1977) · doi:10.1007/BF02312508
[46] Sneath P.H.A., Numerical Taxonomy: The Principles and Practice of Numerical Classification (1973) · Zbl 0285.92001
[47] Stine R.A., Statistics in Medicine 20 pp 215– (2001) · doi:10.1002/1097-0258(20010130)20:2<215::AID-SIM642>3.0.CO;2-X
[48] Weitzman M.S., Measures of Overlap of Income Distributions of White and Negro Families in the United States (1970)
[49] Wolda H., Oecologia 50 pp 296– (1981) · doi:10.1007/BF00344966
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.