×

Relative difference in diversity between populations. (English) Zbl 0786.62004

Let \(\prec\) denote the relation of majorization of two discrete distributions. The distribution \(p\) is called more diverse than \(p'\) if \(p\prec p'\). In this sense the uniform distribution is most diverse and a degenerate distribution is least diverse. A numerical function which is monotone increasing w.r.t. the partial order \(\prec\) is called Schur- concave. Both the Gini-Simpson index and Shannon’s entropy have this property and may be considered as a suitable choice for an index of diversity.
The aim of the paper is to introduce a functional which is directly based on the partial order \(\prec\). Starting with the fact that \(p\prec p'\) iff \(p\) is a randomization of \(p'\) by a doubly stochastic matrix \(Q\) the author introduced \[ \rho(p,p')=\inf_ Q\| p-Qp'\|\text{ and } \tilde\rho(p,p')=\rho(p,p')+\rho(p',p). \] The main results of the paper concern isotonic and convexity properties of \(\tilde\rho\). The authors ask for solutions \(H\) of the equation \[ \tilde\rho(p,p')=\lim_{\lambda\to 0}\lambda^{-1}[H(\lambda p+(1- \lambda)p')-\lambda H(p)-(1-\lambda)H(p')] \] to introduce a new measure \(H\) of diversity with the help of the measure of dissimilarity \(\tilde\rho\). A series representation of \(H\) is given. As an application of \(\tilde\rho\), the problem of selecting the most diverse population is considered, where “most diverse” is defined in terms of \(\rho(p_ i,p_ j)\). The selection rule proposed is based on \(\rho(\hat p_ i,\hat p_ j)\). Numerical results for the probability of correct selection are given and compared with those corresponding to procedures based on other diversity indices.
Reviewer: F.Liese (Rostock)

MSC:

62B10 Statistical aspects of information-theoretic topics
62F07 Statistical ranking and selection procedures
Full Text: DOI

References:

[1] Agresti, A. (1990).Categorical Data Analysis, Wiley Series in Probability and Mathematical Statistics, Wiley, New York. · Zbl 0716.62001
[2] Alam, K., Mitra, A., Rizvi, M. H. and Saxena, K. M. Lal (1986). Selection of the most diverse multinomial population,Amer. J. Math. Management Sci., Special Volume6, 65-86. · Zbl 0618.62034
[3] Dennis, B., Patil, G. P., Rossi, O. and Taillie, C. (1979). A bibliography of literature on ecological diversity and related methodology,Ecological Diversity in Theory and Practice, 319-354, International Co-operative Publishing House, Jerusalem.
[4] Dudewicz, E. J. and Van der Meulen, E. C. (1981). Selection procedures for the best binomial population with generalized entropy goodness,Tamkang J. Math.,12, 206-208. · Zbl 0487.62022
[5] Gini, C. (1912). Variabilitá e Mutabilitá, Studi Economico-Giuridici della facoltá di Giurisprodenza dell, Universitá di Cagliari, Anno 3, Part 2, p. 80.
[6] Gower, J. C. (1985). Measures of similarity, dissimilarity, and distance,Encyclopedia of Statistical Sciences (eds. S. Kotz and N. L. Johnson),5, 397-405, Wiley-Interscience, New York.
[7] Grassle, J. F. and Smith, W. K. (1976). A similarity measure sensitive to the contribution of rare species and its use in investigation of variation in marine benthic communities,Oecologia,25, 13-25. · doi:10.1007/BF00345030
[8] Gupta, S. S. and Huang, D. Y. (1976). On subset selection procedures for the entropy function associated with the binomial populations,Sankhy? Ser. A,38, 153-173. · Zbl 0403.62011
[9] Gupta, S. S. and Wong, W. Y. (1975). Subset selection procedures for finite schemes in information theory,Colloq. Math. Soc. János Bolyai,16, 279-291.
[10] Haberman, S. J. (1982). Analysis of dispersion of multinomial responses,J. Amer. Statist. Assoc.,77, 568-580. · Zbl 0495.62063 · doi:10.2307/2287713
[11] Kostreva, M. M. (1989). Generalization of Murty’s direct algorithm to linear and convex quadratic programming,J. Optim. Theory Appl.,62, 63-76. · Zbl 0651.90087 · doi:10.1007/BF00939630
[12] Light, R. J. and Margolin, B. H. (1971). An analysis of variance for categorical data,J. Amer. Statist. Assoc.,66, 534-544. · Zbl 0222.62035 · doi:10.2307/2283520
[13] Lorenz, M. O. (1905). Methods of measuring concentration of wealth,J. Amer. Statist. Assoc.,9, 209-212.
[14] Marshall, A. V. and Olkin, I. (1979).Inequalities: Theory of Majorization and Its Applications, Academic Press, San Diego. · Zbl 0437.26007
[15] Patil, G. P. and Taillie, C. (1982). Diversity as a concept and its measurement,J. Amer. Statist. Assoc.,77, 548-561. · Zbl 0511.62113 · doi:10.2307/2287709
[16] Rao, C. R. (1982a). Diversity and dissimilarity coefficients, a unified approach,Theoret. Population Biol.,21, 24-43. · Zbl 0516.92021 · doi:10.1016/0040-5809(82)90004-1
[17] Rao, C. R. (1982b). Diversity: its measurement, decomposition, appartionment and analysis,Sankhyã Ser. A,44, 1-22. · Zbl 0584.62114
[18] Rao, C. R. (1984). Convexity properties of entropy functions and analysis of diversity,Inequalities in Statistics and Probability, IMS Lecture Notes - Monograph Series, Vol. 5, 68-77, Hayward, California.
[19] Rao, C. R. and Nayak, T. K. (1985). Cross entropy, dissimilarity measures, and characterizations of quadratic entropy,IEEE Trans. Inform. Theory,31 (5), 589-593. · Zbl 0596.94004 · doi:10.1109/TIT.1985.1057082
[20] Rizvi, M. H., Alam, K. and Saxena, K. M. Lal (1987). Selection procedure for multinomial populations with respect to diversity indices,Contribution to the Theory and Application of Statistics (ed. A. E. Gelfard), Academic Press, New York.
[21] Schmidt, P. and Strauss, R. P. (1975). The prediction of occupation using multiple logit models,Internat. Econom. Rev.,16, 471-486. · doi:10.2307/2525826
[22] Schur, I. (1923). Uber eine Klasse von Mittelbildungen mit Anwendungen die Determinanten,Theorie Sitzungsber. Berlin Math. Gesellschaft,22, 9-20 (Issai Collected Works (eds. A. Brauer and H. Rohrbach), Vol. II, 416-427, Springer, Berlin, 1973). · JFM 49.0054.01
[23] Shannon, C. E. (1948). A mathematical theory of communication,Bell Syst. Tech. J.,27, 379-423 and 626-656. · Zbl 1154.94303
[24] Simpson, E. H. (1949). Measurement of diversity,Nature,163, 688. · Zbl 0032.03902 · doi:10.1038/163688a0
[25] Smith, W. (1989). ANOVA-like similarity analysis using expected species shared,Biometrics,45, 873-881. · Zbl 0715.62197 · doi:10.2307/2531688
[26] Smith, W., Grassle, J. F. and Kravitz, D. (1979). Measures of diversity with unbiased estimators,Ecological Diversity in Theory and Practice, 177-191, International Co-operative Publishing House, Jerusalem.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.