×

Unordered and ordered sample from Dirichlet distribution. (English) Zbl 1095.62013

Summary: Consider the random Dirichlet partition of an interval into \(n\) fragments with parameter \(\theta> 0\). Explicit results on the statistical structure of its size-biased permutation are recalled, leading to (unordered) Ewens and (ordered) Donnelly-Tavaré-Griffiths sampling formulae from finite Dirichlet partitions. We use these preliminary statistical results on frequencies distributions to address the following sampling problem: what are the intervals between new sampled categories when sampling is from Dirichlet populations? The results obtained are in accordance with the ones found in sampling theory from random proportions with GEM\((\gamma)\) distributions. These can be obtained from a Dirichlet model when considering the Kingman limit \(n\uparrow\infty\), \(\theta\downarrow 0\) while \(n\theta= \gamma> 0\).

MSC:

62D05 Sampling theory, sample surveys
62E15 Exact distribution theory in statistics
60C05 Combinatorial probability
Full Text: DOI

References:

[1] Barrera, J., Huillet, T. and Paroissin, C. (2005). Size-biased permutation of Dirichlet partitions and search-cost distribution,Probability in the Engineering & Informational Sciences,19(1), 83–97. · Zbl 1077.60005
[2] Donnelly, P. (1986). Partition structures, Pòlya urns, the Ewens sampling formula and the age of alleles,Theoretical Population Biology,30, 271–288. · Zbl 0608.92005 · doi:10.1016/0040-5809(86)90037-7
[3] Donnelly, P. (1991). The heaps process, libraries and size-biased permutation,Journal of Applied Probability,28, 321–335. · Zbl 0727.60078 · doi:10.2307/3214869
[4] Donnelly, P. and Tavaré, S. (1986). The age of alleles and a coalescent,Advances in Applied Probability,18, 1–19. · Zbl 0588.92013 · doi:10.2307/1427237
[5] Ewens, W. J. (1972). The sampling theory of selectively neutral alleles,Theoretical Population Biology,3, 87–112. · Zbl 0245.92009 · doi:10.1016/0040-5809(72)90035-4
[6] Ewens, W. J. (1990). Population genetics theory–the past and the future,Mathematical and Statistical Developments of Evolutionary Theory (ed. S. Lessard), Kluwer, Dordrecht. · Zbl 0718.92010
[7] Ewens, W. J. (1996). Some remarks on the law of succession,Athens Conference on Applied Probability and Time Series Analysis (1995), Vol. I, Lecture Notes in Statistics,114, 229–244, Springer, New York. · Zbl 0877.62092
[8] Huillet, T. (2003). Sampling problems for randomly broken sticks,Journal of Physics A,36(14), 3947–3960. · Zbl 1049.82036 · doi:10.1088/0305-4470/36/14/302
[9] Huillet, T. (2005). Sampling formulae arising from random Dirichlet populations,Communications in Statistics: Theory and Methods (to appear). · Zbl 1065.60050
[10] Huillet, T. and Martinez, S. (2003). Sampling from finite random partitions,Methodology and Computing in Applied Probability,5(4), 467–492. · Zbl 1038.60007 · doi:10.1023/A:1026289530652
[11] Kingman, J. F. C. (1975). Random discrete distributions,Journal of the Royal Statistical Society. Series B,37, 1–22. · Zbl 0331.62019
[12] Kingman, J. F. C. (1993).Poisson Processes, Clarendon Press, Oxford. · Zbl 0771.60001
[13] Pitman, J. (1996). Random discrete distributions invariant under size-biased permutation,Advances in Applied Probability,28, 525–539. · Zbl 0853.62018 · doi:10.2307/1428070
[14] Pitman, J. (1999). Coalescents with multiple collisions,Annals of Probability,27(4), 1870–1902. · Zbl 0963.60079 · doi:10.1214/aop/1022677552
[15] Pitman, J. (2002). Poisson-Dirichlet and GEM invariant distributions for split-and-merge transformation of an interval partition,Combinatorics, Probability and Computing,11(5), 501–514. · Zbl 1011.60051 · doi:10.1017/S0963548302005163
[16] Pitman, J. and Yor, M. (1997). The two parameter Poisson-Dirichlet distribution derived from a stable subordinator,Annals of Probability,25, 855–900. · Zbl 0880.60076 · doi:10.1214/aop/1024404422
[17] Sibuya, M. and Yamato, H. (1995). Ordered and unordered random partitions of an integer and the GEM distribution,Statistics & Probability Letters,25(2), 177–183. · Zbl 0843.60011 · doi:10.1016/0167-7152(94)00220-3
[18] Tavaré, S. and Ewens, W. J. (1997). Multivariate Ewens distribution,Discrete Multivariate Distributions (eds. N. L. Johnson, S. Kotz and N. Balakrishnan),41, 232–246, Wiley, New York.
[19] Yamato, H. (1997). On the Donnely-Tavaré-Griffiths formula associated with the coalescent,Communications in Statistics: Theory and Methods,26(3), 589–599. · Zbl 1030.62511 · doi:10.1080/03610929708831936
[20] Yamato, H., Sibuya, M. and Nomachi, T. (2001). Ordered sample from two-parameter GEM distribution,Statistics & Probability Letters,55(1), 19–27. · Zbl 1003.62008 · doi:10.1016/S0167-7152(01)00119-5
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.