×

Generalized species sampling priors with latent beta reinforcements. (English) Zbl 1368.62125

Summary: Many popular Bayesian nonparametric priors can be characterized in terms of exchangeable species sampling sequences. However, in some applications, exchangeability may not be appropriate. We introduce a novel and probabilistically coherent family of nonexchangeable species sampling sequences characterized by a tractable predictive probability function with weights driven by a sequence of independent Beta random variables. We compare their theoretical clustering properties with those of the Dirichlet process and the two parameters Poisson-Dirichlet process. The proposed construction provides a complete characterization of the joint process, differently from existing work. We then propose the use of such process as prior distribution in a hierarchical Bayes’ modeling framework, and we describe a Markov chain Monte Carlo sampler for posterior inference. We evaluate the performance of the prior and the robustness of the resulting inference in a simulation study, providing a comparison with popular Dirichlet process mixtures and hidden Markov models. Finally, we develop an application to the detection of chromosomal aberrations in breast cancer by leveraging array comparative genomic hybridization (CGH) data.

MSC:

62G99 Nonparametric inference
62F15 Bayesian inference
60G09 Exchangeability for stochastic processes
62P10 Applications of statistics to biology and medical sciences; meta analysis

References:

[1] DOI: 10.1214/06-BA110 · Zbl 1331.62491 · doi:10.1214/06-BA110
[2] DOI: 10.1016/j.jedc.2007.01.021 · Zbl 1181.91222 · doi:10.1016/j.jedc.2007.01.021
[3] DOI: 10.1198/jasa.2010.ap09250 · Zbl 1388.62312 · doi:10.1198/jasa.2010.ap09250
[4] DOI: 10.1239/aap/1275055237 · Zbl 1202.60034 · doi:10.1239/aap/1275055237
[5] DOI: 10.1214/009117904000000676 · Zbl 1050.60004 · doi:10.1214/009117904000000676
[6] DOI: 10.1214/aos/1176342372 · Zbl 0276.62010 · doi:10.1214/aos/1176342372
[7] Blei D., Journal of Machine Learning Research 12 pp 2461– (2011)
[8] Blei D.M., Proceedings of the 24th Annual International ACM SIGIR Conference pp 343– (2001)
[9] Cardin N., Genetic Epidemiology 35 pp 536– (2011)
[10] DOI: 10.1002/0471733180 · Zbl 1087.60001 · doi:10.1002/0471733180
[11] DOI: 10.1016/j.ccr.2006.10.009 · doi:10.1016/j.ccr.2006.10.009
[12] DOI: 10.1109/TSA.2005.845810 · doi:10.1109/TSA.2005.845810
[13] DOI: 10.1093/nar/gkm076 · doi:10.1093/nar/gkm076
[14] Curtis C., Nature 486 pp 346– (2012)
[15] DOI: 10.1111/j.1541-0420.2009.01226.x · Zbl 1180.62164 · doi:10.1111/j.1541-0420.2009.01226.x
[16] DOI: 10.1109/LSP.2012.2184795 · doi:10.1109/LSP.2012.2184795
[17] DOI: 10.1109/TSP.2010.2053033 · Zbl 1391.92014 · doi:10.1109/TSP.2010.2053033
[18] DOI: 10.1080/01621459.1995.10476550 · doi:10.1080/01621459.1995.10476550
[19] Ferguson J.D., Proceedings of the Symposium on the Applications of Hidden Markov Models to Text and Speech pp 143– (1980)
[20] Fortini S., Sankhya, Series A 62 pp 86– (2000)
[21] DOI: 10.1214/14-AOAS742 · Zbl 1303.62048 · doi:10.1214/14-AOAS742
[22] DOI: 10.1214/10-AOAS395 · Zbl 1232.62077 · doi:10.1214/10-AOAS395
[23] DOI: 10.1093/acprof:oso/9780199216093.003.0014 · doi:10.1093/acprof:oso/9780199216093.003.0014
[24] DOI: 10.1198/1061860032030 · doi:10.1198/1061860032030
[25] DOI: 10.1198/jasa.2010.tm09340 · Zbl 1392.60064 · doi:10.1198/jasa.2010.tm09340
[26] DOI: 10.1198/016214507000000923 · Zbl 1469.62368 · doi:10.1198/016214507000000923
[27] DOI: 10.1016/S0167-7152(99)00109-1 · Zbl 0944.62109 · doi:10.1016/S0167-7152(99)00109-1
[28] DOI: 10.1016/j.neuroimage.2006.04.233 · doi:10.1016/j.neuroimage.2006.04.233
[29] DOI: 10.1017/CBO9780511973420 · doi:10.1017/CBO9780511973420
[30] DOI: 10.1017/CBO9780511802478 · Zbl 1192.62080 · doi:10.1017/CBO9780511802478
[31] DOI: 10.1214/07-BA219 · Zbl 1331.62145 · doi:10.1214/07-BA219
[32] Jara A., Rnews 7 pp 17– (2007)
[33] DOI: 10.1016/j.neuroimage.2008.08.044 · doi:10.1016/j.neuroimage.2008.08.044
[34] DOI: 10.1093/bioinformatics/btn049 · doi:10.1093/bioinformatics/btn049
[35] DOI: 10.1093/biomet/93.4.877 · Zbl 1436.62266 · doi:10.1093/biomet/93.4.877
[36] DOI: 10.1214/12-STS407 · Zbl 1331.62152 · doi:10.1214/12-STS407
[37] Liu, Z., Windle, J., and Scott, J. C. (2013), ”Case Studies in Bayesian Screening for Time-Varying Model Structure,” in Bayesian Theory and Applications, eds. P. Damien, P. Dellaportaas, N. G. Ploson, and D. A. Stephens, Oxford: Oxford University Press, pp. 337–442.
[38] DOI: 10.1080/10618600.1998.10474772 · doi:10.1080/10618600.1998.10474772
[39] DOI: 10.1093/bioinformatics/btl089 · doi:10.1093/bioinformatics/btl089
[40] DOI: 10.1109/89.388149 · doi:10.1109/89.388149
[41] DOI: 10.1214/11-BA626 · Zbl 1330.62146 · doi:10.1214/11-BA626
[42] Müller P., Bayesian Statistics 8 pp 349– (2007)
[43] DOI: 10.1016/j.jspi.2010.03.002 · Zbl 1191.62073 · doi:10.1016/j.jspi.2010.03.002
[44] DOI: 10.1016/j.jmp.2005.11.006 · Zbl 1138.91594 · doi:10.1016/j.jmp.2005.11.006
[45] DOI: 10.1080/10618600.2000.10474879 · doi:10.1080/10618600.2000.10474879
[46] DOI: 10.1093/biostatistics/5.2.155 · Zbl 1096.62124 · doi:10.1093/biostatistics/5.2.155
[47] Park J.H., Statistica Sinica 20 pp 1203– (2010)
[48] DOI: 10.1214/lnms/1215453576 · doi:10.1214/lnms/1215453576
[49] ——, Combinatorial Stochastic Processes. Ecole d’Eté Probabilités de Saint-Flour XXXII 2002, Lecture Notes in Mathematics (2006)
[50] DOI: 10.1109/5.18626 · doi:10.1109/5.18626
[51] DOI: 10.1007/978-1-59745-538-1_17 · doi:10.1007/978-1-59745-538-1_17
[52] DOI: 10.1198/jasa.2009.ap08497 · Zbl 1392.62356 · doi:10.1198/jasa.2009.ap08497
[53] DOI: 10.1214/11-BA605 · Zbl 1330.62120 · doi:10.1214/11-BA605
[54] DOI: 10.1214/aos/1074290335 · Zbl 1042.62026 · doi:10.1214/aos/1074290335
[55] DOI: 10.1093/biostatistics/kxl019 · Zbl 1213.62175 · doi:10.1093/biostatistics/kxl019
[56] Sudderth E.B., Advances in Neural Information Processing Systems pp 1585– (2008)
[57] Sun W., Journal of the Royal Statistical Society, Series B
[58] DOI: 10.1198/016214506000000302 · Zbl 1171.62349 · doi:10.1198/016214506000000302
[59] DOI: 10.1214/13-AOAS657 · Zbl 1454.62206 · doi:10.1214/13-AOAS657
[60] DOI: 10.1111/j.1467-9868.2010.00756.x · doi:10.1111/j.1467-9868.2010.00756.x
[61] DOI: 10.1016/j.artint.2009.11.011 · Zbl 1344.68181 · doi:10.1016/j.artint.2009.11.011
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.