×

A mixture of experts model for rank data with applications in election studies. (English) Zbl 1454.62498

Summary: A voting bloc is defined to be a group of voters who have similar voting preferences. The cleavage of the Irish electorate into voting blocs is of interest. Irish elections employ a “single transferable vote” electoral system; under this system voters rank some or all of the electoral candidates in order of preference. These rank votes provide a rich source of preference information from which inferences about the composition of the electorate may be drawn. Additionally, the influence of social factors or covariates on the electorate composition is of interest.
A mixture of experts model is a mixture model in which the model parameters are functions of covariates. A mixture of experts model for rank data is developed to provide a model-based method to cluster Irish voters into voting blocs, to examine the influence of social factors on this clustering and to examine the characteristic preferences of the voting blocs. The Benter model for rank data is employed as the family of component densities within the mixture of experts model; generalized linear model theory is employed to model the influence of covariates on the mixing proportions. Model fitting is achieved via a hybrid of the EM and MM algorithms. An example of the methodology is illustrated by examining an Irish presidential election. The existence of voting blocs in the electorate is established and it is determined that age and government satisfaction levels are important factors in influencing voting in this election.

MSC:

62P25 Applications of statistics to social sciences

Software:

PRMLT; STRUCTURE

References:

[1] Airoldi, E. M., Blei, D. M., Fienberg, S. E. and Xing, E. P. (2008). Mixed membership stochastic blockmodels. J. Machine Learning Research 9 1981-2014. · Zbl 1225.68143
[2] Benter, W. (1994). Computer-based horse race handicapping and wagering systems: A report. In Efficiency of Racetrack Betting Markets (W. T. Ziemba, V. S. Lo and D. B. Haush, eds.) 183-198. Academic Press, San Diego.
[3] Bishop, C. M. (2006). Pattern Recognition and Machine Learning . Springer, New York. · Zbl 1107.68072
[4] Bishop, C. M. and Svensén, M. (2003). Bayesian hierarchical mixture of experts. In Proceedings Nineteenth Conference on Uncertainty in Artificial Intelligence 57-64.
[5] Böhning, D., Dietz, E., Schaub, R., Schlattmann, P. and Lindsay, B. (1994). The distribution of the likelihood ratio for mixtures of densities from the one-parameter exponential family. Ann. Instit. Statist. Math. 46 373-388. · Zbl 0802.62017 · doi:10.1007/BF01720593
[6] Breiman, L., Friedman, J., Olshen, R. and Stone, C. (2006). Classification and Regression Trees , 3rd ed. Routledge PSAI Press, London. · Zbl 0541.62042
[7] Busse, L. M., Orbanz, P. and Buhmann, J. M. (2007). Cluster analysis of heterogeneous rank data. In ICML 2007: Proceedings of the 24th International Conference on Machine Learning 113-120. ACM, New York.
[8] Coakley, J. and Gallagher, M. (1999). Politics in the Republic of Ireland , 3rd ed. Routledge PSAI Press, London.
[9] Dempster, A. P., Laird, N. M. and Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm (with discussion). J. Roy. Statist. Soc. Ser. B 39 1-38. · Zbl 0364.62022
[10] Dobson, A. J. (2002). An Introduction to Generalized linear Models , 2nd ed. Chapman and Hall, London. · Zbl 1008.62067
[11] Erosheva, E. A., Fienberg, S. E. and Joutard, C. (2007). Describing disability through individual-level mixture models for multivariate binary data. Ann. Appl. Statist. 1 502-537. · Zbl 1126.62101 · doi:10.1214/07-AOAS126
[12] Fligner, M. A. and Verducci, J. S. (1986). Distance based ranking models. J. Roy. Statist. Soc. Ser. B 48 359-369. · Zbl 0658.62031
[13] Fligner, M. A. and Verducci, J. S. (1988). Multistage ranking models. J. Amer. Statist. Assoc. 83 892-901. · Zbl 0719.62036 · doi:10.2307/2289322
[14] Fraley, C. and Raftery, A. E. (1998). How many clusters? Which clustering methods? Answers via model-based cluster analysis. Computer J. 41 578-588. · Zbl 0920.68038 · doi:10.1093/comjnl/41.8.578
[15] Friedman, J. (1991). Multivariate adaptive regression splines. Ann. Statist. 19 1-141. · Zbl 0765.62064 · doi:10.1214/aos/1176347963
[16] Gelman, A. and Rubin, D. B. (1995). Avoiding model selection in Bayesian social research. Soc. Methodol. 25 165-173.
[17] Gordon, A. D. (1979). A measure of the agreement between rankings. Biometrika 66 7-15. · Zbl 0397.62040 · doi:10.1093/biomet/66.1.7
[18] Gormley, I. C. (2006). Statistical models for rank data. Ph.D. thesis, Univ. Dublin, Trinity College.
[19] Gormley, I. C. and Murphy, T. B. (2006). Analysis of Irish third-level college applications data. J. Roy. Statist. Soc. Ser. A 169 361-379. · doi:10.1111/j.1467-985X.2006.00412.x
[20] Gormley, I. C. and Murphy, T. B. (2007). A latent space model for rank data. Statistical Network Analysis: Models, Issues and New Directions. Lecture Notes in Comput. Sci. 4503 90-107. Springer, Berlin.
[21] Gormley, I. C. and Murphy, T. B. (2008a). Exploring voting blocs within the Irish electorate: A mixture modeling approach. J. Amer. Statist. Assoc. 103 1014-1027. · Zbl 1205.62198 · doi:10.1198/016214507000001049
[22] Gormley, I. C. and Murphy, T. B. (2008b). A grade of membership model for rank data. Technical report, Univ. College Dublin. Available at http://www.ucd.ie/statdept/staffpages/cgormley/gom.pdf. · Zbl 1454.62498
[23] Gormley, I. C. and Murphy, T. B. (2008c). Supplement to “A mixture of experts model for rank data with applications in election studies.” DOI: 10.1214/08-AOAS178SUPP. · Zbl 1454.62498
[24] Hill, J. L. (2001). Accommodating missing data in mixture models for classification by opinion-changing behavior. J. Educational and Behavioral Statistics 26 233-268.
[25] Holloway, S. (1990). Forty years of united Nations General Assembly voting. Canad. J. Political Sci. / Revue Canad. Sci. Politique 23 279-296.
[26] Hunter, D. R. (2004). MM algorithms for generalized Bradley-Terry models. Ann. Statist. 32 384-406. · Zbl 1105.62359 · doi:10.1214/aos/1079120141
[27] Hunter, D. R. and Lange, K. (2004). A tutorial on MM algorithms. Amer. Statist. 58 30-37. · doi:10.1198/0003130042836
[28] Jacobs, R. A., Jordan, M. I., Nowlan, S. J. and Hinton, G. E. (1991). Adaptive mixture of local experts. Neural Computation 3 79-87.
[29] Jakulin, A. and Buntine, W. (2004). Analyzing the US Senate in 2003: Similarities, Networks, Clusters and Blocs. Preprint. Available at http://kt.ijs.si/aleks/Politics/us_senate.pdf.
[30] Jordan, M. I. and Jacobs, R. A. (1994). Hierarchical mixture of experts and the EM algorithm. Neural Computation 6 181-214. · Zbl 1058.68097
[31] Kass, R. E. and Raftery, A. E. (1995). Bayes factors. J. Amer. Statist. Assoc. 90 773-795. · Zbl 0846.62028 · doi:10.2307/2291091
[32] Keribin, C. (1998). Estimation consistante de l’ordre de modèles de mélange. C. R. Acad. Sci. Paris Sér. I Math. 326 243-248. · Zbl 0954.62023 · doi:10.1016/S0764-4442(97)89479-7
[33] Keribin, C. (2000). Consistent estimation of the order of mixture models. Sankhyā Ser. A 62 49-66. · Zbl 1081.62516
[34] Lange, K., Hunter, D. R. and Yang, I. (2000). Optimization transfer using surrogate objective functions (with discussion). J. Comput. Graph. Statist. 9 1-59.
[35] Leroux, B. G. (1992). Consistent estimation of a mixing distribution. Ann. Statist. 20 1350-1360. · Zbl 0763.62015 · doi:10.1214/aos/1176348772
[36] Mallows, C. L. (1957). Nonnull ranking models. Biometrika 44 114-130. · Zbl 0087.34001 · doi:10.1093/biomet/44.1-2.114
[37] Marden, J. I. (1995). Analyzing and Modeling Rank Data . Chapman and Hall, London. · Zbl 0853.62006
[38] Marsh, M. (1999). The making of the eight president. In How Ireland Voted 1997 (M. Marsh and P. Mitchell, eds.) 215-242. Westview Press, Boulder, CO.
[39] McCullagh, P. and Nelder, J. A. (1983). Generalized Linear Models . Chapman and Hall, London. · Zbl 0588.62104
[40] McFadden, D. G. (1978). Modelling the choice of residential location. Spatial Interaction Theory and Planning Models 75-96.
[41] McLachlan, G. J. and Krishnan, T. (1997). The EM Algorithm and Extensions . Wiley, New York. · Zbl 0882.62012
[42] McLachlan, G. J. and Peel, D. (2000). Finite Mixture Models . Wiley, New York. · Zbl 0963.62061
[43] Meng, X.-L. and Rubin, D. B. (1993). Maximum likelihood estimation via the ECM algorithm: A general framework. Biometrika 80 267-278. · Zbl 0778.62022 · doi:10.1093/biomet/80.2.267
[44] Murphy, T. B. and Martin, D. (2003). Mixtures of distance-based models for ranking data. Comput. Statist. Data Anal. 41 645-655. · Zbl 1429.62258
[45] Peng, F., Jacobs, R. A. and Tanner, M. A. (1996). Bayesian inference in mixtures-of-experts and hierarchical mixtures-of-experts models with application to speech recognition. J. Amer. Statist. Assoc. 91 953-960. · Zbl 0882.62022 · doi:10.2307/2291714
[46] Plackett, R. L. (1975). The analysis of permutations. Appl. Statist. 24 193-202. · doi:10.2307/2346567
[47] Pritchard, J. K., Stephens, M. and Peter, D. (2000). Inference of population structure using multilocus genotype data. Genetics 155 945-959. · Zbl 1083.62537
[48] Raftery, A. E. (1995). Bayesian model selection in social research. Soc. Methodol. 25 111-163.
[49] Schwartz, G. (1978). Estimating the dimension of a model. Ann. Statist. 6 461-464. · Zbl 0379.62005 · doi:10.1214/aos/1176344136
[50] Sinnott, R. (1995). Irish Voters Decide: Voting Behaviour in Elections and Referendums Since 1918 . Manchester Univ. Press.
[51] Sinnott, R. (1999). The electoral system, In Politics in the Republic of Ireland (J. Coakley and M. Gallagher, eds.) 99-126. Routledge & PSAI Press, London.
[52] Tam, W. K. (1995). Asians-A monolithic voting bloc? Political Behaviour 17 223-249.
[53] Tanner, M. A. (1996). Tools for Statistical Inference , 3rd ed. Springer, New York. · Zbl 0846.62001
[54] Train, K. E. (2003). Discrete Choice Methods with Simulation . Cambridge Univ. Press. · Zbl 1047.62098
[55] van der Brug, W., van der Eijk, C. and Marsh, M. (2000). Exploring uncharted territory: The Irish presidential election 1997. British J. Political Science 30 631-650.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.