×

Multilevel models with multivariate mixed response types. (English) Zbl 07257700

Summary: We build upon the existing literature to formulate a class of models for multivariate mixtures of Gaussian, ordered or unordered categorical responses and continuous distributions that are not Gaussian, each of which can be defined at any level of a multilevel data hierarchy. We describe a Markov chain Monte Carlo algorithm for fitting such models. We show how this unifies a number of disparate problems, including partially observed data and missing data in generalized linear modelling. The two-level model is considered in detail with worked examples of applications to a prediction problem and to multiple imputation for missing data. We conclude with a discussion outlining possible extensions and connections in the literature. Software for estimating the models is freely available.

MSC:

62-XX Statistics
Full Text: DOI

References:

[1] Aitchison J and Bennett JA (1970) Polychotomous quantal response by maximum indicant . Biometrika, 57, 253-62 . · Zbl 0197.16401 · doi:10.1093/biomet/57.2.253
[2] Asparouhov T and Muthen B (2007) Multilevel mixture models. In Hancock GR and Samuelson KM (eds). Advances in latent mixture models Charlotte, NC: Information Age Publishing, Inc ., 27-51.
[3] Box GEP and Cox DR (1964) An analysis of transformations (with discussion) . Journal of the Royal Statistical Society, B, 26, 211-52 . · Zbl 0156.40104
[4] Browne WJ (2009) MCMC estimation in MLwiN. Bristol: University of Bristol .
[5] Browne WJ and Draper D (2006) A comparison of Bayesian and likelihood based methods for fitting multilevel models . Bayesian Analysis, 1, 473-514 . · Zbl 1331.62125 · doi:10.1214/06-BA117
[6] Carpenter J and Goldstein H (2004) Multiple imputation using MLwiN . Multilevel Modelling Newsletter, 16, 9-18 .
[7] Carstairs V and Morris R (1991) Deprivation and health in Scotland. Aberdeen, Scotland: Aberdeen University Press .
[8] Currie C , Levin K , and Todd J (2008) Health behaviour in school-aged children: findings from the 2006 HBSC survey in Scotland. Child and Adolescent Health Research Unit, University of Edinburgh .
[9] Dunson DB (2000) Bayesian latent variable models for clustered mixed outcomes . Journal of the Royal Statistical Society, Series B, 62, 355-66 . · doi:10.1111/1467-9868.00236
[10] Geweke J (1991) Efficient simulation from the multivariate normal and student-t distributions subject to linear constraints . Computing science and statistics: Proceedings of the 23rd symposium on the interface. Fairfa Station, VA: Interface foundation of North America.
[11] Goldstein H (1989) Models for multilevel response variables with an application to Growth Curves. In Bock RD (ed). Multilevel analysis of educational data. New York: Academic Press , 107-25. · doi:10.1016/B978-0-12-108840-8.50011-1
[12] Goldstein H (2003) Multilevel statistical models. Third edition. London: Edward Arnold . · Zbl 1014.62126
[13] Goldstein H (2009) The analysis of survival and event history data using a latent normal model. (in press).
[14] Goldstein H , Bonnet G , and Rocher T (2007) Multilevel structural equation models for the analysis of comparative data on educational performance . Journal of Educational and behavioural Statistics, 32, 252-86 .
[15] Goldstein H and Browne W (2005) Multilevel factor analysis models for continuous and discrete data. In Olivares A and McArdle JJ (eds). Contemporary psychometrics. A Festschrift to Roderick P. McDonald. Mahwah, NJ: Lawrence Erlbaum .
[16] Goldstein H and Kounali D (2009) Multivariate multilevel modelling of childhood growth, members of growth measurements and adult characteristics . Journal of the Royal Statistical Society, A, 172, 599-613 . · doi:10.1111/j.1467-985X.2008.00576.x
[17] Heitjan DF and Rubin DB (1991) Ignorability and coarse data . Annals of Statistics, 19, 2244-53 . · Zbl 0745.62004 · doi:10.1214/aos/1176348396
[18] Imai K and van Dyk DA (2005) A Bayesian analysis of the multinomial probit model using marginal data augmentation . Journal of Econometrics, 124, 311-34 . · Zbl 1335.62049 · doi:10.1016/j.jeconom.2004.02.002
[19] Kenward M and Carpenter J (2007) Multiple imputation: current perspectives . Statistical Methods in Medical Research, 16, 199-218 . · Zbl 1122.62358
[20] Mathworks (2004) Matlab. Available at http://www.mathworks.co.uk.
[21] Muthen LK and Muthen BO (2004) MPLUS users guide version 5. Los Angeles: University of California, Graduate School of Education .
[22] Pitt M , Chan D , and Kohn R (2006) Efficient Bayesian inference for Guassian copula regression models . Biometrika, 93, 537-54 . · Zbl 1108.62027 · doi:10.1093/biomet/93.3.537
[23] Qin C , Dietz PM , England LJ , Martin JA , et al. (2007) Effects of different data-editing methods on trends in race-specific delivery rates, United States, 1990-2002 . Pediatric and Perinatal Epidemiology, 21, 41-49 . · doi:10.1111/j.1365-3016.2007.00860.x
[24] Rabe-Hesketh S , Pickles A , and Skrondal A (2001) GLLAMM: a general class of multilevel models and a STATA program . Multilevel Modelling Newsletter, 13, 17-23 .
[25] Rabe-Hesketh S , Skrondal A , and Pickles A (2005) Maximum likelihood estimation of limited and discrete dependent variable models with nested random effects . Journal of Econometrics, 128, 301-23 . · Zbl 1336.62079 · doi:10.1016/j.jeconom.2004.08.017
[26] Rasbash J , Steele F , Browne W and Goldstein H (2009). A user’s guide to MLwiN version 2.10. Bristol, Centre for Multilevel Modelling, University of Bristol .
[27] Rubin DB (1987) Multiple imputation for non response in surveys. Chichester: Wiley . · Zbl 1070.62007 · doi:10.1002/9780470316696
[28] Schafer JL (1997) Analysis of incomplete multivariate data. London: Chapman & Hall . · Zbl 0997.62510 · doi:10.1201/9781439821862
[29] Scheuren F and Winkler WE (1993) Regression analysis of data files that are computer matched . Survey Methodology, 19, 35-38 .
[30] Scottish Executive (2003) Hungry for success: a whole school approach to school meals in Scotland. Edinburgh: The Stationary Office .
[31] Scottish Health Promoting Unit (2004) Being well-doing well: a framework for health promoting schools in Scotland 2004. Dundee: SHPSU .
[32] Spiegelhalter D , Best N , Carlin BP , and Van der Linde A (2002) Bayesian measures of model complexity and fit (with discussion) . Journal of the Royal Statistical Society, B, 64, 583-640 . · Zbl 1067.62010 · doi:10.1111/1467-9868.00353
[33] Spiegelhalter DJ , Thomas A , and Best NG (1999) WINBUGS version 1.2, user manual. Cambridge: MRC Biostatistics Unit .
[34] Van Buuren S (2007) Multiple imputation of discrete and continuous data by fully conditional specification . Statistical Methods in Medical Research, 16, 219-42 . · Zbl 1122.62382
[35] Van Dyk D and Meng X (2001) The art of data augmentation . Journal of Computational and Graphical Statistics, 10, 1-30 . · doi:10.1198/10618600152418584
[36] Yucel R (2008) Multiple imputation for multilevel continuous data. Philosophical transactions of the Royal Society, A , 2, 2389-403 .
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.