×

Bayesian latent multi-state modeling for nonequidistant longitudinal electronic health records. (English) Zbl 1520.62279

Summary: Large amounts of longitudinal health records are now available for dynamic monitoring of the underlying processes governing the observations. However, the health status progression across time is not typically observed directly: records are observed only when a subject interacts with the system, yielding irregular and often sparse observations. This suggests that the observed trajectories should be modeled via a latent continuous-time process potentially as a function of time-varying covariates. We develop a continuous-time hidden Markov model to analyze longitudinal data accounting for irregular visits and different types of observations. By employing a specific missing data likelihood formulation, we can construct an efficient computational algorithm. We focus on Bayesian inference for the model: this is facilitated by an expectation-maximization algorithm and Markov chain Monte Carlo methods. Simulation studies demonstrate that these approaches can be implemented efficiently for large data sets in a fully Bayesian setting. We apply this model to a real cohort where patients suffer from chronic obstructive pulmonary disease with the outcome being the number of drugs taken, using health care utilization indicators and patient characteristics as covariates.
{© 2020 The International Biometric Society}

MSC:

62P10 Applications of statistics to biology and medical sciences; meta analysis
Full Text: DOI

References:

[1] Alaa, A.M. and van derSchaar, M. (2018) A h idden absorbing semi‐Markov model for informatively censored temporal data: learning and inference. Journal of Machine Learning Research, 19, 1-62. · Zbl 1445.62211
[2] Altman, R.M. (2007) Mixed hidden Markov models: an extension of the hidden Markov model to the longitudinal data setting. Journal of the American Statistical Association, 102, 201-210. · Zbl 1284.62803
[3] Bartolucci, F. and Farcomeni, A. (2019) A shared‐parameter continuous‐time hidden Markov and survival model for longitudinal data with informative dropout. Statistics in Medicine, 38, 1056-1073.
[4] Bartolucci, F., Farcomeni, A. and Pennoni, F. (2012) Latent Markov Models for Longitudinal Data. Boca Raton, FL: Chapman and Hall/CRC. · Zbl 1305.62300
[5] Bartolucci, F., Pennoni, F. and Francis, B. (2007) A latent Markov model for detecting patterns of criminal activity. Journal of the Royal Statistical Society: Series A (Statistics in Society), 170, 115-132.
[6] Baum, L. and Petrie, T. (1966) Statistical inference for probabilistic functions of finite state Markov chains. The Annals of Mathematical Statistics, 37, 1554-1563. · Zbl 0144.40902
[7] Betancourt, M., Roberts, K., Bennett, T., Driscoll, E., Jayaraman, G. and Pelletier, L. (2014) Monitoring chronic diseases in Canada: the chronic disease indicator framework. Maladies Chroniques et Blessures au Canada, 34, 1-30.
[8] Bladt, M. and Sørensen, M. (2005) Statistical inference for discretely observed Markov jump processes. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 67, 395-410. · Zbl 1069.62061
[9] Broemeling, A.‐M., Watson, D.E. and Prebtani, F. (2007) Population patterns of chronic health conditions, co‐morbidity and healthcare use in Canada: implications for policy and practice. Healthcare Quarterly, 11, 70-76.
[10] Centre for Chronic Disease Prevention and Control (2001) Editorial Board for Respiratory Disease in Canada. Ottawa: Health Canada.
[11] Frühwirth‐Schnatter, S. (2006) Finite Mixture and Markov Switching Models. New York, NY: Springer. · Zbl 1108.62002
[12] Gold, M., Beitsch, L., Essien, J., Fleming, D., Getzen, T., Gostin, L., Isham, G., Kaplan, R., Lopez, W. and Mays, G. (2011) For the Public’s Health: The Role of Measurement in Action and Accountability. Washington, DC: National Academies Press.
[13] GOLD Executive Committee (2017) Pocket guide to COPD diagnosis, management and prevention: a guide for health care professionals 2017 report.
[14] Hobolth, A. and Stone, E.A. (2009) Simulation from endpoint‐conditioned, continuous‐time Markov chains on a finite state space with applications to molecular evolution. The Annals of Applied Statistics, 3, 1204-1231. · Zbl 1196.62139
[15] Jackson, C.H. and Sharples, L.D. (2002) Hidden Markov models for the onset and progression of bronchiolitis obliterans syndrome in lung transplant recipients. Statistics in Medicine, 21, 113-128.
[16] Jackson, C.H., Sharples, L.D., Thompson, S.G., Duffy, S.W. and Couto, E. (2003) Multistate Markov models for disease progression with classification error. Journal of the Royal Statistical Society: Series D (The Statistician), 52, 193-209.
[17] Jasra, A., Holmes, C.C. and Stephens, D.A. (2005) Markov chain Monte Carlo methods and the label switching problem in Bayesian mixture modeling. Statistical Science, 20, 50-67. · Zbl 1100.62032
[18] Kalbfleisch, J. and Lawless, J.F. (1985) The analysis of panel data under a Markov assumption. Journal of the American Statistical Association, 80, 863-871. · Zbl 0586.62136
[19] Lange, J., Hubbard, R., Inoue, L. and Minin, V. (2015) A joint model for multi‐state disease processes and random informative observation times, with applications to electronic medical records data. Biometrics, 71, 90-101. · Zbl 1419.62384
[20] Raffa, J. and Dubin, J. (2015) Multivariate longitudinal data analysis with mixed effects hidden Markov models. Biometrics, 71, 821-831. · Zbl 1419.62428
[21] Robert, C.P., Ryden, T. and Titterington, D.M. (2000) Bayesian inference in hidden Markov models through the reversible jump Markov chain Monte Carlo method. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 62, 57-75. · Zbl 0941.62090
[22] Rydén, T. (1996) An EM algorithm for estimation in Markov‐modulated Poisson processes. Computational Statistics & Data Analysis, 21, 431-447. · Zbl 0875.62405
[23] Shaban‐Nejad, A., Lavigne, M., Okhmatovskaia, A. and Buckeridge, D.L. (2017) PopHR: a knowledge‐based platform to support integration, analysis, and visualization of population health data. Annals of the New York Academy of Sciences, 1387, 44-53.
[24] Spiegelhalter, D., Best, N., Carlin, B. and Van Der Linde, A. (2002) Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 64, 583-639. · Zbl 1067.62010
[25] Verma, A. (2015) Hospital readmissions: prediction and inference. PhD thesis, McGill University.
[26] Wang, X., Sontag, D. and Wang, F. (2014) Unsupervised learning of disease progression models. Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 24-27 August 2014. New York, NY: Association for Computing Machinery, pp. 85-94.
[27] Watanabe, S. (2010) Asymptotic equivalence of Bayes cross validation and widely applicable information criterion in singular learning theory. Journal of Machine Learning Research, 11, 3571-3594. · Zbl 1242.62024
[28] West, M. and Harrison, J. (1997) Bayesian Forecasting and Dynamic Models, 2nd edition. New York, NY: Springer. · Zbl 0871.62026
[29] Williams, J.P., Storlie, C.B., Therneau, T.M., JackJr, C.R. and Hannig, J. (2019) A Bayesian approach to multistate hidden Markov models: application to dementia progression. Journal of the American Statistical Association, 1-21.
[30] Wulfsohn, M.S. and Tsiatis, A.A. (1997) A joint model for survival and longitudinal data measured with error. Biometrics, 53, 330-339. · Zbl 0874.62140
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.