×

Likelihood-based inference for linear mixed-effects models using the generalized hyperbolic distribution. (English) Zbl 07858726

Summary: In this paper, we develop statistical methodology for the analysis of data under nonnormal distributions, in the context of mixed effects models. Although the multivariate normal distribution is useful in many cases, it is not appropriate, for instance, when the data come from skewed and/or heavy-tailed distributions. To analyse data with these characteristics, in this paper, we extend the standard linear mixed effects model, considering the family of generalized hyperbolic distributions. We propose methods for statistical inference based on the likelihood function, and due to its complexity, the EM algorithm is used to find the maximum likelihood estimates with the standard errors and the exact likelihood value as a by-product. We use simulations to investigate the asymptotic properties of the expectation-maximization algorithm (EM) estimates and prediction accuracy. A real example is analysed, illustrating the usefulness of the proposed methods.
© 2023 John Wiley & Sons Ltd.

MSC:

62-XX Statistics

Software:

R; QRM
Full Text: DOI

References:

[1] Arellano‐Valle, R. B., Bolfarine, H., & Lachos, V. H. (2005). Skew‐normal linear mixed models. Journal of Data Science, 3, 415-438.
[2] Arellano‐Valle, R. B., Ferreira, C. S., & Genton, M. G. (2018). Scale and shape mixtures of multivariate skew‐normal distributions. Journal of Multivariate Analysis, 166, 98-110. · Zbl 1499.62161
[3] Azzalini, A., & Dalla‐Valle, A. (1996). The multivariate skew‐normal distribution. Biometrika, 83(4), 715-726. · Zbl 0885.62062
[4] Barndorff‐Nielsen, O. E. (1997). Normal inverse Gaussian distributions and stochasctic volatility modelling. Scandinavian Journal of Statistics, 24, 1-14. · Zbl 0934.62109
[5] Branco, M. D., & Dey, D. K. (2001). A general class of multivariate skew‐elliptical distributions. Journal of Multivariate Analysis, 79, 99-113. · Zbl 0992.62047
[6] Browne, R. P., & McNicholas, P. D. (2015). A mixture of generalized hyperbolic distributions. Canadian Journal of Statistics, 43(2), 176-198. · Zbl 1320.62144
[7] Demidenko, E. (2013). Mixed models: Theory and applications with r: John Wiley & Sons. · Zbl 1276.62049
[8] Dempster, A., Laird, N., & Rubin, D. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, Series B, 39, 1-38. · Zbl 0364.62022
[9] Deng, X., & Yao, J. (2018). On the property of multivariate generalized hyperbolic distribution and the Stein‐type inequality. Communications in Statistics‐Theory and Methods, 47(21), 5346-5356. · Zbl 1508.62145
[10] Diggle, P., Diggle, P. J., Heagerty, P., Heagerty, P. J., Liang, K.‐Y., & Zeger, S. (2002). Analysis of longitudinal data: Oxford University Press.
[11] Fang, K. T., Kotz, S., & Ng, K. W. (1990). Symmetric multivariate and related distributions. London: Chapman & Hall. · Zbl 0699.62048
[12] Hellmich, M., & Kassberger, S. (2011). Efficient and robust portfolio optimization in the multivariate generalized hyperbolic framework. Quantitative Finance, 11(10), 1503-1516. · Zbl 1258.91197
[13] Ho, H. J., & Lin, T.‐I. (2010). Robust linear mixed models using the skew t distribution with application to schizophrenia data. Biometrical Journal, 52(4), 449-469. · Zbl 1197.62055
[14] Jorgensen, B. (1982). Statistical properties of the generalized inverse Gaussian distribution. Heidelberg: Springer Verlag. · Zbl 0486.62022
[15] Kotz, S., Kozubowski, T., & Podgorski, K. (2012). The laplace distribution and generalizations: A revisit with applications to communications, economics, engineering, and finance: Springer Science & Business Media.
[16] Lachos, V. H., Bolfarine, H., Arellano‐Valle, R. B., & Montenegro, L. C. (2007). Likelihood based inference for multivariate skew‐normal regression models. Communications in Statistics-Theory and Methods, 36, 1769-1786. · Zbl 1124.62037
[17] Lachos, V. H., Ghosh, P., & Arellano‐Valle, R. B. (2010). Likelihood based inference for skew normal independent linear mixed models. Statistica Sinica, 20, 303-322. · Zbl 1186.62071
[18] Laird, N. M., & Ware, J. H. (1982). Random‐effects models for longitudinal data. Biometrics, 38(4), 963-974. · Zbl 0512.62107
[19] Lange, K. L., Little, R., & Taylor, J. (1989). Robust statistical modeling using \(t\) distribution. Journal of the American Statistical Association, 84, 881-896.
[20] Lange, K. L., & Sinsheimer, J. S. (1993). Normal/independent distributions and their applications in robust regression. Journal of Computational and Graphical Statistics, 2, 175-198.
[21] Lin, T. I., & Lee, J. C. (2008). Estimation and prediction in linear mixed models with skew‐normal random effects for longitudinal data. Statistics in Medicine, 27, 1490-1507.
[22] Lin, T.‐I., & Wang, W.‐L. (2013). Multivariate skew‐normal at linear mixed models for multi‐outcome longitudinal data. Statistical Modelling, 13(3), 199-221. · Zbl 07257455
[23] Maleki, M., Wraith, D., & Arellano‐Valle, R. B. (2019a). A flexible class of parametric distributions for Bayesian linear mixed models. Test, 28(2), 543-564. · Zbl 1420.62288
[24] Maleki, M., Wraith, D., & Arellano‐Valle, R. B. (2019b). Robust finite mixture modeling of multivariate unrestricted skew‐normal generalized hyperbolic distributions. Statistics and Computing, 29(3), 415-428. · Zbl 1430.62105
[25] McLachlan, G. J., & Krishnan, T. (2008). The EM algorithm and extensions(2nd ed.).: Wiley. · Zbl 1165.62019
[26] McNeil, A. J., Frey, R., & Embrechts, P. (2015). Quantitative risk management: Concepts, techniques and tools‐revised edition: Princeton University Press. · Zbl 1337.91003
[27] Mirfarah, E., Naderi, M., Lin, T.‐I., & Wang, W.‐L. (2022). Multivariate measurement error models with normal mean-variance mixture distributions. Stat, 11(1), 1-17. · Zbl 07853639
[28] O’Hagan, A., Murphy, T. B., Gormley, I. C., McNicholas, P. D., & Karlis, D. (2016). Clustering with the multivariate normal inverse Gaussian distribution. Computational Statistics & Data Analysis, 93, 18-30. · Zbl 1468.62151
[29] Øigård, T. A., Hanssen, A., Hansen, R. E., & Godtliebsen, F. (2005). EM‐estimation and modeling of heavy‐tailed processes with the multivariate normal inverse Gaussian distribution. Signal Processing, 85(8), 1655-1673. · Zbl 1160.94355
[30] Pinheiro, J. C., Liu, C. H., & Wu, Y. N. (2001). Efficient algorithms for robust estimation in linear mixed‐effects models using a multivariate \(t\)‐distribution. Journal of Computational and Graphical Statistics, 10, 249-276.
[31] R Core Team (2019). R: A language and environment for statistical computing, R Foundation for Statistical Computing, Vienna, Austria. http://www.R-project.org/
[32] Schumacher, F. L., Lachos, V. H., & Matos, L. A. (2021). Scale mixture of skew‐normal linear mixed models with within‐subject serial dependence. Statistics in Medicine, 40(7), 1790-1810.
[33] Verbeke, G., & Molenberghs, G. (2001). Linear mixed models for longitudinal data. New York: Springer. · Zbl 1106.62069
[34] Vilca, F., Balakrishnan, N., & Zeller, C. B. (2014). Multivariate skew‐normal generalized hyperbolic distribution and its properties. Journal of Multivariate Analysis, 128, 73-85. · Zbl 1352.62080
[35] Wang, W. L., & Lin, T. I. (2020). Multivariate-t linear mixed models with censored responses, intermittent missing values and heavy tails. Statistical Methods in Medical Research, 29(5), 1288-1304.
[36] Wang, W. L., & Lin, T. I. (2022). Multivariate linear mixed models with censored and nonignorable missing outcomes, with application to aids studies. Biometrical Journal, 64(7), 1325-1339. · Zbl 1523.62153
[37] Wang, W. L., & Lin, T. I. (2023). Flexible modeling of multiple nonlinear longitudinal trajectories with censored and non‐ignorable missing outcomes. Statistical Methods in Medical Research, 32(3), 593-608.
[38] Wang, W.‐L., Lin, T.‐I., & Lachos, V. H. (2018). Extending multivariate‐t linear mixed models for multiple longitudinal data with censored responses and heavy tails. Statistical Methods in Medical Research, 27(1), 48-64.
[39] Watanabe, M., & Yamaguchi, K. (2003). The EM algorithm and related statistical models: CRC Press.
[40] Wei, Y., Tang, Y., & McNicholas, P. D. (2019). Mixtures of generalized hyperbolic distributions and mixtures of skew‐t distributions for model‐based clustering with incomplete data. Computational Statistics and Data Analysis, 130, 18-41. · Zbl 1469.62162
[41] Zhang, D., & Davidian, M. (2001). Linear mixed models with flexible distributions of random effects for longitudinal data. Biometrics, 57, 795-802. · Zbl 1209.62087
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.