×

Outlier detection method in GEEs. (English) Zbl 1309.62126

Summary: The generalized estimating equations (GEEs) method has become quite useful in modeling correlated data. However, diagnostic tools to check that the selected final model fits the data as accurately as possible have not been explored intensively. In this paper, an outlier detection technique is developed based on the use of the “working” score test statistic to test an appropriate mean-shift model in the context of longitudinal studies based on GEEs. Through a simulation study it has been shown that this method correctly singled out the outlier when the data set had a known outlier. The method is applied to a set of data to illustrate the outlier detection procedure in GEEs.

MSC:

62J12 Generalized linear models (logistic models)
65C60 Computational problems in statistics (MSC2010)
62P10 Applications of statistics to biology and medical sciences; meta analysis
Full Text: DOI

References:

[1] Barnett, V. and Lewis, T. (1994). Outliers in Statistical Data, 3rd edition. Wiley, Chichester, UK. · Zbl 0801.62001
[2] Breslow, N. and Clayton, D. (1993). Approximate inference in generalized linear mixed models. Journal of the American Statistical Association88, 9-24. · Zbl 0775.62195
[3] Chang, Y. C. (2000). Residuals analysis of the generalized linear models for longitudinal data. Statistics in Medicine19, 1277-1293.
[4] Cook, R. D. and Weisberg, S. (1982). Residuals and Influence in Regression. Chapman and Hall, New York, NY. · Zbl 0564.62054
[5] Diggle, P., Heagerty, P., Liang, K. and Zeger, S. (2003). Analysis of Longitudinal Data, 2nd edition. Oxford University Press, Oxford, UK. · Zbl 1031.62002
[6] Jung, K. M. (2008). Local influence in Generalized estimating equations. Scandinavian Journal of Statistics35, 286-294. · Zbl 1164.62029
[7] Leppik, I. E., Dreifuss, E., Bowman‐Cloyd, T. and Santilli, T. (1985). A double‐blind crossover evaluation of progabide in partial seizures. Neurology35, 285.
[8] Liang, K. Y. and Zeger, S. L. (1986). Longitudinal data analysis using generalized linear models. Biometrika73, 13-22. · Zbl 0595.62110
[9] Macke, J. H., Berens, P., Ecker, A. S., Tolias, A. S. and Bethge, M. (2009). Generating Spike Trains with Specified Correlation Coefficients. Neural Computation21, 397-423. · Zbl 1155.92011
[10] Miller, R. (1966). Simultaneous Inference. McGraw Hill, New York, NY. · Zbl 0192.25702
[11] Oh, S., Carriere, K. C. and Park, T. (2008). Model diagnostic plots for repeated measures data using the generalized estimating equations approach. Computational Statistics and Data Analysis53, 222-232. · Zbl 1452.62104
[12] Pardo, M. C. and Alonso, R. (2012a). Influence measures based on the volume of confidence ellipsoids for GEE. Biometrical Journal54, 552-567. · Zbl 1316.62160
[13] Pardo, M. C. and Alonso, R. (2012b). A generalized Q-Q plot for longitudinal data. Journal of Applied Statistics39, 2349-2362. · Zbl 1514.62259
[14] Park, C. G. and Shin, D. W. (1998). An algorithm for generating correlated random variables in a class of infinitely divisible distributions. Journal of Statistical Computation and Simulation61, 127-139. · Zbl 0945.65003
[15] Pregibon, D. (1981). Logistic Regression Diagnostics. The Annals of Statistics9, 705-724. · Zbl 0478.62053
[16] Preisser, J. S. and Qaqish, B. F. (1996). Deletion diagnostics for generalized estimating equations. Biometrika83, 551-562. · Zbl 0866.62041
[17] Rotnitzky, A. and Jewell, N. (1990). Hypothesis testing of regression parameters in semiparametric generalized linear models for cluster correlated data. Biometrika77, 485-497. · Zbl 0734.62075
[18] Tan, M., Qu, Y. and Kutner, M. H. (1997). Model diagnostics for marginal regression analysis of correlated binary data. Communications in Statistics ‐ Simulation and Computation26, 539-558. · Zbl 0900.62392
[19] Thall, P. F. and Vail, C. S. (1990). Some covariance models for longitudinal count data. Biometrical Journal46, 441-452.
[20] Venezuela, M. K., Botter, D. A. and Sandoval, M. C. (2007). Diagnostic techniques in generalized estimating equations. Journal of Statistical Computation and Simulation77, 879-888. · Zbl 1127.62064
[21] Wei, W. H. and Fung, W. K. (1999). The mean‐shift outlier model in general weighted regression and its applications. Computational Statistics and Data Analysis30, 429-441. · Zbl 1043.62523
[22] Williams, D. A. (1987). Generalized linear model diagnostics using the deviance and single case deletions. Applied Statistics36, 181-191. · Zbl 0646.62062
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.