×

Non-parametric small area estimation using penalized spline regression. (English) Zbl 05563354

Summary: The paper proposes a small area estimation approach that combines small area random effects with a smooth, non-parametrically specified trend. By using penalized splines as the representation for the non-parametric trend, it is possible to express the non-parametric small area estimation problem as a mixed effect model regression. The resulting model is readily fitted by using existing model fitting approaches such as restricted maximum likelihood. We present theoretical results on the prediction mean-squared error of the estimator proposed and on likelihood ratio tests for random effects, and we propose a simple non-parametric bootstrap approach for model inference and estimation of the small area prediction mean-squared error. The applicability of the method is demonstrated on a survey of lakes in north-eastern USA.

MSC:

62-XX Statistics

Software:

SemiPar

References:

[1] Battese, An error-components model for prediction of county crop areas using survey and satellite data, J. Am. Statist. Ass. 83 pp 28– (1988)
[2] Butar, On measures of uncertainty of empirical bayes small-area estimators, J. Statist. Planng Inf. 112 pp 63– (2003) · Zbl 1033.62007
[3] Chernoff, On the distribution of the likelihood ratio, Ann. Math. Statist. 25 pp 573– (1954) · Zbl 0056.37102
[4] Claeskens, Restricted likelihood ratio lack-of-fit tests using mixed spline models, J. R. Statist. Soc. B 66 pp 909– (2004) · Zbl 1059.62041
[5] Clayton, Empirical Bayes estimates of age-standardized relative risks for use in disease mapping, Biometrics 43 pp 671– (1987)
[6] Coull, Simple incorporation of interactions into additive models, Biometrics 57 pp 539– (2001a) · Zbl 1209.62352
[7] Coull, Respiratory health and air pollution: additive mixed model analyses, Biostatistics 2 pp 337– (2001b) · Zbl 1154.62388
[8] Crainiceanu, Likelihood ratio tests in linear mixed models with one variance component, J. R. Statist. Soc. B 66 pp 165– (2004) · Zbl 1061.62027
[9] Crainiceanu, Exact likelihood ratio tests for penalised splines, Biometrika 92 pp 91– (2005) · Zbl 1068.62021
[10] Das, Mean squared error of empirical predictor, Ann. Statist. 32 pp 818– (2004) · Zbl 1092.62063
[11] Datta, A unified measure of uncertainty of estimated best linear unbiased predictors in small area estimation problems, Statist. Sin. 10 pp 613– (2000) · Zbl 1054.62566
[12] Eilers, Flexible smoothing with B-splines and penalties, Statist. Sci. 11 pp 89– (1996) · Zbl 0955.62562
[13] Fay, Estimates of income for small places: an application of James-Stein procedures to census data, J. Am. Statist. Ass. 74 pp 269– (1979)
[14] Ghosh, Generalized linear models for small-area esitmation, J. Am. Statist. Ass. 93 pp 273– (1998)
[15] Ghosh, Small area estimation: an appraisal, Statist. Sci. 9 pp 55– (1994) · Zbl 0955.62538
[16] Hall, On parametric bootstrap methods for small area prediction, J. R. Statist. Soc. B 68 pp 221– (2006) · Zbl 1100.62039
[17] Jiang, Mixed model prediction and small area estimation, Test 15 pp 1– (2006)
[18] Kackar, Approximations for standard errors of estimators of fixed and random effects in mixed linear models, J. Am. Statist. Ass. 79 pp 853– (1984) · Zbl 0557.62066
[19] Lahiri, On the impact of bootstrap in survey sampling and small-area estimation, Statist. Sci. 18 pp 199– (2003) · Zbl 1331.62076
[20] Larsen, Designs for evaluating local and regional scale trends, Bioscience 51 pp 1049– (2001)
[21] McCulloch, Generalized, Linear and Mixed Models (2001)
[22] Messer, An EPA program for monitoring ecological status and trends, Environ. Monit. Assessmnt 17 pp 67– (1991)
[23] Nychka, Case Studies in Environmental Statistics pp 159– (1998) · doi:10.1007/978-1-4612-2226-2
[24] Parise, Incorporation of historical controls using semiparametric mixed models, Appl. Statist. 50 pp 31– (2001) · Zbl 1021.62095
[25] Patterson, Recovery of inter-block information when block sizes are unequal, Biometrika 58 pp 545– (1971) · Zbl 0228.62046
[26] Pfeffermann, Mean square error approximation in small area estimation by use of parametric and nonparametric bootstrap, Proc. Surv. Res. Meth. Sect. Am. Statist. Ass. pp 4167– (2004)
[27] Prasad, The estimation of the mean squared error of small-area estimators, J. Am. Statist. Ass. 85 pp 163– (1990) · Zbl 0719.62064
[28] Rao, Small Area Estimation (2003) · Zbl 1403.62022
[29] Ruppert, Selecting the number of knots for penalized splines, J. Computnl Graph. Statist. 11 pp 735– (2002)
[30] Ruppert, Semiparametric Regression (2003)
[31] Self, Asymptotic properties of maximum likelihood and likelihood ratio tests under nonstandard conditions, J. Am. Statist. Ass. 82 pp 605– (1987) · Zbl 0639.62020
[32] Sen, An appraisal of some aspects of statistical inference under inequality constraints, J. Statist. Planng Inf. 107 pp 3– (2002) · Zbl 1030.62023
[33] Stram, Variance component testing in the longitudinal mixed effects model, Biometrics 50 pp 1171– (1994) · Zbl 0826.62054
[34] Vu, Generalization of likelihood ratio tests under nonstandard conditions, Ann. Statist. 25 pp 897– (1997) · Zbl 0873.62022
[35] Wand, Smoothing and mixed models, Computnl Statist. 18 pp 223– (2003) · Zbl 1050.62049 · doi:10.1007/s001800300142
[36] Wetzel, Limnology (1975)
[37] Zheng, Penalized spline nonparametric mixed models for inference about a finite population mean from two-stage samples, Surv. Methodol. 30 pp 209– (2004)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.