×

Nonparametric regression estimation with missing data. (English) Zbl 0897.62038

Summary: For nonparametric regression, there might be a part of the design points on which the observations are missing. A fundamental issue of interest is to study the impact of the missing observations on the performance of kernel estimators. Utilizing the estimation idea of P. E. Cheng and L. J. Wei [Int. Stat. Symp. Vol. 1, Taipei, ROC, 97-112 (1986)], the effect of missing is precisely quantified through the asymptotic mean square error (AMSE) for the local linear smoother (LLS) of J. Fan [Ann. Stat. 21, No. 1, 196-216 (1993; Zbl 0773.62029)].
An imputed LLS which adjusts for the effect of missing by substituting the missing observations with the respective kernel estimates is also investigated. The imputed LLS is analyzed by its AMSE. This AMSE shows clearly how the kernel function and the value of bandwidth used in constructing the substitutes effect the performance of the imputed LLS. Simulations demonstrate that the derived asymptotic results hold for reasonable sample sizes.

MSC:

62G07 Density estimation
62G20 Asymptotic properties of nonparametric inference

Citations:

Zbl 0773.62029
Full Text: DOI

References:

[1] Afifi, A. A.; Elashoff, R. M., Missing observations in multivariate statistics III: large sample analysis of simple linear regression, J. Amer. Statist. Assoc., 64, 337-358 (1969) · Zbl 0175.17103
[2] Cheng, P. E., Applications of kernel regression estimation: a survey, Commun. Statist. Theory Meth., 19, 4103-4134 (1990)
[3] Cheng, P. E., Nonparametric estimation of mean functionals with data missing at random, J. Amer. Statist. Assoc., 89, 81-87 (1994) · Zbl 0800.62213
[4] Cheng, P. E.; Chu, C. K., Kernel estimation of distribution function and quantile with missing data (1992), Institute of Statistical Science, Academia Sinica: Institute of Statistical Science, Academia Sinica Taipei, Taiwan, Unpublished paper · Zbl 0839.62038
[5] Cheng, P. E.; Wei, L. J., Nonparametric inference under ignorable missing data process and treatment assignment, (1986 Internat. Statistical Symposium, Vol. 1 (1986)), 97-112, Taipei, ROC
[6] Cochran, W. G., The effectiveness of adjustment by subclassification in removing bias in observational studies, Biometrics, 24, 295-313 (1968)
[7] Fan, J., Design-adaptive nonparametric regression, J. Amer. Statist. Assoc., 87, 998-1004 (1992) · Zbl 0850.62354
[8] Fan, J., Local linear regression smoothers and their minimax efficiencies, Ann. Statist., 21, 196-216 (1993) · Zbl 0773.62029
[9] Fan, J.; Gijbels, I., Variable bandwidth and local linear regression smoothers, Ann. Statist., 20, 2008-2036 (1992) · Zbl 0765.62040
[10] Little, R. J.A.; Rubin, D. B., Statistical Analysis with Missing Data (1987), Wiley: Wiley New York · Zbl 0665.62004
[11] Mueller, H. G., Nonparametric Analysis of Longitudinal Data, (Lecture Notes in Statistics (1988), Springer: Springer Berlin), No. 46 · Zbl 0664.62031
[12] Neyman, J., Contribution to the theory of sampling human populations, J. Amer. Statist. Assoc., 33, 101-116 (1938) · Zbl 0018.22603
[13] Titterington, D. M.; Mill, G. M., Kernel-based density estimates from incomplete data, J. Roy. Statist. Sssoc. Ser., B 45, 258-266 (1983) · Zbl 0541.62026
[14] Yates, F., The analysis of replicated experiments when the field results are incomplete, Emporium J. Exp. Agriculture, 1, 129-142 (1993)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.