×

Informative statistical analyses using smooth goodness of fit tests. (English) Zbl 1211.62062

Summary: We propose a methodology for informative goodness of fit testing that combines the merits of both hypothesis testing and nonparametric density estimation. In particular, we construct a data-driven smooth test that selects the model using a weighted integrated squared error (WISE) loss function. When the null hypothesis is rejected, we suggest plotting the estimate of the selected model. This estimate is optimal in the sense that it minimises the WISE loss function. This procedure may be particularly helpful when the components of the smooth test are not diagnostic for detecting moment deviations. Although this approach relies mostly on existing theory of (generalised) smooth tests and nonparametric density estimation, there are a few issues that need to be resolved so as to make the procedure applicable to a large class of distributions. In particular, we will need an estimator of the variance of the smooth test components that is consistent in a large class of distributions for which the nuisance parameters are estimated by the method of moments. This estimator may also be used to construct diagnostic component tests. The properties of the new variance estimator, the new diagnostic components and the proposed informative testing procedure are evaluated in several simulation studies. We demonstrate the new methods on testing for the logistic and extreme value distributions.

MSC:

62G07 Density estimation
62G10 Nonparametric hypothesis testing
65C60 Computational problems in statistics (MSC2010)

References:

[1] Akaike, H.; Petrov, B. (ed.); Csàki, F. (ed.), Information theory and an extension of the maximum likelihood principle. In, 267-281 (1973), Budapest · Zbl 0283.62006
[2] Akaike, H., A new look at statistical model identification, I.E.E.E. Trans. Auto. Control, 19, 716-723 (1974) · Zbl 0314.62039 · doi:10.1109/TAC.1974.1100705
[3] Anderson, G.; Figueiredo, R., An adaptive orthogonal-series estimator for probability density functions, Annals of Statistics, 8, 347-376 (1980) · Zbl 0426.62023 · doi:10.1214/aos/1176344958
[4] Bain, L., Easterman, J., Engelhardt, M., 1973. A study of life-testing models and statistical analyses for the logistic distribution. Technical Report ARL-73-0009, Aerospace Research Laboratories, Wright Patterson AFB. · Zbl 0272.62019
[5] Baringhaus, L.; Henze, N., Limit distributions for Mardia measure of multivariate skewness, Annals of Statistics, 20, 1889-1902 (1992) · Zbl 0767.62041 · doi:10.1214/aos/1176348894
[6] Barton, D., On Neyman’s smooth test of goodness of fit and its power with respect to a particular system of alternatives, Skandinavisk Aktuarietidskrift, 36, 24-63 (1953) · Zbl 0053.10203
[7] Bickel, P.; Ritov, Y.; Stoker, T., Tailor-made tests of goodness of fit to semiparametric hypotheses, Annals of Statistics, 34, 721-741 (2006) · Zbl 1092.62050 · doi:10.1214/009053606000000137
[8] Boos, D., On generalized score tests, The American Statistician, 46, 327-333 (1992)
[9] Buckland, S., Fitting density functions with polynomials, Applied Statistics, 41, 63-76 (1992) · Zbl 0825.62433 · doi:10.2307/2347618
[10] Cencov, N., Evaluation of an unknown distribution density from observations, Soviet. Math., 3, 1559-1562 (1962) · Zbl 0133.11801
[11] Claeskens, G.; Hjort, N., Goodness of fit via non-parametric likelihood ratios, Scandinavian Journal of Statistics, 31, 487-513 (2004) · Zbl 1065.62056 · doi:10.1111/j.1467-9469.2004.00403.x
[12] Clutton-Brock, M., Density estimation using exponentials of orthogonal series, Journal of the American Statistical Association, 85, 760-764 (1990) · doi:10.1080/01621459.1990.10474937
[13] Diggle, P.; Hall, P., The selection of terms in an orthogonal series density estimator, Journal of the American Statistical Association, 81, 230-233 (1986) · Zbl 0589.62024 · doi:10.1080/01621459.1986.10478265
[14] Efron, B.; Tibshirani, R., Using specially designed exponential families for density estimation, Annals of Statistics, 24, 2431-2461 (1996) · Zbl 0878.62028 · doi:10.1214/aos/1032181161
[15] Emerson, P., Numerical construction of orthogonal polynomials from a general recurrence formula, Biometrics, 24, 695-701 (1968) · doi:10.2307/2528328
[16] Engelhardt, M., Simple linear estimation of the parameters of the logistic distribution from a complete or censored sample, Journal of the American Statistical Association, 70, 899-902 (1975) · Zbl 0319.62021 · doi:10.1080/01621459.1975.10480320
[17] Eubank, R.; LaRiccia, V.; Rosenstein, R., Test statistics derived as components of Pearson’s phi-squared distance measure, Journal of the American Statistical Association, 82, 816-825 (1987) · Zbl 0664.62044 · doi:10.1080/01621459.1987.10478503
[18] Gajek, G., On improving density estimators which are not bona fide functions, Annals of Statistics, 14, 1612-1618 (1986) · Zbl 0623.62034 · doi:10.1214/aos/1176350182
[19] Glad, I.; Hjort, N.; Ushakov, N., Correction of density estimators that are not densities, Scandinavian Journal of Statistics, 30, 415-427 (2003) · Zbl 1051.60037 · doi:10.1111/1467-9469.00339
[20] Hall, W.; Mathiason, D., On large-sample estimation and testing in parametric models, International Statistical Review, 58, 77-97 (1990) · Zbl 0715.62058 · doi:10.2307/1403475
[21] Henze, N., Do components of smooth tests of fit have diagnostic properties, Metrika, 45, 121-130 (1997) · Zbl 1030.62505 · doi:10.1007/BF02717098
[22] Henze, N.; Klar, B., Properly rescaled components of smooth tests of fit are diagnostic, Australian Journal of Statistics, 38, 61-74 (1996) · Zbl 0861.62019 · doi:10.1111/j.1467-842X.1996.tb00364.x
[23] Hjort, N.; Glad, I., Nonparametric density estimation with a parametric start, Annals of Statistics, 23, 882-904 (1995) · Zbl 0838.62027 · doi:10.1214/aos/1176324627
[24] Kallenberg, W.; Ledwina, T., Consistency and Monte Carlo simulation of a data driven version of smooth goodness-of-fit tests, Annals of Statistics, 23, 1594-1608 (1995) · Zbl 0847.62035 · doi:10.1214/aos/1176324315
[25] Kallenberg, W.; Ledwina, T., Data-driven smooth tests when the hypothesis is composite, Journal of the American Statistical Association, 92, 1094-1104 (1997) · Zbl 1067.62534 · doi:10.1080/01621459.1997.10474065
[26] Kallenberg, W.; Ledwina, T.; Rafajlowicz, E., Testing bivariate independence and normality (1997) · Zbl 0895.62059
[27] Klar, B., Diagnostic smooth tests of fit, Metrika, 52, 237-252 (2000) · Zbl 1093.62518 · doi:10.1007/PL00003984
[28] Ledwina, T., Data-driven version of Neyman’s smooth test of fit, Journal of the American Statistical Association, 89, 1000-1005 (1994) · Zbl 0805.62022 · doi:10.1080/01621459.1994.10476834
[29] Lehmann, E., 1999. Elements of Large-Sample Theory. Springer, New York. · Zbl 0914.62001 · doi:10.1007/b98855
[30] Mardia, K.; Kent, J., Rao score tests for goodness-of-fit and independence, Biometrika, 78, 355-363 (1991) · Zbl 0735.62056 · doi:10.1093/biomet/78.2.355
[31] Rayner J., Best D., 1989. Smooth Tests of Goodness-of-Fit. Oxford University Press, New York. · Zbl 0731.62064
[32] Rayner, J.; Best, D.; Mathews, K., Interpreting the skewness coefficient, Communications in Statistics — Theory and Methods, 24, 593-600 (1995) · Zbl 0825.62298 · doi:10.1080/03610929508831509
[33] Rayner, J.; Best, D.; Thas, O., Generalised smooth tests of goodness of fit, Journal of Statistical Theory and Practice, 3, 665-679 (2009) · Zbl 1211.62077 · doi:10.1080/15598608.2009.10411953
[34] Rayner, J., Thas, O., Best, D., 2009b. Smooth Tests of Goodness of Fit. Wiley, New York, USA. · Zbl 1171.62015 · doi:10.1002/9780470824443
[35] Rayner, J.; Thas, O.; Boeck, B., A generalised Emerson recurrence relation, Australian and New Zealand Journal of Statistics, 50, 235-240 (2008) · Zbl 1337.60014 · doi:10.1111/j.1467-842X.2008.00514.x
[36] Schwarz, G., Estimating the dimension of a model, Annals of Statistics, 6, 461-464 (1978) · Zbl 0379.62005 · doi:10.1214/aos/1176344136
[37] Stuart, A., Ord, J., 1994. Kendall’s Advanced Theory of Statistics. Arnold / Halsted, London. · Zbl 0727.62002
[38] Tarter, M., An introduction to the implementation and theory of nonparametric density estimation, The American Statistician, 30, 105-112 (1976) · Zbl 0336.62031
[39] van der Vaart, A., 1998. Asymptotic Statistics. Cambridge University Press, Cambridge. · Zbl 0943.62002 · doi:10.1017/CBO9780511802256
[40] Wasserman, L., 2005. All of Nonparametric Statistics. Springer. · Zbl 1099.62029
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.