×

A Bayesian chi-squared goodness-of-fit test for censored data models. (English) Zbl 1192.62074

Summary: We propose a Bayesian chi-squared model diagnostic for analysis of data subject to censoring. The test statistic has the form of Pearson’s chi-squared test statistic and is easy to calculate from standard output of Markov chain Monte Carlo algorithms. The key innovation of this diagnostic is that it is based only on observed failure times. Because it does not rely on the imputation of failure times for observations that have been censored, we show that under heavy censoring it can have higher power for detecting model departures than a comparable test based on the complete data.
In a simulation study, we show that tests based on this diagnostic exhibit comparable power and better nominal Type I error rates than a commonly used alternative test proposed by M. G. Akritas [J. Am. Stat. Assoc. 83, No. 401, 222–230 (1988; Zbl 0653.62036)]. An important advantage of the proposed diagnostic is that it can be applied to a broad class of censored data models, including generalized linear models and other models with nonidentically distributed and nonadditive error structures. We illustrate the proposed model diagnostic for testing the adequacy of two parametric survival models for Space Shuttle main engine failures.

MSC:

62F15 Bayesian inference
62N01 Censored data models
62G10 Nonparametric hypothesis testing
62N03 Testing in survival analysis and censored data
65C40 Numerical analysis or methods applied to Markov chains
62N05 Reliability and life testing
65C60 Computational problems in statistics (MSC2010)

Citations:

Zbl 0653.62036
Full Text: DOI

References:

[1] Akritas, Pearson-type goodness-of-fit tests: The univariate case, Journal of the American Statistical Association 83 pp 222– (1988) · Zbl 0653.62036 · doi:10.2307/2288944
[2] Akritas, Pearson-type goodness-of-fit tests for regression, Canadian Journal of Statistics 25 pp 359– (1997) · Zbl 0911.62037 · doi:10.2307/3315784
[3] Bayarri, P values for composite null models, Journal of the American Statistical Association 95 pp 1127– (2000) · Zbl 1004.62022 · doi:10.2307/2669749
[4] Caraux, Bounds on the distribution function of order statistics for dependent variates, Statistics and Probability Letters 14 pp 103– (1992) · Zbl 0761.62012 · doi:10.1016/0167-7152(92)90071-C
[5] de Uña-Álvarez, Chi-squared goodness-of-fit theory under proportional censorship, Journal of Statistical Planning and Inference 109 pp 101– (2003) · Zbl 1038.62046 · doi:10.1016/S0378-3758(02)00306-3
[6] Gelfand, Bayesian analysis of proportional hazards model built from monotone functions, Biometrics 51 pp 843– (1995) · Zbl 0868.62028 · doi:10.2307/2532986
[7] Gelfand, Model determination using predictive distributions with implementation via sampling-based methods (with discussion), Bayesian Statistics 4 pp 147– (1992)
[8] Gelman, Posterior predictive assessment of model fitness via realized discrepancies (with discussion), Statistica Sinica 6 pp 733– (1996) · Zbl 0859.62028
[9] Guttman, The use of the concept of a future observation in goodness-of-fit problems, Journal of the Royal Statistical Society, Series B 29 pp 83– (1967) · Zbl 0158.37305
[10] Hjort, Goodness of fit tests in models for life history data based on cumulative hazard rates, Annals of Statistics 18 pp 1259– (1990) · Zbl 0714.62037 · doi:10.1214/aos/1176347749
[11] Hollander, A chi-squared goodness-of-fit test for randomly censored data, Journal of the American Statistical Association 87 pp 458– (1992) · Zbl 0763.62027 · doi:10.2307/2290277
[12] Johnson, A Bayesian chi-squared test for goodness-of-fit, Annals of Statistics 32 pp 2361– (2004) · Zbl 1068.62028 · doi:10.1214/009053604000000616
[13] Johnson, Bayesian model assessment using pivotal quantities, Bayesian Analysis 2 pp 1– (2007) · Zbl 1331.62147 · doi:10.1214/07-BA229
[14] Kass, Bayes factors, Journal of the American Statistical Association 90 pp 773– (1995) · Zbl 0846.62028 · doi:10.2307/2291091
[15] Koehler, Chi-squared goodness-of-fit tests: Cell selection and power, Communications in Statistics-Simulation and Computation 19 pp 1265– (1990) · Zbl 0850.62382 · doi:10.1080/03610919008812915
[16] Li, Generalized Pearson-Fisher chi-square goodness-of-fit tests, with application to models with life history data, Annals of Statistics 21 pp 772– (1993) · Zbl 0788.62020 · doi:10.1214/aos/1176349151
[17] Rubin, Bayesianly justifiable and relevant frequency calculations for applied statistician, Annals of Statistics 12 pp 1151– (1984) · Zbl 0555.62010 · doi:10.1214/aos/1176346785
[18] Robins, Asymptotic distribution of P values in composite null models, Journal of the American Statistical Association 95 pp 1143– (2000) · Zbl 1072.62522 · doi:10.2307/2669750
[19] Rychlik, Stochastically extremal distributions of order statistics for dependent samples, Statistics and Probability Letters 13 pp 337– (1992) · Zbl 0743.62018 · doi:10.1016/0167-7152(92)90105-E
[20] Sahu, A Weibull regression model with gamma frailties for multivariate survival data, Lifetime Data Analysis 3 pp 123– (1997) · Zbl 0896.62125 · doi:10.1023/A:1009605117713
[21] Shih, Assessing gamma frailty models for clustered failure time data, Lifetime Data Analysis 1 pp 205– (1995) · Zbl 0836.62097 · doi:10.1007/BF00985771
[22] Sinha, Semiparametric Bayesian analysis of survival data, Journal of the American Statistical Association 92 pp 1195– (1997) · Zbl 1067.62520 · doi:10.2307/2965586
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.