×

Bayesian propensity scores for high-dimensional causal inference: a comparison of drug-eluting to bare-metal coronary stents. (English) Zbl 1400.62288

Summary: High-dimensional data provide many potential confounders that may bolster the plausibility of the ignorability assumption in causal inference problems. Propensity score methods are powerful causal inference tools, which are popular in health care research and are particularly useful for high-dimensional data. Recent interest has surrounded a Bayesian treatment of propensity scores in order to flexibly model the treatment assignment mechanism and summarize posterior quantities while incorporating variance from the treatment model. We discuss methods for Bayesian propensity score analysis of binary treatments, focusing on modern methods for high-dimensional Bayesian regression and the propagation of uncertainty. We introduce a novel and simple estimator for the average treatment effect that capitalizes on conjugacy of the beta and binomial distributions. Through simulations, we show the utility of horseshoe priors and Bayesian additive regression trees paired with our new estimator, while demonstrating the importance of including variance from the treatment regression model. An application to cardiac stent data with almost 500 confounders and 9000 patients illustrates approaches and facilitates comparison with existing alternatives. As measured by a falsifiability endpoint, we improved confounder adjustment compared with past observational research of the same problem.

MSC:

62P10 Applications of statistics to biology and medical sciences; meta analysis
62F15 Bayesian inference

References:

[1] Austin, P. C. (2010). The performance of different propensity‐score methods for estimating differences in proportions (risk differences or absolute risk reductions) in observational studies. Statistics in Medicine, 29, 2137-2148.
[2] Austin, P. C., & Stuart, E. A. (2015). Moving toward best practice when using inverse probability of treatment weighting (IPW) using the propensity score to estimate causal treatment effects in observational studies. Statistics in Medicine, 34, 3661-3679.
[3] Bonaa, K. H., Mannsverk, J., Wiseth, R., Aaberge, L., Myreng, Y., Nygard, O., … NORSTENT Investigators. (2016). Drug‐eluting or bare‐metal stents for coronary artery disease. New England Journal of Medicine, 375, 1242-1252.
[4] Carvahlo, C. M., Polson, N. G., & Scott, J. G. (2010). The horseshoe estimator for sparse signals. Biometrika, 97, 465-480. · Zbl 1406.62021
[5] Cefalu, M., Dominici, F., Arvold, N., & Parmigiani, G. (2016). Model averaged double robust estimation. Biometrics, 73, 410-421. · Zbl 1372.62058
[6] Chipman, H. A., George, E. I., & McCulloch, R. E. (2010). BART: Bayesian additive regression trees. Annals of Applied Statistics, 4, 266-298. · Zbl 1189.62066
[7] Farrell, M. H. (2015). Robust inference for average treatment effects with possibly more covariates than observations. Journal of Econometrics, 189, 1-23. · Zbl 1337.62113
[8] Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A., & Rubin, D. B. (2014). Bayesian data analysis (3rd ed.). Boca Raton, FL: CRC Press. · Zbl 1279.62004
[9] Ghosh, D. (2011). Propensity score modeling in observational studies using dimension reduction methods. Statistics and Probability Letters, 81, 813-820. · Zbl 1218.62047
[10] Ghosh, D., Zhu, Y., & Coffman, D. L. (2015). Penalized regression procedures for variable selection in the potential outcomes framework. Statistics in Medicine, 34, 1645-1658.
[11] Graham, D. J., McCoy, E. J., & Stephens D. A. (2016). Approximate Bayesian Inference for doubly robust estimation. Bayesian Analysis, 11, 47-69. · Zbl 1357.62186
[12] Hahn, P. R., Carvalho, C. M., He, J., & Puelz, D. (2016). Regularization and confounding in linear regression for treatment effect estimation. Bayesian Analysis. https://doi.org/10.1214/16‐BA1044 · Zbl 06873722 · doi:10.1214/16‐BA1044
[13] Hahn, P. R., Murray, J. S., & Carvalho, C. M. (2017). Bayesian regression tree models for causal inference: regularization, confounding, and heterogeneous effects. Retrieved from https://arxiv.org/abs/1706.09523
[14] Hill, J. L. (2011). Bayesian nonparametric modeling for causal inference. Journal of Computational and Graphical Statistics, 20, 217-240.
[15] Kaplan, D., & Chen, J. (2012). A two‐step Bayesian approach for propensity score analysis: Simulations and case study. Pyschometrika, 77, 581-609. · Zbl 1272.62124
[16] Kass, R. E., & Steffey, D. (1989). Approximate Bayesian inference in conditionally independent hierarchical models (parametric empirical Bayes models). Journal of the American Statistical Association, 84, 717-726.
[17] Lunceford, J. K., & Davidian, M. (2004). Stratification and weighting via the propensity score in estimation of causal treatment effects: A comparative study. Statistics in Medicine, 23, 2937-2960.
[18] Mauri, L., Silbaugh, T. S., Wolf, R. E., Zelevinsky, K., Lovett, A., Zhou, Z., … Normand, S. L. T. (2008). Long‐term clinical outcomes after drug‐eluting and bare‐metal stenting in Massachusetts. Circulation, 118, 1817-1827.
[19] McCaffrey, D. F., Ridgeway, G., & Morral, A. R. (2004). Propensity score estimation with boosted regression for evaluating causal effects in observational studies. Psychological Methods, 9, 403-425.
[20] Petersen, M. L., Porter, K. E., Gruber, S., Wang, Y., & van der Laan, M. J. (2010). Diagnosing and responding to violations in the positivity assumption. Statistical Methods in Medical Research, 21, 31-54.
[21] Robins, J. M., Rotnitzky, A., & Zhao, L. P. (1994). Estimation of regression coefficients when some regressors are not always observed. Journal of the American Statistical Association, 89, 846-866. · Zbl 0815.62043
[22] Rosenbaum, P. R., & Rubin, D. B. (1983). The central role of the propensity score in observational studies for causal effects. Biometrika, 70, 41-55. · Zbl 0522.62091
[23] Rubin, D. B. (1985). The use of propensity scores in applied Bayesian inference. Bayesian Statistics, 2, 463-472.
[24] Rubin, D. B. (2008). For objective causal inference, design trumps analysis. Annals of Applied Statistics, 2, 808-840. · Zbl 1149.62089
[25] Saarela, O., Belzile, L. R., & Stephens, D. A. (2016). A Bayesian view of doubly robust causal inference. Biometrika, 103, 667-681. · Zbl 1506.62253
[26] Saarela, O., Stephens, D. A., Moodie, E. E. M., & Klein, M. B. (2015). On Bayesian estimation of marginal structural models. Biometrics, 71, 279-301. · Zbl 1390.62305
[27] Schneeweiss, S., Rassen, J. A., Glynn, R. J., Avorn, J., Mogun, H., & Brookhart, A. M. (2009). High‐dimensional propensity in studies of treatment effects using health care claims data. Epidemiology, 103, 667-681.
[28] Stan Development Team. (2016). Stan modeling language users guide and reference manual, version 2.14.0. Retrieved from http://mc‐stan.org
[29] U.S. Food and Drug Administration. (2013). Best practices for conducting and reporting pharmacoepidemiologic safety studies using electronic health records. Retrieved from http://www.fda.gov/downloads/drugs/guidances/ucm243537.pdf
[30] van der Laan, M. J., & Rose, S. (2011). Targeted learning: Causal inference for observational and experimental data. Berlin, DE: Springer.
[31] Venkitachalam, L., Lei, Y., Magnuson, E., Chan, P. S., Stolker, J. M., Kennedy, K. F., … Cohen, D. J. (2011). Survival benefit with drug‐eluting stents in observational studies: fact or artifact?Circulation, 4, 587-594.
[32] Wang, C., Parmigiani, G., & Dominici, F. (2012). Bayesian effect estimation accounting for adjustment uncertainty. Biometrics, 68, 661-686. · Zbl 1274.62895
[33] Zigler, C. M. (2016). The central role of Bayes theorem for joint estimation of causal effects and propensity scores. American Statistician, 70, 47-54. · Zbl 07665851
[34] Zigler, C. M., & Dominici, F. (2014). Uncertainty in propensity score estimation: Bayesian methods for variable selection and model‐averaged causal effects. Journal of the American Statistical Association, 109, 95-107.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.