×

Reconnecting \(p\)-value and posterior probability under one- and two-sided tests. (English) Zbl 07632864


MSC:

62-XX Statistics

References:

[1] Bayarri, M. J.; Berger, J. O., “The Interplay of Bayesian and Frequentist Analysis, Statistical Science, 19, 58-80 (2004) · Zbl 1062.62001 · doi:10.1214/088342304000000116
[2] Benjamin, D. J.; Berger, J. O., “Three Recommendations for Improving the Use of p-Values, The American Statistician, 73, 186-191 (2019) · Zbl 07588201 · doi:10.1080/00031305.2018.1543135
[3] Berger, J. O., “Could Fisher, Jeffreys and Neyman Have Agreed on Testing?” (with discussion, Statistical Science, 18, 1-32 (2003) · Zbl 1048.62006 · doi:10.1214/ss/1056397485
[4] Berger, J. O.; Delampady, M., “Testing Precise Hypotheses, Statistical Science, 2, 317-335 (1987) · Zbl 0955.62545 · doi:10.1214/ss/1177013238
[5] Berger, J. O.; Sellke, T., “Testing a Point Null Hypothesis: The Irreconcilability of p Values and Evidence, Journal of the American Statistical Association, 82, 112-122 (1987) · Zbl 0612.62022 · doi:10.2307/2289131
[6] Betensky, R. A., “The p-Value Requires Context, Not a Threshold, The American Statistician, 73, 115-117 (2019) · Zbl 07588192 · doi:10.1080/00031305.2018.1529624
[7] Briggs, W. M., “The Substitute for p-Values, Journal of the American Statistical Association, 112, 897-898 (2017) · doi:10.1080/01621459.2017.1311264
[8] Billheimer, D., “Predictive Inference and Scientific Reproducibility, The American Statistician, 73, 291-295 (2019) · Zbl 07588211 · doi:10.1080/00031305.2018.1518270
[9] Casella, G.; Berger, R. L., “Reconciling Bayesian and Frequentist Evidence in the One-Sided Testing Problem” (with discussion), Journal of the American Statistical Association, 82, 106-111 (1987) · Zbl 0612.62021 · doi:10.1080/01621459.1987.10478396
[10] Colquhoun, D., “An Investigation of the False Discovery Rate and the Misinterpretation of p-Values, Royal Society of Open Science, 1, 140-216 (2014)
[11] Concato, J.; Hartigan, J. A., “P Values: From Suggestion to Superstition, Journal of Investigative Medicine, 64, 1166-1171 (2016) · doi:10.1136/jim-2016-000206
[12] Cumming, G., “The New Statistics: Why and How, Psychological Science, 25, 7-29 (2014) · doi:10.1177/0956797613504966
[13] Donahue, R. M. J., “A Note on Information Seldom Reported via the P Value, The American Statistician, 53, 303-306 (1999) · doi:10.2307/2686048
[14] Dudley, R. M.; Haughton, D., “Asymptotic Normality With Small Relative Errors of Posterior Probabilities of Half-Spaces, The Annals of Statistics, 30, 1311-1344 (2002) · Zbl 1014.62031 · doi:10.1214/aos/1035844978
[15] Fidler, F.; Thomason, N.; Cumming, G.; Finch, S.; Leeman, J., “Editors Can Lead Researchers to Confidence Intervals, But Can’t Make Them Think: Statistical Reform Lessons From Medicine, Psychological Science, 15, 119-126 (2004) · doi:10.1111/j.0963-7214.2004.01502008.x
[16] Gill, J., “Comments From the New Editor, Political Analysis, 26, 1-2 (2018) · doi:10.1017/pan.2017.41
[17] Goodman, S. N., “Toward Evidence-Based Medical Statistics. 1: The p Value Fallacy, Annals of Internal Medicine, 130, 995-1004 (1999) · doi:10.7326/0003-4819-130-12-199906150-00008
[18] Hubbard, R.; Lindsay, R. M., “Why P Values Are Not a Useful Measure of Evidence in Statistical Significance Testing, Theory & Psychology, 18, 69-88 (2008) · doi:10.1177/0959354307086923
[19] Hung, H. J.; O’Neill, R. T.; Bauer, P.; Kohne, K., “The Behavior of the p-Value When the Alternative Hypothesis Is True, Biometrics, 53, 11-22 (1997) · Zbl 0876.62015 · doi:10.2307/2533093
[20] Ioannidis, J. P., “Why Most Published Research Findings Are False, PLoS Medicine, 2, 124 (2005) · doi:10.1371/journal.pmed.0020124
[21] Jager, L. R.; Leek, J. T., “An Estimate of the Science-Wise False Discovery Rate and Application to the Top Medical Literature, Biostatistics, 15, 1-12 (2014) · doi:10.1093/biostatistics/kxt007
[22] Johnson, V. E., “Revised Standards for Statistical Evidence, Proceedings of the National Academy of Sciences of the United States of America, 110, 19313-19317 (2013) · Zbl 1357.62025 · doi:10.1073/pnas.1313476110
[23] Leek, J.; McShane, B. B.; Gelman, A.; Colquhoun, D.; Nuijten, M. B.; Goodman, S. N., “Five Ways to Fix Statistics, Nature, 551, 557-559 (2017) · doi:10.1038/d41586-017-07522-z
[24] Lehmann, E. L.; Romano, J. P., Testing Statistical Hypotheses (2005), New York: Springer, New York · Zbl 1076.62018
[25] Lindley, D. V., “A Statistical Paradox, Biometrika, 44, 187-192 (1957) · Zbl 0080.12801 · doi:10.1093/biomet/44.1-2.187
[26] Manski, C. F., “Treatment Choice With Trial Data: Statistical Decision Theory Should Supplant Hypothesis Testing, The American Statistician, 73, 296-304 (2019) · Zbl 07588212 · doi:10.1080/00031305.2018.1513377
[27] Matthews, R. A. J., “Moving Towards the Post p < 0.05 Era via the Analysis of Credibility, The American Statistician, 73, 202-212 (2019) · Zbl 07588203
[28] McShane, B. B.; Gal, D.; Gelman, A.; Robert, C.; Tackett, J. L., “Abandon Statistical Significance, The American Statistician, 73, 235-245 (2019) · Zbl 07588206 · doi:10.1080/00031305.2018.1527253
[29] Murtaugh, P. A., “In Defense of P Values, Ecology, 95, 611-617 (2014) · doi:10.1890/13-0590.1
[30] Nuzzo, R., “Statistical Errors: P Values, the ‘Gold Standard’ of Statistical Validity, Are Not as Reliable as Many Scientists Assume, Nature, 506, 150-152 (2014)
[31] Pratt, J. W., “Bayesian Interpretation of Standard Inference Statements” (with discussion), Journal of the Royal Statistical Society, Series B, 27, 169-203 (1965) · Zbl 0142.15203 · doi:10.1111/j.2517-6161.1965.tb01486.x
[32] Ranstam, J., “Why the p-Value Culture Is Bad and Confidence Intervals a Better Alternative, Osteoarthritis Cartilage, 20, 805-808 (2012) · doi:10.1016/j.joca.2012.04.001
[33] Rosenthal, R.; Rubin, D. B., “Ensemble-Adjusted p Values, Psychological Bulletin, 94, 540-541 (1983) · doi:10.1037/0033-2909.94.3.540
[34] Royall, R. M., “The Effect of Sample Size on the Meaning of Significance Tests, The American Statistician, 40, 313-315 (1986) · Zbl 0613.62026 · doi:10.2307/2684616
[35] Rubin, D. B., “Bayesianly Justifiable and Relevant Frequency Calculations for the Applies Statistician, The Annals of Statistics, 12, 1151-1172 (1984) · Zbl 0555.62010 · doi:10.1214/aos/1176346785
[36] Rubin, D. B., “More Powerful Randomization-Based p-Values in Double-Blind Trials With Non-Compliance, Statistics in Medicine, 17, 371-385 (1998)
[37] Sackrowitz, H.; Samuel-Cahn, E., “P Values as Random Variable-Expected P Values, The American Statistician, 53, 326-331 (1999) · doi:10.2307/2686051
[38] Savalei, V.; Dunn, E., “Is the Call to Abandon p-Values the Red Herring of the Replicability Crisis?, Frontiers in Psychology, 6, 245 (2015) · doi:10.3389/fpsyg.2015.00245
[39] Schervish, M. J., “P Values: What They Are and What They Are Not, The American Statistician, 50, 203-206 (1996) · doi:10.2307/2684655
[40] Sellke, T.; Bayarri, M. J.; Berger, J. O., “Calibration of p-Values for Testing Precise Null Hypotheses, The American Statistician, 55, 62-71 (2001) · Zbl 1182.62053 · doi:10.1198/000313001300339950
[41] Simmons, J. P.; Nelson, L. D.; Simonsohn, U., “False-Positive Psychology: Undisclosed Flexibility in Data Collection and Analysis Allows Presenting Anything as Significant, Psychological Science, 22, 1359-1366 (2011) · doi:10.1177/0956797611417632
[42] Trafimow, D.; Amrhein, V.; Areshenkoff, C. N.; Barrera-Causil, C. J.; Beh, E. J.; Bilgiç, Y. K.; Bono, R.; Bradley, M. T.; Briggs, W. M.; Cepeda-Freyre, H. A.; Chaigneau, S. E., “Manipulating the Alpha Level Cannot Cure Significance Testing, Frontiers in Psychology, 9, 699 (2018) · doi:10.3389/fpsyg.2018.00699
[43] Trafimow, D.; Marks, M., “Editorial, Basic and Applied Social Psychology, 37, 1-2 (2015) · doi:10.1080/01973533.2015.1012991
[44] Wagenmakers, E. J., “A Practical Solution to the Pervasive Problems of p Values, Psychonomic Bulletin & Review, 14, 779-804 (2007) · doi:10.3758/BF03194105
[45] Wasserstein, R. L.; Lazar, N. A., “The ASA’s Statement on p-Values: Context, Process, and Purpose, The American Statistician, 70, 129-133 (2016) · Zbl 07665862 · doi:10.1080/00031305.2016.1154108
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.