×

Invidious comparisons: ranking and selection as compound decisions. (English) Zbl 1541.62360

Summary: There is an innate human tendency, one might call it the “league table mentality,” to construct rankings. Schools, hospitals, sports teams, movies, and myriad other objects are ranked even though their inherent multi-dimensionality would suggest that – at best – only partial orderings were possible. We consider a large class of elementary ranking problems in which we observe noisy, scalar measurements of merit for \(n\) objects of potentially heterogeneous precision and are asked to select a group of the objects that are “most meritorious.” The problem is naturally formulated in the compound decision framework of H. Robbins’s [in: Proc. 3rd Berkeley Sympos. Math. Statist. Probability 1, 157–163 (1956; Zbl 0074.35302)] empirical Bayes theory, but it also exhibits close connections to the recent literature on multiple testing. The nonparametric maximum likelihood estimator for mixture models [J. Kiefer and J. Wolfowitz, Ann. Math. Stat. 27, 887–906 (1956; Zbl 0073.14701)] is employed to construct optimal ranking and selection rules. Performance of the rules is evaluated in simulations and an application to ranking U.S. kidney dialysis centers.
{© 2023 The Authors. Econometrica published by John Wiley & Sons Ltd on behalf of The Econometric Society}

MSC:

62P20 Applications of statistics to economics
62C12 Empirical decision procedures; empirical Bayes procedures
62F07 Statistical ranking and selection procedures
62J15 Paired and multiple comparisons; multiple testing

Software:

REBayes

References:

[1] Andrews, Isaiah, ToruKitagawa, and AdamMcCloskey (2020): “Inference on Winners,” Available at https://www.cemmap.ac.uk/publication/inference-on-winners-4.
[2] Armstrong, Timothy B., MichalKolesár, and MikkelPlagborg‐Møller (2020): “Robust Empirical Bayes Confidence Intervals,” Available at https://arxiv.org/abs/2004.03448.
[3] Athey, Susan, GuidoImbens, JonasMetzger, and EvanMunro (2019): “Using Wasserstein Generative Adversarial Networks for the Design of Monte Carlo Simulations,” Available at http://arxiv.org/pdf/1909.02210.
[4] Bahadur, Raghu Raj (1950): “On a Problem in the Theory of k Populations,” Annals of Mathematical Statistics, 21, 362-375. · Zbl 0038.29702
[5] Bahadur, Raghu Raj, and HerbertRobbins (1950): “The Problem of the Greater Mean,” The Annals of Mathematical Statistics, 21, 469-487. · Zbl 0040.07501
[6] Basu, Pallavi, T. TonyCai, KiranmoyDas, and WenguangSun (2018): “Weighted False Discovery Rate Control in Large‐Scale Multiple Testing,” Journal of the American Statistical Association, 113, 1172-1183. · Zbl 1402.62050
[7] Bechhofer, Robert E. (1954): “A Single‐Sample Multiple Decision Procedure for Ranking Means of Normal Populations With Known Variances,” The Annals of Mathematical Statistics, 25, 16-39. · Zbl 0055.13003
[8] Bechhofer, Robert E., JackKiefer, and MiltonSobel (1968): Sequential Identification and Ranking Procedures. University of Chicago Press. · Zbl 0208.44601
[9] Benjamini, Yoav, and YosefHochberg (1995): “Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing,” Journal of Royal Statistical Society, Series B, 57, 289-300. · Zbl 0809.62014
[10] Berger, James O., and JohnDeely (1988): “A Bayesian Approach to Ranking and Selection of Related Means With Alternatives to Analysis‐of‐Variance Methodology,” Journal of the American Statistical Association, 83, 364-373. · Zbl 0672.62041
[11] Boyd, Stephen, CorinnaCortes, MehryarMohri, and AnaRadovanovic (2012): “Accuracy at the Top,” in Advances in Neural Information Processing Systems 25, ed. by FernandoPereira (ed.), Christopher J. C.Burges (ed.), LéonBottou (ed.), and Kilian Q.Weinberger (ed.). Curran Associates, Inc., 953-961.
[12] Cao, Hongyuan, WenguangSun, and Michael R.Kosorok (2013): “The Optimal Power Puzzle: Scrutiny of the Monotone Likelihood Ratio Assumption in Multiple Testing,” Biometrika, 100, 495-502. · Zbl 1284.62470
[13] Chetty, Raj, and NathanielHendren (2018): “The Impacts of Neighborhoods on Intergenerational Mobility II: County‐Level Estimates,” The Quarterly Journal of Economics, 133, 1163-1228. · Zbl 1405.91171
[14] Chetty, Raj, John N.Friedman, and Jonah E.Rockoff (2014a): “Measuring the Impacts of Teachers i: Evaluating Bias in Teacher Value‐Added Estimates,” American Economic Review, 104, 2593-2632.
[15] Chetty, Raj, John N.Friedman, and Jonah E.Rockoff (2014b): “Measuring the Impacts of Teachers II: Teacher Value‐Added and Student Outcomes in Adulthood,” American Economic Review, 104, 2633-2679.
[16] Efron, Bradley (2011): “Tweedie”s Formula and Selection Bias,” Journal of the American Statistical Association, 106, 1602-1614. · Zbl 1234.62007
[17] Efron, Bradley (2016): “Empirical Bayes Deconvolution Estimates,” Biometrika, 103, 1-20. · Zbl 1452.62220
[18] Efron, Bradley (2019): “Bayes, Oracle Bayes and Empirical Bayes,” Statistical Science, 34, 177-201. · Zbl 1420.62023
[19] Efron, Bradley, and CarlMorris (1973): “Stein”s Estimation Rule and Its Competitiors—an Empirical Bayes Approach,” Journal of the American Statistical Association, 68, 117-130. · Zbl 0275.62005
[20] Efron, Bradley, RobertTibshirani, JohnStorey, and VirginiaTusher (2001): “Empirical Bayes Analysis of Microarray Experiments,” J. American Statistical Association, 96, 1151-1160. · Zbl 1073.62511
[21] Gelman, Andrew, and Phillip N.Price (1999): “All Maps of Parameter Estimates Are Misleading,” Statistics in Medicine, 18, 3221-3234.
[22] Genovese, Christopher, and LarryWasserman (2002): “Operating Characteristic and Extensions of the False Discovery Rate Procedure,” Journal of the Royal Statistical Society, Series B, 64, 499-517. · Zbl 1090.62072
[23] Gilraine, Michael, JiayingGu, and RobertMcMillan (2020): “A New Method for Estimating Teacher Value‐Added,” NBER Working Paper Series Number 27094.
[24] Goel, Prem K., and HermanRubin (1977): “On Selecting a Subset Containing the Best Population—a Bayesian Approach,” The Annals of Statistics, 5, 969-983. · Zbl 0368.62013
[25] Goldstein, Harvey, and David J.Spiegelhalter (1996): “League Tables and Their Limitations: Statistical Issues in Comparisons of Institutional Performance, (With Discussion),” Journal of the Royal Statistical Society: Series A, 159, 385-443.
[26] Gu, Jiaying, and RogerKoenker (2017): “Empirical Bayesball Remixed: Empirical Bayes Methods for Longitudinal Data,” Journal of Applied Econometrics, 32, 575-599.
[27] Gu, Jiaying, and RogerKoenker (2016a): “Unobserved Heterogeneity in Income Dynamics: An Empirical Bayes Perspective,” Journal of Economic and Business Statistics (forthcoming).
[28] Gu, Jiaying, and RogerKoenker (2016b): “On a Problem of Robbins,” International Statistical Review, 84, 224-244. · Zbl 07763492
[29] Gu, Jiaying, and RogerKoenker (2023): “Supplement to ‘Invidious Comparisons: Ranking and Selection as Compound Decisions’,” Econometrica Supplemental Material, 91, https://doi.org/10.3982/ECTA19304. · Zbl 1541.62360 · doi:10.3982/ECTA19304
[30] Guo, Xinzhou, and XumingHe (2020): “Inference on Selected Subgroups in Clinical Trials,” Journal of the American Statistical Association, 1-19.
[31] Gupta, Shanti (1956): “On a Decision Rule for a Problem in Ranking Means,” Mimeograph Series No. 150, Institute of Statistics, University of North Carolina, Chapel Hill. · Zbl 0073.35901
[32] Gupta, Shanti S., and SubramanianPanchapakesan (1979): Multiple Decision Procedures: Theory and Methodology of Selecting and Ranking Populations. Wiley. · Zbl 0516.62030
[33] Hanushek, Eric A. (2011): “The Economic Value of Higher Teacher Quality,” Economics of Education review, 30, 466-479.
[34] Heckman, James, and BartSinger (1984): “A Method for Minimizing the Impact of Distributional Assumptions in Econometric Models for Duration Data,” Econometrica, 52, 63-132.
[35] Henderson, Nicholas C., and Michael A.Newton (2016): “Making the Cut: Improved Ranking and Selection for Large‐Scale Inference,” Journal of the Royal Statistical Society, Series B, 78 (4), 781-804. · Zbl 1414.62064
[36] Jiang, Wenhua (2020): “On General Maximum Likelihood Empirical Bayes Estimation of Heteroscedastic iid Normal Means,” Electronic Journal of Statistics, 14, 2272-2297. · Zbl 1442.62068
[37] Kidney Epidemiology and Cost Center (2018): “Technical Notes on the Dialysis Facility Compare Quality of Patient Care Star Rating Methodology for the October 2018 Release,” University of Michigan, School of Public Health.
[38] Kiefer, Jack, and JackWolfowitz (1956): “Consistency of the Maximum Likelihood Estimator in the Presence of Infinitely Many Incidental Parameters,” The Annals of Mathematical Statistics, 27, 887-906. · Zbl 0073.14701
[39] Kline, Patrick, and ChristopherWalters (2021): “Reasonable Doubt: Experimental Detection of Job‐Level Employment Discrimination,” Econometrica, 89, 565-592. · Zbl 1475.91143
[40] Koenker, Roger, and JiayingGu (2015-2021): “REBayes: An R Package for Empirical Bayes Methods,” Available at https://cran.r-project.org/package=REBayes.
[41] Koenker, Roger (2020): “Empirical Bayes Confidence Intervals: An R Vinaigrette,” Available at http://www.econ.uiuc.edu/ roger/research/ebayes/cieb.pdf.
[42] Koenker, Roger, and JiayingGu (2017): “REBayes: An R Package for Empirical Bayes Mixture Methods,” Journal of Statistical Software, 82, 1-26.
[43] Koenker, Roger, and JiayingGu (2019): “Comment: Minimalist G‐Modeling,” Statistical Science, 34, 209-213. · Zbl 1420.62027
[44] Koenker, Roger, and IvanMizera (2014): “Convex Optimization, Shape Constraints, Compound Decisions and Empirical Bayes Rules,” J. of Am. Stat. Assoc., 109, 674-685. · Zbl 1367.62020
[45] Laird, Nan M., and Thomas A.Louis (1989): “Empirical Bayes Ranking Methods,” Journal of Educational Statistics, 14, 29-46.
[46] Laird, Nan M., and Thomas A.Louis (1991): “Smoothing the Non‐Parametric Estimate of a Prior Distribution by Roughening,” Computational Statistics & Data Analysis, 12, 27-37. · Zbl 0825.62435
[47] Lin, Rongheng, Thomas A.Louis, Susan M.Paddock, and Susan M.Ridgeway (2009): “Ranking USRDS Provider Specific SMRs From 1998-2001,” Health Service Outcomes Research Methodology, 9, 22-38.
[48] Lin, Rongheng, Thomas A.Louis, Susan M.Paddock, and Susan M.Ridgeway (2006): “Loss Function Based Ranking in Two‐Stage, Hierarchical Models,” Bayesian Analysis, 1, 915-946. · Zbl 1331.62063
[49] Lindsay, Bruce G. (1995): “Mixture Models: Theory, Geometry and Applications,” in NSF‐CBMS Regional Conference Series in Probability and Statistics. · Zbl 1163.62326
[50] Mogstad, Magne, JosephRomano, AzeemShaikh, and DanielWilhelm (2020): “Inferences for Ranks With Applications to Mobility Across Neighborhoods and Academic Achievement Across Countries,” Preprint.
[51] Polyanskiy, Yury, and YihongWu (2020): “Self‐Regularizing Property of Nonparametric Maximum Likelihood Estimator in Mixture Models,” Preprint.
[52] Portnoy, Stephen (1982): “Maximizing the Probability of Correctly Ordering Random Variables Using Linear Predictors,” Journal of Multivariate Analysis, 12, 256-269. · Zbl 0497.62063
[53] Robbins, Herbert (1950): “A Generalization of the Method of Maximum Likelihood: Estimating a Mixing Distribution (Abstract),” The Annals of Mathematical Statistics, 21, 314-315.
[54] Robbins, Herbert (1951): “Asymptotically Subminimax Solutions of Compound Statistical Decision Problems,” in Proceedings of the Berkeley Symposium on Mathematical Statistics and Probability, Vol. I. University of California Press: Berkeley, 131-149. · Zbl 0044.14803
[55] Robbins, Herbert (1956): “An Empirical Bayes Approach to Statistics,” in Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability, Vol. I. University of California Press: Berkeley, 157-163. · Zbl 0074.35302
[56] Saha, Sujayam, and AdityanandGuntuboyina (2020): “On the Nonparametric Maximum Likelihood Estimator for Gaussian Location Mixture Densities With Application to Gaussian Denoising,” Annals of Statistics, 48, 738-762. · Zbl 1454.62120
[57] Storey, John D. (2002): “A Direct Approach to False Discovery Rates,” Journal of the Royal Statistical B, 64, 479-498. · Zbl 1090.62073
[58] Sun, Wenguang, and Alexander C.McLain (2012): “Multiple Testing of Composite Null Hypotheses in Heteroscedastic Models,” Journal of the American Statistical Association, 107, 673-687. · Zbl 1261.62016
[59] Sun, Wenguang, and T. TonyCai (2007): “Oracle and Adaptive Compound Decision Rules for False Discovery Rate Control,” Journal American Statistical Association, 102, 901-912. · Zbl 1469.62318
[60] University of Michigan Kidney Epidemiology and Cost Center (2009-2019)): “Dialysis Facility Reports,” Available at https://data.cms.gov/dialysis-facility-reports.
[61] van deGeer, Sara (1993): “Hellinger‐Consistency of Certain Nonparametric Maximum Likelihood Estimators,” The Annals of Statistics, 14-44. · Zbl 0779.62033
[62] Wald, Abraham (1950): Statistical Decision Functions. Wiley. · Zbl 0040.36402
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.