Skip to main content

Showing 1–50 of 123 results for author: Chernozhukov, V

  1. arXiv:2407.06387  [pdf, other

    econ.EM stat.ME

    Conditional Rank-Rank Regression

    Authors: Victor Chernozhukov, Iván Fernández-Val, Jonas Meier, Aico van Vuuren, Francis Vella

    Abstract: Rank-rank regressions are widely used in economic research to evaluate phenomena such as intergenerational income persistence or mobility. However, when covariates are incorporated to capture between-group persistence, the resulting coefficients can be difficult to interpret as such. We propose the conditional rank-rank regression, which uses conditional ranks instead of unconditional ranks, to me… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 40 pages, 3 figures, 8 tables

    MSC Class: 62P20

  2. arXiv:2403.05850  [pdf, other

    econ.EM stat.ME

    Estimating Causal Effects of Discrete and Continuous Treatments with Binary Instruments

    Authors: Victor Chernozhukov, Iván Fernández-Val, Sukjin Han, Kaspar Wüthrich

    Abstract: We propose an instrumental variable framework for identifying and estimating average and quantile effects of discrete and continuous treatments with binary instruments. The basis of our approach is a local copula representation of the joint distribution of the potential outcomes and unobservables determining treatment assignment. This representation allows us to introduce an identifying assumption… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  3. arXiv:2403.02467  [pdf

    econ.EM cs.LG stat.ME stat.ML

    Applied Causal Inference Powered by ML and AI

    Authors: Victor Chernozhukov, Christian Hansen, Nathan Kallus, Martin Spindler, Vasilis Syrgkanis

    Abstract: An introduction to the emerging fusion of machine learning and causal inference. The book presents ideas from classical structural equation models (SEMs) and their modern AI equivalent, directed acyclical graphs (DAGs) and structural causal models (SCMs), and covers Double/Debiased Machine Learning methods to do inference in such models using modern predictive tools.

    Submitted 4 March, 2024; originally announced March 2024.

  4. arXiv:2402.04674  [pdf, other

    econ.EM stat.ML

    Hyperparameter Tuning for Causal Inference with Double Machine Learning: A Simulation Study

    Authors: Philipp Bach, Oliver Schacht, Victor Chernozhukov, Sven Klaassen, Martin Spindler

    Abstract: Proper hyperparameter tuning is essential for achieving optimal performance of modern machine learning (ML) methods in predictive tasks. While there is an extensive literature on tuning ML learners for prediction, there is only little guidance available on tuning ML learners for causal machine learning and how to select among different ML learners. In this paper, we empirically assess the relation… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  5. arXiv:2402.01785  [pdf, other

    cs.LG cs.AI econ.EM stat.ME stat.ML

    DoubleMLDeep: Estimation of Causal Effects with Multimodal Data

    Authors: Sven Klaassen, Jan Teichert-Kluge, Philipp Bach, Victor Chernozhukov, Martin Spindler, Suhas Vijaykumar

    Abstract: This paper explores the use of unstructured, multimodal data, namely text and images, in causal inference and treatment effect estimation. We propose a neural network architecture that is adapted to the double machine learning (DML) framework, specifically the partially linear model. An additional contribution of our paper is a new method to generate a semi-synthetic dataset which can be used to e… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    MSC Class: 62; 91 ACM Class: I.2.0

  6. arXiv:2402.00584  [pdf, ps, other

    econ.EM

    Arellano-Bond LASSO Estimator for Dynamic Linear Panel Models

    Authors: Victor Chernozhukov, Iván Fernández-Val, Chen Huang, Weining Wang

    Abstract: The Arellano-Bond estimator is a fundamental method for dynamic panel data models, widely used in practice. However, the estimator is severely biased when the data's time series dimension $T$ is long due to the large degree of overidentification. We show that weak dependence along the panel's time series dimension naturally implies approximate sparsity of the most informative moment conditions, mo… ▽ More

    Submitted 16 October, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  7. arXiv:2307.04527  [pdf, other

    stat.ME

    Automatic Debiased Machine Learning for Covariate Shifts

    Authors: Victor Chernozhukov, Michael Newey, Whitney K Newey, Rahul Singh, Vasilis Srygkanis

    Abstract: In this paper we address the problem of bias in machine learning of parameters following covariate shifts. Covariate shift occurs when the distribution of input features change between the training and deployment stages. Regularization and model selection associated with machine learning biases many parameter estimates. In this paper, we propose an automatic debiased machine learning approach to c… ▽ More

    Submitted 19 April, 2024; v1 submitted 10 July, 2023; originally announced July 2023.

  8. arXiv:2305.00044  [pdf, other

    econ.GN cs.LG

    Hedonic Prices and Quality Adjusted Price Indices Powered by AI

    Authors: Patrick Bajari, Zhihao Cen, Victor Chernozhukov, Manoj Manukonda, Suhas Vijaykumar, Jin Wang, Ramon Huerta, Junbo Li, Ling Leng, George Monokroussos, Shan Wan

    Abstract: Accurate, real-time measurements of price index changes using electronic records are essential for tracking inflation and productivity in today's economic environment. We develop empirical hedonic models that can process large amounts of unstructured product data (text, images, prices, quantities) and output accurate hedonic price estimates and derived indices. To accomplish this, we generate abst… ▽ More

    Submitted 28 April, 2023; originally announced May 2023.

    Comments: Revised CEMMAP Working Paper (CWP08/23)

  9. arXiv:2301.07782  [pdf, other

    econ.EM stat.CO stat.ME

    An MCMC Approach to Classical Estimation

    Authors: Victor Chernozhukov, Han Hong

    Abstract: This paper studies computationally and theoretically attractive estimators called the Laplace type estimators (LTE), which include means and quantiles of Quasi-posterior distributions defined as transformations of general (non-likelihood-based) statistical criterion functions, such as those in GMM, nonlinear IV, empirical likelihood, and minimum distance methods. The approach generates an alternat… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

    Comments: This is an archival version of the article "An MCMC approach to classical estimation", Journal of econometrics 115 (2), August 2003, pages 293-346. This version does not reflect the corrections made to the article during the publication process; it contains additional two remarks added, as indicated in the text. 62 pages, 7 figures

    Journal ref: Journal of econometrics 115 (2), August 2003, pages 293-346

  10. arXiv:2207.13081  [pdf, other

    cs.LG stat.ML

    Future-Dependent Value-Based Off-Policy Evaluation in POMDPs

    Authors: Masatoshi Uehara, Haruka Kiyohara, Andrew Bennett, Victor Chernozhukov, Nan Jiang, Nathan Kallus, Chengchun Shi, Wen Sun

    Abstract: We study off-policy evaluation (OPE) for partially observable MDPs (POMDPs) with general function approximation. Existing methods such as sequential importance sampling estimators and fitted-Q evaluation suffer from the curse of horizon in POMDPs. To circumvent this problem, we develop a novel model-free OPE method by introducing future-dependent value functions that take future proxies as inputs.… ▽ More

    Submitted 14 November, 2023; v1 submitted 26 July, 2022; originally announced July 2022.

    Comments: This paper was accepted in NeurIPS 2023

  11. arXiv:2205.09691  [pdf, other

    math.ST econ.EM

    High-dimensional Data Bootstrap

    Authors: Victor Chernozhukov, Denis Chetverikov, Kengo Kato, Yuta Koike

    Abstract: This article reviews recent progress in high-dimensional bootstrap. We first review high-dimensional central limit theorems for distributions of sample mean vectors over the rectangles, bootstrap consistency results in high dimensions, and key techniques used to establish those results. We then review selected applications of high-dimensional bootstrap: construction of simultaneous confidence sets… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: 27 pages; review article

  12. arXiv:2203.13887  [pdf, other

    econ.EM cs.LG math.ST stat.ML

    Automatic Debiased Machine Learning for Dynamic Treatment Effects and General Nested Functionals

    Authors: Victor Chernozhukov, Whitney Newey, Rahul Singh, Vasilis Syrgkanis

    Abstract: We extend the idea of automated debiased machine learning to the dynamic treatment regime and more generally to nested functionals. We show that the multiply robust formula for the dynamic treatment regime with discrete treatments can be re-stated in terms of a recursive Riesz representer characterization of nested mean regressions. We then apply a recursive Riesz representer estimation learning a… ▽ More

    Submitted 20 June, 2023; v1 submitted 25 March, 2022; originally announced March 2022.

  13. arXiv:2112.13398  [pdf, other

    econ.EM cs.LG stat.ME stat.ML

    Long Story Short: Omitted Variable Bias in Causal Machine Learning

    Authors: Victor Chernozhukov, Carlos Cinelli, Whitney Newey, Amit Sharma, Vasilis Syrgkanis

    Abstract: We develop a general theory of omitted variable bias for a wide range of common causal parameters, including (but not limited to) averages of potential outcomes, average treatment effects, average causal derivatives, and policy effects from covariate shifts. Our theory applies to nonparametric models, while naturally allowing for (semi-)parametric restrictions (such as partial linearity) when such… ▽ More

    Submitted 26 May, 2024; v1 submitted 26 December, 2021; originally announced December 2021.

    Comments: This is an extended version of the paper was prepared for the NeurIPS-2021 Workshop "Causal Inference & Machine Learning: Why now?"; 55 pages; 10 figures

    MSC Class: 62G

  14. arXiv:2110.06136  [pdf, other

    stat.AP econ.GN

    A Response to Philippe Lemoine's Critique on our Paper "Causal Impact of Masks, Policies, Behavior on Early Covid-19 Pandemic in the U.S."

    Authors: Victor Chernozhukov, Hiroyuki Kasahara, Paul Schrimpf

    Abstract: Recently, Phillippe Lemoine posted a critique of our paper "Causal Impact of Masks, Policies, Behavior on Early Covid-19 Pandemic in the U.S." [arXiv:2005.14168] at his post titled "Lockdowns, econometrics and the art of putting lipstick on a pig." Although Lemoine's critique appears ideologically driven and overly emotional, some of his points are worth addressing. In particular, the sensitivity… ▽ More

    Submitted 10 October, 2021; originally announced October 2021.

  15. arXiv:2110.03031  [pdf, other

    cs.LG econ.EM stat.ML

    RieszNet and ForestRiesz: Automatic Debiased Machine Learning with Neural Nets and Random Forests

    Authors: Victor Chernozhukov, Whitney K. Newey, Victor Quintas-Martinez, Vasilis Syrgkanis

    Abstract: Many causal and policy effects of interest are defined by linear functionals of high-dimensional or non-parametric regression functions. $\sqrt{n}$-consistent and asymptotically normal estimation of the object of interest requires debiasing to reduce the effects of regularization and/or model selection on the object of interest. Debiasing is typically achieved by adding a correction term to the pl… ▽ More

    Submitted 15 June, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: Accepted for a long presentation at the ICML. Code available at https://github.com/victor5as/RieszLearning

  16. arXiv:2107.02602  [pdf, ps, other

    math.ST econ.EM stat.ME

    Inference for Low-Rank Models

    Authors: Victor Chernozhukov, Christian Hansen, Yuan Liao, Yinchu Zhu

    Abstract: This paper studies inference in linear models with a high-dimensional parameter matrix that can be well-approximated by a ``spiked low-rank matrix.'' A spiked low-rank matrix has rank that grows slowly compared to its dimensions and nonzero singular values that diverge to infinity. We show that this framework covers a broad class of models of latent-variables which can accommodate matrix completio… ▽ More

    Submitted 2 January, 2023; v1 submitted 6 July, 2021; originally announced July 2021.

  17. arXiv:2106.09762  [pdf, other

    stat.ME cs.AI cs.LG stat.ML

    Causal Bias Quantification for Continuous Treatments

    Authors: Gianluca Detommaso, Michael Brückner, Philip Schulz, Victor Chernozhukov

    Abstract: We extend the definition of the marginal causal effect to the continuous treatment setting and develop a novel characterization of causal bias in the framework of structural causal models. We prove that our derived bias expression is zero if, and only if, the causal effect is identifiable via covariate adjustment. We show that under some restrictions on the structural equations, the causal bias ca… ▽ More

    Submitted 30 January, 2022; v1 submitted 17 June, 2021; originally announced June 2021.

  18. arXiv:2105.15197  [pdf, ps, other

    stat.ML cs.LG econ.EM math.ST

    A Simple and General Debiased Machine Learning Theorem with Finite Sample Guarantees

    Authors: Victor Chernozhukov, Whitney K. Newey, Rahul Singh

    Abstract: Debiased machine learning is a meta algorithm based on bias correction and sample splitting to calculate confidence intervals for functionals, i.e. scalar summaries, of machine learning algorithms. For example, an analyst may desire the confidence interval for a treatment effect estimated with a neural network. We provide a nonasymptotic debiased machine learning theorem that encompasses any globa… ▽ More

    Submitted 21 October, 2022; v1 submitted 31 May, 2021; originally announced May 2021.

    Comments: Biometrika 2022

  19. arXiv:2105.07424  [pdf, other

    econ.EM stat.ME

    Uniform Inference on High-dimensional Spatial Panel Networks

    Authors: Victor Chernozhukov, Chen Huang, Weining Wang

    Abstract: We propose employing a debiased-regularized, high-dimensional generalized method of moments (GMM) framework to perform inference on large-scale spatial panel networks. In particular, network structure with a flexible sparse deviation, which can be regarded either as latent or as misspecified from a predetermined adjacency matrix, is estimated using debiased machine learning approach. The theoretic… ▽ More

    Submitted 7 September, 2023; v1 submitted 16 May, 2021; originally announced May 2021.

  20. arXiv:2105.04646  [pdf, other

    stat.ML cs.AI cs.LG

    Deeply-Debiased Off-Policy Interval Estimation

    Authors: Chengchun Shi, Runzhe Wan, Victor Chernozhukov, Rui Song

    Abstract: Off-policy evaluation learns a target policy's value with a historical dataset generated by a different behavior policy. In addition to a point estimate, many applications would benefit significantly from having a confidence interval (CI) that quantifies the uncertainty of the point estimate. In this paper, we propose a novel deeply-debiasing procedure to construct an efficient, robust, and flexib… ▽ More

    Submitted 7 June, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

  21. arXiv:2104.14737  [pdf, other

    math.ST econ.EM

    Automatic Debiased Machine Learning via Riesz Regression

    Authors: Victor Chernozhukov, Whitney K. Newey, Victor Quintas-Martinez, Vasilis Syrgkanis

    Abstract: A variety of interesting parameters may depend on high dimensional regressions. Machine learning can be used to estimate such parameters. However estimators based on machine learners can be severely biased by regularization and/or model selection. Debiased machine learning uses Neyman orthogonal estimating equations to reduce such biases. Debiased machine learning generally requires estimation of… ▽ More

    Submitted 14 March, 2024; v1 submitted 29 April, 2021; originally announced April 2021.

    Comments: arXiv admin note: text overlap with arXiv:1809.05224

    MSC Class: 62D20; 62P20 (Primary); 62G20; 62J02 (Secondary)

  22. arXiv:2104.03220  [pdf, other

    stat.ML cs.LG econ.EM

    DoubleML -- An Object-Oriented Implementation of Double Machine Learning in Python

    Authors: Philipp Bach, Victor Chernozhukov, Malte S. Kurz, Martin Spindler

    Abstract: DoubleML is an open-source Python library implementing the double machine learning framework of Chernozhukov et al. (2018) for a variety of causal models. It contains functionalities for valid statistical inference on causal parameters when the estimation of nuisance parameters is based on machine learning methods. The object-oriented implementation of DoubleML provides a high flexibility in terms… ▽ More

    Submitted 20 December, 2021; v1 submitted 7 April, 2021; originally announced April 2021.

    Comments: 6 pages, 2 figures

    MSC Class: 62-04

    Journal ref: Journal of Machine Learning Research 23 (53), 2022, 1-6

  23. arXiv:2103.09603  [pdf, other

    stat.ML cs.LG econ.EM

    DoubleML -- An Object-Oriented Implementation of Double Machine Learning in R

    Authors: Philipp Bach, Victor Chernozhukov, Malte S. Kurz, Martin Spindler, Sven Klaassen

    Abstract: The R package DoubleML implements the double/debiased machine learning framework of Chernozhukov et al. (2018). It provides functionalities to estimate parameters in causal models based on machine learning methods. The double machine learning framework consist of three key ingredients: Neyman orthogonality, high-quality machine learning estimation and sample splitting. Estimation of nuisance compo… ▽ More

    Submitted 5 June, 2024; v1 submitted 17 March, 2021; originally announced March 2021.

    Comments: 56 pages, 8 Figures, 1 Table; Updated version for DoubleML 1.0.0; Updated version due to changes in R package paradox (for parameter tuning with mlr3)

    MSC Class: 62-04

    Journal ref: Journal of Statistical Software 2024

  24. Vector quantile regression and optimal transport, from theory to numerics

    Authors: Guillaume Carlier, Victor Chernozhukov, Gwendoline De Bie, Alfred Galichon

    Abstract: In this paper, we first revisit the Koenker and Bassett variational approach to (univariate) quantile regression, emphasizing its link with latent factor representations and correlation maximization problems. We then review the multivariate extension due to Carlier et al. (2016, 2017) which relates vector quantile regression to an optimal transport problem with mean independence constraints. We in… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

    Comments: 35 pages, 19 figures, 4 tables. arXiv admin note: text overlap with arXiv:1610.06833

    Journal ref: Empirical Economics (2020)

  25. The Association of Opening K-12 Schools with the Spread of COVID-19 in the United States: County-Level Panel Data Analysis

    Authors: Victor Chernozhukov, Hiroyuki Kasahara, Paul Schrimpf

    Abstract: This paper empirically examines how the opening of K-12 schools and colleges is associated with the spread of COVID-19 using county-level panel data in the United States. Using data on foot traffic and K-12 school opening plans, we analyze how an increase in visits to schools and opening schools with different teaching methods (in-person, hybrid, and remote) is related to the 2-weeks forward growt… ▽ More

    Submitted 15 June, 2021; v1 submitted 20 February, 2021; originally announced February 2021.

  26. arXiv:2101.00009  [pdf, other

    econ.EM cs.LG stat.ML

    Adversarial Estimation of Riesz Representers

    Authors: Victor Chernozhukov, Whitney Newey, Rahul Singh, Vasilis Syrgkanis

    Abstract: Many causal parameters are linear functionals of an underlying regression. The Riesz representer is a key component in the asymptotic variance of a semiparametrically estimated linear functional. We propose an adversarial framework to estimate the Riesz representer using general function spaces. We prove a nonasymptotic mean square rate in terms of an abstract quantity called the critical radius,… ▽ More

    Submitted 26 April, 2024; v1 submitted 30 December, 2020; originally announced January 2021.

  27. arXiv:2012.09513  [pdf, ps, other

    math.PR math.ST

    Nearly optimal central limit theorem and bootstrap approximations in high dimensions

    Authors: Victor Chernozhukov, Denis Chetverikov, Yuta Koike

    Abstract: In this paper, we derive new, nearly optimal bounds for the Gaussian approximation to scaled averages of $n$ independent high-dimensional centered random vectors $X_1,\dots,X_n$ over the class of rectangles in the case when the covariance matrix of the scaled average is non-degenerate. In the case of bounded $X_i$'s, the implied bound for the Kolmogorov distance between the distribution of the sca… ▽ More

    Submitted 12 May, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: 60 pages. We corrected a mistake in v1. Lemmas 6.1-6.3 are reformulated for general rectangles

    MSC Class: 60F05; 62E17

  28. arXiv:2011.01092  [pdf, other

    econ.GN physics.soc-ph stat.AP

    Insights from Optimal Pandemic Shielding in a Multi-Group SEIR Framework

    Authors: Philipp Bach, Victor Chernozhukov, Martin Spindler

    Abstract: The COVID-19 pandemic constitutes one of the largest threats in recent decades to the health and economic welfare of populations globally. In this paper, we analyze different types of policy measures designed to fight the spread of the virus and minimize economic losses. Our analysis builds on a multi-group SEIR model, which extends the multi-group SIR model introduced by Acemoglu et al.~(2020). W… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: 39 pages, 23 figures

  29. arXiv:2009.00436  [pdf, ps, other

    econ.EM

    Instrumental Variable Quantile Regression

    Authors: Victor Chernozhukov, Christian Hansen, Kaspar Wuthrich

    Abstract: This chapter reviews the instrumental variable quantile regression model of Chernozhukov and Hansen (2005). We discuss the key conditions used for identification of structural quantile effects within this model which include the availability of instruments and a restriction on the ranks of structural disturbances. We outline several approaches to obtaining point estimates and performing statistica… ▽ More

    Submitted 28 August, 2020; originally announced September 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1303.7050

    Journal ref: Chapter 9 in: Chernozhukov, V., He, X., Koenker, R., Peng, L. (Eds.), Handbook of Quantile Regression. CRC Chapman-Hall, 2017

  30. Causal Impact of Masks, Policies, Behavior on Early Covid-19 Pandemic in the U.S

    Authors: Victor Chernozhukov, Hiroyuki Kasaha, Paul Schrimpf

    Abstract: This paper evaluates the dynamic impact of various policies adopted by US states on the growth rates of confirmed Covid-19 cases and deaths as well as social distancing behavior measured by Google Mobility Reports, where we take into consideration people's voluntarily behavioral response to new information of transmission risks. Our analysis finds that both policies and information on transmission… ▽ More

    Submitted 19 October, 2020; v1 submitted 28 May, 2020; originally announced May 2020.

    Journal ref: Journal of Econometrics (2020)

  31. arXiv:1912.12213  [pdf, other

    math.ST econ.EM stat.ML

    Minimax Semiparametric Learning With Approximate Sparsity

    Authors: Jelena Bradic, Victor Chernozhukov, Whitney K. Newey, Yinchu Zhu

    Abstract: This paper is about the feasibility and means of root-n consistently estimating linear, mean-square continuous functionals of a high dimensional, approximately sparse regression. Such objects include a wide variety of interesting parameters such as regression coefficients, average derivatives, and the average treatment effect. We give lower bounds on the convergence rate of estimators of a regress… ▽ More

    Submitted 8 August, 2022; v1 submitted 27 December, 2019; originally announced December 2019.

  32. arXiv:1912.10529  [pdf, ps, other

    math.ST econ.EM

    Improved Central Limit Theorem and bootstrap approximations in high dimensions

    Authors: Victor Chernozhukov, Denis Chetverikov, Kengo Kato, Yuta Koike

    Abstract: This paper deals with the Gaussian and bootstrap approximations to the distribution of the max statistic in high dimensions. This statistic takes the form of the maximum over components of the sum of independent random vectors and its distribution plays a key role in many high-dimensional econometric problems. Using a novel iterative randomized Lindeberg method, the paper derives new bounds for th… ▽ More

    Submitted 29 May, 2022; v1 submitted 22 December, 2019; originally announced December 2019.

    Comments: 63 pages

  33. arXiv:1909.07889  [pdf, other

    econ.EM stat.ME

    Distributional conformal prediction

    Authors: Victor Chernozhukov, Kaspar Wüthrich, Yinchu Zhu

    Abstract: We propose a robust method for constructing conditionally valid prediction intervals based on models for conditional distributions such as quantile and distribution regression. Our approach can be applied to important prediction problems including cross-sectional prediction, k-step-ahead forecasts, synthetic controls and counterfactual prediction, and individual treatment effects prediction. Our m… ▽ More

    Submitted 21 August, 2021; v1 submitted 17 September, 2019; originally announced September 2019.

    Journal ref: PNAS November 30, 2021 118 (48) e2107794118

  34. arXiv:1909.05782  [pdf, ps, other

    econ.EM stat.CO stat.ME

    Fast Algorithms for the Quantile Regression Process

    Authors: Victor Chernozhukov, Iván Fernández-Val, Blaise Melly

    Abstract: The widespread use of quantile regression methods depends crucially on the existence of fast algorithms. Despite numerous algorithmic improvements, the computation time is still non-negligible because researchers often estimate many quantile regressions and use the bootstrap for inference. We suggest two new fast algorithms for the estimation of a sequence of quantile regressions at many quantile… ▽ More

    Submitted 6 April, 2020; v1 submitted 12 September, 2019; originally announced September 2019.

    Comments: 29 pages, 3 figures, 4 tables; for associated Stata package, see https://sites.google.com/site/blaisemelly/home/computer-programs/fast

  35. arXiv:1909.00836  [pdf, other

    econ.EM stat.CO

    SortedEffects: Sorted Causal Effects in R

    Authors: Shuowen Chen, Victor Chernozhukov, Iván Fernández-Val, Ye Luo

    Abstract: Chernozhukov et al. (2018) proposed the sorted effect method for nonlinear regression models. This method consists of reporting percentiles of the partial effects in addition to the average commonly used to summarize the heterogeneity in the partial effects. They also proposed to use the sorted effects to carry out classification analysis where the observational units are classified as most and le… ▽ More

    Submitted 6 November, 2019; v1 submitted 2 September, 2019; originally announced September 2019.

    Comments: 15 pages, 6 figures, 8 tables

    MSC Class: 62-07; 62E20

  36. arXiv:1908.09173  [pdf, ps, other

    stat.ML cs.LG econ.EM

    Welfare Analysis in Dynamic Models

    Authors: Victor Chernozhukov, Whitney Newey, Vira Semenova

    Abstract: This paper provides welfare metrics for dynamic choice. We give estimation and inference methods for functions of the expected value of dynamic choice. These parameters include average value by group, average derivatives with respect to endowments, and structural decompositions. The example of dynamic discrete choice is considered. We give dual and doubly robust representations of these parameters… ▽ More

    Submitted 14 October, 2024; v1 submitted 24 August, 2019; originally announced August 2019.

  37. arXiv:1905.10116  [pdf, other

    econ.EM cs.LG math.ST stat.ML

    Semi-Parametric Efficient Policy Learning with Continuous Actions

    Authors: Mert Demirer, Vasilis Syrgkanis, Greg Lewis, Victor Chernozhukov

    Abstract: We consider off-policy evaluation and optimization with continuous action spaces. We focus on observational data where the data collection policy is unknown and needs to be estimated. We take a semi-parametric approach where the value function takes a known parametric form in the treatment, but we are agnostic on how it depends on the observed contexts. We propose a doubly robust off-policy estima… ▽ More

    Submitted 20 July, 2019; v1 submitted 24 May, 2019; originally announced May 2019.

  38. arXiv:1901.03821  [pdf, ps, other

    econ.EM stat.ME

    Mastering Panel 'Metrics: Causal Impact of Democracy on Growth

    Authors: Shuowen Chen, Victor Chernozhukov, Iván Fernández-Val

    Abstract: The relationship between democracy and economic growth is of long-standing interest. We revisit the panel data analysis of this relationship by Acemoglu, Naidu, Restrepo and Robinson (forthcoming) using state of the art econometric methods. We argue that this and lots of other panel data settings in economics are in fact high-dimensional, resulting in principal estimators -- the fixed effects (FE)… ▽ More

    Submitted 12 January, 2019; originally announced January 2019.

    Comments: 8 pages, 2 tables, includes supplementary appendix

    MSC Class: 62P20

  39. arXiv:1812.10820  [pdf, other

    econ.EM

    A $t$-test for synthetic controls

    Authors: Victor Chernozhukov, Kaspar Wuthrich, Yinchu Zhu

    Abstract: We propose a practical and robust method for making inferences on average treatment effects estimated by synthetic controls. We develop a $K$-fold cross-fitting procedure for bias correction. To avoid the difficult estimation of the long-run variance, inference is based on a self-normalized $t$-statistic, which has an asymptotically pivotal $t$-distribution. Our $t$-test is easy to implement, prov… ▽ More

    Submitted 17 January, 2024; v1 submitted 27 December, 2018; originally announced December 2018.

  40. arXiv:1812.08089  [pdf, ps, other

    math.ST

    Inference for Heterogeneous Effects using Low-Rank Estimation of Factor Slopes

    Authors: Victor Chernozhukov, Christian Hansen, Yuan Liao, Yinchu Zhu

    Abstract: We study a panel data model with general heterogeneous effects where slopes are allowed to vary across both individuals and over time. The key dimension reduction assumption we employ is that the heterogeneous slopes can be expressed as having a factor structure so that the high-dimensional slope matrix is low-rank and can thus be estimated using low-rank regularized regression. We provide a simpl… ▽ More

    Submitted 4 September, 2019; v1 submitted 19 December, 2018; originally announced December 2018.

  41. arXiv:1812.04345  [pdf, other

    econ.EM stat.AP stat.ML

    Closing the U.S. gender wage gap requires understanding its heterogeneity

    Authors: Philipp Bach, Victor Chernozhukov, Martin Spindler

    Abstract: In 2016, the majority of full-time employed women in the U.S. earned significantly less than comparable men. The extent to which women were affected by gender inequality in earnings, however, depended greatly on socio-economic characteristics, such as marital status or educational attainment. In this paper, we analyzed data from the 2016 American Community Survey using a high-dimensional wage regr… ▽ More

    Submitted 7 June, 2021; v1 submitted 11 December, 2018; originally announced December 2018.

    Comments: Main text: 8 pages, 3 figures; Supplementary Material available online

  42. arXiv:1811.11603  [pdf, other

    econ.EM stat.ME

    Distribution Regression with Sample Selection, with an Application to Wage Decompositions in the UK

    Authors: Victor Chernozhukov, Iván Fernández-Val, Siyi Luo

    Abstract: We develop a distribution regression model under endogenous sample selection. This model is a semi-parametric generalization of the Heckman selection model. It accommodates much richer effects of the covariates on outcome distribution and patterns of heterogeneity in the selection process, and allows for drastic departures from the Gaussian error structure, while maintaining the same level tractab… ▽ More

    Submitted 18 December, 2023; v1 submitted 28 November, 2018; originally announced November 2018.

    Comments: 86 pages, 4 tables, 40 figures, includes supplement

    MSC Class: 62P20; 91B40

  43. arXiv:1809.05224  [pdf, ps, other

    math.ST econ.EM

    Automatic Debiased Machine Learning of Causal and Structural Effects

    Authors: Victor Chernozhukov, Whitney K Newey, Rahul Singh

    Abstract: Many causal and structural effects depend on regressions. Examples include policy effects, average derivatives, regression decompositions, average treatment effects, causal mediation, and parameters of economic structural models. The regressions may be high dimensional, making machine learning useful. Plugging machine learners into identifying equations can lead to poor inference due to bias from… ▽ More

    Submitted 21 October, 2022; v1 submitted 13 September, 2018; originally announced September 2018.

    Comments: Econometrica 2022

  44. arXiv:1809.04951  [pdf, other

    econ.EM stat.ML

    Valid Simultaneous Inference in High-Dimensional Settings (with the hdm package for R)

    Authors: Philipp Bach, Victor Chernozhukov, Martin Spindler

    Abstract: Due to the increasing availability of high-dimensional empirical applications in many research disciplines, valid simultaneous inference becomes more and more important. For instance, high-dimensional settings might arise in economic studies due to very rich data sets with many potential covariates or in the analysis of treatment heterogeneities. Also the evaluation of potentially more complicated… ▽ More

    Submitted 13 September, 2018; originally announced September 2018.

    Comments: 25 pages, 2 figures, 4 tables

  45. arXiv:1809.01038  [pdf, other

    econ.EM stat.ME

    Shape-Enforcing Operators for Point and Interval Estimators

    Authors: Xi Chen, Victor Chernozhukov, Iván Fernández-Val, Scott Kostyshak, Ye Luo

    Abstract: A common problem in econometrics, statistics, and machine learning is to estimate and make inference on functions that satisfy shape restrictions. For example, distribution functions are nondecreasing and range between zero and one, height growth charts are nondecreasing in age, and production functions are nondecreasing and quasi-concave in input quantities. We propose a method to enforce these r… ▽ More

    Submitted 12 February, 2021; v1 submitted 4 September, 2018; originally announced September 2018.

    Comments: 42 pages, 5 figures, 3 tables, v5 includes changes in the main text

    MSC Class: 62F10; 62F25; 62G05; 62G15

  46. arXiv:1808.10532  [pdf, other

    stat.ME cs.LG econ.EM stat.ML

    Uniform Inference in High-Dimensional Gaussian Graphical Models

    Authors: Sven Klaassen, Jannis Kück, Martin Spindler, Victor Chernozhukov

    Abstract: Graphical models have become a very popular tool for representing dependencies within a large set of variables and are key for representing causal structures. We provide results for uniform inference on high-dimensional graphical models with the number of target parameters $d$ being possible much larger than sample size. This is in particular important when certain features or structures of a caus… ▽ More

    Submitted 3 December, 2018; v1 submitted 30 August, 2018; originally announced August 2018.

    Comments: 59 pages, 2 figures, 6 tables

    MSC Class: 62H15; 62J07;

  47. arXiv:1806.11466  [pdf, ps, other

    math.ST econ.EM

    Subvector Inference in Partially Identified Models with Many Moment Inequalities

    Authors: Alexandre Belloni, Federico Bugni, Victor Chernozhukov

    Abstract: This paper considers inference for a function of a parameter vector in a partially identified model with many moment inequalities. This framework allows the number of moment conditions to grow with the sample size, possibly at exponential rates. Our main motivating application is subvector inference, i.e., inference on a single component of the partially identified parameter vector associated with… ▽ More

    Submitted 29 June, 2018; originally announced June 2018.

  48. arXiv:1806.05081  [pdf, other

    econ.EM stat.ME

    LASSO-Driven Inference in Time and Space

    Authors: Victor Chernozhukov, Wolfgang K. Härdle, Chen Huang, Weining Wang

    Abstract: We consider the estimation and inference in a system of high-dimensional regression equations allowing for temporal and cross-sectional dependency in covariates and error processes, covering rather general forms of weak temporal dependence. A sequence of regressions with many regressors using LASSO (Least Absolute Shrinkage and Selection Operator) is applied for variable selection purpose, and an… ▽ More

    Submitted 15 May, 2020; v1 submitted 13 June, 2018; originally announced June 2018.

  49. arXiv:1806.01888  [pdf, other

    math.ST econ.EM

    High-Dimensional Econometrics and Regularized GMM

    Authors: Alexandre Belloni, Victor Chernozhukov, Denis Chetverikov, Christian Hansen, Kengo Kato

    Abstract: This chapter presents key concepts and theoretical results for analyzing estimation and inference in high-dimensional models. High-dimensional models are characterized by having a number of unknown parameters that is not vanishingly small relative to the sample size. We first present results in a framework where estimators of parameters of interest may be represented directly as approximate means.… ▽ More

    Submitted 10 June, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

    Comments: 104 pages, 4 figures

  50. arXiv:1803.08154  [pdf, other

    econ.EM stat.ME

    Network and Panel Quantile Effects Via Distribution Regression

    Authors: Victor Chernozhukov, Iván Fernández-Val, Martin Weidner

    Abstract: This paper provides a method to construct simultaneous confidence bands for quantile functions and quantile effects in nonlinear network and panel models with unobserved two-way effects, strictly exogenous covariates, and possibly discrete outcome variables. The method is based upon projection of simultaneous confidence bands for distribution functions constructed from fixed effects distribution r… ▽ More

    Submitted 8 June, 2020; v1 submitted 21 March, 2018; originally announced March 2018.

    Comments: 71 pages, 8 figures, 3 tables, includes supplementary appendix