×

Binary outcomes, OLS, 2SLS and IV probit. (English) Zbl 07584732

Summary: For a binary outcome \(Y\), generated by a simple threshold crossing model with a single exogenous normally distributed explanatory variable \(X\), the OLS estimator of the coefficient on \(X\) in a linear probability model is a consistent estimator of the average partial effect of \(X\). Even in this very simple setting, we show that when allowing for \(X\) to be endogenously determined, the 2SLS estimator, using a normally distributed instrumental variable \(Z\), does not identify the same causal parameter. It instead estimates the average partial effect of \(Z\), scaled by the coefficient on \(Z\) in the linear first-stage model for \(X\), denoted \(\gamma_1\), or equivalently, it estimates the average partial effect of the population predicted value of \(X\), \(Z_{\gamma_1}\). These causal parameters can differ substantially as we show for the normal Probit model, which implies that care has to be taken when interpreting 2SLS estimation results in a linear probability model. Under joint normality of the error terms, IV Probit maximum likelihood estimation does identify the average partial effect of \(X\). The two-step control function procedure of D. Rivers and Q. H. Vuong [J. Econom. 39, No. 3, 347–366 (1988; Zbl 0668.62081)] can also estimate this causal parameter consistently, but a double averaging is needed, one over the distribution of the first-stage error \(V\) and one over the distribution of \(X\). If instead a single averaging is performed over the joint distribution of \(X\) and \(V\), then the same causal parameter is estimated as the one estimated by the 2SLS estimator in the linear probability model. The 2SLS estimator is a consistent estimator when the average partial effect is equal to 0, and the standard Wald test for this hypothesis has correct size under strong instrument asymptotics. We show that, in general, the standard weak instrument first-stage F-test interpretations do not apply in this setting.

MSC:

62P20 Applications of statistics to economics

Citations:

Zbl 0668.62081

References:

[1] Andrews, I., Valid two-step identification-robust confidence sets for GMM, The Review of Economics and Statistics, 100, 2, 337-348 (2018) · doi:10.1162/REST_a_00682
[2] Arellano, M. (2008). Binary models with endogenous explanatory variables. Technical Report, CEMFI.
[3] Frazier, D.; Renault, E.; Zhang, L.; Zhao, X., Weak instrument test in discrete choice models (2019)
[4] Magnusson, L., Inference in limited dependent variable models robust to weak identification, The Econometrics Journal, 13, 3, S56-S79 (2010) · Zbl 07708398 · doi:10.1111/j.1368-423X.2009.00309.x
[5] Newey, W. K.; Ruud, P. A.; Andrews, D. W. K.; Stock, J. H., Identification and inference for econometric models, Density weighted linear least squares, 554-573 (2005), Cambridge: Cambridge University Press, Cambridge · Zbl 1126.62023
[6] Nunn, N.; Qian, N., US food aid and civil conflict, American Economic Review, 104, 6, 1630-1666 (2014) · doi:10.1257/aer.104.6.1630
[7] Rivers, D.; Vuong, Q. H., Limited information estimators and exogeneity tests for simultaneous probit models, Journal of Econometrics, 39, 3, 347-366 (1988) · Zbl 0668.62081 · doi:10.1016/0304-4076(88)90063-2
[8] Ruud, P. A., Sufficient conditions for the consistency of maximum likelihood estimation despite misspecification of distribution in multinomial discrete choice models, Econometrica, 51, 1, 225 (1983) · Zbl 0513.62071 · doi:10.2307/1912257
[9] Ruud, P. A., Consistent estimation of limited dependent variable models despite misspecification of distribution, Journal of Econometrics, 32, 1, 157-187 (1986) · Zbl 0626.62109 · doi:10.1016/0304-4076(86)90017-5
[10] Stock, J. H.; Yogo, M.; Andrews, D. W. K.; Stock, J. H., Identification and inference for econometric models, Testing for weak instruments in linear IV regression, 80-108 (2005), Cambridge: Cambridge University Press, Cambridge · Zbl 1121.62066
[11] Stoker, T. M., Consistent estimation of scaled coefficients, Econometrica, 54, 6, 1461 (1986) · Zbl 0628.62105 · doi:10.2307/1914309
[12] Windmeijer, F., Testing underidentification in linear models, with applications to dynamic panel and asset pricing models, Journal of Econometrics (2021) · Zbl 07822335 · doi:10.1016/j.jeconom.2021.03.007
[13] Wooldridge, J. M., Econometric analysis of cross section and panel data (2010), Cambridge: The MIT Press, Cambridge · Zbl 1327.62009
[14] Zhang, L.; Frazier, D. T.; Poskitt, D. S.; Zhao, X., Decomposing identification gains and evaluating instrument identification power for partially identified average treatment effects (2020)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.