×

Regression analysis of multivariate fractional data. (English) Zbl 1491.62248

Summary: The present article discusses alternative regression models and estimation methods for dealing with multivariate fractional response variables. Both conditional mean models, estimable by quasi-maximum likelihood, and fully parametric models (Dirichlet and Dirichlet-multinomial), estimable by maximum likelihood, are considered. A new parameterization is proposed for the parametric models, which accommodates the most common specifications for the conditional mean (e.g., multinomial logit, nested logit, random parameters logit, dogit). The text also discusses at some length the specification analysis of fractional regression models, proposing several tests that can be performed through artificial regressions. Finally, an extensive Monte Carlo study evaluates the finite sample properties of most of the estimators and tests considered.

MSC:

62P20 Applications of statistics to economics
62J05 Linear regression; mixed models
62H15 Hypothesis testing in multivariate analysis

Software:

Fahrmeir
Full Text: DOI

References:

[1] Aitchison, J., The statistical analysis of compositional data (with discussion), Journal of the Royal Statistical Society, Series B (Statistical Methodology), 44, 2, 139-177 (1982) · Zbl 0491.62017
[2] Aitchison, J.; Egozcue, J., Compositional data analysis: Where are we and where should we be heading?, Mathematical Geology, 37, 7, 829-850 (2005) · Zbl 1177.86017
[3] Alkhamisia, M.; Khalaf, G.; Shukur, G., The effect of fat-tailed error terms on the properties of system-wise RESET test, Journal of Applied Statistics, 35, 1, 101-113 (2008) · Zbl 1206.62115
[4] Andrews, D. W. K., Testing when a parameter is on the boundary of the maintained hypothesis, Econometrica, 69, 683-784 (2001) · Zbl 0999.62010
[5] Ben-Akiva, M., Choice Models with Simple Choice Set Generating Processes (1977)
[6] Chesher, A.; Santos Silva, J., Taste variation in Discrete choice models, The Review of Economic Studies, 69, 1, 147-168 (2002) · Zbl 1008.91030
[7] Chotikapanich, D.; Griffiths, W. E., Estimating Lorenz curves using a Dirichlet distribution, Journal of Business & Economic Statistics, 20, 2, 290-295 (2002)
[8] Considine, T. J.; Mount, T. D., The use of linear logit models for dynamic input demand systems, Review of Economics and Statistics, 66, 434-443 (1984)
[9] Dubin, J., Valuing intangible assets with a nested logit market share model, Journal of Econometrics, 139, 285-302 (2007) · Zbl 1418.62446
[10] Fahrmeir, L.; Tutz, G., Multivariate Statistical Modelling Based on Generalized Linear Models (2001), New York: Springer, New York · Zbl 0980.62052
[11] Ferrari, S.; Cribari-Neto, F., Beta regression for modelling rates and proportions, Journal of Applied Statistics, 31, 7, 799-815 (2004) · Zbl 1121.62367
[12] Fry, J. M.; Fry, T. R. L.; McLaren, K. M., The stochastic specification of demand share equations: Restricting budget shares to the unit simplex, Journal of Econometrics, 73, 377-385 (1996) · Zbl 0861.62081
[13] Gaudry, M. J.; Dagenais, M. G., The dogit model, Transportation Research, 13B, 2, 105-111 (1979)
[14] Giles, D.; Keil, A., Applying the RESET test in allocation models: A cautionary note, Applied Economics Letters, 4, 359-363 (1997)
[15] Gouriéroux, C.; Monfort, A.; Trognon, A., Pseudo maximum likelihood methods: Theory, Econometrica, 52, 681-700 (1984) · Zbl 0575.62031
[16] Guimarães, P.; Lindrooth, R. C., Controlling for overdispersion in grouped conditional logit models: A computationally simple application of Dirichlet-Multinomial regression, Econometrics Journal, 10, 439-452 (2007) · Zbl 1122.62104
[17] Heckman, J. J.; Willis, R. J., A beta-logistic model for the analysis of sequential labor force participation by married women, Journal of Political Economy, 85, 27-58 (1977)
[18] Heien, D.; Wessells, C. R., Demand systems estimation with microdata: A censored regression approach, Journal of Business & Economic Statistics, 8, 3, 365-371 (1990)
[19] Hermalin, B. E.; Wallace, N. E., The determinants of efficiency and solvency in savings and loans, Rand Journal of Economics, 25, 3, 361-381 (1994)
[20] Johnson, N.; Kemp, A.; Kotz, S., Univariate Discrete Distributions (2005), New York: Wiley, New York · Zbl 1092.62010
[21] Johnson, N.; Kotz, S.; Balakrishnan, N., Discrete Multivariate Distributions (1997), New York: Wiley, New York · Zbl 0868.62048
[22] Katz, J. N.; King, G., A statistical model for multiparty electoral data, Political Science, 93, 1, 15-32 (1999)
[23] Klawitter, M., The effects of sexual orientation and marital status on how couples hold their money, Review of Economics of the Household, 6, 4, 423-446 (2008)
[24] Kotz, S.; Balakrishnan, N.; Johnson, N., Continuous Multivariate Distributions, 1 (2000), New York: Wiley, New York · Zbl 0946.62001
[25] Lee, L.-F.; Pitt, M., Microeconometric demand systems with binding nonnegativity constraints: The dual approach, Econometrica, 54, 5, 1237-1242 (1986) · Zbl 0603.90036
[26] McCullagh, P.; Nelder, J. A., Generalized Linear Models (1989), London: Chapman and Hall, London · Zbl 0744.62098
[27] Mosimann, J., On the compound multinomial distribution, the multivariate beta-distribution, and correlation among proportions, Biometrika, 49, 65-82 (1962) · Zbl 0105.12502
[28] Mullahy, J., Multivariate Fractional Regression Estimation of Econometric Share Models. 2nd International Health Econometrics Workshop (2010), Rome, Italy · Zbl 1345.62096
[29] Mullahy, J.; Robert, S., No time to lose: Time constraints and physical activity in the production of health, Revue of the Economics of the Household, 8, 4, 409-432 (2010)
[30] Newey, W., Maximum likelihood specification testing and conditional moment tests, Econometrica, 53, 1047-1070 (1985) · Zbl 0629.62107
[31] Pagan, A.; Vella, F., Diagnostic tests for models based on individual data: A survey, Journal of Applied Econometrics, 4, S29-S59 (1989)
[32] Paolino, P., Maximum likelihood estimation of models with beta-distributed dependent variables, Political Analysis, 9, 4, 325-346 (2001)
[33] Papke, L. E.; Wooldridge, J. M., Econometric methods for fractional response variables with an application to 401(k) plan participation rates, Journal of Applied Econometrics, 11, 6, 619-632 (1996)
[34] Poterba, J.; Samwick, A., Taxation and household portfolio composition: US evidence from the 1980s and 1990s, Journal of Public Economics, 87, 5-38 (2002)
[35] Pregibon, D., Goodness of link tests for generalized linear models, Applied Statistics, 29, 1, 15-24 (1980) · Zbl 0434.62048
[36] Pu, C.; Lan, V.; Chou, Y.; Lan, C., The crowding-out effects of tobacco and alcohol where expenditure shares are low: Analyzing expenditure data for Taiwan, Social Science & Medicine, 66, 9, 1979-1989 (2008)
[37] Ramalho, E.; Ramalho, J.; Murteira, J., Alternative estimating and testing empirical strategies for fractional regression models, Journal of Economic Surveys, 25, 1, 19-68 (2011)
[38] Ramsey, J. B., Tests for specification errors in classical linear least-squares regression analysis, Journal of the Royal Statistical Society B, 31, 350-371 (1969) · Zbl 0179.48902
[39] Santos Silva, J. M. C.; Murteira, J., Estimation of default probabilities using incomplete contracts data, Journal of Empirical Finance, 16, 3, 457-465 (2009)
[40] Shukur, G.; Edgerton, D., The small sample properties of the reset test as applied to systems of equations, Journal of Statistical Computation and Simulation, 72, 12, 909-924 (2002) · Zbl 1014.62080
[41] Sivakumar, A.; Bhat, C., Fractional split-distribution model for statewide commodity-flow analysis, Transportation Research Record, 1790, 80-88 (2002)
[42] Tauchen, G., Diagnostic testing and evaluation of maximum likelihood models, Journal of Econometrics, 30, 415-443 (1985) · Zbl 0591.62094
[43] Train, K. E., Discrete Choice Methods with Simulation (2009), Cambridge, UK: Cambridge University Press, Cambridge, UK · Zbl 1269.62073
[44] Tse, Y. K., A diagnostic test for the multinomial logit model, Journal of Business & Economics Statistics, 16, 283-286 (1987)
[45] Wales, T. J.; Woodland, A. D., Estimation of consumer demand systems with binding non-negativity constraints, Journal of Econometrics, 21, 263-285 (1983) · Zbl 0512.62119
[46] Wang, H.; Zhang, L.; Hsiao, W., Ill health and its potential influence on household consumptions in rural China, Health Policy, 78, 2-3, 167-177 (2006)
[47] White, H., Maximum likelihood estimation of misspecified models, Econometrica, 50, 1, 1-25 (1982) · Zbl 0478.62088
[48] Woodland, A. D., Stochastic specification and the estimation of share equations, Journal of Econometrics, 10, 361-383 (1979) · Zbl 0414.62083
[49] Wooldridge, J. M., Specification testing and quasi-maximum likelihood estimation, Journal of Econometrics, 48, 29-55 (1991)
[50] Ye, X.; Pendyala, R. M.; Mahmassani, H. S., Transportation and Traffic Theory: Flow, Dynamics, and Human Interaction, A model of daily time use allocation using fractional logit methodology, 507-524 (2005), Elsevier Science Ltd.
[51] Yin, R. S.; Xiang, Q.; Xu, J. T.; Deng, X. Z., Modeling the driving forces of the land use and land cover changes along the upper Yangtze river of China, Environmental Management, 45, 3, 454-465 (2010)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.