×

Multivariate analysis with linearizable regressions. (English) Zbl 0718.62139

Summary: We study the class of multivariate distributions in which all bivariate regressions can be linearized by separate transformation of each of the variables. This class seems more realistic than the multivariate normal or the elliptical distributions, and at the same time its study allows us to combine the results from multivariate analysis with optimal scaling and classical multivariate analysis. In particular, a two-stage procedure which first scales the variables optimally, and then fits a simultaneous equations model, is studied in detail and is shown to have some desirable properties.

MSC:

62H25 Factor analysis and principal components; correspondence analysis
62-07 Data analysis (statistics) (MSC2010)
Full Text: DOI

References:

[1] Anderson, T. W. (1958).An introduction to multivariate statistical analysis. New York, Wiley. · Zbl 0083.14601
[2] Bakker, B. F. M., Dronkers, J., & Ganzeboom, H. B. G. (1984).Social stratification and mobility in The Netherlands. Amsterdam: SISWO.
[3] Bekker, P. & de Leeuw, J. (1988). Relations between various forms of nonlinear principal component analysis. In J. van Rijckevorsel & J. de Leeuw,Progress in component and correspondence analysis. New York: Wiley.
[4] Bentler, P. M. (1983). Some contributions to efficient statistics in structural models: specification and estimation of moment structures.Psychometrika, 48, 493–518. · Zbl 0533.62091 · doi:10.1007/BF02293875
[5] Besse, P., & Ramsay, J. O. (1986). Principal components analysis of sampled functions.Psychometrika, 51, 285–311. · Zbl 0623.62048 · doi:10.1007/BF02293986
[6] Breiman, L., & Friedman, J. H. (1985). Estimating optimal transformations for multiple regression and correlation.Journal of the American Statistical Association, 80, 580–598. · Zbl 0594.62044 · doi:10.2307/2288473
[7] de Leeuw, J. (1968).Canonical discriminant analysis of relational data (Report RN 007-68). Leiden: University of Leiden, Department of Data Theory.
[8] de Leeuw, J. (1982). Nonlinear principal component analysis. In H. Caussinus (Ed.),COMPSTAT 1982 (pp. 77–86). Wien: Physika Verlag.
[9] de Leeuw, J. (1983a). On the prehistory of correspondence analysis.Statistica Neerlandica, 37, 161–164. · Zbl 0546.62034 · doi:10.1111/j.1467-9574.1983.tb00810.x
[10] de Leeuw, J. (1983b). Models and methods for the analysis of correlation coefficients.Journal of Econometrics, 22, 113–137. · doi:10.1016/0304-4076(83)90096-9
[11] de Leeuw, J. (1984a). Models of data.Kwantitatieve Methoden, 5, 17–30.
[12] de Leeuw, J. (1984b). The Gift-system of nonlinear multivariate analysis. In E. Diday (Ed.),Data analysis and informatics II (pp. 415–424). Amsterdam: North Holland.
[13] de Leeuw, J. (1984c). Discrete normal linear regression models. In T. K. Dijkstra (Ed.),Misspecification analysis (pp. 56–71). Berlin: Spring Verlag.
[14] de Leeuw, J. (1986). Regression with optimal scaling of the dependent variable. In O. Bunke (Ed.),Proceedings of the 7th International Summer School on Problems of Model Choice and Parameter Estimation in Regression Analysis (Report No. 84, pp. 99–111). Berlin, GDR: Humboldt University, Department of Mathematics.
[15] de Leeuw, J. (1988a). Model selection in multinomial experiments. In T. K. Dijkstra (Ed.),On model uncertainty and its statistical implications (pp. 118–138). Berlin: Springer Verlag.
[16] de Leeuw, J. (1988b). Models and techniques.Statistica Neerlandica, 42, 91–98. · doi:10.1111/j.1467-9574.1988.tb01222.x
[17] de Leeuw, J. (in press). Multivariate analysis with optimal scaling. In S. Das Gupta (Ed.),Progress in multivariate analysis. Calcutta: Indian Statistical Institute.
[18] de Leeuw, J. & van der Heijden, P. G. M. (1988). Correspondence analysis of incomplete contingency tables,Psychometrika, 53, 223–233. · Zbl 0718.62116 · doi:10.1007/BF02294134
[19] de Leeuw, J. & van der Heijden, P. G. M. (in press). The analysis of time budgets with a latent time budget model. In E. Diday (Eds.),Data analysis and informatics V. Amsterdam: North Holland.
[20] de Leeuw, J. &. van Rijckevorsel, J. L. A. (1988). Beyond homogeneity analysis. In J. van Rijckevorsel & J. de Leeuw,Progress in component and correspondence analysis (pp. 55–81). New York: Wiley.
[21] Dijkstra, T. (1983). Some comments on maximum likelihood and partial least squares methods.Journal of Econometrics, 22, 67–90. · Zbl 0521.62098 · doi:10.1016/0304-4076(83)90094-5
[22] Freedman, D. A. (1987). As others see us: A case study in path analysis.Journal of Educational Statistics, 12, 101–129. · doi:10.2307/1164888
[23] Gifi, A. (1980). Data analyse en statistiek [Data analysis and statistics].Bulletin VVS, 13(5), 10–16.
[24] Gifi, A. (1988).Nonlinear multivariate analysis. Leiden, DSWO-Press. · Zbl 0697.62048
[25] Gilula, Z., & Haberman, S. J. (1986). Canonical analysis of contingency tables by maximum likelihood.Journal of the American Statistical Association, 81, 780–788. · Zbl 0623.62047 · doi:10.2307/2289010
[26] Goodman, L. A. (1986). Some useful extensions of the usual correspondence analysis approach and the usual loglinear models approach in the analysis of contingency tables.International Statistical Review, 54, 243–309. · Zbl 0611.62060 · doi:10.2307/1403053
[27] Greenacre, M. J. (1984).Theory and applications of correspondence analysis. New York: Academic Press. · Zbl 0555.62005
[28] Guttman, L. (1955). The determinacy of factor score matrices with implications for five other basic problems of common-factor theory.British Journal of Statistical Psychology, 8, 65–82.
[29] Guttman, L. (1959). Metricizing rank-ordered or unordered data for a linear factor analysis.Sankhya, 21, 257–268. · Zbl 0090.11501
[30] Guttman, L. (1971). Measurement as structural theory.Psychometrika, 36, 329–347. · doi:10.1007/BF02291362
[31] Hirschfeld, H. O. (1935). A connection between correlation and contingency.Proceedings of the Cambridge Philosophical Society, 31, 520–524. · JFM 61.1304.01 · doi:10.1017/S0305004100013517
[32] Isserlis, L. (1916). On certain probable errors and correlation coefficients of multiple frequency distributions with skew regression.Biometrika, 11, 185–190. · JFM 46.1495.10 · doi:10.1093/biomet/11.3.185
[33] Kiiveri, H. T. (1987). An incomplete data approach to the analysis of covariance structures.Psychometrika, 52, 539–554. · Zbl 0718.62102 · doi:10.1007/BF02294818
[34] Koyak, R. A. (1987). On measuring internal dependence in a set of random variables.Annals of Statistics, 15, 1215–1229. · Zbl 0631.62069 · doi:10.1214/aos/1176350501
[35] Lebart, L., Morineau, A., & Warwick, K. M. (1984).Multivariate descriptive statistical analysis. New York: Wiley. · Zbl 0658.62069
[36] Little, R. J. A., & Rubin, D. B. (1983). On jointly estimating parameters and missing data by maximizing the complete-data likelihood.American Statistician, 37, 218–220. · doi:10.2307/2683374
[37] Little, R. J. A., & Rubin, D. B. (1987).Statistical analysis with missing data. New York: Wiley. · Zbl 0665.62004
[38] Molenaar, I. (1988). Formal statistics and informal data analysis, or why laziness should be discouraged.Statistica Neerlandica, 83–90.
[39] Mooijaart, A., Meijerink, F., & de Leeuw, J. (1988). Nonlinear path models. Unpublished manuscript.
[40] Muthén, B. (1984). A general structural equation model with dichotomous, ordered categorical, and continuous latent variable indicators.Psychometrika, 49, 115–132. · doi:10.1007/BF02294210
[41] Pearson, K. (1906). On certain points connected with scale order in the case of a correlation of two characters which for some arrangement give a linear regression line.Biometrika, 5, 176–178.
[42] Peschar, J. L. (1973).School, milieu, beroep [School, environment, profession]. Groningen, The Netherlands: Wolters.
[43] Steiger, J. H., & Browne, M. W. (1984). The comparison of independent correlations between optimal linear composites.Psychometrika, 49, 11–24. · Zbl 0559.62045 · doi:10.1007/BF02294202
[44] Takane, Y., Young, F. W., & de Leeuw, J. (1979). Nonmetric common factor analysis: An alternating least squares method with optimal scaling features.Behaviormetrika, 6, 45–56. · doi:10.2333/bhmk.6.45
[45] Tenenhaus, M., & Young, F. W. (1985). An analysis and synthesis of multiple correspondence analysis, optimal scaling, dual scaling, homogeneity analysis and other methods for quantifying categorical multivariate data.Psychometrika, 50, 91–119. · Zbl 0585.62104 · doi:10.1007/BF02294151
[46] van der Burg, E., de Leeuw, J., & Verdegaal, R. (1988). Homogeneity analysis withk sets of variables,Psychometrika, 53, 177–197. · Zbl 0718.62143 · doi:10.1007/BF02294131
[47] van der Heijden, P. G. M., & de Leeuw, J. (1985). Correspondence analysis used complementary to loglinear analysis.Psychometrika, 50, 429–447. · Zbl 0616.62082 · doi:10.1007/BF02296262
[48] van Praag, B. M. S., de Leeuw, J., & Kloek, T. (1986). The population sample decomposition approach to multivariate estimation methods.Applied Stochastic Models and Data Analysis, 2, 99–120. · Zbl 0628.62052 · doi:10.1002/asm.3150020302
[49] van Rijckevorsel, J. L. A. (1987).The application of horseshoes and fuzzy coding in multiple correspondence analysis. Leiden: DSWO-Press.
[50] van Rijckevorsel, J. L. A., & de Leeuw, J. (Eds.). (1988).Progress in component and correspondence analysis. New York: Wiley.
[51] Winsberg, S., & Ramsay, J. O. (1980). Monotonic transformations to additivity using splines.Biometrika, 67, 669–674. · Zbl 0453.62053 · doi:10.1093/biomet/67.3.669
[52] Young, F. W. (1981). Quantitative analysis of qualitative data.Psychometrika, 46, 357–388. · Zbl 0479.62003 · doi:10.1007/BF02293796
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.