×

Simultaneous classification and multidimensional scaling with external information. (English) Zbl 1306.62449

Summary: For the exploratory analysis of a matrix of proximities or (dis)similarities between objects, one often uses cluster analysis (CA) or multidimensional scaling (MDS). Solutions resulting from such analyses are sometimes interpreted using external information on the objects. Usually the procedures of CA, MDS and using external information are carried out independently and sequentially, although combinations of two of the three procedures (CA and MDS, or multidimensional scaling and using external information) have been proposed in the literature. The present paper offers a procedure that combines all three procedures in one analysis, using a model that describes a partition of objects with cluster centroids represented in a low-dimensional space, which in turn is related to the information in the external variables. A simulation study is carried out to demonstrate that the method works satisfactorily for data with a known underlying structure. Also, to illustrate the method, it is applied to two empirical data sets.

MSC:

62P15 Applications of statistics to psychology
62H12 Estimation in multivariate analysis
62J05 Linear regression; mixed models
Full Text: DOI

References:

[1] Abelson R.P., Sermat V. (1962) Multidimensional scaling of facial expressions. Journal of Experimental Psychology 63:546–554 · doi:10.1037/h0042280
[2] Bock H.H. (1987) On the interface between cluster analysis, principal component analysis, and multidimensional scaling. In: Bozdogan H., Gupta A.K. (eds) Multivariate Statistical Modeling and Data Analysis. Reidel, New York, pp 17–34
[3] Borg I., Groenen P. (1997) Modern Multidimensional Scaling: Theory and Applications. Springer, Berlin Heidelberg New York · Zbl 0862.62052
[4] Carroll J.D., Chang J.-J. (1970) Analysis of individual differences in multidimensional scaling via an N-way generalization of ”Eckart-Young” decomposition. Psychometrika 35:283–319 · Zbl 0202.19101 · doi:10.1007/BF02310791
[5] Cox T.F., Cox M.A.A. (1994) Multidimensional Scaling. Chapman & Hall, London
[6] De Leeuw J. (1977) Applications of convex analysis to multidimensional scaling. In: Barra J.R., Brodeau F., Romier G., van Cutsem B. (eds) Recent Developments in Statistics. North Holland, Amsterdam, pp 133–145
[7] De Leeuw J., Heiser W. (1977) Convergence of correction matrix algorithms for multidimensional scaling, In: Lingoes J.C. (ed) Geometric Representations of Relational Data. Mathesis Press, Ann Arbor Michigan, pp 735–752
[8] De Leeuw J., Heiser W. (1982) Theory of multidimensional scaling. In: Krishnaiah P.R., Kanai L.N. (eds) Handbook of Statistics vol. 2. North Holland, Amsterdam · Zbl 0511.62076
[9] Diederich G., Messick S.J., Tucker L.R. (1957) A general least squares solution for successive intervals. Psychometrika 22:159–173 · Zbl 0080.13405 · doi:10.1007/BF02289051
[10] Ekman G. (1954) Dimensions of color vision. Journal of Psychology 38:467–474 · doi:10.1080/00223980.1954.9712953
[11] Engen T., Levy N., Schlosberg H. (1958) The dimensional analysis of a new series of facial expressions. Journal of Experimental Psychology 55:454–458 · doi:10.1037/h0047240
[12] Escoufier Y. (1973) Le traitement des variables vectorielles [The treatment of vectorial variables]. Biometrics 29:751–760 · doi:10.2307/2529140
[13] Gordon A.D. (1990) Constructing dissimilarity measures. Journal of Classification 7:257–269 · doi:10.1007/BF01908719
[14] Gower J.C. (1966) Some distance properties of latent root and vector methods used in multivariate analysis. Biometrika 53:325–338 · Zbl 0192.26003 · doi:10.1093/biomet/53.3-4.325
[15] Green P.E., Carmone F., Smith S.M. (1989) Multidimensional Scaling: Concept and Applications. Allyn & Bacon, Boston
[16] Groenen P.J.F. (1993) The Majorization Approach to Multidimensional Scaling: Some Problems and Extensions. DSWO Press, Leiden · Zbl 0898.62080
[17] Gutman L., Levy S. (1991) Two structural laws for intelligence tests. Intelligence 15:79–103 · doi:10.1016/0160-2896(91)90023-7
[18] Hair J.F., Anderson R.E., Tatham R.L., Black W.C. (1998) Multivariate Data Analysis. Prentice Hall Inc, New Jersey
[19] Heiser W.J. (1993) Clustering in low-dimensional space. In: Opitz O., Lausen B., Klar R. (eds) Information and Classification: Concepts, Methods and Applications. Springer, Berlin Heidelberg NewYork, pp 162–173
[20] Heiser W.J., Groenen P.J.F. (1997) Cluster differences scaling with a within-clusters loss component and a fuzzy successive approximation strategy to avoid local minima. Psychometrika 62:63–83 · Zbl 0889.92037 · doi:10.1007/BF02294781
[21] Heiser W.J., Meulman J.J. (1983) Constrained multidimensional scaling including confirmation. Applied Psychological Measurement 7:381–404 · doi:10.1177/014662168300700402
[22] Hubert L., Arabie P. (1985) Comparing partitions. Journal of Classification 2:193–218 · Zbl 0587.62128 · doi:10.1007/BF01908075
[23] Keller J.B. (1962) Factorization of matrices by least squares. Biometrika 49:239–242 · Zbl 0106.34402 · doi:10.1093/biomet/49.1-2.239
[24] Kruskal J.B. (1964) Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis. Psychometrika 29:1–27 · Zbl 0123.36803 · doi:10.1007/BF02289565
[25] MacQueen J.B. (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the 5th Symposumon Mathemaical Statistics and Probability Vol 1. University of California, Berkeley · Zbl 0214.46201
[26] Mathar R. (1985) The best Euclidean fit to a given distance matrix in prescribed dimensions. Linear Algebra and Its Applications 67:1–6 · Zbl 0569.15015 · doi:10.1016/0024-3795(85)90181-8
[27] Milligan W.M., Cooper M.C. (1985) An examination of procedures for determining the number of clusters in a data set. Psychometrika 50:159–179 · doi:10.1007/BF02294245
[28] Penrose R. (1956) On best approximate solutions of linear matrix equations. Proceedings of the Cambridge Philosophical Society 52:17–19 · Zbl 0070.12501 · doi:10.1017/S0305004100030929
[29] Shepard R.N. (1962) The analysis of proximities: multidimensional scaling with an unknown distance function. Psychometrika 27:125–139, 219–246 · Zbl 0129.12103 · doi:10.1007/BF02289630
[30] Torgerson W.S. (1958) Theory and Methods of Scaling. Wiley, New York
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.