×

Two-mode multi-partitioning. (English) Zbl 1452.62463

Summary: New methodologies for two-mode (objects and variables) multi-partitioning of two way data are presented. In particular, by reanalyzing the double \(k\)-means, that identifies a unique partition for each mode of the data, a relevant extension is discussed which allows to specify more partitions of one mode, conditionally to the partition of the other one. The performance of such generalized double \(k\)-means has been tested by both a simulation study and an application to gene microarray data.

MSC:

62H30 Classification and discrimination; cluster analysis (statistical aspects)
62P10 Applications of statistics to biology and medical sciences; meta analysis
62-08 Computational methods for problems pertaining to statistics

Software:

clusfind
Full Text: DOI

References:

[1] Alon, U.; Bakai, N.; Notterman, D. A.; Gish, K.; Ybarra, S.; Mack, D.; Lavine, A. J., Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays, Proc. Natl. Acad. Sci. USA, 96, 6745-6750 (1999)
[2] Ahuja, R. K.; Magnanti, T. L.; Orlin, J. B., Network Flows—Theory, Algorithms, and Applications (1993), Prentice Hall: Prentice Hall Englewood Cliffs NJ · Zbl 1201.90001
[3] Ben-Dor, A.; Shamir, R.; Yakhini, Z., Clustering gene expression patterns, J. Comput. Biol., 6, 281-297 (1999)
[4] Bittner, M.; Meltzer, P.; Chen, Y.; Jiang, Y.; Seftor, E.; Hendrix, M.; Radmacher, M.; Simon, R.; Yakhini, Z.; Ben-Dor, A.; Sampas, N.; Dougherty, E.; Wang, E.; Marincola, F.; Gooden, C.; Lueders, J.; Glatfelter, A.; Pollock, P.; Carpten, J.; Gillanders, E.; Leja, D.; Dietrich, K.; Beaudry, C.; Berens, M.; Alberts, D.; Sondak, V.; Hayward, N.; Trent, J., Molecular classification of cutaneous malignant melanoma by gene expression profiling, Nature, 406, 536-540 (2000)
[5] Blatt, M.; Wiseman, S.; Domany, E., Supermagnetic clustering of data, Phys. Rev. Lett., 76, 18, 3251-3254 (1996)
[6] Bock, H. H., Simultaneous clustering of objects and variables, (Tomassone, R., Analyse des données et informatique (1979), Le Chesnay: Le Chesnay France INRIA) · Zbl 0454.62055
[7] Bock, H. H., Two-way clustering for contingency tables maximizing a dependence measure, (Schader, M.; Gaul, W.; Vichi, M., Between Data Science and Applied Data Analysis (2003), Springer: Springer Heidelberg), 143-155 · Zbl 05280168
[8] Bock, H. H.; Vichi, M., Statistical learning methods including dimensionality reduction, Comput. Statist. Data Anal., 52, 1, 370-613 (2007) · Zbl 1452.00018
[9] Calinski, T.; Harabasz, J., A dendride method for cluster analysis, Commun. Statist., 3, 1-27 (1974) · Zbl 0273.62010
[10] Celeux, G.; Govaert, G., A classification EM algorithm for clustering and two stochastic versions, Comput. Statist. Data Anal., 14, 315-332 (1992) · Zbl 0937.62605
[11] DeSarbo, W. S., GENNCLUS: new models for general non-hierarchical clustering analysis, Psychometrika, 47, 449-475 (1982) · Zbl 0566.62057
[12] Eisen, M.; Spellman, P.; Brown, P.; Botstein, D., Proc. Natl. Acad. Sci. USA, 95, 14863-14868 (1998)
[13] Getz, G.; Levine, E.; Domany, E., Coupled two-way clustering of gene microarray data, Proc. Natl. Acad. Sci. USA, 97, 22, 12079-12084 (2000)
[14] Goldstein, D. R.; Ghosh, D.; Conlon, E. M., Satistical issues in the clustering of gene expression data, Statist. Sinica, 12, 219-240 (2002) · Zbl 0997.62088
[15] Gordon, A.D., 1999. Classification, second ed. Chapman and Hall, London.; Gordon, A.D., 1999. Classification, second ed. Chapman and Hall, London. · Zbl 0929.62068
[16] Govaert, G., Simultaneous clustering of rows and columns. Control Cybernet., 24, 437-458 (1995) · Zbl 0852.62055
[17] Govaert, G.; Nadif, M., Clustering with block mixture models, Pattern Recognition, 36, 463-473 (2003)
[18] Hartigan, J. A., Direct clustering of a data matrix, J. Amer. Statist. Assoc., 67, 123-129 (1972)
[19] Hartigan, J. A., Clustering Algorithms (1975), Wiley: Wiley New York · Zbl 0372.62040
[20] Hubert, L.; Arabie, P., Comparing partitions, J. Classification, 2, 193-218 (1985)
[21] Kaufman, L.; Rousseeuw, P., Finding Groups in Data: An Introduction to Cluster Analysis (1990), Wiley: Wiley New York · Zbl 1345.62009
[22] MacQueen, J., 1967. Some methods for classification and analysis of multivariate observations. In: Le Cam, L.M., and Neyman, J. (Eds.), Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1 Statistics. University of California Press, Berkeley, pp. 281-297.; MacQueen, J., 1967. Some methods for classification and analysis of multivariate observations. In: Le Cam, L.M., and Neyman, J. (Eds.), Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1 Statistics. University of California Press, Berkeley, pp. 281-297. · Zbl 0214.46201
[23] Milligan, G. W.; Cooper, M., An Estimation of Procedures for Determining the Number of Clusters in a Data Set, Psychometrika., 50, 159-179 (1985)
[24] Pollard, K. S.; van der Laan, M. J., Statistical Inference for simultaneous clustering of gene expression data, Math. Biosci., 176, 99-121 (2002) · Zbl 0997.62090
[25] Van Mechelen, I.; Bock, H. H.; De Boeck, P., Two-mode clustering methods: a structured overview, Statist. Methods Medical Res., 13, 5, 363-394 (2004) · Zbl 1053.62078
[26] Vichi, M., 2000. Double \(k\); Vichi, M., 2000. Double \(k\)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.