×

Learning robust affinity graph representation for multi-view clustering. (English) Zbl 1475.68276

Summary: Recently, an increasingly pervasive trend in real-word applications is that the data are collected from multiple sources or represented by multiple views. Owing to the powerful ability of affinity graph in capturing the structural relationships among samples, constructing a robust and meaningful affinity graph has been extensively studied, especially in spectral clustering tasks. However, conventional spectral clustering extended to multi-view scenarios cannot obtain the satisfactory performance due to the presence of noise and the heterogeneity among different views. In this paper, we propose a robust affinity graph learning framework to deal with this issue. First, we employ an improved feature selection algorithm that integrates the advantages of hypergraph embedding and sparse regression to select significant features such that more robust graph Laplacian matrices for various views on this basis can be constructed. Second, we model hypergraph Laplacians as points on a Grassmann manifold and propose a Consistent Affinity Graph Learning (CAGL) algorithm to fuse all views. CAGL aims to learn a latent common affinity matrix shared by all Laplacian matrices by taking both the clustering quality evaluation criterion and the view consistency loss into account. We also develop an alternating descent algorithm to optimize the objective function of CAGL. Experiments on five publicly available datasets demonstrate that our proposed method obtains promising results compared with state-of-the-art methods.

MSC:

68T05 Learning and adaptive systems in artificial intelligence
05C50 Graphs and linear algebra (matrices, eigenvalues, etc.)
05C65 Hypergraphs
62H30 Classification and discrimination; cluster analysis (statistical aspects)
62R30 Statistics on manifolds

Software:

COIL-20
Full Text: DOI

References:

[1] Bai, L.; Cheng, X.; Liang, J.; Guo, Y., Fast graph clustering with a new description model for community detection, Information Sciences (2017)
[2] T. Bühler, M. Hein, Spectral clustering based on the graph p-laplacian, in: Proceedings of International Conference on Machine Learning, 2009, pp. 81-88.
[3] X. Cai, F. Nie, H. Huang, Multi-view k-means clustering on big data, in: Proceedings of International Joint Conference on Artificial Intelligence, pp. 2598-2604.
[4] Cai, X.; Nie, F.; Huang, H.; Kamangar, F., Heterogeneous image feature integration via multi-modal spectral clustering, (Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (2011)), 1977-1984
[5] Chung, F. R., Spectral Graph Theory, vol. 92, American Mathematical Soc. (1997) · Zbl 0867.05046
[6] Dong, X.; Frossard, P.; Vandergheynst, P.; Nefedov, N., Clustering on multi-layer graphs via subspace analysis on grassmann manifolds, IEEE Transactions on Signal Processing, 62, 4, 905-918 (2014) · Zbl 1394.94973
[7] Elhamifar, E.; Vidal, R., Sparse subspace clustering, (Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2009)), 2790-2797
[8] Fischler, M. A.; Bolles, R. C., Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography, Communications of the ACM, 24, 6, 381-395 (1981)
[9] Z. Fu, H.H. Ip, H. Lu, Z. Lu, Multi-modal constraint propagation for heterogeneous image clustering, in: Proceedings of ACM international Conference on Multimedia, 2011, pp. 143-152.
[10] Gross, R.; Shi, J., The cmu motion of body (mobo) database, Citeseer (2001)
[11] J. Hamm, D.D. Lee, Grassmann discriminant analysis: a unifying view on subspace-based learning, in: Proceedings of International Conference on Machine learning, 2008, pp. 376-383.
[12] Harandi, M.; Salzmann, M.; Porikli, F., Bregman divergences for infinite dimensional covariance matrices, (Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (2014)), 1003-1010
[13] Harandi, M.; Sanderson, C.; Shen, C.; Lovell, B. C., Dictionary learning and sparse coding on grassmann manifolds: an extrinsic solution, (Proceedings of the IEEE International Conference on Computer Vision (2013)), 3120-3127
[14] Harandi, M. T.; Sanderson, C.; Shirazi, S.; Lovell, B. C., Graph embedding discriminant analysis on grassmannian manifolds for improved image set matching, (Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (2011)), 2705-2712
[15] He, R.; Zhang, M.; Wang, L.; Ji, Y.; Yin, Q., Cross-modal subspace learning via pairwise constraints, IEEE Transactions on Image Processing, 24, 12, 5543-5556 (2015) · Zbl 1408.94238
[16] Hong, C.; Yu, J.; Wan, J.; Tao, D.; Wang, M., Multimodal deep autoencoder for human pose recovery, IEEE Transactions on Image Processing, 24, 12, 5659-5670 (2015) · Zbl 1408.94248
[17] Hou, C.; Nie, F.; Li, X.; Yi, D.; Wu, Y., Joint embedding learning and sparse regression: a framework for unsupervised feature selection, IEEE Transactions on Cybernetics, 44, 6, 793-804 (2014)
[18] Y. Hu, A.S. Mian, R. Owens, Sparse approximated nearest points for image set classification, in: Proceddings of IEEE Conference on Computer Vision and Pattern Recognition, 2011, pp. 121-128.
[19] Huang, L.; Lu, J.; Tan, Y.-P., Co-learned multi-view spectral clustering for face recognition based on image sets, IEEE Signal Processing Letters, 21, 7, 875-879 (2014)
[20] Z. Huang, R. Wang, S. Shan, X. Chen, Projection metric learning on grassmann manifold with application to video based face recognition, in: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2015, pp. 140-149.
[21] M.J. Huiskes, B. Thomee, M.S. Lew, New trends and ideas in visual concept detection: the mir flickr retrieval evaluation initiative, in: Proceedings of International Conference on Multimedia Information Retrieval, 2010, pp. 527-536.
[22] Kang, Z.; Peng, C.; Cheng, Q., Robust subspace clustering via smoothed rank approximation, IEEE Signal Processing Letters, 22, 11, 2088-2092 (2015)
[23] A. Kumar, H. Daumé, A co-training approach for multi-view spectral clustering, in: Proceedings of International Conference on Machine Learning, 2011, pp. 393-400.
[24] A. Kumar, P. Rai, H. Daume, Co-regularized multi-view spectral clustering, in: Proceedings of Advances in Neural Information Processing Systems, 2011, pp. 1413-1421.
[25] K.-C. Lee, J. Ho, M.-H. Yang, D. Kriegman, Video-based face recognition using probabilistic appearance manifolds, in: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, vol. 1, 2003, pp. I-I.
[26] B. Leibe, B. Schiele, Analyzing appearance and contour based methods for object categorization, in: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, 2003, pp. II-409.
[27] Li, Z.; Nie, F.; Chang, X.; Nie, L.; Zhang, H.; Yang, Y., Rank-constrained spectral clustering with flexible embedding, IEEE Transactions on Neural Networks and Learning Systems, 29, 12, 6073-6082 (2018)
[28] Liu, G.; Lin, Z.; Yan, S.; Sun, J.; Yu, Y.; Ma, Y., Robust recovery of subspace structures by low-rank representation, IEEE Transactions on Pattern Analysis and Machine Intelligence, 35, 1, 171-184 (2013)
[29] Mazzeo, G. M.; Masciari, E.; Zaniolo, C., A fast and accurate algorithm for unsupervised clustering around centroids, Information Sciences, 400, 63-90 (2017)
[30] S.A. Nene, S.K. Nayar, H. Murase, et al., Columbia object image library (coil-20), 1996.
[31] Nie, F.; Huang, H.; Cai, X.; Ding, C. H., Efficient and robust feature selection via joint l2,1-norms minimization, (Proceedings of Advances in Neural Information Processing Systems (2010)), 1813-1821
[32] Nie, F.; Xu, D.; Tsang, I. W.; Zhang, C., Spectral embedded clustering, (Proceedings of International Joint Conference on Artificial Intelligence (2009)), 1181-1186
[33] Nie, F.; Zeng, Z.; Tsang, I. W.; Xu, D.; Zhang, C., Spectral embedded clustering: a framework for in-sample and out-of-sample spectral clustering, IEEE Transactions on Neural Networks, 22, 11, 1796-1808 (2011)
[34] Schwarz, S.; Rupp, M., Predictive quantization on the stiefel manifold, IEEE Signal Processing Letters, 22, 2, 234-238 (2015)
[35] Shi, X.; Guo, Z.; Lai, Z.; Yang, Y.; Bao, Z.; Zhang, D., A framework of joint graph embedding and sparse regression for dimensionality reduction, IEEE Transactions on Image Processing, 24, 4, 1341-1355 (2015) · Zbl 1408.94598
[36] Tang, C.; Bian, M.; Liu, X.; Li, M.; Zhou, H.; Wang, P.; Yin, H., Unsupervised feature selection via latent representation learning and manifold regularization, Neural Networks, 117, 163-178 (2019)
[37] Tang, C.; Zhu, X.; Liu, X.; Wang, L., Cross-view local structure preserved diversity and consensus learning for multi-view unsupervised feature selection, Proceedings of the AAAI Conference on Artificial Intelligence, 33, 5101-5108 (2019)
[38] Trigeorgis, G.; Bousmalis, K.; Zafeiriou, S.; Schuller, B. W., A deep matrix factorization method for learning attribute representations, IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 3, 417-429 (2016)
[39] Turaga, P.; Veeraraghavan, A.; Srivastava, A.; Chellappa, R., Statistical computations on grassmann and stiefel manifolds for image and video-based recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, 33, 11, 2273-2286 (2011)
[40] Von Luxburg, U., A tutorial on spectral clustering, Statistics and Computing, 17, 4, 395-416 (2007)
[41] Wang, B.; Hu, Y.; Gao, J.; Sun, Y.; Yin, B., Low rank representation on grassmann manifolds, (Asian Conference on Computer Vision (2014)), 81-96
[42] Wang, H.; Weng, C.; Yuan, J., Multi-feature spectral clustering with minimax optimization, (Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (2014)), 4106-4113
[43] Wang, H.; Yuan, J., Collaborative multifeature fusion for transductive spectral learning, IEEE Transactions on Cybernetics, 45, 3, 451-461 (2015)
[44] Wang, X.; Bian, W.; Tao, D., Grassmannian regularized structured multi-view embedding for image classification, IEEE Transactions on Image Processing, 22, 7, 2646-2660 (2013) · Zbl 1373.94435
[45] Yang, Y.; Xu, D.; Nie, F.; Yan, S.; Zhuang, Y., Image clustering using local discriminant models and global integration, IEEE Transactions on Image Processing, 19, 10, 2761-2773 (2010) · Zbl 1371.94434
[46] Yu, J.; Rui, Y.; Chen, B., Exploiting click constraints and multi-view features for image re-ranking, IEEE Transactions on Multimedia, 16, 1, 159-168 (2014)
[47] Zhang, C.; Liu, Y.; Fu, H., Ae2-nets: Autoencoder in autoencoder networks, (Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (2019)), 2577-2585
[48] Zheng, M.; Bu, J.; Chen, C.; Wang, C.; Zhang, L.; Qiu, G.; Cai, D., Graph regularized sparse coding for image representation, IEEE Transactions on Image Processing, 20, 5, 1327-1336 (2011) · Zbl 1372.94314
[49] D. Zhou, J. Huang, B. Schölkopf, Learning with hypergraphs: clustering, classification, and embedding, in: Proceedings of Advances in Neural Information Processing Systems, 2006, pp. 1601-1608.
[50] Zhu, X.; Change Loy, C.; Gong, S., Constructing robust affinity graphs for spectral clustering, (Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (2014)), 1450-1457
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.