Skip to main content

2017 | OriginalPaper | Buchkapitel

Multi-View Matrix Completion for Clustering with Side Information

verfasst von : Peng Zhao, Yuan Jiang, Zhi-Hua Zhou

Erschienen in: Advances in Knowledge Discovery and Data Mining

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In many clustering applications, real world data are often collected from multiple sources or with features from multiple channels. Thus, multi-view clustering has attracted much attention during the past few years. It is noteworthy that in many situations, in addition to the data samples, there are some side information describing the relation between instances, such as must-links and cannot-links. Though side information has been well exploited in single-view clustering, they have rarely been studied in multi-view scenario. Considering that matrix completion has sound theoretical properties and demonstrates an excellent performance in single-view clustering, in this paper, we propose the first matrix completion based approach for multi-view clustering with side information. Instead of concatenating multiple views into a single one, we enforce the consistency of clustering results on different views as constraints for alternative optimization, and the global optimal solution is obtained since the objective function is jointly convex. The proposed Multi-View Matrix Completion (MVMC) approach exhibits impressive performance in experiments.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Bach, F.R., Lanckriet, G.R., Jordan, M.I.: Multiple kernel learning, conic duality, and the SMO algorithm. In: ICML (2004) Bach, F.R., Lanckriet, G.R., Jordan, M.I.: Multiple kernel learning, conic duality, and the SMO algorithm. In: ICML (2004)
2.
Zurück zum Zitat Bisson, G., Grimal, C.: Co-clustering of multi-view datasets: a parallelizable approach. In: ICDM, pp. 828–833 (2012) Bisson, G., Grimal, C.: Co-clustering of multi-view datasets: a parallelizable approach. In: ICDM, pp. 828–833 (2012)
3.
Zurück zum Zitat Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: COLT, pp. 92–100 (1998) Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: COLT, pp. 92–100 (1998)
4.
Zurück zum Zitat Bruno, E., Marchand-Maillet, S.: Multiview clustering: a late fusion approach using latent models. In: SIGIR, pp. 736–737 (2009) Bruno, E., Marchand-Maillet, S.: Multiview clustering: a late fusion approach using latent models. In: SIGIR, pp. 736–737 (2009)
5.
Zurück zum Zitat Candès, E.J., Recht, B.: Exact matrix completion via convex optimization. Commun. ACM 55(6), 111–119 (2012)CrossRefMATH Candès, E.J., Recht, B.: Exact matrix completion via convex optimization. Commun. ACM 55(6), 111–119 (2012)CrossRefMATH
6.
Zurück zum Zitat Chaudhuri, K., Kakade, S.M., Livescu, K., Sridharan, K.: Multi-view clustering via canonical correlation analysis. In: ICML, pp. 129–136 (2009) Chaudhuri, K., Kakade, S.M., Livescu, K., Sridharan, K.: Multi-view clustering via canonical correlation analysis. In: ICML, pp. 129–136 (2009)
7.
Zurück zum Zitat Davis, J.V., Kulis, B., Jain, P., Sra, S., Dhillon, I.S.: Information-theoretic metric learning. In: ICML, pp. 209–216 (2007) Davis, J.V., Kulis, B., Jain, P., Sra, S., Dhillon, I.S.: Information-theoretic metric learning. In: ICML, pp. 209–216 (2007)
8.
Zurück zum Zitat Dougherty, E.R., Barrera, J., Brun, M., Kim, S., Cesar, R.M., Chen, Y., Bittner, M., Trent, J.M.: Inference from clustering with application to gene-expression microarrays. Comput. Biol. 9(1), 105–126 (2002)CrossRef Dougherty, E.R., Barrera, J., Brun, M., Kim, S., Cesar, R.M., Chen, Y., Bittner, M., Trent, J.M.: Inference from clustering with application to gene-expression microarrays. Comput. Biol. 9(1), 105–126 (2002)CrossRef
9.
Zurück zum Zitat Ganti, V., Gehrke, J., Ramakrishnan, R.: Cactus—clustering categorical data using summaries. In: KDD, pp. 73–83 (1999) Ganti, V., Gehrke, J., Ramakrishnan, R.: Cactus—clustering categorical data using summaries. In: KDD, pp. 73–83 (1999)
10.
Zurück zum Zitat Goldberg, D., Nichols, D., Oki, B.M., Terry, D.: Using collaborative filtering to weave an information tapestry. Commun. ACM 35(12), 61–70 (1992)CrossRef Goldberg, D., Nichols, D., Oki, B.M., Terry, D.: Using collaborative filtering to weave an information tapestry. Commun. ACM 35(12), 61–70 (1992)CrossRef
11.
Zurück zum Zitat Jalali, A., Chen, Y., Sanghavi, S., Xu, H.: Clustering partially observed graphs via convex optimization. In: ICML, pp. 1001–1008 (2011) Jalali, A., Chen, Y., Sanghavi, S., Xu, H.: Clustering partially observed graphs via convex optimization. In: ICML, pp. 1001–1008 (2011)
12.
Zurück zum Zitat Kim, Y.M., Amini, M.R., Goutte, C., Gallinari, P.: Multi-view clustering of multilingual documents. In: SIGIR, pp. 821–822 (2010) Kim, Y.M., Amini, M.R., Goutte, C., Gallinari, P.: Multi-view clustering of multilingual documents. In: SIGIR, pp. 821–822 (2010)
13.
Zurück zum Zitat Kulis, B., Basu, S., Dhillon, I.S., Mooney, R.J.: Semi-supervised graph clustering: a kernel approach. In: ICML, pp. 457–464 (2005) Kulis, B., Basu, S., Dhillon, I.S., Mooney, R.J.: Semi-supervised graph clustering: a kernel approach. In: ICML, pp. 457–464 (2005)
14.
Zurück zum Zitat Kumar, A., Rai, P., Daume, H.: Co-regularized multi-view spectral clustering. In: NIPS, vol. 24, pp. 1413–1421 (2011) Kumar, A., Rai, P., Daume, H.: Co-regularized multi-view spectral clustering. In: NIPS, vol. 24, pp. 1413–1421 (2011)
15.
Zurück zum Zitat Lanckriet, G.R., Cristianini, N., Bartlett, P., Ghaoui, L.E., Jordan, M.I.: Learning the kernel matrix with semidefinite programming. JMLR 5, 27–72 (2004)MathSciNetMATH Lanckriet, G.R., Cristianini, N., Bartlett, P., Ghaoui, L.E., Jordan, M.I.: Learning the kernel matrix with semidefinite programming. JMLR 5, 27–72 (2004)MathSciNetMATH
16.
Zurück zum Zitat Li, S.Y., Jiang, Y., Zhou, Z.H.: Partial multi-view clustering. In: AAAI (2014) Li, S.Y., Jiang, Y., Zhou, Z.H.: Partial multi-view clustering. In: AAAI (2014)
17.
Zurück zum Zitat Li, S., Shao, M., Fu, Y.: Multi-view low-rank analysis for outlier detection. In: SDM, pp. 748–756 (2015) Li, S., Shao, M., Fu, Y.: Multi-view low-rank analysis for outlier detection. In: SDM, pp. 748–756 (2015)
18.
Zurück zum Zitat Li, Y., Nie, F., Huang, H., Huang, J.: Large-scale multi-view spectral clustering via bipartite graph. In: AAAI, pp. 2750–2756 (2015) Li, Y., Nie, F., Huang, H., Huang, J.: Large-scale multi-view spectral clustering via bipartite graph. In: AAAI, pp. 2750–2756 (2015)
19.
Zurück zum Zitat Liu, M., Luo, Y., Tao, D., Xu, C., Wen, Y.: Low-rank multi-view learning in matrix completion for multi-label image classification. In: AAAI, pp. 2778–2784 (2015) Liu, M., Luo, Y., Tao, D., Xu, C., Wen, Y.: Low-rank multi-view learning in matrix completion for multi-label image classification. In: AAAI, pp. 2778–2784 (2015)
20.
Zurück zum Zitat Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, New York (2008)CrossRefMATH Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, New York (2008)CrossRefMATH
21.
Zurück zum Zitat Nesterov, Y.: Introductory Lectures on Convex Optimization: A Basic Course. Springer, Heidelberg (2013)MATH Nesterov, Y.: Introductory Lectures on Convex Optimization: A Basic Course. Springer, Heidelberg (2013)MATH
22.
Zurück zum Zitat Ng, A.Y., Jordan, M.I., Weiss, Y., et al.: On spectral clustering: analysis and an algorithm. In: NIPS, vol. 15, pp. 849–856 (2002) Ng, A.Y., Jordan, M.I., Weiss, Y., et al.: On spectral clustering: analysis and an algorithm. In: NIPS, vol. 15, pp. 849–856 (2002)
23.
Zurück zum Zitat de Sa, V.R.: Spectral clustering with two views. In: ICML Workshop on Learning with Multiple Views, pp. 20–27 (2005) de Sa, V.R.: Spectral clustering with two views. In: ICML Workshop on Learning with Multiple Views, pp. 20–27 (2005)
24.
Zurück zum Zitat Steinbach, M., Karypis, G., Kumar, V., et al.: A comparison of document clustering techniques. In: KDD Workshop on Text Mining, vol. 400, pp. 525–526 (2000) Steinbach, M., Karypis, G., Kumar, V., et al.: A comparison of document clustering techniques. In: KDD Workshop on Text Mining, vol. 400, pp. 525–526 (2000)
25.
Zurück zum Zitat Strehl, A., Ghosh, J.: Cluster ensembles - a knowledge reuse framework for combining multiple partitions. JMLR 3, 583–617 (2002)MathSciNetMATH Strehl, A., Ghosh, J.: Cluster ensembles - a knowledge reuse framework for combining multiple partitions. JMLR 3, 583–617 (2002)MathSciNetMATH
26.
Zurück zum Zitat Tzortzis, G., Likas, A.: Kernel-based weighted multi-view clustering. In: ICDM, pp. 675–684 (2012) Tzortzis, G., Likas, A.: Kernel-based weighted multi-view clustering. In: ICDM, pp. 675–684 (2012)
28.
Zurück zum Zitat Wang, W., Zhou, Z.-H.: Analyzing co-training style algorithms. In: Kok, J.N., Koronacki, J., Mantaras, R.L., Matwin, S., Mladenič, D., Skowron, A. (eds.) ECML 2007. LNCS (LNAI), vol. 4701, pp. 454–465. Springer, Heidelberg (2007). doi:10.1007/978-3-540-74958-5_42 CrossRef Wang, W., Zhou, Z.-H.: Analyzing co-training style algorithms. In: Kok, J.N., Koronacki, J., Mantaras, R.L., Matwin, S., Mladenič, D., Skowron, A. (eds.) ECML 2007. LNCS (LNAI), vol. 4701, pp. 454–465. Springer, Heidelberg (2007). doi:10.​1007/​978-3-540-74958-5_​42 CrossRef
29.
Zurück zum Zitat Wang, W., Zhou, Z.H.: Multi-view active learning in the non-realizable case. In: NIPS 23, pp. 2388–2396 (2010) Wang, W., Zhou, Z.H.: Multi-view active learning in the non-realizable case. In: NIPS 23, pp. 2388–2396 (2010)
30.
Zurück zum Zitat Wang, Y., Zhang, W., Wu, L., Lin, X., Fang, M., Pan, S.: Iterative views agreement: an iterative low-rank based structured optimization method to multi-view spectral clustering. In: IJCAI, pp. 2153–2159 (2016) Wang, Y., Zhang, W., Wu, L., Lin, X., Fang, M., Pan, S.: Iterative views agreement: an iterative low-rank based structured optimization method to multi-view spectral clustering. In: IJCAI, pp. 2153–2159 (2016)
31.
Zurück zum Zitat Xia, R., Pan, Y., Du, L., Yin, J.: Robust multi-view spectral clustering via low-rank and sparse decomposition. In: AAAI, pp. 2149–2155 (2014) Xia, R., Pan, Y., Du, L., Yin, J.: Robust multi-view spectral clustering via low-rank and sparse decomposition. In: AAAI, pp. 2149–2155 (2014)
32.
Zurück zum Zitat Xing, E.P., Ng, A.Y., Jordan, M.I., Russell, S.: Distance metric learning with application to clustering with side information. In: NIPS, vol. 15, pp. 505–512 (2003) Xing, E.P., Ng, A.Y., Jordan, M.I., Russell, S.: Distance metric learning with application to clustering with side information. In: NIPS, vol. 15, pp. 505–512 (2003)
34.
Zurück zum Zitat Xu, M., Jin, R., Zhou, Z.H.: Speedup matrix completion with side information: application to multi-label learning. In: NIPS, vol. 27, pp. 2301–2309 (2013) Xu, M., Jin, R., Zhou, Z.H.: Speedup matrix completion with side information: application to multi-label learning. In: NIPS, vol. 27, pp. 2301–2309 (2013)
35.
Zurück zum Zitat Ye, H., Zhan, D., Miao, Y., Jiang, Y., Zhou, Z.H.: Rank consistency based multi-view learning: a privacy-preserving approach. In: CIKM, pp. 991–1000 (2015) Ye, H., Zhan, D., Miao, Y., Jiang, Y., Zhou, Z.H.: Rank consistency based multi-view learning: a privacy-preserving approach. In: CIKM, pp. 991–1000 (2015)
36.
Zurück zum Zitat Yi, J., Jin, R., Jain, A.K., Jain, S., Yang, T.: Semi-crowdsourced clustering: generalizing crowd labeling by robust distance metric learning. In: NIPS, vol. 25, pp. 1772–1780 (2012) Yi, J., Jin, R., Jain, A.K., Jain, S., Yang, T.: Semi-crowdsourced clustering: generalizing crowd labeling by robust distance metric learning. In: NIPS, vol. 25, pp. 1772–1780 (2012)
37.
Zurück zum Zitat Yi, J., Zhang, L., Jin, R., Qian, Q., Jain, A.K.: Semi-supervised clustering by input pattern assisted pairwise similarity matrix completion. In: ICML, pp. 1400–1408 (2013) Yi, J., Zhang, L., Jin, R., Qian, Q., Jain, A.K.: Semi-supervised clustering by input pattern assisted pairwise similarity matrix completion. In: ICML, pp. 1400–1408 (2013)
38.
Zurück zum Zitat Zeng, H., Cheung, Y.: Semi-supervised maximum margin clustering with pairwise constraints. TKDE 24(5), 926–939 (2012) Zeng, H., Cheung, Y.: Semi-supervised maximum margin clustering with pairwise constraints. TKDE 24(5), 926–939 (2012)
39.
Zurück zum Zitat Zhang, X., Zong, L., Liu, X., Yu, H.: Constrained NMF-based multi-view clustering on unmapped data. In: AAAI, pp. 3174–3180 (2015) Zhang, X., Zong, L., Liu, X., Yu, H.: Constrained NMF-based multi-view clustering on unmapped data. In: AAAI, pp. 3174–3180 (2015)
Metadaten
Titel
Multi-View Matrix Completion for Clustering with Side Information
verfasst von
Peng Zhao
Yuan Jiang
Zhi-Hua Zhou
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-57529-2_32