Skip to main content
Erschienen in: Neural Computing and Applications 10/2020

17.05.2019 | Advances in Parallel and Distributed Computing for Neural Computing

Parallel multi-view concept clustering in distributed computing

verfasst von: Hao Wang, Yan Yang, Xiaobo Zhang, Bo Peng

Erschienen in: Neural Computing and Applications | Ausgabe 10/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Multi-view clustering (MvC) is an emerging task in data mining. It aims at partitioning the data sampled from multiple views. Although a great deal of research has been done, this task remains to be very challenging. We found an important problem in performing the MvC task. MvC needs large amounts of computation. To address this problem, we propose a parallel MvC method in a distributed computing environment. The proposed method builds upon concept factorization with local manifold learning, denoted by parallel multi-view concept clustering (PMCC). Concept factorization learns a compressed representation for the data. Local manifold learning preserves the locally intrinsic geometrical structure in the data. The weight of each view is learned automatically and a cooperative normalized approach is proposed to better guide the learning of a consensus representation for all views. For the proposed PMCC architecture, the calculation of each part is independent. It is clear that our PMCC can be performed in a distributed computing environment. Experimental results using real-world datasets demonstrate the effectiveness of the proposed method.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Appice A, Malerba D (2016) A co-training strategy for multiple view clustering in process mining. IEEE Trans Serv Comput 9(6):832–845CrossRef Appice A, Malerba D (2016) A co-training strategy for multiple view clustering in process mining. IEEE Trans Serv Comput 9(6):832–845CrossRef
2.
Zurück zum Zitat Belkin M, Niyogi P, Sindhwani V (2006) Manifold regularization: a geometric framework for learning from labeled and unlabeled examples. J Mach Learn Res 7:2399–2434MathSciNetMATH Belkin M, Niyogi P, Sindhwani V (2006) Manifold regularization: a geometric framework for learning from labeled and unlabeled examples. J Mach Learn Res 7:2399–2434MathSciNetMATH
3.
Zurück zum Zitat Cai D, He X, Han J (2011a) Locally consistent concept factorization for document clustering. IEEE Trans Knowl Data Eng 23(6):902–913CrossRef Cai D, He X, Han J (2011a) Locally consistent concept factorization for document clustering. IEEE Trans Knowl Data Eng 23(6):902–913CrossRef
4.
Zurück zum Zitat Cai D, He X, Han J, Huang TS (2011b) Graph regularized nonnegative matrix factorization for data representation. IEEE Trans Pattern Anal Mach Intell 33(8):1548–1560CrossRef Cai D, He X, Han J, Huang TS (2011b) Graph regularized nonnegative matrix factorization for data representation. IEEE Trans Pattern Anal Mach Intell 33(8):1548–1560CrossRef
5.
Zurück zum Zitat Cai X, Nie F, Huang H (2013) Multi-view K-means clustering on big data. In: Proceedings of the international joint conferences on artificial intelligence, pp 2598–2604 Cai X, Nie F, Huang H (2013) Multi-view K-means clustering on big data. In: Proceedings of the international joint conferences on artificial intelligence, pp 2598–2604
7.
Zurück zum Zitat Chen J, Li K, Tang Z, Bilal K, Yu S, Weng C, Li K (2017) A parallel random forest algorithm for big data in a spark cloud computing environment. IEEE Trans Parallel Distrib Syst 28(4):919–933CrossRef Chen J, Li K, Tang Z, Bilal K, Yu S, Weng C, Li K (2017) A parallel random forest algorithm for big data in a spark cloud computing environment. IEEE Trans Parallel Distrib Syst 28(4):919–933CrossRef
10.
Zurück zum Zitat Ding C, Li T, Jordan MI (2010) Convex and semi-nonnegative matrix factorizations. IEEE Trans Pattern Anal Mach Intell 32(1):45–55CrossRef Ding C, Li T, Jordan MI (2010) Convex and semi-nonnegative matrix factorizations. IEEE Trans Pattern Anal Mach Intell 32(1):45–55CrossRef
11.
Zurück zum Zitat Gao H, Nie F, Li X, Huang H (2015) Multi-view subspace clustering. In: Proceedings of the IEEE international conference on computer vision, pp 4238–4246 Gao H, Nie F, Li X, Huang H (2015) Multi-view subspace clustering. In: Proceedings of the IEEE international conference on computer vision, pp 4238–4246
12.
Zurück zum Zitat Hou C, Nie F, Tao H, Yi D (2017) Multi-view unsupervised feature selection with adaptive similarity and view weight. IEEE Trans Knowl Data Eng 29(9):1998–2011CrossRef Hou C, Nie F, Tao H, Yi D (2017) Multi-view unsupervised feature selection with adaptive similarity and view weight. IEEE Trans Knowl Data Eng 29(9):1998–2011CrossRef
13.
Zurück zum Zitat Huang S, Kang Z, Xu Z (2018a) Self-weighted multi-view clustering with soft capped norm. Knowl Based Syst 158:1–8CrossRef Huang S, Kang Z, Xu Z (2018a) Self-weighted multi-view clustering with soft capped norm. Knowl Based Syst 158:1–8CrossRef
14.
Zurück zum Zitat Huang S, Ren Y, Xu Z (2018b) Robust multi-view data clustering with multi-view capped-norm k-means. Neurocomputing 311:197–208CrossRef Huang S, Ren Y, Xu Z (2018b) Robust multi-view data clustering with multi-view capped-norm k-means. Neurocomputing 311:197–208CrossRef
15.
Zurück zum Zitat Huang S, Kang Z, Tsang IW, Xu Z (2019) Auto-weighted multi-view clustering via kernelized graph learning. Pattern Recognit 88:174–184CrossRef Huang S, Kang Z, Tsang IW, Xu Z (2019) Auto-weighted multi-view clustering via kernelized graph learning. Pattern Recognit 88:174–184CrossRef
16.
Zurück zum Zitat Hussain SF, Bashir S (2016) Co-clustering of multi-view datasets. Knowl Inf Syst 47(3):1–26CrossRef Hussain SF, Bashir S (2016) Co-clustering of multi-view datasets. Knowl Inf Syst 47(3):1–26CrossRef
17.
Zurück zum Zitat Kumar A, Rai P, III HD (2011) Co-regularized multi-view spectral clustering. In: Proceedings of the advances in neural information processing systems, pp 1413–1421 Kumar A, Rai P, III HD (2011) Co-regularized multi-view spectral clustering. In: Proceedings of the advances in neural information processing systems, pp 1413–1421
18.
Zurück zum Zitat Lee DD, Seung HS (1999) Learning the parts of objects by non-negative matrix factorization. Nature 401(6755):788MATHCrossRef Lee DD, Seung HS (1999) Learning the parts of objects by non-negative matrix factorization. Nature 401(6755):788MATHCrossRef
19.
Zurück zum Zitat Lee DD, Seung HS (2001) Algorithms for non-negative matrix factorization. In: Proceedings of the advances in neural information processing systemsSyst, pp 556–562 Lee DD, Seung HS (2001) Algorithms for non-negative matrix factorization. In: Proceedings of the advances in neural information processing systemsSyst, pp 556–562
20.
Zurück zum Zitat Li K, Tang X, Veeravalli B, Li K (2015a) Scheduling precedence constrained stochastic tasks on heterogeneous cluster systems. IEEE Trans Comput 64(1):191–204MathSciNetMATHCrossRef Li K, Tang X, Veeravalli B, Li K (2015a) Scheduling precedence constrained stochastic tasks on heterogeneous cluster systems. IEEE Trans Comput 64(1):191–204MathSciNetMATHCrossRef
21.
Zurück zum Zitat Li K, Yang W, Li K (2015b) Performance analysis and optimization for spmv on gpu using probabilistic modeling. IEEE Trans Parallel Distrib Syst 26(1):196–205MathSciNet Li K, Yang W, Li K (2015b) Performance analysis and optimization for spmv on gpu using probabilistic modeling. IEEE Trans Parallel Distrib Syst 26(1):196–205MathSciNet
22.
Zurück zum Zitat Liu J, Wang C, Gao J, Han J (2013a) Multi-view clustering via joint nonnegative matrix factorization. In: Proceedings of the SIAM international conferences on data mining, pp 252–260 Liu J, Wang C, Gao J, Han J (2013a) Multi-view clustering via joint nonnegative matrix factorization. In: Proceedings of the SIAM international conferences on data mining, pp 252–260
23.
Zurück zum Zitat Liu X, Wang L, Yin J, Zhu E, Zhang J (2013b) An efficient approach to integrating radius information into multiple kernel learning. IEEE Trans Cybern 43(2):557–569CrossRef Liu X, Wang L, Yin J, Zhu E, Zhang J (2013b) An efficient approach to integrating radius information into multiple kernel learning. IEEE Trans Cybern 43(2):557–569CrossRef
25.
Zurück zum Zitat Lu C, Yan S, Lin Z (2016) Convex sparse spectral clustering: single-view to multi-view. IEEE Trans Image Process 25(6):2833–2843MathSciNetMATHCrossRef Lu C, Yan S, Lin Z (2016) Convex sparse spectral clustering: single-view to multi-view. IEEE Trans Image Process 25(6):2833–2843MathSciNetMATHCrossRef
26.
Zurück zum Zitat Nie F, Li J, Li X (2017) Self-weighted multiview clustering with multiple graphs. In: Proceedings of the international joint conferences on artificial intelligence, pp 2564–2570 Nie F, Li J, Li X (2017) Self-weighted multiview clustering with multiple graphs. In: Proceedings of the international joint conferences on artificial intelligence, pp 2564–2570
27.
Zurück zum Zitat Nie F, Cai G, Li J, Li X (2018) Auto-weighted multi-view learning for image clustering and semi-supervised classification. IEEE Trans Image Process 27(3):1501–1511MathSciNetMATHCrossRef Nie F, Cai G, Li J, Li X (2018) Auto-weighted multi-view learning for image clustering and semi-supervised classification. IEEE Trans Image Process 27(3):1501–1511MathSciNetMATHCrossRef
28.
Zurück zum Zitat Sun J, Lu J, Xu T, Bi J (2015) Multi-view sparse co-clustering via proximal alternating linearized minimization. In: Proceedings of the international conference on machine learning, pp 757–766 Sun J, Lu J, Xu T, Bi J (2015) Multi-view sparse co-clustering via proximal alternating linearized minimization. In: Proceedings of the international conference on machine learning, pp 757–766
29.
Zurück zum Zitat Tao H, Hou C, Liu X, Liu T, Yi D, Zhu J (2018) Reliable multi-view clustering. In: Proceedings of the AAAI conference on artificial intelligence, pp 4123–4130 Tao H, Hou C, Liu X, Liu T, Yi D, Zhu J (2018) Reliable multi-view clustering. In: Proceedings of the AAAI conference on artificial intelligence, pp 4123–4130
31.
Zurück zum Zitat Tzortzis G, Likas A (2012) Kernel-based weighted multi-view clustering. In: Proceedings of the international conferences on data mining, pp 675–684 Tzortzis G, Likas A (2012) Kernel-based weighted multi-view clustering. In: Proceedings of the international conferences on data mining, pp 675–684
32.
Zurück zum Zitat Wang H, Yang Y, Li T (2016) Multi-view clustering via concept factorization with local manifold regularization. In: Proceedings of the international conferences on data mining, pp 1245–1250 Wang H, Yang Y, Li T (2016) Multi-view clustering via concept factorization with local manifold regularization. In: Proceedings of the international conferences on data mining, pp 1245–1250
34.
Zurück zum Zitat Wang H, Yang Y, Liu B, Fujita H (2019b) A study of graph-based system for multi-view clustering. Knowl Based Syst 163:1009–1019CrossRef Wang H, Yang Y, Liu B, Fujita H (2019b) A study of graph-based system for multi-view clustering. Knowl Based Syst 163:1009–1019CrossRef
35.
Zurück zum Zitat Wang Y, Lin X, Wu L, Zhang W, Zhang Q, Huang X (2015a) Robust subspace clustering for multi-view data by exploiting correlation consensus. IEEE Trans Image Process 24(11):3939–3949MathSciNetMATHCrossRef Wang Y, Lin X, Wu L, Zhang W, Zhang Q, Huang X (2015a) Robust subspace clustering for multi-view data by exploiting correlation consensus. IEEE Trans Image Process 24(11):3939–3949MathSciNetMATHCrossRef
36.
Zurück zum Zitat Wang Y, Liu X, Dou Y, Li R (2017) Multiple kernel clustering framework with improved kernels. In: Proceedings of the international joint conferences on artificial intelligence, pp 2999–3005 Wang Y, Liu X, Dou Y, Li R (2017) Multiple kernel clustering framework with improved kernels. In: Proceedings of the international joint conferences on artificial intelligence, pp 2999–3005
37.
Zurück zum Zitat Wang Z, Kong X, Fu H, Li M, Zhang Y (2015b) Feature extraction via multi-view non-negative matrix factorization with local graph regularization. In: IEEE international conference image processing, pp 3500–3504 Wang Z, Kong X, Fu H, Li M, Zhang Y (2015b) Feature extraction via multi-view non-negative matrix factorization with local graph regularization. In: IEEE international conference image processing, pp 3500–3504
38.
Zurück zum Zitat Xia R, Pan Y, Du L, Yin J (2014) Robust multi-view spectral clustering via low-rank and sparse decomposition. In: Proceedings of the AAAI conference on artificial intelligence, pp 2149–2155 Xia R, Pan Y, Du L, Yin J (2014) Robust multi-view spectral clustering via low-rank and sparse decomposition. In: Proceedings of the AAAI conference on artificial intelligence, pp 2149–2155
40.
Zurück zum Zitat Xu W, Gong Y (2004) Document clustering by concept factorization. In: ACM SIGIR conference on research and development in information retrieval, pp 202–209 Xu W, Gong Y (2004) Document clustering by concept factorization. In: ACM SIGIR conference on research and development in information retrieval, pp 202–209
41.
Zurück zum Zitat Yang S, Hou C, Zhang C, Wu Y (2013) Robust non-negative matrix factorization via joint sparse and graph regularization for transfer learning. Neural Comput Appl 23(2):541–559CrossRef Yang S, Hou C, Zhang C, Wu Y (2013) Robust non-negative matrix factorization via joint sparse and graph regularization for transfer learning. Neural Comput Appl 23(2):541–559CrossRef
42.
Zurück zum Zitat Yang Y, Wang H (2018) Multi-view clustering: a survey. Big Data Min Anal 1(2):83–107CrossRef Yang Y, Wang H (2018) Multi-view clustering: a survey. Big Data Min Anal 1(2):83–107CrossRef
43.
Zurück zum Zitat Yang Y, Teng F, Li T, Wang H, Wang H, Zhang Q (2018) Parallel semi-supervised multi-ant colonies clustering ensemble based on mapreduce methodology. IEEE Trans Cloud Comput 6(3):857–867CrossRef Yang Y, Teng F, Li T, Wang H, Wang H, Zhang Q (2018) Parallel semi-supervised multi-ant colonies clustering ensemble based on mapreduce methodology. IEEE Trans Cloud Comput 6(3):857–867CrossRef
44.
Zurück zum Zitat Zhan K, Chang X, Guan J, Chen L, Ma Z, Yang Y (2018) Adaptive structure discovery for multimedia analysis using multiple features. IEEE Trans Cybern 49(5):1826–1834CrossRef Zhan K, Chang X, Guan J, Chen L, Ma Z, Yang Y (2018) Adaptive structure discovery for multimedia analysis using multiple features. IEEE Trans Cybern 49(5):1826–1834CrossRef
45.
Zurück zum Zitat Zong L, Zhang X, Zhao L, Yu H, Zhao Q (2017) Multi-view clustering via multi-manifold regularized non-negative matrix factorization. Neural Netw 88:74–89CrossRef Zong L, Zhang X, Zhao L, Yu H, Zhao Q (2017) Multi-view clustering via multi-manifold regularized non-negative matrix factorization. Neural Netw 88:74–89CrossRef
Metadaten
Titel
Parallel multi-view concept clustering in distributed computing
verfasst von
Hao Wang
Yan Yang
Xiaobo Zhang
Bo Peng
Publikationsdatum
17.05.2019
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 10/2020
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-019-04243-4

Weitere Artikel der Ausgabe 10/2020

Neural Computing and Applications 10/2020 Zur Ausgabe

Advances in Parallel and Distributed Computing for Neural Computing

Multi-task cascade deep convolutional neural networks for large-scale commodity recognition