Skip to main content
Erschienen in: International Journal of Machine Learning and Cybernetics 1/2013

01.02.2013 | Original Article

Spectral co-clustering documents and words using fuzzy K-harmonic means

verfasst von: Na Liu, Fei Chen, Mingyu Lu

Erschienen in: International Journal of Machine Learning and Cybernetics | Ausgabe 1/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper analyzes the main steps of spectral co-clustering documents and words, finds out its cause of sensitivity to input order, and presents a modified method of spectral co-clustering documents and words based on fuzzy K-harmonic means. This method consists of two steps. The first step constructs Laplacian matrix which is insensitive to input order. The second step exploits fuzzy K-harmonic means algorithm instead of K-means algorithm to obtain clustering results. Fuzzy K-harmonic means algorithm uses fuzzy weight distance while calculating the distance between each data points and cluster centers. The experiments show that the proposed method not only is insensitive to input order, but also can improve the accuracy and robustness of clustering results.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Weitere Produktempfehlungen anzeigen
Literatur
2.
Zurück zum Zitat Ng AY, Jordan MI, Weiss Y(2002) On spectral clustering: analysis and an algorithm. In: Proceedings of the conference on advances in neural information processing systems. Mas-sachusetts, pp 849–856 Ng AY, Jordan MI, Weiss Y(2002) On spectral clustering: analysis and an algorithm. In: Proceedings of the conference on advances in neural information processing systems. Mas-sachusetts, pp 849–856
3.
5.
Zurück zum Zitat Fiedler M (1973) Algebraic connectivity of graphs. Czechoslovak Math J 23(2):298–305MathSciNet Fiedler M (1973) Algebraic connectivity of graphs. Czechoslovak Math J 23(2):298–305MathSciNet
6.
Zurück zum Zitat Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22(8):888–905CrossRef Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22(8):888–905CrossRef
7.
Zurück zum Zitat Ng AY, Jordan ML, Weiss Y (2001) On spectral clustering: analysis and an algorithm. In: Advances in neural information processing systems, vol 14, pp 849–856 Ng AY, Jordan ML, Weiss Y (2001) On spectral clustering: analysis and an algorithm. In: Advances in neural information processing systems, vol 14, pp 849–856
8.
Zurück zum Zitat Prieto R, Jiang J, Choi CH (2003) A new spectral clustering algorithm for large training sets. In: International conference on machine learning and cybernetics, China, pp 147–152 Prieto R, Jiang J, Choi CH (2003) A new spectral clustering algorithm for large training sets. In: International conference on machine learning and cybernetics, China, pp 147–152
9.
Zurück zum Zitat Fern XZ, Brodley CE (2004) Solving cluster ensemble problems by bipartite graph partitioning. In: Proceedings of international conference on machine learning, vol 21, New York, pp 36–44 Fern XZ, Brodley CE (2004) Solving cluster ensemble problems by bipartite graph partitioning. In: Proceedings of international conference on machine learning, vol 21, New York, pp 36–44
10.
Zurück zum Zitat Sanguinetti G, Laidler J, Lawrence N (2005) Automatic Determination of the number of clusters using spectral algorithms. In: Proceedings of IEEE machine learning for signal processing, USA, pp 28–30 Sanguinetti G, Laidler J, Lawrence N (2005) Automatic Determination of the number of clusters using spectral algorithms. In: Proceedings of IEEE machine learning for signal processing, USA, pp 28–30
11.
Zurück zum Zitat Fowlkes C, Belongie S, Chung F (2007) Spectral grouping using the Nystrom method. IEEE Trans Pattern Anal Mach Intell 26(2):217–225 Fowlkes C, Belongie S, Chung F (2007) Spectral grouping using the Nystrom method. IEEE Trans Pattern Anal Mach Intell 26(2):217–225
12.
Zurück zum Zitat Xu S, Lu ZM, Gu GC (2009) Two spectral algorithms for ensembling document clusters. Acta Autom Sin 35(7):997–1002 Xu S, Lu ZM, Gu GC (2009) Two spectral algorithms for ensembling document clusters. Acta Autom Sin 35(7):997–1002
13.
Zurück zum Zitat Yeung DS, Wang X (2002) Improving performance of similarity-based clustering by feature weight learning. IEEE Trans Pattern Anal Mach Intell 24(4):556–561 Yeung DS, Wang X (2002) Improving performance of similarity-based clustering by feature weight learning. IEEE Trans Pattern Anal Mach Intell 24(4):556–561
14.
Zurück zum Zitat Wang XZ, Wang YD, Wang LJ (2004) Improving fuzzy c-means clustering based on feature-weight learning. Pattern Recognit Lett 25(10):1123–1132 Wang XZ, Wang YD, Wang LJ (2004) Improving fuzzy c-means clustering based on feature-weight learning. Pattern Recognit Lett 25(10):1123–1132
15.
Zurück zum Zitat Wang XZ, Dong CR, Fan TG (2007) Training T-S Norm neural networks to refine weights for fuzzy if-then rules. Neurocomputing 70(13–15):2581–2587 Wang XZ, Dong CR, Fan TG (2007) Training T-S Norm neural networks to refine weights for fuzzy if-then rules. Neurocomputing 70(13–15):2581–2587
16.
Zurück zum Zitat Xing HJ, Hu BG (2008) An adaptive fuzzy c-means clustering-based mixtures of experts model for unlabeled data classification. Neurocomputing 71(4–6):1008–1021 Xing HJ, Hu BG (2008) An adaptive fuzzy c-means clustering-based mixtures of experts model for unlabeled data classification. Neurocomputing 71(4–6):1008–1021
17.
Zurück zum Zitat Wang XZ, Dong CR (2009) Improving generalization of fuzzy if-then rules by maximizing fuzzy entropy. IEEE Trans Fuzzy Syst 17(3):556–567 Wang XZ, Dong CR (2009) Improving generalization of fuzzy if-then rules by maximizing fuzzy entropy. IEEE Trans Fuzzy Syst 17(3):556–567
19.
Zurück zum Zitat Graaff AJ, Engelbrecht AP Clustering data in stationary environments with a local network neighborhood artificial immune system. Int J Mach Learn Cyber. doi:10.1007/s13042-011-0041-0 Graaff AJ, Engelbrecht AP Clustering data in stationary environments with a local network neighborhood artificial immune system. Int J Mach Learn Cyber. doi:10.​1007/​s13042-011-0041-0
21.
Zurück zum Zitat Wang XZ, He YL, Dong LC, Zhao HY (2011) Particle swarm optimization for determining fuzzy measures from data. Inf Sci 181(19):4230–4252 Wang XZ, He YL, Dong LC, Zhao HY (2011) Particle swarm optimization for determining fuzzy measures from data. Inf Sci 181(19):4230–4252
22.
Zurück zum Zitat Kluger Y, Basri R, Chang JT, Gerstein M (2003) Spectral biclustering of microarray data: coclustering genes and conditions. Genome Res 13(4):703–716 Kluger Y, Basri R, Chang JT, Gerstein M (2003) Spectral biclustering of microarray data: coclustering genes and conditions. Genome Res 13(4):703–716
23.
Zurück zum Zitat Guan J, Qiu G (2005) Spectral images and features co-clustering with application to content-based image retrieval. In: 7th IEEE workshop on multimedia signal processing, Shanghai, pp 1–4 Guan J, Qiu G (2005) Spectral images and features co-clustering with application to content-based image retrieval. In: 7th IEEE workshop on multimedia signal processing, Shanghai, pp 1–4
24.
Zurück zum Zitat Wieling M, Nerbonne J (2009) Bipartite spectral graph partitioning to co-cluster varieties and sound correspondences in dialectology. In: Proceedings of the 2009 workshop on graph-based methods for natural language processing, Singapore, pp 14–22 Wieling M, Nerbonne J (2009) Bipartite spectral graph partitioning to co-cluster varieties and sound correspondences in dialectology. In: Proceedings of the 2009 workshop on graph-based methods for natural language processing, Singapore, pp 14–22
25.
Zurück zum Zitat Xu G, Zong Y, Dolog P, Zhang Y (2010) Co-clustering analysis of weblogs using bipartite spectral projection approach. In: Proceedings of 14th KES, Cardiff, pp 398–407 Xu G, Zong Y, Dolog P, Zhang Y (2010) Co-clustering analysis of weblogs using bipartite spectral projection approach. In: Proceedings of 14th KES, Cardiff, pp 398–407
26.
Zurück zum Zitat Green NS (2010) Evolutionary spectral co-clustering. Dissertation, Rochester Institute of Technology Green NS (2010) Evolutionary spectral co-clustering. Dissertation, Rochester Institute of Technology
27.
Zurück zum Zitat Dhillon I (2001) Co-clustering documents and words using bipartite spectral graph partitioning. In: Proceedings of the seventh ACM SIGKDD international conference on knowledge discovery and data mining, San Francisco, pp 269–274 Dhillon I (2001) Co-clustering documents and words using bipartite spectral graph partitioning. In: Proceedings of the seventh ACM SIGKDD international conference on knowledge discovery and data mining, San Francisco, pp 269–274
30.
Zurück zum Zitat Strehl A, Ghosh J (2002) Cluster ensembles-a knowledge reuse framework for combining partitionings. J Mach Learn Res 3:583–617MathSciNet Strehl A, Ghosh J (2002) Cluster ensembles-a knowledge reuse framework for combining partitionings. J Mach Learn Res 3:583–617MathSciNet
Metadaten
Titel
Spectral co-clustering documents and words using fuzzy K-harmonic means
verfasst von
Na Liu
Fei Chen
Mingyu Lu
Publikationsdatum
01.02.2013
Verlag
Springer-Verlag
Erschienen in
International Journal of Machine Learning and Cybernetics / Ausgabe 1/2013
Print ISSN: 1868-8071
Elektronische ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-012-0077-9

Weitere Artikel der Ausgabe 1/2013

International Journal of Machine Learning and Cybernetics 1/2013 Zur Ausgabe

Neuer Inhalt