Skip to main content
Erschienen in: Soft Computing 3/2013

01.03.2013 | Original Paper

A novel approach for distance-based semi-supervised clustering using functional link neural network

verfasst von: B. Chandra, Manish Gupta

Erschienen in: Soft Computing | Ausgabe 3/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Semi-supervised clustering is gaining importance these days since neither supervised nor unsupervised learning methods in a stand-alone manner provide satisfactory results. Existing semi-supervised clustering techniques are mostly based on pair-wise constraints, which could be misleading. These semi-supervised clustering algorithms also fail to address the problem of dealing with attributes having different weights. In most of the real-life applications, all attributes do not have equal importance and hence same weights cannot be assigned for each attribute. In this paper, a novel distance-based semi-supervised clustering algorithm has been proposed, which uses functional link neural network (FLNN) for finding weights for attributes with small amount of labeled data for further use in parametric Minkowski’s model for clustering. In FLNN, the nonlinearity is captured by enhancing the input using orthonormal basis functions. The effectiveness of the approach has been illustrated over a number of datasets taken from UCI machine learning repository. Comparative performance evaluation demonstrates that the proposed approach outperforms the existing semi-supervised clustering algorithms. The proposed approach has also been successfully used to cluster the crime locations and to find crime hot spots in India on the data provided by National Crime Records Bureau (NCRB).

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Aggarwal C (2003) Towards systematic design of distance functions for data mining applications. In: Proceedings of 9th ACM SIGKDD international conference on knowledge discovery and data mining (KDD-2003) Aggarwal C (2003) Towards systematic design of distance functions for data mining applications. In: Proceedings of 9th ACM SIGKDD international conference on knowledge discovery and data mining (KDD-2003)
Zurück zum Zitat Al-Harbi SH, Rayward-Smith VJ (2006) Adapting k-means for supervised clustering. Appl Intell 24:219–226CrossRef Al-Harbi SH, Rayward-Smith VJ (2006) Adapting k-means for supervised clustering. Appl Intell 24:219–226CrossRef
Zurück zum Zitat Ayan NF (1999) Using information gain as feature weight. In: Proceedings of 8th Turkish symposium on artificial intelligence and neural networks Ayan NF (1999) Using information gain as feature weight. In: Proceedings of 8th Turkish symposium on artificial intelligence and neural networks
Zurück zum Zitat Bar-Hillel A, Hertz T, Shental N, Weinshall D (2005) Learning a Mahalanobis metric from equivalence constraints. J Mach Learn Res 6:937–965MathSciNetMATH Bar-Hillel A, Hertz T, Shental N, Weinshall D (2005) Learning a Mahalanobis metric from equivalence constraints. J Mach Learn Res 6:937–965MathSciNetMATH
Zurück zum Zitat Basu S, Banerjee A, Mooney RJ (2002) Semi-supervised clustering by seeding. In: Proceedings of 19th international conference on, machine learning (ICML-2002) pp 19–26 Basu S, Banerjee A, Mooney RJ (2002) Semi-supervised clustering by seeding. In: Proceedings of 19th international conference on, machine learning (ICML-2002) pp 19–26
Zurück zum Zitat Basu S, Banerjee A, Mooney RJ (2004a) Active semi-supervision for pairwise constrained clustering. In: Proceedings of the 2004 SIAM international conference on data mining Basu S, Banerjee A, Mooney RJ (2004a) Active semi-supervision for pairwise constrained clustering. In: Proceedings of the 2004 SIAM international conference on data mining
Zurück zum Zitat Basu S, Bilenko M, Mooney RJ (2004b) A probabilistic framework for semi supervised clustering. In: Proceedings of 10th ACM SIGKDD Int. Conf. on knowledge discovery and data mining (KDD-2004) pp 59–68 Basu S, Bilenko M, Mooney RJ (2004b) A probabilistic framework for semi supervised clustering. In: Proceedings of 10th ACM SIGKDD Int. Conf. on knowledge discovery and data mining (KDD-2004) pp 59–68
Zurück zum Zitat Bilenko M, Basu S, Mooney R (2004) Integrating constraints and metric learning in semi-supervised clustering. in: Proceedings of international conference on, machine learning, pp 81–88 Bilenko M, Basu S, Mooney R (2004) Integrating constraints and metric learning in semi-supervised clustering. in: Proceedings of international conference on, machine learning, pp 81–88
Zurück zum Zitat Bouchachia A, Pedrycz W (2003) A semi-supervised clustering algorithm for data exploration. Lect Notes Comp Sci 2715:107–155 Bouchachia A, Pedrycz W (2003) A semi-supervised clustering algorithm for data exploration. Lect Notes Comp Sci 2715:107–155
Zurück zum Zitat Camastra F, Verri A (2005) A novel kernel method for clustering. IEEE Trans Pattern Anal Mach Intell 27(5):801–805CrossRef Camastra F, Verri A (2005) A novel kernel method for clustering. IEEE Trans Pattern Anal Mach Intell 27(5):801–805CrossRef
Zurück zum Zitat Chang H, Yeung DY (2006) Locally linear metric adaptation for semi-supervised clustering. Pattern Recognit 39(7):1253–1264MATHCrossRef Chang H, Yeung DY (2006) Locally linear metric adaptation for semi-supervised clustering. Pattern Recognit 39(7):1253–1264MATHCrossRef
Zurück zum Zitat Chapelle O, Schlkopf B, Zien A (2006) Semi-supervised learning, MIT Press, Cambridge Chapelle O, Schlkopf B, Zien A (2006) Semi-supervised learning, MIT Press, Cambridge
Zurück zum Zitat Chen CH, Lin CJ, Lin CT (2007) An efficient quantum neuro-fuzzy classifier based on fuzzy entropy and compensatory operation. Soft Comput 12(6):567–583CrossRef Chen CH, Lin CJ, Lin CT (2007) An efficient quantum neuro-fuzzy classifier based on fuzzy entropy and compensatory operation. Soft Comput 12(6):567–583CrossRef
Zurück zum Zitat Chung F, Shitong Wang et al (2006) Clustering analysis of gene expression data based on semi-supervised visual clustering algorithm. Soft Comput 10(11):981–993MATHCrossRef Chung F, Shitong Wang et al (2006) Clustering analysis of gene expression data based on semi-supervised visual clustering algorithm. Soft Comput 10(11):981–993MATHCrossRef
Zurück zum Zitat Cohn D, Caruana R, McCallum (2003) A Semi-supervised clustering with user feedback, Tech Rep TR2003-1892, Cornell University, Ithaca Cohn D, Caruana R, McCallum (2003) A Semi-supervised clustering with user feedback, Tech Rep TR2003-1892, Cornell University, Ithaca
Zurück zum Zitat Davies DL, Bouldin DW (1979) A cluster separation measure. IEEE Trans Pattern Anal Mach Intell 1(4):224–227CrossRef Davies DL, Bouldin DW (1979) A cluster separation measure. IEEE Trans Pattern Anal Mach Intell 1(4):224–227CrossRef
Zurück zum Zitat Demiriz A, Bennett KP, Embrechts MJ (1999) Semi-supervised clustering using genetic algorithms. In: Artificial Neural Networks in Engineering (ANNIE) pp 809–814 Demiriz A, Bennett KP, Embrechts MJ (1999) Semi-supervised clustering using genetic algorithms. In: Artificial Neural Networks in Engineering (ANNIE) pp 809–814
Zurück zum Zitat Dhillon I, Guan Y, Kulis B (2004) Kernel k-means, spectral clustering and normalized cuts. In: Proceedings of the 10th International conference on knowledge discovery and data mining, pp 551–556 Dhillon I, Guan Y, Kulis B (2004) Kernel k-means, spectral clustering and normalized cuts. In: Proceedings of the 10th International conference on knowledge discovery and data mining, pp 551–556
Zurück zum Zitat Gan G, Chaoqun M, Wu J (2007) Data clustering: theory, algorithms, and applications. SIAM, PhiladelphiaMATHCrossRef Gan G, Chaoqun M, Wu J (2007) Data clustering: theory, algorithms, and applications. SIAM, PhiladelphiaMATHCrossRef
Zurück zum Zitat Garg S, Patra K et al (2008) Effect of different basis functions on a radial basis function network in prediction of drill flank wear from motor current signals. Soft Comput 12:777–787CrossRef Garg S, Patra K et al (2008) Effect of different basis functions on a radial basis function network in prediction of drill flank wear from motor current signals. Soft Comput 12:777–787CrossRef
Zurück zum Zitat Girolami M (2002) Mercer kernel based clustering in feature space. IEEE Trans Neural Netw 13(3):780–784CrossRef Girolami M (2002) Mercer kernel based clustering in feature space. IEEE Trans Neural Netw 13(3):780–784CrossRef
Zurück zum Zitat Haykin S (2008) Neural networks: a comprehensive foundation. Macmillan, New York Haykin S (2008) Neural networks: a comprehensive foundation. Macmillan, New York
Zurück zum Zitat Hu YC et al (2007) Functional-link net with fuzzy integral for bankruptcy prediction. Neurocomputing 70:2959–2968CrossRef Hu YC et al (2007) Functional-link net with fuzzy integral for bankruptcy prediction. Neurocomputing 70:2959–2968CrossRef
Zurück zum Zitat Jain AK, Murty MN, Flynn PJ (1999) Data clustering: a review. ACM Comput Surveys 31(3):264–323CrossRef Jain AK, Murty MN, Flynn PJ (1999) Data clustering: a review. ACM Comput Surveys 31(3):264–323CrossRef
Zurück zum Zitat Kulis B, Basu S, Dhillon I et al (2009) Semi-supervised graph clustering: a kernel approach. Mach Learn 74:1–22CrossRef Kulis B, Basu S, Dhillon I et al (2009) Semi-supervised graph clustering: a kernel approach. Mach Learn 74:1–22CrossRef
Zurück zum Zitat Lim CP, Woo SC (2007) Text-dependent speaker recognition using wavelets and neural networks. Soft Comput 11(6):549–556CrossRef Lim CP, Woo SC (2007) Text-dependent speaker recognition using wavelets and neural networks. Soft Comput 11(6):549–556CrossRef
Zurück zum Zitat McQueen J (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of 5th Berkeley Symp Maths Stat and Probability, pp 281–298 McQueen J (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of 5th Berkeley Symp Maths Stat and Probability, pp 281–298
Zurück zum Zitat Muller KR, Mika S et al (2001) An introduction to kernel-based learning algorithms. IEEE Trans Neural Netw 12(2):181–201CrossRef Muller KR, Mika S et al (2001) An introduction to kernel-based learning algorithms. IEEE Trans Neural Netw 12(2):181–201CrossRef
Zurück zum Zitat Ninness B, Gustafsson F (1997) A unifying construction of orthonormal bases for system identification. IEEE Trans Autom Control 42(4):515–521MathSciNetMATHCrossRef Ninness B, Gustafsson F (1997) A unifying construction of orthonormal bases for system identification. IEEE Trans Autom Control 42(4):515–521MathSciNetMATHCrossRef
Zurück zum Zitat Pao YH (1989) Adaptive pattern recognition and neural networks, Addison-Wesley, Boston Pao YH (1989) Adaptive pattern recognition and neural networks, Addison-Wesley, Boston
Zurück zum Zitat Pao YH, Park GH, Sobajic DJ (1994) Learning and generalization characteristics of the random vector functional-link net. Neurocomputing 6(2):163–180 Pao YH, Park GH, Sobajic DJ (1994) Learning and generalization characteristics of the random vector functional-link net. Neurocomputing 6(2):163–180
Zurück zum Zitat Patra JC, Pal RN (1995) A functional link artificial neural network for adaptive channel equalization. Signal Process 43(2):181–195MATHCrossRef Patra JC, Pal RN (1995) A functional link artificial neural network for adaptive channel equalization. Signal Process 43(2):181–195MATHCrossRef
Zurück zum Zitat Rand WM (1971) Objective criteria for the evaluation of clustering methods. J Am Stat Assoc 66:846–850CrossRef Rand WM (1971) Objective criteria for the evaluation of clustering methods. J Am Stat Assoc 66:846–850CrossRef
Zurück zum Zitat Shen H, Yang J et al (2006) Attribute weighted mercer kernel based fuzzy clustering algorithm for general non-spherical datasets. Soft Comput 10(11):1061–1073CrossRef Shen H, Yang J et al (2006) Attribute weighted mercer kernel based fuzzy clustering algorithm for general non-spherical datasets. Soft Comput 10(11):1061–1073CrossRef
Zurück zum Zitat Wagstaff K, Rogers S (2001) Constrained k-means clustering with background knowledge. In: Proceedings of 18th international conference on, machine learning, pp 577–584 Wagstaff K, Rogers S (2001) Constrained k-means clustering with background knowledge. In: Proceedings of 18th international conference on, machine learning, pp 577–584
Zurück zum Zitat Wang D et al (2007) Learning based neural similarity metrics for multimedia data mining. Soft Comput 11:335–340CrossRef Wang D et al (2007) Learning based neural similarity metrics for multimedia data mining. Soft Comput 11:335–340CrossRef
Zurück zum Zitat Xiang S, Nie F, Zhang C (2008) Learning a Mahalanobis distance metric for data clustering and classification. Pattern Recognit 41(12):3600–3612MATHCrossRef Xiang S, Nie F, Zhang C (2008) Learning a Mahalanobis distance metric for data clustering and classification. Pattern Recognit 41(12):3600–3612MATHCrossRef
Zurück zum Zitat Xing EP, Ng AY et al (2003) Distance metric learning, with application to clustering with side-information. Adv Neural Inf Process Syst 15:505–512 Xing EP, Ng AY et al (2003) Distance metric learning, with application to clustering with side-information. Adv Neural Inf Process Syst 15:505–512
Metadaten
Titel
A novel approach for distance-based semi-supervised clustering using functional link neural network
verfasst von
B. Chandra
Manish Gupta
Publikationsdatum
01.03.2013
Verlag
Springer-Verlag
Erschienen in
Soft Computing / Ausgabe 3/2013
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-012-0912-7

Weitere Artikel der Ausgabe 3/2013

Soft Computing 3/2013 Zur Ausgabe

Premium Partner