Skip to main content
Top
Published in: Soft Computing 3/2016

03-01-2015 | Methodologies and Application

Chaotic gradient artificial bee colony for text clustering

Authors: Kusum Kumari Bharti, Pramod Kumar Singh

Published in: Soft Computing | Issue 3/2016

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Text clustering is widely used to create clusters of the digital documents. Selection of cluster centers plays an important role in the clustering. In this paper, we use artificial bee colony algorithm (ABC) to select appropriate cluster centers for creating clusters of the text documents. The ABC is a population-based nature-inspired algorithm, which simulates intelligent foraging behavior of the real honey bees and has been shown effective in solving many search and optimization problems. However, a major drawback of the algorithm is that it provides a good exploration of the search space at the cost of exploitation. In this paper, we improve search equation of the ABC and embed two local search paradigms namely chaotic local search and gradient search in the basic ABC to improve its exploitation capability. The proposed algorithm is named as chaotic gradient artificial bee colony. The effectiveness of the proposed algorithm is tested on three different benchmark text datasets namely Reuters-21,578, Classic4, and WebKB. The obtained results are compared with the ABC, a recent variant of the ABC namely gbest-guided ABC, a variant of the proposed methodology namely chaotic artificial bee colony, memetic ABC, and conventional clustering algorithm K-means. The empirical evaluation reveals very encouraging results in terms of the quality of solution and convergence speed.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
go back to reference Bansal JC, Sharma H, Arya K, Nagar A (2013) Memetic search in artificial bee colony algorithm. Soft Comput 17(10):1911–1928CrossRef Bansal JC, Sharma H, Arya K, Nagar A (2013) Memetic search in artificial bee colony algorithm. Soft Comput 17(10):1911–1928CrossRef
go back to reference Bezdek JC (1981) Pattern recognition with fuzzy objective function algorithms. Kluwer Academic Publishers, NorwellCrossRefMATH Bezdek JC (1981) Pattern recognition with fuzzy objective function algorithms. Kluwer Academic Publishers, NorwellCrossRefMATH
go back to reference Bharti KK, Singh PK (2014a) A three-stage unsupervised dimension reduction method for text clustering. J Comput Sci 5(2):156–169 Bharti KK, Singh PK (2014a) A three-stage unsupervised dimension reduction method for text clustering. J Comput Sci 5(2):156–169
go back to reference Bharti KK, Singh PK (2014b) Chaotic artificial bee colony for text clustering. In: Fourth international conference on emerging applications of information technology (EAIT-2014), ISI. IEEE Kolkata Bharti KK, Singh PK (2014b) Chaotic artificial bee colony for text clustering. In: Fourth international conference on emerging applications of information technology (EAIT-2014), ISI. IEEE Kolkata
go back to reference Buckley C, Singhal A, Mitra M, Salton G (1995) New retrieval approaches using smart: TREC 4. In: Proceedings of the fourth text retrieval conference (TREC-4), pp 25–48 Buckley C, Singhal A, Mitra M, Salton G (1995) New retrieval approaches using smart: TREC 4. In: Proceedings of the fourth text retrieval conference (TREC-4), pp 25–48
go back to reference Chuang LY, Tsai SW, Yang CH (2011) Improved binary particle swarm optimization using catfish effect for feature selection. Expert Syst Appl 38(10):12699–12707CrossRef Chuang LY, Tsai SW, Yang CH (2011) Improved binary particle swarm optimization using catfish effect for feature selection. Expert Syst Appl 38(10):12699–12707CrossRef
go back to reference Cui X, Potok TE, Palathingal P (2005) Document clustering using particle swarm optimization. In: Proceedings of IEEE swarm intelligence symposium (SIS-2005). IEEE, pp 185–191 Cui X, Potok TE, Palathingal P (2005) Document clustering using particle swarm optimization. In: Proceedings of IEEE swarm intelligence symposium (SIS-2005). IEEE, pp 185–191
go back to reference Cura T (2012) A particle swarm optimization approach to clustering. Expert Syst Appl 39(1):1582–1588CrossRef Cura T (2012) A particle swarm optimization approach to clustering. Expert Syst Appl 39(1):1582–1588CrossRef
go back to reference Derrac J, García S, Molina D, Herrera F (2011) A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms. Swarm Evolut Comput 1(1):3–18CrossRef Derrac J, García S, Molina D, Herrera F (2011) A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms. Swarm Evolut Comput 1(1):3–18CrossRef
go back to reference Eberhart RC, Kennedy J (1995) A new optimizer using particle swarm theory. In: Proceedings of the 6th international symposium on micro machine and human science (MHS-1995), vol 1, New York, pp 39–43 Eberhart RC, Kennedy J (1995) A new optimizer using particle swarm theory. In: Proceedings of the 6th international symposium on micro machine and human science (MHS-1995), vol 1, New York, pp 39–43
go back to reference Fei K, Junjie L, Haojin L, Zhenyue M, Qing X (2010) Improved artificial bee colony algorithm. IEEE, 2nd international workshop on intelligent systems and applications (ISA-2010), pp 1–4 Fei K, Junjie L, Haojin L, Zhenyue M, Qing X (2010) Improved artificial bee colony algorithm. IEEE, 2nd international workshop on intelligent systems and applications (ISA-2010), pp 1–4
go back to reference Figueiredo F, Rocha L, Couto T, Salles T, Gonçalves MA, Meira W Jr (2011) Word co-occurrence features for text classification. Inf Syst 36(5):843–858CrossRef Figueiredo F, Rocha L, Couto T, Salles T, Gonçalves MA, Meira W Jr (2011) Word co-occurrence features for text classification. Inf Syst 36(5):843–858CrossRef
go back to reference Gao W, Liu S, Huang L (2012) A global best artificial bee colony algorithm for global optimization. J Comput Appl Math 236(11):2741–2753CrossRefMathSciNetMATH Gao W, Liu S, Huang L (2012) A global best artificial bee colony algorithm for global optimization. J Comput Appl Math 236(11):2741–2753CrossRefMathSciNetMATH
go back to reference W Gao, S Liu, L Huang (2013) A novel artificial bee colony algorithm with powell’s method. Appl Soft Comput 13(9):3763–3775CrossRef W Gao, S Liu, L Huang (2013) A novel artificial bee colony algorithm with powell’s method. Appl Soft Comput 13(9):3763–3775CrossRef
go back to reference Guo JQ, Zhou HF, Meng LQ (2009) Chaos particle swarm optimization algorithm for estimating solute transport parameters of streams from tracer experiment data. In: Fourth international conference on innovative computing, information and control (ICICIC-2009). IEEE, pp 872–875 Guo JQ, Zhou HF, Meng LQ (2009) Chaos particle swarm optimization algorithm for estimating solute transport parameters of streams from tracer experiment data. In: Fourth international conference on innovative computing, information and control (ICICIC-2009). IEEE, pp 872–875
go back to reference Han J, Kamber M (2006) Data mining. Concepts and techniques, Southeast Asia edn. Morgan kaufmann, WalthamMATH Han J, Kamber M (2006) Data mining. Concepts and techniques, Southeast Asia edn. Morgan kaufmann, WalthamMATH
go back to reference Handl J, Meyer B (2007) Ant-based and swarm-based clustering. Swarm Intell 1(2):95–113 Handl J, Meyer B (2007) Ant-based and swarm-based clustering. Swarm Intell 1(2):95–113
go back to reference He D, He C, Jiang LG, Zhu HW, Hu GR (2001) Chaotic characteristics of a one-dimensional iterative map with infinite collapses. IEEE Trans Circuits Syst I: Fundam Theory Appl 48(7):900–906 He D, He C, Jiang LG, Zhu HW, Hu GR (2001) Chaotic characteristics of a one-dimensional iterative map with infinite collapses. IEEE Trans Circuits Syst I: Fundam Theory Appl 48(7):900–906
go back to reference Jadhav H, Roy R (2013) Gbest guided artificial bee colony algorithm for environmental/economic dispatch considering wind power. Expert Syst Appl 40(16):6385–6399CrossRef Jadhav H, Roy R (2013) Gbest guided artificial bee colony algorithm for environmental/economic dispatch considering wind power. Expert Syst Appl 40(16):6385–6399CrossRef
go back to reference Jolliffe I (2005) Principal component analysis. Wiley Online Library Jolliffe I (2005) Principal component analysis. Wiley Online Library
go back to reference Karaboga D (2005) An idea based on honey bee swarm for numerical optimization. Technical report TR06, Engineering faculty, Computer Engineering Department, Erciyes University Press, Erciyes Karaboga D (2005) An idea based on honey bee swarm for numerical optimization. Technical report TR06, Engineering faculty, Computer Engineering Department, Erciyes University Press, Erciyes
go back to reference Karaboga D, Ozturk C (2011) A novel clustering approach: artificial bee colony (ABC) algorithm. Appl Soft Comput 11(1):652–657CrossRef Karaboga D, Ozturk C (2011) A novel clustering approach: artificial bee colony (ABC) algorithm. Appl Soft Comput 11(1):652–657CrossRef
go back to reference Kaufman L, Rousseeuw P (1987) Clustering by means of medoids. North-Holland, Amsterdam Kaufman L, Rousseeuw P (1987) Clustering by means of medoids. North-Holland, Amsterdam
go back to reference Li C, Zhou J, Kou P, Xiao J (2012) A novel chaotic particle swarm optimization based fuzzy clustering algorithm. Neurocomputing 83:98–109CrossRef Li C, Zhou J, Kou P, Xiao J (2012) A novel chaotic particle swarm optimization based fuzzy clustering algorithm. Neurocomputing 83:98–109CrossRef
go back to reference Liang Z (2010) Genetic enhancing chaotic particle swarm optimization algorithm. In: Proceedings of the 29th Chinese control conference (CCC-2010). IEEE, pp 5182–5187 Liang Z (2010) Genetic enhancing chaotic particle swarm optimization algorithm. In: Proceedings of the 29th Chinese control conference (CCC-2010). IEEE, pp 5182–5187
go back to reference MacQueen J (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the 5th Berkeley symposium on mathematical statistics and probability, California, vol 1, no 14, pp 281–297 MacQueen J (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the 5th Berkeley symposium on mathematical statistics and probability, California, vol 1, no 14, pp 281–297
go back to reference Maulik U, Bandyopadhyay S (2000) Genetic algorithm-based clustering technique. Pattern Recognit 33(9):1455–1465CrossRef Maulik U, Bandyopadhyay S (2000) Genetic algorithm-based clustering technique. Pattern Recognit 33(9):1455–1465CrossRef
go back to reference Pantel P, Lin D (2002) Document clustering with committees. In: Proceedings of the 25th annual international ACM SIGIR conference on research and development in information retrieval. ACM, pp 199–206 Pantel P, Lin D (2002) Document clustering with committees. In: Proceedings of the 25th annual international ACM SIGIR conference on research and development in information retrieval. ACM, pp 199–206
go back to reference Powell MJD (1977) Restart procedures for the conjugate gradient method. Math Program 12(1):241–254CrossRefMATH Powell MJD (1977) Restart procedures for the conjugate gradient method. Math Program 12(1):241–254CrossRefMATH
go back to reference Reed JW, Jiao Y, Potok TE, Klump BA, Elmore MT, Hurson AR (2006) Tf-icf: a new term weighting scheme for clustering dynamic data streams. In: 5th International conference on machine learning and applications (ICMLA-2006). IEEE, pp 258–263 Reed JW, Jiao Y, Potok TE, Klump BA, Elmore MT, Hurson AR (2006) Tf-icf: a new term weighting scheme for clustering dynamic data streams. In: 5th International conference on machine learning and applications (ICMLA-2006). IEEE, pp 258–263
go back to reference Robertson SE, Walker S (1999) Okapi/keenbow at trec-8. In: Text retrieval conference (TREC), vol 8, pp 151–162 Robertson SE, Walker S (1999) Okapi/keenbow at trec-8. In: Text retrieval conference (TREC), vol 8, pp 151–162
go back to reference Salton G, Buckley C (1988) Term-weighting approaches in automatic text retrieval. Inf Process Manage 24(5):513–523CrossRef Salton G, Buckley C (1988) Term-weighting approaches in automatic text retrieval. Inf Process Manage 24(5):513–523CrossRef
go back to reference Sharma H, Bansal JC, Arya K (2013) Opposition based lévy flight artificial bee colony. Memet Comput 5(3):213–227CrossRef Sharma H, Bansal JC, Arya K (2013) Opposition based lévy flight artificial bee colony. Memet Comput 5(3):213–227CrossRef
go back to reference Sharma TK, Pant M, Singh VP (2012) Improved local search in artificial bee colony using golden section search. J Eng 1(1):14–19 Sharma TK, Pant M, Singh VP (2012) Improved local search in artificial bee colony using golden section search. J Eng 1(1):14–19
go back to reference Tan PN, Steinbach M, Kumar V (2005) Introduction to Data Mining. Addison Wesley, Upper Saddle River Tan PN, Steinbach M, Kumar V (2005) Introduction to Data Mining. Addison Wesley, Upper Saddle River
go back to reference Uğuz H (2011) A two-stage feature selection method for text categorization by using information gain, principal component analysis and genetic algorithm. Knowl Based Syst 24(7):1024–1032CrossRef Uğuz H (2011) A two-stage feature selection method for text categorization by using information gain, principal component analysis and genetic algorithm. Knowl Based Syst 24(7):1024–1032CrossRef
go back to reference Umeno K, Kitayama K (1999) Spreading sequences using periodic orbits of chaos for CDMA. Electron Lett 35(7):545–546CrossRef Umeno K, Kitayama K (1999) Spreading sequences using periodic orbits of chaos for CDMA. Electron Lett 35(7):545–546CrossRef
go back to reference Zhu G, Kwong S (2010) Gbest-guided artificial bee colony algorithm for numerical function optimization. Appl Math Comput 217(7):3166–3173CrossRefMathSciNetMATH Zhu G, Kwong S (2010) Gbest-guided artificial bee colony algorithm for numerical function optimization. Appl Math Comput 217(7):3166–3173CrossRefMathSciNetMATH
Metadata
Title
Chaotic gradient artificial bee colony for text clustering
Authors
Kusum Kumari Bharti
Pramod Kumar Singh
Publication date
03-01-2015
Publisher
Springer Berlin Heidelberg
Published in
Soft Computing / Issue 3/2016
Print ISSN: 1432-7643
Electronic ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-014-1571-7

Other articles of this Issue 3/2016

Soft Computing 3/2016 Go to the issue

Premium Partner