Skip to main content
Top

2015 | OriginalPaper | Chapter

Web Document Clustering by Using PSO-Based Cuckoo Search Clustering Algorithm

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

With the increasing amount of web information, web document clustering plays an important role in Information Retrieval. This paper presents a PSO-based Cuckoo Search Clustering Algorithm to combine the strengths of Cuckoo Search and Particle Swarm. The solutions of new cuckoos are based on the solutions of PSO. Among these solutions, the algorithm will replace some eggs on lack of fitness with successful solutions until an optimal solution emerges. The proposed hybrid algorithm is tested with a web document benchmark dataset and the results show that it performs well in web document clustering area.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Oikonomakou, N., Vazirgiannis, M.: A review of web document clustering approaches Oikonomakou, N., Vazirgiannis, M.: A review of web document clustering approaches
2.
go back to reference Schenker, A., Last, M., Bunke, H., Kandel, A.: Clustering Of Web Documents Using a Graph Model (2003) Schenker, A., Last, M., Bunke, H., Kandel, A.: Clustering Of Web Documents Using a Graph Model (2003)
3.
go back to reference Huang, A.: Similarity Measures for Text Document Clustering, NZCSRSC 2008. Christchurch, New Zealand (2008) Huang, A.: Similarity Measures for Text Document Clustering, NZCSRSC 2008. Christchurch, New Zealand (2008)
4.
go back to reference Sridevi, K., Umarani, R., Selvi, V.: An analysis of web document clustering algorithms. Int. J. Sci. Technol. India (2011) Sridevi, K., Umarani, R., Selvi, V.: An analysis of web document clustering algorithms. Int. J. Sci. Technol. India (2011)
5.
go back to reference van der Merwe, D.W., Engelhrecht, A.P.: Data Clustering Using Particle Swarm Optimization, Evolutionary Computation 2003 Congress, IEEE, New York, Dec 2003 van der Merwe, D.W., Engelhrecht, A.P.: Data Clustering Using Particle Swarm Optimization, Evolutionary Computation 2003 Congress, IEEE, New York, Dec 2003
6.
go back to reference Ye, F., Chen, C.Y.: Alternative KPSO-clustering algorithm. Tamkang J. Sci. Eng. 8(2), 165–174 (2005) Ye, F., Chen, C.Y.: Alternative KPSO-clustering algorithm. Tamkang J. Sci. Eng. 8(2), 165–174 (2005)
7.
go back to reference Cui, X., Potok, T.E.: Document clustering using particle swarm optimization. In: Proceedings of Swarm Intelligence Symposium (2005) Cui, X., Potok, T.E.: Document clustering using particle swarm optimization. In: Proceedings of Swarm Intelligence Symposium (2005)
8.
go back to reference Zamir, O., Etzioni, O.: Web document clustering: a feasibility demonstration. In: Proceedings of 21st Annals Int’l ACM SIGIR Conference, pp. 46–54 (1998) Zamir, O., Etzioni, O.: Web document clustering: a feasibility demonstration. In: Proceedings of 21st Annals Int’l ACM SIGIR Conference, pp. 46–54 (1998)
9.
go back to reference Goel, S., Sharma, A., Bedi, P.: Cuckoo Search Clustering Algorithm: A Novel Strategy of Biomimicry, World Congress on Information and Communication Technologies. IEEE publication, New York (2011) Goel, S., Sharma, A., Bedi, P.: Cuckoo Search Clustering Algorithm: A Novel Strategy of Biomimicry, World Congress on Information and Communication Technologies. IEEE publication, New York (2011)
10.
go back to reference Zaw, M.M., Mon, E.E.: Web document clustering using cuckoo search clustering algorithm based on levy flight. Int. J. Innov. Appl. Stud. 4(1), 182–188 (2013) Zaw, M.M., Mon, E.E.: Web document clustering using cuckoo search clustering algorithm based on levy flight. Int. J. Innov. Appl. Stud. 4(1), 182–188 (2013)
11.
go back to reference AbdelHamid, N.M., Abdel Halim, M.B., Waleed Fakhr, M.: Bees algorithm-based document clustering. In: The 6th International Conference on Information Technology (2013) AbdelHamid, N.M., Abdel Halim, M.B., Waleed Fakhr, M.: Bees algorithm-based document clustering. In: The 6th International Conference on Information Technology (2013)
12.
go back to reference Machnik, L.: Documents clustering method based on ants algorithms. In: Proceedings of the International Multi Conference on Computer Science and Information Technology, pp. 123–130 (2006) Machnik, L.: Documents clustering method based on ants algorithms. In: Proceedings of the International Multi Conference on Computer Science and Information Technology, pp. 123–130 (2006)
13.
go back to reference Christopher, D.M., Prabhakar, R., Hinrich, S.: An Introductio to Information Retrieval, 1st edn. Cambridge University Press, Cambridge (2008) Christopher, D.M., Prabhakar, R., Hinrich, S.: An Introductio to Information Retrieval, 1st edn. Cambridge University Press, Cambridge (2008)
14.
go back to reference Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Comm. ACM 18(11), 613–620 (1975)CrossRefMATH Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Comm. ACM 18(11), 613–620 (1975)CrossRefMATH
15.
go back to reference Yates, R.B., Neto, B.R.: Modern Information Retrieval. Addison-Wesley, New York (1999) Yates, R.B., Neto, B.R.: Modern Information Retrieval. Addison-Wesley, New York (1999)
16.
go back to reference Larsen, B., Aone, C.: Fast and effective text mining using linear-time document clustering. In: Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (1999) Larsen, B., Aone, C.: Fast and effective text mining using linear-time document clustering. In: Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (1999)
17.
go back to reference Nedjah, N., Mourelle, L.M.: Swarm Intelligent Systems. Springer, New York (2006)CrossRef Nedjah, N., Mourelle, L.M.: Swarm Intelligent Systems. Springer, New York (2006)CrossRef
18.
go back to reference Settles, M.: An introduction to particle swarm optimization, 7 Nov 2005 Settles, M.: An introduction to particle swarm optimization, 7 Nov 2005
19.
go back to reference Yang, X.S., Deb, S.: Cuckoo Search via Lévy Flights. In: Proceedings of World Congress on Nature and Biologically Inspired Algorithms, pp. 210–214. IEEE publication, New York (2009) Yang, X.S., Deb, S.: Cuckoo Search via Lévy Flights. In: Proceedings of World Congress on Nature and Biologically Inspired Algorithms, pp. 210–214. IEEE publication, New York (2009)
20.
go back to reference Yang, X.S., Deb, S.: Engineering optimization by cuckoo search. Int. J. Math. Model. Num. Opt. 1(4), 330–343 (2010)MATH Yang, X.S., Deb, S.: Engineering optimization by cuckoo search. Int. J. Math. Model. Num. Opt. 1(4), 330–343 (2010)MATH
21.
go back to reference Andrews O.N., Edward, A.F.: Recent Developments in Document Custering, Technical Report, Computer Science, Virginia Tech (2007) Andrews O.N., Edward, A.F.: Recent Developments in Document Custering, Technical Report, Computer Science, Virginia Tech (2007)
Metadata
Title
Web Document Clustering by Using PSO-Based Cuckoo Search Clustering Algorithm
Authors
Moe Moe Zaw
Ei Ei Mon
Copyright Year
2015
DOI
https://doi.org/10.1007/978-3-319-13826-8_14

Premium Partner