Skip to main content
Top
Published in: The Journal of Supercomputing 6/2020

17-10-2017

Swarm-based clustering algorithm for efficient web blog and data classification

Authors: E. A. Neeba, S. Koteeswaran, N. Malarvizhi

Published in: The Journal of Supercomputing | Issue 6/2020

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Data classification and the weblog classification have become the most regular approach for people to express themselves. Data classification is another type of problem for classifying a feature set into several feature subsets, and those are further clustered into different classes on the basis of binary or multiclassification. Many problems in science and technology, industry and commercial business and medicine and health care can be treated as classification problems. In recent years, many methods are existing to build a classification model based on many statistical concepts and optimization methods. One major issue of building statistical model will have the principle to provide good accuracy simply when the principal assumptions are correct. The classification decision made on accuracy only justifies the performance of the particular model. Before applying the model to the particular application, it requires good perceptive of data utilized. In order to provide an effective learning algorithm to refine such complexity in handling the data and to minimize output errors and to provide the hands to improve the efficiency of the model, this research article is framed. In this work, a novel algorithm named ‘swarm-based cluster algorithm’ is proposed to complete the feature selection task in order to produce optimized feature-based clusters for effective data and weblogs classification.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Ikeda D, Takamura H, Okumura M (2008) Semi-supervised learning for blog classification. In: AAAI, pp 1156–1161 Ikeda D, Takamura H, Okumura M (2008) Semi-supervised learning for blog classification. In: AAAI, pp 1156–1161
2.
go back to reference Chen Y, Xu X-H et al (2012) Study of modified particle swarm optimization algorithm classification. IEEE Trans Knowl Data Eng 24(1):127–140MathSciNetCrossRef Chen Y, Xu X-H et al (2012) Study of modified particle swarm optimization algorithm classification. IEEE Trans Knowl Data Eng 24(1):127–140MathSciNetCrossRef
3.
go back to reference Lin K-C, Zhang K-Y, Huang Y-H, Hung JC, Yen N (2016) Feature selection based on an improved cat swarm optimization algorithm for big data classification. J Supercomput 72(8):3210–3221CrossRef Lin K-C, Zhang K-Y, Huang Y-H, Hung JC, Yen N (2016) Feature selection based on an improved cat swarm optimization algorithm for big data classification. J Supercomput 72(8):3210–3221CrossRef
4.
go back to reference Zhao Q, Meng G (2012) Bacterial foraging with PSO algorithm and its application on attribute reduction. Int J Innov Comput Appl 4(2):100CrossRef Zhao Q, Meng G (2012) Bacterial foraging with PSO algorithm and its application on attribute reduction. Int J Innov Comput Appl 4(2):100CrossRef
5.
go back to reference Liao J-K, Ye D-Y (2012) Minimal attribute reduction algorithm based on particle swarm optimization with immunity. J Comput Appl 7(3):550–555 Liao J-K, Ye D-Y (2012) Minimal attribute reduction algorithm based on particle swarm optimization with immunity. J Comput Appl 7(3):550–555
6.
go back to reference Guo J-L, Wu Z-J, Jiang D-Z (2009) Adaptive swarm optimization algorithm based on energy of particle. J Syst Simul 21(5):4465–4471 Guo J-L, Wu Z-J, Jiang D-Z (2009) Adaptive swarm optimization algorithm based on energy of particle. J Syst Simul 21(5):4465–4471
7.
go back to reference Li J, Fong S, Mohammed S, Fiaidhi J (2016) Improving the classification performance of biological imbalanced datasets by swarm optimization algorithms. J Supercomput 72(10):3708–3728CrossRef Li J, Fong S, Mohammed S, Fiaidhi J (2016) Improving the classification performance of biological imbalanced datasets by swarm optimization algorithms. J Supercomput 72(10):3708–3728CrossRef
8.
go back to reference Abualigah LM, Khader ATJ (2017) Unsupervised text feature selection technique based on hybrid particle swarm optimization algorithm with genetic operators for the text clustering. J Supercomput 1:1–27. doi:10.1007/s11227-017-2046-2 CrossRef Abualigah LM, Khader ATJ (2017) Unsupervised text feature selection technique based on hybrid particle swarm optimization algorithm with genetic operators for the text clustering. J Supercomput 1:1–27. doi:10.​1007/​s11227-017-2046-2 CrossRef
9.
go back to reference Wang YL, Kim KT, Lee B, Youn HY (2017) A novel buffer management scheme based on particle swarm optimization for SSD. J Supercomput 1:1–19 Wang YL, Kim KT, Lee B, Youn HY (2017) A novel buffer management scheme based on particle swarm optimization for SSD. J Supercomput 1:1–19
10.
go back to reference Melgani F, Bazi Y (2008) Classification of electrocardiogram signals with support vector machines and particle swarm optimization. IEEE Trans Inf Technol Biomed 12(5):667–677CrossRef Melgani F, Bazi Y (2008) Classification of electrocardiogram signals with support vector machines and particle swarm optimization. IEEE Trans Inf Technol Biomed 12(5):667–677CrossRef
11.
go back to reference Olesen JR (2009) Auto-clustering using particle swarm optimization and bacterial foraging in agents and data mining interaction. Springer, Berlin, pp 69–83 Olesen JR (2009) Auto-clustering using particle swarm optimization and bacterial foraging in agents and data mining interaction. Springer, Berlin, pp 69–83
12.
go back to reference Wan M, Wang C, Li L, Yang Y (2012) Chaotic ant swarm approach for data clustering. Appl Soft Comput 12:2387–2393CrossRef Wan M, Wang C, Li L, Yang Y (2012) Chaotic ant swarm approach for data clustering. Appl Soft Comput 12:2387–2393CrossRef
13.
go back to reference Yuwono M, Su SW, Moulton B, Nguyen H (2012) Fast unsupervised learning method for rapid estimation of cluster centroids. In: IEEE, pp 1–8 Yuwono M, Su SW, Moulton B, Nguyen H (2012) Fast unsupervised learning method for rapid estimation of cluster centroids. In: IEEE, pp 1–8
14.
go back to reference Chuang L, Yang C, Wu K, Yang C (2011) Gene selection and classification using Taguchi chaotic binary particle swarm optimization. Expert Syst Appl 38(10):13367–13377CrossRef Chuang L, Yang C, Wu K, Yang C (2011) Gene selection and classification using Taguchi chaotic binary particle swarm optimization. Expert Syst Appl 38(10):13367–13377CrossRef
15.
go back to reference Wang X-Y, Yang J, Teng X-L (2007) Feature selection based on rough sets and particle swarm optimization. Pattern Recogn Lett 28(1):459–471CrossRef Wang X-Y, Yang J, Teng X-L (2007) Feature selection based on rough sets and particle swarm optimization. Pattern Recogn Lett 28(1):459–471CrossRef
16.
go back to reference Lee IH, Lushington GH, Visvanathan M (2011) A filter-based feature selection approach for identifying potential biomarkers for lung cancer. J Clin Bioinform 1(1):1–11CrossRef Lee IH, Lushington GH, Visvanathan M (2011) A filter-based feature selection approach for identifying potential biomarkers for lung cancer. J Clin Bioinform 1(1):1–11CrossRef
17.
go back to reference Liu H, Liu L, Zhang H (2010) Ensemble gene selection for cancer classification. Pattern Recogn 43(8):2763–2772CrossRef Liu H, Liu L, Zhang H (2010) Ensemble gene selection for cancer classification. Pattern Recogn 43(8):2763–2772CrossRef
18.
go back to reference Wang J, Wu L, Kong J, Li Y, Zhang B (2013) Maximum weight and minimum redundancy: a novel framework for feature subset selection. Pattern Recogn 46(1):1616–1627MATHCrossRef Wang J, Wu L, Kong J, Li Y, Zhang B (2013) Maximum weight and minimum redundancy: a novel framework for feature subset selection. Pattern Recogn 46(1):1616–1627MATHCrossRef
19.
20.
go back to reference Maji P (2012) Mutual information-based supervised attribute clustering for microarray sample classification. IEEE Trans Knowl Data Eng 24(1):127–140CrossRef Maji P (2012) Mutual information-based supervised attribute clustering for microarray sample classification. IEEE Trans Knowl Data Eng 24(1):127–140CrossRef
21.
go back to reference Han JQ, Sun ZY, Hao HW (2015) Selecting feature subset with sparsity and low redundancy for unsupervised learning. Knowl Based Syst 86(1):210–223CrossRef Han JQ, Sun ZY, Hao HW (2015) Selecting feature subset with sparsity and low redundancy for unsupervised learning. Knowl Based Syst 86(1):210–223CrossRef
22.
go back to reference Huang KY (2011) A hybrid particle swarm optimization approach for clustering and classification of datasets. Knowl Based Syst 24(3):420–426CrossRef Huang KY (2011) A hybrid particle swarm optimization approach for clustering and classification of datasets. Knowl Based Syst 24(3):420–426CrossRef
23.
go back to reference Han M, Liu XX (2013) Feature selection techniques with class separability for multivariate time series. Neurocomputing 110(1):29–34CrossRef Han M, Liu XX (2013) Feature selection techniques with class separability for multivariate time series. Neurocomputing 110(1):29–34CrossRef
Metadata
Title
Swarm-based clustering algorithm for efficient web blog and data classification
Authors
E. A. Neeba
S. Koteeswaran
N. Malarvizhi
Publication date
17-10-2017
Publisher
Springer US
Published in
The Journal of Supercomputing / Issue 6/2020
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-017-2162-z

Other articles of this Issue 6/2020

The Journal of Supercomputing 6/2020 Go to the issue

Premium Partner