Skip to main content

2017 | OriginalPaper | Buchkapitel

Clustering Algorithms: Experiment and Improvements

verfasst von : Anand Khandare, A. S. Alvi

Erschienen in: Computing and Network Sustainability

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Clustering is data mining method to divide the data objects into n number of clusters. Clustering algorithms can be used in domains such as e-commerce, bio-informatics, image segmentation, speech recognition, financial analysis, and fraud detection. There is abandon knowledge in the clustering research and applications and also various improvements are done on various clustering algorithms. This paper includes the study and survey of various concepts and clustering algorithms by experimenting on it on some data sets and then analyzed gaps and scope for enhancement and scalability of algorithms. Then improved k-means is proposed to minimize these gaps. This improved algorithm automatically finds value of number of clusters and calculates initial centroids in better way rather random selection. From the experimentation, it is found that numbers of iterations are reduced; clusters quality increased and also minimized empty clusters in proposed algorithm.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Dunham MH (2006) Data mining-introduction and advanced concepts. Pearson Education Dunham MH (2006) Data mining-introduction and advanced concepts. Pearson Education
2.
Zurück zum Zitat Aggarwal CC, Zhai C (2012) Survey on text clustering algorithms in mining text data. Springer, USA, pp 77–128 Aggarwal CC, Zhai C (2012) Survey on text clustering algorithms in mining text data. Springer, USA, pp 77–128
3.
Zurück zum Zitat Mahmood A, Leckie C, Udaya P (2007) An efficient clustering scheme to exploit hierarchical data in NW traffic analysis. IEEE Tran. Knowl Data Eng 20(6):752–767 Mahmood A, Leckie C, Udaya P (2007) An efficient clustering scheme to exploit hierarchical data in NW traffic analysis. IEEE Tran. Knowl Data Eng 20(6):752–767
4.
Zurück zum Zitat Kanungo T, Mount DM, Netanyahu NS, Wu AY, Piatko CD, Silverman R (2002) An efficient k-means clustering algorithm-analysis and implementation. IEEE Trans Pattern Anal Mach Intell 24(7) Kanungo T, Mount DM, Netanyahu NS, Wu AY, Piatko CD, Silverman R (2002) An efficient k-means clustering algorithm-analysis and implementation. IEEE Trans Pattern Anal Mach Intell 24(7)
5.
Zurück zum Zitat Pham DT, Dimov SS, Nguyen CD (2005) Selection of value of k in k-means clustering. Proc Mech Mech Eng Sci 219 Pham DT, Dimov SS, Nguyen CD (2005) Selection of value of k in k-means clustering. Proc Mech Mech Eng Sci 219
6.
Zurück zum Zitat Fong S (2013) Opportunities and challenges of integrating bio inspired optimization and data mining algorithms. In: Swarm intelligence and bio inspired computation. Elsevier, pp 385–401 Fong S (2013) Opportunities and challenges of integrating bio inspired optimization and data mining algorithms. In: Swarm intelligence and bio inspired computation. Elsevier, pp 385–401
7.
Zurück zum Zitat Abbasi AA, Younis M (2007) A survey of clustering algorithms for wireless sensor networks. Comput Commun 30(14), 15, 2822841 Abbasi AA, Younis M (2007) A survey of clustering algorithms for wireless sensor networks. Comput Commun 30(14), 15, 2822841
8.
Zurück zum Zitat Bishnu PS, Bhattacherjee V (2012) Software fault predictions using quad tree based k-means clustering algorithm. IEEE Trans Knowl Data Eng 24(6) Bishnu PS, Bhattacherjee V (2012) Software fault predictions using quad tree based k-means clustering algorithm. IEEE Trans Knowl Data Eng 24(6)
9.
Zurück zum Zitat Siddiqui FU, Isa NAM (2011) Enhanced moving k-means algorithm for image segmentation. IEEE Tran Consum Electron 57(2) Siddiqui FU, Isa NAM (2011) Enhanced moving k-means algorithm for image segmentation. IEEE Tran Consum Electron 57(2)
10.
Zurück zum Zitat Khandare AD (2015) A modified k-means algorithm for emotional intelligence mining, ICCCI-15, Coimbatore, India, pp 1–3 Khandare AD (2015) A modified k-means algorithm for emotional intelligence mining, ICCCI-15, Coimbatore, India, pp 1–3
11.
Zurück zum Zitat Harrison R, Zhong W, Altun G, Tai PC, Pan Y (2005) Improved k-means clustering algorithm for exploring local protein sequence motifs representing common structural property. IEEE Trans Nanobiosci 4(3) Harrison R, Zhong W, Altun G, Tai PC, Pan Y (2005) Improved k-means clustering algorithm for exploring local protein sequence motifs representing common structural property. IEEE Trans Nanobiosci 4(3)
12.
Zurück zum Zitat Jaber H, Marle F, Jankovic M (2015) Improving the collaborative decision making in the new products development project using clustering algorithm. IEEE Trans Eng Manag 62(4) Jaber H, Marle F, Jankovic M (2015) Improving the collaborative decision making in the new products development project using clustering algorithm. IEEE Trans Eng Manag 62(4)
13.
Zurück zum Zitat Li T-HS, Kao M-C, Kuo P-H (2016) Recognitions system for the home service related sign languages using entropy based kmeans algorithm and the ABC based HMM. IEEE Trans Syst Man Cybern Syst 46(1) Li T-HS, Kao M-C, Kuo P-H (2016) Recognitions system for the home service related sign languages using entropy based kmeans algorithm and the ABC based HMM. IEEE Trans Syst Man Cybern Syst 46(1)
14.
Zurück zum Zitat Wu X, Zhu X, Wu G-Q, Ding W (2014) Data mining on big data. IEEE Trans Knowl Data Eng 26(1) Wu X, Zhu X, Wu G-Q, Ding W (2014) Data mining on big data. IEEE Trans Knowl Data Eng 26(1)
15.
Zurück zum Zitat Traganitis PA, Slavakis K, Giannakis GB (2015) Sketch and validate big data clustering. IEEE J Sel Top Signal Process 9(4) Traganitis PA, Slavakis K, Giannakis GB (2015) Sketch and validate big data clustering. IEEE J Sel Top Signal Process 9(4)
16.
Zurück zum Zitat Khandare A, Alvi AS (2016) Survey of improved k-means clustering algorithms-an improvements, shortcoming and scope for further enhancement and scalability, INDIA-2016, vol 434. AISC Springer, pp 495–503 Khandare A, Alvi AS (2016) Survey of improved k-means clustering algorithms-an improvements, shortcoming and scope for further enhancement and scalability, INDIA-2016, vol 434. AISC Springer, pp 495–503
17.
Zurück zum Zitat Xu R, Wunsch D II (2005) Survey of clustering algorithms. IEEE Trans Neural Netw 16(3) Xu R, Wunsch D II (2005) Survey of clustering algorithms. IEEE Trans Neural Netw 16(3)
18.
Zurück zum Zitat AM Fahim, AM Salem, FATorkey, M.A. Ramadan (2006) An efficient enhance kmeans clustering algorithm. J Zhejiang Univ Sci 7(10):1626–1633 AM Fahim, AM Salem, FATorkey, M.A. Ramadan (2006) An efficient enhance kmeans clustering algorithm. J Zhejiang Univ Sci 7(10):1626–1633
19.
Zurück zum Zitat Verma NK, Roy A (2014) Self optimal clustering techniques using optimized threshold function. IEEE Syst J 8(4) Verma NK, Roy A (2014) Self optimal clustering techniques using optimized threshold function. IEEE Syst J 8(4)
20.
Zurück zum Zitat Harb H, Makhoul A, Couturier R (2015) Enhanced k-means, ANOVA based clustering approach for similarity aggregation in underwater wireless sensor networks. IEEE Sens J 15(10) Harb H, Makhoul A, Couturier R (2015) Enhanced k-means, ANOVA based clustering approach for similarity aggregation in underwater wireless sensor networks. IEEE Sens J 15(10)
21.
Zurück zum Zitat Liang H-W, Chung W-H, Kuo S-Y (2016) Coding aided k-means clustering blind transceiver for space shift keying mimo system. IEEE Trans Wirel Commun 15(1) Liang H-W, Chung W-H, Kuo S-Y (2016) Coding aided k-means clustering blind transceiver for space shift keying mimo system. IEEE Trans Wirel Commun 15(1)
22.
Zurück zum Zitat Kumar R, Dwivedi R (2016) Quaternion domain kmeans clustering for the improved real time classification of E-Nose data. IEEE Sens J 16(1) Kumar R, Dwivedi R (2016) Quaternion domain kmeans clustering for the improved real time classification of E-Nose data. IEEE Sens J 16(1)
23.
Zurück zum Zitat Antonenko PD, Toy S, Niederhauser DS (2012) Using cluster analysis for the data mining in educational technology research R&D Antonenko PD, Toy S, Niederhauser DS (2012) Using cluster analysis for the data mining in educational technology research R&D
24.
Zurück zum Zitat Kwak J, Lee T, Kim CO (2015) Incremental clustering algorithm based fault detection algorithm for class imbalanced process data. IEEE Trans Semicond Manuf 28(3) (Yonsei University, Seoul, Korea) Kwak J, Lee T, Kim CO (2015) Incremental clustering algorithm based fault detection algorithm for class imbalanced process data. IEEE Trans Semicond Manuf 28(3) (Yonsei University, Seoul, Korea)
25.
Zurück zum Zitat Sulaiman SN, Isa NAM (2010) Adaptive fuzzy k-means clustering algorithm for image segmentation. IEEE Trans Consum Electron 56(4) Sulaiman SN, Isa NAM (2010) Adaptive fuzzy k-means clustering algorithm for image segmentation. IEEE Trans Consum Electron 56(4)
26.
Zurück zum Zitat Huang X, Ye Y, Zhang H (2014) Extensions of k-means type algorithms: a new clustering framework by integrating intra cluster compactness and inter cluster separation, IEEE Trans Neural Netw Learn Syst 25(8) Huang X, Ye Y, Zhang H (2014) Extensions of k-means type algorithms: a new clustering framework by integrating intra cluster compactness and inter cluster separation, IEEE Trans Neural Netw Learn Syst 25(8)
27.
Zurück zum Zitat Xie M, Cui H, Cai Y, Huang X, Liu Y (2014) Cluster validity index for adaptive clustering algorithms. IET Commun 8(13) Xie M, Cui H, Cai Y, Huang X, Liu Y (2014) Cluster validity index for adaptive clustering algorithms. IET Commun 8(13)
28.
Zurück zum Zitat Bandyopadhyay S, Coyle E (2003) An energy efficient hierarchical clustering algorithm for wireless sensor networks. In: Proceedings of the 22 annual joint conference, IEEE computer and communication societies, San Francisco, California Bandyopadhyay S, Coyle E (2003) An energy efficient hierarchical clustering algorithm for wireless sensor networks. In: Proceedings of the 22 annual joint conference, IEEE computer and communication societies, San Francisco, California
29.
Zurück zum Zitat An F, Mattausch HJ (2013) k-means clustering algorithm for multimedia application with flexible hardware and software co-design. J Syst Archit 59(3) (Elsevier) An F, Mattausch HJ (2013) k-means clustering algorithm for multimedia application with flexible hardware and software co-design. J Syst Archit 59(3) (Elsevier)
Metadaten
Titel
Clustering Algorithms: Experiment and Improvements
verfasst von
Anand Khandare
A. S. Alvi
Copyright-Jahr
2017
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-3935-5_27

Neuer Inhalt