Skip to main content

2019 | OriginalPaper | Buchkapitel

Enhanced Manhattan-Based Clustering Using Fuzzy C-Means Algorithm

verfasst von : Joven A. Tolentino, Bobby D. Gerardo, Ruji P. Medina

Erschienen in: Recent Advances in Information and Communication Technology 2018

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Fuzzy C-Means is a clustering algorithm known to suffer from slow processing time. One factor affecting this algorithm is on the selection of appropriate distance measure. While this drawback was addressed with the use of the Manhattan distance measure, this sacrifice its accuracy over processing time. In this study, a new approach to distance measurement is explored to answer both the speed and accuracy issues of Fuzzy C-Means incorporating trigonometric functions to Manhattan distance calculation. Upon application of the new approach for clustering of the Iris dataset, processing time was reduced by three iterations over the use of Euclidean distance. Improvement in accuracy was also observed with 50% and 78% improvement over the use of Euclidean and Manhattan distances respectively. The results provide clear proof that the new distance measurement approach was able to address both the slow processing time and accuracy problems associated with Fuzzy C-Means clustering algorithm.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Chen, H., Storey, V.C.: Business intelligence and analytics: from big data to big impact. MIS Q. 36(4), 1165–1188 (2012) Chen, H., Storey, V.C.: Business intelligence and analytics: from big data to big impact. MIS Q. 36(4), 1165–1188 (2012)
2.
Zurück zum Zitat Jung, Y.G., Kang, M.S., Heo, J.: Clustering performance comparison using K-Means and expectation maximization algorithms. Biotechnol. Biotechnol. Equip. 28(sub1), 44–48 (2014)CrossRef Jung, Y.G., Kang, M.S., Heo, J.: Clustering performance comparison using K-Means and expectation maximization algorithms. Biotechnol. Biotechnol. Equip. 28(sub1), 44–48 (2014)CrossRef
3.
Zurück zum Zitat Raval, U.R., Jani, C.: Implementing and improvisation of K-Means clustering. Int. J. Comput. Sci. Mob. Comput. 4(11), 72–76 (2015) Raval, U.R., Jani, C.: Implementing and improvisation of K-Means clustering. Int. J. Comput. Sci. Mob. Comput. 4(11), 72–76 (2015)
4.
Zurück zum Zitat Han, D., Ji, J., Dai, Y., Li, G., Fan, W., Chen, H.: Improved Fuzzy C-Means algorithm and its application to classification of remote sensing image in Chengdu City. In: 2016 International Conference on Progress in Informatics and Computing (PIC), pp. 437–443. IEEE, Shanghai (2016) Han, D., Ji, J., Dai, Y., Li, G., Fan, W., Chen, H.: Improved Fuzzy C-Means algorithm and its application to classification of remote sensing image in Chengdu City. In: 2016 International Conference on Progress in Informatics and Computing (PIC), pp. 437–443. IEEE, Shanghai (2016)
5.
Zurück zum Zitat Cebeci, Z., Yildiz, F.: Comparison of K-Means and Fuzzy C-Means algorithms on different cluster structures. J. Agric. Inf. 6(3), 13–23 (2015) Cebeci, Z., Yildiz, F.: Comparison of K-Means and Fuzzy C-Means algorithms on different cluster structures. J. Agric. Inf. 6(3), 13–23 (2015)
6.
Zurück zum Zitat Winkler, R., Klawonn, F., Kruse, R.: Problems of Fuzzy C-Means clustering and similar algorithms with high dimensional data sets. In: Gaul, W., Geyer-Schulz, A., Schmidt-Thieme, L., Kunze, J. (eds.) Challenges at the Interface of Data Analysis, Computer Science, and Optimization. Studies in Classification, Data Analysis, and Knowledge Organization, pp. 1–8. Springer, Heidelberg (2012)CrossRef Winkler, R., Klawonn, F., Kruse, R.: Problems of Fuzzy C-Means clustering and similar algorithms with high dimensional data sets. In: Gaul, W., Geyer-Schulz, A., Schmidt-Thieme, L., Kunze, J. (eds.) Challenges at the Interface of Data Analysis, Computer Science, and Optimization. Studies in Classification, Data Analysis, and Knowledge Organization, pp. 1–8. Springer, Heidelberg (2012)CrossRef
7.
Zurück zum Zitat Fahad, A., et al.: A survey of clustering algorithms for big data : taxonomy & empirical analysis. IEEE Trans. Emerg. Topics Comput. 2(3), 267–279 (2014)CrossRef Fahad, A., et al.: A survey of clustering algorithms for big data : taxonomy & empirical analysis. IEEE Trans. Emerg. Topics Comput. 2(3), 267–279 (2014)CrossRef
8.
Zurück zum Zitat Sharma, S.K., Kumar, S.: Comparative analysis of Manhattan and Euclidean distance metrics using A* algorithm. J. Res. Eng. Appl. Sci. 1(4), 196–198 (2016) Sharma, S.K., Kumar, S.: Comparative analysis of Manhattan and Euclidean distance metrics using A* algorithm. J. Res. Eng. Appl. Sci. 1(4), 196–198 (2016)
9.
Zurück zum Zitat Kalaivani, E.R., Suganya, G., Kiruba, J.: Review on K-Means and fuzzy C means clustering algorithm. Imp. J. Interdiscip. Res. 3(2), 1822–1824 (2017) Kalaivani, E.R., Suganya, G., Kiruba, J.: Review on K-Means and fuzzy C means clustering algorithm. Imp. J. Interdiscip. Res. 3(2), 1822–1824 (2017)
10.
Zurück zum Zitat Kouser, K.: A comparative study of K means algorithm by different distance measures. Int. J. Innov. Res. Comput. Commun. Eng. 1(9), 2443–2447 (2013) Kouser, K.: A comparative study of K means algorithm by different distance measures. Int. J. Innov. Res. Comput. Commun. Eng. 1(9), 2443–2447 (2013)
11.
Zurück zum Zitat Kapoor, A.: A comparative study of K-Means, K-Means ++ and Fuzzy C-Means clustering algorithms. In: 3rd International Conference on Computational Intelligence & Communication Technology (CICT), pp. 1–6. IEEE (2017) Kapoor, A.: A comparative study of K-Means, K-Means ++ and Fuzzy C-Means clustering algorithms. In: 3rd International Conference on Computational Intelligence & Communication Technology (CICT), pp. 1–6. IEEE (2017)
12.
Zurück zum Zitat Son, L.H.: Generalized picture distance measure and applications to picture fuzzy clustering. Appl. Soft Comput. J. 46(C), 284–295 (2016)CrossRef Son, L.H.: Generalized picture distance measure and applications to picture fuzzy clustering. Appl. Soft Comput. J. 46(C), 284–295 (2016)CrossRef
14.
Zurück zum Zitat Rathod, A.B.: A comparative study on distance measuring approaches for permutation representations. In: 2016 IEEE International Conference on Advances in Electronics, Communication and Computer Technology (ICAECCT), pp. 251–255 (2016) Rathod, A.B.: A comparative study on distance measuring approaches for permutation representations. In: 2016 IEEE International Conference on Advances in Electronics, Communication and Computer Technology (ICAECCT), pp. 251–255 (2016)
15.
Zurück zum Zitat Wang, X., Yu, F., Pedrycz, W.: An area-based shape distance measure of time series. Appl. Soft Comput. J. 46(C), 650–659 (2016)CrossRef Wang, X., Yu, F., Pedrycz, W.: An area-based shape distance measure of time series. Appl. Soft Comput. J. 46(C), 650–659 (2016)CrossRef
16.
Zurück zum Zitat Wang, Z., Zhao, N., Wang, W., Tang, R., Li, S.: A fault diagnosis approach for gas turbine exhaust gas temperature based on Fuzzy C-Means clustering and support vector machine. Math. Prob. Eng. 2015, 1–11 (2015) Wang, Z., Zhao, N., Wang, W., Tang, R., Li, S.: A fault diagnosis approach for gas turbine exhaust gas temperature based on Fuzzy C-Means clustering and support vector machine. Math. Prob. Eng. 2015, 1–11 (2015)
17.
Zurück zum Zitat Grover, N.: A study of various fuzzy clustering algorithms. Int. J. Eng. Res. 5013(3), 177–181 (2014)CrossRef Grover, N.: A study of various fuzzy clustering algorithms. Int. J. Eng. Res. 5013(3), 177–181 (2014)CrossRef
18.
Zurück zum Zitat Leddo, J.: Comparing the relative effects of homework on high aptitude VS average aptitude students 2(9), 59–62 (2016) Leddo, J.: Comparing the relative effects of homework on high aptitude VS average aptitude students 2(9), 59–62 (2016)
19.
Zurück zum Zitat Dom, B.E., Jose, S.: An information-theoretic external cluster-validity measure. In: Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI 2002), pp. 137–145, Alberta, Canada (2002) Dom, B.E., Jose, S.: An information-theoretic external cluster-validity measure. In: Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI 2002), pp. 137–145, Alberta, Canada (2002)
20.
Zurück zum Zitat Färber, I., et al.: On using class-labels in evaluation of clusterings. In: Proceedings of 1st International Workshop on Discovering, Summarizing and Using Multiple Clusterings (MultiClust 2010) Held in Conjunction with 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2010), p. 9. Washington, D.C., USA (2010) Färber, I., et al.: On using class-labels in evaluation of clusterings. In: Proceedings of 1st International Workshop on Discovering, Summarizing and Using Multiple Clusterings (MultiClust 2010) Held in Conjunction with 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2010), p. 9. Washington, D.C., USA (2010)
Metadaten
Titel
Enhanced Manhattan-Based Clustering Using Fuzzy C-Means Algorithm
verfasst von
Joven A. Tolentino
Bobby D. Gerardo
Ruji P. Medina
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-319-93692-5_13