Skip to main content
Top

2017 | OriginalPaper | Chapter

Clustering Based on Classification Quality (CCQ)

Authors : Iwan Tri Riyadi Yanto, Rd Rohmat Saedudin, Dedy Hartama, Tutut Herawan

Published in: Recent Advances on Soft Computing and Data Mining

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Clustering a set of objects into homogeneous classes is a fundamental operation in data mining. Categorical data clustering based on rough set theory has been an active research area in the field of machine learning. However, pure rough set theory is not well suited for analyzing noisy information systems. In this paper, an alternative technique for categorical data clustering using Variable Precision Rough Set model is proposed. It is based on the classification quality of Variable Precision Rough theory. The technique is implemented in MATLAB. Experimental results on three benchmark UCI datasets indicate that the technique can be successfully used to analyze grouped categorical data because it produces better clustering results.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Huang, Z.: Extensions to the k-means algorithm for clustering large data sets with categorical values. Data Min. Knowl. Disc. 2(3), 283–304 (1998)CrossRef Huang, Z.: Extensions to the k-means algorithm for clustering large data sets with categorical values. Data Min. Knowl. Disc. 2(3), 283–304 (1998)CrossRef
2.
go back to reference Johnson, R., Wichern, W.: Applied Multivariate Statistical Analysis. Prentice Hall, New York (2002)MATH Johnson, R., Wichern, W.: Applied Multivariate Statistical Analysis. Prentice Hall, New York (2002)MATH
3.
go back to reference Park, I.-K., Choi, G.-S.: Rough set approach for clustering categorical data using information-theoretic dependency measure. Inf. Syst. 48, 289–295 (2015). ISSN 0306-4379CrossRef Park, I.-K., Choi, G.-S.: Rough set approach for clustering categorical data using information-theoretic dependency measure. Inf. Syst. 48, 289–295 (2015). ISSN 0306-4379CrossRef
4.
go back to reference Li, M., Deng, S., Wang, L., Feng, S., Fan, J.: Hierarchical clustering algorithm for categorical data using a probabilistic rough set model. Knowl. Based Syst. 65, 60–71 (2014). ISSN 0950-7051CrossRef Li, M., Deng, S., Wang, L., Feng, S., Fan, J.: Hierarchical clustering algorithm for categorical data using a probabilistic rough set model. Knowl. Based Syst. 65, 60–71 (2014). ISSN 0950-7051CrossRef
6.
go back to reference Pawlak, Z.: Rough Sets: A Theoretical Aspect of Reasoning About Data. Kluwer Academic Publisher, Dordrecht (1991)CrossRefMATH Pawlak, Z.: Rough Sets: A Theoretical Aspect of Reasoning About Data. Kluwer Academic Publisher, Dordrecht (1991)CrossRefMATH
8.
go back to reference Mazlack, L.J., He, A., Zhu, Y., Coppock, S.: A rough set approach in choosing partitioning attributes. In: Proceedings of the ISCA 13th International Conference, CAINE-2000, pp. 1–6 (2000) Mazlack, L.J., He, A., Zhu, Y., Coppock, S.: A rough set approach in choosing partitioning attributes. In: Proceedings of the ISCA 13th International Conference, CAINE-2000, pp. 1–6 (2000)
9.
go back to reference Parmar, D., Wu, T., Blackhurst, J.: MMR: an algorithm for clustering categorical data using rough set theory. Data Knowl. Eng. 63, 879–893 (2007)CrossRef Parmar, D., Wu, T., Blackhurst, J.: MMR: an algorithm for clustering categorical data using rough set theory. Data Knowl. Eng. 63, 879–893 (2007)CrossRef
10.
go back to reference Gong, Z.T., Shi, Z.H., Yao, H.Y.: Variable precision rough set model for incomplete information systems and its Β-reducts. Comput. Inf. 31(2012), 1385–1399 (2012)MathSciNetMATH Gong, Z.T., Shi, Z.H., Yao, H.Y.: Variable precision rough set model for incomplete information systems and its Β-reducts. Comput. Inf. 31(2012), 1385–1399 (2012)MathSciNetMATH
12.
Metadata
Title
Clustering Based on Classification Quality (CCQ)
Authors
Iwan Tri Riyadi Yanto
Rd Rohmat Saedudin
Dedy Hartama
Tutut Herawan
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-51281-5_33

Premium Partner