Skip to main content
Top

2016 | OriginalPaper | Chapter

Experiments on Data Classification Using Relative Entropy

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Data classification is one of the basic tasks in data mining. In this paper, we propose a new classifier based on relative entropy, where data to particular class assignment is made by the majority good guess criteria. The presented approach is intended to be used when relations between datasets and assignment classes are rather complex, nonlinear, or with logical inconsistencies; because such datasets can be too complex to be classified by ordinary methods of decision trees or by the tools of logical analysis. The relative entropy evaluation of associative rules can be simple to interpret and offers better comprehensibility in comparison to decision trees and artificial neural networks.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Kotsiantis, S.B.: Supervised machine learning: a review of classification techniques. Informatica 31, 249–268 (2007)MathSciNetMATH Kotsiantis, S.B.: Supervised machine learning: a review of classification techniques. Informatica 31, 249–268 (2007)MathSciNetMATH
2.
go back to reference Fürnkranz, J., Flach, P.A.: ROC ‘n’ rule learning—towards a better understanding of covering rules. Mach. Learn. 58, 39–77 (2005)CrossRefMATH Fürnkranz, J., Flach, P.A.: ROC ‘n’ rule learning—towards a better understanding of covering rules. Mach. Learn. 58, 39–77 (2005)CrossRefMATH
3.
go back to reference Shannon, C.E.: A mathematical theory of communication. Bell Syst. Tech. J. 27, 379–423, 623–656 (1948) Shannon, C.E.: A mathematical theory of communication. Bell Syst. Tech. J. 27, 379–423, 623–656 (1948)
5.
go back to reference Quinlan, J.R.: Learning efficient classification procedures and their application to chess endgames. Machine Learning: An Artificial Inteligence Approach, pp. 463–482. Palo Alto, Tioga (1983)CrossRef Quinlan, J.R.: Learning efficient classification procedures and their application to chess endgames. Machine Learning: An Artificial Inteligence Approach, pp. 463–482. Palo Alto, Tioga (1983)CrossRef
6.
go back to reference Cendrowska, J.: PRISM: an algorithm for inducing modular rules. Int. J. Man-Mach. Stud. 27, 349–370 (1987)CrossRefMATH Cendrowska, J.: PRISM: an algorithm for inducing modular rules. Int. J. Man-Mach. Stud. 27, 349–370 (1987)CrossRefMATH
7.
go back to reference Thabtah, F.A., Cowling, P.I.: A greedy classification algorithm based on association rule. Appl. Soft Comput. 7, 1102–1111 (2007)CrossRef Thabtah, F.A., Cowling, P.I.: A greedy classification algorithm based on association rule. Appl. Soft Comput. 7, 1102–1111 (2007)CrossRef
8.
go back to reference Li, J., Wong, L.: Using rules to analyse bio-medical data: a comparison between C4.5 and PCL. Adv. Web-Age Inf. Manag. 4, 254–265 (2003)CrossRef Li, J., Wong, L.: Using rules to analyse bio-medical data: a comparison between C4.5 and PCL. Adv. Web-Age Inf. Manag. 4, 254–265 (2003)CrossRef
9.
go back to reference Fano, R.M.: Transmission of Information. A Statistical Theory of Communications. M.I.T. Press, New York (1961)MATH Fano, R.M.: Transmission of Information. A Statistical Theory of Communications. M.I.T. Press, New York (1961)MATH
11.
go back to reference Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers Inc., San Francisco (1993) Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers Inc., San Francisco (1993)
12.
go back to reference Cohen, W.: Fast effective rule induction. In: Proceedings of ICML-95, pp. 115–123 (1995) Cohen, W.: Fast effective rule induction. In: Proceedings of ICML-95, pp. 115–123 (1995)
13.
go back to reference Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In: Proceedings of the KDD, pp. 80–86. New York (1998) Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In: Proceedings of the KDD, pp. 80–86. New York (1998)
Metadata
Title
Experiments on Data Classification Using Relative Entropy
Authors
Michal Vašinek
Jan Platoš
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-26227-7_22

Premium Partner