Skip to main content

2016 | OriginalPaper | Buchkapitel

Towards Expressive Modular Rule Induction for Numerical Attributes

verfasst von : Manal Almutairi, Frederic Stahl, Mathew Jennings, Thien Le, Max Bramer

Erschienen in: Research and Development in Intelligent Systems XXXIII

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The Prism family is an alternative set of predictive data mining algorithms to the more established decision tree data mining algorithms. Prism classifiers are more expressive and user friendly compared with decision trees and achieve a similar accuracy compared with that of decision trees and even outperform decision trees in some cases. This is especially the case where there is noise and clashes in the training data. However, Prism algorithms still tend to overfit on noisy data; this has led to the development of pruning methods which have allowed the Prism algorithms to generalise better over the dataset. The work presented in this paper aims to address the problem of overfitting at rule induction stage for numerical attributes by proposing a new numerical rule term structure based on the Gauss Probability Density Distribution. This new rule term structure is not only expected to lead to a more robust classifier, but also lowers the computational requirements as it needs to induce fewer rule terms.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Bramer, M.: Principles of Data Mining. Undergraduate Topics in Computer Science. Springer International Publishing (2013) Bramer, M.: Principles of Data Mining. Undergraduate Topics in Computer Science. Springer International Publishing (2013)
2.
Zurück zum Zitat Bramer, M.A.: An information-theoretic approach to the pre-pruning of classification rules. In: Neumann, B., Musen, M., Studer, R. (eds) Intelligent Information Processing, pp. 201–212. Kluwer (2002) Bramer, M.A.: An information-theoretic approach to the pre-pruning of classification rules. In: Neumann, B., Musen, M., Studer, R. (eds) Intelligent Information Processing, pp. 201–212. Kluwer (2002)
3.
Zurück zum Zitat Cendrowska, J.: PRISM: an algorithm for inducing modular rules (1987) Cendrowska, J.: PRISM: an algorithm for inducing modular rules (1987)
4.
Zurück zum Zitat Le, T., Stahl, F., Gomes, J., Gaber, M.M. Di Fatta, G.: Computationally efficient rule-based classification for continuous streaming data. In: Research and Development in Intelligent Systems XXXI, pp. 21–34. Springer (2014) Le, T., Stahl, F., Gomes, J., Gaber, M.M. Di Fatta, G.: Computationally efficient rule-based classification for continuous streaming data. In: Research and Development in Intelligent Systems XXXI, pp. 21–34. Springer (2014)
5.
Zurück zum Zitat Lichman, M.: UCI machine learning repository (2013) Lichman, M.: UCI machine learning repository (2013)
6.
Zurück zum Zitat Ross, J.: Quinlan induction of decision trees. Mach. Learn. 1(1), 81–106 (1986)MathSciNet Ross, J.: Quinlan induction of decision trees. Mach. Learn. 1(1), 81–106 (1986)MathSciNet
7.
Zurück zum Zitat Stahl, F., Bramer, M.: Computationally efficient induction of classification rules with the PMCRI and j-pmcri frameworks. Knowl.-Based Syst. 35, 49–63 (2012)CrossRef Stahl, F., Bramer, M.: Computationally efficient induction of classification rules with the PMCRI and j-pmcri frameworks. Knowl.-Based Syst. 35, 49–63 (2012)CrossRef
8.
Zurück zum Zitat Stahl, F., Bramer, M.: Jmax-pruning: a facility for the information theoretic pruning of modular classification rules. Knowl.-Based Syst. 29, 12–19 (2012)CrossRef Stahl, F., Bramer, M.: Jmax-pruning: a facility for the information theoretic pruning of modular classification rules. Knowl.-Based Syst. 29, 12–19 (2012)CrossRef
9.
Zurück zum Zitat Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning Tools and Techniques: Practical Machine Learning Tools and Techniques. Elsevier Science, The Morgan Kaufmann Series in Data Management Systems (2011)MATH Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning Tools and Techniques: Practical Machine Learning Tools and Techniques. Elsevier Science, The Morgan Kaufmann Series in Data Management Systems (2011)MATH
Metadaten
Titel
Towards Expressive Modular Rule Induction for Numerical Attributes
verfasst von
Manal Almutairi
Frederic Stahl
Mathew Jennings
Thien Le
Max Bramer
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-47175-4_16