Skip to main content

2019 | OriginalPaper | Buchkapitel

Simple and Accurate Classification Method Based on Class Association Rules Performs Well on Well-Known Datasets

verfasst von : Jamolbek Mattiev, Branko Kavšek

Erschienen in: Machine Learning, Optimization, and Data Science

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Existing classification rule learning algorithms use mainly greedy heuristic search to find regularities in datasets for classification. In recent years, extensive research on association rule mining was performed in the machine learning community on learning rules by using exhaustive search. The main objective is to find all rules in data that satisfy the user-specified minimum support and minimum confidence constraints. Although the whole set of rules may not be used directly for accurate classification, effective and efficient classifiers have been built using these, so called, classification association rules.
In this paper, we compare “classical” classification rule learning algorithms that use greedy heuristic search to produce the final classifier with a class association rule learner that uses constrained exhaustive search to find classification rules on “well known” datasets. We propose a simple method to extract class association rules by simple pruning to form an accurate classifier. This is a preliminary study that aims to show that an adequate choice of the “right” class association rules by considering the dependent (class) attribute distribution of values can produce a compact, understandable and relatively accurate classifier. We have performed experiments on 12 datasets from UCI Machine Learning Database Repository and compared the results with well-known rule-based and tree-based classification algorithms. Experimental results show that our method was consistent and comparative with other well-known classification algorithms. Although not achieving the best results in terms of classification accuracy, our method is relatively simple and produces compact and understandable classifiers by exhaustively searching the entire example space.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: VLDB 1994 Proceedings of the 20th International Conference on Very Large Data Bases, Chile, pp. 487–499 (1994) Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: VLDB 1994 Proceedings of the 20th International Conference on Very Large Data Bases, Chile, pp. 487–499 (1994)
2.
Zurück zum Zitat Ali, K., Manganaris, S., Srikant, R.: Partial classification using association rules. In: Proceedings of KDD-1997, U.S.A., pp. 115–118 (1997) Ali, K., Manganaris, S., Srikant, R.: Partial classification using association rules. In: Proceedings of KDD-1997, U.S.A., pp. 115–118 (1997)
3.
Zurück zum Zitat Baralis, E., Cagliero, L., Garza, P.: A novel pattern-based Bayesian classifier. IEEE Trans. Knowl. Data Eng. 25(12), 2780–2795 (2013)CrossRef Baralis, E., Cagliero, L., Garza, P.: A novel pattern-based Bayesian classifier. IEEE Trans. Knowl. Data Eng. 25(12), 2780–2795 (2013)CrossRef
4.
Zurück zum Zitat Bayardo, R.J.: Brute-force mining of high-confidence classification rules. In: Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, U.S.A., pp. 123–126 (1997) Bayardo, R.J.: Brute-force mining of high-confidence classification rules. In: Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, U.S.A., pp. 123–126 (1997)
5.
6.
Zurück zum Zitat Cendrowska, J.: PRISM: an algorithm for inducing modular rules. Int. J. Man-Mach. Stud. 27(4), 349–370 (1987)CrossRef Cendrowska, J.: PRISM: an algorithm for inducing modular rules. Int. J. Man-Mach. Stud. 27(4), 349–370 (1987)CrossRef
7.
Zurück zum Zitat Chen, G., Liu, H., Yu, L., Wei, Q., Zhang, X.: A new approach to classification based on association rule mining. Decis. Support Syst. 42(2), 674–689 (2006)CrossRef Chen, G., Liu, H., Yu, L., Wei, Q., Zhang, X.: A new approach to classification based on association rule mining. Decis. Support Syst. 42(2), 674–689 (2006)CrossRef
8.
Zurück zum Zitat Clark, P., Niblett, T.: The CN2 induction algorithm. Mach. Learn. 3(4), 261–283 (1989) Clark, P., Niblett, T.: The CN2 induction algorithm. Mach. Learn. 3(4), 261–283 (1989)
9.
Zurück zum Zitat Cohen, W.W.: Fast effective rule induction. In: ICML 1995 Proceedings of the Twelfth International Conference on Machine Learning, Tahoe City, California, pp. 115–123 (1995) Cohen, W.W.: Fast effective rule induction. In: ICML 1995 Proceedings of the Twelfth International Conference on Machine Learning, Tahoe City, California, pp. 115–123 (1995)
10.
Zurück zum Zitat Dua, D., Graff, C.: UCI Machine Learning Repository. University of California, Irvine (2019) Dua, D., Graff, C.: UCI Machine Learning Repository. University of California, Irvine (2019)
11.
Zurück zum Zitat Frank, E., Witten, I.: Generating accurate rule sets without global optimization. In: Fifteenth International Conference on Machine Learning, USA, pp. 144–151 (1998) Frank, E., Witten, I.: Generating accurate rule sets without global optimization. In: Fifteenth International Conference on Machine Learning, USA, pp. 144–151 (1998)
12.
Zurück zum Zitat Holte, R.: Very simple classification rules perform well on most commonly used datasets. Mach. Learn. 11(1), 63–91 (1993)CrossRef Holte, R.: Very simple classification rules perform well on most commonly used datasets. Mach. Learn. 11(1), 63–91 (1993)CrossRef
13.
Zurück zum Zitat Kohavi, R.: The power of decision tables. In: 8th European Conference on Machine Learning, Heraclion, Crete, Greece, pp. 174–189 (1995) Kohavi, R.: The power of decision tables. In: 8th European Conference on Machine Learning, Heraclion, Crete, Greece, pp. 174–189 (1995)
14.
Zurück zum Zitat Lent, B., Swami, A., Widom, J.: Clustering association rules. In: ICDE 1997 Proceedings of the Thirteenth International Conference on Data Engineering, England, pp. 220–231 (1997) Lent, B., Swami, A., Widom, J.: Clustering association rules. In: ICDE 1997 Proceedings of the Thirteenth International Conference on Data Engineering, England, pp. 220–231 (1997)
15.
Zurück zum Zitat Li, W., Han, J., Pei, J.: CMAR: accurate and efficient classification based on multiple class-association rules. In: Proceedings of the 1st IEEE International Conference on Data Mining (ICDM 2001), San Jose, California, USA, pp. 369–376 (2001) Li, W., Han, J., Pei, J.: CMAR: accurate and efficient classification based on multiple class-association rules. In: Proceedings of the 1st IEEE International Conference on Data Mining (ICDM 2001), San Jose, California, USA, pp. 369–376 (2001)
16.
Zurück zum Zitat Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In: Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining (KDD 1998), New York, USA, pp. 80–86 (1998) Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In: Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining (KDD 1998), New York, USA, pp. 80–86 (1998)
17.
Zurück zum Zitat Quinlan, J.: C4.5: Programs for machine learning. Mach. Learn. 16(3), 235–240 (1993) Quinlan, J.: C4.5: Programs for machine learning. Mach. Learn. 16(3), 235–240 (1993)
18.
Zurück zum Zitat Xiaoxin, Y., Jiawei, H. CPAR: classification based on predictive association rules. In: Proceedings of the SIAM International Conference on Data Mining, San Francisco, U.S.A., pp. 331–335 (2003) Xiaoxin, Y., Jiawei, H. CPAR: classification based on predictive association rules. In: Proceedings of the SIAM International Conference on Data Mining, San Francisco, U.S.A., pp. 331–335 (2003)
19.
Zurück zum Zitat Zhang, M., Zhou Z.: A k-nearest neighbor based algorithm for multi-label classification. In: Proceedings of the 1st IEEE International Conference on Granular Computing (GrC 2005), Beijing, China, vol. 2, pp. 718–721 (2005) Zhang, M., Zhou Z.: A k-nearest neighbor based algorithm for multi-label classification. In: Proceedings of the 1st IEEE International Conference on Granular Computing (GrC 2005), Beijing, China, vol. 2, pp. 718–721 (2005)
20.
Zurück zum Zitat Zhou, Z., Liu, X.: Training cost-sensitive neural networks with methods addressing the class imbalance problem. IEEE Trans. Knowl. Data Eng. 18(1), 63–77 (2006)MathSciNetCrossRef Zhou, Z., Liu, X.: Training cost-sensitive neural networks with methods addressing the class imbalance problem. IEEE Trans. Knowl. Data Eng. 18(1), 63–77 (2006)MathSciNetCrossRef
Metadaten
Titel
Simple and Accurate Classification Method Based on Class Association Rules Performs Well on Well-Known Datasets
verfasst von
Jamolbek Mattiev
Branko Kavšek
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-37599-7_17