Skip to main content
Erschienen in: Soft Computing 7/2020

01.08.2019 | Methodologies and Application

A novel switching function approach for data mining classification problems

verfasst von: Mohammed Hussein Ibrahim, Mehmet Hacibeyoglu

Erschienen in: Soft Computing | Ausgabe 7/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Rule induction (RI) is one of the known classification approaches in data mining. RI extracts hidden patterns from instances in terms of rules. This paper proposes a logic-based rule induction (LBRI) classifier based on a switching function approach. LBRI generates binary rules by using a novel minimization function, which depends on simple and powerful bitwise operations. Initially, LBRI generates instance codes by encoding the dataset with standard binary code and then generates prime cubes (PC) for all classes from the instance codes by the proposed reduced offset method. Finally, LBRI selects the most effective PC of the current classes and adds them into the binary rule set that belongs to the current class. Each binary rule represents an IfThen rule for the rule induction classifiers. The proposed LBRI classifier is based on basic logic functions. It is a simple and effective method, and it can be used by intelligent systems to solve real-life classification/prediction problems in areas such as health care, online/financial banking, image/voice recognition, and bioinformatics. The performance of the proposed algorithm is compared to six rule induction algorithms; decision table, Ripper, C4.5, REPTree, OneR, and ICRM by using nineteen different datasets. The experimental results show that the proposed algorithm yields better classification accuracy than the other rule induction algorithms on ten out of nineteen datasets.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Abdelhamid N, Ayesh A, Thabtah F (2014) Phishing detection based associative classification data mining. Expert Syst Appl 41(13):5948–5959CrossRef Abdelhamid N, Ayesh A, Thabtah F (2014) Phishing detection based associative classification data mining. Expert Syst Appl 41(13):5948–5959CrossRef
Zurück zum Zitat Alcalá-Fdez J, Fernández A, Luengo J, Derrac J, García S, Sánchez L, Herrera AF (2011) KEEL data-mining software tool: data set repository, integration of algorithms and experimental analysis framework. J Mult-Valued Logic Soft Comput 17:255–287 Alcalá-Fdez J, Fernández A, Luengo J, Derrac J, García S, Sánchez L, Herrera AF (2011) KEEL data-mining software tool: data set repository, integration of algorithms and experimental analysis framework. J Mult-Valued Logic Soft Comput 17:255–287
Zurück zum Zitat An LP, Tong LY (2010) Binary relations as a basis for rule induction in presence of quantitative attributes. JCP 5(3):440–447 An LP, Tong LY (2010) Binary relations as a basis for rule induction in presence of quantitative attributes. JCP 5(3):440–447
Zurück zum Zitat Bazan JG (1998) A comparison of dynamic and non-dynamic rough set methods for extracting laws from decision tables. Rough Sets Knowl Discov 1:321–365MATH Bazan JG (1998) A comparison of dynamic and non-dynamic rough set methods for extracting laws from decision tables. Rough Sets Knowl Discov 1:321–365MATH
Zurück zum Zitat Bertelsen R, Martinez TR (1994) Extending ID3 through discretization of continuous inputs. In: Proceedings of the 7th florida artificial intelligence research symposium, pp 122–125 Bertelsen R, Martinez TR (1994) Extending ID3 through discretization of continuous inputs. In: Proceedings of the 7th florida artificial intelligence research symposium, pp 122–125
Zurück zum Zitat Bieganowski J, Karatkevich A (2005) Heuristics for Thelen’s prime implicant method. Schedae Informaticae 14:125 Bieganowski J, Karatkevich A (2005) Heuristics for Thelen’s prime implicant method. Schedae Informaticae 14:125
Zurück zum Zitat Brayton RK, Hachtel GD, McMullen C, Sangiovanni-Vincentelli A (1984) Logic minimization algorithms for VLSI synthesis. Springer, BerlinCrossRef Brayton RK, Hachtel GD, McMullen C, Sangiovanni-Vincentelli A (1984) Logic minimization algorithms for VLSI synthesis. Springer, BerlinCrossRef
Zurück zum Zitat Breiman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and regression trees. The Wadsworth statisticsprobability series. Wadsworth International Group, Belmont, CA Breiman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and regression trees. The Wadsworth statisticsprobability series. Wadsworth International Group, Belmont, CA
Zurück zum Zitat Cai J (2006) Decision tree pruning using expert knowledge. University of Akron, Akron Cai J (2006) Decision tree pruning using expert knowledge. University of Akron, Akron
Zurück zum Zitat Carneiro N, Figueira G, Costa M (2017) A data mining based system for credit-card fraud detection in e-tail. Decis Support Syst 95:91–101CrossRef Carneiro N, Figueira G, Costa M (2017) A data mining based system for credit-card fraud detection in e-tail. Decis Support Syst 95:91–101CrossRef
Zurück zum Zitat Chen C (2015) Handbook of pattern recognition and computer vision. World Scientific, Singapore Chen C (2015) Handbook of pattern recognition and computer vision. World Scientific, Singapore
Zurück zum Zitat Clark P, Niblett T (1989) The CN2 induction algorithm. Mach Learn 3(4):261–283 Clark P, Niblett T (1989) The CN2 induction algorithm. Mach Learn 3(4):261–283
Zurück zum Zitat Cohen WW (1995) Fast effective rule induction. In: Proceedings of the twelfth international conference on machine learning, pp 115–123 Cohen WW (1995) Fast effective rule induction. In: Proceedings of the twelfth international conference on machine learning, pp 115–123
Zurück zum Zitat Grzymala-Busse JW, Stefanowski J (2001) Three discretization methods for rule induction. Int J Intell Syst 16(1):29–38CrossRef Grzymala-Busse JW, Stefanowski J (2001) Three discretization methods for rule induction. Int J Intell Syst 16(1):29–38CrossRef
Zurück zum Zitat Hall M, Frank E, Holmes G, Pfahringer B (2009) The WEKA data mining software: an update. ACM SIGKDD Explor Newsl 11(1):10–18CrossRef Hall M, Frank E, Holmes G, Pfahringer B (2009) The WEKA data mining software: an update. ACM SIGKDD Explor Newsl 11(1):10–18CrossRef
Zurück zum Zitat Han J, Pei J, Kamber M (2011) Data mining: concepts and techniques. Elsevier, AmsterdamMATH Han J, Pei J, Kamber M (2011) Data mining: concepts and techniques. Elsevier, AmsterdamMATH
Zurück zum Zitat Han J, Pei J, Kamber M (2012) Statistical comparisons of classifiers over multiple data sets, vol 7. Elsevier, Amsterdam Han J, Pei J, Kamber M (2012) Statistical comparisons of classifiers over multiple data sets, vol 7. Elsevier, Amsterdam
Zurück zum Zitat Hong S (1997) R-MINI: an iterative approach for generating minimal rules from examples. IEEE Trans Knowl Data Eng 9(5):709–717CrossRef Hong S (1997) R-MINI: an iterative approach for generating minimal rules from examples. IEEE Trans Knowl Data Eng 9(5):709–717CrossRef
Zurück zum Zitat Iman S, Pedram M (1998) Logic synthesis for low power VLSI designs. Springer Science & Business Media Iman S, Pedram M (1998) Logic synthesis for low power VLSI designs. Springer Science & Business Media
Zurück zum Zitat Jakubczyc J (2005) The ant colony algorithms for rule induction. In: Proceedings of AIML, pp 112–117 Jakubczyc J (2005) The ant colony algorithms for rule induction. In: Proceedings of AIML, pp 112–117
Zurück zum Zitat Kahramanli S (2015) A novel approach to logic-based sequential cover strategy. In: International technology management conference (ITMC2015), pp 48–53 Kahramanli S (2015) A novel approach to logic-based sequential cover strategy. In: International technology management conference (ITMC2015), pp 48–53
Zurück zum Zitat Kusunoki Y, Inuiguchi M, Stefanowski J (2008) Rule induction via clustering decision classes. Int J Innov Comput Inf Control 4(10):2663–2677 Kusunoki Y, Inuiguchi M, Stefanowski J (2008) Rule induction via clustering decision classes. Int J Innov Comput Inf Control 4(10):2663–2677
Zurück zum Zitat Michalski RS, Carbonell JG, Mitchell TM (1983) Machine learning: an artificial intelligence approach. Springer, BerlinCrossRef Michalski RS, Carbonell JG, Mitchell TM (1983) Machine learning: an artificial intelligence approach. Springer, BerlinCrossRef
Zurück zum Zitat Micheli G (1994) Synthesis and optimization of digital circuits. McGraw-Hill Higher Education, New York Micheli G (1994) Synthesis and optimization of digital circuits. McGraw-Hill Higher Education, New York
Zurück zum Zitat Miller R (1979) Switching theory. Krieger, Malabar Miller R (1979) Switching theory. Krieger, Malabar
Zurück zum Zitat Muresan S, Tzoukermann E, Klavans J (2001) Combining linguistic and machine learning techniques for email summarization. In: Proceedings of the 2001 workshop on computational natural language learning, vol 7 Muresan S, Tzoukermann E, Klavans J (2001) Combining linguistic and machine learning techniques for email summarization. In: Proceedings of the 2001 workshop on computational natural language learning, vol 7
Zurück zum Zitat Pal S, Skowron A (1999) Rough-fuzzy hybridization: a new trend in decision making. Springer, New YorkMATH Pal S, Skowron A (1999) Rough-fuzzy hybridization: a new trend in decision making. Springer, New YorkMATH
Zurück zum Zitat Shiva SG (1998) Introduction to logic design, 2nd edn, CRC Press Shiva SG (1998) Introduction to logic design, 2nd edn, CRC Press
Zurück zum Zitat Thelen B (1981) Investigations of algorithms for computer-aided logic design of digital circuits. PhD thesis, ITIV, Univ. of Karlsruhe Thelen B (1981) Investigations of algorithms for computer-aided logic design of digital circuits. PhD thesis, ITIV, Univ. of Karlsruhe
Zurück zum Zitat Zhao X (2011) A classification rule acquisition algorithm based on constrained concept lattice. Artif Intell Comput Intell 7002:356–363CrossRef Zhao X (2011) A classification rule acquisition algorithm based on constrained concept lattice. Artif Intell Comput Intell 7002:356–363CrossRef
Metadaten
Titel
A novel switching function approach for data mining classification problems
verfasst von
Mohammed Hussein Ibrahim
Mehmet Hacibeyoglu
Publikationsdatum
01.08.2019
Verlag
Springer Berlin Heidelberg
Erschienen in
Soft Computing / Ausgabe 7/2020
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-019-04246-2

Weitere Artikel der Ausgabe 7/2020

Soft Computing 7/2020 Zur Ausgabe

Premium Partner