Skip to main content
Erschienen in: Soft Computing 10/2017

11.12.2015 | Methodologies and Application

Searching for the most significant rules: an evolutionary approach for subgroup discovery

verfasst von: Victoria Pachón, Jacinto Mata, Juan Luis Domínguez

Erschienen in: Soft Computing | Ausgabe 10/2017

Einloggen

Aktivieren Sie unsere intelligente Suche um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, a new genetic algorithm (GAR-SD\(^{+})\) for subgroup discovery tasks is described. The main feature of this new method is that it can work with both discrete and continuous attributes without previous discretization. The ranges of numeric attributes are obtained in the rules induction process itself. In this way, we ensure that these intervals are the most suitable for maximizing the quality measures. An experimental study was carried out to verify the performance of the method. GAR-SD\(^{+}\) was compared with other subgroup discovery methods by evaluating certain measures (such as number of rules, number of attributes, significance, unusualness, support and confidence). For subgroup discovery tasks, GAR-SD\(^{+}\) obtained good results compared with existing algorithms.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Alcalá-Fdez J, Sánchez L, García S, del Jesus MJ, Ventura S, Garrell JM, Otero J, Romero C, Bacardit J, Rivas VM, Fernández JC, Herrera F (2009) KEEL: a software tool to assess evolutionary algorithms for data mining problems. Soft Comput 13(13):307–318 Alcalá-Fdez J, Sánchez L, García S, del Jesus MJ, Ventura S, Garrell JM, Otero J, Romero C, Bacardit J, Rivas VM, Fernández JC, Herrera F (2009) KEEL: a software tool to assess evolutionary algorithms for data mining problems. Soft Comput 13(13):307–318
Zurück zum Zitat Atzmüller M, Puppe F (2006) SD-Map a fast algorithm for exhaustive subgroup discovery. In: Proceedings of the 10th European conference on principles and practice of knowledge discovery in databases (PKDD-06), pp 6–17 Atzmüller M, Puppe F (2006) SD-Map a fast algorithm for exhaustive subgroup discovery. In: Proceedings of the 10th European conference on principles and practice of knowledge discovery in databases (PKDD-06), pp 6–17
Zurück zum Zitat Bay SD, Pazzani MJ (2001) Detecting group differences. Mining contrast sets. Data Min Knowl Discov 5(3):213–246CrossRefMATH Bay SD, Pazzani MJ (2001) Detecting group differences. Mining contrast sets. Data Min Knowl Discov 5(3):213–246CrossRefMATH
Zurück zum Zitat Berlanga F, del Jesus MJ, González P, Herrera F, Mesonero M (2006) Multiobjective evolutionary induction of subgroup discovery fuzzy rules: a case study in marketing. In: Perner P (ed) ICDM 2006. LNCS, vol 4065. Springer, pp 337–349 (2006) Berlanga F, del Jesus MJ, González P, Herrera F, Mesonero M (2006) Multiobjective evolutionary induction of subgroup discovery fuzzy rules: a case study in marketing. In: Perner P (ed) ICDM 2006. LNCS, vol 4065. Springer, pp 337–349 (2006)
Zurück zum Zitat Carmona CJ, González P, del Jesús MJ, Herrera F (2010) NMEEF-SD: non-dominated multi-objective evolutionary algorithm for extracting fuzzy rules in subgroup discovery. IEEE Trans Fuzzy Syst 18(5):958–970CrossRef Carmona CJ, González P, del Jesús MJ, Herrera F (2010) NMEEF-SD: non-dominated multi-objective evolutionary algorithm for extracting fuzzy rules in subgroup discovery. IEEE Trans Fuzzy Syst 18(5):958–970CrossRef
Zurück zum Zitat Chen MY (2014) A high-order fuzzy time series forecasting model for Internet stock trading. Future Gen Comput Syst—Int J Grid Comput eSci 37:461–467CrossRef Chen MY (2014) A high-order fuzzy time series forecasting model for Internet stock trading. Future Gen Comput Syst—Int J Grid Comput eSci 37:461–467CrossRef
Zurück zum Zitat Chen MY (2013) A hybrid ANFIS model for business failure prediction–utilization of particle swarm optimization and subtractive clustering. Inf Sci 220:180–195CrossRef Chen MY (2013) A hybrid ANFIS model for business failure prediction–utilization of particle swarm optimization and subtractive clustering. Inf Sci 220:180–195CrossRef
Zurück zum Zitat Chen MY, Fan MH, Chen YL, Wei HM (2013) Design of experiments on neural network’s parameters optimization for time series forecasting in stock markets. Neural Netw World 23(4):369–393CrossRef Chen MY, Fan MH, Chen YL, Wei HM (2013) Design of experiments on neural network’s parameters optimization for time series forecasting in stock markets. Neural Netw World 23(4):369–393CrossRef
Zurück zum Zitat del Jesús MJ, González P, Herrera F, Mesonero M (2007) Evolutionary fuzzy rule induction process for subgroup discovery. A case study in marketing. IEEE Trans Fuzzy Syst 15(4):578–592CrossRef del Jesús MJ, González P, Herrera F, Mesonero M (2007) Evolutionary fuzzy rule induction process for subgroup discovery. A case study in marketing. IEEE Trans Fuzzy Syst 15(4):578–592CrossRef
Zurück zum Zitat Dong G , Li J (1999) Efficient mining of emerging patterns. Discovering trends and differences. In: Proccedings of the 5th ACM SIGKDD international conference on knowledge discovery and data mining, pp 43–52 Dong G , Li J (1999) Efficient mining of emerging patterns. Discovering trends and differences. In: Proccedings of the 5th ACM SIGKDD international conference on knowledge discovery and data mining, pp 43–52
Zurück zum Zitat Fayyad U, Irani KB (1990) Multi-interval discretization of continuous-valued attributes for classification learning. In: 13th international joint conference on artificial intelligence, pp 1022–1029 Fayyad U, Irani KB (1990) Multi-interval discretization of continuous-valued attributes for classification learning. In: 13th international joint conference on artificial intelligence, pp 1022–1029
Zurück zum Zitat Guan Y-Y, Wang H-K, Wang Y, Yang F (2009) Attribute reduction and optimal decision rules acquisition for continuous valued information systems. Inf Sci 179:2974–2984 (8/5)MathSciNetCrossRefMATH Guan Y-Y, Wang H-K, Wang Y, Yang F (2009) Attribute reduction and optimal decision rules acquisition for continuous valued information systems. Inf Sci 179:2974–2984 (8/5)MathSciNetCrossRefMATH
Zurück zum Zitat Goldberg DE (1989) Genetic algorithms in search, optimization and machine learning. Wesley Longman, ReadingMATH Goldberg DE (1989) Genetic algorithms in search, optimization and machine learning. Wesley Longman, ReadingMATH
Zurück zum Zitat Grosskreutz H, Rüping S (2009) On subgroup discovery in numerical domains. Data Min Knowl Discov 19:210–226MathSciNetCrossRef Grosskreutz H, Rüping S (2009) On subgroup discovery in numerical domains. Data Min Knowl Discov 19:210–226MathSciNetCrossRef
Zurück zum Zitat Grosskreutz H, Rüping S, Wrobel S (2008) Tight optimistic estimates for fast subgroup discovery. In: Proceedings of the ECML/PKDD. Lecture notes in artificial intelligence, vol 5211. Springer, pp 440–456 Grosskreutz H, Rüping S, Wrobel S (2008) Tight optimistic estimates for fast subgroup discovery. In: Proceedings of the ECML/PKDD. Lecture notes in artificial intelligence, vol 5211. Springer, pp 440–456
Zurück zum Zitat Herrera F (2008) Genetic fuzzy systems: taxonomy, current research trends and propects. Evolut Intell 1:27–46CrossRef Herrera F (2008) Genetic fuzzy systems: taxonomy, current research trends and propects. Evolut Intell 1:27–46CrossRef
Zurück zum Zitat Kavsek B, Lavrač N (2006) APRIORI-SD: adapting association rule learning to subgroup discovery. Appl Artif Intell 20(7):543–583CrossRef Kavsek B, Lavrač N (2006) APRIORI-SD: adapting association rule learning to subgroup discovery. Appl Artif Intell 20(7):543–583CrossRef
Zurück zum Zitat Klösgen W, May M (2002) Spatial subgroup mining integrated in an object-relational spatial database. In Proccedings of the 6th European conference on principles and practice of KDD, pp 275–286 Klösgen W, May M (2002) Spatial subgroup mining integrated in an object-relational spatial database. In Proccedings of the 6th European conference on principles and practice of KDD, pp 275–286
Zurück zum Zitat Klösgen W (1996) Explora: a multipattern and multistrategy discovery assistant. In: Advances in knowledge discovery and data mining, pp 249–271 Klösgen W (1996) Explora: a multipattern and multistrategy discovery assistant. In: Advances in knowledge discovery and data mining, pp 249–271
Zurück zum Zitat Lavrač N, Flach P, Zupan B (1999) Rule evaluation measures: a unifying view. In: Proceedings of the 9th international workshop on inductive logic programming (ILP-99). LNCS, vol 1634. Springer, pp 174–183 Lavrač N, Flach P, Zupan B (1999) Rule evaluation measures: a unifying view. In: Proceedings of the 9th international workshop on inductive logic programming (ILP-99). LNCS, vol 1634. Springer, pp 174–183
Zurück zum Zitat Lavrač N, Kavsek B, Flach P, Todorovski L (2004) Subgroup discovery with CN2-SD. J Mach Learn Res 5:153–188MathSciNet Lavrač N, Kavsek B, Flach P, Todorovski L (2004) Subgroup discovery with CN2-SD. J Mach Learn Res 5:153–188MathSciNet
Zurück zum Zitat Lemmerich F, Puppe F (2011) Local models for expectation-driven subgroup discovery. In: Proceedings of the IEEE international conference on data mining (ICDM). IEEE, Washington, DC, pp 360–369 Lemmerich F, Puppe F (2011) Local models for expectation-driven subgroup discovery. In: Proceedings of the IEEE international conference on data mining (ICDM). IEEE, Washington, DC, pp 360–369
Zurück zum Zitat Lemmerich F, Rohlfs M, Atzmueller M (2010) Fast discovery of relevant subgroup patterns. In: Proceedings of the 23rd international FLAIRS conference. AAAI Press, pp 428–433 Lemmerich F, Rohlfs M, Atzmueller M (2010) Fast discovery of relevant subgroup patterns. In: Proceedings of the 23rd international FLAIRS conference. AAAI Press, pp 428–433
Zurück zum Zitat Mata J, Alvarez JL, Riquelme JC (2002) Discovering numeric association rules via evolutionary algorithm. In: Proccedings of the of PAKDD 2002. Springer, pp 40–51 Mata J, Alvarez JL, Riquelme JC (2002) Discovering numeric association rules via evolutionary algorithm. In: Proccedings of the of PAKDD 2002. Springer, pp 40–51
Zurück zum Zitat Novak PK, Lavrač N, Webb GI (2009) Supervised descriptive rule discovery: a unifying survey of contrast set, emerging pattern and subgroup mining. J Mach Learn Res 10:377–403 Novak PK, Lavrač N, Webb GI (2009) Supervised descriptive rule discovery: a unifying survey of contrast set, emerging pattern and subgroup mining. J Mach Learn Res 10:377–403
Zurück zum Zitat Pachón V, Mata J (2012) An evolutionary algorithm to discover quantitative association rules from huge databases without the need for an a priori discretization. Expert Syst Appl 39(1):585–593 Pachón V, Mata J (2012) An evolutionary algorithm to discover quantitative association rules from huge databases without the need for an a priori discretization. Expert Syst Appl 39(1):585–593
Zurück zum Zitat Pachón V, Mata J, Domínguez JL, Maña MJ (2011) A multi-objective evolutionary approach for subgroup discovery. In: Corchado E, Kurzynski M, Wozniak M (eds) Proceedings of the 6th international conference on hybrid artificial intelligent systems–volume part II (HAIS’11). Springer, Berlin, Heidelberg, pp 271–279 Pachón V, Mata J, Domínguez JL, Maña MJ (2011) A multi-objective evolutionary approach for subgroup discovery. In: Corchado E, Kurzynski M, Wozniak M (eds) Proceedings of the 6th international conference on hybrid artificial intelligent systems–volume part II (HAIS’11). Springer, Berlin, Heidelberg, pp 271–279
Zurück zum Zitat Rodríguez D, Ruiz R, Riquelme JC, Aguilar-Ruiz JS (2012) Searching for rules to detect defective modules: a subgroup discovery approach. Inf Sci 191(15):14–30CrossRef Rodríguez D, Ruiz R, Riquelme JC, Aguilar-Ruiz JS (2012) Searching for rules to detect defective modules: a subgroup discovery approach. Inf Sci 191(15):14–30CrossRef
Zurück zum Zitat Terlecki P, Walczak K (2007) On the relation between rough set reducts and jumping emerging patterns. Inf Sci 177:74–83 (1/1)MathSciNetCrossRefMATH Terlecki P, Walczak K (2007) On the relation between rough set reducts and jumping emerging patterns. Inf Sci 177:74–83 (1/1)MathSciNetCrossRefMATH
Zurück zum Zitat Tsai C-J, Lee C-I, Yang W-P (2008) A discretization algorithm based on class-attribute contingency coefficient. Inf Sci 178:714–731 (2/1)CrossRef Tsai C-J, Lee C-I, Yang W-P (2008) A discretization algorithm based on class-attribute contingency coefficient. Inf Sci 178:714–731 (2/1)CrossRef
Zurück zum Zitat Wrobel S (1997) An algorithm for multi-relational discovery of subgroups. In: Proccedings of the 1st European conference on principles of data mining and knowledge discovery (PKDD-97), pp 78–87 Wrobel S (1997) An algorithm for multi-relational discovery of subgroups. In: Proccedings of the 1st European conference on principles of data mining and knowledge discovery (PKDD-97), pp 78–87
Zurück zum Zitat Zelezny F, Lavrač N (2006) Propositionalization-based relational subgroup discovery with RSD. Mach Learn 62:33–63CrossRef Zelezny F, Lavrač N (2006) Propositionalization-based relational subgroup discovery with RSD. Mach Learn 62:33–63CrossRef
Metadaten
Titel
Searching for the most significant rules: an evolutionary approach for subgroup discovery
verfasst von
Victoria Pachón
Jacinto Mata
Juan Luis Domínguez
Publikationsdatum
11.12.2015
Verlag
Springer Berlin Heidelberg
Erschienen in
Soft Computing / Ausgabe 10/2017
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-015-1961-5

Weitere Artikel der Ausgabe 10/2017

Soft Computing 10/2017 Zur Ausgabe