Skip to main content
Erschienen in: Evolutionary Intelligence 2/2012

01.06.2012 | Special Issue

Analysing BioHEL using challenging boolean functions

verfasst von: María A. Franco, Natalio Krasnogor, Jaume Bacardit

Erschienen in: Evolutionary Intelligence | Ausgabe 2/2012

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this work we present an extensive empirical analysis of the BioHEL genetics-based machine learning system using the k-Disjunctive Normal Form (k-DNF) family of boolean functions. These functions present a broad set of possible challenges for most machine learning techniques, such as different degrees of specificity, class imbalance and niche overlap. Moreover, as the ideal solutions are known, it is possible to assess if a learning system is able to find them, and how fast. Specifically, we study two aspects of BioHEL: its sensitivity to the coverage breakpoint parameter (that determines the degree of generality pressure applied by the fitness function) and the impact of the default rule policy. The results show that BioHEL is highly sensitive to the choice of coverage breakpoint and that using a default class suitable for the problem allows the system to learn faster than using other default class policies (e.g. the majority class policy). Moreover, the experiments indicate that BioHEL’s scalability depends directly on both k (the specificity of the k-DNF terms) and the number of terms in the problem. In the last part of the paper we discuss alternative policies to adjust the coverage breakpoint parameter.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
The number of examples needed to learn a classification problem.
 
Literatur
1.
Zurück zum Zitat Bacardit J (2004) Pittsburgh Genetics-Based machine learning in the data mining era: representations, generalization, and run-time. phdthesis. Ramon Llull University, Barcelona, Spain Bacardit J (2004) Pittsburgh Genetics-Based machine learning in the data mining era: representations, generalization, and run-time. phdthesis. Ramon Llull University, Barcelona, Spain
3.
Zurück zum Zitat Bacardit J, Garrell JM (2003) Bloat control and generalization pressure using the minimum description length principle for a pittsburgh approach learning classifier system. In: Proceedings of the 6th International Workshop on Learning Classifier Systems Bacardit J, Garrell JM (2003) Bloat control and generalization pressure using the minimum description length principle for a pittsburgh approach learning classifier system. In: Proceedings of the 6th International Workshop on Learning Classifier Systems
4.
Zurück zum Zitat Bacardit J, Goldberg DE, Butz MV (2007) Improving the performance of a pittsburgh learning classifier system using a default rule. In: Learning classifier systems, revised selected papers of the international workshop on learning classifier systems 2003–2005. Springer, LNCS 4399, pp. 291–307 Bacardit J, Goldberg DE, Butz MV (2007) Improving the performance of a pittsburgh learning classifier system using a default rule. In: Learning classifier systems, revised selected papers of the international workshop on learning classifier systems 2003–2005. Springer, LNCS 4399, pp. 291–307
5.
Zurück zum Zitat Bacardit J, Goldberg DE, Butz MV, Llorá X, Garrell JM (2004) Speeding-Up pittsburgh learning classifier systems: modeling time and accuracy. In: Parallel problem solving from nature—PPSN VIII, Lecture Notes in Computer Science, vol. 3242, chap. 103. Springer, Berlin, Heidelberg, pp 1021–1031. http://www.springerlink.com/content/66w8u56a61wntqa6 Bacardit J, Goldberg DE, Butz MV, Llorá X, Garrell JM (2004) Speeding-Up pittsburgh learning classifier systems: modeling time and accuracy. In: Parallel problem solving from nature—PPSN VIII, Lecture Notes in Computer Science, vol. 3242, chap. 103. Springer, Berlin, Heidelberg, pp 1021–1031. http://​www.​springerlink.​com/​content/​66w8u56a61wntqa6​
6.
Zurück zum Zitat Bacardit J, Hirst JD, Stout M, Blazewicz J, Krasnogor N (2006) Coordination number prediction using learning classifier systems: performance and interpretability. In: In GECCO ’06: Proceedings of the 8th annual conference on Genetic and evolutionary computation. ACM Press, New York, NY, pp 247–254 Bacardit J, Hirst JD, Stout M, Blazewicz J, Krasnogor N (2006) Coordination number prediction using learning classifier systems: performance and interpretability. In: In GECCO ’06: Proceedings of the 8th annual conference on Genetic and evolutionary computation. ACM Press, New York, NY, pp 247–254
7.
Zurück zum Zitat Bacardit J, Krasnogor N (2009) A mixed discrete-continuous attribute list representation for large scale classification domains. In: GECCO ’09: Proceedings of the 11th annual conference on genetic and evolutionary computation, pp 1155–1162. ACM Press, New York, NY. doi:10.1145/1569901.1570057 Bacardit J, Krasnogor N (2009) A mixed discrete-continuous attribute list representation for large scale classification domains. In: GECCO ’09: Proceedings of the 11th annual conference on genetic and evolutionary computation, pp 1155–1162. ACM Press, New York, NY. doi:10.​1145/​1569901.​1570057
8.
Zurück zum Zitat Bacardit J, Stout M, Hirst JD, Sastry K, Llorà X, Krasnogor N (2007) Automated alphabet reduction method with evolutionary algorithms for protein structure prediction. In: GECCO ’07: Proceedings of the 9th annual conference on genetic and evolutionary computation. ACM, New York, NY, pp 346–353. doi:10.1145/1276958.1277033 Bacardit J, Stout M, Hirst JD, Sastry K, Llorà X, Krasnogor N (2007) Automated alphabet reduction method with evolutionary algorithms for protein structure prediction. In: GECCO ’07: Proceedings of the 9th annual conference on genetic and evolutionary computation. ACM, New York, NY, pp 346–353. doi:10.​1145/​1276958.​1277033
9.
Zurück zum Zitat Bacardit J, Stout M, Hirst JD, Valencia A, Smith R, Krasnogor N (2009) Automated alphabet reduction for protein datasets. BMC Bioinformatics 10(1):6. doi:10.1186/1471-2105-10-6 Bacardit J, Stout M, Hirst JD, Valencia A, Smith R, Krasnogor N (2009) Automated alphabet reduction for protein datasets. BMC Bioinformatics 10(1):6. doi:10.​1186/​1471-2105-10-6
10.
Zurück zum Zitat Bassel GW, Glaab E, Marquez J, Holdsworth MJ, Bacardit J (2011) Functional network construction in arabidopsis using rule-based machine learning on large-scale data sets. Plant Cell Online 23(9):3101–3116. doi:10.1105/tpc.111.088153 CrossRef Bassel GW, Glaab E, Marquez J, Holdsworth MJ, Bacardit J (2011) Functional network construction in arabidopsis using rule-based machine learning on large-scale data sets. Plant Cell Online 23(9):3101–3116. doi:10.​1105/​tpc.​111.​088153 CrossRef
11.
Zurück zum Zitat Butz MV (2006) Rule-based evolutionary online learning systems: a principled approach to LCS analysis and design, studies in fuzziness and soft computing. vol 109, Springer, Berlin Butz MV (2006) Rule-based evolutionary online learning systems: a principled approach to LCS analysis and design, studies in fuzziness and soft computing. vol 109, Springer, Berlin
12.
Zurück zum Zitat Butz MV, Pelikan M (2006) Studying XCS/BOA learning in boolean functions: structure encoding and random boolean functions. In: GECCO ’06: Proceedings of the 8th annual conference on Genetic and evolutionary computation. ACM, New York, NY, pp 1449–456. doi:10.1145/1143997.1144236 Butz MV, Pelikan M (2006) Studying XCS/BOA learning in boolean functions: structure encoding and random boolean functions. In: GECCO ’06: Proceedings of the 8th annual conference on Genetic and evolutionary computation. ACM, New York, NY, pp 1449–456. doi:10.​1145/​1143997.​1144236
13.
Zurück zum Zitat Ehrenfeucht A, Haussler D, Kearns MJ, Valiant L (1988) A general lower bound on the number of examples needed for learning. In: Proceedings of the first annual workshop on Computational learning theory. Morgan Kaufmann Publishers Inc., MIT, Cambridge, MA, pp 139–154. http://portal.acm.org/citation.cfm?id=93068 Ehrenfeucht A, Haussler D, Kearns MJ, Valiant L (1988) A general lower bound on the number of examples needed for learning. In: Proceedings of the first annual workshop on Computational learning theory. Morgan Kaufmann Publishers Inc., MIT, Cambridge, MA, pp 139–154. http://​portal.​acm.​org/​citation.​cfm?​id=​93068
14.
Zurück zum Zitat Franco MA, Krasnogor N, Bacardit J (2010) Speeding up the evaluation of evolutionary learning systems using GPGPUs. In: GECCO ’10: Proceedings of the 12th annual conference on genetic and evolutionary computation. ACM, New York, NY, pp. 1039–1046. doi:10.1145/1830483.1830672 Franco MA, Krasnogor N, Bacardit J (2010) Speeding up the evaluation of evolutionary learning systems using GPGPUs. In: GECCO ’10: Proceedings of the 12th annual conference on genetic and evolutionary computation. ACM, New York, NY, pp. 1039–1046. doi:10.​1145/​1830483.​1830672
15.
Zurück zum Zitat Franco MA, Krasnogor N, Bacardit J (2011) Modelling the initialisation stage of the alkr representation for discrete domains and gabil encoding. In: Proceedings of the 13th annual conference on Genetic and evolutionary computation, GECCO ’11. ACM, New York, NY, pp 1291–1298. doi:10.1145/2001576.2001750 Franco MA, Krasnogor N, Bacardit J (2011) Modelling the initialisation stage of the alkr representation for discrete domains and gabil encoding. In: Proceedings of the 13th annual conference on Genetic and evolutionary computation, GECCO ’11. ACM, New York, NY, pp 1291–1298. doi:10.​1145/​2001576.​2001750
16.
Zurück zum Zitat Hernández-Aguirre A, Buckles BP, Coello CAC (2001) On learning kDNF n s s boolean formulas. In: Evolvable hardware, NASA/DoD conference on, vol 0. IEEE Computer Society, Los Alamitos, CA, p 0240.doi:10.1109/EH.2001.937967 Hernández-Aguirre A, Buckles BP, Coello CAC (2001) On learning kDNF n s s boolean formulas. In: Evolvable hardware, NASA/DoD conference on, vol 0. IEEE Computer Society, Los Alamitos, CA, p 0240.doi:10.​1109/​EH.​2001.​937967
17.
Zurück zum Zitat Hirschberg DS, Pazzani MJ, Ali KM (1994) Average case analysis of k-CNF and k-DNF learning algorithms. In: Proceedings of the workshop on computational learning theory and natural learning systems (vol 2): intersections between theory and experiment. MIT Press, Cambridge, MA, pp 15–28 Hirschberg DS, Pazzani MJ, Ali KM (1994) Average case analysis of k-CNF and k-DNF learning algorithms. In: Proceedings of the workshop on computational learning theory and natural learning systems (vol 2): intersections between theory and experiment. MIT Press, Cambridge, MA, pp 15–28
18.
Zurück zum Zitat Ioannides C, Barrett G, Eder K (2011) Xcs cannot learn all boolean functions. In: Proceedings of the 13th annual conference on Genetic and evolutionary computation, GECCO ’11, pp. 1283–1290. ACM, New York, NY. doi:10.1145/2001576.2001749 Ioannides C, Barrett G, Eder K (2011) Xcs cannot learn all boolean functions. In: Proceedings of the 13th annual conference on Genetic and evolutionary computation, GECCO ’11, pp. 1283–1290. ACM, New York, NY. doi:10.​1145/​2001576.​2001749
19.
20.
Zurück zum Zitat Kearns MJ (1990) The computational complexity of machine learning. MIT Press, Cambridge, MA Kearns MJ (1990) The computational complexity of machine learning. MIT Press, Cambridge, MA
23.
26.
Zurück zum Zitat Venturini G (1993) SIA: a supervised inductive algorithm with genetic search for learning attributes based concepts. In: Brazdil PB (eds), Machine learning: ECML-93—Proceedings of the European Conference on Machine Learning. Springer, New York, pp 280–296 Venturini G (1993) SIA: a supervised inductive algorithm with genetic search for learning attributes based concepts. In: Brazdil PB (eds), Machine learning: ECML-93—Proceedings of the European Conference on Machine Learning. Springer, New York, pp 280–296
28.
Zurück zum Zitat Wilson SW (2001) Mining oblique data with XCS. In: Luca Lanzi P, Stolzmann W, Wilson S (eds), Advances in learning classifier systems, lecture notes in computer science, vol 1996. Springer, Berlin/Heidelberg, pp 283–290. doi:10.1007/3-540-44640-0_11 Wilson SW (2001) Mining oblique data with XCS. In: Luca Lanzi P, Stolzmann W, Wilson S (eds), Advances in learning classifier systems, lecture notes in computer science, vol 1996. Springer, Berlin/Heidelberg, pp 283–290. doi:10.​1007/​3-540-44640-0_​11
29.
Zurück zum Zitat Witten IH, Frank E (2005) Data mining: practical machine learning tools and techniques. Morgan Kaufmann, Waltham, MAMATH Witten IH, Frank E (2005) Data mining: practical machine learning tools and techniques. Morgan Kaufmann, Waltham, MAMATH
Metadaten
Titel
Analysing BioHEL using challenging boolean functions
verfasst von
María A. Franco
Natalio Krasnogor
Jaume Bacardit
Publikationsdatum
01.06.2012
Verlag
Springer-Verlag
Erschienen in
Evolutionary Intelligence / Ausgabe 2/2012
Print ISSN: 1864-5909
Elektronische ISSN: 1864-5917
DOI
https://doi.org/10.1007/s12065-012-0080-9

Weitere Artikel der Ausgabe 2/2012

Evolutionary Intelligence 2/2012 Zur Ausgabe

Premium Partner