Skip to main content

2019 | OriginalPaper | Buchkapitel

A Genetic Programming Approach Applied to Feature Selection from Medical Data

verfasst von : José A. Castellanos-Garzón, Juan Ramos, Yeray Mezquita Martín, Juan F. de Paz, Ernesto Costa

Erschienen in: Practical Applications of Computational Biology and Bioinformatics, 12th International Conference

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Genetic programming represents a flexible and powerful evolutionary technique in machine learning. The use of genetic programming for rule induction has generated interesting results in classification problems. This paper proposes an evolutionary approach for logical rule induction, which is applied to clinical data. Since logical rules disclose knowledge from the analyzed data, we use such a knowledge to filter features from the target dataset. The results reached by the used dataset have been very promising when used in classification tasks and compared with other methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Bonelli, P., Parodi, A.: An efficient classifier system and its experimental comparison with two representative learning methods on three medical domains. In: Proceedings of the 4th International Conference on Genetic Algorithms (ICGA), pp. 288–295 (1991) Bonelli, P., Parodi, A.: An efficient classifier system and its experimental comparison with two representative learning methods on three medical domains. In: Proceedings of the 4th International Conference on Genetic Algorithms (ICGA), pp. 288–295 (1991)
3.
Zurück zum Zitat Hong, J.H., Cho, S.B.: The classification of cancer based on DNA microarray data that uses diverse ensemble genetic programming. Artif. Intell. Med. 36, 43–58 (2006)CrossRef Hong, J.H., Cho, S.B.: The classification of cancer based on DNA microarray data that uses diverse ensemble genetic programming. Artif. Intell. Med. 36, 43–58 (2006)CrossRef
4.
Zurück zum Zitat Kumar, T.P., Iba, H.: Prediction of cancer class with majority voting genetic programming classifier using gene expression data. IEEE/ACM Trans. Comput. Biol. Bioinf. 6, 353–367 (2009)CrossRef Kumar, T.P., Iba, H.: Prediction of cancer class with majority voting genetic programming classifier using gene expression data. IEEE/ACM Trans. Comput. Biol. Bioinf. 6, 353–367 (2009)CrossRef
5.
Zurück zum Zitat Kumar, R., Verma, R.: Classification rule discovery for diabetes patients by using genetic programming. Int. J. Soft Comput. Eng. (IJSCE) 2, 183–185 (2012) Kumar, R., Verma, R.: Classification rule discovery for diabetes patients by using genetic programming. Int. J. Soft Comput. Eng. (IJSCE) 2, 183–185 (2012)
6.
Zurück zum Zitat Larraaga, P., et al.: Machine learning in bioinformatics. Briefings Bioinf. 7, 86–112 (2006)CrossRef Larraaga, P., et al.: Machine learning in bioinformatics. Briefings Bioinf. 7, 86–112 (2006)CrossRef
7.
Zurück zum Zitat Liu, K.H., Xu, C.G.: A genetic programming-based approach to the classification of multiclass microarray datasets. Bioinformatics 25, 331–337 (2009)CrossRef Liu, K.H., Xu, C.G.: A genetic programming-based approach to the classification of multiclass microarray datasets. Bioinformatics 25, 331–337 (2009)CrossRef
9.
Zurück zum Zitat Pea-Reyes, C.A., Sipper, M.: Evolutionary computation in medicine: an overview. Artif. Intell. Med. 19, 1–23 (2000)CrossRef Pea-Reyes, C.A., Sipper, M.: Evolutionary computation in medicine: an overview. Artif. Intell. Med. 19, 1–23 (2000)CrossRef
10.
Zurück zum Zitat Podgorelec, V., Kokol, P., Stiglic, M.M., Hericko, M., Rozrnan, I.: Knowledge discovery with classification rules in a cardiovascular dataset. Comput. Methods Program. Biomed. 1, 539–549 (2005) Podgorelec, V., Kokol, P., Stiglic, M.M., Hericko, M., Rozrnan, I.: Knowledge discovery with classification rules in a cardiovascular dataset. Comput. Methods Program. Biomed. 1, 539–549 (2005)
11.
Zurück zum Zitat Soni, J., Ansari, U., Sharma, D., Soni, S.: Intelligent and effective heart disease prediction system using weighted associative classifiers. Int. J. Comput. Sci. Eng. (IJCSE) 3, 2385–2392 (2011) Soni, J., Ansari, U., Sharma, D., Soni, S.: Intelligent and effective heart disease prediction system using weighted associative classifiers. Int. J. Comput. Sci. Eng. (IJCSE) 3, 2385–2392 (2011)
12.
Zurück zum Zitat Tsakonas, A., Dounias, G., Jantzen, J., Axer, H., Bjerregaard, B., von Keyserlingk, D.G.: Evolving rule-based systems in two medical domains using genetic programming. Artif. Intell. Med. 32, 195–216 (2004)CrossRef Tsakonas, A., Dounias, G., Jantzen, J., Axer, H., Bjerregaard, B., von Keyserlingk, D.G.: Evolving rule-based systems in two medical domains using genetic programming. Artif. Intell. Med. 32, 195–216 (2004)CrossRef
13.
Zurück zum Zitat Vargas, C.M.B., Chidambaram, C., Hembecker, F., Silvério, H.L.: A comparative study of machine learning and evolutionary computation approaches for protein secondary structure classification. In: Computational Biology and Applied Bioinformatics, pp. 239–258. InTech (2011) Vargas, C.M.B., Chidambaram, C., Hembecker, F., Silvério, H.L.: A comparative study of machine learning and evolutionary computation approaches for protein secondary structure classification. In: Computational Biology and Applied Bioinformatics, pp. 239–258. InTech (2011)
14.
Zurück zum Zitat Wolberg, W.H., Mangasarian, O.L.: Multisurface method of pattern separation for medical diagnosis applied to breast cytology. Proc. Natl. Acad. Sci. USA 87, 9193–9196 (1990)CrossRef Wolberg, W.H., Mangasarian, O.L.: Multisurface method of pattern separation for medical diagnosis applied to breast cytology. Proc. Natl. Acad. Sci. USA 87, 9193–9196 (1990)CrossRef
16.
Zurück zum Zitat Lucas, P.: Prognostic methods in medicine. Artif. Intell. 15, 105–119 (1999)CrossRef Lucas, P.: Prognostic methods in medicine. Artif. Intell. 15, 105–119 (1999)CrossRef
17.
Zurück zum Zitat Ramos, J., Castellanos-Garzón, J.A., González-Briones, A., de Paz, J.F., Corchado, J.M.: An agent-based clustering approach for gene selection in gene expression microarray. Interdiscip. Sci. Comput. Life Sci. 9, 1–13 (2017)CrossRef Ramos, J., Castellanos-Garzón, J.A., González-Briones, A., de Paz, J.F., Corchado, J.M.: An agent-based clustering approach for gene selection in gene expression microarray. Interdiscip. Sci. Comput. Life Sci. 9, 1–13 (2017)CrossRef
18.
Zurück zum Zitat Castellanos-Garzón, J.A., Ramos, J., González-Briones, A., de Paz, J.F.: A clustering-based method for gene selection to classify tissue samples in lung cancer. In: Saberi Mohamad, M., Rocha, M., Fdez-Riverola, F., Domínguez Mayo, F., De Paz, J. (eds.) PACBB 2016. AISC, vol. 477, pp. 99–107. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-40126-3_11CrossRef Castellanos-Garzón, J.A., Ramos, J., González-Briones, A., de Paz, J.F.: A clustering-based method for gene selection to classify tissue samples in lung cancer. In: Saberi Mohamad, M., Rocha, M., Fdez-Riverola, F., Domínguez Mayo, F., De Paz, J. (eds.) PACBB 2016. AISC, vol. 477, pp. 99–107. Springer, Cham (2016). https://​doi.​org/​10.​1007/​978-3-319-40126-3_​11CrossRef
19.
Zurück zum Zitat Castellanos-Garzón, J.A., Ramos, J.: A gene selection approach based on clustering for classification tasks in colon cancer. ADCAIJ Adv. Distrib. Comput. Artif. Intell. J. 4(3), 1–10 (2015) Castellanos-Garzón, J.A., Ramos, J.: A gene selection approach based on clustering for classification tasks in colon cancer. ADCAIJ Adv. Distrib. Comput. Artif. Intell. J. 4(3), 1–10 (2015)
20.
Zurück zum Zitat González-Briones, A., Ramos, J., De Paz, J.F.: A drug identification system for intoxicated drivers based on a systematic review. ADCAIJ Adv. Distrib. Comput. Artif. Intell. J. 4(4), 83–101 (2015) González-Briones, A., Ramos, J., De Paz, J.F.: A drug identification system for intoxicated drivers based on a systematic review. ADCAIJ Adv. Distrib. Comput. Artif. Intell. J. 4(4), 83–101 (2015)
21.
Zurück zum Zitat Espejo, P.G., Ventura, S., Herrera, F.: A survey on the application of genetic programming to classification. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 40(2), 121–144 (2010)CrossRef Espejo, P.G., Ventura, S., Herrera, F.: A survey on the application of genetic programming to classification. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 40(2), 121–144 (2010)CrossRef
22.
Zurück zum Zitat Pappa, G.L., Freitas, A.A.: Evolving rule induction algorithms with multi-objective grammar-based genetic programming. Knowl. Inf. Syst. 19(3), 283–309 (2009)CrossRef Pappa, G.L., Freitas, A.A.: Evolving rule induction algorithms with multi-objective grammar-based genetic programming. Knowl. Inf. Syst. 19(3), 283–309 (2009)CrossRef
23.
Zurück zum Zitat Alcalá-Fdez, J., et al.: KEEL: a software tool to assess evolutionary algorithms for data mining problems. Soft. Comput. 13, 307–318 (2009)CrossRef Alcalá-Fdez, J., et al.: KEEL: a software tool to assess evolutionary algorithms for data mining problems. Soft. Comput. 13, 307–318 (2009)CrossRef
24.
Zurück zum Zitat Fernández, A., García, S., Luengo, J., Bernadó-Mansilla, E., Herrera, F.: Genetics-based machine learning for rule induction: state of the art, taxonomy, and comparative study. IEEE Trans. Evol. Comput. 14(6), 913–941 (2010)CrossRef Fernández, A., García, S., Luengo, J., Bernadó-Mansilla, E., Herrera, F.: Genetics-based machine learning for rule induction: state of the art, taxonomy, and comparative study. IEEE Trans. Evol. Comput. 14(6), 913–941 (2010)CrossRef
25.
Zurück zum Zitat Oyebode, O.K., Adeyemo, J.A.: Genetic programming: principles, applications and opportunities for hydrological modelling. World Acad. Sci. Eng. Technol. Int. J. Environ. Ecol. Geol. Min. Eng. 8, 310–316 (2014) Oyebode, O.K., Adeyemo, J.A.: Genetic programming: principles, applications and opportunities for hydrological modelling. World Acad. Sci. Eng. Technol. Int. J. Environ. Ecol. Geol. Min. Eng. 8, 310–316 (2014)
26.
Zurück zum Zitat Freitas, A.A.: A survey of evolutionary algorithms for data mining and knowledge discovery. In: Ghosh, A., Tsutsui, S. (eds.) Advances in Evolutionary Computation, pp. 819–845. Springer, Heidelberg (2002) Freitas, A.A.: A survey of evolutionary algorithms for data mining and knowledge discovery. In: Ghosh, A., Tsutsui, S. (eds.) Advances in Evolutionary Computation, pp. 819–845. Springer, Heidelberg (2002)
28.
Zurück zum Zitat Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)MATH Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)MATH
29.
Zurück zum Zitat Flach, P.: Machine Learning: The Art and Science of Algorithms that Make Sense of Data. Cambridge University Press, Cambridge (2012)CrossRef Flach, P.: Machine Learning: The Art and Science of Algorithms that Make Sense of Data. Cambridge University Press, Cambridge (2012)CrossRef
31.
Zurück zum Zitat Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning, Tools and Techniques, 3rd edn. Elsevier Inc., Waltham (2011) Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning, Tools and Techniques, 3rd edn. Elsevier Inc., Waltham (2011)
32.
Zurück zum Zitat Bacardit, J., Goldberg, D.E., Butz, M.V.: Improving the performance of a pittsburgh learning classifier system using a default rule. In: Kovacs, T., Llorà, X., Takadama, K., Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2003-2005. LNCS (LNAI), vol. 4399, pp. 291–307. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-71231-2_20CrossRef Bacardit, J., Goldberg, D.E., Butz, M.V.: Improving the performance of a pittsburgh learning classifier system using a default rule. In: Kovacs, T., Llorà, X., Takadama, K., Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2003-2005. LNCS (LNAI), vol. 4399, pp. 291–307. Springer, Heidelberg (2007). https://​doi.​org/​10.​1007/​978-3-540-71231-2_​20CrossRef
33.
Zurück zum Zitat Goldberg, D.E.: Genetic Algorithms in Search, Optimization and Machine Learning. Addison-Wesley, Reading (1989)MATH Goldberg, D.E.: Genetic Algorithms in Search, Optimization and Machine Learning. Addison-Wesley, Reading (1989)MATH
34.
Zurück zum Zitat Blake, C., Merz, C.: Repository of machine learning databases (UCI). Center for Machine Learning and Intelligent Systems (1998) Blake, C., Merz, C.: Repository of machine learning databases (UCI). Center for Machine Learning and Intelligent Systems (1998)
35.
Zurück zum Zitat Kononenko, I., Simec, E., Robnik-Sikonja, M.: Overcoming the myopia of inductive learning algorithms with RELIEFF. Appl. Intell. 7(1), 39–55 (1997)CrossRef Kononenko, I., Simec, E., Robnik-Sikonja, M.: Overcoming the myopia of inductive learning algorithms with RELIEFF. Appl. Intell. 7(1), 39–55 (1997)CrossRef
37.
Zurück zum Zitat Zhou, Z.H., Jiang, Y.: NeC4.5: neural ensemble based C4.5. IEEE Trans. Knowl. Data Eng. 16(6), 770–773 (2004)CrossRef Zhou, Z.H., Jiang, Y.: NeC4.5: neural ensemble based C4.5. IEEE Trans. Knowl. Data Eng. 16(6), 770–773 (2004)CrossRef
38.
Zurück zum Zitat Smirnov, E., Sprinkhuizen-Kuyper, I.G., Nalbantis, I.: Unanimous voting using support vector machines. Technical report, ERIM and Universiteit Rotterdam, IKAT, Universiteit Maastricht (2004) Smirnov, E., Sprinkhuizen-Kuyper, I.G., Nalbantis, I.: Unanimous voting using support vector machines. Technical report, ERIM and Universiteit Rotterdam, IKAT, Universiteit Maastricht (2004)
Metadaten
Titel
A Genetic Programming Approach Applied to Feature Selection from Medical Data
verfasst von
José A. Castellanos-Garzón
Juan Ramos
Yeray Mezquita Martín
Juan F. de Paz
Ernesto Costa
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-319-98702-6_24