Skip to main content
Top
Published in:
Cover of the book

2013 | OriginalPaper | Chapter

1. Evolving SQL Queries from Examples with Developmental Genetic Programming

Authors : Thomas Helmuth, Lee Spector

Published in: Genetic Programming Theory and Practice X

Publisher: Springer New York

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Large databases are becoming ever more ubiquitous, as are the opportunities for discovering useful knowledge within them. Evolutionary computation methods such as genetic programming have previously been applied to several aspects of the problem of discovering knowledge in databases. The more specific task of producing human-comprehensible SQL queries has several potential applications but has thus far been explored only to a limited extent. In this chapter we show howdevelopmental genetic programming can automatically generate SQL queries from sets of positive and negative examples. We show that a developmental genetic programming system can produce queries that are reasonably accurate while excelling in human comprehensibility relative to the well-known C5.0 decision tree generation system.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Acar AC, Motro A (2005) Intensional encapsulations of database subsets by genetic programming. Tech. Rep. ISE-TR-05-01, Information and Software Engineering Department, The Volgenau School of Information Technology and Engineering, George Mason University, URL http://ise.gmu.edu/techrep/2005/05-01.pdf Acar AC, Motro A (2005) Intensional encapsulations of database subsets by genetic programming. Tech. Rep. ISE-TR-05-01, Information and Software Engineering Department, The Volgenau School of Information Technology and Engineering, George Mason University, URL http://​ise.​gmu.​edu/​techrep/​2005/​05-01.​pdf
go back to reference Doucette JA, McIntyre AR, Lichodzijewski P, Heywood MI (2012) Symbiotic coevolutionary genetic programming: a benchmarking study under large attribute spaces. Genetic Programming and Evolvable Machines 13(1):71–101, DOI doi:10.1007/s10710-011-9151-4, special Section on Evolutionary Algorithms for Data Mining Doucette JA, McIntyre AR, Lichodzijewski P, Heywood MI (2012) Symbiotic coevolutionary genetic programming: a benchmarking study under large attribute spaces. Genetic Programming and Evolvable Machines 13(1):71–101, DOI doi:10.1007/s10710-011-9151-4, special Section on Evolutionary Algorithms for Data Mining
go back to reference Freitas AA (1997) A genetic programming framework for two data mining tasks: Classification and generalized rule induction. In: Koza JR, Deb K, Dorigo M, Fogel DB, Garzon M, Iba H, Riolo RL (eds) Genetic Programming 1997: Proceedings of the Second Annual Conference, Morgan Kaufmann, Stanford University, CA, USA, pp 96–101, URL http://citeseer.nj.nec.com/43454.html Freitas AA (1997) A genetic programming framework for two data mining tasks: Classification and generalized rule induction. In: Koza JR, Deb K, Dorigo M, Fogel DB, Garzon M, Iba H, Riolo RL (eds) Genetic Programming 1997: Proceedings of the Second Annual Conference, Morgan Kaufmann, Stanford University, CA, USA, pp 96–101, URL http://​citeseer.​nj.​nec.​com/​43454.​html
go back to reference Ishida CY, Pozo ATR (2002) GP SQL miner: SQL-grammar genetic programming in data mining. In: Fogel DB, El-Sharkawi MA, Yao X, Greenwood G, Iba H, Marrow P, Shackleton M (eds) Proceedings of the 2002 Congress on Evolutionary Computation CEC2002, IEEE Press, pp 1226–1231 Ishida CY, Pozo ATR (2002) GP SQL miner: SQL-grammar genetic programming in data mining. In: Fogel DB, El-Sharkawi MA, Yao X, Greenwood G, Iba H, Marrow P, Shackleton M (eds) Proceedings of the 2002 Congress on Evolutionary Computation CEC2002, IEEE Press, pp 1226–1231
go back to reference Klein J, Spector L (2007) Unwitting distributed genetic programming via asynchronous JavaScript and XML. In: Thierens D, Beyer HG, Bongard J, Branke J, Clark JA, Cliff D, Congdon CB, Deb K, Doerr B, Kovacs T, Kumar S, Miller JF, Moore J, Neumann F, Pelikan M, Poli R, Sastry K, Stanley KO, Stutzle T, Watson RA, Wegener I (eds) GECCO ’07: Proceedings of the 9th annual conference on Genetic and evolutionary computation, ACM Press, London, vol 2, pp 1628–1635, DOI doi:10.1145/1276958.1277282, URL http://www.cs.bham.ac.uk/~wbl/biblio/gecco2007/docs/p1628.pdf Klein J, Spector L (2007) Unwitting distributed genetic programming via asynchronous JavaScript and XML. In: Thierens D, Beyer HG, Bongard J, Branke J, Clark JA, Cliff D, Congdon CB, Deb K, Doerr B, Kovacs T, Kumar S, Miller JF, Moore J, Neumann F, Pelikan M, Poli R, Sastry K, Stanley KO, Stutzle T, Watson RA, Wegener I (eds) GECCO ’07: Proceedings of the 9th annual conference on Genetic and evolutionary computation, ACM Press, London, vol 2, pp 1628–1635, DOI doi:10.1145/1276958.1277282, URL http://​www.​cs.​bham.​ac.​uk/​~wbl/​biblio/​gecco2007/​docs/​p1628.​pdf
go back to reference Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA
go back to reference da Silva BC, Thomas P (2010) Automatic query generation, “Unpublished manuscript” da Silva BC, Thomas P (2010) Automatic query generation, “Unpublished manuscript”
go back to reference Spector L (2001) Autoconstructive evolution: Push, pushGP, and pushpop. In: Spector L, Goodman ED, Wu A, Langdon WB, Voigt HM, Gen M, Sen S, Dorigo M, Pezeshk S, Garzon MH, Burke E (eds) Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2001), Morgan Kaufmann, San Francisco, California, USA, pp 137–146, URL http://hampshire.edu/lspector/pubs/ace.pdf Spector L (2001) Autoconstructive evolution: Push, pushGP, and pushpop. In: Spector L, Goodman ED, Wu A, Langdon WB, Voigt HM, Gen M, Sen S, Dorigo M, Pezeshk S, Garzon MH, Burke E (eds) Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2001), Morgan Kaufmann, San Francisco, California, USA, pp 137–146, URL http://​hampshire.​edu/​lspector/​pubs/​ace.​pdf
go back to reference Spector L, Klein J, Keijzer M (2005) The push3 execution stack and the evolution of control. In: Beyer HG, O’Reilly UM, Arnold DV, Banzhaf W, Blum C, Bonabeau EW, Cantu-Paz E, Dasgupta D, Deb K, Foster JA, de Jong ED, Lipson H, Llora X, Mancoridis S, Pelikan M, Raidl GR, Soule T, Tyrrell AM, Watson JP, Zitzler E (eds) GECCO 2005: Proceedings of the 2005 conference on Genetic and evolutionary computation, ACM Press, Washington DC, USA, vol 2, pp 1689–1696, DOI doi:10.1145/1068009.1068292, URL http://www.cs.bham.ac.uk/~wbl/biblio/gecco2005/docs/p1689.pdf Spector L, Klein J, Keijzer M (2005) The push3 execution stack and the evolution of control. In: Beyer HG, O’Reilly UM, Arnold DV, Banzhaf W, Blum C, Bonabeau EW, Cantu-Paz E, Dasgupta D, Deb K, Foster JA, de Jong ED, Lipson H, Llora X, Mancoridis S, Pelikan M, Raidl GR, Soule T, Tyrrell AM, Watson JP, Zitzler E (eds) GECCO 2005: Proceedings of the 2005 conference on Genetic and evolutionary computation, ACM Press, Washington DC, USA, vol 2, pp 1689–1696, DOI doi:10.1145/1068009.1068292, URL http://​www.​cs.​bham.​ac.​uk/​~wbl/​biblio/​gecco2005/​docs/​p1689.​pdf
go back to reference Van Rijsbergen C (1979) Information retrieval. Butterworths, London Van Rijsbergen C (1979) Information retrieval. Butterworths, London
go back to reference Veeramachaneni K, Vladislavleva E, O’Reilly UM (2012) Knowledge mining sensory evaluation data: genetic programming, statistical techniques, and swarm optimization. Genetic Programming and Evolvable Machines 13(1):103–133, DOI doi:10.1007/s10710-011-9153-2, special Section on Evolutionary Algorithms for Data Mining Veeramachaneni K, Vladislavleva E, O’Reilly UM (2012) Knowledge mining sensory evaluation data: genetic programming, statistical techniques, and swarm optimization. Genetic Programming and Evolvable Machines 13(1):103–133, DOI doi:10.1007/s10710-011-9153-2, special Section on Evolutionary Algorithms for Data Mining
Metadata
Title
Evolving SQL Queries from Examples with Developmental Genetic Programming
Authors
Thomas Helmuth
Lee Spector
Copyright Year
2013
Publisher
Springer New York
DOI
https://doi.org/10.1007/978-1-4614-6846-2_1

Premium Partner