Skip to main content
Erschienen in: International Journal of Machine Learning and Cybernetics 3/2011

01.09.2011 | Original Article

A new fast fuzzy Cocke–Younger–Kasami algorithm for DNA strings analysis

verfasst von: Herón Molina-Lozano

Erschienen in: International Journal of Machine Learning and Cybernetics | Ausgabe 3/2011

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper we present a variation of the Cocke–Younger–Kasami algorithm (CYK algorithm) for the analysis of fuzzy free context languages applied to DNA strings. We propose a variation of the original CYK algorithm where we prove that the computational order of the new CYK algorithm is O(n). We prove that the new algorithm only uses O(2n) memory locations. The fuzzy context-free grammar (FCFG) is obtained from the DNA. The algorithm can be used to find regulatory motifs among other applications. In order to demonstrate the applications of the proposed algorithm, we present two examples. In the first example, we prove that it is possible to define a fuzzy grammar for a prototype DNA sequence and then find the membership grade of any arbitrary sequence against this specific pattern. As a second example, we construct a fuzzy grammar from the alignment of promoters obtained by a logo sequence algorithm for the Escherichia coli K12 DNA string, and then show how the proposed method can be used for discovery of the regulatory motifs.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Weitere Produktempfehlungen anzeigen
Literatur
1.
2.
Zurück zum Zitat Brendel V, Busse H (1984) Genome structure described by formal languages. Nucleic Acids Res 12:2561–2568CrossRef Brendel V, Busse H (1984) Genome structure described by formal languages. Nucleic Acids Res 12:2561–2568CrossRef
3.
Zurück zum Zitat Collado-Vides J (1989) A transformational grammar approach to the study of the regulation of gene expression. J Theory Biol 136:403–425CrossRef Collado-Vides J (1989) A transformational grammar approach to the study of the regulation of gene expression. J Theory Biol 136:403–425CrossRef
4.
Zurück zum Zitat Duda RO, Hart PE, Stork DG (2001) Pattern classification, 2nd edn. Wiley, New York Duda RO, Hart PE, Stork DG (2001) Pattern classification, 2nd edn. Wiley, New York
5.
Zurück zum Zitat Hawkins J, Boden M (2005) The applicability of recurrent neural networks for biological sequence analysis. IEEE/ACM Trans Comput Biol Bioinform 2:243–253CrossRef Hawkins J, Boden M (2005) The applicability of recurrent neural networks for biological sequence analysis. IEEE/ACM Trans Comput Biol Bioinform 2:243–253CrossRef
6.
Zurück zum Zitat Head T (1987) Formal languages theory and DNA. Bull Math Biol 49 Head T (1987) Formal languages theory and DNA. Bull Math Biol 49
7.
Zurück zum Zitat Hopcroft JE, Rajeev Motwai R, Ullman JD (2002) Introduction to automata theory, languages and computation. Addison-Wesley, Reading Hopcroft JE, Rajeev Motwai R, Ullman JD (2002) Introduction to automata theory, languages and computation. Addison-Wesley, Reading
8.
Zurück zum Zitat Jang J-S, Sun CT, Mitzutani E (1997) Neuro-fuzzy and soft computing: a computational approach to learning and machine intelligence. Prentice Hall, Englewood Cliffs Jang J-S, Sun CT, Mitzutani E (1997) Neuro-fuzzy and soft computing: a computational approach to learning and machine intelligence. Prentice Hall, Englewood Cliffs
9.
Zurück zum Zitat Jones NC, Pevzner PA (2004) An introduction to bioinformatics algorithms. MIT Press, Cambridge Jones NC, Pevzner PA (2004) An introduction to bioinformatics algorithms. MIT Press, Cambridge
10.
Zurück zum Zitat Koski T (2001) Hidden Markov models for bioinformatics. Kluwer Academic Publishers, Dordrecht Koski T (2001) Hidden Markov models for bioinformatics. Kluwer Academic Publishers, Dordrecht
12.
Zurück zum Zitat Linz P (2006) Formal languages and automata, 4th edn. Jones and Bartlett Publishers, Sudbury Linz P (2006) Formal languages and automata, 4th edn. Jones and Bartlett Publishers, Sudbury
13.
Zurück zum Zitat Molina-Lozano H (2010) A fast fuzzy Cocke–Younger–Kasami algorithm for DNA and RNA string analysis. In: Mexican International Conference on Artificial Intelligence, MICAI Molina-Lozano H (2010) A fast fuzzy Cocke–Younger–Kasami algorithm for DNA and RNA string analysis. In: Mexican International Conference on Artificial Intelligence, MICAI
14.
Zurück zum Zitat Molina-Lozano H, Vallejo-Clemente E, Morett-Sánchez J (2008) DNA sequence analysis using fuzzy grammars. In: IEEE World Congress on Computational Intelligence Molina-Lozano H, Vallejo-Clemente E, Morett-Sánchez J (2008) DNA sequence analysis using fuzzy grammars. In: IEEE World Congress on Computational Intelligence
15.
Zurück zum Zitat Mordeson JN, Malik DS (2002) Fuzzy automata and languages: theory and applications. Chapman and Hall/CRC, Boca Raton Mordeson JN, Malik DS (2002) Fuzzy automata and languages: theory and applications. Chapman and Hall/CRC, Boca Raton
16.
Zurück zum Zitat Needleman SB, Wunsch CD (1970) A general method applicable to the search for similarities in the amino acid sequences of two proteins. J Mol Biol 44:443–453CrossRef Needleman SB, Wunsch CD (1970) A general method applicable to the search for similarities in the amino acid sequences of two proteins. J Mol Biol 44:443–453CrossRef
17.
Zurück zum Zitat Searls D (1992) The linguistics of DNA. Am Sci 80:579–591 Searls D (1992) The linguistics of DNA. Am Sci 80:579–591
18.
Zurück zum Zitat Searls D (1993) Artificial intelligence and molecular biology. In: Hunter L (eds). AAAI Press, pp 47–120 Searls D (1993) Artificial intelligence and molecular biology. In: Hunter L (eds). AAAI Press, pp 47–120
19.
Zurück zum Zitat Searls DB (2002) The languages of genes. Nature 420:211–217 Searls DB (2002) The languages of genes. Nature 420:211–217
20.
Zurück zum Zitat Smith TF, Waterman MS (1981) Identification of common molecular subsequences. J Mol Biol 147:195–197CrossRef Smith TF, Waterman MS (1981) Identification of common molecular subsequences. J Mol Biol 147:195–197CrossRef
22.
Zurück zum Zitat Harrison MA (1978) Introduction to formal language theory. Addison-Wesley, Reading Harrison MA (1978) Introduction to formal language theory. Addison-Wesley, Reading
23.
Zurück zum Zitat Schneider TD (1996) New approaches in mathematical biology: information theory and molecular machines. In: Chela-Flores J, Raulin F (eds) Chemical evolution: physics of the origin and evolution of life. Kluwer Academic Publishers, Dordrecht, pp 313–321 Schneider TD (1996) New approaches in mathematical biology: information theory and molecular machines. In: Chela-Flores J, Raulin F (eds) Chemical evolution: physics of the origin and evolution of life. Kluwer Academic Publishers, Dordrecht, pp 313–321
Metadaten
Titel
A new fast fuzzy Cocke–Younger–Kasami algorithm for DNA strings analysis
verfasst von
Herón Molina-Lozano
Publikationsdatum
01.09.2011
Verlag
Springer-Verlag
Erschienen in
International Journal of Machine Learning and Cybernetics / Ausgabe 3/2011
Print ISSN: 1868-8071
Elektronische ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-011-0042-z

Weitere Artikel der Ausgabe 3/2011

International Journal of Machine Learning and Cybernetics 3/2011 Zur Ausgabe

Editorial

Editorial

Neuer Inhalt