Skip to main content

2018 | OriginalPaper | Buchkapitel

5. Finding New Overlapping Genes and Their Theory (FOG Theory)

verfasst von : Siegfried Scherer, Klaus Neuhaus, Martin Bossert, Katharina Mir, Daniel Keim, Svenja Simon

Erschienen in: Information- and Communication Theory in Molecular Biology

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The general goal of the project is to find and verify new overlapping protein-coding DNA sequences in prokaryotes and to understand the underlying mechanisms with the help of models from information and communication theory. To reach these goals, a cooperation of three groups is necessary, namely a group performing in vivo and in vitro molecular biology experiments, an informatic group which can handle the huge amount of widely distributed data on gene sequences, and a group working in information and communication theory. With methods from information theory, especially from error correcting codes, the process of coding proteins via embedded genes will be studied, using new distance measures. Further, the powerful concept of random coding will be used to obtain bounds. Embedded genes will be analyzed using a coding-theoretic approach. Communication theory provides models and mechanisms in order to transmit information reliably over channels which introduce errors. Evolution, as well as the process of coding proteins by overlapping genes, can be viewed as such a communication system. Both will be described and analyzed with the theory from communication systems, including synchronization mechanisms. The parameters of the models need to be verified and/or determined. Therefore, aspects of bioinformatics and molecular biology are essential. Algorithms will be developed which efficiently search databases at a large scale for new protein-coding DNA sequences in prokaryotes, embedded in annotated genes in overlapping alternative reading frames. Based on these results, experimental evaluation of embedded genes using molecular biology tools to determine function of selected candidate genes will be performed.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Behrisch M et al (2013) Visual comparison of orderings and rankings. In: Pohl M, Schumann H (eds) EuroVis workshop on visual analytics. The Eurographics Association, pp. 7–11 Behrisch M et al (2013) Visual comparison of orderings and rankings. In: Pohl M, Schumann H (eds) EuroVis workshop on visual analytics. The Eurographics Association, pp. 7–11
Zurück zum Zitat Fellner L et al (2014) Phenotype of htgA (mbiA), a recently evolved orphan gene of Escherichia coli and Shigella, completely overlapping in antisense to yaaW. FEMS Microbiol Lett 350(1):57–64 Fellner L et al (2014) Phenotype of htgA (mbiA), a recently evolved orphan gene of Escherichia coli and Shigella, completely overlapping in antisense to yaaW. FEMS Microbiol Lett 350(1):57–64
Zurück zum Zitat Fellner L et al (2015) Evidence for the recent origin of a bacterial protein-coding, overlapping orphan gene by evolutionary overprinting. BMC Evol Biol 15:283 Fellner L et al (2015) Evidence for the recent origin of a bacterial protein-coding, overlapping orphan gene by evolutionary overprinting. BMC Evol Biol 15:283
Zurück zum Zitat Fellner L et al (2016) Draft genome sequences of three european laboratory derivatives from enterohemorrhagic Escherichia coli O157:H7 strain EDL933, including two plasmids. Genome Announcements 4(2):e01331-15 Fellner L et al (2016) Draft genome sequences of three european laboratory derivatives from enterohemorrhagic Escherichia coli O157:H7 strain EDL933, including two plasmids. Genome Announcements 4(2):e01331-15
Zurück zum Zitat Hücker SM et al (2017) Transcriptional and translational regulation by RNA thermometers, riboswitches and the sRNA DsrA in Escherichia coli O157:H7 Sakai under combined cold and osmotic stress adaptation. FEMS Microbiol Lett 364(2):fnw262 Hücker SM et al (2017) Transcriptional and translational regulation by RNA thermometers, riboswitches and the sRNA DsrA in Escherichia coli O157:H7 Sakai under combined cold and osmotic stress adaptation. FEMS Microbiol Lett 364(2):fnw262
Zurück zum Zitat Landstorfer R et al (2014) Comparison of strand-specific transcriptomes of enterohemorrhagic Escherichia coli O157:H7 EDL933 (EHEC) under eleven different environmental conditions including radish sprouts and cattle feces. BMC Genomics 15:353 Landstorfer R et al (2014) Comparison of strand-specific transcriptomes of enterohemorrhagic Escherichia coli O157:H7 EDL933 (EHEC) under eleven different environmental conditions including radish sprouts and cattle feces. BMC Genomics 15:353
Zurück zum Zitat Mir K et al (2012) Predicting statistical properties of open reading frames in bacterial genomes. PLoS ONE 7(9):e45103CrossRef Mir K et al (2012) Predicting statistical properties of open reading frames in bacterial genomes. PLoS ONE 7(9):e45103CrossRef
Zurück zum Zitat Mir K et al (2013) Short barcodes for next generation sequencing. PLoS ONE 8(12):e82933CrossRef Mir K et al (2013) Short barcodes for next generation sequencing. PLoS ONE 8(12):e82933CrossRef
Zurück zum Zitat Mir K, Schober S (2014a) Investigation of genetic code optimality for overlapping protein coding sequences. In: Proceedings of the 8th international symposium on turbo codes and iterative information processing (ISTC), Ulm, Germany Mir K, Schober S (2014a) Investigation of genetic code optimality for overlapping protein coding sequences. In: Proceedings of the 8th international symposium on turbo codes and iterative information processing (ISTC), Ulm, Germany
Zurück zum Zitat Mir K, Schober S (2014b) Selection pressure in alternative reading frames. PLoS ONE 9(10):e108768 Mir K, Schober S (2014b) Selection pressure in alternative reading frames. PLoS ONE 9(10):e108768
Zurück zum Zitat Neuhaus K et al (2016) Translatomics combined with transcriptomics and proteomics reveals novel functional, recently evolved orphan genes in Escherichia coli O157:H7 (EHEC). BMC Genomics 17:133 Neuhaus K et al (2016) Translatomics combined with transcriptomics and proteomics reveals novel functional, recently evolved orphan genes in Escherichia coli O157:H7 (EHEC). BMC Genomics 17:133
Zurück zum Zitat Neuhaus K et al (2017) Differentiation of ncRNAs from small mRNAs in Escherichia coli O157:H7 EDL933 (EHEC) by combined RNAseq and RIBOseq—ryhB encodes the regulatory RNA RyhB and a peptide, RyhP. BMC Genomics 18:216 Neuhaus K et al (2017) Differentiation of ncRNAs from small mRNAs in Escherichia coli O157:H7 EDL933 (EHEC) by combined RNAseq and RIBOseq—ryhB encodes the regulatory RNA RyhB and a peptide, RyhP. BMC Genomics 18:216
Zurück zum Zitat Oelke D et al (2011) Visual boosting in pixel-based visualizations. Comput Gr Forum 30(3):871–880CrossRef Oelke D et al (2011) Visual boosting in pixel-based visualizations. Comput Gr Forum 30(3):871–880CrossRef
Zurück zum Zitat Schober S et al (2012) Design of short barcodes for next generation sequencing of DNA and RNA. In: Genomic signal processing and statistics (GENSIPS), pp. 31–34 Schober S et al (2012) Design of short barcodes for next generation sequencing of DNA and RNA. In: Genomic signal processing and statistics (GENSIPS), pp. 31–34
Zurück zum Zitat Simon S et al (2011) Visual analysis of next-generation sequencing data to detect overlapping genes in bacterial genomes. In: Proceedings of IEEE symposium on biological data visualization, Providence, Rhode Island, USA, vol 1, pp. 47–54, 23–24 October 2011 Simon S et al (2011) Visual analysis of next-generation sequencing data to detect overlapping genes in bacterial genomes. In: Proceedings of IEEE symposium on biological data visualization, Providence, Rhode Island, USA, vol 1, pp. 47–54, 23–24 October 2011
Zurück zum Zitat Simon S et al (2015) Bridging the gap of domain and visualization experts with a Liaison. In: Bertini E, Kennedy J, Puppo P (eds) Eurographics conference on visualization (EuroVis) - short papers, Cagliari, Italy, 25–29 May 2015. The Eurographics Association, pp. 127–133 Simon S et al (2015) Bridging the gap of domain and visualization experts with a Liaison. In: Bertini E, Kennedy J, Puppo P (eds) Eurographics conference on visualization (EuroVis) - short papers, Cagliari, Italy, 25–29 May 2015. The Eurographics Association, pp. 127–133
Zurück zum Zitat Simon S et al (2015) VisExpress - visual exploration of differential gene expression data. Inf Vis 16(1): 48–73 Simon S et al (2015) VisExpress - visual exploration of differential gene expression data. Inf Vis 16(1): 48–73
Zurück zum Zitat Altschul SF et al (1990) Basic local alignment search tool. J Mol Biol 215(3):403–410 Altschul SF et al (1990) Basic local alignment search tool. J Mol Biol 215(3):403–410
Zurück zum Zitat Altschul SF et al (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25(17):3389–3402 Altschul SF et al (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25(17):3389–3402
Zurück zum Zitat Behrens M, Sheikh J, Nataro JP (2002) Regulation of the overlapping pic/set locus in Shigella flexneri and enteroaggregative Escherichia coli. Infect Immun 70:2915–2925 Behrens M, Sheikh J, Nataro JP (2002) Regulation of the overlapping pic/set locus in Shigella flexneri and enteroaggregative Escherichia coli. Infect Immun 70:2915–2925
Zurück zum Zitat Chirico N, Vianelli A, Belshaw R (2010) Why genes overlap in viruses. Proc R Soc B Biol Sci 277(1701):3809–3817 Chirico N, Vianelli A, Belshaw R (2010) Why genes overlap in viruses. Proc R Soc B Biol Sci 277(1701):3809–3817
Zurück zum Zitat Grassl M (2006) Searching for linear codes with large minimum distance. In: Bosma W, Cannon J (eds) Discovering mathematics with magma – reducing the abstract to the concrete. Algorithms and computation in mathematics, vol 19. Springer, Heidelberg, pp 287–313 Grassl M (2006) Searching for linear codes with large minimum distance. In: Bosma W, Cannon J (eds) Discovering mathematics with magma – reducing the abstract to the concrete. Algorithms and computation in mathematics, vol 19. Springer, Heidelberg, pp 287–313
Zurück zum Zitat Itzkovitz S, Alon U (2007) The genetic code is nearly optimal for allowing additional information within protein-coding sequences. Genome Res 17(4):405CrossRef Itzkovitz S, Alon U (2007) The genetic code is nearly optimal for allowing additional information within protein-coding sequences. Genome Res 17(4):405CrossRef
Zurück zum Zitat Jensen KT et al (2006) Novel overlapping coding sequences in Chlamydia trachomatis. FEMS Microbiol Lett 265(1):106–117 Jensen KT et al (2006) Novel overlapping coding sequences in Chlamydia trachomatis. FEMS Microbiol Lett 265(1):106–117
Zurück zum Zitat Johnson ZI, Chisholm SW (2004) Properties of overlapping genes are conserved across microbial genomes. Genome Res 14(11):2268–72 Johnson ZI, Chisholm SW (2004) Properties of overlapping genes are conserved across microbial genomes. Genome Res 14(11):2268–72
Zurück zum Zitat Kim W et al (2009) Proteomic detection of non-annotated protein-coding genes in Pseudomonas fluorescens Pf0-1. PloS ONE 4(12):e8455 Kim W et al (2009) Proteomic detection of non-annotated protein-coding genes in Pseudomonas fluorescens Pf0-1. PloS ONE 4(12):e8455
Zurück zum Zitat Koonin EV, Novozhilov AS (2009) Origin and evolution of the genetic code: the universal enigma. Int Union Biochem Mol Biol Life 61(2):99–111CrossRef Koonin EV, Novozhilov AS (2009) Origin and evolution of the genetic code: the universal enigma. Int Union Biochem Mol Biol Life 61(2):99–111CrossRef
Zurück zum Zitat Krakauer DC (2000) Stability and evolution of overlapping genes. Evol Int J Org Evol 54(3):731–739 Krakauer DC (2000) Stability and evolution of overlapping genes. Evol Int J Org Evol 54(3):731–739
Zurück zum Zitat Kryazhimskiy S, Plotkin JB (2008) The population genetics of dN/dS. PLoS Genet 4(12):e1000304 Kryazhimskiy S, Plotkin JB (2008) The population genetics of dN/dS. PLoS Genet 4(12):e1000304
Zurück zum Zitat Latif H et al (2014) A gapless, unambiguous genome sequence of the enterohemorrhagic Escherichia coli O157: H7 strain EDL933. Genome Announce 2(4):e00821–14CrossRef Latif H et al (2014) A gapless, unambiguous genome sequence of the enterohemorrhagic Escherichia coli O157: H7 strain EDL933. Genome Announce 2(4):e00821–14CrossRef
Zurück zum Zitat Miyata T, Yasunaga T (1980) Molecular evolution of mRNA: a method for estimating evolutionary rates of synonymous and amino acid substitutions from homologous nucleotide sequences and its application. Genetics 16:641–657 Miyata T, Yasunaga T (1980) Molecular evolution of mRNA: a method for estimating evolutionary rates of synonymous and amino acid substitutions from homologous nucleotide sequences and its application. Genetics 16:641–657
Zurück zum Zitat Perna NT et al (2001) Genome sequence of enterohaemorrhagic Escherichia coli O157:H7. Nature 409(6819):529–533 Perna NT et al (2001) Genome sequence of enterohaemorrhagic Escherichia coli O157:H7. Nature 409(6819):529–533
Zurück zum Zitat Silby MW, Rainey PB, Levy SB (2004) IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated with recognizable overlapping genes. Microbiology 150:518–520 Silby MW, Rainey PB, Levy SB (2004) IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated with recognizable overlapping genes. Microbiology 150:518–520
Zurück zum Zitat Simon S et al (2012) Visualization of the sensitivity of BLAST to changes in the parameter settings. In: Poster at GCB 2012 - German conference on bioinformatics 2012, Jena, Germany (Poster) Simon S et al (2012) Visualization of the sensitivity of BLAST to changes in the parameter settings. In: Poster at GCB 2012 - German conference on bioinformatics 2012, Jena, Germany (Poster)
Zurück zum Zitat Tunca S et al (2009) Two overlapping antiparallel genes encoding the iron regulator DmdR1 and the Adm proteins control siderophore and antibiotic biosynthesis in Streptomyces coelicolor A3(2). FEBS J 276(17):4814–4827 Tunca S et al (2009) Two overlapping antiparallel genes encoding the iron regulator DmdR1 and the Adm proteins control siderophore and antibiotic biosynthesis in Streptomyces coelicolor A3(2). FEBS J 276(17):4814–4827
Zurück zum Zitat Yockey HP (1992) Information theory in molecular biology. Cambridge University Press, CambridgeMATH Yockey HP (1992) Information theory in molecular biology. Cambridge University Press, CambridgeMATH
Metadaten
Titel
Finding New Overlapping Genes and Their Theory (FOG Theory)
verfasst von
Siegfried Scherer
Klaus Neuhaus
Martin Bossert
Katharina Mir
Daniel Keim
Svenja Simon
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-54729-9_5

Neuer Inhalt