Skip to main content

2019 | OriginalPaper | Buchkapitel

Reads in NGS Are Distributed over a Sequence Very Inhomogeneously

verfasst von : Michael Sadovsky, Victory Kobets, Georgy Khodos, Dmitry Kuzmin, Vadim Sharov

Erschienen in: Bioinformatics and Biomedical Engineering

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Distribution of read starts over a sequences genetic entity is studied. Key question was whether the starts are distributed uniformly and homogeneously along a sequence, or there exist some spots of the increased local density of the starts. To answer the question, 15 bacterial genomes have been studied. It was found that some genomes exhibit extremely far distribution pattern, from an homogeneity, while others show lower level of the inhomogeneity. The inhomogeneity level was determined through the Kullback-Leibler distance between the real string distribution, and that one bearing the most probable continuations of the shorter strings.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
The equality of these two sums stands behind the connection of a sequence into a ring.
 
Literatur
1.
Zurück zum Zitat Van Dijk, E.L., Auger, H., Jaszczyszyn, Y., Thermes, C.: Ten years of next-generation sequencing technology. Trends Genet. 30(9), 418–426 (2014)CrossRef Van Dijk, E.L., Auger, H., Jaszczyszyn, Y., Thermes, C.: Ten years of next-generation sequencing technology. Trends Genet. 30(9), 418–426 (2014)CrossRef
2.
Zurück zum Zitat Li, H., Homer, N.: A survey of sequence alignment algorithms for next-generation sequencing. Brief. Bioinform. 11(5), 473–483 (2010)CrossRef Li, H., Homer, N.: A survey of sequence alignment algorithms for next-generation sequencing. Brief. Bioinform. 11(5), 473–483 (2010)CrossRef
3.
Zurück zum Zitat Buermans, H., den Dunnen, J.: Next generation sequencing technology: advances and applications. Biochimica et Biophysica Acta (BBA)—Mol. Basis Dis. 1842(10), 1932–1941 (2014)CrossRef Buermans, H., den Dunnen, J.: Next generation sequencing technology: advances and applications. Biochimica et Biophysica Acta (BBA)—Mol. Basis Dis. 1842(10), 1932–1941 (2014)CrossRef
4.
Zurück zum Zitat Conesa, A., et al.: A survey of best practices for RNA-seq data analysis. Genome Biol. 17(1), 13 (2016)CrossRef Conesa, A., et al.: A survey of best practices for RNA-seq data analysis. Genome Biol. 17(1), 13 (2016)CrossRef
5.
Zurück zum Zitat Sadovsky, M.G.: Information capacity of nucleotide sequences and its applications. Bull. Math. Biol. 68(4), 785–806 (2006)MathSciNetCrossRef Sadovsky, M.G.: Information capacity of nucleotide sequences and its applications. Bull. Math. Biol. 68(4), 785–806 (2006)MathSciNetCrossRef
6.
Zurück zum Zitat Sadovsky, M.G.: Comparison of real frequencies of strings vs. the expected ones reveals the information capacity of macromoleculae. J. Biol. Phys. 29(1), 23–38 (2003)CrossRef Sadovsky, M.G.: Comparison of real frequencies of strings vs. the expected ones reveals the information capacity of macromoleculae. J. Biol. Phys. 29(1), 23–38 (2003)CrossRef
7.
Zurück zum Zitat Sadovsky, M.G., Putintseva, J.A., Shchepanovsky, A.S.: Genes, information and sense: complexity and knowledge retrieval. Theory Biosci. 127(2), 69–78 (2008)CrossRef Sadovsky, M.G., Putintseva, J.A., Shchepanovsky, A.S.: Genes, information and sense: complexity and knowledge retrieval. Theory Biosci. 127(2), 69–78 (2008)CrossRef
8.
Zurück zum Zitat Sadovsky, M.G.: Information capacity of symbol sequences. Open Syst. Inf. Dyn. 9(01), 37–49 (2002)CrossRef Sadovsky, M.G.: Information capacity of symbol sequences. Open Syst. Inf. Dyn. 9(01), 37–49 (2002)CrossRef
9.
Zurück zum Zitat Borovikov, I., Sadovsky, M.G.: Sliding window analysis of binary n-grams relative information for financial time series. In: Center for Advanced Signal and Image Sciences (CASIS) at LLNL 18th Annual Workshop, p. 1 (2014) Borovikov, I., Sadovsky, M.G.: Sliding window analysis of binary n-grams relative information for financial time series. In: Center for Advanced Signal and Image Sciences (CASIS) at LLNL 18th Annual Workshop, p. 1 (2014)
11.
Zurück zum Zitat Bugaenko, N.N., Gorban, A.N., Sadovsky, M.G.: Maximum entropy method in analysis of genetic text and measurement of its information content. Open Syst. Inf. Dyn. 5(3), 265–278 (1998)CrossRef Bugaenko, N.N., Gorban, A.N., Sadovsky, M.G.: Maximum entropy method in analysis of genetic text and measurement of its information content. Open Syst. Inf. Dyn. 5(3), 265–278 (1998)CrossRef
Metadaten
Titel
Reads in NGS Are Distributed over a Sequence Very Inhomogeneously
verfasst von
Michael Sadovsky
Victory Kobets
Georgy Khodos
Dmitry Kuzmin
Vadim Sharov
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-17938-0_25