Skip to main content
Top

2017 | OriginalPaper | Chapter

Classification of Vector-Borne Virus Through Totally Ordered Set of Dinucleotide Interval Patterns

Authors : Uddalak Mitra, Balaram Bhattacharyya

Published in: Pattern Recognition and Machine Intelligence

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In genome analysis, common approach to all word methods is use of long words to improve precision in biological findings. However, arbitrary increment in word length cannot always be fruitful, rather causing increase in space-time complexity. We observe that instead of mere increase in length, integration of word intervals along with order and frequency of their occurrence have great impact in extracting sequence information with much smaller word length and devise a method, Dinucleotide Interval Patterns (DIP), for entropy retrieval from ordered sets of dinucleotide intervals. Experiments on natural sequences of Flaviviridae virus with length 9 to 12 kbp establish that only word size of 2bp is capable of deriving precise taxonomic classification of the virus. This is in sharp contrast to standard word-based methods requiring a minimum of 6bp word size to achieve nearly 30% Topological Similarity in comparison to 60% score by DIP with only 2bp.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990)CrossRef Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990)CrossRef
2.
go back to reference Sims, G.E., Jun, S.R., Wu, G.A., Kim, S.H.: Whole-genome phylogeny of mammals: evolutionary information in genic and nongenic regions. Proc. Nat. Acad. Sci. U.S.A. 106(40), 17077–17082 (2009)CrossRef Sims, G.E., Jun, S.R., Wu, G.A., Kim, S.H.: Whole-genome phylogeny of mammals: evolutionary information in genic and nongenic regions. Proc. Nat. Acad. Sci. U.S.A. 106(40), 17077–17082 (2009)CrossRef
3.
go back to reference Sims, G.E., Kim, S.H.: Whole-genome phylogeny of Escherichia coli/Shigella group by feature frequency profiles (FFPs). Proc. Nat. Acad. Sci. U.S.A. 108(20), 832–934 (2011)CrossRef Sims, G.E., Kim, S.H.: Whole-genome phylogeny of Escherichia coli/Shigella group by feature frequency profiles (FFPs). Proc. Nat. Acad. Sci. U.S.A. 108(20), 832–934 (2011)CrossRef
4.
go back to reference Gao, L., Qi, J.: Whole genome molecular phylogeny of large dsDNA viruses using composition vector method. BMC Evol. Biol. 7, 41 (2007)CrossRef Gao, L., Qi, J.: Whole genome molecular phylogeny of large dsDNA viruses using composition vector method. BMC Evol. Biol. 7, 41 (2007)CrossRef
5.
go back to reference Alsop, E.B., Raymond, J.: Resolving prokaryotic taxonomy without rRNA: Longer oligonucleotide word lengths improve genome and metagenome taxonomic classication. PLoS ONE 8, e67337 (2013)CrossRef Alsop, E.B., Raymond, J.: Resolving prokaryotic taxonomy without rRNA: Longer oligonucleotide word lengths improve genome and metagenome taxonomic classication. PLoS ONE 8, e67337 (2013)CrossRef
6.
go back to reference Hao, B.L., Qi, J., Wang, B.: Prokaryotic phylogeny based on complete genomes without sequence alignment. Mod. Phys. Lett. B 2, 1–4 (2003)MATHMathSciNet Hao, B.L., Qi, J., Wang, B.: Prokaryotic phylogeny based on complete genomes without sequence alignment. Mod. Phys. Lett. B 2, 1–4 (2003)MATHMathSciNet
7.
go back to reference Leimeister, C.A., Boden, M., Horwege, S., Lindner, S.: Fast alignment-free sequence comparison using spaced-word frequencies. Bioinformatics 30(14), 1991–1999 (2014)CrossRef Leimeister, C.A., Boden, M., Horwege, S., Lindner, S.: Fast alignment-free sequence comparison using spaced-word frequencies. Bioinformatics 30(14), 1991–1999 (2014)CrossRef
8.
Metadata
Title
Classification of Vector-Borne Virus Through Totally Ordered Set of Dinucleotide Interval Patterns
Authors
Uddalak Mitra
Balaram Bhattacharyya
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-69900-4_51

Premium Partner