Skip to main content
Top

2020 | OriginalPaper | Chapter

Highly Parallel Convolution Method to Compare DNA Sequences with Enforced In/Del and Mutation Tolerance

Authors : Anna Molyavko, Vladimir Shaidurov, Eugenia Karepova, Michael Sadovsky

Published in: Bioinformatics and Biomedical Engineering

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

New error tolerant method for the comparison and analysis of symbol sequences is proposed. The method is based on convolution function calculation, where the function is defined over the binary numeric sequences obtained by the specific transformation of original symbol sequence. The method allows highly parallel implementation and is of great value for insertion/delition mutations search. To calculate the convolution function, fast Fourier transform is used in the method implementation.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
Actually, there is no difference between insertion and deletion: changing a reference sequence, one always is able to convert the situation to a single mutation, e. g. to insertion.
 
Literature
2.
go back to reference Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. J. Mol. Biol. 215(3), 403–410 (1990)CrossRef Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. J. Mol. Biol. 215(3), 403–410 (1990)CrossRef
3.
go back to reference Freschi, V., Bogliolo, A.: A faster algorithm for the computation of string convolutions using lz78 parsing. Inform. Process. Lett. 110(14), 609–613 (2010)CrossRef Freschi, V., Bogliolo, A.: A faster algorithm for the computation of string convolutions using lz78 parsing. Inform. Process. Lett. 110(14), 609–613 (2010)CrossRef
4.
go back to reference Freschi, V., Bogliolo, A.: Longest common subsequence between run-length-encoded strings: a new algorithm with improved parallelism. Inform. Process. Lett. 90(4), 167–173 (2004)CrossRef Freschi, V., Bogliolo, A.: Longest common subsequence between run-length-encoded strings: a new algorithm with improved parallelism. Inform. Process. Lett. 90(4), 167–173 (2004)CrossRef
5.
go back to reference Keogh, E., Chakrabarti, K., Pazzani, M., Mehrotra, S.: Dimensionality reduction for fast similarity search in large time series databases. Knowl. Inf. Syst. 3(3), 263–286 (2001)CrossRef Keogh, E., Chakrabarti, K., Pazzani, M., Mehrotra, S.: Dimensionality reduction for fast similarity search in large time series databases. Knowl. Inf. Syst. 3(3), 263–286 (2001)CrossRef
6.
go back to reference Katoh, K., Misawa, K., Kuma, K.I., Miyata, T.: MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 30(14), 3059–3066 (2002)CrossRef Katoh, K., Misawa, K., Kuma, K.I., Miyata, T.: MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 30(14), 3059–3066 (2002)CrossRef
7.
go back to reference Janacek, G.J., Bagnall, A.J., Powell, M.: A likelihood ratio distance measure for the similarity between the Fourier transform of time series. In: Ho, T.B., Cheung, D., Liu, H. (eds.) PAKDD 2005. LNCS (LNAI), vol. 3518, pp. 737–743. Springer, Heidelberg (2005). https://doi.org/10.1007/11430919_85CrossRef Janacek, G.J., Bagnall, A.J., Powell, M.: A likelihood ratio distance measure for the similarity between the Fourier transform of time series. In: Ho, T.B., Cheung, D., Liu, H. (eds.) PAKDD 2005. LNCS (LNAI), vol. 3518, pp. 737–743. Springer, Heidelberg (2005). https://​doi.​org/​10.​1007/​11430919_​85CrossRef
8.
go back to reference Hetland, M.L.: A survey of recent methods for efficient retrieval of similar time sequences. In: Data Mining in Time Series Databases, pp. 23–42. World Scientific (2004) Hetland, M.L.: A survey of recent methods for efficient retrieval of similar time sequences. In: Data Mining in Time Series Databases, pp. 23–42. World Scientific (2004)
9.
go back to reference Benson, D.C.: Fourier methods for biosequence analysis. Nucleic Acids Res. 18(21), 6305–6310 (1990)CrossRef Benson, D.C.: Fourier methods for biosequence analysis. Nucleic Acids Res. 18(21), 6305–6310 (1990)CrossRef
10.
go back to reference Aho, A.V., Hopcroft, J.E.: The Design and Analysis of Computer Algorithms. Pearson Education India, Bengaluru (1974) Aho, A.V., Hopcroft, J.E.: The Design and Analysis of Computer Algorithms. Pearson Education India, Bengaluru (1974)
11.
go back to reference Baase, S.: Computer Algorithms: Introduction to Design and Analysis. Pearson Education India, Bengaluru (2009) Baase, S.: Computer Algorithms: Introduction to Design and Analysis. Pearson Education India, Bengaluru (2009)
12.
go back to reference Kozen, D.C.: The Design and Analysis of Algorithms. Springer, Heidleberg (2012) Kozen, D.C.: The Design and Analysis of Algorithms. Springer, Heidleberg (2012)
13.
go back to reference Levenshtein, V.I.: Bounds for deletion/insertion correcting codes. In: Proceedings IEEE International Symposium on Information Theory, p. 370. IEEE (2002) Levenshtein, V.I.: Bounds for deletion/insertion correcting codes. In: Proceedings IEEE International Symposium on Information Theory, p. 370. IEEE (2002)
14.
go back to reference Merhi, S., Zhang, R., Iwen, M.A., Christlieb, A.: A new class of fully discrete sparse fourier transforms: faster stable implementations with guarantees. J. Fourier Anal. Appl. 25(3), 751–784 (2019)CrossRef Merhi, S., Zhang, R., Iwen, M.A., Christlieb, A.: A new class of fully discrete sparse fourier transforms: faster stable implementations with guarantees. J. Fourier Anal. Appl. 25(3), 751–784 (2019)CrossRef
15.
go back to reference Karam, C., Sugimoto, K., Hirakawa, K.: Fast convolutional distance transform. IEEE Signal Process. Lett. 26(6), 853–857 (2019)CrossRef Karam, C., Sugimoto, K., Hirakawa, K.: Fast convolutional distance transform. IEEE Signal Process. Lett. 26(6), 853–857 (2019)CrossRef
Metadata
Title
Highly Parallel Convolution Method to Compare DNA Sequences with Enforced In/Del and Mutation Tolerance
Authors
Anna Molyavko
Vladimir Shaidurov
Eugenia Karepova
Michael Sadovsky
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-45385-5_42

Premium Partner