Skip to main content
Top
Published in: Microsystem Technologies 9/2017

02-03-2016 | Technical Paper

Polyphase filtering with variable mapping rule in protein coding region prediction

Authors: Saikat Singha Roy, Soma Barman

Published in: Microsystem Technologies | Issue 9/2017

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Genomic researches are concerned with the study of genomes of organisms. It has become a challenge to the researchers to identify the segments within the DNA sequence that involved in protein synthesis and called coding region of gene. The methods are generally used to identify the segment that relies on period-3 property of genes. This period-3 property easily can be identified by digital signal processing with great accuracy. Prior to DSP application in gene prediction a conversion rule is required which converts symbolic DNA (ATCGTC…) sequence into numerical representations. Accuracy of gene prediction depends on mapping rule. The effectiveness of mapping rule depends on the application area of genomics. Some mapping rule works well in gene prediction may not performed good in genetic disease prediction. Most of the available conversion rules are fixed mapping technique. In this paper a new conversion rule is proposed prior to DSP application and a polyphase filter is used to suppress the noise in the DNA spectrum. The performance of the proposed mapping is compared with existing mapping and also the performance of the polyphase filtering method is compared with existing filtering methods in terms of signal to noise ratio (SNR) and location accuracy.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
go back to reference Abo-Zahhad M, Ahmed SM, Abd-Elrahman SA (2012) Genomic analysis and classification of exon and intron sequences using DNA numerical mapping techniques. Int J Inf Technol Comput Sci 4(8):22–36 Abo-Zahhad M, Ahmed SM, Abd-Elrahman SA (2012) Genomic analysis and classification of exon and intron sequences using DNA numerical mapping techniques. Int J Inf Technol Comput Sci 4(8):22–36
go back to reference Akhtar M, Epps J and Ambikairajah E (2007) On DNA numerical representations for period-3 based exon prediction. In: Proceedings of IEEE workshop on genomic signal processing and statistics (GENSIPS), pp. 1–4 Akhtar M, Epps J and Ambikairajah E (2007) On DNA numerical representations for period-3 based exon prediction. In: Proceedings of IEEE workshop on genomic signal processing and statistics (GENSIPS), pp. 1–4
go back to reference Akhtar M, Epps J, Ambikairajah E (2008) Signal processing in sequence analysis: advances in eukaryotic gene prediction. IEEE J Sel Topics Signal Process 2(3):310–321CrossRef Akhtar M, Epps J, Ambikairajah E (2008) Signal processing in sequence analysis: advances in eukaryotic gene prediction. IEEE J Sel Topics Signal Process 2(3):310–321CrossRef
go back to reference Alberts B, Bray D, Johnson A, Lewis J, Raff M, Roberts K, Walter P (1998) Essential cell biology. Garland Publishing Inc., New York Alberts B, Bray D, Johnson A, Lewis J, Raff M, Roberts K, Walter P (1998) Essential cell biology. Garland Publishing Inc., New York
go back to reference Anastassiou D (2000) Frequency–domain analysis of bimolecular sequences. Bioinformatics 16:1073–1081CrossRef Anastassiou D (2000) Frequency–domain analysis of bimolecular sequences. Bioinformatics 16:1073–1081CrossRef
go back to reference Anastassiou D (2001a) DSP in genomics: Processing and frequency domain analysis of character strings. IEEE-7803-7041-2001 Anastassiou D (2001a) DSP in genomics: Processing and frequency domain analysis of character strings. IEEE-7803-7041-2001
go back to reference Anastassiou D (2001b) Genomic signal processing. IEEE Signal Process Mag 18:8–20CrossRef Anastassiou D (2001b) Genomic signal processing. IEEE Signal Process Mag 18:8–20CrossRef
go back to reference Barman (Mandal) S, Biswas S, Das S and Roy M (2012) Performance analysis and Simulation of IIR anti-notch filter with various structures for gene predication application. 5th International Conference on Computer and Devices for Communication (CODEC) Barman (Mandal) S, Biswas S, Das S and Roy M (2012) Performance analysis and Simulation of IIR anti-notch filter with various structures for gene predication application. 5th International Conference on Computer and Devices for Communication (CODEC)
go back to reference Bellanger M, Bonnerot G, Coudreuse M (1976) Digital filtering by polyphase network: application to sample rate alteration and filter banks. IEEE Trans. Acoustic Speech Signal Proc. 24:109–114CrossRef Bellanger M, Bonnerot G, Coudreuse M (1976) Digital filtering by polyphase network: application to sample rate alteration and filter banks. IEEE Trans. Acoustic Speech Signal Proc. 24:109–114CrossRef
go back to reference Chakravarthy N, Spanias A, Iasemidis LD, Tsakalis K (2004) Autoregressive modeling and feature analysis of DNA sequences. EURASIP J Appl Sig Process 1:13–28CrossRefMATH Chakravarthy N, Spanias A, Iasemidis LD, Tsakalis K (2004) Autoregressive modeling and feature analysis of DNA sequences. EURASIP J Appl Sig Process 1:13–28CrossRefMATH
go back to reference Crick FH, Watson JD (1953) Molecular structure of nucleic acids. Nature 171(4356):737–738CrossRef Crick FH, Watson JD (1953) Molecular structure of nucleic acids. Nature 171(4356):737–738CrossRef
go back to reference Cristea P D (2002a) Genetic signal representation and analysis. In: Proceedings of SPIEConference. International Biomedical Optics Symposium (BIOS’02), vol. 4623, pp 77–84 Cristea P D (2002a) Genetic signal representation and analysis. In: Proceedings of SPIEConference. International Biomedical Optics Symposium (BIOS’02), vol. 4623, pp 77–84
go back to reference CristeaP D (2002) Conversion of nucleotides sequences into genomic signals. J Cell Mol Med 6:279–303CrossRef CristeaP D (2002) Conversion of nucleotides sequences into genomic signals. J Cell Mol Med 6:279–303CrossRef
go back to reference Epps J, Ambikairajah E and Akhtar M (2008) An integer period DFT for biological sequence processing. In: Proceedings of the IEEE International Workshop on Genomic Signal Processing and Statistics GENSIPS, pp 1–4 Epps J, Ambikairajah E and Akhtar M (2008) An integer period DFT for biological sequence processing. In: Proceedings of the IEEE International Workshop on Genomic Signal Processing and Statistics GENSIPS, pp 1–4
go back to reference Ficket JW, Tung CS (1982) Recognition of protein coding regions in DNA sequences. Nucleic Acids Res 10(17):5303–5318CrossRef Ficket JW, Tung CS (1982) Recognition of protein coding regions in DNA sequences. Nucleic Acids Res 10(17):5303–5318CrossRef
go back to reference Grandhi D G and Vijaykumar C (2007) 2-Simplex Mapping for Identifying the Protein Coding Regions in DNA [C]. TENCON-2007, Taiwan, 530 Grandhi D G and Vijaykumar C (2007) 2-Simplex Mapping for Identifying the Protein Coding Regions in DNA [C]. TENCON-2007, Taiwan, 530
go back to reference Holden T, Subramaniam R, Sullivan R, Cheng E, Sneider C, Tremberger G, Flamholz JA, Leiberman DH, Cheung TD (1992) A TCG nucleotide fluctuation of Deinococcusradiodurans radiation genes. Proceedings of Society of Photo-Optical Nature, San Diego 168 Holden T, Subramaniam R, Sullivan R, Cheng E, Sneider C, Tremberger G, Flamholz JA, Leiberman DH, Cheung TD (1992) A TCG nucleotide fluctuation of Deinococcusradiodurans radiation genes. Proceedings of Society of Photo-Optical Nature, San Diego 168
go back to reference Inbamalar T M and Sivakumar R (2015) Improved Algorithm for Analysis of DNA Sequences Using Multiresolution Transformation. Scientific World J Inbamalar T M and Sivakumar R (2015) Improved Algorithm for Analysis of DNA Sequences Using Multiresolution Transformation. Scientific World J
go back to reference Kakumani R and Devabhaktuni V (2008) Prediction of Protein Coding Regions in DNA Sequence using a model based approach. IEEE explore. doi:978-1-4244-1684-4/08, pp 1918–1920 Kakumani R and Devabhaktuni V (2008) Prediction of Protein Coding Regions in DNA Sequence using a model based approach. IEEE explore. doi:978-1-4244-1684-4/08, pp 1918–1920
go back to reference Liu G and Luan Y (2014) Identification of protein coding regions in the eukaryotic DNA sequences based on Marple algorithm and wavelet packets transform. In Abstract and Applied Analysis (Vol. 2014). Hindawi Publishing Corporation Liu G and Luan Y (2014) Identification of protein coding regions in the eukaryotic DNA sequences based on Marple algorithm and wavelet packets transform. In Abstract and Applied Analysis (Vol. 2014). Hindawi Publishing Corporation
go back to reference Ning J, Moore C N and Nelson J C (2003) Preliminary wavelet analysis of genomic sequences. Proc. IEEE Bioinformatics Conf. (CSB), pp 509–510 Ning J, Moore C N and Nelson J C (2003) Preliminary wavelet analysis of genomic sequences. Proc. IEEE Bioinformatics Conf. (CSB), pp 509–510
go back to reference Rao N, Shepherd SJ (2004) Detection of 3-periodicity for small genomic sequences based on AR technique. Proc Int Conf Commun Circuits Syst ICCCAS 2:1032–1036 Rao N, Shepherd SJ (2004) Detection of 3-periodicity for small genomic sequences based on AR technique. Proc Int Conf Commun Circuits Syst ICCCAS 2:1032–1036
go back to reference Roy M, Biswas S and Barman (Mandal) S (2009) Identification and analysis of coding and non-coding regions of a DNA sequence by Positional Frequency Distribution of Nucleotides (PFDN) algorithm. International Conference on Computers and Devices for Communication (CODEC) Roy M, Biswas S and Barman (Mandal) S (2009) Identification and analysis of coding and non-coding regions of a DNA sequence by Positional Frequency Distribution of Nucleotides (PFDN) algorithm. International Conference on Computers and Devices for Communication (CODEC)
go back to reference Sahu, S S and Panda G (2010) An efficient signal processing approach in eukaryotic gene prediction. In: Proceeding of 8th Asia Pacific Bioinformatic Conference (APBC), Bangalore, pp 1–12 Sahu, S S and Panda G (2010) An efficient signal processing approach in eukaryotic gene prediction. In: Proceeding of 8th Asia Pacific Bioinformatic Conference (APBC), Bangalore, pp 1–12
go back to reference Silverman BD, Linker R (1986) A measure of DNA periodicity [J]. Theor Biol 118:295–300CrossRef Silverman BD, Linker R (1986) A measure of DNA periodicity [J]. Theor Biol 118:295–300CrossRef
go back to reference Singha Roy S, Barman S (2014) Identification of protein coding region of DNA sequence using multirate filter. Computational Advan Commun Circuits Syst. doi:10.1007/978-81-322-2274-3_16 (Lecture Notes in Electrical Engineering) Singha Roy S, Barman S (2014) Identification of protein coding region of DNA sequence using multirate filter. Computational Advan Commun Circuits Syst. doi:10.​1007/​978-81-322-2274-3_​16 (Lecture Notes in Electrical Engineering)
go back to reference Tiwari S, Ramachandran S, Bhattacharya A, Bhattacharya S, Ramaswamy R (1997) Prediction of probable genes by Fourier analysis of genomic sequences. CABIOS 3(3):263–270 Tiwari S, Ramachandran S, Bhattacharya A, Bhattacharya S, Ramaswamy R (1997) Prediction of probable genes by Fourier analysis of genomic sequences. CABIOS 3(3):263–270
go back to reference Vaidyanathan PP (1990) Multirate digital filters filter banks, polyphase networks, and applications: a tutorial. Proc IEEE 78(1):56–93CrossRef Vaidyanathan PP (1990) Multirate digital filters filter banks, polyphase networks, and applications: a tutorial. Proc IEEE 78(1):56–93CrossRef
go back to reference Vaidyanathan PP (2004) Genomics and proteomics: a signal processor’s tour. Circuits Syst Mag IEEE 4(4):6–29CrossRef Vaidyanathan PP (2004) Genomics and proteomics: a signal processor’s tour. Circuits Syst Mag IEEE 4(4):6–29CrossRef
go back to reference Vaidyanathan P P and Yoon B J (2004) The role of signal-processing concepts in genomics and proteomics. J Franklin Inst (Special issue on Genomics) Vaidyanathan P P and Yoon B J (2004) The role of signal-processing concepts in genomics and proteomics. J Franklin Inst (Special issue on Genomics)
go back to reference Voss RF (1992) Evolution of long-range fractal correlations and 1/f noise in DNA base sequences. Phys Rev Lett 68(25):3805–3808CrossRef Voss RF (1992) Evolution of long-range fractal correlations and 1/f noise in DNA base sequences. Phys Rev Lett 68(25):3805–3808CrossRef
go back to reference Yin C, Stephen S, Yau T (2007) Prediction of protein coding regions by the 3-base periodicity analysis of a DNA sequence. J Theor Biol 247:687–694MathSciNetCrossRef Yin C, Stephen S, Yau T (2007) Prediction of protein coding regions by the 3-base periodicity analysis of a DNA sequence. J Theor Biol 247:687–694MathSciNetCrossRef
go back to reference Zhang R, Zhang CT (1004) Z curves, An Intuitive Tool, for Visualizing and Analyzing the DNA sequences. J Biomol Struct Dyn 11:767–782CrossRef Zhang R, Zhang CT (1004) Z curves, An Intuitive Tool, for Visualizing and Analyzing the DNA sequences. J Biomol Struct Dyn 11:767–782CrossRef
Metadata
Title
Polyphase filtering with variable mapping rule in protein coding region prediction
Authors
Saikat Singha Roy
Soma Barman
Publication date
02-03-2016
Publisher
Springer Berlin Heidelberg
Published in
Microsystem Technologies / Issue 9/2017
Print ISSN: 0946-7076
Electronic ISSN: 1432-1858
DOI
https://doi.org/10.1007/s00542-016-2884-5

Other articles of this Issue 9/2017

Microsystem Technologies 9/2017 Go to the issue