Skip to main content
Top

2016 | OriginalPaper | Chapter

The Prediction of Human Genes in DNA Based on a Generalized Hidden Markov Model

Authors : Rui Guo, Ke Yan, Wei He, Jian Zhang

Published in: Biometric Recognition

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The Generalized Hidden Markov Model (GHMM) has been proved to be an excellently general probabilistic model of the gene structure of human genomic sequences. It can simultaneously incorporate different signal descriptions like splicing sites and content descriptions, for instance, compositional features of exons and introns. Enjoying its flexibility and convincing probabilistic underpinnings, we integrate some other modification of submodels and then implement a prediction program of Human Genes in DNA. The program has the capacity to predict multiple genes in a sequence, to deal with partial as well as complete genes, and to predict consistent sets of genes occurring on either or both DNA strands. More importantly, it also can perform well for longer sequences with an unknown number of genes in them. In the experiments, the results show that the proposed method has better performance in prediction accuracy than some existing methods, and over 70 % of exons can be identified exactly.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Cairui, L., Changsong, Z., Guoli, S.: Recent progress in gene mapping through high-throughput sequencing technology and forward genetic approaches. Yi chuan = Hereditas/Zhongguo yi chuan xue hui bian ji 37(8), 765–776 (2015) Cairui, L., Changsong, Z., Guoli, S.: Recent progress in gene mapping through high-throughput sequencing technology and forward genetic approaches. Yi chuan = Hereditas/Zhongguo yi chuan xue hui bian ji 37(8), 765–776 (2015)
2.
go back to reference Burge, C., Karlin, S.: Prediction of complete gene structures in human genomic DNA. J. Mol. Biol. 268(1), 78–94 (1997)CrossRef Burge, C., Karlin, S.: Prediction of complete gene structures in human genomic DNA. J. Mol. Biol. 268(1), 78–94 (1997)CrossRef
3.
go back to reference Burset, M., Seledtsov, I.A., Solovyev, V.V.: Analysis of canonical and non-canonical splice sites in mammalian genomes. Nucleic Acids Res. 28(21), 4364–4375 (2000)CrossRef Burset, M., Seledtsov, I.A., Solovyev, V.V.: Analysis of canonical and non-canonical splice sites in mammalian genomes. Nucleic Acids Res. 28(21), 4364–4375 (2000)CrossRef
4.
go back to reference Guigó, R., et al.: Prediction of gene structure ☆. J. Mol. Biol. 226(1), 141–157 (1992)CrossRef Guigó, R., et al.: Prediction of gene structure ☆. J. Mol. Biol. 226(1), 141–157 (1992)CrossRef
5.
go back to reference Haussler, D., David, K., Reese, M.G., Eeckman, F.H.: A generalized hidden Markov model for the recognition of human genes in DNA. In: Proceedings of the International Conference on Intelligent Systems for Molecular Biology, St. Louis (1996) Haussler, D., David, K., Reese, M.G., Eeckman, F.H.: A generalized hidden Markov model for the recognition of human genes in DNA. In: Proceedings of the International Conference on Intelligent Systems for Molecular Biology, St. Louis (1996)
6.
go back to reference Stanke, M., Waack, S.: Gene prediction with a hidden Markov model and a new intron submodel. Bioinformatics 19(suppl 2), 215–225 (2003)CrossRef Stanke, M., Waack, S.: Gene prediction with a hidden Markov model and a new intron submodel. Bioinformatics 19(suppl 2), 215–225 (2003)CrossRef
7.
go back to reference Fickett, J.W.: Finding genes by computer: the state of the art. Trends Genet. 12(8), 316–320 (1996)CrossRef Fickett, J.W.: Finding genes by computer: the state of the art. Trends Genet. 12(8), 316–320 (1996)CrossRef
8.
go back to reference Krogh, A., Mian, I.S., Haussler, D.: A hidden Markov model that finds genes in E. coli DNA. Nucleic Acids Res. 22(22), 4768–4778 (1994)CrossRef Krogh, A., Mian, I.S., Haussler, D.: A hidden Markov model that finds genes in E. coli DNA. Nucleic Acids Res. 22(22), 4768–4778 (1994)CrossRef
9.
go back to reference Salzberg, Steven L., D. B. Searls, and S. Kasif. “Computational methods in molecular biology.” Computational Methods in Molecular Biology49.2(1999):191-192 Salzberg, Steven L., D. B. Searls, and S. Kasif. “Computational methods in molecular biology.” Computational Methods in Molecular Biology49.2(1999):191-192
10.
go back to reference Ryan, M.S., Nudd, G.R.: The viterbi algorithm. Warwick Res. Rep. Rr 37(2), 160–163 (1993)MathSciNet Ryan, M.S., Nudd, G.R.: The viterbi algorithm. Warwick Res. Rep. Rr 37(2), 160–163 (1993)MathSciNet
11.
go back to reference Majoros, W.H., et al.: Efficient decoding algorithms for generalized hidden Markov model gene finders. BMC Bioinform. 6(2), 8–16 (2005) Majoros, W.H., et al.: Efficient decoding algorithms for generalized hidden Markov model gene finders. BMC Bioinform. 6(2), 8–16 (2005)
12.
go back to reference Zhang, M.Q., Marr, T.G.: A weight array method for splicing signal analysis. Comput. Appl. Biosci. Cabios 9(5), 499–509 (1993) Zhang, M.Q., Marr, T.G.: A weight array method for splicing signal analysis. Comput. Appl. Biosci. Cabios 9(5), 499–509 (1993)
13.
go back to reference Salzberg, S.L., et al.: Microbial gene identification using interpolated Markov models. Nucleic Acids Res. 26(2), 544–548 (1998)CrossRef Salzberg, S.L., et al.: Microbial gene identification using interpolated Markov models. Nucleic Acids Res. 26(2), 544–548 (1998)CrossRef
Metadata
Title
The Prediction of Human Genes in DNA Based on a Generalized Hidden Markov Model
Authors
Rui Guo
Ke Yan
Wei He
Jian Zhang
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-46654-5_82

Premium Partner