Skip to main content

2016 | OriginalPaper | Buchkapitel

11. Enhanced Prediction of DNA-Binding Proteins and Classes

verfasst von : Huda A. Maghawry, Mostafa G. M. Mostafa, Mohamed H. Abdul-Aziz, Tarek F. Gharib

Erschienen in: Applications of Intelligent Optimization in Biology and Medicine

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Predicting DNA-binding proteins computationally based on proteins features is a very challenging process. This is due to the diversity of DNA-binding patterns and classes. Therefore, the accurate prediction of DNA-binding proteins and their classes is essential. This chapter proposes efficient protein feature representations for the prediction of DNA-binding proteins and their classes. The prediction results achieved are comparable or superior to previously published results using different benchmark datasets. A protein representation of sequence, psychochemical and structural features achieved accuracy improvement of about 7 % on average for the prediction of DNA-binding proteins. Moreover, a newly proposed structure-based protein representation that takes distance and angle patterns into accounts was evaluated for DNA-binding proteins prediction. It achieved when combined with other feature representations improvement in accuracy over previously published results about 7 and 12 % on average for the prediction of DNA-binding proteins and DNA-binding protein classes, respectively. All results were evaluated using two classifiers, Random Forest and SVM.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990) Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990)
2.
Zurück zum Zitat Ahmad, S., Sarai, A.: Moment-based prediction of dna-binding proteins. J. Mol. Biol. 341, 65–71 (2004) Ahmad, S., Sarai, A.: Moment-based prediction of dna-binding proteins. J. Mol. Biol. 341, 65–71 (2004)
3.
Zurück zum Zitat Berman, J., Westbrook, H.M., Feng, Z., et al.: The protein data bank. Nucleic Acids Res. 28, 235–242 (2000) Berman, J., Westbrook, H.M., Feng, Z., et al.: The protein data bank. Nucleic Acids Res. 28, 235–242 (2000)
4.
Zurück zum Zitat Bhardwaj, N., Langlois, R.E., Zhao, G., Lu, H.: Kernel-based machine learning protocol for predicting dna-binding proteins. Nucleic Acids Res. 33, 6486–6493 (2005) Bhardwaj, N., Langlois, R.E., Zhao, G., Lu, H.: Kernel-based machine learning protocol for predicting dna-binding proteins. Nucleic Acids Res. 33, 6486–6493 (2005)
5.
Zurück zum Zitat Burkowski, F.J.: Structural Bioinformatics an Algorithmic Approach, Mathematical and Computational Biology Series. Chapman & Hall/CRC, Boca Raton (2009) Burkowski, F.J.: Structural Bioinformatics an Algorithmic Approach, Mathematical and Computational Biology Series. Chapman & Hall/CRC, Boca Raton (2009)
6.
Zurück zum Zitat Duda, R.O., Hart, P.E., Daivd, G.S.: Pattern Classification. Wiley, New York (2000) Duda, R.O., Hart, P.E., Daivd, G.S.: Pattern Classification. Wiley, New York (2000)
7.
Zurück zum Zitat Hall, M.A.: Correlation-based Feature Subset Selection for Machine Learning. Hamilton, New Zealand (1998) Hall, M.A.: Correlation-based Feature Subset Selection for Machine Learning. Hamilton, New Zealand (1998)
8.
Zurück zum Zitat Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Elsevier, San Francisco (2008) Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Elsevier, San Francisco (2008)
9.
Zurück zum Zitat Kyte, J., Doolittle, R.F.: A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 157, 105–132 (1982) Kyte, J., Doolittle, R.F.: A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 157, 105–132 (1982)
10.
Zurück zum Zitat Kawashima, S., Pokarowski, P., Pokarowska, M., Kolinski, A., Katayama, T., Kanehisa, M.: Aaindex: amino acid index database, progress report. Nucleic Acids Res. 36 (2008) Kawashima, S., Pokarowski, P., Pokarowska, M., Kolinski, A., Katayama, T., Kanehisa, M.: Aaindex: amino acid index database, progress report. Nucleic Acids Res. 36 (2008)
11.
Zurück zum Zitat Luscombe, N.M., Austin, S.E., Berman, H.M., Thornton, J.M.: An overview of the structures of proteindna complexes. Int. J. Comput. Sci. Manage. Res. 1 (2000) Luscombe, N.M., Austin, S.E., Berman, H.M., Thornton, J.M.: An overview of the structures of proteindna complexes. Int. J. Comput. Sci. Manage. Res. 1 (2000)
12.
Zurück zum Zitat Langlois, R.E., Lu, H.: Boosting the prediction and understanding of dna-binding domains from sequence. Nucleic Acids Res. 38, 3149–3158 (2010) Langlois, R.E., Lu, H.: Boosting the prediction and understanding of dna-binding domains from sequence. Nucleic Acids Res. 38, 3149–3158 (2010)
13.
Zurück zum Zitat Maghawry, H.A., Mostafa, M.G., Gharib, T.F.: A new protein structure representation for efficient protein. J. Comput. Biol. 21, 936–946 (2014) Maghawry, H.A., Mostafa, M.G., Gharib, T.F.: A new protein structure representation for efficient protein. J. Comput. Biol. 21, 936–946 (2014)
14.
Zurück zum Zitat Pires, D.E., Melo-Minardi, R.C., Santos, M.A., Silveira, C.H., Santoro, M.M., Meira, W.: Cutoff scanning matrix (csm): structural classification and function prediction by protein inter-residue distance patterns. BMC Genomics 12, S12 (2011) Pires, D.E., Melo-Minardi, R.C., Santos, M.A., Silveira, C.H., Santoro, M.M., Meira, W.: Cutoff scanning matrix (csm): structural classification and function prediction by protein inter-residue distance patterns. BMC Genomics 12, S12 (2011)
15.
Zurück zum Zitat Shanahan, H.P., Garcia, M.A., Jones, S., Thornton, J.M.: Identifying dna-binding proteins using structural motifs and the electrostatic potential. Nucleic Acids Res. 32 (2004) Shanahan, H.P., Garcia, M.A., Jones, S., Thornton, J.M.: Identifying dna-binding proteins using structural motifs and the electrostatic potential. Nucleic Acids Res. 32 (2004)
16.
Zurück zum Zitat Stawiski, E.W., Gregoret, L.M., Mandel-Gutfreund, Y.: Annotating nucleic acid-binding function based on protein structure. J. Mol. Biol. 326, 1065–1079 (2003) Stawiski, E.W., Gregoret, L.M., Mandel-Gutfreund, Y.: Annotating nucleic acid-binding function based on protein structure. J. Mol. Biol. 326, 1065–1079 (2003)
17.
Zurück zum Zitat Szaboova, A., Kuzelka1, O., Zelezny, F., Tolar, J.: Prediction of dna-binding proteins from relational features. Proteome Sci. 10 (2012) Szaboova, A., Kuzelka1, O., Zelezny, F., Tolar, J.: Prediction of dna-binding proteins from relational features. Proteome Sci. 10 (2012)
18.
Zurück zum Zitat Szilagyi, A., Skolnick, J.: Efficient prediction of nucleic acid binding function from low-resolution protein structures. J. Mol. Biol. 358, 922–933 (2006) Szilagyi, A., Skolnick, J.: Efficient prediction of nucleic acid binding function from low-resolution protein structures. J. Mol. Biol. 358, 922–933 (2006)
19.
Zurück zum Zitat Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann (2005) Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann (2005)
Metadaten
Titel
Enhanced Prediction of DNA-Binding Proteins and Classes
verfasst von
Huda A. Maghawry
Mostafa G. M. Mostafa
Mohamed H. Abdul-Aziz
Tarek F. Gharib
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-21212-8_11

Premium Partner