Skip to main content
Top

2016 | OriginalPaper | Chapter

11. Enhanced Prediction of DNA-Binding Proteins and Classes

Authors : Huda A. Maghawry, Mostafa G. M. Mostafa, Mohamed H. Abdul-Aziz, Tarek F. Gharib

Published in: Applications of Intelligent Optimization in Biology and Medicine

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Predicting DNA-binding proteins computationally based on proteins features is a very challenging process. This is due to the diversity of DNA-binding patterns and classes. Therefore, the accurate prediction of DNA-binding proteins and their classes is essential. This chapter proposes efficient protein feature representations for the prediction of DNA-binding proteins and their classes. The prediction results achieved are comparable or superior to previously published results using different benchmark datasets. A protein representation of sequence, psychochemical and structural features achieved accuracy improvement of about 7 % on average for the prediction of DNA-binding proteins. Moreover, a newly proposed structure-based protein representation that takes distance and angle patterns into accounts was evaluated for DNA-binding proteins prediction. It achieved when combined with other feature representations improvement in accuracy over previously published results about 7 and 12 % on average for the prediction of DNA-binding proteins and DNA-binding protein classes, respectively. All results were evaluated using two classifiers, Random Forest and SVM.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990) Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990)
2.
go back to reference Ahmad, S., Sarai, A.: Moment-based prediction of dna-binding proteins. J. Mol. Biol. 341, 65–71 (2004) Ahmad, S., Sarai, A.: Moment-based prediction of dna-binding proteins. J. Mol. Biol. 341, 65–71 (2004)
3.
go back to reference Berman, J., Westbrook, H.M., Feng, Z., et al.: The protein data bank. Nucleic Acids Res. 28, 235–242 (2000) Berman, J., Westbrook, H.M., Feng, Z., et al.: The protein data bank. Nucleic Acids Res. 28, 235–242 (2000)
4.
go back to reference Bhardwaj, N., Langlois, R.E., Zhao, G., Lu, H.: Kernel-based machine learning protocol for predicting dna-binding proteins. Nucleic Acids Res. 33, 6486–6493 (2005) Bhardwaj, N., Langlois, R.E., Zhao, G., Lu, H.: Kernel-based machine learning protocol for predicting dna-binding proteins. Nucleic Acids Res. 33, 6486–6493 (2005)
5.
go back to reference Burkowski, F.J.: Structural Bioinformatics an Algorithmic Approach, Mathematical and Computational Biology Series. Chapman & Hall/CRC, Boca Raton (2009) Burkowski, F.J.: Structural Bioinformatics an Algorithmic Approach, Mathematical and Computational Biology Series. Chapman & Hall/CRC, Boca Raton (2009)
6.
go back to reference Duda, R.O., Hart, P.E., Daivd, G.S.: Pattern Classification. Wiley, New York (2000) Duda, R.O., Hart, P.E., Daivd, G.S.: Pattern Classification. Wiley, New York (2000)
7.
go back to reference Hall, M.A.: Correlation-based Feature Subset Selection for Machine Learning. Hamilton, New Zealand (1998) Hall, M.A.: Correlation-based Feature Subset Selection for Machine Learning. Hamilton, New Zealand (1998)
8.
go back to reference Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Elsevier, San Francisco (2008) Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Elsevier, San Francisco (2008)
9.
go back to reference Kyte, J., Doolittle, R.F.: A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 157, 105–132 (1982) Kyte, J., Doolittle, R.F.: A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 157, 105–132 (1982)
10.
go back to reference Kawashima, S., Pokarowski, P., Pokarowska, M., Kolinski, A., Katayama, T., Kanehisa, M.: Aaindex: amino acid index database, progress report. Nucleic Acids Res. 36 (2008) Kawashima, S., Pokarowski, P., Pokarowska, M., Kolinski, A., Katayama, T., Kanehisa, M.: Aaindex: amino acid index database, progress report. Nucleic Acids Res. 36 (2008)
11.
go back to reference Luscombe, N.M., Austin, S.E., Berman, H.M., Thornton, J.M.: An overview of the structures of proteindna complexes. Int. J. Comput. Sci. Manage. Res. 1 (2000) Luscombe, N.M., Austin, S.E., Berman, H.M., Thornton, J.M.: An overview of the structures of proteindna complexes. Int. J. Comput. Sci. Manage. Res. 1 (2000)
12.
go back to reference Langlois, R.E., Lu, H.: Boosting the prediction and understanding of dna-binding domains from sequence. Nucleic Acids Res. 38, 3149–3158 (2010) Langlois, R.E., Lu, H.: Boosting the prediction and understanding of dna-binding domains from sequence. Nucleic Acids Res. 38, 3149–3158 (2010)
13.
go back to reference Maghawry, H.A., Mostafa, M.G., Gharib, T.F.: A new protein structure representation for efficient protein. J. Comput. Biol. 21, 936–946 (2014) Maghawry, H.A., Mostafa, M.G., Gharib, T.F.: A new protein structure representation for efficient protein. J. Comput. Biol. 21, 936–946 (2014)
14.
go back to reference Pires, D.E., Melo-Minardi, R.C., Santos, M.A., Silveira, C.H., Santoro, M.M., Meira, W.: Cutoff scanning matrix (csm): structural classification and function prediction by protein inter-residue distance patterns. BMC Genomics 12, S12 (2011) Pires, D.E., Melo-Minardi, R.C., Santos, M.A., Silveira, C.H., Santoro, M.M., Meira, W.: Cutoff scanning matrix (csm): structural classification and function prediction by protein inter-residue distance patterns. BMC Genomics 12, S12 (2011)
15.
go back to reference Shanahan, H.P., Garcia, M.A., Jones, S., Thornton, J.M.: Identifying dna-binding proteins using structural motifs and the electrostatic potential. Nucleic Acids Res. 32 (2004) Shanahan, H.P., Garcia, M.A., Jones, S., Thornton, J.M.: Identifying dna-binding proteins using structural motifs and the electrostatic potential. Nucleic Acids Res. 32 (2004)
16.
go back to reference Stawiski, E.W., Gregoret, L.M., Mandel-Gutfreund, Y.: Annotating nucleic acid-binding function based on protein structure. J. Mol. Biol. 326, 1065–1079 (2003) Stawiski, E.W., Gregoret, L.M., Mandel-Gutfreund, Y.: Annotating nucleic acid-binding function based on protein structure. J. Mol. Biol. 326, 1065–1079 (2003)
17.
go back to reference Szaboova, A., Kuzelka1, O., Zelezny, F., Tolar, J.: Prediction of dna-binding proteins from relational features. Proteome Sci. 10 (2012) Szaboova, A., Kuzelka1, O., Zelezny, F., Tolar, J.: Prediction of dna-binding proteins from relational features. Proteome Sci. 10 (2012)
18.
go back to reference Szilagyi, A., Skolnick, J.: Efficient prediction of nucleic acid binding function from low-resolution protein structures. J. Mol. Biol. 358, 922–933 (2006) Szilagyi, A., Skolnick, J.: Efficient prediction of nucleic acid binding function from low-resolution protein structures. J. Mol. Biol. 358, 922–933 (2006)
19.
go back to reference Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann (2005) Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann (2005)
Metadata
Title
Enhanced Prediction of DNA-Binding Proteins and Classes
Authors
Huda A. Maghawry
Mostafa G. M. Mostafa
Mohamed H. Abdul-Aziz
Tarek F. Gharib
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-21212-8_11

Premium Partner