Skip to main content

2016 | OriginalPaper | Buchkapitel

A Hybrid Recurrent Neural Network/Dynamic Probabilistic Graphical Model Predictor of the Disulfide Bonding State of Cysteines from the Primary Structure of Proteins

verfasst von : Marco Bongini, Vincenzo Laveglia, Edmondo Trentin

Erschienen in: Artificial Neural Networks in Pattern Recognition

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Cysteines in a protein have a tendency to form mutual disulfide bonds. This affects the secondary and tertiary structure of the protein. Therefore, automatic prediction of the bonding state of cysteines from the primary structure of proteins has long been a relevant task in bioinformatics. The paper investigates the feasibility of a predictor based on a hybrid approach that combines the dynamic encoding capabilities of a recurrent autoencoder with the short-term/long-term dependencies modeling capabilities of a dynamic probabilistic graphical model (a dynamic extension of the hybrid random field). Results obtained using 1797 proteins from the May 2010 version of the Protein Data Bank show an average accuracy of \(85\,\%\) by relying only on the sub-sequences of the residue chains with no additional attributes (like global descriptors, or evolutionary information provided by multiple alignment).

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Bearing in mind this sequential dynamics of the model, we will occasionally refer to the RNN as operating “over time”, such that it is fed with t-th residue at “time t”.
 
Literatur
1.
Zurück zum Zitat Baldi, P., Brunak, S.: Bioinformatics: The Machine Learning Approach. MIT Press, Cambridge (1998)MATH Baldi, P., Brunak, S.: Bioinformatics: The Machine Learning Approach. MIT Press, Cambridge (1998)MATH
2.
Zurück zum Zitat Bengio, Y.: Neural Networks for Speech and Sequence Recognition. International Thomson Computer Press, London (1996) Bengio, Y.: Neural Networks for Speech and Sequence Recognition. International Thomson Computer Press, London (1996)
3.
Zurück zum Zitat Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., Bourne, P.E.: The protein data bank. Nucleic Acids Res. 28(1), 235–242 (2000)CrossRef Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., Bourne, P.E.: The protein data bank. Nucleic Acids Res. 28(1), 235–242 (2000)CrossRef
4.
Zurück zum Zitat Trentin, E., Bongini, M.: Towards a novel probabilistic graphical model of sequential data: fundamental notions and a solution to the problem of parameter learning. In: Mana, N., Schwenker, F., Trentin, E. (eds.) ANNPR 2012. LNCS, vol. 7477, pp. 72–81. Springer, Heidelberg (2012)CrossRef Trentin, E., Bongini, M.: Towards a novel probabilistic graphical model of sequential data: fundamental notions and a solution to the problem of parameter learning. In: Mana, N., Schwenker, F., Trentin, E. (eds.) ANNPR 2012. LNCS, vol. 7477, pp. 72–81. Springer, Heidelberg (2012)CrossRef
5.
Zurück zum Zitat Ceroni, A., Passerini, A., Vullo, A., Frasconi, P.: DISULFIND: a disulfide bonding state and cysteine connectivity prediction server. Nucleic Acids Res. 34(Web-Server-Issue), 177–181 (2006) Ceroni, A., Passerini, A., Vullo, A., Frasconi, P.: DISULFIND: a disulfide bonding state and cysteine connectivity prediction server. Nucleic Acids Res. 34(Web-Server-Issue), 177–181 (2006)
6.
Zurück zum Zitat Chung, W.-C., Yang, C.-B., Hor, C.-Y.: An effective tuning method for cysteine state classification. In: Proceedings of the National Computer Symposium, Workshop on Algorithms and Bioinformatics, Taipei, Taiwan, 27–28 November 2009 Chung, W.-C., Yang, C.-B., Hor, C.-Y.: An effective tuning method for cysteine state classification. In: Proceedings of the National Computer Symposium, Workshop on Algorithms and Bioinformatics, Taipei, Taiwan, 27–28 November 2009
7.
Zurück zum Zitat Crick, F.: Central dogma of molecular biology. Nature 227(5258), 561–563 (1970)CrossRef Crick, F.: Central dogma of molecular biology. Nature 227(5258), 561–563 (1970)CrossRef
8.
Zurück zum Zitat Fariselli, P., Riccobelli, P., Casadio, R.: Role of evolutionary information in predicting the disulfide-bonding state of cysteine in proteins. Proteins 36(3), 340–346 (1999)CrossRef Fariselli, P., Riccobelli, P., Casadio, R.: Role of evolutionary information in predicting the disulfide-bonding state of cysteine in proteins. Proteins 36(3), 340–346 (1999)CrossRef
9.
Zurück zum Zitat Frasconi, P., Passerini, A., Vullo, A.: A two-stage svm architecture for predicting the disulfide bonding state of cysteines. In: Proceedings of the IEEE Workshop on Neural Networks for Signal Processing, pp. 25–34 (2002) Frasconi, P., Passerini, A., Vullo, A.: A two-stage svm architecture for predicting the disulfide bonding state of cysteines. In: Proceedings of the IEEE Workshop on Neural Networks for Signal Processing, pp. 25–34 (2002)
10.
Zurück zum Zitat Freno, A., Trentin, E.: Hybrid Random Fields: A Scalable Approach to Structure and Parameter Learning in Probabilistic Graphical Models. ISRL, vol. 15. Springer, Heidelberg (2011)MATH Freno, A., Trentin, E.: Hybrid Random Fields: A Scalable Approach to Structure and Parameter Learning in Probabilistic Graphical Models. ISRL, vol. 15. Springer, Heidelberg (2011)MATH
11.
Zurück zum Zitat Freno, A., Trentin, E., Gori, M.: A hybrid random field model for scalable statistical learning. Neural Netw. 22, 603–613 (2009)CrossRefMATH Freno, A., Trentin, E., Gori, M.: A hybrid random field model for scalable statistical learning. Neural Netw. 22, 603–613 (2009)CrossRefMATH
12.
Zurück zum Zitat Ross Kindermann, J., Snell, L.: Markov Random Fields and Their Applications. American Mathematical Society, Providence (1980) Ross Kindermann, J., Snell, L.: Markov Random Fields and Their Applications. American Mathematical Society, Providence (1980)
13.
Zurück zum Zitat Pearl, J.: Bayesian networks: a model of self-activated memory for evidential reasoning. In: Proceedings of the 7th Conference of the Cognitive Science Society, pp. 329–334. University of California, Irvine, August 1985 Pearl, J.: Bayesian networks: a model of self-activated memory for evidential reasoning. In: Proceedings of the 7th Conference of the Cognitive Science Society, pp. 329–334. University of California, Irvine, August 1985
14.
Zurück zum Zitat Lawrence, R.: Rabiner: a tutorial on hidden markov models and selected applications in speech recognition. Proc. IEEE 77(2), 257–286 (1989)CrossRef Lawrence, R.: Rabiner: a tutorial on hidden markov models and selected applications in speech recognition. Proc. IEEE 77(2), 257–286 (1989)CrossRef
15.
Zurück zum Zitat Savojardo, C., Fariselli, P., Alhamdoosh, M., Martelli, P.L., Pierleoni, A., Casadio, R.: Improving the prediction of disulfide bonds in eukaryotes with machine lear-ning methods and protein subcellular localization. Bioinformatics 27(16), 2224–2230 (2011)CrossRef Savojardo, C., Fariselli, P., Alhamdoosh, M., Martelli, P.L., Pierleoni, A., Casadio, R.: Improving the prediction of disulfide bonds in eukaryotes with machine lear-ning methods and protein subcellular localization. Bioinformatics 27(16), 2224–2230 (2011)CrossRef
16.
Zurück zum Zitat Shoombuatong, W., Traisathit, P., Prasitwattanaseree, S., Tayapiwatana, C., Cutler, R.W., Chaijaruwanich, J.: Prediction of the disulphide bonding state of cysteines in proteins using conditional random fields. IJDMB 5(4), 449–464 (2011)CrossRef Shoombuatong, W., Traisathit, P., Prasitwattanaseree, S., Tayapiwatana, C., Cutler, R.W., Chaijaruwanich, J.: Prediction of the disulphide bonding state of cysteines in proteins using conditional random fields. IJDMB 5(4), 449–464 (2011)CrossRef
17.
Zurück zum Zitat Singh, R.: A review of algorithmic techniques for disulfide-bond determination. Brief. Funct. Genomic Proteomic 7(2), 157–172 (2008)CrossRef Singh, R.: A review of algorithmic techniques for disulfide-bond determination. Brief. Funct. Genomic Proteomic 7(2), 157–172 (2008)CrossRef
18.
Zurück zum Zitat Trentin, E., Bongini, M.: Towards a novel probabilistic graphical model of sequential data: fundamental notions and a solution to the problem of parameter learning. In: Mana, N., Schwenker, F., Trentin, E. (eds.) ANNPR 2012. LNCS, vol. 7477, pp. 72–81. Springer, Heidelberg (2012)CrossRef Trentin, E., Bongini, M.: Towards a novel probabilistic graphical model of sequential data: fundamental notions and a solution to the problem of parameter learning. In: Mana, N., Schwenker, F., Trentin, E. (eds.) ANNPR 2012. LNCS, vol. 7477, pp. 72–81. Springer, Heidelberg (2012)CrossRef
19.
Zurück zum Zitat Chen, Y.C., Lin, Y.S., Lin, C.J., Hwang, J.K.: Prediction of the bonding states of cysteines using the support vector machines based on multiple feature vectors and cysteine state sequences. Proteins 55(4), 1036–1042 (2004)CrossRef Chen, Y.C., Lin, Y.S., Lin, C.J., Hwang, J.K.: Prediction of the bonding states of cysteines using the support vector machines based on multiple feature vectors and cysteine state sequences. Proteins 55(4), 1036–1042 (2004)CrossRef
Metadaten
Titel
A Hybrid Recurrent Neural Network/Dynamic Probabilistic Graphical Model Predictor of the Disulfide Bonding State of Cysteines from the Primary Structure of Proteins
verfasst von
Marco Bongini
Vincenzo Laveglia
Edmondo Trentin
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-46182-3_22