Abstract
An improved multiple linear regression (MLR) method is proposed to predict a protein's secondary structural content based on its primary sequence. The amino acid composition, the autocorrelation function, and the interaction function of side-chain mass derived from the primary sequence are taken into account. The average absolute errors of prediction over 704 unrelated proteins with the jackknife test are 0.088, 0.081, and 0.059 with standard deviations 0.073, 0.066, and 0.055 for α-helix, β-sheet, and coil, respectively. That the sum of predicted secondary structure content should be close to 1.0 was introduced as a criterion to evaluate whether the prediction is acceptable. While only the predictions with the sum of predicted secondary structure content between 0.99 and 1.01 are accepted (about 11% of all proteins), the absolute errors are 0.058 for α-helix, 0.054 for β-sheet, and 0.045 for coil.
Similar content being viewed by others
REFERENCES
Chandonia, J. M. and Karplus, M. (1999). Proteins 35, 293–306.
Chou, P. Y. and Fasman, G. D. (1974). Biochemistry 13, 222–245.
Eisenhaber, F., Frömmel, C., and Argos, P. (1996a). Proteins 25, 169–179.
Eisenhaber, F., Imperiale, F., Argos, P., and Frömmel, C. (1996b). Proteins 25, 157–168.
Fauchere, J. L. and Pliska, V. (1983). Eur J.Mol.Chem. 18, 369–375.
Kabsch, W. and Sander, C. (1983). Biopolymers 22, 2577–2637.
Krigbaum, W. R. and Knutton, S. P. (1973). Proc.Natl.Acad.Sci.USA 70, 2809–2813.
Li, X. and Pan, X. M. (2001). Proteins 42, 1–5.
Lim, V. I. (1974). J.Mol.Biol. 88, 857–872.
Muskal, S. M. and Kim, S. H. (1992). J.Mol.Biol. 225, 713–727.
Orengo, C. A., Bray, J. E., Hubbard, T., Loconte, L., and Sillitoe, I. (1999). Proteins (Suppl) 3, 149–170.
Wang, Z. X. and Yuan, Z. (2000). Proteins 38, 165–175.
Zhang, C. T., Zhang, Z., and He, Z. (1996). J.Protein Chem. 15, 775–786.
Zhang, C. T., Zhang, Z., and He, Z. (1998b). J.Protein Chem. 17, 261–272.
Zhang, C. T., Lin, Z., Zhang, Z., and Yan, M. (1998a). Protein Eng. 11, 971–979.
Zhang, Z., Sun, Z. R., and Zhang, C. T. (2001). J.Theor.Biol. 208, 65–78.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Lin, Z., Pan, XM. Accurate Prediction of Protein Secondary Structural Content. J Protein Chem 20, 217–220 (2001). https://doi.org/10.1023/A:1010967008838
Published:
Issue Date:
DOI: https://doi.org/10.1023/A:1010967008838