ABSTRACT
We present a new machine learning approach for 3D-QSAR, the task of predicting binding affinities of molecules to target proteins based on 3D structure. Our approach predicts binding affinity by using regression on substructures discovered by relational learning. We make two contributions to the state-of-the-art. First, we use multiple-instance (MI) regression, which represents a molecule as a set of 3D conformations, to model activity. Second, the relational learning component employs the "Score As You Use" (SAYU) method to select substructures for their ability to improve the regression model. This is the first application of SAYU to multiple-instance, real-valued prediction. We evaluate our approach on three tasks and demonstrate that (i) SAYU outperforms standard coverage measures when selecting features for regression, (ii) the MI representation improves accuracy over standard single feature-vector encodings and (iii) combining SAYU with MI regression is more accurate for 3D-QSAR than either approach by itself.
- Brint, A., & Willett, P. (1987). Algorithms for the identification of three-dimensional maximal common substructures. J. Chemical Informatics and Computer Sciences, 27, 152--158. Google ScholarDigital Library
- Cheng, J., Hatzis, C., Hayashi, H., Krogel, M.-A., Morishita, S., Page, D., & Sese, J. (2002). KDD Cup 2001 report. SIGKDD Explorations, 3, 47--64. Google ScholarDigital Library
- Cramer, R. D., Patterson, D. E., & Bunce, J. D. (1988). Comparative molecular field analysis (ComFA). Effect on binding of steroids to carrier proteins. Journal of the American Chemical Society, 110, 5959--5967.Google ScholarCross Ref
- Davis, J., Burnside, E., Dutra, I. C., Page, D., & Costa, V. S. (2005). An integrated approach to learning Bayesian networks of rules. Proceedings of the 16th European Conference on Machine Learning (pp. 84--95). Springer. Google ScholarDigital Library
- Dietterich, T. G., Lathrop, R. H., & Lozano-Perez, T. (1997). Solving the multiple instance problem with axis-parallel rectangles. Artificial Intelligence, 89, 31--71. Google ScholarDigital Library
- Finn, P., Muggleton, S., Page, D., & Srinivasan, A. (1998). Pharmacophore discovery using the inductive logic programming system PROGOL. Machine Learning, 30: Special issue on applications and the knowledge discovery process, Kohavi and Provost (Ed.s), 241--270. Google ScholarDigital Library
- Fletcher, R. (1980). Practical methods of optimization, vol. 1: Unconstrained Optimization, chapter 3. John Wiley and Sons.Google Scholar
- Jain, A., Dietterich, T., Lathrop, R., Chapman, D., Critchlow, R., Bauer, B., Webster, T., & Lozano-Péérez, T. (1994a). Compass: a shape-based machine learning tool for drug design. Journal of Computer-Aided Molecular Design, 8, 635--652.Google ScholarCross Ref
- Jain, A., Koile, K., Bauer, B., & Chapman, D. (1994b). Compass: Predicting biological activities from molecular surface properties. Journal of Medicinal Chemistry, 37, 2315--2327.Google ScholarCross Ref
- Landwehr, N., Kersting, K., & Raedt, L. D. (2005). nFOIL: Integrating Naive Bayes and FOIL. Proceedings of the 20th National Conference on Artificial Intelligence (pp. 795--800). Google ScholarDigital Library
- Landwehr, N., Passerini, A., Raedt, L. D., & Frasconi, P. (2006). kFOIL: Learning simple relational kernels. Proceedings of the 21st National Conference on Artificial Intelligence. Google ScholarDigital Library
- Marchand-Geneste, N., Watson, K., Alsberg, B., & King, R. (2002). New approach to pharmacophore mapping and QSAR analysis using inductive logic programming. Application to thermolysin inhibitors and glycogen phosphorylase b inhibitors. Journal of Medicinal Chemistry, 45, 399--409.Google ScholarCross Ref
- Maron, O. (1998). Learning from ambiguity. Doctoral dissertation, Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA. Google ScholarDigital Library
- Martin, Y., Bures, M., Danaher, E., DeLazzer, J., Lico, I., & Pavlik, P. (1993). A fast new approach to pharmacophore mapping and its application to dopaminergic and benzodiazepine agonists. J. Computer-Aided Molecular Design, 7, 83--102.Google ScholarCross Ref
- McGaughey, G. B., & Mewshaw, R. E. (1999). Application of comparative molecular field analysis to dopamine d2 partial agonists. Bioorganic Medical Chemistry, 7, 2453--2456.Google ScholarCross Ref
- Muggleton, S. (1995). Inverse entailment and Progol. New Generation Computing, 13, 245--286.Google ScholarDigital Library
- Ray, S., & Page, D. (2001). Multiple instance regression. Proceedings of the 18th International Conference on Machine Learning (pp. 425--432). Morgan Kaufmann. Google ScholarDigital Library
- Srinivasan, A., Page, D., Camacho, R., & King, R. (2006). Quantitative pharmacophore models with Inductive Logic Programming. Machine Learning Journal, 64, 65--90. Google ScholarDigital Library
- Vapnik, V. (1999). The nature of statistical learning theory. Statistics for Engineering and Information Science. Springer. Google ScholarDigital Library
- An integrated approach to feature invention and model construction for drug activity prediction
Recommendations
Drug Repurposing: Targeting mTOR Inhibitors for Anticancer Activity
CSBio '17: Proceedings of the 8th International Conference on Computational Systems-Biology and BioinformaticsIn the search of safer and more effective drugs while reducing costs and increasing productivity of novel drug discovery, scientists are changing their focus to an approach known as drug repurposing. This involves finding a new therapeutic effect of an ...
Computational prediction of drug-drug interactions based on drugs functional similarities
Display Omitted A similarity based method is proposed for the prediction of drug interactions.Drug-drug interactions may occur based on common biological targets.Similarity measures of drug interactions are based on drugs functional similarity.Over 250,...
Comments