Article

An integrated approach to feature invention and model construction for drug activity prediction

Authors:
Jesse Davis

University of Wisconsin-Madison

University of Wisconsin-Madison
View Profile

,
Vítor Santos Costa

Universidade do Porto, Portugal

Universidade do Porto, Portugal
View Profile

,
Soumya Ray

Oregon State University

Oregon State University
View Profile

,
David Page

University of Wisconsin-Madison

University of Wisconsin-Madison
View Profile

ICML '07: Proceedings of the 24th international conference on Machine learningJune 2007Pages 217–224https://doi.org/10.1145/1273496.1273524

Published:20 June 2007Publication History

ICML '07: Proceedings of the 24th international conference on Machine learning

Pages 217–224

ABSTRACT

We present a new machine learning approach for 3D-QSAR, the task of predicting binding affinities of molecules to target proteins based on 3D structure. Our approach predicts binding affinity by using regression on substructures discovered by relational learning. We make two contributions to the state-of-the-art. First, we use multiple-instance (MI) regression, which represents a molecule as a set of 3D conformations, to model activity. Second, the relational learning component employs the "Score As You Use" (SAYU) method to select substructures for their ability to improve the regression model. This is the first application of SAYU to multiple-instance, real-valued prediction. We evaluate our approach on three tasks and demonstrate that (i) SAYU outperforms standard coverage measures when selecting features for regression, (ii) the MI representation improves accuracy over standard single feature-vector encodings and (iii) combining SAYU with MI regression is more accurate for 3D-QSAR than either approach by itself.

References

Brint, A., & Willett, P. (1987). Algorithms for the identification of three-dimensional maximal common substructures. J. Chemical Informatics and Computer Sciences, 27, 152--158. Google ScholarDigital Library
Cheng, J., Hatzis, C., Hayashi, H., Krogel, M.-A., Morishita, S., Page, D., & Sese, J. (2002). KDD Cup 2001 report. SIGKDD Explorations, 3, 47--64. Google ScholarDigital Library
Cramer, R. D., Patterson, D. E., & Bunce, J. D. (1988). Comparative molecular field analysis (ComFA). Effect on binding of steroids to carrier proteins. Journal of the American Chemical Society, 110, 5959--5967.Google ScholarCross Ref
Davis, J., Burnside, E., Dutra, I. C., Page, D., & Costa, V. S. (2005). An integrated approach to learning Bayesian networks of rules. Proceedings of the 16th European Conference on Machine Learning (pp. 84--95). Springer. Google ScholarDigital Library
Dietterich, T. G., Lathrop, R. H., & Lozano-Perez, T. (1997). Solving the multiple instance problem with axis-parallel rectangles. Artificial Intelligence, 89, 31--71. Google ScholarDigital Library
Finn, P., Muggleton, S., Page, D., & Srinivasan, A. (1998). Pharmacophore discovery using the inductive logic programming system PROGOL. Machine Learning, 30: Special issue on applications and the knowledge discovery process, Kohavi and Provost (Ed.s), 241--270. Google ScholarDigital Library
Fletcher, R. (1980). Practical methods of optimization, vol. 1: Unconstrained Optimization, chapter 3. John Wiley and Sons.Google Scholar
Jain, A., Dietterich, T., Lathrop, R., Chapman, D., Critchlow, R., Bauer, B., Webster, T., & Lozano-Péérez, T. (1994a). Compass: a shape-based machine learning tool for drug design. Journal of Computer-Aided Molecular Design, 8, 635--652.Google ScholarCross Ref
Jain, A., Koile, K., Bauer, B., & Chapman, D. (1994b). Compass: Predicting biological activities from molecular surface properties. Journal of Medicinal Chemistry, 37, 2315--2327.Google ScholarCross Ref
Landwehr, N., Kersting, K., & Raedt, L. D. (2005). nFOIL: Integrating Naive Bayes and FOIL. Proceedings of the 20th National Conference on Artificial Intelligence (pp. 795--800). Google ScholarDigital Library
Landwehr, N., Passerini, A., Raedt, L. D., & Frasconi, P. (2006). kFOIL: Learning simple relational kernels. Proceedings of the 21st National Conference on Artificial Intelligence. Google ScholarDigital Library
Marchand-Geneste, N., Watson, K., Alsberg, B., & King, R. (2002). New approach to pharmacophore mapping and QSAR analysis using inductive logic programming. Application to thermolysin inhibitors and glycogen phosphorylase b inhibitors. Journal of Medicinal Chemistry, 45, 399--409.Google ScholarCross Ref
Maron, O. (1998). Learning from ambiguity. Doctoral dissertation, Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA. Google ScholarDigital Library
Martin, Y., Bures, M., Danaher, E., DeLazzer, J., Lico, I., & Pavlik, P. (1993). A fast new approach to pharmacophore mapping and its application to dopaminergic and benzodiazepine agonists. J. Computer-Aided Molecular Design, 7, 83--102.Google ScholarCross Ref
McGaughey, G. B., & Mewshaw, R. E. (1999). Application of comparative molecular field analysis to dopamine d2 partial agonists. Bioorganic Medical Chemistry, 7, 2453--2456.Google ScholarCross Ref
Muggleton, S. (1995). Inverse entailment and Progol. New Generation Computing, 13, 245--286.Google ScholarDigital Library
Ray, S., & Page, D. (2001). Multiple instance regression. Proceedings of the 18th International Conference on Machine Learning (pp. 425--432). Morgan Kaufmann. Google ScholarDigital Library
Srinivasan, A., Page, D., Camacho, R., & King, R. (2006). Quantitative pharmacophore models with Inductive Logic Programming. Machine Learning Journal, 64, 65--90. Google ScholarDigital Library
Vapnik, V. (1999). The nature of statistical learning theory. Statistics for Engineering and Information Science. Springer. Google ScholarDigital Library

An integrated approach to feature invention and model construction for drug activity prediction
1. Applied computing
  1. Life and medical sciences
2. Computing methodologies

Recommendations

Drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated framework

Motivation:In silico prediction of drug–target interactions from heterogeneous biological data is critical in the search for drugs and therapeutic targets for known diseases such as cancers. There is therefore a strong incentive to develop new methods ...
Read More
Drug Repurposing: Targeting mTOR Inhibitors for Anticancer Activity
CSBio '17: Proceedings of the 8th International Conference on Computational Systems-Biology and Bioinformatics

In the search of safer and more effective drugs while reducing costs and increasing productivity of novel drug discovery, scientists are changing their focus to an approach known as drug repurposing. This involves finding a new therapeutic effect of an ...
Read More
Computational prediction of drug-drug interactions based on drugs functional similarities

Display Omitted A similarity based method is proposed for the prediction of drug interactions.Drug-drug interactions may occur based on common biological targets.Similarity measures of drug interactions are based on drugs functional similarity.Over 250,...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ICML '07: Proceedings of the 24th international conference on Machine learning
June 2007
1233 pages
ISBN:9781595937933
DOI:10.1145/1273496
Editor:
Zoubin Ghahramani
University of Cambridge, United Kingdom
Copyright © 2007 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 20 June 2007
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate140of548submissions,26%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 11
  Total Citations
  View Citations
- 249
  Total Downloads
- Downloads (Last 12 months)4
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

An integrated approach to feature invention and model construction for drug activity prediction

ICML '07: Proceedings of the 24th international conference on Machine learning

ABSTRACT

References

Cited By

Recommendations

Drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated framework

Drug Repurposing: Targeting mTOR Inhibitors for Anticancer Activity

Computational prediction of drug-drug interactions based on drugs functional similarities

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

An integrated approach to feature invention and model construction for drug activity prediction

ICML '07: Proceedings of the 24th international conference on Machine learning

ABSTRACT

References

Cited By

Recommendations

Drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated framework

Drug Repurposing: Targeting mTOR Inhibitors for Anticancer Activity

Computational prediction of drug-drug interactions based on drugs functional similarities

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media