skip to main content
10.1145/1102351.1102464acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
Article

Learning structured prediction models: a large margin approach

Published:07 August 2005Publication History

ABSTRACT

We consider large margin estimation in a broad range of prediction models where inference involves solving combinatorial optimization problems, for example, weighted graph-cuts or matchings. Our goal is to learn parameters such that inference using the model reproduces correct answers on the training data. Our method relies on the expressive power of convex optimization problems to compactly capture inference or solution optimality in structured prediction models. Directly embedding this structure within the learning formulation produces concise convex problems for efficient estimation of very complex and diverse models. We describe experimental results on a matching task, disulfide connectivity prediction, showing significant improvements over state-of-the-art methods.

References

  1. Altschul, S., Madden, T., Schaffer, A., Zhang, A., Miller, W., & Lipman (1997). Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleid Acids Res., 25, 3389--3402.Google ScholarGoogle ScholarCross RefCross Ref
  2. Altun, Y., Tsochantaridis, I., & Hofmann, T. (2003). Hidden markov support vector machines. Proc. ICML.Google ScholarGoogle Scholar
  3. Baldi, P., Cheng, J., & Vullo, A. (2004). Large-scale prediction of disulphide bond connectivity. Proc. NIPS.Google ScholarGoogle Scholar
  4. Belongie, S., Malik, J., & Puzicha, J. (2002). Shape matching and object recognition using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell., 24. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Berman, H., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T., Weissig, H., Shindyalov, I., & Bourne, P. (2000). The protein data bank.Google ScholarGoogle Scholar
  6. Boyd, S., & Vandenberghe, L. (2004). Convex optimization. Cambridge University Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Collins, M. (2002). Discriminative training methods for hidden markov models: Theory and experiments with perceptron algorithms. Proc. EMNLP. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Duda, R. O., Hart, P. E., & Stork, D. G. (2000). Pattern classification. New York: Wiley Interscience. 2nd edition. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Edmonds, J. (1965). Maximum matching and a polyhedron with 0-1 vertices. Journal of Research at the National Bureau of Standards, 69B, 125 130.Google ScholarGoogle ScholarCross RefCross Ref
  10. Fariselli, P., & Casadio, R. (2001). Prediction of disulfide connectivity in proteins. Bioinformatics, 17, 957--964.Google ScholarGoogle ScholarCross RefCross Ref
  11. Heuberger, C. (2004). Inverse combinatorial optimization: A survey on problems, methods, and results. Journal of Combinatorial Optimization, 8.Google ScholarGoogle ScholarCross RefCross Ref
  12. Kabsch, W., & Sander, C. (1983). Dictionary of protein secondary structure: Pattern recognition of hydrogen-bonded and geometrical features.Google ScholarGoogle Scholar
  13. Schrijver, A. (2003). Combinatorial optimization: Polyhedra and efficiency. Springer.Google ScholarGoogle Scholar
  14. Taskar, B. (2004). Learning structured prediction models: A large margin approach. Doctoral dissertation, Stanford University. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Taskar, B., Chatalbashev, V., & Koller, D. (2004a). Learning associative Markov networks. Proc. ICML. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Taskar, B., Guestrin, C., & Koller, D. (2003). Max margin Markov networks. Proc. NIPS.Google ScholarGoogle Scholar
  17. Taskar, B., Klein, D., Collins, M., Koller, D., & Manning, C. (2004b). Max margin parsing. Proc. EMNLP.Google ScholarGoogle Scholar
  18. Tsochantaridis, I., Hofmann, T., Joachims, T., & Altun, Y. (2004). Support vector machine learning for interdependent and structured output spaces. Proc. ICML. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Vullo, A., & Frasconi, P. (2004). Dirsulfide connectivity prediction using recursive neural networks and evolutionary information. Bioinformatics, 20, 653--659. Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    ICML '05: Proceedings of the 22nd international conference on Machine learning
    August 2005
    1113 pages
    ISBN:1595931805
    DOI:10.1145/1102351

    Copyright © 2005 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 7 August 2005

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • Article

    Acceptance Rates

    Overall Acceptance Rate140of548submissions,26%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader