skip to main content
10.1145/1014052.1014089acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

Incorporating prior knowledge with weighted margin support vector machines

Published:22 August 2004Publication History

ABSTRACT

Like many purely data-driven machine learning methods, Support Vector Machine (SVM) classifiers are learned exclusively from the evidence presented in the training dataset; thus a larger training dataset is required for better performance. In some applications, there might be human knowledge available that, in principle, could compensate for the lack of data. In this paper, we propose a simple generalization of SVM: Weighted Margin SVM (WMSVMs) that permits the incorporation of prior knowledge. We show that Sequential Minimal Optimization can be used in training WMSVM. We discuss the issues of incorporating prior knowledge using this rather general formulation. The experimental results show that the proposed methods of incorporating prior knowledge is effective.

References

  1. K. Bennett and A. Demiriz. Semi-supervised support vector machines. In Advances in Neural Information Processing Systems 11, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. C. Chang and C. Lin. LIBSVM: a library for support vector machines (version 2.3), 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. G. Fung and O. Mangasarian. Semi-supervised support vector machines for unlabeled data classification. Optimization Methods and Software, 15, 2001.Google ScholarGoogle Scholar
  4. G. Fung, O. L. Mangasarian, and J. Shavlik. Knowledge-based support vector machine classifiers. In Data Mining Institute Technical Report 01-09, Nov 2001.Google ScholarGoogle Scholar
  5. G. H. Golub and C. F. V. Loan. Matrix Computation. Johns Hopkins Univ Press, 1996.Google ScholarGoogle Scholar
  6. W. R. Hersh, C. Buckley, T. J. Leone, and D. H. Hickam. Ohsumed: An interactive retrieval evaluation and new large test collection for research, 1994.Google ScholarGoogle Scholar
  7. T. Joachims. Text categorization with support vector machines: learning with many relevant features. In C. Nedellec and C. Rouveirol, editors, Proceedings of ECML-98, 10th European Conference on Machine Learning, number 1398, pages 137--142, Chemnitz, DE, 1998. Springer Verlag, Heidelberg, DE. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. T. Joachims. Transductive inference for text classification using support vector machines. In Proc. 16th International Conf. on Machine Learning, pages 200--209. Morgan Kaufmann, San Francisco, CA, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. T. Joachims. Learning To Classify Text Using Support Vector Machines. Kluwer Academic Publishers, Boston, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. S. Keerthi, S. Shevade, C. Bhattacharyya, and K. Murthy. Improvements to platt's smo algorithm for svm classifier design, 1999.Google ScholarGoogle Scholar
  11. W. Lam and C. Ho. Using a generalized instance set for automatic text categorization. In W. B. Croft, A. Moffat, C. J. van Rijsbergen, R. Wilkinson, and J. Zobel, editors, Proceedings of SIGIR-98, 21st ACM International Conference on Research and Development in Information Retrieval, pages 81--89, Melbourne, AU, 1998. ACM Press, New York, US. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. Platt. Fast training of support vector machines using sequential minimal optimization. In B. Scholkopf, C. Burges, and A. Smola, editors, Advances in kernel methods - support vector learning. MIT Press, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. R. Schapire, M. Rochery, M. Rahim, and N. Gupta. Incorporating prior knowledge into boosting. In Proceedings of the Nineteenth International Conference In Machine Learning, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. B. Scholkopf, P. Simard, A. Smola, and V. Vapnik. Prior knowledge in support vector kernels. In B. Scholkopf, C. Burges, and A. Smola, editors, Advances in kernel methods - support vector learning. MIT Press, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. S. Tong and D. Koller. Support vector machine active learning with applications to text classification. In P. Langley, editor, Proceedings of ICML-00, 17th International Conference on Machine Learning, pages 999--1006, Stanford, US, 2000. Morgan Kaufmann Publishers, San Francisco, US. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. V. N. Vapnik. Statistical learning theory. John Wiley & Sons, New York, NY, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. V. N. Vapnik. The nature of statistical learning theory, 2nd Edition. Springer Verlag, Heidelberg, DE, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Y. Yang and X. Liu. A re-examination of text categorization methods. In M. A. Hearst, F. Gey, and R. Tong, editors, Proceedings of SIGIR-99, 22nd ACM International Conference on Research and Development in Information Retrieval, pages 42--49, Berkeley, US, 1999. ACM Press, New York, US. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. Zhang and Y. Yang. Robustness of regularized linear classification methods in text categorization. In Proceedings of SIGIR-2003, 26st ACM International Conference on Research and Development in Information Retrieval. ACM Press, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Incorporating prior knowledge with weighted margin support vector machines

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        KDD '04: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
        August 2004
        874 pages
        ISBN:1581138881
        DOI:10.1145/1014052

        Copyright © 2004 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 22 August 2004

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate1,133of8,635submissions,13%

        Upcoming Conference

        KDD '24

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader