Article

Inference with the Universum

Authors:
Jason Weston

NEC Labs America, Princeton NJ

NEC Labs America, Princeton NJ
View Profile

,
Ronan Collobert

NEC Labs America, Princeton NJ

NEC Labs America, Princeton NJ
View Profile

,
Fabian Sinz

NEC Labs America, Princeton NJ and Max Planck Insitute for Biological Cybernetics, Tuebingen, Germany

NEC Labs America, Princeton NJ and Max Planck Insitute for Biological Cybernetics, Tuebingen, Germany
View Profile

,
Léon Bottou

NEC Labs America, Princeton NJ

NEC Labs America, Princeton NJ
View Profile

,
Vladimir Vapnik

NEC Labs America, Princeton NJ

NEC Labs America, Princeton NJ
View Profile

ICML '06: Proceedings of the 23rd international conference on Machine learningJune 2006Pages 1009–1016https://doi.org/10.1145/1143844.1143971

Published:25 June 2006Publication History

ICML '06: Proceedings of the 23rd international conference on Machine learning

Pages 1009–1016

ABSTRACT

In this paper we study a new framework introduced by Vapnik (1998) and Vapnik (2006) that is an alternative capacity concept to the large margin approach. In the particular case of binary classification, we are given a set of labeled examples, and a collection of "non-examples" that do not belong to either class of interest. This collection, called the Universum, allows one to encode prior knowledge by representing meaningful concepts in the same domain as the problem at hand. We describe an algorithm to leverage the Universum by maximizing the number of observed contradictions, and show experimentally that this approach delivers accuracy improvements over using labeled data alone.

References

Baird, H. (1990). Document image defect models. Proceedings, IAPR Workshop on Syntactic and Structural Pattern Recognition (pp. 38--46). Murray Hill, NJ.Google Scholar
Bernardo, J. M., & Smith, A. F. M. (1994). Bayesian theory. John Wiley and Sons.Google ScholarCross Ref
Boser, B. E., Guyon, I. M., & Vapnik, V. (1992). A training algorithm for optimal margin classifiers. Proceedings of the 5th Annual ACM Workshop on Computational Learning Theory (pp. 144--152). Pittsburgh, PA: ACM Press. Google ScholarDigital Library
Grandvalet, Y., Canu, S., & Boucheron, S. (1997). Noise injection: Theoretical prospects. Neural Computation, 9, 1093--1108. Google ScholarDigital Library
Leen, T. K. (1995). From data distributions to regularization in invariant learning. Advances in Neural information processing systems 7. Cambridge MA: MIT Press. Google ScholarDigital Library
Lewis, D. D., Yang, Y., Rose, T., & Li, F. (2004). Rcv1: A new benchmark collection for text categorization research. Journal of Machine Learning Research, 5, 361--397. Google ScholarDigital Library
Mangasarian, O. L. (1965). Linear and nonlinear separation of patterns by linear programming. Operations Research, 13, 444--452.Google ScholarDigital Library
Niyogi, P., Girosi, F., & Poggio, T. (1998). Incorporating prior information in machine learning by creating virtual examples. Proceedings of the IEEE, 86, 2196--2209.Google ScholarCross Ref
Schölkopf, B., Burges, C., & Vapnik, V. (1996). Incorporating invariances in support vector learning machines. Artificial Neural Networks --- ICANN'96 (pp. 47--52). Berlin: Springer Lecture Notes in Computer Science, Vol. 1112. Google ScholarDigital Library
Shawe-Taylor, J., Bartlett, P. L., Williamson, R. C., & Anthony, M. (1998). Structural risk minimization over data-dependent hierarchies. 44, 1926--1940.Google Scholar
Vapnik, V. (2006). Estimation of dependences based on empirical data. Berlin: Springer Verlag. 2nd edition. Google ScholarDigital Library
Vapnik, V. N. (1998). Statistical learning theory. New York: Wiley. Google ScholarDigital Library
Zhong, P., & Fukushima, M. (2006). A new multi-class support vector algorithm. Optimization Methods and Software, 21, 359--372.Google ScholarCross Ref

Index Terms

Inference with the Universum
1. Mathematics of computing
  1. Probability and statistics
    1. Distribution functions
    2. Statistical paradigms
      1. Statistical graphics

Recommendations

Self-Universum support vector machine

In this paper, for an improved twin support vector machine (TWSVM), we give it a theoretical explanation based on the concept of Universum and then name it Self-Universum support vector machine (SUSVM). For the binary classification problem, SUSVM takes ...
Read More
Twin support vector machine with Universum data

The Universum, which is defined as the sample not belonging to either class of the classification problem of interest, has been proved to be helpful in supervised learning. In this work, we designed a new Twin Support Vector Machine with Universum (...
Read More
Inverse Free Universum Twin Support Vector Machine
Learning and Intelligent Optimization
Abstract
Universum twin support vector machine ( $U$ -TSVM) is an efficient method for binary classification problems . In this paper, we improve the $U$ -TSVM algorithm and propose an improved Universum twin bounded support vector machine (named as IUTBSVM) . ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ICML '06: Proceedings of the 23rd international conference on Machine learning
June 2006
1154 pages
ISBN:1595933832
DOI:10.1145/1143844
Program Chairs:
William Cohen,
Andrew Moore
Copyright © 2006 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 25 June 2006
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- Article
Conference

Acceptance Rates
ICML '06 Paper Acceptance Rate140of548submissions,26%Overall Acceptance Rate140of548submissions,26%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 133
  Total Citations
  View Citations
- 702
  Total Downloads
- Downloads (Last 12 months)45
- Downloads (Last 6 weeks)6
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Inference with the Universum

ICML '06: Proceedings of the 23rd international conference on Machine learning

ABSTRACT

References

Cited By

Index Terms

Recommendations

Self-Universum support vector machine

Twin support vector machine with Universum data

Inverse Free Universum Twin Support Vector Machine

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Inference with the Universum

ICML '06: Proceedings of the 23rd international conference on Machine learning

ABSTRACT

References

Cited By

Index Terms

Recommendations

Self-Universum support vector machine

Twin support vector machine with Universum data

Inverse Free Universum Twin Support Vector Machine

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media