Article

Discriminative learning for differing training and test distributions

Authors:
Steffen Bickel

Max Planck Institute for Computer Science, Saarbrücken, Germany

Max Planck Institute for Computer Science, Saarbrücken, Germany
View Profile

,
Michael Brückner

Max Planck Institute for Computer Science, Saarbrücken, Germany

Max Planck Institute for Computer Science, Saarbrücken, Germany
View Profile

,
Tobias Scheffer

Max Planck Institute for Computer Science, Saarbrücken, Germany

Max Planck Institute for Computer Science, Saarbrücken, Germany
View Profile

ICML '07: Proceedings of the 24th international conference on Machine learningJune 2007Pages 81–88https://doi.org/10.1145/1273496.1273507

Published:20 June 2007Publication History

ICML '07: Proceedings of the 24th international conference on Machine learning

Pages 81–88

ABSTRACT

We address classification problems for which the training instances are governed by a distribution that is allowed to differ arbitrarily from the test distribution---problems also referred to as classification under covariate shift. We derive a solution that is purely discriminative: neither training nor test distribution are modeled explicitly. We formulate the general problem of learning under covariate shift as an integrated optimization problem. We derive a kernel logistic regression classifier for differing training and test distributions.

References

Bickel, S., & Scheffer, T. (2007). Dirichlet-enhanced spam filtering based on biased samples. Advances in Neural Information Processing Systems.Google Scholar
Dudik, M., Schapire, R., & Phillips, S. (2005). Correcting sample selection bias in maximum entropy density estimation. Advances in Neural Information Processing Systems.Google Scholar
Elkan, C. (2001). The foundations of cost-sensitive learning. Proceedings of the International Joint Conference on Artificial Intellligence. Google ScholarDigital Library
Heckman, J. (1979). Sample selection bias as a specification error. Econometrica, 47, 153--161.Google ScholarCross Ref
Huang, J., Smola, A., Gretton, A., Borgwardt, K., & Schöölkopf, B. (2007). Correcting sample selection bias by unlabeled data. Advances in Neural Information Processing Systems.Google Scholar
Japkowicz, N., & Stephen, S. (2002). The class imbalance problem: A systematic study. Intelligent Data Analysis, 6, 429--449. Google ScholarDigital Library
Manski, C., & Lerman, S. (1977). The estimation of choice probabilities from choice based samples. Econometrica, 45, 1977--1988.Google ScholarCross Ref
Shimodaira, H. (2000). Improving predictive inference under covariate shift by weighting the log-likelihood function. Journal of Statistical Planning and Inference, 90, 227--244.Google ScholarCross Ref
Silverman, B. W. (1986). Density estimation for statistics and data analysis. Chapman & Hall, London.Google Scholar
Sugiyama, M., & Müüller, K.-R. (2005). Model selection under covariate shift. Proceedings of the International Conference on Artificial Neural Networks. Google ScholarDigital Library
Xue, Y., Liao, X., Carin, L., & Krishnapuram, B. (2007). Multi-task learning for classification with dirichlet process priors. Journal of Machine Learning Research, 8, 35--63. Google ScholarDigital Library
Zadrozny, B. (2004). Learning and evaluating classifiers under sample selection bias. Proceedings of the International Conference on Machine Learning. Google ScholarDigital Library

Discriminative learning for differing training and test distributions
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
      1. Supervised learning
    2. Machine learning approaches

Recommendations

Multiple instance learning by discriminative training of Markov networks
UAI'13: Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence

We introduce a graphical framework for multiple instance learning (MIL) based on Markov networks. This framework can be used to model the traditional MIL definition as well as more general MIL definitions. Different levels of ambiguity - the portion of ...
Read More
Confidence-rated discriminative partial label learning
AAAI'17: Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence

Partial label learning aims to induce a multi-class classifier from training examples where each of them is associated with a set of candidate labels, among which only one label is valid. The common discriminative solution to learn from partial label ...
Read More
Mismatched training and test distributions can outperform matched ones

In learning theory, the training and test sets are assumed to be drawn from the same probability distribution. This assumption is also followed in practical situations, where matching the training and test distributions is considered desirable. Contrary ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ICML '07: Proceedings of the 24th international conference on Machine learning
June 2007
1233 pages
ISBN:9781595937933
DOI:10.1145/1273496
Editor:
Zoubin Ghahramani
University of Cambridge, United Kingdom
Copyright © 2007 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 20 June 2007
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate140of548submissions,26%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 209
  Total Citations
  View Citations
- 1,259
  Total Downloads
- Downloads (Last 12 months)63
- Downloads (Last 6 weeks)7
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Discriminative learning for differing training and test distributions

ICML '07: Proceedings of the 24th international conference on Machine learning

ABSTRACT

References

Cited By

Recommendations

Multiple instance learning by discriminative training of Markov networks

Confidence-rated discriminative partial label learning

Mismatched training and test distributions can outperform matched ones

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Discriminative learning for differing training and test distributions

ICML '07: Proceedings of the 24th international conference on Machine learning

ABSTRACT

References

Cited By

Recommendations

Multiple instance learning by discriminative training of Markov networks

Confidence-rated discriminative partial label learning

Mismatched training and test distributions can outperform matched ones

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media