skip to main content
10.1145/1273496.1273507acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
Article

Discriminative learning for differing training and test distributions

Authors Info & Claims
Published:20 June 2007Publication History

ABSTRACT

We address classification problems for which the training instances are governed by a distribution that is allowed to differ arbitrarily from the test distribution---problems also referred to as classification under covariate shift. We derive a solution that is purely discriminative: neither training nor test distribution are modeled explicitly. We formulate the general problem of learning under covariate shift as an integrated optimization problem. We derive a kernel logistic regression classifier for differing training and test distributions.

References

  1. Bickel, S., & Scheffer, T. (2007). Dirichlet-enhanced spam filtering based on biased samples. Advances in Neural Information Processing Systems.Google ScholarGoogle Scholar
  2. Dudik, M., Schapire, R., & Phillips, S. (2005). Correcting sample selection bias in maximum entropy density estimation. Advances in Neural Information Processing Systems.Google ScholarGoogle Scholar
  3. Elkan, C. (2001). The foundations of cost-sensitive learning. Proceedings of the International Joint Conference on Artificial Intellligence. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Heckman, J. (1979). Sample selection bias as a specification error. Econometrica, 47, 153--161.Google ScholarGoogle ScholarCross RefCross Ref
  5. Huang, J., Smola, A., Gretton, A., Borgwardt, K., & Schöölkopf, B. (2007). Correcting sample selection bias by unlabeled data. Advances in Neural Information Processing Systems.Google ScholarGoogle Scholar
  6. Japkowicz, N., & Stephen, S. (2002). The class imbalance problem: A systematic study. Intelligent Data Analysis, 6, 429--449. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Manski, C., & Lerman, S. (1977). The estimation of choice probabilities from choice based samples. Econometrica, 45, 1977--1988.Google ScholarGoogle ScholarCross RefCross Ref
  8. Shimodaira, H. (2000). Improving predictive inference under covariate shift by weighting the log-likelihood function. Journal of Statistical Planning and Inference, 90, 227--244.Google ScholarGoogle ScholarCross RefCross Ref
  9. Silverman, B. W. (1986). Density estimation for statistics and data analysis. Chapman & Hall, London.Google ScholarGoogle Scholar
  10. Sugiyama, M., & Müüller, K.-R. (2005). Model selection under covariate shift. Proceedings of the International Conference on Artificial Neural Networks. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Xue, Y., Liao, X., Carin, L., & Krishnapuram, B. (2007). Multi-task learning for classification with dirichlet process priors. Journal of Machine Learning Research, 8, 35--63. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Zadrozny, B. (2004). Learning and evaluating classifiers under sample selection bias. Proceedings of the International Conference on Machine Learning. Google ScholarGoogle ScholarDigital LibraryDigital Library
  1. Discriminative learning for differing training and test distributions

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICML '07: Proceedings of the 24th international conference on Machine learning
        June 2007
        1233 pages
        ISBN:9781595937933
        DOI:10.1145/1273496

        Copyright © 2007 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 20 June 2007

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate140of548submissions,26%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader