Skip to main content
Top
Published in: International Journal of Computer Vision 1-2/2014

01-08-2014

Learning Kernels for Unsupervised Domain Adaptation with Applications to Visual Object Recognition

Authors: Boqing Gong, Kristen Grauman, Fei Sha

Published in: International Journal of Computer Vision | Issue 1-2/2014

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Domain adaptation aims to correct the mismatch in statistical properties between the source domain on which a classifier is trained and the target domain to which the classifier is to be applied. In this paper, we address the challenging scenario of unsupervised domain adaptation, where the target domain does not provide any annotated data to assist in adapting the classifier. Our strategy is to learn robust features which are resilient to the mismatch across domains and then use them to construct classifiers that will perform well on the target domain. To this end, we propose novel kernel learning approaches to infer such features for adaptation. Concretely, we explore two closely related directions. In the first direction, we propose unsupervised learning of a geodesic flow kernel (GFK). The GFK summarizes the inner products in an infinite sequence of feature subspaces that smoothly interpolates between the source and target domains. In the second direction, we propose supervised learning of a kernel that discriminatively combines multiple base GFKs. Those base kernels model the source and the target domains at fine-grained granularities. In particular, each base kernel pivots on a different set of landmarks—the most useful data instances that reveal the similarity between the source and the target domains, thus bridging them to achieve adaptation. Our approaches are computationally convenient, automatically infer important hyper-parameters, and are capable of learning features and classifiers discriminatively without demanding labeled data from the target domain. In extensive empirical studies on standard benchmark recognition datasets, our appraches yield state-of-the-art results compared to a variety of competing methods.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Footnotes
1
Note that we assume the set of possible labels are the same across domains.
 
2
A similar idea was pursued in Gopalan et al. (2011). We contrast it to our work in Sect. 5.
 
3
The unit-ball condition allows the difference to be represented as a metric in the form of Eq. (13) and the universality ensures that the means are injective such that the difference in the means is zero if and only if the two distributions are the same. For more detailed theoretical analysis, please refer to Gretton et al. (2006).
 
4
Note that we do not require the landmarks to be i.i.d samples from \(P_S(X)\)—they only need to be representative samples of \(P_L(X)\).
 
5
In the supplementary material for our previously published work (Gong et al. 2012), we report our results on 31 categories common to Amazon, Webcam and DSLR, to compare directly to published results from the literature (Saenko et al. 2010; Kulis et al. 2011; Gopalan et al. 2011). Despite occasional discrepancies between the published results and the results obtained by our own experimentation on these 31 categories, they demonstrate the same trend—that our proposed methods significantly outperform competing approaches.
 
7
We did not use dslr as the source domain in these experiments as it is too small to select landmarks.
 
Literature
go back to reference Ando, R., & Zhang, T. (2005). A framework for learning predictive structures from multiple tasks and unlabeled data. JMLR, 6, 1817–1853.MATHMathSciNet Ando, R., & Zhang, T. (2005). A framework for learning predictive structures from multiple tasks and unlabeled data. JMLR, 6, 1817–1853.MATHMathSciNet
go back to reference Bay, H., Tuytelaars, T., & Van Gool, L. (2006). SURF: Speeded up robust features. In ECCV. Bay, H., Tuytelaars, T., & Van Gool, L. (2006). SURF: Speeded up robust features. In ECCV.
go back to reference Ben-David, S., Blitzer, J., Crammer, K., Kulesza, A., Pereira, F., & Wortman, J. (2010). A theory of learning from different domains. Machine Learning, 79, 151–175.CrossRefMathSciNet Ben-David, S., Blitzer, J., Crammer, K., Kulesza, A., Pereira, F., & Wortman, J. (2010). A theory of learning from different domains. Machine Learning, 79, 151–175.CrossRefMathSciNet
go back to reference Ben-David, S., Blitzer, J., Crammer, K., & Pereira, F. (2007). Analysis of representations for domain adaptation. In NIPS. Ben-David, S., Blitzer, J., Crammer, K., & Pereira, F. (2007). Analysis of representations for domain adaptation. In NIPS.
go back to reference Bergamo, A., & Torresani, L. (2010). Exploiting weakly-labeled web images to improve object classification: a domain adaptation approach. In NIPS. Bergamo, A., & Torresani, L. (2010). Exploiting weakly-labeled web images to improve object classification: a domain adaptation approach. In NIPS.
go back to reference Blitzer, J., Dredze, M., & Pereira, F. (2007). Biographies, Bolly-wood, boomboxes and blenders: Domain adaptation for sentiment classification. In ACL. Blitzer, J., Dredze, M., & Pereira, F. (2007). Biographies, Bolly-wood, boomboxes and blenders: Domain adaptation for sentiment classification. In ACL.
go back to reference Blitzer, J., Foster, D., & Kakade, S. (2011). Domain adaptation with coupled subspaces. In AISTATS. Blitzer, J., Foster, D., & Kakade, S. (2011). Domain adaptation with coupled subspaces. In AISTATS.
go back to reference Blitzer, J., McDonald, R., & Pereira, F. (2006). Domain adaptation with structural correspondence learning. In EMNLP. Blitzer, J., McDonald, R., & Pereira, F. (2006). Domain adaptation with structural correspondence learning. In EMNLP.
go back to reference Bruzzone, L., & Marconcini, M. (2010). Domain adaptation problems: A DASVM classification technique and a circular validation strategy. IEEE PAMI, 32(5), 770–787.CrossRef Bruzzone, L., & Marconcini, M. (2010). Domain adaptation problems: A DASVM classification technique and a circular validation strategy. IEEE PAMI, 32(5), 770–787.CrossRef
go back to reference Chen, M., Weinberger, K., & Blitzer, J. (2011). Co-training for domain adaptation. In NIPS. Chen, M., Weinberger, K., & Blitzer, J. (2011). Co-training for domain adaptation. In NIPS.
go back to reference Daumé, H., III. (2007). Frustratingly easy domain adaptation. In ACL. Daumé, H., III. (2007). Frustratingly easy domain adaptation. In ACL.
go back to reference Daumé, H., Kumar, A., & Saha, A. (2010). Co-regularization based semi-supervised domain adaptation. In NIPS. Daumé, H., Kumar, A., & Saha, A. (2010). Co-regularization based semi-supervised domain adaptation. In NIPS.
go back to reference Daumé, H, I. I. I., & Marcu, D. (2006). Domain adaptation for statistical classifiers. Journal of Artificial Intelligence Research, 26(1), 101–126.MATHMathSciNet Daumé, H, I. I. I., & Marcu, D. (2006). Domain adaptation for statistical classifiers. Journal of Artificial Intelligence Research, 26(1), 101–126.MATHMathSciNet
go back to reference Deng, J., Dong, W., Socher, R., Li, L. J., Li, K., & Fei-Fei, L. (2009). ImageNet: A large-scale hierarchical image database. In CVPR. Deng, J., Dong, W., Socher, R., Li, L. J., Li, K., & Fei-Fei, L. (2009). ImageNet: A large-scale hierarchical image database. In CVPR.
go back to reference Dollár, P., Wojek, C., Schiele, B., & Perona, P. (2009). Pedestrian detection: A benchmark. In CVPR. Dollár, P., Wojek, C., Schiele, B., & Perona, P. (2009). Pedestrian detection: A benchmark. In CVPR.
go back to reference Dredze, M., & Crammer, K. (2008). Online methods for multi-domain learning and adaptation. In Proceedings of the conference on empirical methods in natural language processing (EMNLP ’08) (pp. 689–697). Dredze, M., & Crammer, K. (2008). Online methods for multi-domain learning and adaptation. In Proceedings of the conference on empirical methods in natural language processing (EMNLP ’08) (pp. 689–697).
go back to reference Duan, L., Tsang, I., Xu, D., & Maybank, S. (2009). Domain transfer SVM for video concept detection. In CVPR. Duan, L., Tsang, I., Xu, D., & Maybank, S. (2009). Domain transfer SVM for video concept detection. In CVPR.
go back to reference Duan, L., Xu, D., & Tsang, I. (2012). Domain adaptation from multiple sources: A domain-dependent regularization approach. IEEE Transactions on Neural Networks and Learning Systems, 23(3), 504–518.CrossRef Duan, L., Xu, D., & Tsang, I. (2012). Domain adaptation from multiple sources: A domain-dependent regularization approach. IEEE Transactions on Neural Networks and Learning Systems, 23(3), 504–518.CrossRef
go back to reference Duan, L., Xu, D., Tsang, I., & Luo, J. (2010) Visual event recognition in videos by learning from web data. In CVPR. Duan, L., Xu, D., Tsang, I., & Luo, J. (2010) Visual event recognition in videos by learning from web data. In CVPR.
go back to reference Everingham, M., Van Gool, L., Williams, C. K. I., Winn, J., Zisserman, A. (2007). The PASCAL visual object classes, challenge 2007. Everingham, M., Van Gool, L., Williams, C. K. I., Winn, J., Zisserman, A. (2007). The PASCAL visual object classes, challenge 2007.
go back to reference Fei-Fei, L., Fergus, R., & Perona, P. (2007). Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories. Comp Vis & Img Under, 106(1), 59–70.CrossRef Fei-Fei, L., Fergus, R., & Perona, P. (2007). Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories. Comp Vis & Img Under, 106(1), 59–70.CrossRef
go back to reference Gong, B., Grauman, K., & Sha, F. (2013a). Connecting the dots with landmarks: Discriminatively learning domain-invariant features for unsupervised domain adaptation. In ICML. Gong, B., Grauman, K., & Sha, F. (2013a). Connecting the dots with landmarks: Discriminatively learning domain-invariant features for unsupervised domain adaptation. In ICML.
go back to reference Gong, B., Grauman, K., & Sha, F. (2013b). Reshaping visual datasets for domain adaptation. In NIPS. Gong, B., Grauman, K., & Sha, F. (2013b). Reshaping visual datasets for domain adaptation. In NIPS.
go back to reference Gong, B., Shi, Y., Sha, F., & Grauman, K. (2012). Geodesic flow kernel for unsupervised domain adaptation. In CVPR. Gong, B., Shi, Y., Sha, F., & Grauman, K. (2012). Geodesic flow kernel for unsupervised domain adaptation. In CVPR.
go back to reference Gopalan, R. (2013). Learning cross-domain information transfer for location recognition and clustering. In CVPR. Gopalan, R. (2013). Learning cross-domain information transfer for location recognition and clustering. In CVPR.
go back to reference Gopalan, R., Li, R., & Chellappa, R. (2011). Domain adaptation for object recognition: An unsupervised approach. In ICCV. Gopalan, R., Li, R., & Chellappa, R. (2011). Domain adaptation for object recognition: An unsupervised approach. In ICCV.
go back to reference Gretton, A., Borgwardt, K., Rasch, M., Schölkopf, B., Smola, A. (2006). A kernel method for the two-sample-problem. In NIPS. Gretton, A., Borgwardt, K., Rasch, M., Schölkopf, B., Smola, A. (2006). A kernel method for the two-sample-problem. In NIPS.
go back to reference Gretton, A., Smola, A., Huang, J., Schmittfull, M., Borgwardt, K., & Scholkopf, B. (2009). Covariate shift by kernel mean matching. In J. Quionero-Candela, M. Sugiyama, A. Schwaighofer, & N. Lawrence (Eds.), Dataset shift in machine learning. Cambridge: MIT Press. Gretton, A., Smola, A., Huang, J., Schmittfull, M., Borgwardt, K., & Scholkopf, B. (2009). Covariate shift by kernel mean matching. In J. Quionero-Candela, M. Sugiyama, A. Schwaighofer, & N. Lawrence (Eds.), Dataset shift in machine learning. Cambridge: MIT Press.
go back to reference Griffin, G., Holub, A., & Perona, P. (2007). Caltech-256 object category dataset. Tech. rep., Caltech. Griffin, G., Holub, A., & Perona, P. (2007). Caltech-256 object category dataset. Tech. rep., Caltech.
go back to reference Ham, J., Lee, D. D., Mika, S., Schölkopf, B. (2004). A kernel view of the dimensionality reduction of manifolds. In ICML. Ham, J., Lee, D. D., Mika, S., Schölkopf, B. (2004). A kernel view of the dimensionality reduction of manifolds. In ICML.
go back to reference Hamm, J., & Lee, D. (2008). Grassmann discriminant analysis: A unifying view on subspace-based learning. In ICML. Hamm, J., & Lee, D. (2008). Grassmann discriminant analysis: A unifying view on subspace-based learning. In ICML.
go back to reference Hastie, T., Tibshirani, R., & Friedman, J. (2009). The elements of statistical learning. Berlin: Springer.CrossRefMATH Hastie, T., Tibshirani, R., & Friedman, J. (2009). The elements of statistical learning. Berlin: Springer.CrossRefMATH
go back to reference Huang, J., Smola, A., Gretton, A., Borgwardt, K., & Scholkopf, B. (2006). Correcting sample selection bias by unlabeled data. In NIPS. Huang, J., Smola, A., Gretton, A., Borgwardt, K., & Scholkopf, B. (2006). Correcting sample selection bias by unlabeled data. In NIPS.
go back to reference Jain, V., & Learned-Miller, E. (2011). Online domain adaptation of a pre-trained cascade of classifiers. In CVPR. Jain, V., & Learned-Miller, E. (2011). Online domain adaptation of a pre-trained cascade of classifiers. In CVPR.
go back to reference Kulis, B,, Saenko, K., & Darrell, T. (2011). What you saw is not what you get: Domain adaptation using asymmetric kernel transforms. In CVPR. Kulis, B,, Saenko, K., & Darrell, T. (2011). What you saw is not what you get: Domain adaptation using asymmetric kernel transforms. In CVPR.
go back to reference Lanckriet, G. R. G., Cristianini, N., Bartlett, P., Ghaoui, L. E., & Jordan, M. (2004). Learning the kernel matrix with semidefinite programming. JMLR, 5, 27–72.MATH Lanckriet, G. R. G., Cristianini, N., Bartlett, P., Ghaoui, L. E., & Jordan, M. (2004). Learning the kernel matrix with semidefinite programming. JMLR, 5, 27–72.MATH
go back to reference Leggetter, C., & Woodland, P. (1995). Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models. Computer Speech and Language, 9(2), 171–185.CrossRef Leggetter, C., & Woodland, P. (1995). Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models. Computer Speech and Language, 9(2), 171–185.CrossRef
go back to reference Li, R., & Zickler, T. (2012). Discriminative virtual views for cross-view action recognition. In CVPR. Li, R., & Zickler, T. (2012). Discriminative virtual views for cross-view action recognition. In CVPR.
go back to reference Mansour, Y., Mohri, M., & Rostamizadeh, A. (2009a). Domain adaptation: Learning bounds and algorithms. Arxiv, preprint arXiv:09023430. Mansour, Y., Mohri, M., & Rostamizadeh, A. (2009a). Domain adaptation: Learning bounds and algorithms. Arxiv, preprint arXiv:09023430.
go back to reference Mansour, Y., Mohri, M., & Rostamizadeh, A. (2009b). Multiple source adaptation and the rényi divergence. In UAI. Mansour, Y., Mohri, M., & Rostamizadeh, A. (2009b). Multiple source adaptation and the rényi divergence. In UAI.
go back to reference Pan, S., Tsang, I., Kwok, J., & Yang, Q. (2009). Domain adaptation via transfer component analysis. IEEE Trans Neural Nets, 99, 1–12. Pan, S., Tsang, I., Kwok, J., & Yang, Q. (2009). Domain adaptation via transfer component analysis. IEEE Trans Neural Nets, 99, 1–12.
go back to reference Pan, S. J., & Yang, Q. (2010). A survey on transfer learning. IEEE Trans Knowledge and Data Engineering, 22(10), 1345–1359. Pan, S. J., & Yang, Q. (2010). A survey on transfer learning. IEEE Trans Knowledge and Data Engineering, 22(10), 1345–1359.
go back to reference Perronnin, F., Sánchez, J., & Liu, Y. (2010). Large-scale image categorization with explicit data embedding. In CVPR. Perronnin, F., Sánchez, J., & Liu, Y. (2010). Large-scale image categorization with explicit data embedding. In CVPR.
go back to reference Roweis, S. T., & Saul, L. K. (2000). Nonlinear dimensionality reduction by locally linear embedding. Science, 290(5500), 2323–2326.CrossRef Roweis, S. T., & Saul, L. K. (2000). Nonlinear dimensionality reduction by locally linear embedding. Science, 290(5500), 2323–2326.CrossRef
go back to reference Russell, B. C., Torralba, A., Murphy, K. P., & Freeman, W. T. (2008). LabelMe: A database and web-based tool for image annotation. IJCV, 77, 157–173.CrossRef Russell, B. C., Torralba, A., Murphy, K. P., & Freeman, W. T. (2008). LabelMe: A database and web-based tool for image annotation. IJCV, 77, 157–173.CrossRef
go back to reference Saenko, K., Kulis, B., Fritz, M., & Darrell, T. (2010). Adapting visual category models to new domains. In ECCV. Saenko, K., Kulis, B., Fritz, M., & Darrell, T. (2010). Adapting visual category models to new domains. In ECCV.
go back to reference Shi, Y,, & Sha, F. (2012). Information-theoretical learning of discriminative clusters for unsupervised domain adaptation. In ICML. Shi, Y,, & Sha, F. (2012). Information-theoretical learning of discriminative clusters for unsupervised domain adaptation. In ICML.
go back to reference Shimodaira, H. (2000). Improving predictive inference under covariate shift by weighting the log-likelihood function. Journal of Statistical Planning and Inference, 90(2), 227–244.CrossRefMATHMathSciNet Shimodaira, H. (2000). Improving predictive inference under covariate shift by weighting the log-likelihood function. Journal of Statistical Planning and Inference, 90(2), 227–244.CrossRefMATHMathSciNet
go back to reference Torralba, A., & Efros, A. (2011). Unbiased look at dataset bias. In CVPR. Torralba, A., & Efros, A. (2011). Unbiased look at dataset bias. In CVPR.
go back to reference Vedaldi, A., Gulshan, V., Varma, M., & Zisserman, A. (2009). Multiple kernels for object detection. In ICCV. Vedaldi, A., Gulshan, V., Varma, M., & Zisserman, A. (2009). Multiple kernels for object detection. In ICCV.
go back to reference Wang, M., & Wang, X. (2011). Automatic adaptation of a generic pedestrian detector to a specific traffic scene. In CVPR. Wang, M., & Wang, X. (2011). Automatic adaptation of a generic pedestrian detector to a specific traffic scene. In CVPR.
go back to reference Weinberger, K. Q., & Saul, L. K. (2006). Unsupervised learning of image manifolds by semidefinite programming. International Journal of Computer Vision, 70(1), 77–90.CrossRef Weinberger, K. Q., & Saul, L. K. (2006). Unsupervised learning of image manifolds by semidefinite programming. International Journal of Computer Vision, 70(1), 77–90.CrossRef
go back to reference Zheng, J., Liu, M. Y., Chellappa, R., & Phillips, P. J. (2012) A grassmann manifold-based domain adaptation approach. In ICPR. Zheng, J., Liu, M. Y., Chellappa, R., & Phillips, P. J. (2012) A grassmann manifold-based domain adaptation approach. In ICPR.
Metadata
Title
Learning Kernels for Unsupervised Domain Adaptation with Applications to Visual Object Recognition
Authors
Boqing Gong
Kristen Grauman
Fei Sha
Publication date
01-08-2014
Publisher
Springer US
Published in
International Journal of Computer Vision / Issue 1-2/2014
Print ISSN: 0920-5691
Electronic ISSN: 1573-1405
DOI
https://doi.org/10.1007/s11263-014-0718-4

Other articles of this Issue 1-2/2014

International Journal of Computer Vision 1-2/2014 Go to the issue

Premium Partner