Skip to main content
Erschienen in: International Journal of Computer Vision 1-2/2014

01.08.2014

Asymmetric and Category Invariant Feature Transformations for Domain Adaptation

verfasst von: Judy Hoffman, Erik Rodner, Jeff Donahue, Brian Kulis, Kate Saenko

Erschienen in: International Journal of Computer Vision | Ausgabe 1-2/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

-1We address the problem of visual domain adaptation for transferring object models from one dataset or visual domain to another. We introduce a unified flexible model for both supervised and semi-supervised learning that allows us to learn transformations between domains. Additionally, we present two instantiations of the model, one for general feature adaptation/alignment, and one specifically designed for classification. First, we show how to extend metric learning methods for domain adaptation, allowing for learning metrics independent of the domain shift and the final classifier used. Furthermore, we go beyond classical metric learning by extending the method to asymmetric, category independent transformations. Our framework can adapt features even when the target domain does not have any labeled examples for some categories, and when the target and source features have different dimensions. Finally, we develop a joint learning framework for adaptive classifiers, which outperforms competing methods in terms of multi-class accuracy and scalability. We demonstrate the ability of our approach to adapt object recognition models under a variety of situations, such as differing imaging conditions, feature types, and codebooks. The experiments show its strong performance compared to previous approaches and its applicability to large-scale scenarios.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
Note that in general we could equally optimize a second loss function between the source and target data which considers instance level constraints. However, to distinguish ourselves from prior work which focused on learning a metric requiring instance constraints, we present our algorithms assuming only category level information to demonstrate the effectiveness of using only this coarser level of supervision.
 
2
Note that we present this result for the specific case of using the Frobenius norm regularizer, though in fact our analysis holds for the class of regularizers \(r({\varvec{W}})\) that can be written in terms of the singular values of \({\varvec{W}}\); that is, if \(\sigma _1, \ldots , \sigma _p\) are the singular values of \({\varvec{W}}\), then \(r({\varvec{W}})\) is of the form \(\sum _{j=1}^p r_j(\sigma _j)\) for some scalar functions \(r_j\), which is globally minimized by zero. For example, the squared Frobenius norm \(r({\varvec{W}}) = \frac{1}{2} \Vert {\varvec{W}}\Vert _F^2\) is a special case where \(r_j(\sigma _j) = \frac{1}{2} \sigma _j^2\).
 
3
The assumption that the kernel matrices are strictly positive definite is not a severe limitation. For the Gaussian RBF kernel, strict positive definiteness can always be assured and for other kernel functions, the matrices can be regularized by adding a scaled identity matrix.
 
Literatur
Zurück zum Zitat Argyriou, A., Micchelli, C. A., & Pontil, M. (2010). On spectral learning. Journal of Machine Learning Research, 11, 935–953.MATHMathSciNet Argyriou, A., Micchelli, C. A., & Pontil, M. (2010). On spectral learning. Journal of Machine Learning Research, 11, 935–953.MATHMathSciNet
Zurück zum Zitat Aytar, Y., & Zisserman, A. (2011). Tabula rasa: Model transfer for object category detection. In Proceedings of the international conference on computer vision (ICCV) (pp. 2252–2259). Aytar, Y., & Zisserman, A. (2011). Tabula rasa: Model transfer for object category detection. In Proceedings of the international conference on computer vision (ICCV) (pp. 2252–2259).
Zurück zum Zitat Ben-david, S., Blitzer, J., Crammer, K., & Pereira, O. (2007). Analysis of representations for domain adaptation. In Advances in neural information processing systems (NIPS) ( pp. 137–145). Cambridge: MIT Press. Ben-david, S., Blitzer, J., Crammer, K., & Pereira, O. (2007). Analysis of representations for domain adaptation. In Advances in neural information processing systems (NIPS) ( pp. 137–145). Cambridge: MIT Press.
Zurück zum Zitat Bergamo, A., & Torresani, L. (2010). Exploiting weakly-labeled web images to improve object classification: A domain adaptation approach. In Advances in neural information processing systems (NIPS) (pp. 181–189). Bergamo, A., & Torresani, L. (2010). Exploiting weakly-labeled web images to improve object classification: A domain adaptation approach. In Advances in neural information processing systems (NIPS) (pp. 181–189).
Zurück zum Zitat Blitzer, J., Dredze, M., & Pereira, F. (2007). Biographies, bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification. ACL, 7, 440–447. Blitzer, J., Dredze, M., & Pereira, F. (2007). Biographies, bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification. ACL, 7, 440–447.
Zurück zum Zitat Chopra, S., Balakrishnan, S., & Gopalan, R. (2013). Dlid: Deep learning for domain adaptation by interpolating between domains. In ICML workshop on challenges in representation learning. Chopra, S., Balakrishnan, S., & Gopalan, R. (2013). Dlid: Deep learning for domain adaptation by interpolating between domains. In ICML workshop on challenges in representation learning.
Zurück zum Zitat Dai, W., Chen, Y., Xue, G., Yang, Q., & Yu, Y. (2008). Translated learning: Transfer learning across different feature spaces. In Advances in neural information processing systems (NIPS) (pp. 353–360). Dai, W., Chen, Y., Xue, G., Yang, Q., & Yu, Y. (2008). Translated learning: Transfer learning across different feature spaces. In Advances in neural information processing systems (NIPS) (pp. 353–360).
Zurück zum Zitat Daume III, H. (2007). Frustratingly easy domain adaptation. In ACL (pp. 256–263). Daume III, H. (2007). Frustratingly easy domain adaptation. In ACL (pp. 256–263).
Zurück zum Zitat Davis, J., Kulis, B., Jain, P., Sra, S., & Dhillon, I. (2007). Information-theoretic metric learning. In Proceedings of the international conference on Machine learning (ICML) (pp. 209–216) . Davis, J., Kulis, B., Jain, P., Sra, S., & Dhillon, I. (2007). Information-theoretic metric learning. In Proceedings of the international conference on Machine learning (ICML) (pp. 209–216) .
Zurück zum Zitat Diethe, T., Hardoon, D. R., & Shawe-Taylor, J. (2010). Constructing nonlinear discriminants from multiple data views. In Machine learning and knowledge discovery in databases (pp. 328–343) Berlin: Springer. Diethe, T., Hardoon, D. R., & Shawe-Taylor, J. (2010). Constructing nonlinear discriminants from multiple data views. In Machine learning and knowledge discovery in databases (pp. 328–343) Berlin: Springer.
Zurück zum Zitat Donahue, J., Hoffman, J., Rodner, E., Saenko, K., & Darrell, T. (2013). Semi-supervised domain adaptation with instance constraints. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 668–675). Donahue, J., Hoffman, J., Rodner, E., Saenko, K., & Darrell, T. (2013). Semi-supervised domain adaptation with instance constraints. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 668–675).
Zurück zum Zitat Donahue, J., Jia, Y., Vinyals, O., Hoffman, J., Zhang, N., Tzeng, E., et al. (2014). Decaf: A deep convolutional activation feature for generic visual recognition. In International conference in machine learning (ICML). Donahue, J., Jia, Y., Vinyals, O., Hoffman, J., Zhang, N., Tzeng, E., et al. (2014). Decaf: A deep convolutional activation feature for generic visual recognition. In International conference in machine learning (ICML).
Zurück zum Zitat Duan, L., Tsang, I. W., Xu, D., & Maybank, S. J. (2009). Domain transfer svm for video concept detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1375–1381). Duan, L., Tsang, I. W., Xu, D., & Maybank, S. J. (2009). Domain transfer svm for video concept detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1375–1381).
Zurück zum Zitat Duan, L., Xu, D., & Tsang, I. W. (2012a). Learning with augmented features for heterogeneous domain adaptation. In Proceedings of the international conference on machine learning (pp. 711–718). Duan, L., Xu, D., & Tsang, I. W. (2012a). Learning with augmented features for heterogeneous domain adaptation. In Proceedings of the international conference on machine learning (pp. 711–718).
Zurück zum Zitat Duan, L., Xu, D., Tsang, I. W. H., & Luo, J. (2012b). Visual event recognition in videos by learning from web data. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(9), 1667–1680.CrossRef Duan, L., Xu, D., Tsang, I. W. H., & Luo, J. (2012b). Visual event recognition in videos by learning from web data. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(9), 1667–1680.CrossRef
Zurück zum Zitat Farhadi, A., & Tabrizi, M. K. (2008). Learning to recognize activities from the wrong view point. In Proceedings of the European conference on computer vision (ECCV) (pp. 154–166). Farhadi, A., & Tabrizi, M. K. (2008). Learning to recognize activities from the wrong view point. In Proceedings of the European conference on computer vision (ECCV) (pp. 154–166).
Zurück zum Zitat Farquhar, J., Hardoon, D., Meng, H., Shawe-taylor, J. S., & Szedmak, S. (2005). Two view learning: Svm-2k, theory and practice. In Advances in neural information processing systems (NIPS) (pp. 355–362). Farquhar, J., Hardoon, D., Meng, H., Shawe-taylor, J. S., & Szedmak, S. (2005). Two view learning: Svm-2k, theory and practice. In Advances in neural information processing systems (NIPS) (pp. 355–362).
Zurück zum Zitat Gong, B., Shi, Y., Sha, F., & Grauman, K. (2012). Geodesic flow kernel for unsupervised domain adaptation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2066–2073). Gong, B., Shi, Y., Sha, F., & Grauman, K. (2012). Geodesic flow kernel for unsupervised domain adaptation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2066–2073).
Zurück zum Zitat Gopalan, R., Li, R., & Chellappa, R. (2011). Domain adaptation for object recognition: An unsupervised approach. In Proceedings of the international conference on computer vision (ICCV) (pp. 999–1006). Gopalan, R., Li, R., & Chellappa, R. (2011). Domain adaptation for object recognition: An unsupervised approach. In Proceedings of the international conference on computer vision (ICCV) (pp. 999–1006).
Zurück zum Zitat Hoffman, J., Rodner, E., Donahue, J., Saenko, K., & Darrell, T. (2013). Efficient learning of domain-invariant image representations. In International conference on learning representations (ICLR). http://arxiv.org/abs/1301.3224 Hoffman, J., Rodner, E., Donahue, J., Saenko, K., & Darrell, T. (2013). Efficient learning of domain-invariant image representations. In International conference on learning representations (ICLR). http://​arxiv.​org/​abs/​1301.​3224
Zurück zum Zitat Jhuo, I. H., Liu, D., Chang, S. F., & Lee, D. T. (2012). Robust visual domain adaptation with low-rank reconstruction. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2168–2175). Jhuo, I. H., Liu, D., Chang, S. F., & Lee, D. T. (2012). Robust visual domain adaptation with low-rank reconstruction. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2168–2175).
Zurück zum Zitat Jiang, J., & Zhai, C. X. (2007). Instance weighting for domain adaptation in NLP. In ACL (pp. 264–271). Jiang, J., & Zhai, C. X. (2007). Instance weighting for domain adaptation in NLP. In ACL (pp. 264–271).
Zurück zum Zitat Jiang, W., Zavesky, E., Chang, S., & Loui, A. (2008). Cross-domain learning methods for high-level visual concept classification. In International conference on image processing (ICIP) (pp. 161–164). Jiang, W., Zavesky, E., Chang, S., & Loui, A. (2008). Cross-domain learning methods for high-level visual concept classification. In International conference on image processing (ICIP) (pp. 161–164).
Zurück zum Zitat Kan, M., Shan, S., Zhang, H., Lao, S., & Chen, X. (2012). Multi-view discriminant analysis. In Proceedings of the European computer vision conference (ECCV) (pp. 808–821). Berlin: Springer. Kan, M., Shan, S., Zhang, H., Lao, S., & Chen, X. (2012). Multi-view discriminant analysis. In Proceedings of the European computer vision conference (ECCV) (pp. 808–821). Berlin: Springer.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems. Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems.
Zurück zum Zitat Kulis, B., Jain, P., & Grauman, K. (2009). Fast similarity search for learned metrics. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(12), 2143–2157.CrossRef Kulis, B., Jain, P., & Grauman, K. (2009). Fast similarity search for learned metrics. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(12), 2143–2157.CrossRef
Zurück zum Zitat Kulis, B., Saenko, K., & Darrell, T. (2011). What you saw is not what you get: Domain adaptation using asymmetric kernel transforms. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1785–1792). Kulis, B., Saenko, K., & Darrell, T. (2011). What you saw is not what you get: Domain adaptation using asymmetric kernel transforms. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1785–1792).
Zurück zum Zitat Li, R., & Zickler, T. (2012). Discriminative virtual views for cross-view action recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2855–2862). Li, R., & Zickler, T. (2012). Discriminative virtual views for cross-view action recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2855–2862).
Zurück zum Zitat Li, X. (2007). Regularized adaptation: Theory, algorithms and applications. Ph.D. thesis, USA: University of Washington Li, X. (2007). Regularized adaptation: Theory, algorithms and applications. Ph.D. thesis, USA: University of Washington
Zurück zum Zitat Quadrianto, N., & Lampert, C. H. (2011). Learning multi-view neighborhood preserving projections. In Proceedings of the International Conference on Machine Learning (ICML) (pp. 425–432). Quadrianto, N., & Lampert, C. H. (2011). Learning multi-view neighborhood preserving projections. In Proceedings of the International Conference on Machine Learning (ICML) (pp. 425–432).
Zurück zum Zitat Rodner, E., Hoffman, J., Donahue, J., Darrell, T., Saenko, K. (2013). Towards adapting imagenet to reality: Scalable domain adaptation with implicit low-rank transformations. arXiv:1308.4200 (preprint). Rodner, E., Hoffman, J., Donahue, J., Darrell, T., Saenko, K. (2013). Towards adapting imagenet to reality: Scalable domain adaptation with implicit low-rank transformations. arXiv:​1308.​4200 (preprint).
Zurück zum Zitat Saenko, K., Kulis, B., Fritz, M., & Darrell, T. (2010). Adapting visual category models to new domains. In Proceedings of the European Conference on Computer Vision (ECCV) (pp. 213–226). Saenko, K., Kulis, B., Fritz, M., & Darrell, T. (2010). Adapting visual category models to new domains. In Proceedings of the European Conference on Computer Vision (ECCV) (pp. 213–226).
Zurück zum Zitat Sharma, A., Kumar, A., Daume, H., & Jacobs, D. W. (2012). Generalized multiview analysis: A discriminative latent space. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2160–2167). Sharma, A., Kumar, A., Daume, H., & Jacobs, D. W. (2012). Generalized multiview analysis: A discriminative latent space. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2160–2167).
Zurück zum Zitat Torralba, A., & Efros, A. (2011). Unbiased look at dataset bias. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1521–1528). Torralba, A., & Efros, A. (2011). Unbiased look at dataset bias. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1521–1528).
Zurück zum Zitat Yang, J., Yan, R., & Hauptmann, A. G. (2007). Cross-domain video concept detection using adaptive svms. In ACM Multimedia (pp 188–197). Yang, J., Yan, R., & Hauptmann, A. G. (2007). Cross-domain video concept detection using adaptive svms. In ACM Multimedia (pp 188–197).
Metadaten
Titel
Asymmetric and Category Invariant Feature Transformations for Domain Adaptation
verfasst von
Judy Hoffman
Erik Rodner
Jeff Donahue
Brian Kulis
Kate Saenko
Publikationsdatum
01.08.2014
Verlag
Springer US
Erschienen in
International Journal of Computer Vision / Ausgabe 1-2/2014
Print ISSN: 0920-5691
Elektronische ISSN: 1573-1405
DOI
https://doi.org/10.1007/s11263-014-0719-3

Weitere Artikel der Ausgabe 1-2/2014

International Journal of Computer Vision 1-2/2014 Zur Ausgabe