nach oben

International Journal of Computer Vision

Erschienen in:

01.08.2014

Asymmetric and Category Invariant Feature Transformations for Domain Adaptation

verfasst von: Judy Hoffman, Erik Rodner, Jeff Donahue, Brian Kulis, Kate Saenko

Erschienen in: International Journal of Computer Vision | Ausgabe 1-2/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

-1We address the problem of visual domain adaptation for transferring object models from one dataset or visual domain to another. We introduce a unified flexible model for both supervised and semi-supervised learning that allows us to learn transformations between domains. Additionally, we present two instantiations of the model, one for general feature adaptation/alignment, and one specifically designed for classification. First, we show how to extend metric learning methods for domain adaptation, allowing for learning metrics independent of the domain shift and the final classifier used. Furthermore, we go beyond classical metric learning by extending the method to asymmetric, category independent transformations. Our framework can adapt features even when the target domain does not have any labeled examples for some categories, and when the target and source features have different dimensions. Finally, we develop a joint learning framework for adaptive classifiers, which outperforms competing methods in terms of multi-class accuracy and scalability. We demonstrate the ability of our approach to adapt object recognition models under a variety of situations, such as differing imaging conditions, feature types, and codebooks. The experiments show its strong performance compared to previous approaches and its applicability to large-scale scenarios.

Vorheriger Artikel Learning Kernels for Unsupervised Domain Adaptation with Applications to Visual Object Recognition

Nächster Artikel Weakly-Supervised Cross-Domain Dictionary Learning for Visual Recognition

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Nur mit Berechtigung zugänglich

Note that in general we could equally optimize a second loss function between the source and target data which considers instance level constraints. However, to distinguish ourselves from prior work which focused on learning a metric requiring instance constraints, we present our algorithms assuming only category level information to demonstrate the effectiveness of using only this coarser level of supervision.

Note that we present this result for the specific case of using the Frobenius norm regularizer, though in fact our analysis holds for the class of regularizers \(r({\varvec{W}})\) that can be written in terms of the singular values of \({\varvec{W}}\); that is, if \(\sigma _1, \ldots , \sigma _p\) are the singular values of \({\varvec{W}}\), then \(r({\varvec{W}})\) is of the form \(\sum _{j=1}^p r_j(\sigma _j)\) for some scalar functions \(r_j\), which is globally minimized by zero. For example, the squared Frobenius norm \(r({\varvec{W}}) = \frac{1}{2} \Vert {\varvec{W}}\Vert _F^2\) is a special case where \(r_j(\sigma _j) = \frac{1}{2} \sigma _j^2\).

The assumption that the kernel matrices are strictly positive definite is not a severe limitation. For the Gaussian RBF kernel, strict positive definiteness can always be assured and for other kernel functions, the matrices can be regularized by adding a scaled identity matrix.

Argyriou, A., Micchelli, C. A., & Pontil, M. (2010). On spectral learning. Journal of Machine Learning Research, 11, 935–953.MATHMathSciNet

Aytar, Y., & Zisserman, A. (2011). Tabula rasa: Model transfer for object category detection. In Proceedings of the international conference on computer vision (ICCV) (pp. 2252–2259).

Ben-david, S., Blitzer, J., Crammer, K., & Pereira, O. (2007). Analysis of representations for domain adaptation. In Advances in neural information processing systems (NIPS) ( pp. 137–145). Cambridge: MIT Press.

Bergamo, A., & Torresani, L. (2010). Exploiting weakly-labeled web images to improve object classification: A domain adaptation approach. In Advances in neural information processing systems (NIPS) (pp. 181–189).

Blitzer, J., Dredze, M., & Pereira, F. (2007). Biographies, bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification. ACL, 7, 440–447.

Chopra, S., Balakrishnan, S., & Gopalan, R. (2013). Dlid: Deep learning for domain adaptation by interpolating between domains. In ICML workshop on challenges in representation learning.

Dai, W., Chen, Y., Xue, G., Yang, Q., & Yu, Y. (2008). Translated learning: Transfer learning across different feature spaces. In Advances in neural information processing systems (NIPS) (pp. 353–360).

Daume III, H. (2007). Frustratingly easy domain adaptation. In ACL (pp. 256–263).

Davis, J., Kulis, B., Jain, P., Sra, S., & Dhillon, I. (2007). Information-theoretic metric learning. In Proceedings of the international conference on Machine learning (ICML) (pp. 209–216) .

Diethe, T., Hardoon, D. R., & Shawe-Taylor, J. (2010). Constructing nonlinear discriminants from multiple data views. In Machine learning and knowledge discovery in databases (pp. 328–343) Berlin: Springer.

Donahue, J., Hoffman, J., Rodner, E., Saenko, K., & Darrell, T. (2013). Semi-supervised domain adaptation with instance constraints. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 668–675).

Donahue, J., Jia, Y., Vinyals, O., Hoffman, J., Zhang, N., Tzeng, E., et al. (2014). Decaf: A deep convolutional activation feature for generic visual recognition. In International conference in machine learning (ICML).

Duan, L., Tsang, I. W., Xu, D., & Maybank, S. J. (2009). Domain transfer svm for video concept detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1375–1381).

Duan, L., Xu, D., & Tsang, I. W. (2012a). Learning with augmented features for heterogeneous domain adaptation. In Proceedings of the international conference on machine learning (pp. 711–718).

Duan, L., Xu, D., Tsang, I. W. H., & Luo, J. (2012b). Visual event recognition in videos by learning from web data. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(9), 1667–1680.CrossRef

Farhadi, A., & Tabrizi, M. K. (2008). Learning to recognize activities from the wrong view point. In Proceedings of the European conference on computer vision (ECCV) (pp. 154–166).

Farquhar, J., Hardoon, D., Meng, H., Shawe-taylor, J. S., & Szedmak, S. (2005). Two view learning: Svm-2k, theory and practice. In Advances in neural information processing systems (NIPS) (pp. 355–362).

Gong, B., Shi, Y., Sha, F., & Grauman, K. (2012). Geodesic flow kernel for unsupervised domain adaptation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2066–2073).

Gopalan, R., Li, R., & Chellappa, R. (2011). Domain adaptation for object recognition: An unsupervised approach. In Proceedings of the international conference on computer vision (ICCV) (pp. 999–1006).

Hoffman, J., Rodner, E., Donahue, J., Saenko, K., & Darrell, T. (2013). Efficient learning of domain-invariant image representations. In International conference on learning representations (ICLR). http://arxiv.org/abs/1301.3224

Jhuo, I. H., Liu, D., Chang, S. F., & Lee, D. T. (2012). Robust visual domain adaptation with low-rank reconstruction. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2168–2175).

Jiang, J. (2008). A literature survey on domain adaptation of statistical classifiers. http://sifaka.cs.uiuc.edu/jiang4/domain_adaptation/survey/.

Jiang, J., & Zhai, C. X. (2007). Instance weighting for domain adaptation in NLP. In ACL (pp. 264–271).

Jiang, W., Zavesky, E., Chang, S., & Loui, A. (2008). Cross-domain learning methods for high-level visual concept classification. In International conference on image processing (ICIP) (pp. 161–164).

Kan, M., Shan, S., Zhang, H., Lao, S., & Chen, X. (2012). Multi-view discriminant analysis. In Proceedings of the European computer vision conference (ECCV) (pp. 808–821). Berlin: Springer.

Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems.

Kulis, B., Jain, P., & Grauman, K. (2009). Fast similarity search for learned metrics. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(12), 2143–2157.CrossRef

Kulis, B., Saenko, K., & Darrell, T. (2011). What you saw is not what you get: Domain adaptation using asymmetric kernel transforms. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1785–1792).

Li, R., & Zickler, T. (2012). Discriminative virtual views for cross-view action recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2855–2862).

Li, X. (2007). Regularized adaptation: Theory, algorithms and applications. Ph.D. thesis, USA: University of Washington

Quadrianto, N., & Lampert, C. H. (2011). Learning multi-view neighborhood preserving projections. In Proceedings of the International Conference on Machine Learning (ICML) (pp. 425–432).

Rodner, E., Hoffman, J., Donahue, J., Darrell, T., Saenko, K. (2013). Towards adapting imagenet to reality: Scalable domain adaptation with implicit low-rank transformations. arXiv:1308.4200 (preprint).

Saenko, K., Kulis, B., Fritz, M., & Darrell, T. (2010). Adapting visual category models to new domains. In Proceedings of the European Conference on Computer Vision (ECCV) (pp. 213–226).

Sharma, A., Kumar, A., Daume, H., & Jacobs, D. W. (2012). Generalized multiview analysis: A discriminative latent space. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2160–2167).

Torralba, A., & Efros, A. (2011). Unbiased look at dataset bias. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1521–1528).

Yang, J., Yan, R., & Hauptmann, A. G. (2007). Cross-domain video concept detection using adaptive svms. In ACM Multimedia (pp 188–197).

Titel: Asymmetric and Category Invariant Feature Transformations for Domain Adaptation
verfasst von: Judy Hoffman
Erik Rodner
Jeff Donahue
Brian Kulis
Kate Saenko
Publikationsdatum: 01.08.2014
Verlag: Springer US
Erschienen in: International Journal of Computer Vision / Ausgabe 1-2/2014
Print ISSN: 0920-5691
Elektronische ISSN: 1573-1405
DOI: https://doi.org/10.1007/s11263-014-0719-3

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Weitere Artikel der Ausgabe 1-2/2014

Guest Editor’s Introduction to the Special Issue on Domain Adaptation for Vision Applications

Model-Driven Domain Adaptation on Product Manifolds for Unconstrained Face Recognition

Learning Kernels for Unsupervised Domain Adaptation with Applications to Visual Object Recognition

Generalized Transfer Subspace Learning Through Low-Rank Constraint

Exploring Transfer Learning Approaches for Head Pose Classification from Multi-view Surveillance Images

Domain Adaptation for Face Recognition: Targetize Source Domain Bridged by Common Subspace