Skip to main content
Top

2017 | OriginalPaper | Chapter

Theoretical Analysis of Domain Adaptation with Optimal Transport

Authors : Ievgen Redko, Amaury Habrard, Marc Sebban

Published in: Machine Learning and Knowledge Discovery in Databases

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Domain adaptation (DA) is an important and emerging field of machine learning that tackles the problem occurring when the distributions of training (source domain) and test (target domain) data are similar but different. This kind of learning paradigm is of vital importance for future advances as it allows a learner to generalize the knowledge across different tasks. Current theoretical results show that the efficiency of DA algorithms depends on their capacity of minimizing the divergence between source and target probability distributions. In this paper, we provide a theoretical study on the advantages that concepts borrowed from optimal transportation theory [17] can bring to DA. In particular, we show that the Wasserstein metric can be used as a divergence measure between distributions to obtain generalization guarantees for three different learning settings: (i) classic DA with unsupervised target data (ii) DA combining source and target labeled data, (iii) multiple source DA. Based on the obtained results, we motivate the use of the regularized optimal transport and provide some algorithmic insights for multi-source domain adaptation. We also show when this theoretical analysis can lead to tighter inequalities than those of other existing frameworks. We believe that these results open the door to novel ideas and directions for DA.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Footnotes
1
If \(h,f \in \mathcal {H}\) then \(h-f \in \mathcal {H}\) implying that \(l(h(x),f(x)) = \vert h(x) - f(x) \vert ^q\) is a nonlinear transform for \(h-f \in \mathcal {H}\).
 
2
For the sake of simplicity, we will further write \(\mathcal {H}\) meaning \(\mathcal {H}_{k_l}\) and l meaning \(l_{f,h}\).
 
3
We present the original version of this Theorem in the Supplementary material.
 
Literature
2.
go back to reference Ben-David, Sh., Blitzer, J., Crammer, K., Kulesza, A., Pereira, F., Vaughan, J.: A theory of learning from different domains. Mach. Learn. 79, 151–175 (2010) Ben-David, Sh., Blitzer, J., Crammer, K., Kulesza, A., Pereira, F., Vaughan, J.: A theory of learning from different domains. Mach. Learn. 79, 151–175 (2010)
3.
go back to reference Ben-David, Sh., Blitzer, J., Crammer, K., Pereira, O.: Analysis of representations for domain adaptation. In: NIPS (2007) Ben-David, Sh., Blitzer, J., Crammer, K., Pereira, O.: Analysis of representations for domain adaptation. In: NIPS (2007)
4.
go back to reference Bolley, Fr., Guillin, Ar., Villani, C.: Quantitative concentration inequalities for empirical measures on non-compact spaces. Prob. Theory Relat. Fields 137(3–4), 541–593 (2007) Bolley, Fr., Guillin, Ar., Villani, C.: Quantitative concentration inequalities for empirical measures on non-compact spaces. Prob. Theory Relat. Fields 137(3–4), 541–593 (2007)
5.
go back to reference Cortes, C., Mohri, M.: Domain adaptation and sample bias correction theory and algorithm for regression. Theoret. Comput. Sci. 519, 103–126 (2014)MathSciNetCrossRefMATH Cortes, C., Mohri, M.: Domain adaptation and sample bias correction theory and algorithm for regression. Theoret. Comput. Sci. 519, 103–126 (2014)MathSciNetCrossRefMATH
7.
go back to reference Courty, N., Flamary, R., Tuia, D., Rakotomamonjy, A.: Optimal transport for domain adaptation. IEEE Trans. Pattern Anal. Mach. Intell. 39(9), 1853–1865 (2017)CrossRef Courty, N., Flamary, R., Tuia, D., Rakotomamonjy, A.: Optimal transport for domain adaptation. IEEE Trans. Pattern Anal. Mach. Intell. 39(9), 1853–1865 (2017)CrossRef
8.
go back to reference Cuturi, M.: Sinkhorn distances: lightspeed computation of optimal transport. In: NIPS, pp. 2292–2300 (2013) Cuturi, M.: Sinkhorn distances: lightspeed computation of optimal transport. In: NIPS, pp. 2292–2300 (2013)
9.
go back to reference Cuturi, M., Doucet, A.: Fast computation of Wasserstein barycenters. In: ICML, pp. 685–693 (2014) Cuturi, M., Doucet, A.: Fast computation of Wasserstein barycenters. In: ICML, pp. 685–693 (2014)
10.
go back to reference Ding, Y.: Wasserstein-divergence transportation inequalities and polynomial concentration inequalities. Stat. Probab. Lett. 94(C), 77–85 (2014)MathSciNetCrossRefMATH Ding, Y.: Wasserstein-divergence transportation inequalities and polynomial concentration inequalities. Stat. Probab. Lett. 94(C), 77–85 (2014)MathSciNetCrossRefMATH
11.
go back to reference Fournier, N., Guillin, A.: On the rate of convergence in Wasserstein distance of the empirical measure. Probab. Theory Relat. Fields 162(3–4), 707 (2015)MathSciNetCrossRefMATH Fournier, N., Guillin, A.: On the rate of convergence in Wasserstein distance of the empirical measure. Probab. Theory Relat. Fields 162(3–4), 707 (2015)MathSciNetCrossRefMATH
12.
go back to reference Bergounioux, M., Abraham, I., Abraham, R., Carlier, G.: Tomographic reconstruction from a few views: a multi-marginal optimal transport approach. Appl. Math. Optim. 75(1), 1–19 (2016)MathSciNetMATH Bergounioux, M., Abraham, I., Abraham, R., Carlier, G.: Tomographic reconstruction from a few views: a multi-marginal optimal transport approach. Appl. Math. Optim. 75(1), 1–19 (2016)MathSciNetMATH
13.
go back to reference Kantorovich, L.: On the translocation of masses. C.R. (Doklady) Acad. Sci. URSS (N.S.) 37(10), 199–201 (1942)MathSciNetMATH Kantorovich, L.: On the translocation of masses. C.R. (Doklady) Acad. Sci. URSS (N.S.) 37(10), 199–201 (1942)MathSciNetMATH
14.
go back to reference Knott, M., Smith, C.S.: On a generalization of cyclic-monotonicity and distances among random vectors. Linear Algebra Appl. 199, 363–371 (1994)MathSciNetCrossRefMATH Knott, M., Smith, C.S.: On a generalization of cyclic-monotonicity and distances among random vectors. Linear Algebra Appl. 199, 363–371 (1994)MathSciNetCrossRefMATH
15.
go back to reference Mansour, Y., Mohri, M., Rostamizadeh, A.: Domain adaptation: learning bounds and algorithms. In: COLT (2009) Mansour, Y., Mohri, M., Rostamizadeh, A.: Domain adaptation: learning bounds and algorithms. In: COLT (2009)
16.
go back to reference Mansour, Y., Mohri, M., Rostamizadeh, A.: Multiple source adaptation and the rényi divergence. In: UAI, pp. 367–374 (2009) Mansour, Y., Mohri, M., Rostamizadeh, A.: Multiple source adaptation and the rényi divergence. In: UAI, pp. 367–374 (2009)
17.
go back to reference Monge, G.: Mémoire sur la théorie des déblais et des remblais. In: Histoire de l’Académie Royale des Sciences, pp. 666–704 (1781) Monge, G.: Mémoire sur la théorie des déblais et des remblais. In: Histoire de l’Académie Royale des Sciences, pp. 666–704 (1781)
18.
go back to reference Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2010)CrossRef Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2010)CrossRef
19.
go back to reference Pass, B.: Uniqueness and monge solutions in the multimarginal optimal transportation problem. SIAM J. Math. Anal. 43(6), 2758–2775 (2011)MathSciNetCrossRefMATH Pass, B.: Uniqueness and monge solutions in the multimarginal optimal transportation problem. SIAM J. Math. Anal. 43(6), 2758–2775 (2011)MathSciNetCrossRefMATH
20.
go back to reference Pinsker, M.S.: Information and Information Stability of Random Variables and Processes. Holden-Day, San Francisco (1964)MATH Pinsker, M.S.: Information and Information Stability of Random Variables and Processes. Holden-Day, San Francisco (1964)MATH
22.
go back to reference Rubner, Y., Tomasi, C., Guibas, L.J.: The earth mover’s distance as a metric for image retrieval. Int. J. Comput. Vis. 40(2), 99–121 (2000)CrossRefMATH Rubner, Y., Tomasi, C., Guibas, L.J.: The earth mover’s distance as a metric for image retrieval. Int. J. Comput. Vis. 40(2), 99–121 (2000)CrossRefMATH
23.
go back to reference Saitoh, S.: Integral Transforms, Reproducing Kernels and their Applications. Pitman Research Notes in Mathematics Series (1997) Saitoh, S.: Integral Transforms, Reproducing Kernels and their Applications. Pitman Research Notes in Mathematics Series (1997)
24.
go back to reference Sejdinovic, D., Sriperumbudur, B.K., Gretton, A., Fukumizu, K.: Equivalence of distance-based and RKHS-based statistics in hypothesis testing. Ann. Stat. 41(5), 2263–2291 (2013)MathSciNetCrossRefMATH Sejdinovic, D., Sriperumbudur, B.K., Gretton, A., Fukumizu, K.: Equivalence of distance-based and RKHS-based statistics in hypothesis testing. Ann. Stat. 41(5), 2263–2291 (2013)MathSciNetCrossRefMATH
25.
27.
go back to reference Zhang, C., Zhang, L., Ye, J.: Generalization bounds for domain adaptation. In: NIPS (2012) Zhang, C., Zhang, L., Ye, J.: Generalization bounds for domain adaptation. In: NIPS (2012)
28.
go back to reference Perrot, M., Courty, N., Flamary, R., Habrard, A.: Mapping estimation for discrete optimal transport. In: NIPS, pp. 4197–4205 (2016) Perrot, M., Courty, N., Flamary, R., Habrard, A.: Mapping estimation for discrete optimal transport. In: NIPS, pp. 4197–4205 (2016)
Metadata
Title
Theoretical Analysis of Domain Adaptation with Optimal Transport
Authors
Ievgen Redko
Amaury Habrard
Marc Sebban
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-71246-8_45

Premium Partner