Skip to main content
Top

2021 | OriginalPaper | Chapter

CycleGAN Through the Lens of (Dynamical) Optimal Transport

Authors : Emmanuel de Bézenac, Ibrahim Ayed, Patrick Gallinari

Published in: Machine Learning and Knowledge Discovery in Databases. Research Track

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Unsupervised Domain Translation (UDT) is the problem of finding a meaningful correspondence between two given domains, without explicit pairings between elements of the domains. Following the seminal CycleGAN model, variants and extensions have been used successfully for a wide range of applications. However, although there have been some attempts, they remain poorly understood, and lack theoretical guarantees. In this work, we explore the implicit biases present in current approaches and demonstrate where and why they fail. By expliciting these biases, we show that UDT can be reframed as an Optimal Transport (OT) problem. Using the dynamical formulation of Optimal Transport, this allows us to improve the CycleGAN model into a simple and practical formulation which comes with theoretical guarantees and added robustness. Finally, we show how our improved model behaves on the CelebA dataset in a standard then in a more challenging setting, thus paving the way for new applications of UDT. Supplementary material is available at https://​arxiv.​org/​pdf/​1906.​01292.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
The push-forward measure \(f_\sharp \rho \) is defined as \({f_\sharp \rho (B) = \rho (f^{-1}(B))}\), for any measurable set B. Said otherwise, we need T to map \({\alpha }\) to \({\beta }\) and S does the reverse.
 
2
A larger family of costs can be considered at the expense of some technicalities, see [13].
 
3
Which was pioneered in [5] and for which a detailed modern presentation is given in chapters 4 and 5 of [22].
 
4
Other schemes could be used, which would lead to other architectures, and could arguably be more suited for stability reasons but this is beyond the scope of this work.
 
Literature
1.
go back to reference Arora, S., Cohen, N., Hu, W., Luo, Y.: Implicit regularization in deep matrix factorization (2019) Arora, S., Cohen, N., Hu, W., Luo, Y.: Implicit regularization in deep matrix factorization (2019)
4.
go back to reference Benaim, S., Wolf, L.: One-sided unsupervised domain mapping (2017) Benaim, S., Wolf, L.: One-sided unsupervised domain mapping (2017)
6.
go back to reference Chen, T.Q., Rubanova, Y., Bettencourt, J., Duvenaud, D.K.: Neural ordinary differential equations. CoRR abs/1806.07366 (2018) Chen, T.Q., Rubanova, Y., Bettencourt, J., Duvenaud, D.K.: Neural ordinary differential equations. CoRR abs/1806.07366 (2018)
7.
go back to reference Choi, Y., Choi, M., Kim, M., Ha, J., Kim, S., Choo, J.: StarGAN: unified generative adversarial networks for multi-domain image-to-image translation (2018) Choi, Y., Choi, M., Kim, M., Ha, J., Kim, S., Choo, J.: StarGAN: unified generative adversarial networks for multi-domain image-to-image translation (2018)
8.
go back to reference Chung, Y., Weng, W., Tong, S., Glass, J.: Unsupervised cross-modal alignment of speech and text embedding spaces (2018) Chung, Y., Weng, W., Tong, S., Glass, J.: Unsupervised cross-modal alignment of speech and text embedding spaces (2018)
9.
go back to reference Courty, N., Flamary, R., Tuia, D., Rakotomamonjy, A.: Optimal transport for domain adaptation. CoRR abs/1507.00504 (2015) Courty, N., Flamary, R., Tuia, D., Rakotomamonjy, A.: Optimal transport for domain adaptation. CoRR abs/1507.00504 (2015)
10.
go back to reference Damodaran, B.B., Kellenberger, B., Flamary, R., Tuia, D., Courty, N.: DeepJDOT: deep joint distribution optimal transport for unsupervised domain adaptation (2018) Damodaran, B.B., Kellenberger, B., Flamary, R., Tuia, D., Courty, N.: DeepJDOT: deep joint distribution optimal transport for unsupervised domain adaptation (2018)
11.
go back to reference Dwork, C., Hardt, M., Pitassi, T., Reingold, O., Zemel, R.S.: Fairness through awareness (2012) Dwork, C., Hardt, M., Pitassi, T., Reingold, O., Zemel, R.S.: Fairness through awareness (2012)
12.
go back to reference Felix, R., Kumar, V.B.G., Reid, I., Carneiro, G.: Multi-modal cycle-consistent generalized zero-shot learning (2018) Felix, R., Kumar, V.B.G., Reid, I., Carneiro, G.: Multi-modal cycle-consistent generalized zero-shot learning (2018)
14.
go back to reference Galanti, T., Wolf, L., Benaim, S.: The role of minimal complexity functions in unsupervised learning of semantic mappings (2018) Galanti, T., Wolf, L., Benaim, S.: The role of minimal complexity functions in unsupervised learning of semantic mappings (2018)
15.
go back to reference Gong, R., Li, W., Chen, Y., Gool, L.V.: DLOW: domain flow for adaptation and generalization (2019) Gong, R., Li, W., Chen, Y., Gool, L.V.: DLOW: domain flow for adaptation and generalization (2019)
16.
go back to reference Isola, P., Zhu, J.Y., Zhou, T., Efros, A.A.: Image-to-image translation with conditional adversarial networks. arXiv (2016) Isola, P., Zhu, J.Y., Zhou, T., Efros, A.A.: Image-to-image translation with conditional adversarial networks. arXiv (2016)
17.
go back to reference Ji, Z., Telgarsky, M.: Gradient descent aligns the layers of deep linear networks (2019) Ji, Z., Telgarsky, M.: Gradient descent aligns the layers of deep linear networks (2019)
18.
go back to reference Lample, G., Conneau, A., Denoyer, L., Ranzato, M.: Unsupervised machine translation using monolingual corpora only (2018) Lample, G., Conneau, A., Denoyer, L., Ranzato, M.: Unsupervised machine translation using monolingual corpora only (2018)
19.
go back to reference Lu, G., Zhou, Z., Song, Y., Ren, K., Yu, Y.: Guiding the one-to-one mapping in cycleGAN via optimal transport (2018) Lu, G., Zhou, Z., Song, Y., Ren, K., Yu, Y.: Guiding the one-to-one mapping in cycleGAN via optimal transport (2018)
20.
go back to reference Ma, X., Trudinger, N., Wang, X.: Regularity of potential functions of the optimal transportation problem. Archive for Rational Mechanics and Analysis (2005) Ma, X., Trudinger, N., Wang, X.: Regularity of potential functions of the optimal transportation problem. Archive for Rational Mechanics and Analysis (2005)
21.
go back to reference Moriakov, N., Adler, J., Teuwen, J.: Kernel of cycleGAN as a principle homogeneous space (2020) Moriakov, N., Adler, J., Teuwen, J.: Kernel of cycleGAN as a principle homogeneous space (2020)
23.
go back to reference Soudry, D., Hoffer, E., Nacson, M.S., Srebro, N.: The implicit bias of gradient descent on separable data. J. Mach. Learn. Res. 19(1), 2822–2878 (2018)MathSciNetMATH Soudry, D., Hoffer, E., Nacson, M.S., Srebro, N.: The implicit bias of gradient descent on separable data. J. Mach. Learn. Res. 19(1), 2822–2878 (2018)MathSciNetMATH
24.
go back to reference Tripathy, S., Kannala, J., Rahtu, E.: Learning image-to-image translation using paired and unpaired training samples (2018) Tripathy, S., Kannala, J., Rahtu, E.: Learning image-to-image translation using paired and unpaired training samples (2018)
26.
go back to reference Weinan, E.: A proposal on machine learning via dynamical systems. Commun. Math. Stat. 5, 1–11 (2017)MathSciNetMATH Weinan, E.: A proposal on machine learning via dynamical systems. Commun. Math. Stat. 5, 1–11 (2017)MathSciNetMATH
27.
go back to reference Yang, C., Kim, T., Wang, R., Peng, H., Kuo, C.J.: ESTHER: extremely simple image translation through self-regularization (2018) Yang, C., Kim, T., Wang, R., Peng, H., Kuo, C.J.: ESTHER: extremely simple image translation through self-regularization (2018)
28.
go back to reference Yuan, Y., Liu, S., Zhang, J., Zhang, Y., Dong, C., Lin, L.: Unsupervised image super-resolution using cycle-in-cycle generative adversarial networks (2018) Yuan, Y., Liu, S., Zhang, J., Zhang, Y., Dong, C., Lin, L.: Unsupervised image super-resolution using cycle-in-cycle generative adversarial networks (2018)
29.
go back to reference Zhang, C., Bengio, S., Hardt, M., Recht, B., Vinyals, O.: Understanding deep learning requires rethinking generalization (2017) Zhang, C., Bengio, S., Hardt, M., Recht, B., Vinyals, O.: Understanding deep learning requires rethinking generalization (2017)
30.
go back to reference Zhang, H., et al.: StackGAN: text to photo-realistic image synthesis with stacked generative adversarial networks (2016) Zhang, H., et al.: StackGAN: text to photo-realistic image synthesis with stacked generative adversarial networks (2016)
Metadata
Title
CycleGAN Through the Lens of (Dynamical) Optimal Transport
Authors
Emmanuel de Bézenac
Ibrahim Ayed
Patrick Gallinari
Copyright Year
2021
DOI
https://doi.org/10.1007/978-3-030-86520-7_9

Premium Partner