Top

Neural Computing and Applications

Published in:

05-08-2021 | Original Article

Multidomain image-to-image translation model based on hidden space sharing

Authors: Ding Yuxin, Wang Longfei

Published in: Neural Computing and Applications | Issue 1/2022

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

Image-to-image translation translates an image from one domain to another. The goal is to learn the translation relationship between different image domains. Compared with the translation models to be trained using paired training data, CycleGAN has the advantage of learning to translate between domains without paired input–output training examples. However, when using CycleGAN to translate images among multiple domains, the complexity of the model increases nonlinearly with the number of domains. To reduce the model complexity of CycleGAN-based translation models, we assume that there is a hidden space shared by different domains, and this space stores the common features of images. Then, we design a common encoder to learn image features in the hidden space. Based on the hidden space, we propose a translation model that scales linearly with the number of domains. To further improve the common feature representation accuracy, we introduce the adversarial component in the hidden space to learn the common features. We test the proposed models on different datasets, including painting style and season transfer datasets and achieve good results.

previous article Artificial neural network for prediction of thermal conductivity of rGO–metal oxide nanocomposite-based nanofluids

next article A novel nature-inspired maximum power point tracking (MPPT) controller based on ACO-ANN algorithm for photovoltaic (PV) system fed arc welding machines

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Goodfellow IJ, Pouget-Abadie J, Mirza M, et al (2014) Generative adversarial nets, In Proceedings of the International Conference on Neural Information Processing Systems, pp. 2672–2680, Montreal, MIT Press, Canada

Zhu J-Y, Park T, Isola P, et al. (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks. In Proceedings of International Conference on Computer Vision, Pages 2223–2230, Venice, Italy, IEEE.

Isola P, Zhu J-Y, Zhou T, et al (2017) Image-to-image translation with conditional adversarial networks. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Pages 5967–5976, Hawaii, USA. IEEE.

Gatys LA, Ecker AS, Bethge M (2016) Image style transfer using convolutional neural networks. In the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Pages 2414–2423, Las Vegas, USA.IEEE.

Gatys LA, Ecker AS, Bethge M (2015) A neural algorithm of artistic style. arXiv:1508.06576.

Ulyanov D, Lebedev V, Vedaldi A, et al. (2016) Texture Networks: Feed-forward Synthesis of Textures and Stylized Images. In the Proceedings of International Conference on Machine Learning. Pages 1349–1357, NY, USA. IMLS.

Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In the proc. of IEEE Conference on Computer Vision and Pattern Recognition. Pages 3431–3440, Boston, USA. IEEE.

Xu S, Zhu Q, Wang J (2020) Generative image completion with image-to-image translation. Neural Comput & Applic 32:7333–7345CrossRef

Karatsiolis S, Schizas CN, Petkov N (2020) Modular domain-to-domain translation network. Neural Comput & Applic 32:6779–6791CrossRef

10.

Pathak D, Krahenbuhl P, Donahue J, et al.(2016) Context encoders: Feature learning by inpainting. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Pages 2536–2544, Las Vegas , USA. IEEE.

11.

Zhang R, Isola P, Efros A A (2016) Colorful image colorization. In Proceedings of European Conference on Computer Vision. Pages 649–666, Amsterdam, Netherlands. Springer.

12.

Hertzmann A, Jacobs C E, Oliver N, et al (2001) Image analogies.p. In Proceedings of the ACM SIGGRAPH, Pages 327–340. Los Angeles, USA. ACM.

13.

Choi Y, Choi M, Kim M, et al. (2017) StarGAN: Unified Generative Adversarial Networks forMulti-Domain Image-to-Image Translation. ArXiv e-prints. arXiv:1711.09020v1.

14.

Hsu SY,Yang CY, Huang CC,et al. (2018) SemiStarGAN: Semi-supervised Generative Adversarial Networks for Multi-domain Image-to-Image Translation. Asian Conference on Computer Vision, pages 338–353.

15.

Anoosheh A, Agustsson E, Timofte R, Van Gool L (2018) ComboGAN: unrestrained scalability for image domain translation. IEEE/CVF Con Comput Vis Pattern Recognit Workshops (CVPRW) 2018:896–8967

16.

Jing Y, Yang Y, Feng Z, et al. (2017) Neural Style Transfer: A Review. arXiv:1705.04058.

17.

Salimans T, Goodfellow I, Zaremba W, et al. (2016) Improved techniques for training gans. In Proc. of International Conference on Neural Information Processing Systems. Pages 2234–2242, Barcelona, Spain. MIT Press.

18.

Donahue J, Kr¨ahenb¨uhl P, Darrell T (2016) Adversarial feature learning. arXiv:1605.09782

19.

Dumoulin V, Belghazi I et al. (2016) Adversarially learned inference. arXiv:1606.00704

20.

Szegedy C., Vanhoucke V., Ioffe S., et al. (2016) Rethinking the Inception Architecture for Computer Vision. IEEE Conference on Computer Vision and Pattern Recognition, Pages 1–10, Las Vegas, USA, IEEE.

21.

Efros AA, Freeman WT (2001) Image quilting for texture synthesis and transfer. In Proceedings of the 28th annual conference on Computer graphics and Interactive Techniques. Pages 341–346, Los Angeles, USA. ACM.

22.

Justin J, Alexandre A, Li Fei-Fei (2016) Perceptual losses for real-time style transfer and super-resolution. In the Proceedings of European Conference on Computer Vision. Pages 694–711, Amsterdam, Netherlands. Springer.

23.

Li C, Wand M (2016) Precomputed real-time texture synthesis with markovian generative adversarial networks. In Proc. of European Conference on Computer Vision. Pages 702–716, Amsterdam, Netherlands. Springer.

24.

Liu M-Y, Tuzel O(2016) Coupled generative adversarial networks. In Proc. of International Conference on Neural Information Processing Systems. Pages 469–477, Barcelona, Spain. MIT Press.

25.

Mikołaj B., Danica J. , Michael A., et al. (2018) Demystifying MMD GANs. International Conference on Learning Representations, Pages 1–36, Vancouver, Canada.

26.

Taigman Y, Polyak A, Wolf L (2017) Unsupervised cross-domain image generation. In Proc. 5th International Conference on Learning Representations, Pages 1–6, Toulon, France. ICLR.

Title: Multidomain image-to-image translation model based on hidden space sharing
Authors: Ding Yuxin
Wang Longfei
Publication date: 05-08-2021
Publisher: Springer London
Published in: Neural Computing and Applications / Issue 1/2022
Print ISSN: 0941-0643
Electronic ISSN: 1433-3058
DOI: https://doi.org/10.1007/s00521-021-06386-9

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Springer Professional "Wirtschaft+Technik"

Other articles of this Issue 1/2022

Automatic identification of the number of clusters in hierarchical clustering

Memristor-based circuit implementation of Competitive Neural Network based on online unsupervised Hebbian learning rule for pattern recognition

A conjugate gradient algorithm based on double parameter scaled Broyden–Fletcher–Goldfarb–Shanno update for optimization problems and image restoration

Deep learning for prediction of depressive symptoms in a large textual dataset

Asset criticality and risk prediction for an effective cybersecurity risk management of cyber-physical system

Context-guided feature enhancement network for automatic check-out

Premium Partner