Skip to main content
Erschienen in: Neural Computing and Applications 9/2020

17.08.2018 | Original Article

ClothingOut: a category-supervised GAN model for clothing segmentation and retrieval

verfasst von: Haijun Zhang, Yanfang Sun, Linlin Liu, Xinghao Wang, Liuwu Li, Wenyin Liu

Erschienen in: Neural Computing and Applications | Ausgabe 9/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper presents a new framework, ClothingOut, which utilizes generative adversarial network (GAN) to generate tiled clothing images automatically. Specifically, we design a novel category-supervised GAN model by learning transformation rules between clothes on wearers and clothes that are tiled. Our method features in adding category attribute to a traditional GAN model. For model training, we built a large-scale dataset containing over 20,000 pairs of wearer images and their corresponding tiled clothing images. The learned model can be straightforwardly applied to video advertising and cross-scenario clothing image retrieval. We evaluated our generated images which can be regarded as the segmentation from the wearer images from two aspects: authenticity and retrieval performance. Experimental results demonstrate the effectiveness of our method.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Zhang H, Cao X, Ho JKL, Chow TWS (2017) Object-level video advertising: an optimization framework. IEEE Trans Ind Inf 13(2):520–531CrossRef Zhang H, Cao X, Ho JKL, Chow TWS (2017) Object-level video advertising: an optimization framework. IEEE Trans Ind Inf 13(2):520–531CrossRef
2.
Zurück zum Zitat Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets, In: Advances in neural information processing systems, pp 2672–2680 Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets, In: Advances in neural information processing systems, pp 2672–2680
3.
Zurück zum Zitat Radford A, Metz L, Chintala S (2015) Unsupervised representation learning with deep convolutional generative adversarial networks,’ arXiv:1511.06434 Radford A, Metz L, Chintala S (2015) Unsupervised representation learning with deep convolutional generative adversarial networks,’ arXiv:​1511.​06434
5.
Zurück zum Zitat Isola P, Zhu J-Y, Zhou T, Efros AA (2017) Image-to-image translation with conditional adversarial networks. In: CVPR Isola P, Zhu J-Y, Zhou T, Efros AA (2017) Image-to-image translation with conditional adversarial networks. In: CVPR
6.
Zurück zum Zitat Girshick R, Donahue J, Darrell T, Malik J (2014) UC Berkeley and ICSI, Rich feature hierarchies for accurate object detection and semantic segmentation. In: CVPR Girshick R, Donahue J, Darrell T, Malik J (2014) UC Berkeley and ICSI, Rich feature hierarchies for accurate object detection and semantic segmentation. In: CVPR
7.
Zurück zum Zitat Uijlings JRR, van de Sande KEA, Gevers T, Smeulders AWM (2013) Selective search for object recognition. Int J Comput Vis 104(2):154–171CrossRef Uijlings JRR, van de Sande KEA, Gevers T, Smeulders AWM (2013) Selective search for object recognition. Int J Comput Vis 104(2):154–171CrossRef
8.
9.
Zurück zum Zitat Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in neural information processing systems Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in neural information processing systems
10.
Zurück zum Zitat Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In: CVPR Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In: CVPR
12.
Zurück zum Zitat Wan J, Wang D, Hoi SCH, Wu P, Zhu J, Zhang Y, Li J (2014) Deep learning for content-based image retrieval: a comprehensive study. In: MM ’14 proceedings of the 22nd ACM international conference on multimedia, pp 157–166 Wan J, Wang D, Hoi SCH, Wu P, Zhu J, Zhang Y, Li J (2014) Deep learning for content-based image retrieval: a comprehensive study. In: MM ’14 proceedings of the 22nd ACM international conference on multimedia, pp 157–166
13.
Zurück zum Zitat Veit A, Kovacs B, Bell S, McAuley J, Bala K, Belongie S (2015) Learning visual clothing style with heterogeneous dyadic co-occurrences. In: ICCV Veit A, Kovacs B, Bell S, McAuley J, Bala K, Belongie S (2015) Learning visual clothing style with heterogeneous dyadic co-occurrences. In: ICCV
14.
Zurück zum Zitat Wang X, Shi Y, Kitani KM (2016) Deep supervised hashing with triplet labels. In: Asian conference on computer vision, pp 70–84 Wang X, Shi Y, Kitani KM (2016) Deep supervised hashing with triplet labels. In: Asian conference on computer vision, pp 70–84
15.
Zurück zum Zitat Liu Z, Luo P, Qiu S, Wang X, Tang X (2016) DeepFashion: powering Robust clothes recognition and retrieval with Rich annotations. In: CVPR Liu Z, Luo P, Qiu S, Wang X, Tang X (2016) DeepFashion: powering Robust clothes recognition and retrieval with Rich annotations. In: CVPR
16.
Zurück zum Zitat Lin K, Yang H-F, Hsiao J-H, Cheni C-S (2015) Deep learning of binary hash codes for fast image retrieval. In: CVPR Lin K, Yang H-F, Hsiao J-H, Cheni C-S (2015) Deep learning of binary hash codes for fast image retrieval. In: CVPR
17.
Zurück zum Zitat Zhu S, Fidler S, Urtasun R, Lin D, Loy CC (2017) Be your own prada: fashion synthesis with structural coherence. In: ICCV Zhu S, Fidler S, Urtasun R, Lin D, Loy CC (2017) Be your own prada: fashion synthesis with structural coherence. In: ICCV
18.
Zurück zum Zitat Wan X, Gupta A (2016) Generative image modeling using style and structure adversarial networks. In: European conference on computer vision, pp 318–335 Wan X, Gupta A (2016) Generative image modeling using style and structure adversarial networks. In: European conference on computer vision, pp 318–335
19.
Zurück zum Zitat Ronneberger O, Fischer P, Brox T (2015) U-Net: convolutional networks for biomedical image segmentation. In: International conference on medical image computing and computer-assisted intervention, pp 234–241 Ronneberger O, Fischer P, Brox T (2015) U-Net: convolutional networks for biomedical image segmentation. In: International conference on medical image computing and computer-assisted intervention, pp 234–241
20.
Zurück zum Zitat Zeiler Matthew D, Fergus Rob (2014) Visualizing and understanding convolutional networks. In: ECCV 2014: computer vision, pp 818–833 Zeiler Matthew D, Fergus Rob (2014) Visualizing and understanding convolutional networks. In: ECCV 2014: computer vision, pp 818–833
21.
Zurück zum Zitat Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. In: Computer vision and pattern recognition, arXiv:1409.1556v6 Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. In: Computer vision and pattern recognition, arXiv:​1409.​1556v6
22.
Zurück zum Zitat Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu C-Y, Berg AC (2016) SSD: single shot multibox detector. In: ECCV 2016: computer vision, pp 21–37 Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu C-Y, Berg AC (2016) SSD: single shot multibox detector. In: ECCV 2016: computer vision, pp 21–37
23.
Zurück zum Zitat He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: CVPR He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: CVPR
24.
Zurück zum Zitat Zhang H, Chow TWS, Wu QMJ (2016) Organizing books and authors by multilayer SOM sign in or purchase. IEEE Trans Neural Netw Learn Syst 27(12):2537–2550CrossRef Zhang H, Chow TWS, Wu QMJ (2016) Organizing books and authors by multilayer SOM sign in or purchase. IEEE Trans Neural Netw Learn Syst 27(12):2537–2550CrossRef
25.
Zurück zum Zitat Zhang H, Xu T, Li H, Zhang S, Huang X, Wang X, Metaxas D (2016) StackGAN: text to photo-realistic image synthesis with stacked generative adversarial networks, arXiv:1612.03242 Zhang H, Xu T, Li H, Zhang S, Huang X, Wang X, Metaxas D (2016) StackGAN: text to photo-realistic image synthesis with stacked generative adversarial networks, arXiv:​1612.​03242
Metadaten
Titel
ClothingOut: a category-supervised GAN model for clothing segmentation and retrieval
verfasst von
Haijun Zhang
Yanfang Sun
Linlin Liu
Xinghao Wang
Liuwu Li
Wenyin Liu
Publikationsdatum
17.08.2018
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 9/2020
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-018-3691-y

Weitere Artikel der Ausgabe 9/2020

Neural Computing and Applications 9/2020 Zur Ausgabe