Skip to main content
Top

2016 | OriginalPaper | Chapter

Fast Cross-Scenario Clothing Retrieval Based on Indexing Deep Features

Authors : Zongmin Li, Yante Li, Yongbiao Gao, Yujie Liu

Published in: Advances in Multimedia Information Processing - PCM 2016

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this paper, we propose a new approach for large scale daily clothing retrieval. Fast clothing image search in cross scenarios is a challenging task due to the large amount of clothing images on the internet and visual differences between street photos (pictures of people wearing clothing taken in our daily life with complex background) and online shop photos (pictures of clothing items on people, captured by professionals in more controlled settings). We tackle the problem of cross-scenario clothing retrieval through clothing segmentation based on coarse-fine hierarchical superpixel segmentation and pose estimation to remove the background of clothing image and employ deep features representing the clothing item aimed at describing various clothing effectively. In addition, in order to speed up the retrieval process for large scale online clothing images, we adopt inverted indexing on deep feature by regarding deep features as Bag-of-Word model. In this way, we obtain similar clothing items far faster. Experiments demonstrate that our method significantly outperforms state-of-the-art approaches.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Deng, J., Dong, W., Socher, R, Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255 (2009) Deng, J., Dong, W., Socher, R, Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255 (2009)
2.
go back to reference Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef
3.
go back to reference Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: International Conference on Computer Vision, pp. 1470–1477 (2003) Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: International Conference on Computer Vision, pp. 1470–1477 (2003)
4.
go back to reference Yamaguchi, K., Kiapour, M.H., Ortiz, L.E., Berg, T.L.: Parsing clothing in fashion photographs. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3570–3577 (2012) Yamaguchi, K., Kiapour, M.H., Ortiz, L.E., Berg, T.L.: Parsing clothing in fashion photographs. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3570–3577 (2012)
5.
go back to reference Rother, C., Kolmogorov, V., Blake, A.: Grabcut - interactive foreground extraction using iterated graph cuts. ACM Trans. Graph. (TOG) 23, 309–314 (2004)CrossRef Rother, C., Kolmogorov, V., Blake, A.: Grabcut - interactive foreground extraction using iterated graph cuts. ACM Trans. Graph. (TOG) 23, 309–314 (2004)CrossRef
6.
go back to reference Liu, S., Song, Z., Liu, G.: Street-to-shop: cross-scenario clothing retrieval via parts alignment and auxiliary set. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3330–3337 (2012) Liu, S., Song, Z., Liu, G.: Street-to-shop: cross-scenario clothing retrieval via parts alignment and auxiliary set. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3330–3337 (2012)
7.
go back to reference Yang, Y., Ramanan, D.: Articulated pose estimation with flexible mixtures-of-parts. In: IEEE Conference Computer Vision and Pattern Recognition, pp. 1385–1392 (2011) Yang, Y., Ramanan, D.: Articulated pose estimation with flexible mixtures-of-parts. In: IEEE Conference Computer Vision and Pattern Recognition, pp. 1385–1392 (2011)
8.
go back to reference Liu, S., et al.: Hi, magic closet, tell me what to wear! In: Proceedings of the 20th ACM International Conference on Multimedia, pp. 619–628 (2012) Liu, S., et al.: Hi, magic closet, tell me what to wear! In: Proceedings of the 20th ACM International Conference on Multimedia, pp. 619–628 (2012)
9.
go back to reference Fu, J., Wang, J., Li, Z., et al.: Efficient clothing retrieval with semantic preserving visual phrases. In: Proceedings of 11th Asian Conference on Computer Vision, pp. 420–431 (2013) Fu, J., Wang, J., Li, Z., et al.: Efficient clothing retrieval with semantic preserving visual phrases. In: Proceedings of 11th Asian Conference on Computer Vision, pp. 420–431 (2013)
10.
go back to reference Di, W., Wah, C., Bhardwaj, A., Piramuthu, R., Sundaresan, N.: Style finder: fine-grained clothing style recognition and retrieval. In: Computer Vision and Pattern Recognition Workshops, pp. 8–13 (2013) Di, W., Wah, C., Bhardwaj, A., Piramuthu, R., Sundaresan, N.: Style finder: fine-grained clothing style recognition and retrieval. In: Computer Vision and Pattern Recognition Workshops, pp. 8–13 (2013)
11.
go back to reference Chen, H. Gallagher, A. Girod, B.: Describing clothing by semantic attributes. In: Proceedings of the 12th European Conference on Computer Vision, pp. 609–623 (2012) Chen, H. Gallagher, A. Girod, B.: Describing clothing by semantic attributes. In: Proceedings of the 12th European Conference on Computer Vision, pp. 609–623 (2012)
12.
go back to reference Kalantidis, Y., Kennedy, L., Li, L.J.: Getting the look: clothing recognition and segmentation for automatic product suggestions in everyday photos. In: The 3rd ACM Conference on International Conference on Multimedia Retrieval, pp. 105–112 (2013) Kalantidis, Y., Kennedy, L., Li, L.J.: Getting the look: clothing recognition and segmentation for automatic product suggestions in everyday photos. In: The 3rd ACM Conference on International Conference on Multimedia Retrieval, pp. 105–112 (2013)
13.
go back to reference Malisiewicz, T., Gupta, A., Efros, A.A.: A. Ensemble of exemplar-SVMs for object detection. In: International Conference on Computer Vision, pp. 89–96 (2011) Malisiewicz, T., Gupta, A., Efros, A.A.: A. Ensemble of exemplar-SVMs for object detection. In: International Conference on Computer Vision, pp. 89–96 (2011)
14.
go back to reference Sutskever, I., Krizhevsky, A., Hinton, G.: Imagenet classification with deep convolutional neural networks. In: Neural Information Processing Systems, pp. 1097–1105 (2012) Sutskever, I., Krizhevsky, A., Hinton, G.: Imagenet classification with deep convolutional neural networks. In: Neural Information Processing Systems, pp. 1097–1105 (2012)
15.
go back to reference Babenko, A., Lempitsky, V.: The inverted multi-index. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3069–3076 (2012) Babenko, A., Lempitsky, V.: The inverted multi-index. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3069–3076 (2012)
16.
go back to reference Jurie, F., Nowak, E., Triggs, B.: Sampling strategies for bag of features image classification. In: European Conference on Computer Vision, pp. 490–503 (2006) Jurie, F., Nowak, E., Triggs, B.: Sampling strategies for bag of features image classification. In: European Conference on Computer Vision, pp. 490–503 (2006)
17.
go back to reference Kiapour, M.H., Lazebnik, S., Han, X.: Where to buy it: matching street clothing photos in online shops. In: IEEE International Conference on Computer Vision, pp. 3343–3351 (2015) Kiapour, M.H., Lazebnik, S., Han, X.: Where to buy it: matching street clothing photos in online shops. In: IEEE International Conference on Computer Vision, pp. 3343–3351 (2015)
18.
go back to reference Girshick, R., Donahue, J., Darrell, T.: Region based convolutional networks for accurate object detection and semantic segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 38, 142–158 (2015)CrossRef Girshick, R., Donahue, J., Darrell, T.: Region based convolutional networks for accurate object detection and semantic segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 38, 142–158 (2015)CrossRef
19.
go back to reference Kuang, Z., Li, Z., Lv, Q.: Modal function transformation for isometric 3D shape representation. Comput. Graph. 46, 209–220 (2015)CrossRef Kuang, Z., Li, Z., Lv, Q.: Modal function transformation for isometric 3D shape representation. Comput. Graph. 46, 209–220 (2015)CrossRef
20.
go back to reference Liu, R., Zhao, Y., Wei, S., Zhu, Z., Liao, L., Qiu, S.: Indexing of CNN features for large scale image search. CoRR, abs/1508.00217 (2015) Liu, R., Zhao, Y., Wei, S., Zhu, Z., Liao, L., Qiu, S.: Indexing of CNN features for large scale image search. CoRR, abs/1508.00217 (2015)
21.
go back to reference Uijlings, J., van Sande, K.E.A.: Selective search for object recognition. Int. J. Comput. Vis. 104, 154–171 (2013)CrossRef Uijlings, J., van Sande, K.E.A.: Selective search for object recognition. Int. J. Comput. Vis. 104, 154–171 (2013)CrossRef
22.
go back to reference Avrithis, Y., Kalantidis, Y.: Approximate Gaussian mixtures for large scale vocabularies. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7574, pp. 15–28. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33712-3_2 CrossRef Avrithis, Y., Kalantidis, Y.: Approximate Gaussian mixtures for large scale vocabularies. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7574, pp. 15–28. Springer, Heidelberg (2012). doi:10.​1007/​978-3-642-33712-3_​2 CrossRef
Metadata
Title
Fast Cross-Scenario Clothing Retrieval Based on Indexing Deep Features
Authors
Zongmin Li
Yante Li
Yongbiao Gao
Yujie Liu
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-48890-5_11