nach oben

Multimedia Systems

Erschienen in:

01.02.2013 | Regular Paper

Combining global and local matching of multiple features for precise item image retrieval

verfasst von: Haojie Li, Xiaohui Wang, Jinhui Tang, Chunxia Zhao

Erschienen in: Multimedia Systems | Ausgabe 1/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

With the fast-growing of online shopping services, there are millions even billions of commercial item images available on the Internet. How to effectively leverage visual search method to find the items of users’ interests is an important yet challenging task. Besides global appearances (e.g., color, shape or pattern), users may often pay more attention to the local styles of certain products, thus an ideal visual item search engine should support detailed and precise search of similar images, which is beyond the capabilities of current search systems. In this paper, we propose a novel system named iSearch and global/local matching of local features are combined to do precise retrieval of item images in an interactive manner. We extract multiple local features including scale-invariant feature transform (SIFT), regional color moments and object contour fragments to sufficiently represent the visual appearances of items; while global and local matching of large-scale image dataset are allowed. To do this, an effective contour fragments encoding and indexing method is developed. Meanwhile, to improve the matching robustness of local features, we encode the spatial context with grid representations and a simple but effective verification approach using triangle relations constraints is proposed for spatial consistency filtering. The experimental evaluations show the promising results of our approach and system.

Vorheriger Artikel Multi-label multi-instance learning with missing object tags

Nächster Artikel Weakly-supervised object localization in unlabeled image collection

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: Proc. of ICCV (2003)

Witten, I.H., Moffat, A., Bell, T.: Managing gigabytes: compressing and indexing documents and images. Morgan Kaufmann Publishers, USA (1999). (ISBN:1558605703)

Lowe, D.G.: Distinctive Image Features from Scale Invariant Features. Int. J. Comput. Vision 60(2), 91–110 (2004)CrossRef

Zhou, W., Lu, Y., Li, H., Song, Y., Tian, Q.: Spatial coding for large scale partial-duplicate web image search. In: Proc. of ACM multimedia (2010)

Wu, Z., Ke, Q., Isard, M., Sun, J.: Bundling features for large scale partial-duplicate web image search. In: Proc. of CVPR (2009)

Wang, M., Hua, X., Mei, T., Tang, J., et al.: Interactive video annotation by multi-concept multi-modality active learning. Int. J. Semant. Comput. 4, 459–477 (2007)CrossRef

Datta, R., Joshi, D., Li, J., Wang, J.: Image retrieval: ideas, influences, and trends of the new age. ACM Comput. Surv. 40(2), 1–60 (2008)CrossRef

Wang, M., Hua, X.: Active learning in multimedia annotation and retrieval: a survey. ACM TIST 2(2), 10 (2011)MathSciNet

Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In: Proc. of CVPR (2007)

10.

Zhao, W., Wu, X., Ngo, C.: On the annotation of web videos by efficient near-duplicate search. IEEE Trans. Multimedia 12(5), 448–461 (2010)CrossRef

11.

Li, H., Wang, X., Tang, J., Yi, L., Xiao, L.: iSearch: towards precise retrieval of item image. In: Proc. of ACM ICIMCS, Chengdu, China (2011)

12.

Carneiro, G., Jepson, C.: Flexible spatial configuration of local image features. IEEE Trans. Pattern Anal. Mach. Intell. 29(12), 2089–2104 (2007)CrossRef

13.

Wu, Z., Xu, Q., Jiang, S., Huang, Q., Cui, P., Li, L.: Adding affine invariant geometric constraint for partial-duplicate image retrieval. In: Proc. of ICPR (2010)

14.

Jegou, H., Douze, M., Schmid, C.: Hamming embedding and weak geometric consistency for large scale image search. In: Proc. ECCV (2008)

15.

Tang, J., Yan, S., Hong, R., Qi, G., Chua, T.: Inferring semantic concepts from community-contributed images and noisy tags. In: Proc. of ACM multimedia (2009)

16.

Wang, J., Li, J., Lee, C., Yau, W.: Dense SIFT and Gabor descriptors-based face representation with applications to gender recognition. In: Proc. of international conference on control automation robotics and vision (2010)

17.

Liu, X., Yan, S., Luo, J., Tang, J., Huang, Z., Jin, H.: Nonparametric label-to-region by search. In: Proc. of IEEE CVPR (2010)

18.

Shotton, J., Blake, A., Cipolla, R.: Multi-scale categorical object recognition using contour fragments. IEEE Trans. PAMI 30(7), 1270–1281 (2008)CrossRef

19.

Xu, C., Kuipers, B.: Object detection using principal contour fragments. In: Proc. of Canadian conference on computer and robot vision (CRV-11) (2011)

20.

Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. IEEE Trans. PAMI 24(24), 509–521 (2002)CrossRef

21.

Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Trans. PAMI 27(10), 1615–1630 (2005)CrossRef

22.

Lowe, D.G.: Object recognition from local scale-invariant features. In: Proc. of ICCV (1999)

23.

Gavrila, D.M.: Multi-feature hierarchical template matching using distance transforms. In: Proc. of ICPR, Brisbane, Australia (1998)

24.

Jing, F., Li, M., Zhang, H.-J., Zhang, B.: An efficient and effective region-based image retrieval framework. IEEE Trans. Image Process. 13(5), 699–709 (2004)CrossRef

25.

Deng, Y., Manjunath, B. S., Shin, H.: Color image segmentation. In: Proc. of IEEE CVPR ‘99, Fort Collins (1999)

26.

Tang, S., Li, J.-T., Li, M., Xie, C., Liu, Y. Z., Tao, K., Xu, S.-X.: TRECVID 2008 high-level feature extraction by MCG-ICT-CAS. In: Proc. TRECVID 2008 workshop, Gaithesburg, USA (2008)

27.

Tang, J., Li, H., Qi, G.-J., Chua, T.-S.: Image annotation by graph-based inference with integrated multiple/single instance representations. IEEE Trans. Multimedia 12(2), 131–141 (2010)CrossRef

28.

Li, H., Tang, J., Li, G., Chua, T.-S., Word2Image: towards visual interpretation of words. In: Proc. ACM multimedia (2008)

29.

Li, H., Tang, J., Wu, S., Zhang, Y., Lin, S.: Automatic detection and analysis of player action in moving background sports video sequences. IEEE Trans. CSVT 20(3), 351–364 (2010)

30.

Cheng, M.-M., Zhang, G.-X., Mitra, N. J., Huang, X., Hu, S.-M.: Global contrast based salient region detection. In: Proc. of IEEE CVPR, Colorado Springs, Colorado, USA (2011)

31.

Ricardo, B.Y., Berthier, R.N.: Modern Information Retrieval. ACM Press, New York (1999). (ISBN: 020139829)

32.

Chua, T., Tang, J., Hong, R., Li, J., Luo, Z., Zheng, Y.: NUS-WIDE: a real-world web image database from National University of Singapore. In: Proc. of ACM CIVR (2009)

Titel: Combining global and local matching of multiple features for precise item image retrieval
verfasst von: Haojie Li
Xiaohui Wang
Jinhui Tang
Chunxia Zhao
Publikationsdatum: 01.02.2013
Verlag: Springer-Verlag
Erschienen in: Multimedia Systems / Ausgabe 1/2013
Print ISSN: 0942-4962
Elektronische ISSN: 1432-1882
DOI: https://doi.org/10.1007/s00530-012-0265-1

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Nachhaltigkeitsaward Key Visual/© Cometis AG/Global ESG Monitor | Daniel Rupp | Generiert mit KI, Search Icon, Banner Hanser, Arbeitszeit/© granata68 / Fotolia, E-Autos im Fuhrpark: Lohnt sich das noch?/© Petair / stock.adobe.com, Kryptowährungen/© gopixa / Getty Images / iStock, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, Sustainibility Finance/© Robert Kneschke / stock.adobe.com / Springer Fachmedien Wiesbaden GmbH, Zukunftswerkstatt Sales Excellence 2024/© AndreyPopov / Getty Images / iStock, 2023_Antrieb/© supervisuell

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 1/2013

Guest editorial: selected papers from ICIMCS 2011

Multi-label multi-instance learning with missing object tags

Weakly-supervised object localization in unlabeled image collection

Video recommendation over multiple information sources

Robust wireless sharing of internet video streams

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.