Skip to main content
Erschienen in: Multimedia Systems 3/2017

28.03.2016 | Regular Paper

Efficient logo recognition by local feature groups

verfasst von: Yujie Liu, Jun Wang, Zongmin Li, Hua Li

Erschienen in: Multimedia Systems | Ausgabe 3/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper presents a method for efficient and scalable logo recognition. Using generalized Hough transform to identify local features that are invariant across images, we can efficiently add spatial information into groups of local features and enhance the discriminative power of local feature. Our method is more flexible and efficient compared with state-of-the-art methods that merge features into groups. To fully exploit the information that different logo images provide, we employ a reference-based image representation scheme to represent training and testing images. Experiments on challenging datasets show that our method is efficient and scalable and achieves state-of-the-art performance.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Romberg, S., Pueyo, L. G., Lienhart, R., Van Zwol, R.: Scalable logo recognition in real-world images. In: Proceedings of the 1st ACM International Conference on Multimedia Retrieval ACM, p. 25 (2011) Romberg, S., Pueyo, L. G., Lienhart, R., Van Zwol, R.: Scalable logo recognition in real-world images. In: Proceedings of the 1st ACM International Conference on Multimedia Retrieval ACM, p. 25 (2011)
2.
Zurück zum Zitat Torralba, A., Murphy, K. P., Freeman, W. T., Rubin, M.: Context-based vision system for place and object recognition. In: Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on IEEE, pp. 273–280 (2003) Torralba, A., Murphy, K. P., Freeman, W. T., Rubin, M.: Context-based vision system for place and object recognition. In: Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on IEEE, pp. 273–280 (2003)
3.
Zurück zum Zitat Dalal, N., & Triggs, B.: Histograms of oriented gradients for human detection. In Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on IEEE, vol. 1, pp. 886–893 (2005) Dalal, N., & Triggs, B.: Histograms of oriented gradients for human detection. In Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on IEEE, vol. 1, pp. 886–893 (2005)
4.
Zurück zum Zitat Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60(2), 91–110 (2004)CrossRef Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60(2), 91–110 (2004)CrossRef
5.
Zurück zum Zitat Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: Speeded-up robust features (SURF). Comput. Vis. Image Underst. 110(3), 346–359 (2008)CrossRef Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: Speeded-up robust features (SURF). Comput. Vis. Image Underst. 110(3), 346–359 (2008)CrossRef
6.
Zurück zum Zitat Rublee, E., Rabaud, V., Konolige, K., & Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: Computer Vision (ICCV), 2011 IEEE International Conference on IEEE, pp. 2564–2571, (2011) Rublee, E., Rabaud, V., Konolige, K., & Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: Computer Vision (ICCV), 2011 IEEE International Conference on IEEE, pp. 2564–2571, (2011)
7.
Zurück zum Zitat Sivic, J., & Zisserman, A.: Video Google: A text retrieval approach to object matching in videos. In: Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on IEEE, pp. 1470–1477 (2003) Sivic, J., & Zisserman, A.: Video Google: A text retrieval approach to object matching in videos. In: Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on IEEE, pp. 1470–1477 (2003)
8.
Zurück zum Zitat Tian, Q., Zhang, S., Zhou, W., Ji, R., Ni, B., Sebe, N.: Building descriptive and discriminative visual codebook for large-scale image applications. Multimed. Tools Appl 51(2), 441–477 (2011)CrossRef Tian, Q., Zhang, S., Zhou, W., Ji, R., Ni, B., Sebe, N.: Building descriptive and discriminative visual codebook for large-scale image applications. Multimed. Tools Appl 51(2), 441–477 (2011)CrossRef
9.
Zurück zum Zitat Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In: Computer vision and pattern recognition, 2007. CVPR’07. IEEE Conference on, IEEE, pp. 1–8 (2007) Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In: Computer vision and pattern recognition, 2007. CVPR’07. IEEE Conference on, IEEE, pp. 1–8 (2007)
10.
Zurück zum Zitat Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In Computer Vision and Pattern Recognition, 2007. CVPR’07. IEEE Conference on IEEE, pp. 1–8 (2007) Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In Computer Vision and Pattern Recognition, 2007. CVPR’07. IEEE Conference on IEEE, pp. 1–8 (2007)
11.
Zurück zum Zitat Jiang, Y., Meng, J., Yuan, J.: Grid-based local feature bundling for efficient object search and localization. In Image Processing (ICIP), 2011 18th IEEE International Conference on IEEE, pp. 113–116 (2011) Jiang, Y., Meng, J., Yuan, J.: Grid-based local feature bundling for efficient object search and localization. In Image Processing (ICIP), 2011 18th IEEE International Conference on IEEE, pp. 113–116 (2011)
12.
Zurück zum Zitat VC, H. P.:. U.S. Patent No. 3,069,654. Washington, DC: U.S. Patent and Trademark Office (1962) VC, H. P.:. U.S. Patent No. 3,069,654. Washington, DC: U.S. Patent and Trademark Office (1962)
13.
Zurück zum Zitat Harris, C., Stephens, M.: A combined corner and edge detector. In: Alvey vision conference, vol. 15, p. 50 (1988) Harris, C., Stephens, M.: A combined corner and edge detector. In: Alvey vision conference, vol. 15, p. 50 (1988)
14.
Zurück zum Zitat Mikolajczyk, K., Schmid, C.: Scale & affine invariant interest point detectors. Int. J. Comput. Vision 60(1), 63–86 (2004)CrossRef Mikolajczyk, K., Schmid, C.: Scale & affine invariant interest point detectors. Int. J. Comput. Vision 60(1), 63–86 (2004)CrossRef
15.
Zurück zum Zitat Morel, J.M., Yu, G.: ASIFT: a new framework for fully affine invariant image comparison. SIAM J. Imaging Sci. 2(2), 438–469 (2009)MathSciNetCrossRefMATH Morel, J.M., Yu, G.: ASIFT: a new framework for fully affine invariant image comparison. SIAM J. Imaging Sci. 2(2), 438–469 (2009)MathSciNetCrossRefMATH
16.
Zurück zum Zitat Mikolajczyk, K., Tuytelaars, T., Schmid, C., Zisserman, A., Matas, J., Schaffalitzky, F., Van Gool, L.: A comparison of affine region detectors. Int. J. Comput. Vision 65(1–2), 43–72 (2005)CrossRef Mikolajczyk, K., Tuytelaars, T., Schmid, C., Zisserman, A., Matas, J., Schaffalitzky, F., Van Gool, L.: A comparison of affine region detectors. Int. J. Comput. Vision 65(1–2), 43–72 (2005)CrossRef
17.
Zurück zum Zitat Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. Pattern Anal. Mach. Intell. IEEE Trans. 27(10), 1615–1630 (2005)CrossRef Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. Pattern Anal. Mach. Intell. IEEE Trans. 27(10), 1615–1630 (2005)CrossRef
18.
Zurück zum Zitat Wu, Z., Ke, Q., Isard, M., Sun, J.: Bundling features for large scale partial-duplicate web image search. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on IEEE, pp. 25–32 (2009) Wu, Z., Ke, Q., Isard, M., Sun, J.: Bundling features for large scale partial-duplicate web image search. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on IEEE, pp. 25–32 (2009)
19.
Zurück zum Zitat Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide-baseline stereo from maximally stable extremal regions. Image Vis. Comput. 22(10), 761–767 (2004)CrossRef Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide-baseline stereo from maximally stable extremal regions. Image Vis. Comput. 22(10), 761–767 (2004)CrossRef
20.
Zurück zum Zitat Fu, J., Wang, J., Zhang, Y., Lu, H.: Point-context descriptor based region search for logo recognition. In: Proceedings of the 4th International Conference on Internet Multimedia Computing and Service ACM, pp. 188–191 (2012) Fu, J., Wang, J., Zhang, Y., Lu, H.: Point-context descriptor based region search for logo recognition. In: Proceedings of the 4th International Conference on Internet Multimedia Computing and Service ACM, pp. 188–191 (2012)
21.
Zurück zum Zitat Kalantidis, Y., Pueyo, L. G., Trevisiol, M., van Zwol, R., Avrithis, Y.: Scalable triangulation-based logo recognition. In: Proceedings of the 1st ACM International Conference on Multimedia Retrieval ACM, p. 20 (2011) Kalantidis, Y., Pueyo, L. G., Trevisiol, M., van Zwol, R., Avrithis, Y.: Scalable triangulation-based logo recognition. In: Proceedings of the 1st ACM International Conference on Multimedia Retrieval ACM, p. 20 (2011)
22.
Zurück zum Zitat Romberg, S., & Lienhart, R.: Bundle min-hashing for logo recognition. In: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval ACM, pp. 113–120 (2013) Romberg, S., & Lienhart, R.: Bundle min-hashing for logo recognition. In: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval ACM, pp. 113–120 (2013)
23.
Zurück zum Zitat Zhang, Y., Jia, Z., Chen, T.: Image retrieval with geometry-preserving visual phrases. In: Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on IEEE, pp. 809–816 (2011) Zhang, Y., Jia, Z., Chen, T.: Image retrieval with geometry-preserving visual phrases. In: Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on IEEE, pp. 809–816 (2011)
24.
Zurück zum Zitat Zheng, L., Wang, S.: Visual phraselet: refining spatial constraints for large scale image search. Signal Process. Lett. IEEE 20(4), 391–394 (2013)MathSciNetCrossRef Zheng, L., Wang, S.: Visual phraselet: refining spatial constraints for large scale image search. Signal Process. Lett. IEEE 20(4), 391–394 (2013)MathSciNetCrossRef
25.
Zurück zum Zitat Liu, D., Hua, G., Viola, P., Chen, T.: Integrated feature selection and higher-order spatial feature extraction for object categorization. In: Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on IEEE pp. 1–8 (2008) Liu, D., Hua, G., Viola, P., Chen, T.: Integrated feature selection and higher-order spatial feature extraction for object categorization. In: Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on IEEE pp. 1–8 (2008)
26.
Zurück zum Zitat Zhang, S., Huang, Q., Hua, G., Jiang, S., Gao, W., Tian, Q.: Building contextual visual vocabulary for large-scale image applications. In: Proceedings of the international conference on Multimedia ACM, pp. 501–510 (2010) Zhang, S., Huang, Q., Hua, G., Jiang, S., Gao, W., Tian, Q.: Building contextual visual vocabulary for large-scale image applications. In: Proceedings of the international conference on Multimedia ACM, pp. 501–510 (2010)
27.
Zurück zum Zitat Kleban, J., Xie, X., Ma, W. Y.: Spatial pyramid mining for logo detection in natural scenes. In: Multimedia and Expo, 2008 IEEE International Conference on IEEE, pp. 1077–1080, (2008) Kleban, J., Xie, X., Ma, W. Y.: Spatial pyramid mining for logo detection in natural scenes. In: Multimedia and Expo, 2008 IEEE International Conference on IEEE, pp. 1077–1080, (2008)
28.
Zurück zum Zitat Zhang, S., Tian, Q., Hua, G., Huang, Q., Li, S.: Descriptive visual words and visual phrases for image applications. In: Proceedings of the 17th ACM international conference on Multimedia ACM, pp. 75–84 (2009) Zhang, S., Tian, Q., Hua, G., Huang, Q., Li, S.: Descriptive visual words and visual phrases for image applications. In: Proceedings of the 17th ACM international conference on Multimedia ACM, pp. 75–84 (2009)
29.
Zurück zum Zitat Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on IEEE, pp. 1794–1801 (2009) Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on IEEE, pp. 1794–1801 (2009)
30.
Zurück zum Zitat Arandjelović, R., Zisserman, A.: Three things everyone should know to improve object retrieval. In: Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on IEEE, pp. 2911–2918 (2012) Arandjelović, R., Zisserman, A.: Three things everyone should know to improve object retrieval. In: Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on IEEE, pp. 2911–2918 (2012)
31.
Zurück zum Zitat Revaud, J., Douze, M., Schmid, C.: Correlation-based burstiness for logo retrieval. In: Proceedings of the 20th ACM international conference on Multimedia ACM, pp. 965–968 (2012) Revaud, J., Douze, M., Schmid, C.: Correlation-based burstiness for logo retrieval. In: Proceedings of the 20th ACM international conference on Multimedia ACM, pp. 965–968 (2012)
32.
Zurück zum Zitat Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. (TIST) 2(3), 27 (2011) Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. (TIST) 2(3), 27 (2011)
Metadaten
Titel
Efficient logo recognition by local feature groups
verfasst von
Yujie Liu
Jun Wang
Zongmin Li
Hua Li
Publikationsdatum
28.03.2016
Verlag
Springer Berlin Heidelberg
Erschienen in
Multimedia Systems / Ausgabe 3/2017
Print ISSN: 0942-4962
Elektronische ISSN: 1432-1882
DOI
https://doi.org/10.1007/s00530-016-0508-7

Weitere Artikel der Ausgabe 3/2017

Multimedia Systems 3/2017 Zur Ausgabe