Skip to main content

2017 | OriginalPaper | Buchkapitel

A Text Recognition Augmented Deep Learning Approach for Logo Identification

verfasst von : Moushumi Medhi, Shubham Sinha, Rajiv Ranjan Sahay

Erschienen in: Computer Vision, Graphics, and Image Processing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Logo/brand name detection and recognition in unstructured and highly unpredictable natural images has always been a challenging problem. We notice that in most natural images logos are accompanied with associated text. Therefore, we address the problem of logo recognition by first detecting and isolating text of varying color, font size and orientation in the input image using affine invariant maximally stable extremal regions (MSERs). Using an off-the-shelf OCR, we identify the text associated with the logo image. Then an effective grouping technique is employed to combine the remaining stable regions based on spatial proximity of MSERs. Deep learning has the advantage that optimal features can be learned automatically from image pixel data. This motivates us to feed the clustered logo candidate image regions to a pre-trained deep convolutional neural network (DCNN) to generate a set of complex features which are further input to a multiclass support vector machine (SVM) for classification. We tested our proposed logo recognition system on 32 logo classes, and a non-logo class obtained by combining FlickrLogos-32 and MICC logo databases, amounting to a total of 23582 training and testing images. Our method yields robust recognition performance, outperforming state-of-the-art techniques achieving 97.8% precision, 95.7% recall and 95.7% average accuracy on the combined MICC and FlickrLogos-32 datasets and a precision of 98.6%, recall of 97.9% and average accuracy of 99.6% on only the FlickrLogos-32 dataset.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Alaei, A., Delalandre, M., Girard, N.: Logo detection using painting based representation and probability features. In: ICDAR, pp. 1235–1239 (2013) Alaei, A., Delalandre, M., Girard, N.: Logo detection using painting based representation and probability features. In: ICDAR, pp. 1235–1239 (2013)
2.
Zurück zum Zitat Boia, R., Florea, C., Florea, L., Dogaru, R.: Logo localization and recognition in natural images using homographic class graphs. Mach. Vis. Appl. 27(2), 287–301 (2016)CrossRef Boia, R., Florea, C., Florea, L., Dogaru, R.: Logo localization and recognition in natural images using homographic class graphs. Mach. Vis. Appl. 27(2), 287–301 (2016)CrossRef
3.
Zurück zum Zitat Romberg, S., Pueyo, L.G., Lienhart, R., Zwol, R.V.: Scalable logo recognition in real-world images. In: Proceedings of the 1st ACM International Conference on Multimedia Retrieval, pp. 965–968 (2011) Romberg, S., Pueyo, L.G., Lienhart, R., Zwol, R.V.: Scalable logo recognition in real-world images. In: Proceedings of the 1st ACM International Conference on Multimedia Retrieval, pp. 965–968 (2011)
4.
Zurück zum Zitat Chen, W., Lan, S., Xu, P.: Multiple feature fusion via hierarchical matching for TV logo recognition. In: Proceedings of the 8th International Congress on Image and Signal Processing, IEEE (2015) Chen, W., Lan, S., Xu, P.: Multiple feature fusion via hierarchical matching for TV logo recognition. In: Proceedings of the 8th International Congress on Image and Signal Processing, IEEE (2015)
5.
Zurück zum Zitat Sahbi, H., Ballan, L., Serra, G., Bimbo, A.: Context-dependent logo matching and recognition. IEEE Trans. Image Process. 22(3), 1018–1031 (2013). IEEECrossRefMATHMathSciNet Sahbi, H., Ballan, L., Serra, G., Bimbo, A.: Context-dependent logo matching and recognition. IEEE Trans. Image Process. 22(3), 1018–1031 (2013). IEEECrossRefMATHMathSciNet
6.
Zurück zum Zitat Zhang, Y., Zhang, S., Liang, W., Guo, Q.: Individualized matching based on logo density for scalable logo recognition. In: ICASSP, pp. 4324–4328 (2014) Zhang, Y., Zhang, S., Liang, W., Guo, Q.: Individualized matching based on logo density for scalable logo recognition. In: ICASSP, pp. 4324–4328 (2014)
7.
Zurück zum Zitat Hassanzadeh, S., Pourghassem, H.: Fast logo detection based on morphological features in document images. In: Proceedings of the 7th International Colloquium on Signal Processing and its Applications, pp. 283–286 (2011) Hassanzadeh, S., Pourghassem, H.: Fast logo detection based on morphological features in document images. In: Proceedings of the 7th International Colloquium on Signal Processing and its Applications, pp. 283–286 (2011)
8.
Zurück zum Zitat Hoi, S.C.H., Wu, X., Liu, H., Wu, Y., Wang, H., Xue, H., Wu, Q.: LOGO-net: largescale deep logo detection and brand recognition with deep region-based convolutional networks. arXiv:1511.02462 (2015) Hoi, S.C.H., Wu, X., Liu, H., Wu, Y., Wang, H., Xue, H., Wu, Q.: LOGO-net: largescale deep logo detection and brand recognition with deep region-based convolutional networks. arXiv:​1511.​02462 (2015)
9.
Zurück zum Zitat Iandola, F.N., Shen, A., Gao, P., Keutzer, K.: DeepLogo: hitting logo recognition with the deep neural network hammer. arXiv:1510.02131 (2015) Iandola, F.N., Shen, A., Gao, P., Keutzer, K.: DeepLogo: hitting logo recognition with the deep neural network hammer. arXiv:​1510.​02131 (2015)
10.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., Hinton, J.E.: Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 1097–1105 (2012) Krizhevsky, A., Sutskever, I., Hinton, J.E.: Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 1097–1105 (2012)
11.
Zurück zum Zitat Oliveira, G., Frazão, X., Pimentel, A., Ribeiro. B.: Automatic graphic logo detection via fast region-based convolutional networks. arXiv:1604.06083 (2016) Oliveira, G., Frazão, X., Pimentel, A., Ribeiro. B.: Automatic graphic logo detection via fast region-based convolutional networks. arXiv:​1604.​06083 (2016)
12.
Zurück zum Zitat Uijlings, J.R.R., Van De Sande, K.E.A., Gevers, T., Smeulders, A.W.M.: Selective search for object recognition. Intl. J. Comput. Vis. 104(2), 154–171 (2013). SpringerCrossRef Uijlings, J.R.R., Van De Sande, K.E.A., Gevers, T., Smeulders, A.W.M.: Selective search for object recognition. Intl. J. Comput. Vis. 104(2), 154–171 (2013). SpringerCrossRef
13.
Zurück zum Zitat Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide baseline stereo from maximally stable extremal regions. Image Vis. Comput. 22(10), 761–767 (2002)CrossRef Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide baseline stereo from maximally stable extremal regions. Image Vis. Comput. 22(10), 761–767 (2002)CrossRef
14.
Zurück zum Zitat Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 818–833. Springer, Cham (2014). doi:10.1007/978-3-319-10590-1_53 Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 818–833. Springer, Cham (2014). doi:10.​1007/​978-3-319-10590-1_​53
15.
Zurück zum Zitat Hancock, J.M.: Jaccard distance (Jaccard Index, Jaccard Similarity Coefficient). Dictionary Bioinform. Comput. Biol (2004) Hancock, J.M.: Jaccard distance (Jaccard Index, Jaccard Similarity Coefficient). Dictionary Bioinform. Comput. Biol (2004)
16.
Zurück zum Zitat Huang, W., Qiao, Y., Tang, X.: Robust scene text detection with convolution neural network induced MSER trees. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8692, pp. 497–511. Springer, Cham (2014). doi:10.1007/978-3-319-10593-2_33 Huang, W., Qiao, Y., Tang, X.: Robust scene text detection with convolution neural network induced MSER trees. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8692, pp. 497–511. Springer, Cham (2014). doi:10.​1007/​978-3-319-10593-2_​33
17.
Zurück zum Zitat de Campos, T.E., Babu, B.R., Varma, M.: Character recognition in natural images. In: Proceedings of the 4th International Conference on Computer Vision Theory and Applications, pp. 273–280 (2009) de Campos, T.E., Babu, B.R., Varma, M.: Character recognition in natural images. In: Proceedings of the 4th International Conference on Computer Vision Theory and Applications, pp. 273–280 (2009)
18.
Zurück zum Zitat Revaud, J., Douze, M., Schmid, C.: Correlation-based burstiness for logo retrieval. In: Proceedings of the 20th ACM International Conference on Multimedia, pp. 965–968 (2012) Revaud, J., Douze, M., Schmid, C.: Correlation-based burstiness for logo retrieval. In: Proceedings of the 20th ACM International Conference on Multimedia, pp. 965–968 (2012)
19.
Zurück zum Zitat Romberg, S., Lienhart, R.: Bundle min-hashing for logo recognition. In: Proceedings of the 3rd ACM Conf. on International Conference on Multimedia Accessed, pp. 113–120 (2013) Romberg, S., Lienhart, R.: Bundle min-hashing for logo recognition. In: Proceedings of the 3rd ACM Conf. on International Conference on Multimedia Accessed, pp. 113–120 (2013)
20.
Zurück zum Zitat Farajzadeh, N.: Exemplar-based logo and trademark recognition. Mach. Vis. Appl. 26(6), 791–805 (2015)CrossRef Farajzadeh, N.: Exemplar-based logo and trademark recognition. Mach. Vis. Appl. 26(6), 791–805 (2015)CrossRef
21.
Zurück zum Zitat Liu, Y., Wang, J., Li, Z., Li, H.: Efficient logo recognition by local feature groups. Multimedia Syst. 1–9 (2016) Liu, Y., Wang, J., Li, Z., Li, H.: Efficient logo recognition by local feature groups. Multimedia Syst. 1–9 (2016)
22.
Zurück zum Zitat Nair, V., Hinton, G.: Rectified linear units improve restricted boltzmann machines. In: ICML (2010) Nair, V., Hinton, G.: Rectified linear units improve restricted boltzmann machines. In: ICML (2010)
Metadaten
Titel
A Text Recognition Augmented Deep Learning Approach for Logo Identification
verfasst von
Moushumi Medhi
Shubham Sinha
Rajiv Ranjan Sahay
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-68124-5_13