Skip to main content

2019 | OriginalPaper | Buchkapitel

Combined Advertising Sign Classifier

verfasst von : Valentin Malykh, Aleksei Samarin

Erschienen in: Analysis of Images, Social Networks and Texts

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The article describes the problem of classifying photographs of advertising signs of commercial establishments according to the type of services provided. The proposed solution is based on the sharing of textual and visual features. We provide a composite model that includes a text recognition module and an extractor of visual characteristics to improve classification accuracy. We achieve \(F_1\) of 0.24 exceeding strong baseline quality for 10%.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
We use publicly available pre-trained model which could be accessed here: https://​drive.​google.​com/​file/​d/​0B7XkCwpI5KDYNlN​UTTlSS21pQmM/​edit?​usp=​sharing.
 
Literatur
1.
Zurück zum Zitat Intasuwan, T., Kaewthong, J., Vittayakorn, S.: Text and object detection on billboards. In: 2018 10th International Conference on Information Technology and Electrical Engineering (ICITEE), pp. 6–11 (July 2018) Intasuwan, T., Kaewthong, J., Vittayakorn, S.: Text and object detection on billboards. In: 2018 10th International Conference on Information Technology and Electrical Engineering (ICITEE), pp. 6–11 (July 2018)
3.
Zurück zum Zitat Watve, A., Sural, S.: Soccer video processing for the detection of advertisement billboards. Pattern Recogn. Lett. 29(7), 994–1006 (2008)CrossRef Watve, A., Sural, S.: Soccer video processing for the detection of advertisement billboards. Pattern Recogn. Lett. 29(7), 994–1006 (2008)CrossRef
4.
Zurück zum Zitat Szegedy, C., et al.: Going deeper with convolutions. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–9 (June 2015) Szegedy, C., et al.: Going deeper with convolutions. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–9 (June 2015)
5.
Zurück zum Zitat He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (June 2016) He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (June 2016)
6.
Zurück zum Zitat Howard, A.G., et al.: MobileNets: efficient convolutional neural networks for mobile vision applications (2017) Howard, A.G., et al.: MobileNets: efficient convolutional neural networks for mobile vision applications (2017)
8.
Zurück zum Zitat Deng, J., Dong, W., Socher, R., Li, L., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255 (June 2009) Deng, J., Dong, W., Socher, R., Li, L., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255 (June 2009)
10.
Zurück zum Zitat Zhou, X., et al.: East: an efficient and accurate scene text detector. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2642–2651 (July 2017) Zhou, X., et al.: East: an efficient and accurate scene text detector. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2642–2651 (July 2017)
11.
Zurück zum Zitat Smith, R.: An overview of the Tesseract OCR engine. In: Ninth International Conference on Document Analysis and Recognition (ICDAR 2007), vol. 2, pp. 629–633 (September 2007) Smith, R.: An overview of the Tesseract OCR engine. In: Ninth International Conference on Document Analysis and Recognition (ICDAR 2007), vol. 2, pp. 629–633 (September 2007)
12.
Zurück zum Zitat Liu, T., Fang, S., Zhao, Y., Wang, P., Zhang, J.: Implementation of training convolutional neural networks. CoRR. arXiv:1506.01195 (2015) Liu, T., Fang, S., Zhao, Y., Wang, P., Zhang, J.: Implementation of training convolutional neural networks. CoRR. arXiv:​1506.​01195 (2015)
13.
Zurück zum Zitat Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality, vol. 26, p. 10 (2013) Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality, vol. 26, p. 10 (2013)
14.
Zurück zum Zitat Malykh, V.: Robust word vectors for Russian language. In: Proceedings of Artificial Intelligence and Natural Language AINL FRUCT 2016 Conference, Saint-Petersburg, Russia, pp. 10–12 (2016) Malykh, V.: Robust word vectors for Russian language. In: Proceedings of Artificial Intelligence and Natural Language AINL FRUCT 2016 Conference, Saint-Petersburg, Russia, pp. 10–12 (2016)
15.
Zurück zum Zitat Cucerzan, S., Brill, E.: Spelling correction as an iterative process that exploits the collective knowledge of web users, vol. 4, pp. 293–300 (2004) Cucerzan, S., Brill, E.: Spelling correction as an iterative process that exploits the collective knowledge of web users, vol. 4, pp. 293–300 (2004)
16.
Zurück zum Zitat Lang, K.: NewsWeeder: Learning to filter netnews. In: Proceedings of the Twelfth International Conference on Machine Learning, pp. 331–339 (1995)CrossRef Lang, K.: NewsWeeder: Learning to filter netnews. In: Proceedings of the Twelfth International Conference on Machine Learning, pp. 331–339 (1995)CrossRef
17.
Zurück zum Zitat Wang, K., Babenko, B., Belongie, S.: End-to-end scene text recognition. In: Proceedings of the 2011 International Conference on Computer Vision, ICCV 2011, pp. 1457–1464. IEEE Computer Society, Washington (2011) Wang, K., Babenko, B., Belongie, S.: End-to-end scene text recognition. In: Proceedings of the 2011 International Conference on Computer Vision, ICCV 2011, pp. 1457–1464. IEEE Computer Society, Washington (2011)
Metadaten
Titel
Combined Advertising Sign Classifier
verfasst von
Valentin Malykh
Aleksei Samarin
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-37334-4_16

Premium Partner