Skip to main content

2017 | OriginalPaper | Buchkapitel

Script Identification in Natural Scene Images: A Dataset and Texture-Feature Based Performance Evaluation

verfasst von : Manisha Verma, Nitakshi Sood, Partha Pratim Roy, Balasubramanian Raman

Erschienen in: Proceedings of International Conference on Computer Vision and Image Processing

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Recognizing text with occlusion and perspective distortion in natural scenes is a challenging problem. In this work, we present a dataset of multi-lingual scripts and performance evaluation of script identification in this dataset using texture features. A ‘Station Signboard’ database that contains railway sign-boards written in 5 different Indic scripts is presented in this work. The images contain challenges like occlusion, perspective distortion, illumination effect, etc. We have collected a total of 500 images and corresponding ground-truths are made in semi-automatic way. Next, a script identification technique is proposed for multi-lingual scene text recognition. Considering the inherent problems in scene images, local texture features are used for feature extraction and SVM classifier, is employed for script identification. From the preliminary experiment, the performance of script identification is found to be 84 % using LBP feature with SVM classifier.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Chanda, S., Franke, K., Pal, U.: Text independent writer identification for oriya script. In: Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on. pp. 369–373. IEEE (2012) Chanda, S., Franke, K., Pal, U.: Text independent writer identification for oriya script. In: Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on. pp. 369–373. IEEE (2012)
2.
Zurück zum Zitat Ghosh, D., Dube, T., Shivaprasad, A.P.: Script recognition–a review. Pattern Analysis and Machine Intelligence, IEEE Transactions on 32(12), 2142–2161 (2010) Ghosh, D., Dube, T., Shivaprasad, A.P.: Script recognition–a review. Pattern Analysis and Machine Intelligence, IEEE Transactions on 32(12), 2142–2161 (2010)
3.
Zurück zum Zitat Heikkilä, M., Pietikäinen, M., Schmid, C.: Description of interest regions with local binary patterns. Pattern recognition 42(3), 425–436 (2009) Heikkilä, M., Pietikäinen, M., Schmid, C.: Description of interest regions with local binary patterns. Pattern recognition 42(3), 425–436 (2009)
4.
Zurück zum Zitat Murala, S., Maheshwari, R., Balasubramanian, R.: Directional local extrema patterns: a new descriptor for content based image retrieval. International Journal of Multimedia Information Retrieval 1(3), 191–203 (2012) Murala, S., Maheshwari, R., Balasubramanian, R.: Directional local extrema patterns: a new descriptor for content based image retrieval. International Journal of Multimedia Information Retrieval 1(3), 191–203 (2012)
5.
Zurück zum Zitat Ojala, T., Pietikäinen, M., Mäenpää, T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. Pattern Analysis and Machine Intelligence, IEEE Transactions on 24(7), 971–987 (2002) Ojala, T., Pietikäinen, M., Mäenpää, T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. Pattern Analysis and Machine Intelligence, IEEE Transactions on 24(7), 971–987 (2002)
6.
Zurück zum Zitat Pal, U., Sinha, S., Chaudhuri, B.: Multi-script line identification from indian documents. In: Proceedings of Seventh International Conference on Document Analysis and Recognition. pp. 880–884. IEEE (2003) Pal, U., Sinha, S., Chaudhuri, B.: Multi-script line identification from indian documents. In: Proceedings of Seventh International Conference on Document Analysis and Recognition. pp. 880–884. IEEE (2003)
7.
Zurück zum Zitat Phan, T.Q., Shivakumara, P., Ding, Z., Lu, S., Tan, C.L.: Video script identification based on text lines. In: International Conference on Document Analysis and Recognition (ICDAR). pp. 1240–1244. IEEE (2011) Phan, T.Q., Shivakumara, P., Ding, Z., Lu, S., Tan, C.L.: Video script identification based on text lines. In: International Conference on Document Analysis and Recognition (ICDAR). pp. 1240–1244. IEEE (2011)
8.
Zurück zum Zitat Shi, B., Yao, C., Zhang, C., Guo, X., Huang, F., Bai, X.: Automatic script identification in the wild. In: Proceedings of ICDAR. No. 531–535 (2015) Shi, B., Yao, C., Zhang, C., Guo, X., Huang, F., Bai, X.: Automatic script identification in the wild. In: Proceedings of ICDAR. No. 531–535 (2015)
9.
Zurück zum Zitat Shijian, L., Tan, C.L.: Script and language identification in noisy and degraded document images. Pattern Analysis and Machine Intelligence, IEEE Transactions on 30(1), 14–24 (2008) Shijian, L., Tan, C.L.: Script and language identification in noisy and degraded document images. Pattern Analysis and Machine Intelligence, IEEE Transactions on 30(1), 14–24 (2008)
10.
Zurück zum Zitat Shivakumara, P., Yuan, Z., Zhao, D., Lu, T., Tan, C.L.: New gradient-spatial-structural features for video script identification. Computer Vision and Image Understanding 130, 35–53 (2015) Shivakumara, P., Yuan, Z., Zhao, D., Lu, T., Tan, C.L.: New gradient-spatial-structural features for video script identification. Computer Vision and Image Understanding 130, 35–53 (2015)
11.
Zurück zum Zitat Singhal, V., Navin, N., Ghosh, D.: Script-based classification of hand-written text documents in a multilingual environment. In: Proceedings of 13th International Workshop on Research Issues in Data Engineering: Multi-lingual Information Management (RIDE-MLIM). pp. 47–54. IEEE (2003) Singhal, V., Navin, N., Ghosh, D.: Script-based classification of hand-written text documents in a multilingual environment. In: Proceedings of 13th International Workshop on Research Issues in Data Engineering: Multi-lingual Information Management (RIDE-MLIM). pp. 47–54. IEEE (2003)
12.
Zurück zum Zitat Sun, Q.Y., Lu, Y.: Text location in scene images using visual attention model. International Journal of Pattern Recognition and Artificial Intelligence 26(04), 1–22 (2012) Sun, Q.Y., Lu, Y.: Text location in scene images using visual attention model. International Journal of Pattern Recognition and Artificial Intelligence 26(04), 1–22 (2012)
13.
Zurück zum Zitat Ullrich, C.: Support vector classification. In: Forecasting and Hedging in the Foreign Exchange Markets, pp. 65–82. Springer (2009) Ullrich, C.: Support vector classification. In: Forecasting and Hedging in the Foreign Exchange Markets, pp. 65–82. Springer (2009)
Metadaten
Titel
Script Identification in Natural Scene Images: A Dataset and Texture-Feature Based Performance Evaluation
verfasst von
Manisha Verma
Nitakshi Sood
Partha Pratim Roy
Balasubramanian Raman
Copyright-Jahr
2017
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-2107-7_28

Neuer Inhalt