Skip to main content

Script Identification from Camera-Captured Multi-script Scene Text Components

  • Conference paper
  • First Online:
Recent Developments in Machine Learning and Data Analytics

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 740))

Abstract

Identification of script from multi-script text components of camera-captured images is an emerging research field. Here, challenges are mainly twofold: (1) typical challenges of camera-captured images like blur, uneven illumination, complex background, etc., and (2) challenges related to shape, size, and orientation of the texts written in different scripts. In this work, an effective set consisting of both shape-based and texture-based features is designed for script classification. An in-house scene text data set comprising 300 text boxes written in three scripts, namely Bangla, Devanagri, and Roman is prepared. Performance of this feature set is associated with five popular classifiers and highest accuracy of 90% is achieved with Multi-layer Perceptron (MLP) classifier, which is reasonably satisfactory considering the domain complexity.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Neumann, L. Matas, J.: Real-time scene text localization and recognition. In: Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on IEEE (2012)

    Google Scholar 

  2. Gómez, L., Karatzas, D.: Textproposals: a text-specific selective search algorithm for word spotting in the wild. Pattern Recogn. 70, 60–74 (2017)

    Article  Google Scholar 

  3. Singh, A.K., et al.: A simple and effective solution for script identification in the wild. In: Document Analysis Systems (DAS), 2016 12th IAPR Workshop on. IEEE (2016)

    Google Scholar 

  4. Li, Y., et al.: Characterness: an indicator of text in the wild. IEEE Trans. Image Process. 23(4), 1666–1677 (2014)

    Article  MathSciNet  Google Scholar 

  5. Yin, X.C., et al.: Multi-orientation scene text detection with adaptive clustering. IEEE Trans. Pattern Anal. Mach. Intell. 37(9), 1930–1937 (2015)

    Article  Google Scholar 

  6. Sain, A., Bhunia, A.K., Roy, P.P., Pal, U.: Multi-oriented text detection and verification in video frames and scene images. Neurocomputing 275, 1531–1549 (2018)

    Article  Google Scholar 

  7. Xu, H., Xue, L., Su, F.: Scene text detection based on robust stroke width transform and deep belief network. In: Asian Conference on Computer Vision, pp. 195–209. Springer, Cham (2014)

    Google Scholar 

  8. Weinman, Jerod J., Learned-Miller, Erik, Hanson, Allen R.: Scene text recognition using similarity and a lexicon with sparse belief propagation. IEEE Trans. Pattern Anal. Mach. Intell. 31(10), 1733–1746 (2009)

    Article  Google Scholar 

  9. Wang, K., Babenko, B., Belongie, S.: End-to-end scene text recognition. In: Computer Vision (ICCV), 2011 IEEE International Conference on, (pp. 1457–1464). IEEE (2011)

    Google Scholar 

  10. Shi, B., Bai, X., Yao, C.: An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Trans. Pattern Anal. Mach. Intell. 39(11), 2298–2304 (2017)

    Article  Google Scholar 

  11. Ye, Qixiang, Doermann, David: Text detection and recognition in imagery: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 37(7), 1480–1500 (2015)

    Article  Google Scholar 

Download references

Acknowledgements

This work is partially supported by the CMATER research laboratory of the Computer Science and Engineering Department, Jadavpur University, India, PURSE-II and UPE-II, project. SB is partially funded by DBT grant (BT/PR16356/BID/7/596/2016) and UGC Research Award (F. 30-31/2016(SA-II)). RS, SB and AFM are partially funded by DST grant (EMR/2016/007213).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ram Sarkar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Jajoo, M., Chakraborty, N., Mollah, A.F., Basu, S., Sarkar, R. (2019). Script Identification from Camera-Captured Multi-script Scene Text Components. In: Kalita, J., Balas, V., Borah, S., Pradhan, R. (eds) Recent Developments in Machine Learning and Data Analytics. Advances in Intelligent Systems and Computing, vol 740. Springer, Singapore. https://doi.org/10.1007/978-981-13-1280-9_16

Download citation

Publish with us

Policies and ethics