Script Identification from Camera-Captured Multi-script Scene Text Components

Jajoo, Madhuram; Chakraborty, Neelotpal; Mollah, Ayatullah Faruk; Basu, Subhadip; Sarkar, Ram

doi:10.1007/978-981-13-1280-9_16

Madhuram Jajoo¹⁸,
Neelotpal Chakraborty¹⁸,
Ayatullah Faruk Mollah¹⁹,
Subhadip Basu¹⁸ &
…
Ram Sarkar¹⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 740))

1040 Accesses
9 Citations

Abstract

Identification of script from multi-script text components of camera-captured images is an emerging research field. Here, challenges are mainly twofold: (1) typical challenges of camera-captured images like blur, uneven illumination, complex background, etc., and (2) challenges related to shape, size, and orientation of the texts written in different scripts. In this work, an effective set consisting of both shape-based and texture-based features is designed for script classification. An in-house scene text data set comprising 300 text boxes written in three scripts, namely Bangla, Devanagri, and Roman is prepared. Performance of this feature set is associated with five popular classifiers and highest accuracy of 90% is achieved with Multi-layer Perceptron (MLP) classifier, which is reasonably satisfactory considering the domain complexity.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Neumann, L. Matas, J.: Real-time scene text localization and recognition. In: Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on IEEE (2012)
Google Scholar
Gómez, L., Karatzas, D.: Textproposals: a text-specific selective search algorithm for word spotting in the wild. Pattern Recogn. 70, 60–74 (2017)
Article Google Scholar
Singh, A.K., et al.: A simple and effective solution for script identification in the wild. In: Document Analysis Systems (DAS), 2016 12th IAPR Workshop on. IEEE (2016)
Google Scholar
Li, Y., et al.: Characterness: an indicator of text in the wild. IEEE Trans. Image Process. 23(4), 1666–1677 (2014)
Article MathSciNet Google Scholar
Yin, X.C., et al.: Multi-orientation scene text detection with adaptive clustering. IEEE Trans. Pattern Anal. Mach. Intell. 37(9), 1930–1937 (2015)
Article Google Scholar
Sain, A., Bhunia, A.K., Roy, P.P., Pal, U.: Multi-oriented text detection and verification in video frames and scene images. Neurocomputing 275, 1531–1549 (2018)
Article Google Scholar
Xu, H., Xue, L., Su, F.: Scene text detection based on robust stroke width transform and deep belief network. In: Asian Conference on Computer Vision, pp. 195–209. Springer, Cham (2014)
Google Scholar
Weinman, Jerod J., Learned-Miller, Erik, Hanson, Allen R.: Scene text recognition using similarity and a lexicon with sparse belief propagation. IEEE Trans. Pattern Anal. Mach. Intell. 31(10), 1733–1746 (2009)
Article Google Scholar
Wang, K., Babenko, B., Belongie, S.: End-to-end scene text recognition. In: Computer Vision (ICCV), 2011 IEEE International Conference on, (pp. 1457–1464). IEEE (2011)
Google Scholar
Shi, B., Bai, X., Yao, C.: An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Trans. Pattern Anal. Mach. Intell. 39(11), 2298–2304 (2017)
Article Google Scholar
Ye, Qixiang, Doermann, David: Text detection and recognition in imagery: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 37(7), 1480–1500 (2015)
Article Google Scholar

Download references

Acknowledgements

This work is partially supported by the CMATER research laboratory of the Computer Science and Engineering Department, Jadavpur University, India, PURSE-II and UPE-II, project. SB is partially funded by DBT grant (BT/PR16356/BID/7/596/2016) and UGC Research Award (F. 30-31/2016(SA-II)). RS, SB and AFM are partially funded by DST grant (EMR/2016/007213).

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Jadavpur University, Kolkata, India
Madhuram Jajoo, Neelotpal Chakraborty, Subhadip Basu & Ram Sarkar
Department of Computer Science and Engineering, Aliah University, Kolkata, 700160, India
Ayatullah Faruk Mollah

Authors

Madhuram Jajoo
View author publications
You can also search for this author in PubMed Google Scholar
Neelotpal Chakraborty
View author publications
You can also search for this author in PubMed Google Scholar
Ayatullah Faruk Mollah
View author publications
You can also search for this author in PubMed Google Scholar
Subhadip Basu
View author publications
You can also search for this author in PubMed Google Scholar
Ram Sarkar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ram Sarkar .

Editor information

Editors and Affiliations

College of Engineering and Applied Science, University of Colorado Colorado Springs, Colorado Springs, CO, USA
Jugal Kalita
Automation and Applied Informatics, Aurel Vlaicu University of Arad, Arad, Romania
Valentina Emilia Balas
Department of Computer Applications, Sikkim Manipal University, Sikkim, India
Samarjeet Borah
Department of Computer Applications, Sikkim Manipal University, Sikkim, India
Ratika Pradhan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jajoo, M., Chakraborty, N., Mollah, A.F., Basu, S., Sarkar, R. (2019). Script Identification from Camera-Captured Multi-script Scene Text Components. In: Kalita, J., Balas, V., Borah, S., Pradhan, R. (eds) Recent Developments in Machine Learning and Data Analytics. Advances in Intelligent Systems and Computing, vol 740. Springer, Singapore. https://doi.org/10.1007/978-981-13-1280-9_16

Download citation

DOI: https://doi.org/10.1007/978-981-13-1280-9_16
Published: 12 September 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1279-3
Online ISBN: 978-981-13-1280-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics