Skip to main content
main-content
Top

Hint

Swipe to navigate through the chapters of this book

2020 | OriginalPaper | Chapter

Text Detection Using Maximally Stable External Regions and Stroke Width Variation

share
SHARE

Abstract

There is a proverb “an image is worth, than ten thousand words”. So it is very difficult to explain a single image. But what happens if a single image itself contains some information in the form of text. Though it is easy to extract text from the structured image, it is difficult to retrieve it from unstructured image. Thus in this paper, we are providing an efficient and concrete algorithm to solve this problem. This algorithm consists of detecting candidate text region using maximally stable external regions (MSER). Then it removes false region based on basic geometric properties. Now, again removing false region based on stroke width variation (SWV) and finally merging all text regions for detection of the result. At last, recognition of detected text with the help of optical character recognition (OCR). All these methods are combined to give high performance of the proposed algorithm.
Literature
1.
go back to reference Tsai, S., Chen, D., Chandrasekhar, V., Takacs, G., Cheung, N.M., Vedantham, R., Grzeszczuk, R., Girod, B.: Mobile product recognition. In: Proceedings of ACM Multimedia 2010 (2010) Tsai, S., Chen, D., Chandrasekhar, V., Takacs, G., Cheung, N.M., Vedantham, R., Grzeszczuk, R., Girod, B.: Mobile product recognition. In: Proceedings of ACM Multimedia 2010 (2010)
2.
go back to reference Liang, J., Doermann, D., Li, H.P.: Camera-based analysis of text and documents: a survey. IJDAR 7(2–3), 84–104 (2005) CrossRef Liang, J., Doermann, D., Li, H.P.: Camera-based analysis of text and documents: a survey. IJDAR 7(2–3), 84–104 (2005) CrossRef
3.
go back to reference Yao, C., Bai, X., Liu, W., Ma, Y., Tu, Z.: Detecting texts of arbitrary orientations in natural images. IEEE Part IV LNCS 8692, 497–511 (2012) Yao, C., Bai, X., Liu, W., Ma, Y., Tu, Z.: Detecting texts of arbitrary orientations in natural images. IEEE Part IV LNCS 8692, 497–511 (2012)
4.
go back to reference Fabrizio, J., Cord, M., Marcotegui, B.: Text extraction from street level images. In: CMRT, pp. 199–204 (2009) Fabrizio, J., Cord, M., Marcotegui, B.: Text extraction from street level images. In: CMRT, pp. 199–204 (2009)
5.
go back to reference Yi, C., Tian, Y.: Assistive text reading from complex background for blind persons. In: Camera-Based Document Analysis and Recognition, pp. 15–28. Springer (2012) Yi, C., Tian, Y.: Assistive text reading from complex background for blind persons. In: Camera-Based Document Analysis and Recognition, pp. 15–28. Springer (2012)
6.
go back to reference Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR, pp. 2963–2970 (2010) Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR, pp. 2963–2970 (2010)
7.
go back to reference Saini, S., Marawaha, C.: Comparative study of text detection in natural scene images. In: IEEE International Conference on Recent Trends in Electronics Information & Communication Technology, RTEICT, pp. 1981–1985 (2016) Saini, S., Marawaha, C.: Comparative study of text detection in natural scene images. In: IEEE International Conference on Recent Trends in Electronics Information & Communication Technology, RTEICT, pp. 1981–1985 (2016)
9.
go back to reference Chen, H., et al.: Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In: 18th IEEE International Conference on Image Processing, ICIP. IEEE (2011) Chen, H., et al.: Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In: 18th IEEE International Conference on Image Processing, ICIP. IEEE (2011)
10.
go back to reference Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR. IEEE (2012) Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR. IEEE (2012)
11.
go back to reference Gonzalez, A., et al.: Text location in complex images. In: 21st International Conference on Pattern Recognition, ICPR. IEEE (2012) Gonzalez, A., et al.: Text location in complex images. In: 21st International Conference on Pattern Recognition, ICPR. IEEE (2012)
12.
go back to reference Li, Y., Lu, H.: Scene text detection via stroke width. In: 21st International Conference on Pattern Recognition ICPR. IEEE (2012) Li, Y., Lu, H.: Scene text detection via stroke width. In: 21st International Conference on Pattern Recognition ICPR. IEEE (2012)
13.
go back to reference Subramanian, K., Natarajan, P., Decerbo, M., Castañòn, D.: Character-stroke detection for text-localization and extraction. In: International Conference on Document Analysis and Recognition, ICDAR (2005) Subramanian, K., Natarajan, P., Decerbo, M., Castañòn, D.: Character-stroke detection for text-localization and extraction. In: International Conference on Document Analysis and Recognition, ICDAR (2005)
14.
go back to reference Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: CVPR, pp. 2963–2970 (2010) Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: CVPR, pp. 2963–2970 (2010)
15.
go back to reference Minetto, R., Thome, N., Cord, M., Fabrizio, J., Marcotegui, B.: Snoopertext: a multiresolution system for text detection in complex visual scenes. In: ICIP, pp. 3861–3864 (2010) Minetto, R., Thome, N., Cord, M., Fabrizio, J., Marcotegui, B.: Snoopertext: a multiresolution system for text detection in complex visual scenes. In: ICIP, pp. 3861–3864 (2010)
Metadata
Title
Text Detection Using Maximally Stable External Regions and Stroke Width Variation
Authors
Nishant Singh
Vivek Kumar
Charul Bhatnagar
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-39875-0_38