Skip to main content
Top
Published in:
Cover of the book

2019 | OriginalPaper | Chapter

Text Extraction Using Sparse Representation over Learning Dictionaries

Authors : Thanh-Ha Do, Thi Minh Huyen Nguyen, K. C. Santosh

Published in: Recent Trends in Image Processing and Pattern Recognition

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper presents a new approach for text detection using sparse representation over learned dictionaries. More specifically, the K-SVD algorithm is used for constructing two dictionaries, one for the background and one for the text. Then, text detection is done by comparing the error constructions of each patch of image over two dictionaries. Results on ICDAR dataset present that proposed method is competitive related to state-of-the-art methods.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Aerschot, W., Jansen, M., Bultheel, A.: Normal mesh based geometrical image compression. Image Vis. Comput. 27(4), 459–468 (2009)CrossRef Aerschot, W., Jansen, M., Bultheel, A.: Normal mesh based geometrical image compression. Image Vis. Comput. 27(4), 459–468 (2009)CrossRef
2.
go back to reference Aharon, M., Elad, M., Bruckstein, A.: K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation. Sig. Process. 54(11), 4311–4322 (2006)CrossRef Aharon, M., Elad, M., Bruckstein, A.: K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation. Sig. Process. 54(11), 4311–4322 (2006)CrossRef
3.
go back to reference Angadi, S., Kodabagi, M.: A texture based methodology for text region extraction from low resolution natural scene images. In: Advance Computing Conference, pp. 121–128 (2010) Angadi, S., Kodabagi, M.: A texture based methodology for text region extraction from low resolution natural scene images. In: Advance Computing Conference, pp. 121–128 (2010)
4.
go back to reference Belaid, A., Santosh, K., D’Andecy, V.P.: Handwritten and printed text separation in real document. In: The Thirteenth International Conference on Machine Vision Applications (2013) Belaid, A., Santosh, K., D’Andecy, V.P.: Handwritten and printed text separation in real document. In: The Thirteenth International Conference on Machine Vision Applications (2013)
5.
go back to reference Bui, T., Pan, W., Suen, C.: Text detection from natural scene images using topographic maps and sparse representations. In: The IEEE International Conference on Image Processing (2009) Bui, T., Pan, W., Suen, C.: Text detection from natural scene images using topographic maps and sparse representations. In: The IEEE International Conference on Image Processing (2009)
6.
go back to reference Chen, D., Jean-Marc, O., Herve, B.: Text detection and recognition in images and video frames. Pattern Recogn. 37(3), 595–608 (2004)CrossRef Chen, D., Jean-Marc, O., Herve, B.: Text detection and recognition in images and video frames. Pattern Recogn. 37(3), 595–608 (2004)CrossRef
7.
go back to reference Chen, S.S., Donoho, D.L., Saunders, M.A.: Atomic decomposition by basis pursuit. SIAM J. Sci. Comput. 20(1), 33–61 (1998)MathSciNetCrossRef Chen, S.S., Donoho, D.L., Saunders, M.A.: Atomic decomposition by basis pursuit. SIAM J. Sci. Comput. 20(1), 33–61 (1998)MathSciNetCrossRef
8.
go back to reference Chen, X., Yuille, A.: Detecting and reading text in natural scenes. In: Proceeding of CVPR (2004) Chen, X., Yuille, A.: Detecting and reading text in natural scenes. In: Proceeding of CVPR (2004)
9.
go back to reference Daubechies, I., Devore, R., Fornasier, M., Gunturk, C.: Iteratively reweighted least squares minimization for sparse recovery. Commun. Pure Appl. Math. 63(1), 1–38 (2009)MathSciNetCrossRef Daubechies, I., Devore, R., Fornasier, M., Gunturk, C.: Iteratively reweighted least squares minimization for sparse recovery. Commun. Pure Appl. Math. 63(1), 1–38 (2009)MathSciNetCrossRef
10.
go back to reference Do, T.H., Tabbone, S., Terrades, O.R.: Text/graphic separation using a sparse representation with multi-learned dictionaries. In: The International Conference on Pattern Recognition, pp. 689–692 (2012) Do, T.H., Tabbone, S., Terrades, O.R.: Text/graphic separation using a sparse representation with multi-learned dictionaries. In: The International Conference on Pattern Recognition, pp. 689–692 (2012)
11.
go back to reference Do, T.H., Tabbone, S., Terrades, O.R.: Document noise removal using sparse representations over learned dictionary. In: ACM Symposium on Document Engineering, pp. 161–168 (2013) Do, T.H., Tabbone, S., Terrades, O.R.: Document noise removal using sparse representations over learned dictionary. In: ACM Symposium on Document Engineering, pp. 161–168 (2013)
12.
go back to reference Donoho, D., Elad, M.: Optimally sparse representation in general (nonorthogonal) dictionaries via ell1 minimization. PNAS 100(5), 2197–2202 (2003)CrossRef Donoho, D., Elad, M.: Optimally sparse representation in general (nonorthogonal) dictionaries via ell1 minimization. PNAS 100(5), 2197–2202 (2003)CrossRef
14.
go back to reference Engan, K., Skretting, K., Husoy, J.H.: Family of iterative LS-based dictionary learning algorithm, ILS-DLA, for sparse signal representation. Digit. Signal Process. 17(1), 32–49 (2007)CrossRef Engan, K., Skretting, K., Husoy, J.H.: Family of iterative LS-based dictionary learning algorithm, ILS-DLA, for sparse signal representation. Digit. Signal Process. 17(1), 32–49 (2007)CrossRef
15.
go back to reference Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: Proceedings of the CVPR (2010) Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: Proceedings of the CVPR (2010)
16.
go back to reference Ezaki, N., Bulacu, M., Schomaker, L.: Text detection from natural scene images: towards a system for visually impaired persons. In: Proceedings of the 17th International Conference on Pattern Recognition, vol. 2, pp. 683–686 (2004) Ezaki, N., Bulacu, M., Schomaker, L.: Text detection from natural scene images: towards a system for visually impaired persons. In: Proceedings of the 17th International Conference on Pattern Recognition, vol. 2, pp. 683–686 (2004)
17.
go back to reference Jain, A., Yu, B.: Automatic text location in images and video frames. Pattern Recogn. 31(12), 2055–2076 (1998)CrossRef Jain, A., Yu, B.: Automatic text location in images and video frames. Pattern Recogn. 31(12), 2055–2076 (1998)CrossRef
18.
go back to reference Jiang, R., Qi, F., Xu, L., Wu, G.: Using connected components’ features to detect and segment text. J. Image Graph. 11, 1653–1656 (2006) Jiang, R., Qi, F., Xu, L., Wu, G.: Using connected components’ features to detect and segment text. J. Image Graph. 11, 1653–1656 (2006)
19.
go back to reference Kim, K., Jung, K., Kim, J.: Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 25(12), 1631–1639 (2003)CrossRef Kim, K., Jung, K., Kim, J.: Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 25(12), 1631–1639 (2003)CrossRef
20.
go back to reference Kumar, S., Gupta, R., Khanna, N., Chaudhury, S., Joshi, S.: Text extraction and document image segmentation using matched wavelets and MFR model. IEEE Trans. Image Process. 16(8), 2117–2128 (2007)MathSciNetCrossRef Kumar, S., Gupta, R., Khanna, N., Chaudhury, S., Joshi, S.: Text extraction and document image segmentation using matched wavelets and MFR model. IEEE Trans. Image Process. 16(8), 2117–2128 (2007)MathSciNetCrossRef
21.
go back to reference Lee, T.W., Lewicki, M.: Unsupervised image classification, segmentation and enhancement using ICA mixture models. IEEE Trans. Image Process. 11(3), 270–279 (2002)CrossRef Lee, T.W., Lewicki, M.: Unsupervised image classification, segmentation and enhancement using ICA mixture models. IEEE Trans. Image Process. 11(3), 270–279 (2002)CrossRef
22.
go back to reference Lienhart, R., Wernicke, A.: Localizing and segmenting text in images and videos. IEEE Trans. Circuits Syst. Video Technol. 12, 256–268 (2002)CrossRef Lienhart, R., Wernicke, A.: Localizing and segmenting text in images and videos. IEEE Trans. Circuits Syst. Video Technol. 12, 256–268 (2002)CrossRef
23.
go back to reference Lim, J., Park, J., Medioni, G.: Text segmentation in color images using tensor voting. Image Vis. Comput. 25(5), 671–685 (2007)CrossRef Lim, J., Park, J., Medioni, G.: Text segmentation in color images using tensor voting. Image Vis. Comput. 25(5), 671–685 (2007)CrossRef
24.
go back to reference Liu, Z., Sarkar, S.: Robust outdoor text detection using text intensity and shape features. In: The 19th International Conference on Pattern Recognition, pp. 1–4 (2008) Liu, Z., Sarkar, S.: Robust outdoor text detection using text intensity and shape features. In: The 19th International Conference on Pattern Recognition, pp. 1–4 (2008)
25.
go back to reference Lucas, S.M.: ICDAR 2005 text locating competition results. In: Proceedings of the ICDAR (2005) Lucas, S.M.: ICDAR 2005 text locating competition results. In: Proceedings of the ICDAR (2005)
27.
go back to reference Mallat, S.G., Zhang, Z.: Matching pursuits with time-frequency dictionaries. Sig. Process. 41(12), 3397–3415 (1993)CrossRef Mallat, S.G., Zhang, Z.: Matching pursuits with time-frequency dictionaries. Sig. Process. 41(12), 3397–3415 (1993)CrossRef
28.
go back to reference Mallat, S., Zhong, S.: Characterization of signals from multiscale edges. IEEE Trans. Pattern Anal. Mach. Intell. 11(7), 710–732 (1992)CrossRef Mallat, S., Zhong, S.: Characterization of signals from multiscale edges. IEEE Trans. Pattern Anal. Mach. Intell. 11(7), 710–732 (1992)CrossRef
29.
go back to reference Marial, J., Bach, F., Ponce, J., Sapiro, G.: Online dictionary learning for sparse coding. In: Proceedings of the 26th Annual International Conference on Machine Learning, pp. 689–696 (2009) Marial, J., Bach, F., Ponce, J., Sapiro, G.: Online dictionary learning for sparse coding. In: Proceedings of the 26th Annual International Conference on Machine Learning, pp. 689–696 (2009)
30.
go back to reference Pan, W., Bui, T., Suen, C.: Text detection from scene images using sparse representation. In: Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), pp. 1–5 (2008) Pan, W., Bui, T., Suen, C.: Text detection from scene images using sparse representation. In: Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), pp. 1–5 (2008)
31.
go back to reference Pan, Y., Liu, C., Hou, X.: Fast scene text localization by learning-based filtering and verification. In: The 17th IEEE International Conference on Image Processing, pp. 2269–2272 (2010) Pan, Y., Liu, C., Hou, X.: Fast scene text localization by learning-based filtering and verification. In: The 17th IEEE International Conference on Image Processing, pp. 2269–2272 (2010)
32.
go back to reference Park, J., Chung, H., Seong, Y.: Scene text detection suitable for parallelizing on multi-core. In: IEEE International Conference on Image Processing, pp. 2425–2428 (2009) Park, J., Chung, H., Seong, Y.: Scene text detection suitable for parallelizing on multi-core. In: IEEE International Conference on Image Processing, pp. 2425–2428 (2009)
33.
go back to reference Pati, Y., Rezaiifar, R., Krishnaprasad, P.: Orthogonal matching pursuit: recursive function approximation with applications to wavelet decomposition. In: Proceedings of the 27th Annual Asilomar Conference on Signals, Systems, and Computers, pp. 40–44 (1993) Pati, Y., Rezaiifar, R., Krishnaprasad, P.: Orthogonal matching pursuit: recursive function approximation with applications to wavelet decomposition. In: Proceedings of the 27th Annual Asilomar Conference on Signals, Systems, and Computers, pp. 40–44 (1993)
34.
go back to reference Santosh, K.C.: g-DICE: graph mining-based document information content exploitation. IJDAR 18(4), 337–355 (2015)CrossRef Santosh, K.C.: g-DICE: graph mining-based document information content exploitation. IJDAR 18(4), 337–355 (2015)CrossRef
36.
go back to reference Skretting, K., Engan, K.: Recursive least squares dictionary learning algorithm. Sig. Process. 58(4), 2121–2130 (2010)MathSciNetCrossRef Skretting, K., Engan, K.: Recursive least squares dictionary learning algorithm. Sig. Process. 58(4), 2121–2130 (2010)MathSciNetCrossRef
38.
go back to reference Hoang, T.V., Tabbone, S.: Text extraction from graphical document images using sparse representation. In: Proceedings of the 9th International Workshop on Document Analysis Systems (2010) Hoang, T.V., Tabbone, S.: Text extraction from graphical document images using sparse representation. In: Proceedings of the 9th International Workshop on Document Analysis Systems (2010)
39.
go back to reference Wright, J., Ganesh, A., Yang, A., Ma, Y.: Robust face recognition via sparse representation. IEEE Trans. Pattern Anal. Mach. Intell. 31(2), 210–227 (2009)CrossRef Wright, J., Ganesh, A., Yang, A., Ma, Y.: Robust face recognition via sparse representation. IEEE Trans. Pattern Anal. Mach. Intell. 31(2), 210–227 (2009)CrossRef
40.
go back to reference Yao, C., Bai, X., Liu, W., Ma, Y., Tu, Z.: Detecting text of arbitrary orientations in natural images. In: Proceedings of CVPR (2012) Yao, C., Bai, X., Liu, W., Ma, Y., Tu, Z.: Detecting text of arbitrary orientations in natural images. In: Proceedings of CVPR (2012)
41.
go back to reference Ye, Q., Jiao, J., Huang, J., Yu, H.: Text detection and restoration in natural scene images. J. Vis. Commun. Image Represent. 18(6), 504–513 (2007)CrossRef Ye, Q., Jiao, J., Huang, J., Yu, H.: Text detection and restoration in natural scene images. J. Vis. Commun. Image Represent. 18(6), 504–513 (2007)CrossRef
42.
go back to reference Yi, C., Tian, Y.: Text string detection from natural scenes by structure-based partition and grouping. In: Image Processing (2011) Yi, C., Tian, Y.: Text string detection from natural scenes by structure-based partition and grouping. In: Image Processing (2011)
43.
go back to reference Yi, C., Tian, Y.: Text string detection from natural scenes by structure-based partition and grouping. IEEE Trans. Image Process. 20(9), 2594–2605 (2011)MathSciNetCrossRef Yi, C., Tian, Y.: Text string detection from natural scenes by structure-based partition and grouping. IEEE Trans. Image Process. 20(9), 2594–2605 (2011)MathSciNetCrossRef
44.
go back to reference Zhao, M., Li, S., Kwok, J.: Text dectection in images using sparse representation with discriminative dictioanries. Image Vis. Comput. 28, 1590–1599 (2010)CrossRef Zhao, M., Li, S., Kwok, J.: Text dectection in images using sparse representation with discriminative dictioanries. Image Vis. Comput. 28, 1590–1599 (2010)CrossRef
Metadata
Title
Text Extraction Using Sparse Representation over Learning Dictionaries
Authors
Thanh-Ha Do
Thi Minh Huyen Nguyen
K. C. Santosh
Copyright Year
2019
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-13-9187-3_1

Premium Partner