Skip to main content

2016 | OriginalPaper | Buchkapitel

Smooth Stroke Width Transform for Text Detection

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The stroke width transform (SWT) is a generic operation for the task of detecting texts from natural images because the characters intrinsically have the elongated shape of nearly uniform width. The edge pairing technique was recently developed by Epshtein et al. and is popularly used due to its simplicity and effectiveness. However since the natural images are noisy and sensitive to variations, high degree of artifacts arises and it hinders subsequent processing of the text detection. This paper reformulates the SWT problem in a new way that searches for an optimal solution in 3-D space. We present an effective search algorithm called the aggregation approach, borrowed from the depth image reconstruction domain. The experiments showed that the algorithm produced a smooth SWT map which is better for subsequent processes.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Koo, H.I., Kim, D.H.: Scene text detection via connected component clustering and non-text filtering. IEEE Trans. Image Process. 22(6), 2296–2305 (2013)MathSciNetCrossRefMATH Koo, H.I., Kim, D.H.: Scene text detection via connected component clustering and non-text filtering. IEEE Trans. Image Process. 22(6), 2296–2305 (2013)MathSciNetCrossRefMATH
2.
Zurück zum Zitat Yin, X.-C., Yin, X., Huang, K., Hao, H.-W.: Robust text detection in natural scene images. IEEE Trans. Pattern Recogn. Mach. Intell. 36(5), 970–983 (2014)CrossRef Yin, X.-C., Yin, X., Huang, K., Hao, H.-W.: Robust text detection in natural scene images. IEEE Trans. Pattern Recogn. Mach. Intell. 36(5), 970–983 (2014)CrossRef
3.
Zurück zum Zitat Epshtein, B., Ofek, E., Wexler, Y., Detecting text in natural scenes with stroke width transform. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2963–2970 (2010) Epshtein, B., Ofek, E., Wexler, Y., Detecting text in natural scenes with stroke width transform. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2963–2970 (2010)
4.
Zurück zum Zitat Lucas, B.D., Kanade, T.: An iterative image registration technique with an application to stereo vision. In: International Joint Conference on Artificial Intelligence, pp. 674–679 (1981) Lucas, B.D., Kanade, T.: An iterative image registration technique with an application to stereo vision. In: International Joint Conference on Artificial Intelligence, pp. 674–679 (1981)
5.
Zurück zum Zitat Scharstein, D., Szeliski, R.: A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. Int. J. Comput. Vis. 47(1), 7–42 (2002)CrossRefMATH Scharstein, D., Szeliski, R.: A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. Int. J. Comput. Vis. 47(1), 7–42 (2002)CrossRefMATH
6.
Zurück zum Zitat Zhang, J., Kasturi, R.: Character energy and link energy-based text extraction in scene images. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part II. LNCS, vol. 6493, pp. 308–320. Springer, Heidelberg (2011)CrossRef Zhang, J., Kasturi, R.: Character energy and link energy-based text extraction in scene images. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part II. LNCS, vol. 6493, pp. 308–320. Springer, Heidelberg (2011)CrossRef
7.
Zurück zum Zitat Yi, C., Tian, Y.: Text string detection from natural scenes by structure-based partition and grouping. IEEE Trans. Image Process. 20(9), 2594–2605 (2011)MathSciNetCrossRefMATH Yi, C., Tian, Y.: Text string detection from natural scenes by structure-based partition and grouping. IEEE Trans. Image Process. 20(9), 2594–2605 (2011)MathSciNetCrossRefMATH
8.
Zurück zum Zitat Mosleh, A., et al.: Image text detection using a Bandlet-based edge detector and stroke width transform. In: British Machine Vision Conference (2012) Mosleh, A., et al.: Image text detection using a Bandlet-based edge detector and stroke width transform. In: British Machine Vision Conference (2012)
9.
Zurück zum Zitat Meng, Q., Song, Y., Zhang, Y., Liu, Y.: Text detection in natural scene with edge analysis. In: International Conference on Image Processing, pp. 4151–4155 (2013) Meng, Q., Song, Y., Zhang, Y., Liu, Y.: Text detection in natural scene with edge analysis. In: International Conference on Image Processing, pp. 4151–4155 (2013)
10.
Zurück zum Zitat Huang, W., Lin, Z., Yang, J., Wang, J.: Text localization in natural images using stroke feature transform and text covariance descriptors. In: International Conference on Computer Vision, pp. 1241–1248 (2013) Huang, W., Lin, Z., Yang, J., Wang, J.: Text localization in natural images using stroke feature transform and text covariance descriptors. In: International Conference on Computer Vision, pp. 1241–1248 (2013)
11.
Zurück zum Zitat Karthikeyan, S.: Jagadeesh, V., Manjunath, B.S.: Learning bottom-up text attention maps for text detection using stroke width transform. In: International Conference on Image Processing, pp. 3312–3316 (2013) Karthikeyan, S.: Jagadeesh, V., Manjunath, B.S.: Learning bottom-up text attention maps for text detection using stroke width transform. In: International Conference on Image Processing, pp. 3312–3316 (2013)
12.
Zurück zum Zitat Liu, S., Zhou, Y., Zhang, Y., Wang, Y., Lin, W.: Text detection in natural scene images with stroke width clustering and superpixel. In: Ooi, W.T., Snoek, C.G., Tan, H.K., Ho, C.-K., Huet, B., Ngo, C.-W. (eds.) PCM 2014. LNCS, vol. 8879, pp. 123–132. Springer, Heidelberg (2014) Liu, S., Zhou, Y., Zhang, Y., Wang, Y., Lin, W.: Text detection in natural scene images with stroke width clustering and superpixel. In: Ooi, W.T., Snoek, C.G., Tan, H.K., Ho, C.-K., Huet, B., Ngo, C.-W. (eds.) PCM 2014. LNCS, vol. 8879, pp. 123–132. Springer, Heidelberg (2014)
13.
Zurück zum Zitat Dong, W., Lian, Z., Tang, Y., Xiao, J.: Text detection in natural images using localized stroke width transform. In: He, X., Luo, S., Tao, D., Xu, C., Yang, J., Hasan, M.A. (eds.) MMM 2015, Part I. LNCS, vol. 8935, pp. 49–58. Springer, Heidelberg (2015) Dong, W., Lian, Z., Tang, Y., Xiao, J.: Text detection in natural images using localized stroke width transform. In: He, X., Luo, S., Tao, D., Xu, C., Yang, J., Hasan, M.A. (eds.) MMM 2015, Part I. LNCS, vol. 8935, pp. 49–58. Springer, Heidelberg (2015)
14.
Zurück zum Zitat Yao, C., Bai, X., Liu, W., Ma, Y., Tu, Z.: Detecting texts of arbitrary orientations in natural images. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1083–1090 (2012) Yao, C., Bai, X., Liu, W., Ma, Y., Tu, Z.: Detecting texts of arbitrary orientations in natural images. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1083–1090 (2012)
15.
Zurück zum Zitat Bhavadharani, R., Thilagavathy, A.: An efficient gaze-text-detection from images using stroke width transform. Int. J. Adv. Eng. Technol. Manag. Appl. Sci. 1(6), 1–8 (2014) Bhavadharani, R., Thilagavathy, A.: An efficient gaze-text-detection from images using stroke width transform. Int. J. Adv. Eng. Technol. Manag. Appl. Sci. 1(6), 1–8 (2014)
16.
Zurück zum Zitat Gong, M., Yang, R., Wang, L., Gong, M.: A performance study on different cost aggregation approaches used in real-time stereo matching. Int. J. Comput. Vis. 75(2), 283–296 (2007)CrossRef Gong, M., Yang, R., Wang, L., Gong, M.: A performance study on different cost aggregation approaches used in real-time stereo matching. Int. J. Comput. Vis. 75(2), 283–296 (2007)CrossRef
17.
Zurück zum Zitat Karatzas, D., et al.: ICDAR 2013 robust reading competition. In: International Conference on Document Analysis and Recognition, pp. 1484–1493 (2013) Karatzas, D., et al.: ICDAR 2013 robust reading competition. In: International Conference on Document Analysis and Recognition, pp. 1484–1493 (2013)
Metadaten
Titel
Smooth Stroke Width Transform for Text Detection
verfasst von
Il-Seok Oh
Jin-Seon Lee
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-44748-3_18

Premium Partner