Skip to main content
Top

2015 | OriginalPaper | Chapter

A Method for Binarization of Document Images from a Live Camera Stream

Author : Mattias Wahde

Published in: Agents and Artificial Intelligence

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper describes a method for binarization of document images from a live camera stream. The method is based on histogram matching over partial images (referred to as tiles). A method developed previously has been applied successfully to images with artificially added noise. Here, an improved method is presented, in which the user has more direct control over the specification of the binarizer. The resulting system is then taken a step further, by considering the more difficult case of binarization of live camera images. It is demonstrated that the improved method works well for this case, even when the image stream is obtained using a (slightly modified) low-cost web camera with low resolution. For typical images obtained this way, a standard OCR reader is capable of reading the binarized images, detecting around 87.5 % of all words without any error, and with mostly minor, correctable errors for the remaining words.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Neumann, L., Matas, J.: A method for text localization and recognition in real-world images. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part III. LNCS, vol. 6494, pp. 770–783. Springer, Heidelberg (2011) CrossRef Neumann, L., Matas, J.: A method for text localization and recognition in real-world images. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part III. LNCS, vol. 6494, pp. 770–783. Springer, Heidelberg (2011) CrossRef
2.
go back to reference González, A., Bergasa, L.: A text reading algorithm for natural images. Image vis. comput. 31, 255–274 (2013)CrossRef González, A., Bergasa, L.: A text reading algorithm for natural images. Image vis. comput. 31, 255–274 (2013)CrossRef
3.
go back to reference Stathis, P., Kavallieratou, E., Papamarkos, N.: An evaluation technique for binarization algorithms. J. Univ. Comput. Sci. 14(18), 3011–3030 (2008) Stathis, P., Kavallieratou, E., Papamarkos, N.: An evaluation technique for binarization algorithms. J. Univ. Comput. Sci. 14(18), 3011–3030 (2008)
4.
go back to reference Shi, J., Ray, N., Zhang, H.: Shape based local thresholding for binarization of document images. Pattern Recogn. Lett. 33, 24–32 (2012)CrossRef Shi, J., Ray, N., Zhang, H.: Shape based local thresholding for binarization of document images. Pattern Recogn. Lett. 33, 24–32 (2012)CrossRef
5.
go back to reference Valizadeh, M., Kabir, E.: An adaptive water flow model for binarization of degraded document images. Int. J. Doc. Anal. Recogn. 16(2), 165–176 (2013)CrossRef Valizadeh, M., Kabir, E.: An adaptive water flow model for binarization of degraded document images. Int. J. Doc. Anal. Recogn. 16(2), 165–176 (2013)CrossRef
6.
go back to reference Wahde, M.: A method for document image binarization based on histogram matching and repeated contrast enhancement. In: Duval, B., van der Herik, J., Loiseau, S., Filipe, J. (eds.) Proceedings of the 6th International Conference on Agents and Artificial Intelligence (ICAART 2014), pp. 34–41 (2014) Wahde, M.: A method for document image binarization based on histogram matching and repeated contrast enhancement. In: Duval, B., van der Herik, J., Loiseau, S., Filipe, J. (eds.) Proceedings of the 6th International Conference on Agents and Artificial Intelligence (ICAART 2014), pp. 34–41 (2014)
7.
go back to reference Chen, K.-N., Chen, C.-H., Chang, C.-C.: Efficient illumination compensation techniques for text images. Digit. Signal Process. 22, 726–733 (2012)CrossRef Chen, K.-N., Chen, C.-H., Chang, C.-C.: Efficient illumination compensation techniques for text images. Digit. Signal Process. 22, 726–733 (2012)CrossRef
8.
go back to reference Lu, S., Su, B., Tan, C.: Document image binarization using background estimation and stroke edges. Int. J. Doc. Anal. Recogn. 13(4), 303–314 (2010)CrossRef Lu, S., Su, B., Tan, C.: Document image binarization using background estimation and stroke edges. Int. J. Doc. Anal. Recogn. 13(4), 303–314 (2010)CrossRef
9.
go back to reference Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man. Cybern. 9, 62–66 (1979)CrossRef Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man. Cybern. 9, 62–66 (1979)CrossRef
10.
go back to reference Niblack, W.: An Introduction to Image Processing. Prentice-Hall, Englewood Cliffs (1986) Niblack, W.: An Introduction to Image Processing. Prentice-Hall, Englewood Cliffs (1986)
11.
go back to reference Sauvola, J., Pietikäinen, M.: Adaptive document image binarization. Pattern Recogn. 33, 225–236 (2010)CrossRef Sauvola, J., Pietikäinen, M.: Adaptive document image binarization. Pattern Recogn. 33, 225–236 (2010)CrossRef
12.
go back to reference Pele, O., Werman, M.: The Quadratic-Chi Histogram Distance Family. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part II. LNCS, vol. 6312, pp. 749–762. Springer, Heidelberg (2010) CrossRef Pele, O., Werman, M.: The Quadratic-Chi Histogram Distance Family. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part II. LNCS, vol. 6312, pp. 749–762. Springer, Heidelberg (2010) CrossRef
Metadata
Title
A Method for Binarization of Document Images from a Live Camera Stream
Author
Mattias Wahde
Copyright Year
2015
DOI
https://doi.org/10.1007/978-3-319-25210-0_9

Premium Partner