Skip to main content

2016 | OriginalPaper | Buchkapitel

Information Density Based Image Binarization for Text Document Containing Graphics

verfasst von : Soma Datta, Nabendu Chaki, Sankhayan Choudhury

Erschienen in: Computer Information Systems and Industrial Management

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this work, a new clustering based binarization technique has been proposed. Clustering is done depending on the information density of the input image. Here input image is considered as a set of text, images as foreground and some random noises, marks of ink, spots of oil, etc. in the background. It is often quite difficult to separate the foreground from the background based on existing binarization technique. The existing methods offer good result if the input image contains only text. Experimental results indicate that this method is particularly good for degraded text document containing graphic images as well. USC-SIPI database is used for testing phase. It is compared with iterative partitioning, Otsu’s method for seven different metrics.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Thillou, C., Gosselin, B.: Segmentation-based binarization for color degraded images. In: Wojciechowski, K., Smolka, B., Palus, H., Kozera, R.S., Skarbek, W., Noakes, L. (eds.) Computer Vision and Graphics, pp. 808–813. Springer, Heidelberg (2006)CrossRef Thillou, C., Gosselin, B.: Segmentation-based binarization for color degraded images. In: Wojciechowski, K., Smolka, B., Palus, H., Kozera, R.S., Skarbek, W., Noakes, L. (eds.) Computer Vision and Graphics, pp. 808–813. Springer, Heidelberg (2006)CrossRef
2.
Zurück zum Zitat Gonzalez, R.C., Woods, R.E.: Digital Image Processing. Pearson Education India, New Delhi (2009) Gonzalez, R.C., Woods, R.E.: Digital Image Processing. Pearson Education India, New Delhi (2009)
3.
Zurück zum Zitat Namboodiri, A.M., et al.: Document structure and layout analysis. In: Chaudhuri, B.B. (ed.) Digital Document Processing. Springer, London (2007) Namboodiri, A.M., et al.: Document structure and layout analysis. In: Chaudhuri, B.B. (ed.) Digital Document Processing. Springer, London (2007)
4.
Zurück zum Zitat Dinan, R.F., Dubil, J.F., Malin, J.R., Rodite, R.R., Rohe, C.F., Rohrer, G.D.: Document image processing system. US Patent 4,888,812, 19 December 1989 Dinan, R.F., Dubil, J.F., Malin, J.R., Rodite, R.R., Rohe, C.F., Rohrer, G.D.: Document image processing system. US Patent 4,888,812, 19 December 1989
5.
Zurück zum Zitat Jaimes, A., Mintzer, F.C., Rao, A.R., Thompson, G.: Segmentation and automatic descreening of scanned documents. In: Electronic Imaging 1999, International Society for Optics and Photonics, pp. 517–528 (1998) Jaimes, A., Mintzer, F.C., Rao, A.R., Thompson, G.: Segmentation and automatic descreening of scanned documents. In: Electronic Imaging 1999, International Society for Optics and Photonics, pp. 517–528 (1998)
6.
Zurück zum Zitat Ghosh, P., Bhattacharjee, D., Nasipuri, M.: Blood smear analyzer for white blood cell counting: a hybrid microscopic image analyzing technique. Appl. Soft Comput. 46, 629–638 (2016)CrossRef Ghosh, P., Bhattacharjee, D., Nasipuri, M.: Blood smear analyzer for white blood cell counting: a hybrid microscopic image analyzing technique. Appl. Soft Comput. 46, 629–638 (2016)CrossRef
7.
Zurück zum Zitat Parker, J.R., Jennings, C., Salkauskas, A.G.: Thresholding using an illumination model. In: Proceedings of the Second International Conference on Document Analysis and Recognition, 1993, pp. 270–273. IEEE (1993) Parker, J.R., Jennings, C., Salkauskas, A.G.: Thresholding using an illumination model. In: Proceedings of the Second International Conference on Document Analysis and Recognition, 1993, pp. 270–273. IEEE (1993)
8.
Zurück zum Zitat Chen, W.T., Wen, C.H., Yang, C.W.: A fast two-dimensional entropic thresholding algorithm. Pattern Recogn. 27(7), 885–893 (1994)CrossRef Chen, W.T., Wen, C.H., Yang, C.W.: A fast two-dimensional entropic thresholding algorithm. Pattern Recogn. 27(7), 885–893 (1994)CrossRef
9.
Zurück zum Zitat Yanowitz, S.D., Bruckstein, A.M.: A new method for image segmentation. In: 9th International Conference on Pattern Recognition, 1988, pp. 270–275. IEEE (1988) Yanowitz, S.D., Bruckstein, A.M.: A new method for image segmentation. In: 9th International Conference on Pattern Recognition, 1988, pp. 270–275. IEEE (1988)
10.
Zurück zum Zitat Ghosh, P., Bhattacharjee, D., Nasipuri, M., Basu, D.K.: Medical aid for automatic detection of malaria. In: Chaki, N., Cortesi, A. (eds.) CISIM 2011. CCIS, vol. 245, pp. 170–178. Springer, Heidelberg (2011)CrossRef Ghosh, P., Bhattacharjee, D., Nasipuri, M., Basu, D.K.: Medical aid for automatic detection of malaria. In: Chaki, N., Cortesi, A. (eds.) CISIM 2011. CCIS, vol. 245, pp. 170–178. Springer, Heidelberg (2011)CrossRef
11.
Zurück zum Zitat Yang, J.D., Chen, Y.S., Hsu, W.H.: Adaptive thresholding algorithm and its hardware implementation. Pattern Recogn. Lett. 15(2), 141–150 (1994)CrossRef Yang, J.D., Chen, Y.S., Hsu, W.H.: Adaptive thresholding algorithm and its hardware implementation. Pattern Recogn. Lett. 15(2), 141–150 (1994)CrossRef
12.
Zurück zum Zitat Shaikh, S.H., Maiti, A.K., Chaki, N.: A new image binarization method using iterative partitioning. Mach. Vis. Appl. 24(2), 337–350 (2013)CrossRef Shaikh, S.H., Maiti, A.K., Chaki, N.: A new image binarization method using iterative partitioning. Mach. Vis. Appl. 24(2), 337–350 (2013)CrossRef
13.
Zurück zum Zitat Otsu, N.: A threshold selection method from gray-level histograms. Automatica 11(285–296), 23–27 (1975) Otsu, N.: A threshold selection method from gray-level histograms. Automatica 11(285–296), 23–27 (1975)
15.
Zurück zum Zitat Jain, A.K.: Fundamentals of Digital Image Processing. Prentice-Hall Inc., Upper Saddle River (1989)MATH Jain, A.K.: Fundamentals of Digital Image Processing. Prentice-Hall Inc., Upper Saddle River (1989)MATH
16.
Zurück zum Zitat Abutaleb, A.S.: Automatic thresholding of gray-level pictures using two-dimensional entropy. Comput. Vis. Graph. Image Process. 47(1), 22–32 (1989)CrossRef Abutaleb, A.S.: Automatic thresholding of gray-level pictures using two-dimensional entropy. Comput. Vis. Graph. Image Process. 47(1), 22–32 (1989)CrossRef
17.
Zurück zum Zitat Datta, S., Chaki, N.: Person identification technique using RGB based dental images. In: Saeed, K., Homenda, W. (eds.) CISIM 2015. LNCS, vol. 9339, pp. 169–180. Springer, Heidelberg (2015)CrossRef Datta, S., Chaki, N.: Person identification technique using RGB based dental images. In: Saeed, K., Homenda, W. (eds.) CISIM 2015. LNCS, vol. 9339, pp. 169–180. Springer, Heidelberg (2015)CrossRef
18.
Zurück zum Zitat Han, Y., Shi, P.: An improved ant colony algorithm for fuzzy clustering in image segmentation. Neurocomputing 70(4), 665–671 (2007)CrossRef Han, Y., Shi, P.: An improved ant colony algorithm for fuzzy clustering in image segmentation. Neurocomputing 70(4), 665–671 (2007)CrossRef
19.
Zurück zum Zitat Chaki, N., Shaikh, S.H., Saeed, K.: Exploring Image Binarization Techniques. SCI, vol. 560. Springer, New Delhi (2014) Chaki, N., Shaikh, S.H., Saeed, K.: Exploring Image Binarization Techniques. SCI, vol. 560. Springer, New Delhi (2014)
20.
Zurück zum Zitat Su, B., Lu, S., Tan, C.L.: Binarization of historical document images using the local maximum and minimum. In: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems, pp. 159–166. ACM (2010) Su, B., Lu, S., Tan, C.L.: Binarization of historical document images using the local maximum and minimum. In: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems, pp. 159–166. ACM (2010)
Metadaten
Titel
Information Density Based Image Binarization for Text Document Containing Graphics
verfasst von
Soma Datta
Nabendu Chaki
Sankhayan Choudhury
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-45378-1_10

Premium Partner