Skip to main content
Erschienen in: International Journal on Document Analysis and Recognition (IJDAR) 1/2015

01.03.2015 | Original Paper

Skew detection and correction based on an axes-parallel bounding box

verfasst von: Mahnaz Shafii, Maher Sid-Ahmed

Erschienen in: International Journal on Document Analysis and Recognition (IJDAR) | Ausgabe 1/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Skew detection and correction of a scanned document is a preprocessing step for optical character recognition systems. We present a novel approach in skew detection and correction of a typed document by minimizing the area of the axis-parallel bounding box. Advantage of our approach over existing methods is that our algorithm is script and content independent. Moreover, our algorithm is not subject to skew angle limitations. The performance of our algorithm was evaluated using images of different scripts with varying skew angles with and without graphical images. Our experiments show that our algorithm outperforms existing state-of-the-art skew detection algorithms.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Postl, W.: Detection of linear oblique structures and skew scan in digitized documents. In: Proc. 8th International Conference on Pattern Recognition, pp. 687–689 (1986) Postl, W.: Detection of linear oblique structures and skew scan in digitized documents. In: Proc. 8th International Conference on Pattern Recognition, pp. 687–689 (1986)
2.
Zurück zum Zitat Baird, H.S.: The skew angle of printed documents. In: Proc. SPSE 40th Symposium Hybrid Imaging, Rochester, NY, pp. 739–743 (1987) Baird, H.S.: The skew angle of printed documents. In: Proc. SPSE 40th Symposium Hybrid Imaging, Rochester, NY, pp. 739–743 (1987)
3.
Zurück zum Zitat Ciardiello, G., Scafuro, G., Degrandi, M.T., et al.: An experimental system for office document handling and text recognition. In: Proc. 9th International Conference on Pattern Recognition, pp. 739–743 (1988) Ciardiello, G., Scafuro, G., Degrandi, M.T., et al.: An experimental system for office document handling and text recognition. In: Proc. 9th International Conference on Pattern Recognition, pp. 739–743 (1988)
4.
Zurück zum Zitat Ishitani, Y.: Document skew detection based on local region complexity. In: Proc. 2nd International on Document Analysis and Recognition, Japan, pp. 49–52 (1993) Ishitani, Y.: Document skew detection based on local region complexity. In: Proc. 2nd International on Document Analysis and Recognition, Japan, pp. 49–52 (1993)
5.
Zurück zum Zitat Bloomberg, D.S., Kopec, G.E., Dasari, L.: Measuring document image skew and orientation. In: Document Recognition II SPIE, vol. 2422, pp. 302–316 (1995) Bloomberg, D.S., Kopec, G.E., Dasari, L.: Measuring document image skew and orientation. In: Document Recognition II SPIE, vol. 2422, pp. 302–316 (1995)
6.
Zurück zum Zitat Grafmuller, M., Beyerer, J.: Performance improvement of character recognition in industrial applications using prior knowledge for more reliable segmentation. Expert Syst. Appl. 40(17), 6955–6963 (2013)CrossRef Grafmuller, M., Beyerer, J.: Performance improvement of character recognition in industrial applications using prior knowledge for more reliable segmentation. Expert Syst. Appl. 40(17), 6955–6963 (2013)CrossRef
7.
Zurück zum Zitat Duda, R.O., Hart, P.E.: Use of Hough transformation to detect lines and curves in pictures. Commun. ACM 15, 11–15 (1972)CrossRefMATH Duda, R.O., Hart, P.E.: Use of Hough transformation to detect lines and curves in pictures. Commun. ACM 15, 11–15 (1972)CrossRefMATH
8.
Zurück zum Zitat Hough, P.V.C.: Machine analysis of bubble chamber pictures. In: 2nd International Conference on High-Energy Accelerators, pp. 554–558 (1959) Hough, P.V.C.: Machine analysis of bubble chamber pictures. In: 2nd International Conference on High-Energy Accelerators, pp. 554–558 (1959)
9.
Zurück zum Zitat Billard, D.H.: Generalizing the Hough transform to detect arbitrary shapes. Pattern Recognit. 13, 111–122 (1981)CrossRef Billard, D.H.: Generalizing the Hough transform to detect arbitrary shapes. Pattern Recognit. 13, 111–122 (1981)CrossRef
10.
Zurück zum Zitat Illingworth, J., Kittler, J.: A survery of the Hough transforms. Comput. Graph. Image Process. 44, 87–116 (1988)CrossRef Illingworth, J., Kittler, J.: A survery of the Hough transforms. Comput. Graph. Image Process. 44, 87–116 (1988)CrossRef
11.
Zurück zum Zitat Hinds, S.C., Fisher, J.L., D’Amoto, D.P.: A document skew detection method using run-length encoding and Hough transform. In: Proc. 10th Int’l Conf. on Pattern Recognition, New York, pp. 464–468 (1990) Hinds, S.C., Fisher, J.L., D’Amoto, D.P.: A document skew detection method using run-length encoding and Hough transform. In: Proc. 10th Int’l Conf. on Pattern Recognition, New York, pp. 464–468 (1990)
12.
Zurück zum Zitat Yu, B., Jain, A.K.: A robust and fast skew detection algorithm for generic documents. Pattern Recognit. 29(10), 1599–1629 (1996)CrossRef Yu, B., Jain, A.K.: A robust and fast skew detection algorithm for generic documents. Pattern Recognit. 29(10), 1599–1629 (1996)CrossRef
13.
Zurück zum Zitat Srihari, S.N., Govindaraju, V.: Analysis of textual images using the Hough transform. Mach. Vis. Appl. 2, 141–153 (1989)CrossRef Srihari, S.N., Govindaraju, V.: Analysis of textual images using the Hough transform. Mach. Vis. Appl. 2, 141–153 (1989)CrossRef
14.
Zurück zum Zitat Manjunath, V.N., Kumar, G.H., Shivakumara, P.: Skew detection technique for binary document images based on Hough transform. Int. J. Technol. 13(3), 194–200 (2006) Manjunath, V.N., Kumar, G.H., Shivakumara, P.: Skew detection technique for binary document images based on Hough transform. Int. J. Technol. 13(3), 194–200 (2006)
15.
Zurück zum Zitat Le, D.S., Thoma, G.R.,Wechsler, H.: Automated page orientation and skew angle detection for binary document images. Pattern Recognit. 27(10), 1325–1344 (1994) Le, D.S., Thoma, G.R.,Wechsler, H.: Automated page orientation and skew angle detection for binary document images. Pattern Recognit. 27(10), 1325–1344 (1994)
16.
Zurück zum Zitat Singh, C., Bhatia, N., Kaur, A.: Hough transform based fast skew detection and accurate skew correction methods. Pattern Recognit. 41(12), 3528–3546 (2008) Singh, C., Bhatia, N., Kaur, A.: Hough transform based fast skew detection and accurate skew correction methods. Pattern Recognit. 41(12), 3528–3546 (2008)
17.
Zurück zum Zitat Hashizume, A., Yeh, P.-S., Rosenfeld, A.: A method of detecting the orientation of aligned components. Pattern Recognit. Lett. 4, 125–132 (1986) Hashizume, A., Yeh, P.-S., Rosenfeld, A.: A method of detecting the orientation of aligned components. Pattern Recognit. Lett. 4, 125–132 (1986)
18.
Zurück zum Zitat Peake, G.S., Tan, T.N.: A general algorithm for document skew angle estimation. In: International Conference on Image Processing, pp. 230–233 (1997) Peake, G.S., Tan, T.N.: A general algorithm for document skew angle estimation. In: International Conference on Image Processing, pp. 230–233 (1997)
19.
Zurück zum Zitat Dey, P., Noushath, S.: e-PCP: a robust skew detection method for scanned document images. Pattern Recognit. 43(3), 937–948 (2010)CrossRefMATH Dey, P., Noushath, S.: e-PCP: a robust skew detection method for scanned document images. Pattern Recognit. 43(3), 937–948 (2010)CrossRefMATH
20.
Zurück zum Zitat Egozi, A., Dinstein, I.: Statistical mixture model for documents skew angle estimation. Pattern Recognit. Lett. 32(14), 1912–1921 (2011) Egozi, A., Dinstein, I.: Statistical mixture model for documents skew angle estimation. Pattern Recognit. Lett. 32(14), 1912–1921 (2011)
21.
Zurück zum Zitat Papandreou, A., Gatos, B.: A novel skew detection technique based on vertical projections. In: Proceedings of ICDAR, pp. 384–388 (2011) Papandreou, A., Gatos, B.: A novel skew detection technique based on vertical projections. In: Proceedings of ICDAR, pp. 384–388 (2011)
22.
Zurück zum Zitat Li, S., Shen, Q., Sun, J.: Skew detection using wavelet decomposition and projection profile analysis. Pattern Recognit. Lett. 28(5), 555–562 (2007)CrossRefMathSciNet Li, S., Shen, Q., Sun, J.: Skew detection using wavelet decomposition and projection profile analysis. Pattern Recognit. Lett. 28(5), 555–562 (2007)CrossRefMathSciNet
23.
Zurück zum Zitat Makridis, M., Nikolaou, N., Papamarkos, N.: An adaptive technique for global and local skew correction in color documents. Expert Syst. Appl. 37, 6832–6843 (2010)CrossRef Makridis, M., Nikolaou, N., Papamarkos, N.: An adaptive technique for global and local skew correction in color documents. Expert Syst. Appl. 37, 6832–6843 (2010)CrossRef
24.
Zurück zum Zitat Safabakhsh, R., Khadivi, S.: Document skew detection using minimum-area bounding rectangle. In: Information Technology: Coding and Computing, pp. 253–258 (2000) Safabakhsh, R., Khadivi, S.: Document skew detection using minimum-area bounding rectangle. In: Information Technology: Coding and Computing, pp. 253–258 (2000)
25.
Zurück zum Zitat Rudak, P., Lee, Y., Morgan, T.: Method for determining skew angle and location of a document in an over-scanned image. US 7,027,666 B2 (2006) Rudak, P., Lee, Y., Morgan, T.: Method for determining skew angle and location of a document in an over-scanned image. US 7,027,666 B2 (2006)
26.
Zurück zum Zitat Avila, B.T., Lins, R.D.: Efficient removal of noisy borders from monochromatic documents. In: International Conference on Image Analysis and Recognition, Portugal, pp. 249–256 (2004) Avila, B.T., Lins, R.D.: Efficient removal of noisy borders from monochromatic documents. In: International Conference on Image Analysis and Recognition, Portugal, pp. 249–256 (2004)
27.
Zurück zum Zitat Le, D.X., et al.: Automated borders detection and adaptive segmentation for binary document images. In: 13th International Conference on Pattern Recognition, Austria, pp. 737–741 (1996) Le, D.X., et al.: Automated borders detection and adaptive segmentation for binary document images. In: 13th International Conference on Pattern Recognition, Austria, pp. 737–741 (1996)
28.
Zurück zum Zitat Fan, K.C., Wang, Y.K., Lay, T.R.: Marginal noise removal of document images. Pattern Recognit. 35, 2593–2611 (2002)CrossRefMATH Fan, K.C., Wang, Y.K., Lay, T.R.: Marginal noise removal of document images. Pattern Recognit. 35, 2593–2611 (2002)CrossRefMATH
Metadaten
Titel
Skew detection and correction based on an axes-parallel bounding box
verfasst von
Mahnaz Shafii
Maher Sid-Ahmed
Publikationsdatum
01.03.2015
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal on Document Analysis and Recognition (IJDAR) / Ausgabe 1/2015
Print ISSN: 1433-2833
Elektronische ISSN: 1433-2825
DOI
https://doi.org/10.1007/s10032-014-0230-y

Weitere Artikel der Ausgabe 1/2015

International Journal on Document Analysis and Recognition (IJDAR) 1/2015 Zur Ausgabe