Skip to main content
Top

2020 | OriginalPaper | Chapter

An Unconstrained Rotation Invariant Approach for Document Skew Estimation and Correction

Authors : H. N. Balachandra, K. Sanjay Nayak, C. Chakradhar Reddy, T. Shreekanth, Shankaraiah

Published in: New Trends in Computational Vision and Bio-inspired Computing

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The OCR technology is gaining more and more importance in the digitalization of the documents, this is because of its functionality to convert the text data in the image to the machine-encoded text and this machine-encoded text can be further used for processing. The orientation of the digitized document is important for the OCR to recognize the data in the document veraciously. Sometimes due to manual error, the scanned document may not be properly oriented, this condition is called skew of an image. Deskewing is a procedure to align the image properly, before further processing the data in the image. There are many existing approaches for deskewing the image such as mathematical morphology, principal of connected components, projection profile technique, Fourier transform, Hough transform, Radon transform and KL Transform. These methods for deskewing have their own constraints with respect to font style, font size and are not rotation invariant. In this paper, we propose a method which can deskew an image with any degree of skewness using warp-affine transform, Hough transform and feedback of the OCR output. The warp-affine transform is used for adjusting the shape of the background image, Hough transform is used for checking the vertical symmetry of the text and feedback from OCR is used for checking the skewness of 180° and flipped document cases. The proposed method was evaluated on 40 images with the various skew angle and the performance was comparable with the existing techniques in the literature.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Yuan, B. and Tan, C.L “Skew estimation for scanned documents from noises”. Eighth International Conference on Document Analysis and Recognition, 2005, pp. 277–281. Yuan, B. and Tan, C.L “Skew estimation for scanned documents from noises”. Eighth International Conference on Document Analysis and Recognition, 2005, pp. 277–281.
2.
go back to reference Fabrizio, J “A precise skew estimation algorithm for document images using KNN clustering and Fourier transform”. IEEE International Conference on Image Processing (ICIP), October 2014, pp. 2585–2588. Fabrizio, J “A precise skew estimation algorithm for document images using KNN clustering and Fourier transform”. IEEE International Conference on Image Processing (ICIP), October 2014, pp. 2585–2588.
3.
go back to reference Nandini, N., Srikanta Murthy, K. and Hemantha Kumar, G “Estimation of skew angle in binary document images using Hough transform”. International Scholarly and Scientific Research & Innovation, 2008, pp.44–49. Nandini, N., Srikanta Murthy, K. and Hemantha Kumar, G “Estimation of skew angle in binary document images using Hough transform”. International Scholarly and Scientific Research & Innovation, 2008, pp.44–49.
4.
go back to reference Sarfraz, M., Zidouri, A. and Shahab, S.A “A novel approach for skew estimation of document images in OCR system”. International Conference on Computer Graphics, Imaging and Vision: New Trends, IEEE July 2005, pp. 175–180. Sarfraz, M., Zidouri, A. and Shahab, S.A “A novel approach for skew estimation of document images in OCR system”. International Conference on Computer Graphics, Imaging and Vision: New Trends, IEEE July 2005, pp. 175–180.
5.
go back to reference Kaur, M. and Jindal S. “An integrated skew detection and correction using Fast Fourier transform and DCT”. International Journal of Scientific & Technology Research, Dec 2013, vol 2(12), pp.164–169. Kaur, M. and Jindal S. “An integrated skew detection and correction using Fast Fourier transform and DCT”. International Journal of Scientific & Technology Research, Dec 2013, vol 2(12), pp.164–169.
6.
go back to reference Srihari, S.N. and Govindaraju, V “Analysis of textual images using the Hough transform”. Machine vision and Applications, 1989,2(3), pp.141–153. Srihari, S.N. and Govindaraju, V “Analysis of textual images using the Hough transform”. Machine vision and Applications, 1989,2(3), pp.141–153.
7.
go back to reference Zhu, X. and Yin, X “A new textual/non-textual Recognition”, 2002. Proceedings. 16th International Conference on (Vol. 1, pp. 480–482). IEEE. 2002’Science, Engineering and Applications (IJCSEA) 3, no. 3(2013). Zhu, X. and Yin, X “A new textual/non-textual Recognition”, 2002. Proceedings. 16th International Conference on (Vol. 1, pp. 480–482). IEEE. 2002’Science, Engineering and Applications (IJCSEA) 3, no. 3(2013).
8.
go back to reference Nguyen, D.T, Nguyen, T.M. and Nguyen, T.G. “A robust document skew estimation algorithm using mathematical morphology”. IEEE, October 2007, pp.496–503. Nguyen, D.T, Nguyen, T.M. and Nguyen, T.G.A robust document skew estimation algorithm using mathematical morphology”. IEEE, October 2007, pp.496–503.
9.
go back to reference Sarfraz, M., Mahmoud S.A. and Rasheed, Z “On skew estimation and correction of text.” IEEE Conference on Computer Graphics, Imaging and Visualization (CGIV), August 2007 pp.308–313. Sarfraz, M., Mahmoud S.A. and Rasheed, Z “On skew estimation and correction of text.” IEEE Conference on Computer Graphics, Imaging and Visualization (CGIV), August 2007 pp.308–313.
10.
go back to reference Ju, Z. and Gu, G “Algorithm of document skew detection based on character vertices”. Intelligent Information Technology Application, (IITA) 2009, Vol. 2, pp. 23–26. Ju, Z. and Gu, G “Algorithm of document skew detection based on character vertices”. Intelligent Information Technology Application, (IITA) 2009, Vol. 2, pp. 23–26.
11.
go back to reference Arvind, K.R., Kumar, J. and Ramakrishnan, A.G “Entropy based skew correction of document images”, International conference on Pattern recognition and Machine Intelligence, Springer, Berlin, Heidelberg, December 2007, pp. 495–502. Arvind, K.R., Kumar, J. and Ramakrishnan, A.G “Entropy based skew correction of document images”, International conference on Pattern recognition and Machine Intelligence, Springer, Berlin, Heidelberg, December 2007, pp. 495–502.
12.
go back to reference Caprari, R.S “Algorithm for text page up/down orientation determination. Pattern Recognition Letters” 2000, 21(4), pp.311–317. Caprari, R.S “Algorithm for text page up/down orientation determination. Pattern Recognition Letters” 2000, 21(4), pp.311–317.
Metadata
Title
An Unconstrained Rotation Invariant Approach for Document Skew Estimation and Correction
Authors
H. N. Balachandra
K. Sanjay Nayak
C. Chakradhar Reddy
T. Shreekanth
Shankaraiah
Copyright Year
2020
Publisher
Springer International Publishing
DOI
https://doi.org/10.1007/978-3-030-41862-5_60

Premium Partner