Skip to main content
Top
Published in: International Journal on Document Analysis and Recognition (IJDAR) 2/2023

09-01-2023 | Original Paper

Adaptive dewarping of severely warped camera-captured document images based on document map generation

Authors: C. H. Nachappa, N. Shobha Rani, Peeta Basa Pati, M. Gokulnath

Published in: International Journal on Document Analysis and Recognition (IJDAR) | Issue 2/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Automated dewarping of camera-captured handwritten documents is a challenging research problem in Computer Vision and Pattern Recognition. Most available systems assume the shape of the camera-captured image boundaries to be anywhere between trapezoidal and octahedral, with linear distortion in areas between the boundaries for dewarping. The majority of the state-of-the-art applications successfully dewarp the simple-to-medium range geometrical distortions with partial selection of control points by a user. The proposed work implements a fully automated technique for control point detection from simple-to-complex geometrical distortions in camera-captured document images. The input image is subject to preprocessing, corner point detection, document map generation, and rendering of the de-warped document image. The proposed algorithm has been tested on five different camera-captured document datasets (one internal and four external publicly available) consisting of 958 images. Both quantitative and qualitative evaluations have been performed to test the efficacy of the proposed system. On the quantitative front, an Intersection Over Union (IoU) score of 0.92, 0.88, and 0.80 for document map generation for low-, medium-, and high-complexity datasets, respectively. Additionally, accuracies of the recognized texts, obtained from a market leading OCR engine, are utilized for quantitative comparative analysis on document images before and after the proposed enhancement. Finally, the qualitative analysis visually establishes the system’s reliability by demonstrating improved readability even for severely distorted image samples.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Liang, J., Doermann, D., Li, H.: Camera-based analysis of text and documents: a survey. IJDAR 7(2), 84–104 (2005)CrossRef Liang, J., Doermann, D., Li, H.: Camera-based analysis of text and documents: a survey. IJDAR 7(2), 84–104 (2005)CrossRef
2.
go back to reference Jung, K., Kim, K.I., Jain, A.K.: Text information extraction in images and video: a survey. Pattern Recogn. 37(5), 977–997 (2004)CrossRef Jung, K., Kim, K.I., Jain, A.K.: Text information extraction in images and video: a survey. Pattern Recogn. 37(5), 977–997 (2004)CrossRef
3.
go back to reference Wolberg, G.: Digital image warping, vol. 10662, pp. 90720–91264. IEEE computer society press, Los Alamitos (1990) Wolberg, G.: Digital image warping, vol. 10662, pp. 90720–91264. IEEE computer society press, Los Alamitos (1990)
4.
go back to reference Heckbert, P.S.: Fundamentals of texture mapping and image warping (1989) Heckbert, P.S.: Fundamentals of texture mapping and image warping (1989)
5.
go back to reference Zhang, Z., Tan, C.L.: Correcting document image warping based on regression of curved text lines. In: Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings, pp. 589–593. IEEE (2003) Zhang, Z., Tan, C.L.: Correcting document image warping based on regression of curved text lines. In: Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings, pp. 589–593. IEEE (2003)
6.
go back to reference Wagdy, M., Amin, K., Ibrahim, M.: Dewarping document image techniques: survey and comparative study. Int. J. Image Graph. 21(03), 2150031 (2021)CrossRef Wagdy, M., Amin, K., Ibrahim, M.: Dewarping document image techniques: survey and comparative study. Int. J. Image Graph. 21(03), 2150031 (2021)CrossRef
7.
go back to reference Wu, C., Agam, G.: Document image de-warping for text/graphics recognition. In: Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR), pp. 348–357. Springer, Berlin (2002) Wu, C., Agam, G.: Document image de-warping for text/graphics recognition. In: Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR), pp. 348–357. Springer, Berlin (2002)
8.
go back to reference Kil, T., Seo, W., Koo, H.I., Cho, N.I.: Robust document image dewarping method using text-lines and line segments. In: 2017 14Th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 1, pp. 865–870. IEEE (2017) Kil, T., Seo, W., Koo, H.I., Cho, N.I.: Robust document image dewarping method using text-lines and line segments. In: 2017 14Th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 1, pp. 865–870. IEEE (2017)
9.
go back to reference Sutapirat, D., Sookhanaphibarn, K., Lursinsap, C.: Model-based book dewarping method for content-independent document images. In: 2009 International Conference on Digital Image Processing, pp. 190–194. IEEE 2009. Sutapirat, D., Sookhanaphibarn, K., Lursinsap, C.: Model-based book dewarping method for content-independent document images. In: 2009 International Conference on Digital Image Processing, pp. 190–194. IEEE 2009.
10.
go back to reference Dehbovid, H., Razzazi, F., Alirezaii, S.: A novel method for de-warping in Persian document images captured by cameras. In: 2010 International Conference on Computer Information Systems and Industrial Management Applications (CISIM), pp. 614–619. IEEE (2010) Dehbovid, H., Razzazi, F., Alirezaii, S.: A novel method for de-warping in Persian document images captured by cameras. In: 2010 International Conference on Computer Information Systems and Industrial Management Applications (CISIM), pp. 614–619. IEEE (2010)
11.
go back to reference Bukhari, S.S., Shafait, F., Breuel, T.M.: Coupled Snakelet model for curled textline segmentation of camera-captured document images. In: 2009 10th International Conference on Document Analysis and Recognition, pp. 61–65. IEEE (2009) Bukhari, S.S., Shafait, F., Breuel, T.M.: Coupled Snakelet model for curled textline segmentation of camera-captured document images. In: 2009 10th International Conference on Document Analysis and Recognition, pp. 61–65. IEEE (2009)
12.
go back to reference Bukhari, S.S., Shafait, F., Breuel, T.M.: Dewarping of document images using coupled-snakes. In: Proceedings of Third International Workshop on Camera-Based Document Analysis and Recognition, Barcelona, Spain, pp. 34–41 (2009) Bukhari, S.S., Shafait, F., Breuel, T.M.: Dewarping of document images using coupled-snakes. In: Proceedings of Third International Workshop on Camera-Based Document Analysis and Recognition, Barcelona, Spain, pp. 34–41 (2009)
13.
go back to reference Bukhari, S.S., Shafait, F., Breuel, T.M.: Segmentation of curled textlines using active contours. In: 2008 The Eighth IAPR International Workshop on Document Analysis Systems, pp. 270–277. IEEE (2008) Bukhari, S.S., Shafait, F., Breuel, T.M.: Segmentation of curled textlines using active contours. In: 2008 The Eighth IAPR International Workshop on Document Analysis Systems, pp. 270–277. IEEE (2008)
14.
go back to reference Stamatopoulos, N., Gatos, B., Pratikakis, I.: A methodology for document image dewarping techniques performance evaluation. In: 2009 10th International Conference on Document Analysis and Recognition, pp. 956–960. IEEE (2009) Stamatopoulos, N., Gatos, B., Pratikakis, I.: A methodology for document image dewarping techniques performance evaluation. In: 2009 10th International Conference on Document Analysis and Recognition, pp. 956–960. IEEE (2009)
15.
go back to reference Kim, B.S., Koo, H.I., Cho, N.I.: Document dewarping via text-line based optimization. Pattern Recogn. 48(11), 3600–3614 (2015)CrossRef Kim, B.S., Koo, H.I., Cho, N.I.: Document dewarping via text-line based optimization. Pattern Recogn. 48(11), 3600–3614 (2015)CrossRef
16.
go back to reference Song, L., Wu, Y., Sun, B.: A robust and fast dewarping method of document images. In: 2010 International Conference on E-Product E-Service and E-Entertainment, pp. 1–4. IEEE (2010) Song, L., Wu, Y., Sun, B.: A robust and fast dewarping method of document images. In: 2010 International Conference on E-Product E-Service and E-Entertainment, pp. 1–4. IEEE (2010)
17.
go back to reference Ramanna, V.K.B., Bukhari, S.S., Dengel, A.: Document image dewarping using deep learning. In: ICPRAM, pp. 524–531 (2019) Ramanna, V.K.B., Bukhari, S.S., Dengel, A.: Document image dewarping using deep learning. In: ICPRAM, pp. 524–531 (2019)
18.
go back to reference Garai, A., Biswas, S., Mandal, S.: A theoretical justification of warping generation for dewarping using CNN. Pattern Recogn. 109, 107621 (2021)CrossRef Garai, A., Biswas, S., Mandal, S.: A theoretical justification of warping generation for dewarping using CNN. Pattern Recogn. 109, 107621 (2021)CrossRef
19.
go back to reference Xie, G.W., Yin, F., Zhang, X.Y., Liu, C.L.: Dewarping document image by displacement flow estimation with fully convolutional network. In: International Workshop on Document Analysis Systems, pp. 131–144. Springer, Cham (2020) Xie, G.W., Yin, F., Zhang, X.Y., Liu, C.L.: Dewarping document image by displacement flow estimation with fully convolutional network. In: International Workshop on Document Analysis Systems, pp. 131–144. Springer, Cham (2020)
20.
go back to reference Ulges, A., Lampert, C.H., Breuel, T.M.: Document image dewarping using robust estimation of curled text lines. In: Eighth International Conference on Document Analysis and Recognition (ICDAR'05), pp. 1001–1005. IEEE (2005) Ulges, A., Lampert, C.H., Breuel, T.M.: Document image dewarping using robust estimation of curled text lines. In: Eighth International Conference on Document Analysis and Recognition (ICDAR'05), pp. 1001–1005. IEEE (2005)
21.
go back to reference Ezaki, H., Uchida, S., Asano, A., Sakoe, H.: Dewarping of document image by global optimization. In: Eighth International Conference on Document Analysis and Recognition (ICDAR'05), pp. 302–306. IEEE (2005) Ezaki, H., Uchida, S., Asano, A., Sakoe, H.: Dewarping of document image by global optimization. In: Eighth International Conference on Document Analysis and Recognition (ICDAR'05), pp. 302–306. IEEE (2005)
22.
go back to reference Kakumanu, P., Bourbakis, N., Black, J., Panchanathan, S.: Document image dewarping based on line estimation for visually impaired. In: 2006 18th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'06), pp. 625–631. IEEE (2006) Kakumanu, P., Bourbakis, N., Black, J., Panchanathan, S.: Document image dewarping based on line estimation for visually impaired. In: 2006 18th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'06), pp. 625–631. IEEE (2006)
23.
go back to reference Koo, H.I., Kim, J., Cho, N.I.: Composition of a dewarped and enhanced document image from two view images. IEEE Trans. Image Process. 18(7), 1551–1562 (2009)MathSciNetCrossRefMATH Koo, H.I., Kim, J., Cho, N.I.: Composition of a dewarped and enhanced document image from two view images. IEEE Trans. Image Process. 18(7), 1551–1562 (2009)MathSciNetCrossRefMATH
24.
go back to reference Ghods, A.R., Mozaffari, S., Ahmadpanahi, F.: Document image dewarping using Kinect depth sensor. In: 2013 21st Iranian Conference on Electrical Engineering (ICEE), pp. 1–6. IEEE (2013) Ghods, A.R., Mozaffari, S., Ahmadpanahi, F.: Document image dewarping using Kinect depth sensor. In: 2013 21st Iranian Conference on Electrical Engineering (ICEE), pp. 1–6. IEEE (2013)
25.
go back to reference Dasgupta, T., Das, N., Nasipuri, M.: Multistage curvilinear coordinate transform based document image dewarping using a novel quality estimator. arXiv preprint arXiv:2003.06872 (2020) Dasgupta, T., Das, N., Nasipuri, M.: Multistage curvilinear coordinate transform based document image dewarping using a novel quality estimator. arXiv preprint arXiv:​2003.​06872 (2020)
26.
go back to reference Simon, G., &Tabbone, S.: Generic document image dewarping by probabilistic discretization of vanishing points. In: 2020 25th International Conference on Pattern Recognition (ICPR), pp. 2344–2351. IEEE (2021) Simon, G., &Tabbone, S.: Generic document image dewarping by probabilistic discretization of vanishing points. In: 2020 25th International Conference on Pattern Recognition (ICPR), pp. 2344–2351. IEEE (2021)
27.
go back to reference Wagdy, M., Amin, K., Ibrahim, M.: Detection and correction of multi-warping document image. Int. J. Image Graph. 2250034 (2021) Wagdy, M., Amin, K., Ibrahim, M.: Detection and correction of multi-warping document image. Int. J. Image Graph. 2250034 (2021)
28.
go back to reference Labib, M.W., Amin, K.M.: Warped document image correction based on checkboard pattern and geometric transformation. IJCI Int. J. Comput. Inf. 8(1), 30–54 (2021) Labib, M.W., Amin, K.M.: Warped document image correction based on checkboard pattern and geometric transformation. IJCI Int. J. Comput. Inf. 8(1), 30–54 (2021)
29.
go back to reference Xie, G.W., Yin, F., Zhang, X.Y., Liu, C.L.: Document dewarping with control points. In: International conference on document analysis and recognition, pp. 466–480. Springer, Cham (2021) Xie, G.W., Yin, F., Zhang, X.Y., Liu, C.L.: Document dewarping with control points. In: International conference on document analysis and recognition, pp. 466–480. Springer, Cham (2021)
30.
go back to reference Shafait, F., Breuel, T.M.: Document image dewarping contest. In: 2nd Int. Workshop on Camera-Based Document Analysis and Recognition, Curitiba, Brazil, pp. 181–188 (2007) Shafait, F., Breuel, T.M.: Document image dewarping contest. In: 2nd Int. Workshop on Camera-Based Document Analysis and Recognition, Curitiba, Brazil, pp. 181–188 (2007)
31.
go back to reference Stamatopoulos, N., Gatos, B., Pratikakis, I.: Performance evaluation methodology for document image dewarping techniques. IET Image Proc. 6(6), 738–745 (2012)CrossRef Stamatopoulos, N., Gatos, B., Pratikakis, I.: Performance evaluation methodology for document image dewarping techniques. IET Image Proc. 6(6), 738–745 (2012)CrossRef
32.
go back to reference Garai, A., Dutta, A., Biswas, S.: Automatic dewarping of camera-captured comic document images. Multimed. Tools Appl. 1–16 (2022) Garai, A., Dutta, A., Biswas, S.: Automatic dewarping of camera-captured comic document images. Multimed. Tools Appl. 1–16 (2022)
33.
go back to reference Bolelli, F.: Indexing of historical document images: Ad hoc dewarping technique for handwritten text. In: Italian Research Conference on Digital Libraries, pp. 45–55. Springer, Cham (2017) Bolelli, F.: Indexing of historical document images: Ad hoc dewarping technique for handwritten text. In: Italian Research Conference on Digital Libraries, pp. 45–55. Springer, Cham (2017)
34.
go back to reference Rowtula, V., Bhargavan, V., Kumar, M., Jawahar, C.V.: Scaling handwritten student assessments with a document image workflow system. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp. 2307–2314 (2018) Rowtula, V., Bhargavan, V., Kumar, M., Jawahar, C.V.: Scaling handwritten student assessments with a document image workflow system. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp. 2307–2314 (2018)
35.
go back to reference Vinod, H.C., Niranjan, S.K.: Camera captured document de-warping and de-skewing. J. Comput. Theor. Nanosci. 17(9–10), 4398–4403 (2020)CrossRef Vinod, H.C., Niranjan, S.K.: Camera captured document de-warping and de-skewing. J. Comput. Theor. Nanosci. 17(9–10), 4398–4403 (2020)CrossRef
36.
go back to reference Garai, A., Biswas, S., Mandal, S., Chaudhuri, B.B.: Automatic rectification of warped Bangla document images. IET Image Process. 14(1), 74–83 (2020)CrossRef Garai, A., Biswas, S., Mandal, S., Chaudhuri, B.B.: Automatic rectification of warped Bangla document images. IET Image Process. 14(1), 74–83 (2020)CrossRef
37.
go back to reference Dulla, A.: A dataset of warped historical Arabic documents (2018) Dulla, A.: A dataset of warped historical Arabic documents (2018)
38.
go back to reference Feng, H., Wang, Y., Zhou, W., Deng, J., Li, H.: Doctr: Document image transformer for geometric unwarping and illumination correction. arXiv preprint arXiv:2110.12942 (2021) Feng, H., Wang, Y., Zhou, W., Deng, J., Li, H.: Doctr: Document image transformer for geometric unwarping and illumination correction. arXiv preprint arXiv:​2110.​12942 (2021)
39.
go back to reference Bandyopadhyay, H., Dasgupta, T., Das, N., Nasipuri, M.: A gated and bifurcated stacked U-net module for document image dewarping. In: 2020 25th International Conference on Pattern Recognition (ICPR), pp. 10548–10554. IEEE (2021) Bandyopadhyay, H., Dasgupta, T., Das, N., Nasipuri, M.: A gated and bifurcated stacked U-net module for document image dewarping. In: 2020 25th International Conference on Pattern Recognition (ICPR), pp. 10548–10554. IEEE (2021)
40.
go back to reference Ma, K., Shu, Z., Bai, X., Wang, J., Samaras, D.: Docunet: document image unwarping via a stacked U-net. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4700–4709 (2018) Ma, K., Shu, Z., Bai, X., Wang, J., Samaras, D.: Docunet: document image unwarping via a stacked U-net. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4700–4709 (2018)
41.
go back to reference Dutta, A., Garai, A., Biswas, S., Das, A.K.: Segmentation of text lines using multi-scale CNN from warped printed and handwritten document images. Int. J. Document Anal. Recognit. (IJDAR) 24(4), 299–313 (2021)CrossRef Dutta, A., Garai, A., Biswas, S., Das, A.K.: Segmentation of text lines using multi-scale CNN from warped printed and handwritten document images. Int. J. Document Anal. Recognit. (IJDAR) 24(4), 299–313 (2021)CrossRef
42.
go back to reference Souibgui, M.A., Biswas, S., Jemni, S.K., Kessentini, Y., Fornés, A., Lladós, J., Pal, U.: DocEnTr: an end-to-end document image enhancement transformer. arXiv preprint arXiv:2201.10252 (2022) Souibgui, M.A., Biswas, S., Jemni, S.K., Kessentini, Y., Fornés, A., Lladós, J., Pal, U.: DocEnTr: an end-to-end document image enhancement transformer. arXiv preprint arXiv:​2201.​10252 (2022)
43.
go back to reference Luisier, F., Blu, T., Unser, M.: Image denoising in mixed Poisson–Gaussian noise. IEEE Trans. Image Process. 20(3), 696–708 (2010)MathSciNetCrossRefMATH Luisier, F., Blu, T., Unser, M.: Image denoising in mixed Poisson–Gaussian noise. IEEE Trans. Image Process. 20(3), 696–708 (2010)MathSciNetCrossRefMATH
44.
go back to reference Chen, J., Benesty, J., Huang, Y., Doclo, S.: New insights into the noise reduction Wiener filter. IEEE Trans. Audio Speech Lang. Process. 14(4), 1218–1234 (2006)CrossRef Chen, J., Benesty, J., Huang, Y., Doclo, S.: New insights into the noise reduction Wiener filter. IEEE Trans. Audio Speech Lang. Process. 14(4), 1218–1234 (2006)CrossRef
45.
go back to reference Read, S.M.: The deconvolution problem: an overview. Proc. IEEE 74(1), 82–85 (1986)CrossRef Read, S.M.: The deconvolution problem: an overview. Proc. IEEE 74(1), 82–85 (1986)CrossRef
46.
go back to reference Jamil, N., Sembok, T.M.T., Bakar, Z.A.: Noise removal and enhancement of binary images using morphological operations. In: 2008 International Symposium on Information Technology, vol. 4, pp. 1–6. IEEE (2008) Jamil, N., Sembok, T.M.T., Bakar, Z.A.: Noise removal and enhancement of binary images using morphological operations. In: 2008 International Symposium on Information Technology, vol. 4, pp. 1–6. IEEE (2008)
47.
go back to reference Bradley, D., Roth, G.: Adaptive thresholding using the integral image. J. Graph. Tools 12(2), 13–21 (2007)CrossRef Bradley, D., Roth, G.: Adaptive thresholding using the integral image. J. Graph. Tools 12(2), 13–21 (2007)CrossRef
48.
go back to reference Meng, G., Pan, C., Xiang, S., Duan, J., Zheng, N.: Metric rectification of curved document images. IEEE Trans. Pattern Anal. Mach. Intell. 34(4), 707–722 (2011)CrossRef Meng, G., Pan, C., Xiang, S., Duan, J., Zheng, N.: Metric rectification of curved document images. IEEE Trans. Pattern Anal. Mach. Intell. 34(4), 707–722 (2011)CrossRef
51.
go back to reference Shafait, F., Breuel, T.M.: Document image dewarping contest. In: 2nd Int. Workshop on Camera-Based Document Analysis and Recognition, pp. 181–188 (2007) Shafait, F., Breuel, T.M.: Document image dewarping contest. In: 2nd Int. Workshop on Camera-Based Document Analysis and Recognition, pp. 181–188 (2007)
Metadata
Title
Adaptive dewarping of severely warped camera-captured document images based on document map generation
Authors
C. H. Nachappa
N. Shobha Rani
Peeta Basa Pati
M. Gokulnath
Publication date
09-01-2023
Publisher
Springer Berlin Heidelberg
Published in
International Journal on Document Analysis and Recognition (IJDAR) / Issue 2/2023
Print ISSN: 1433-2833
Electronic ISSN: 1433-2825
DOI
https://doi.org/10.1007/s10032-022-00425-4

Other articles of this Issue 2/2023

International Journal on Document Analysis and Recognition (IJDAR) 2/2023 Go to the issue

Premium Partner