Skip to main content
Top
Published in: Mobile Networks and Applications 4/2021

30-04-2021

ClusTi: Clustering Method for Table Structure Recognition in Scanned Images

Authors: Arthur Zucker, Younes Belkada, Hanh Vu, Van Nam Nguyen

Published in: Mobile Networks and Applications | Issue 4/2021

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

OCR (Optical Character Recognition) for scanned paper invoices is very challenging due to the variability of 19 invoice layouts, different information fields, large data tables, and low scanning quality. In this case, table structure recognition is a critical task in which all rows, columns, and cells must be accurately positioned and extracted. Existing methods such as DeepDeSRT only dealt with high-quality born-digital images (e.g., PDF) with low noise and apparent table structure. This paper proposes an efficient method called CluSTi (Clustering method for recognition of the Structure of Tables in invoice scanned Images). The contributions of CluSTi are three-fold. Firstly, it removes heavy noises in the table images using a clustering algorithm. Secondly, it extracts all text boxes using state-of-the-art text recognition. Thirdly, based on the horizontal and vertical clustering algorithm with optimized parameters, CluSTi groups the text boxes into their correct rows and columns, respectively. The method was evaluated on three datasets: i) 397 public scanned images; ii) 193 PDF document images from ICDAR 2013 competition dataset; and iii) 281 PDF document images from ICDAR 2019’s numeric tables. The evaluation results showed that CluSTi achieved an F1-score of 87.5%, 98.5%, and 94.5%, respectively. Our method also outperformed DeepDeSRT with an F1-score of 91.44% on only 34 images from the ICDAR 2013 competition dataset. To the best of our knowledge, CluSTi is the first method to tackle the table structure recognition problem on scanned images.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Show more products
Literature
2.
go back to reference Agrawal R, Gehrke J, Gunopulos D, Raghavan P (1998) Automatic subspace clustering of high dimensional data for data mining applications. ACM Agrawal R, Gehrke J, Gunopulos D, Raghavan P (1998) Automatic subspace clustering of high dimensional data for data mining applications. ACM
3.
go back to reference Baek Y, Lee B, Han D, Yun S, Lee H (2019) Character region awareness for text detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 9365–9374 Baek Y, Lee B, Han D, Yun S, Lee H (2019) Character region awareness for text detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 9365–9374
4.
go back to reference Ben-Hur A, Horn D, Siegelmann HT, Vapnik V (2001) Support vector clustering. J Mach Learn Res 2(Dec):125–137MATH Ben-Hur A, Horn D, Siegelmann HT, Vapnik V (2001) Support vector clustering. J Mach Learn Res 2(Dec):125–137MATH
5.
go back to reference Clinchant S, Déjean H., Meunier JL, Lang EM, Kleber F (2018) Comparing machine learning approaches for table recognition in historical register books. In: 2018 13th IAPR International workshop on document analysis systems (DAS). IEEE, pp 133–138 Clinchant S, Déjean H., Meunier JL, Lang EM, Kleber F (2018) Comparing machine learning approaches for table recognition in historical register books. In: 2018 13th IAPR International workshop on document analysis systems (DAS). IEEE, pp 133–138
6.
go back to reference Deng D, Liu H, Li X, Cai D (2018) Pixellink: Detecting scene text via instance segmentation Thirty-Second AAAI conference on artificial intelligence Deng D, Liu H, Li X, Cai D (2018) Pixellink: Detecting scene text via instance segmentation Thirty-Second AAAI conference on artificial intelligence
7.
go back to reference Ester M, Kriegel HP, Sander J, Xu X, et al. (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Kdd, pp 226–231 Ester M, Kriegel HP, Sander J, Xu X, et al. (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Kdd, pp 226–231
8.
go back to reference Farahmand A, Sarrafzadeh H, Shanbehzadeh J (2013) Document image noises and removal methods. In: Proceedings of the international multiconference of engineers and computer scientists(Ed.) Newswood Ltd, pp 436–440 Farahmand A, Sarrafzadeh H, Shanbehzadeh J (2013) Document image noises and removal methods. In: Proceedings of the international multiconference of engineers and computer scientists(Ed.) Newswood Ltd, pp 436–440
9.
go back to reference Fields CR (2001) Probabilistic models for segmenting and labeling sequence data. In: ICML 2001 Fields CR (2001) Probabilistic models for segmenting and labeling sequence data. In: ICML 2001
10.
11.
go back to reference Hartigan JA, Wong MA (1979) Algorithm as 136: A k-means clustering algorithm. J R Stat Soc Ser C Appl Stat 28(1):100–108MATH Hartigan JA, Wong MA (1979) Algorithm as 136: A k-means clustering algorithm. J R Stat Soc Ser C Appl Stat 28(1):100–108MATH
12.
go back to reference He P, Huang W, He T, Zhu Q, Qiao Y, Li X (2017) Single shot text detector with regional attention. In: Proceedings of the IEEE international conference on computer vision, pp 3047–3055 He P, Huang W, He T, Zhu Q, Qiao Y, Li X (2017) Single shot text detector with regional attention. In: Proceedings of the IEEE international conference on computer vision, pp 3047–3055
13.
go back to reference He T, Tian Z, Huang W, Shen C, Qiao Y, Sun C (2018) An end-to-end textspotter with explicit alignment and attention. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5020–5029 He T, Tian Z, Huang W, Shen C, Qiao Y, Sun C (2018) An end-to-end textspotter with explicit alignment and attention. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5020–5029
14.
go back to reference He W, Zhang XY, Yin F, Liu CL (2017) Deep direct regression for multi-oriented scene text detection. In: Proceedings of the IEEE international conference on computer vision, pp 745–753 He W, Zhang XY, Yin F, Liu CL (2017) Deep direct regression for multi-oriented scene text detection. In: Proceedings of the IEEE international conference on computer vision, pp 745–753
15.
go back to reference Hu J, Kashi RS, Lopresti DP, Wilfong G (2000) Table structure recognition and its evaluation. In: Document recognition and retrieval VIII. International society for optics and photonics, vol 4307, pp 44–55 Hu J, Kashi RS, Lopresti DP, Wilfong G (2000) Table structure recognition and its evaluation. In: Document recognition and retrieval VIII. International society for optics and photonics, vol 4307, pp 44–55
16.
go back to reference Jain AK, Maheswari S (2012) Survey of recent clustering techniques in data mining. Int J Comput Sci Manag Res 3(2):68–75 Jain AK, Maheswari S (2012) Survey of recent clustering techniques in data mining. Int J Comput Sci Manag Res 3(2):68–75
17.
go back to reference Kboubi F, Chabi AH, Ahmed MB (2005) Table recognition evaluation and combination methods. In: Eighth international conference on document analysis and recognition (ICDAR’05). IEEE, pp 1237–1241 Kboubi F, Chabi AH, Ahmed MB (2005) Table recognition evaluation and combination methods. In: Eighth international conference on document analysis and recognition (ICDAR’05). IEEE, pp 1237–1241
18.
go back to reference Le Vine N, Zeigenfuse M, Rowan M (2019) Extracting tables from documents using conditional generative adversarial networks and genetic algorithms. In: 2019 international joint conference on neural networks (IJCNN). IEEE, pp 1–8 Le Vine N, Zeigenfuse M, Rowan M (2019) Extracting tables from documents using conditional generative adversarial networks and genetic algorithms. In: 2019 international joint conference on neural networks (IJCNN). IEEE, pp 1–8
19.
go back to reference Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) Ssd: Single shot multibox detector. In: European conference on computer vision. Springer, pp 21–37 Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) Ssd: Single shot multibox detector. In: European conference on computer vision. Springer, pp 21–37
20.
go back to reference Liu X, Liang D, Yan S, Chen D, Qiao Y, Yan J (2018) Fots: Fast oriented text spotting with a unified network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5676–5685 Liu X, Liang D, Yan S, Chen D, Qiao Y, Yan J (2018) Fots: Fast oriented text spotting with a unified network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5676–5685
21.
go back to reference Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3431–3440 Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3431–3440
22.
go back to reference Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: The IEEE conference on computer vision and pattern recognition (CVPR) Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: The IEEE conference on computer vision and pattern recognition (CVPR)
23.
go back to reference Paliwal SS, Vishwanath D, Rahul R, Sharma M, Vig L (2019) Tablenet: Deep learning model for end-to-end table detection and tabular data extraction from scanned document images. In: 2019 International conference on document analysis and recognition (ICDAR). IEEE, pp 128–133 Paliwal SS, Vishwanath D, Rahul R, Sharma M, Vig L (2019) Tablenet: Deep learning model for end-to-end table detection and tabular data extraction from scanned document images. In: 2019 International conference on document analysis and recognition (ICDAR). IEEE, pp 128–133
24.
go back to reference Qasim SR, Mahmood H, Shafait F (2019) Rethinking table recognition using graph neural networks. In: 2019 International conference on document analysis and recognition (ICDAR). IEEE, pp 142–147 Qasim SR, Mahmood H, Shafait F (2019) Rethinking table recognition using graph neural networks. In: 2019 International conference on document analysis and recognition (ICDAR). IEEE, pp 142–147
25.
go back to reference Rajalingam N, Ranjini K (2011) Hierarchical clustering algorithm-a comparative study. Int J Comput Appl 19(3):42–46 Rajalingam N, Ranjini K (2011) Hierarchical clustering algorithm-a comparative study. Int J Comput Appl 19(3):42–46
26.
go back to reference Rani1Y, Rohil H (2013) A study of hierarchical clustering algorithm. ter S & on Te SIT-2, p 113 Rani1Y, Rohil H (2013) A study of hierarchical clustering algorithm. ter S & on Te SIT-2, p 113
27.
go back to reference Rashid SF, Akmal A, Adnan M, Aslam AA, Dengel A (2017) Table recognition in heterogeneous documents using machine learning. In: 2017 14th IAPR International conference on document analysis and recognition (ICDAR), vol 1. IEEE, pp 777–782 Rashid SF, Akmal A, Adnan M, Aslam AA, Dengel A (2017) Table recognition in heterogeneous documents using machine learning. In: 2017 14th IAPR International conference on document analysis and recognition (ICDAR), vol 1. IEEE, pp 777–782
28.
go back to reference Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. In: Advances in neural information processing systems, pp 91–99 Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. In: Advances in neural information processing systems, pp 91–99
29.
go back to reference Reynolds AP, Richards G, de la Iglesia B, Rayward-Smith VJ (2006) Clustering rules: a comparison of partitioning and hierarchical clustering algorithms. J Math Model Algorithms 5(4):475–504MathSciNetCrossRef Reynolds AP, Richards G, de la Iglesia B, Rayward-Smith VJ (2006) Clustering rules: a comparison of partitioning and hierarchical clustering algorithms. J Math Model Algorithms 5(4):475–504MathSciNetCrossRef
30.
go back to reference Rokach L, Maimon O (2005) Clustering methods. In: Data mining and knowledge discovery handbook. Springer, pp 321–352 Rokach L, Maimon O (2005) Clustering methods. In: Data mining and knowledge discovery handbook. Springer, pp 321–352
31.
go back to reference Sasaki Y, et al. (2007) The truth of the f-measure. Teach Tutor mater 1(5):1–5 Sasaki Y, et al. (2007) The truth of the f-measure. Teach Tutor mater 1(5):1–5
32.
go back to reference Satopaa V, Albrecht J, Irwin D, Raghavan B (2011) Finding a “kneedle” in a haystack: Detecting knee points in system behavior. In: 2011 31st international conference on distributed computing systems workshops. IEEE, pp 166–171 Satopaa V, Albrecht J, Irwin D, Raghavan B (2011) Finding a “kneedle” in a haystack: Detecting knee points in system behavior. In: 2011 31st international conference on distributed computing systems workshops. IEEE, pp 166–171
33.
go back to reference Scholkmann F, Boss J, Wolf M (2012) An efficient algorithm for automatic peak detection in noisy periodic and quasi-periodic signals. Algorithms 5(4):588–603CrossRef Scholkmann F, Boss J, Wolf M (2012) An efficient algorithm for automatic peak detection in noisy periodic and quasi-periodic signals. Algorithms 5(4):588–603CrossRef
34.
go back to reference Schreiber S, Agne S, Wolf I, Dengel A, Ahmed S (2017) Deepdesrt: Deep learning for detection and structure recognition of tables in document images. In: 2017 14th IAPR International conference on document analysis and recognition (ICDAR), vol 1. IEEE, pp 1162–1167 Schreiber S, Agne S, Wolf I, Dengel A, Ahmed S (2017) Deepdesrt: Deep learning for detection and structure recognition of tables in document images. In: 2017 14th IAPR International conference on document analysis and recognition (ICDAR), vol 1. IEEE, pp 1162–1167
35.
go back to reference Soille P (2013) Morphological image analysis: principles and applications. Springer Science & Business Media Soille P (2013) Morphological image analysis: principles and applications. Springer Science & Business Media
36.
go back to reference Sudana O, Putra D, Sudarma M, Hartati RS, Wirdiani A (2018) Image clustering of complex balinese character with dbscan algorithm. J Eng Technol 6(1):548–558 Sudana O, Putra D, Sudarma M, Hartati RS, Wirdiani A (2018) Image clustering of complex balinese character with dbscan algorithm. J Eng Technol 6(1):548–558
37.
go back to reference Xu Q, Wang Q, Xu C, Qu L (2017) Collective vertex classification using recursive neural network. arXiv:1701.06751 Xu Q, Wang Q, Xu C, Qu L (2017) Collective vertex classification using recursive neural network. arXiv:1701.​06751
38.
go back to reference Xu R, Wunsch D (2005) Survey of clustering algorithms. IEEE Trans Neural Netw 16 (3):645–678CrossRef Xu R, Wunsch D (2005) Survey of clustering algorithms. IEEE Trans Neural Netw 16 (3):645–678CrossRef
Metadata
Title
ClusTi: Clustering Method for Table Structure Recognition in Scanned Images
Authors
Arthur Zucker
Younes Belkada
Hanh Vu
Van Nam Nguyen
Publication date
30-04-2021
Publisher
Springer US
Published in
Mobile Networks and Applications / Issue 4/2021
Print ISSN: 1383-469X
Electronic ISSN: 1572-8153
DOI
https://doi.org/10.1007/s11036-021-01759-9

Other articles of this Issue 4/2021

Mobile Networks and Applications 4/2021 Go to the issue