Skip to main content
Erschienen in: Mobile Networks and Applications 4/2021

30.04.2021

ClusTi: Clustering Method for Table Structure Recognition in Scanned Images

verfasst von: Arthur Zucker, Younes Belkada, Hanh Vu, Van Nam Nguyen

Erschienen in: Mobile Networks and Applications | Ausgabe 4/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

OCR (Optical Character Recognition) for scanned paper invoices is very challenging due to the variability of 19 invoice layouts, different information fields, large data tables, and low scanning quality. In this case, table structure recognition is a critical task in which all rows, columns, and cells must be accurately positioned and extracted. Existing methods such as DeepDeSRT only dealt with high-quality born-digital images (e.g., PDF) with low noise and apparent table structure. This paper proposes an efficient method called CluSTi (Clustering method for recognition of the Structure of Tables in invoice scanned Images). The contributions of CluSTi are three-fold. Firstly, it removes heavy noises in the table images using a clustering algorithm. Secondly, it extracts all text boxes using state-of-the-art text recognition. Thirdly, based on the horizontal and vertical clustering algorithm with optimized parameters, CluSTi groups the text boxes into their correct rows and columns, respectively. The method was evaluated on three datasets: i) 397 public scanned images; ii) 193 PDF document images from ICDAR 2013 competition dataset; and iii) 281 PDF document images from ICDAR 2019’s numeric tables. The evaluation results showed that CluSTi achieved an F1-score of 87.5%, 98.5%, and 94.5%, respectively. Our method also outperformed DeepDeSRT with an F1-score of 91.44% on only 34 images from the ICDAR 2013 competition dataset. To the best of our knowledge, CluSTi is the first method to tackle the table structure recognition problem on scanned images.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Weitere Produktempfehlungen anzeigen
Literatur
2.
Zurück zum Zitat Agrawal R, Gehrke J, Gunopulos D, Raghavan P (1998) Automatic subspace clustering of high dimensional data for data mining applications. ACM Agrawal R, Gehrke J, Gunopulos D, Raghavan P (1998) Automatic subspace clustering of high dimensional data for data mining applications. ACM
3.
Zurück zum Zitat Baek Y, Lee B, Han D, Yun S, Lee H (2019) Character region awareness for text detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 9365–9374 Baek Y, Lee B, Han D, Yun S, Lee H (2019) Character region awareness for text detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 9365–9374
4.
Zurück zum Zitat Ben-Hur A, Horn D, Siegelmann HT, Vapnik V (2001) Support vector clustering. J Mach Learn Res 2(Dec):125–137MATH Ben-Hur A, Horn D, Siegelmann HT, Vapnik V (2001) Support vector clustering. J Mach Learn Res 2(Dec):125–137MATH
5.
Zurück zum Zitat Clinchant S, Déjean H., Meunier JL, Lang EM, Kleber F (2018) Comparing machine learning approaches for table recognition in historical register books. In: 2018 13th IAPR International workshop on document analysis systems (DAS). IEEE, pp 133–138 Clinchant S, Déjean H., Meunier JL, Lang EM, Kleber F (2018) Comparing machine learning approaches for table recognition in historical register books. In: 2018 13th IAPR International workshop on document analysis systems (DAS). IEEE, pp 133–138
6.
Zurück zum Zitat Deng D, Liu H, Li X, Cai D (2018) Pixellink: Detecting scene text via instance segmentation Thirty-Second AAAI conference on artificial intelligence Deng D, Liu H, Li X, Cai D (2018) Pixellink: Detecting scene text via instance segmentation Thirty-Second AAAI conference on artificial intelligence
7.
Zurück zum Zitat Ester M, Kriegel HP, Sander J, Xu X, et al. (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Kdd, pp 226–231 Ester M, Kriegel HP, Sander J, Xu X, et al. (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Kdd, pp 226–231
8.
Zurück zum Zitat Farahmand A, Sarrafzadeh H, Shanbehzadeh J (2013) Document image noises and removal methods. In: Proceedings of the international multiconference of engineers and computer scientists(Ed.) Newswood Ltd, pp 436–440 Farahmand A, Sarrafzadeh H, Shanbehzadeh J (2013) Document image noises and removal methods. In: Proceedings of the international multiconference of engineers and computer scientists(Ed.) Newswood Ltd, pp 436–440
9.
Zurück zum Zitat Fields CR (2001) Probabilistic models for segmenting and labeling sequence data. In: ICML 2001 Fields CR (2001) Probabilistic models for segmenting and labeling sequence data. In: ICML 2001
10.
11.
Zurück zum Zitat Hartigan JA, Wong MA (1979) Algorithm as 136: A k-means clustering algorithm. J R Stat Soc Ser C Appl Stat 28(1):100–108MATH Hartigan JA, Wong MA (1979) Algorithm as 136: A k-means clustering algorithm. J R Stat Soc Ser C Appl Stat 28(1):100–108MATH
12.
Zurück zum Zitat He P, Huang W, He T, Zhu Q, Qiao Y, Li X (2017) Single shot text detector with regional attention. In: Proceedings of the IEEE international conference on computer vision, pp 3047–3055 He P, Huang W, He T, Zhu Q, Qiao Y, Li X (2017) Single shot text detector with regional attention. In: Proceedings of the IEEE international conference on computer vision, pp 3047–3055
13.
Zurück zum Zitat He T, Tian Z, Huang W, Shen C, Qiao Y, Sun C (2018) An end-to-end textspotter with explicit alignment and attention. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5020–5029 He T, Tian Z, Huang W, Shen C, Qiao Y, Sun C (2018) An end-to-end textspotter with explicit alignment and attention. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5020–5029
14.
Zurück zum Zitat He W, Zhang XY, Yin F, Liu CL (2017) Deep direct regression for multi-oriented scene text detection. In: Proceedings of the IEEE international conference on computer vision, pp 745–753 He W, Zhang XY, Yin F, Liu CL (2017) Deep direct regression for multi-oriented scene text detection. In: Proceedings of the IEEE international conference on computer vision, pp 745–753
15.
Zurück zum Zitat Hu J, Kashi RS, Lopresti DP, Wilfong G (2000) Table structure recognition and its evaluation. In: Document recognition and retrieval VIII. International society for optics and photonics, vol 4307, pp 44–55 Hu J, Kashi RS, Lopresti DP, Wilfong G (2000) Table structure recognition and its evaluation. In: Document recognition and retrieval VIII. International society for optics and photonics, vol 4307, pp 44–55
16.
Zurück zum Zitat Jain AK, Maheswari S (2012) Survey of recent clustering techniques in data mining. Int J Comput Sci Manag Res 3(2):68–75 Jain AK, Maheswari S (2012) Survey of recent clustering techniques in data mining. Int J Comput Sci Manag Res 3(2):68–75
17.
Zurück zum Zitat Kboubi F, Chabi AH, Ahmed MB (2005) Table recognition evaluation and combination methods. In: Eighth international conference on document analysis and recognition (ICDAR’05). IEEE, pp 1237–1241 Kboubi F, Chabi AH, Ahmed MB (2005) Table recognition evaluation and combination methods. In: Eighth international conference on document analysis and recognition (ICDAR’05). IEEE, pp 1237–1241
18.
Zurück zum Zitat Le Vine N, Zeigenfuse M, Rowan M (2019) Extracting tables from documents using conditional generative adversarial networks and genetic algorithms. In: 2019 international joint conference on neural networks (IJCNN). IEEE, pp 1–8 Le Vine N, Zeigenfuse M, Rowan M (2019) Extracting tables from documents using conditional generative adversarial networks and genetic algorithms. In: 2019 international joint conference on neural networks (IJCNN). IEEE, pp 1–8
19.
Zurück zum Zitat Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) Ssd: Single shot multibox detector. In: European conference on computer vision. Springer, pp 21–37 Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) Ssd: Single shot multibox detector. In: European conference on computer vision. Springer, pp 21–37
20.
Zurück zum Zitat Liu X, Liang D, Yan S, Chen D, Qiao Y, Yan J (2018) Fots: Fast oriented text spotting with a unified network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5676–5685 Liu X, Liang D, Yan S, Chen D, Qiao Y, Yan J (2018) Fots: Fast oriented text spotting with a unified network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5676–5685
21.
Zurück zum Zitat Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3431–3440 Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3431–3440
22.
Zurück zum Zitat Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: The IEEE conference on computer vision and pattern recognition (CVPR) Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: The IEEE conference on computer vision and pattern recognition (CVPR)
23.
Zurück zum Zitat Paliwal SS, Vishwanath D, Rahul R, Sharma M, Vig L (2019) Tablenet: Deep learning model for end-to-end table detection and tabular data extraction from scanned document images. In: 2019 International conference on document analysis and recognition (ICDAR). IEEE, pp 128–133 Paliwal SS, Vishwanath D, Rahul R, Sharma M, Vig L (2019) Tablenet: Deep learning model for end-to-end table detection and tabular data extraction from scanned document images. In: 2019 International conference on document analysis and recognition (ICDAR). IEEE, pp 128–133
24.
Zurück zum Zitat Qasim SR, Mahmood H, Shafait F (2019) Rethinking table recognition using graph neural networks. In: 2019 International conference on document analysis and recognition (ICDAR). IEEE, pp 142–147 Qasim SR, Mahmood H, Shafait F (2019) Rethinking table recognition using graph neural networks. In: 2019 International conference on document analysis and recognition (ICDAR). IEEE, pp 142–147
25.
Zurück zum Zitat Rajalingam N, Ranjini K (2011) Hierarchical clustering algorithm-a comparative study. Int J Comput Appl 19(3):42–46 Rajalingam N, Ranjini K (2011) Hierarchical clustering algorithm-a comparative study. Int J Comput Appl 19(3):42–46
26.
Zurück zum Zitat Rani1Y, Rohil H (2013) A study of hierarchical clustering algorithm. ter S & on Te SIT-2, p 113 Rani1Y, Rohil H (2013) A study of hierarchical clustering algorithm. ter S & on Te SIT-2, p 113
27.
Zurück zum Zitat Rashid SF, Akmal A, Adnan M, Aslam AA, Dengel A (2017) Table recognition in heterogeneous documents using machine learning. In: 2017 14th IAPR International conference on document analysis and recognition (ICDAR), vol 1. IEEE, pp 777–782 Rashid SF, Akmal A, Adnan M, Aslam AA, Dengel A (2017) Table recognition in heterogeneous documents using machine learning. In: 2017 14th IAPR International conference on document analysis and recognition (ICDAR), vol 1. IEEE, pp 777–782
28.
Zurück zum Zitat Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. In: Advances in neural information processing systems, pp 91–99 Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. In: Advances in neural information processing systems, pp 91–99
29.
Zurück zum Zitat Reynolds AP, Richards G, de la Iglesia B, Rayward-Smith VJ (2006) Clustering rules: a comparison of partitioning and hierarchical clustering algorithms. J Math Model Algorithms 5(4):475–504MathSciNetCrossRef Reynolds AP, Richards G, de la Iglesia B, Rayward-Smith VJ (2006) Clustering rules: a comparison of partitioning and hierarchical clustering algorithms. J Math Model Algorithms 5(4):475–504MathSciNetCrossRef
30.
Zurück zum Zitat Rokach L, Maimon O (2005) Clustering methods. In: Data mining and knowledge discovery handbook. Springer, pp 321–352 Rokach L, Maimon O (2005) Clustering methods. In: Data mining and knowledge discovery handbook. Springer, pp 321–352
31.
Zurück zum Zitat Sasaki Y, et al. (2007) The truth of the f-measure. Teach Tutor mater 1(5):1–5 Sasaki Y, et al. (2007) The truth of the f-measure. Teach Tutor mater 1(5):1–5
32.
Zurück zum Zitat Satopaa V, Albrecht J, Irwin D, Raghavan B (2011) Finding a “kneedle” in a haystack: Detecting knee points in system behavior. In: 2011 31st international conference on distributed computing systems workshops. IEEE, pp 166–171 Satopaa V, Albrecht J, Irwin D, Raghavan B (2011) Finding a “kneedle” in a haystack: Detecting knee points in system behavior. In: 2011 31st international conference on distributed computing systems workshops. IEEE, pp 166–171
33.
Zurück zum Zitat Scholkmann F, Boss J, Wolf M (2012) An efficient algorithm for automatic peak detection in noisy periodic and quasi-periodic signals. Algorithms 5(4):588–603CrossRef Scholkmann F, Boss J, Wolf M (2012) An efficient algorithm for automatic peak detection in noisy periodic and quasi-periodic signals. Algorithms 5(4):588–603CrossRef
34.
Zurück zum Zitat Schreiber S, Agne S, Wolf I, Dengel A, Ahmed S (2017) Deepdesrt: Deep learning for detection and structure recognition of tables in document images. In: 2017 14th IAPR International conference on document analysis and recognition (ICDAR), vol 1. IEEE, pp 1162–1167 Schreiber S, Agne S, Wolf I, Dengel A, Ahmed S (2017) Deepdesrt: Deep learning for detection and structure recognition of tables in document images. In: 2017 14th IAPR International conference on document analysis and recognition (ICDAR), vol 1. IEEE, pp 1162–1167
35.
Zurück zum Zitat Soille P (2013) Morphological image analysis: principles and applications. Springer Science & Business Media Soille P (2013) Morphological image analysis: principles and applications. Springer Science & Business Media
36.
Zurück zum Zitat Sudana O, Putra D, Sudarma M, Hartati RS, Wirdiani A (2018) Image clustering of complex balinese character with dbscan algorithm. J Eng Technol 6(1):548–558 Sudana O, Putra D, Sudarma M, Hartati RS, Wirdiani A (2018) Image clustering of complex balinese character with dbscan algorithm. J Eng Technol 6(1):548–558
37.
Zurück zum Zitat Xu Q, Wang Q, Xu C, Qu L (2017) Collective vertex classification using recursive neural network. arXiv:1701.06751 Xu Q, Wang Q, Xu C, Qu L (2017) Collective vertex classification using recursive neural network. arXiv:1701.​06751
38.
Zurück zum Zitat Xu R, Wunsch D (2005) Survey of clustering algorithms. IEEE Trans Neural Netw 16 (3):645–678CrossRef Xu R, Wunsch D (2005) Survey of clustering algorithms. IEEE Trans Neural Netw 16 (3):645–678CrossRef
Metadaten
Titel
ClusTi: Clustering Method for Table Structure Recognition in Scanned Images
verfasst von
Arthur Zucker
Younes Belkada
Hanh Vu
Van Nam Nguyen
Publikationsdatum
30.04.2021
Verlag
Springer US
Erschienen in
Mobile Networks and Applications / Ausgabe 4/2021
Print ISSN: 1383-469X
Elektronische ISSN: 1572-8153
DOI
https://doi.org/10.1007/s11036-021-01759-9

Weitere Artikel der Ausgabe 4/2021

Mobile Networks and Applications 4/2021 Zur Ausgabe

Neuer Inhalt