Skip to main content

2019 | OriginalPaper | Buchkapitel

Attributed Paths for Layout-Based Document Retrieval

verfasst von : Divya Sharma, Gaurav Harit, Chiranjoy Chattopadhyay

Erschienen in: Document Analysis and Recognition

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

A document is rich in its layout. The entities of interest can be scattered over the document page. Traditional layout matching has involved modeling layout structure as grids, graphs, and spatial histograms of patches. In this paper we propose a new way of representing layout, which we call attributed paths. This representation admits a string edit distance based match measure. Our experiments show that layout based retrieval using attributed paths is computationally efficient and more effective. It also offers flexibility in tuning the match criterion. We have demonstrated effectiveness of attributed paths in performing layout based retrieval tasks on datasets of floor plan images [14] and journal pages [1].

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Beusekom, J.V.: Diploma thesis: Document layout analysis. Image Understanding and Pattern Recognition Group, Department of Computer Science, Month Unknown, pp. 1–67 (2006) Beusekom, J.V.: Diploma thesis: Document layout analysis. Image Understanding and Pattern Recognition Group, Department of Computer Science, Month Unknown, pp. 1–67 (2006)
3.
Zurück zum Zitat Cesarini, F., Lastri, M., Marinai, S., Soda, G.: Encoding of modified XY trees for document classification. In: Proceedings of the Sixth International Conference on Document Analysis and Recognition, pp. 1131–1136. IEEE (2001) Cesarini, F., Lastri, M., Marinai, S., Soda, G.: Encoding of modified XY trees for document classification. In: Proceedings of the Sixth International Conference on Document Analysis and Recognition, pp. 1131–1136. IEEE (2001)
4.
Zurück zum Zitat Collins-Thompson, K., Nickolov, R.: A clustering-based algorithm for automatic document separation. In: SIGIR 2002 Workshop on Information Retrieval and OCR: From Converting Content to Grasping, Meaning, Tampere, Finland (2002) Collins-Thompson, K., Nickolov, R.: A clustering-based algorithm for automatic document separation. In: SIGIR 2002 Workshop on Information Retrieval and OCR: From Converting Content to Grasping, Meaning, Tampere, Finland (2002)
5.
Zurück zum Zitat Gao, H., Rusinol, M., Karatzas, D., Lladós, J.: Fast structural matching for document image retrieval through spatial databases. In: DRR, pp. 90,210N–90,210N (2014) Gao, H., Rusinol, M., Karatzas, D., Lladós, J.: Fast structural matching for document image retrieval through spatial databases. In: DRR, pp. 90,210N–90,210N (2014)
6.
Zurück zum Zitat Gordo, A., Valveny, E.: A rotation invariant page layout descriptor for document classification and retrieval. In: 10th International Conference on Document Analysis and Recognition, ICDAR 2009, pp. 481–485. IEEE (2009) Gordo, A., Valveny, E.: A rotation invariant page layout descriptor for document classification and retrieval. In: 10th International Conference on Document Analysis and Recognition, ICDAR 2009, pp. 481–485. IEEE (2009)
7.
Zurück zum Zitat Hu, J., Kashi, R., Wilfong, G.: Document image layout comparison and classification. In: Proceedings of the Fifth International Conference on Document Analysis and Recognition, ICDAR 1999, pp. 285–288. IEEE (1999) Hu, J., Kashi, R., Wilfong, G.: Document image layout comparison and classification. In: Proceedings of the Fifth International Conference on Document Analysis and Recognition, ICDAR 1999, pp. 285–288. IEEE (1999)
8.
Zurück zum Zitat Kin-Chung Au, O., Tai, C.L., Cohen-Or, D., Zheng, Y., Fu, H.: Electors voting for fast automatic shape correspondence. In: Computer Graphics Forum, vol. 29, pp. 645–654. Wiley Online Library (2010) Kin-Chung Au, O., Tai, C.L., Cohen-Or, D., Zheng, Y., Fu, H.: Electors voting for fast automatic shape correspondence. In: Computer Graphics Forum, vol. 29, pp. 645–654. Wiley Online Library (2010)
9.
Zurück zum Zitat Kumar, J., Ye, P., Doermann, D.: Learning document structure for retrieval and classification. In: 2012 21st International Conference on Pattern Recognition (ICPR), pp. 1558–1561. IEEE (2012) Kumar, J., Ye, P., Doermann, D.: Learning document structure for retrieval and classification. In: 2012 21st International Conference on Pattern Recognition (ICPR), pp. 1558–1561. IEEE (2012)
10.
Zurück zum Zitat Marinai, S., Marino, E., Soda, G.: Layout based document image retrieval by means of XY tree reduction. In: Proceedings of the Eighth International Conference on Document Analysis and Recognition, pp. 432–436. IEEE (2005) Marinai, S., Marino, E., Soda, G.: Layout based document image retrieval by means of XY tree reduction. In: Proceedings of the Eighth International Conference on Document Analysis and Recognition, pp. 432–436. IEEE (2005)
11.
Zurück zum Zitat Sebastian, T.B., Klein, P.N., Kimia, B.B.: On aligning curves. IEEE Trans. Pattern Anal. Mach. Intell. 25(1), 116–125 (2003)CrossRef Sebastian, T.B., Klein, P.N., Kimia, B.B.: On aligning curves. IEEE Trans. Pattern Anal. Mach. Intell. 25(1), 116–125 (2003)CrossRef
12.
Zurück zum Zitat Sebastian, T.B., Klein, P.N., Kimia, B.B.: Recognition of shapes by editing their shock graphs. IEEE Trans. Pattern Anal. Mach. Intell. 26(5), 550–571 (2004)CrossRef Sebastian, T.B., Klein, P.N., Kimia, B.B.: Recognition of shapes by editing their shock graphs. IEEE Trans. Pattern Anal. Mach. Intell. 26(5), 550–571 (2004)CrossRef
13.
Zurück zum Zitat Sharma, D., Chattopadhyay, C., Harit, G.: A unified framework for semnatic matching of architectural floorplans. In: ICPR (2016) Sharma, D., Chattopadhyay, C., Harit, G.: A unified framework for semnatic matching of architectural floorplans. In: ICPR (2016)
14.
Zurück zum Zitat Sharma, D., Gupta, N., Chattopadhyay, C., Mehta, S.: DANIEL: a deep architecture for automatic analysis and retrieval of building floor plans. In: ICDAR (2017) Sharma, D., Gupta, N., Chattopadhyay, C., Mehta, S.: DANIEL: a deep architecture for automatic analysis and retrieval of building floor plans. In: ICDAR (2017)
15.
Zurück zum Zitat Tzacheva, A., El-Sonbaty, Y., El-Kwae, E.A.: Document image matching using a maximal grid approach. In: Proceedings of the SPIE, vol. 4670, p. 122 (2002) Tzacheva, A., El-Sonbaty, Y., El-Kwae, E.A.: Document image matching using a maximal grid approach. In: Proceedings of the SPIE, vol. 4670, p. 122 (2002)
16.
Zurück zum Zitat Zhu, S.C., Yuille, A.L.: FORMS: a flexible object recognition and modelling system. Int. J. Comput. Vis. 20(3), 187–212 (1996)CrossRef Zhu, S.C., Yuille, A.L.: FORMS: a flexible object recognition and modelling system. Int. J. Comput. Vis. 20(3), 187–212 (1996)CrossRef
Metadaten
Titel
Attributed Paths for Layout-Based Document Retrieval
verfasst von
Divya Sharma
Gaurav Harit
Chiranjoy Chattopadhyay
Copyright-Jahr
2019
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-13-9361-7_2