Skip to main content
Erschienen in: International Journal on Document Analysis and Recognition (IJDAR) 1/2015

01.03.2015 | Original Paper

Towards the interactive transcription of handwritings: anytime anywhere document analysis

verfasst von: Björn Gottfried, Marius Wegner, Mathias Lawo

Erschienen in: International Journal on Document Analysis and Recognition (IJDAR) | Ausgabe 1/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper introduces the anytime anywhere document analysis methodology applied in the context of computer-aided transcription. Its utility is revealed for documents which are difficult to analyse, as in the case of handwritten texts. A special focus lies on the glyph separation problem which turns out to be particularly complicated. As automatic methods show fundamental limitations, a number of interactive methods are proposed which are based on the interplay between user and machine. These methods get along without any assumptions concerning underlying languages or appearances of texts. An evaluation in the context of palaeography and applied to a well-established data set illustrates how well handwritings are dealt with, although they offer distinct differences in their regularity and shape.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Brockhaus: Graphem. In: Der neue Brockhaus, vol. 2, p. 436. F. A. Brockhaus Wiesbaden (1979) Brockhaus: Graphem. In: Der neue Brockhaus, vol. 2, p. 436. F. A. Brockhaus Wiesbaden (1979)
2.
Zurück zum Zitat Casey, R.G., Lecolinet, E.: A survey of methods and strategies in character segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 18(7), 690–706 (1996)CrossRef Casey, R.G., Lecolinet, E.: A survey of methods and strategies in character segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 18(7), 690–706 (1996)CrossRef
3.
Zurück zum Zitat Chazalon, J., Coüasnon, B.: Iterative analysis of document collections enables efficient human-initiated interaction. In: Viard-Gaudin, C., Zanibbi, R. (eds.) Document Recognition and Retrieval XIX—DRR 2012, 19th Document Recognition and Retrieval Conference, Part of the IS&T-SPIE Electronic Imaging Symposium, Burlingame, California, USA, 25 Jan 2012, Proceedings, SPIE Proceedings, vol. 8297. SPIE (2012) Chazalon, J., Coüasnon, B.: Iterative analysis of document collections enables efficient human-initiated interaction. In: Viard-Gaudin, C., Zanibbi, R. (eds.) Document Recognition and Retrieval XIX—DRR 2012, 19th Document Recognition and Retrieval Conference, Part of the IS&T-SPIE Electronic Imaging Symposium, Burlingame, California, USA, 25 Jan 2012, Proceedings, SPIE Proceedings, vol. 8297. SPIE (2012)
4.
Zurück zum Zitat Chazalon, J., Coüasnon, B., Lemaitre, A.: Iterative analysis of pages in document collections for efficient user interaction. In: International Conference on Document Analysis and Recognition, ICDAR 2011, Beijing, China, 18–21 Sept 2011, pp. 503–507. IEEE (2011) Chazalon, J., Coüasnon, B., Lemaitre, A.: Iterative analysis of pages in document collections for efficient user interaction. In: International Conference on Document Analysis and Recognition, ICDAR 2011, Beijing, China, 18–21 Sept 2011, pp. 503–507. IEEE (2011)
5.
Zurück zum Zitat Clavier, E., Masini, G., Delalandre, M., Rigamonti, M., Tombre, K., Gardes, J.: Docmining: a cooperative platform for heterogeneous document interpretation according to user-defined scenarios. In: Lladós, J., Kwon, Y.B. (eds.) Graphics Recognition, Recent Advances and Perspectives, 5th InternationalWorkshop, GREC 2003, Barcelona, Spain, 30–31 July 2003. Revised Selected Papers, Lecture Notes in Computer Science, vol. 3088, pp. 13–24. Springer (2004) Clavier, E., Masini, G., Delalandre, M., Rigamonti, M., Tombre, K., Gardes, J.: Docmining: a cooperative platform for heterogeneous document interpretation according to user-defined scenarios. In: Lladós, J., Kwon, Y.B. (eds.) Graphics Recognition, Recent Advances and Perspectives, 5th InternationalWorkshop, GREC 2003, Barcelona, Spain, 30–31 July 2003. Revised Selected Papers, Lecture Notes in Computer Science, vol. 3088, pp. 13–24. Springer (2004)
6.
Zurück zum Zitat Fischer, A., Frinken, V., Fornés, A., Bunke, H.: Transcription alignment of latin manuscripts using hidden markov models. In: Proceedings of the 2011 Workshop on Historical Document Imaging and Processing. HIP ’11, pp. 29–36. ACM, New York, NY, USA (2011) Fischer, A., Frinken, V., Fornés, A., Bunke, H.: Transcription alignment of latin manuscripts using hidden markov models. In: Proceedings of the 2011 Workshop on Historical Document Imaging and Processing. HIP ’11, pp. 29–36. ACM, New York, NY, USA (2011)
7.
Zurück zum Zitat Fischer, A., Indermühle, E., Bunke, H., Viehhauser, G., Stolz, M.: Ground truth creation for handwriting recognition in historical documents. In: Doermann, D.S., Govindaraju, V., Lopresti, D.P., Natarajan, P. (eds.) The Ninth IAPR International Workshop on Document Analysis Systems, DAS 2010, 9–11 June 2010, Boston, Massachusetts, USA, ACM International Conference Proceeding Series, pp. 3–10. ACM (2010) Fischer, A., Indermühle, E., Bunke, H., Viehhauser, G., Stolz, M.: Ground truth creation for handwriting recognition in historical documents. In: Doermann, D.S., Govindaraju, V., Lopresti, D.P., Natarajan, P. (eds.) The Ninth IAPR International Workshop on Document Analysis Systems, DAS 2010, 9–11 June 2010, Boston, Massachusetts, USA, ACM International Conference Proceeding Series, pp. 3–10. ACM (2010)
8.
Zurück zum Zitat Fischer, A., Keller, A., Frinken, V., Bunke, H.: Lexicon-free handwritten word spotting using character hmms. Pattern Recogn. Lett. 33(7), 934–942 (2012)CrossRef Fischer, A., Keller, A., Frinken, V., Bunke, H.: Lexicon-free handwritten word spotting using character hmms. Pattern Recogn. Lett. 33(7), 934–942 (2012)CrossRef
9.
Zurück zum Zitat Gottfried, B.: Qualitative similarity measures—the case of two-dimensional outlines. Comput. Vis. Image Underst. 110(1), 117–133 (2008)CrossRef Gottfried, B.: Qualitative similarity measures—the case of two-dimensional outlines. Comput. Vis. Image Underst. 110(1), 117–133 (2008)CrossRef
10.
Zurück zum Zitat Gottfried, B., Meyer-Lerbs, L.: Towards the processing of historic documents. In: Bernadi, R. (ed.) Advanced Technologies for Digital Libraries, LNCS, pp. 15–28. Springer, Berlin (2011)CrossRef Gottfried, B., Meyer-Lerbs, L.: Towards the processing of historic documents. In: Bernadi, R. (ed.) Advanced Technologies for Digital Libraries, LNCS, pp. 15–28. Springer, Berlin (2011)CrossRef
11.
Zurück zum Zitat He, L., Chao, Y., Suzuki, K.: A run-based two-scan labeling algorithm. IEEE Trans. Image Process. 17(5), 749–756 (2008)CrossRefMathSciNet He, L., Chao, Y., Suzuki, K.: A run-based two-scan labeling algorithm. IEEE Trans. Image Process. 17(5), 749–756 (2008)CrossRefMathSciNet
12.
Zurück zum Zitat Hofmeister, W., Hofmeister-Winter, A.: Schriftzüge unter der High-Tech-Lupe. Theoretische Grundlagen und erste praktische Ergebnisse des Grazer Pilotprojekts DAmalS. In: Internatiohnales Jahrbuch für Editionswissenschaft, vol. 22, pp. 90–117 (2008) Hofmeister, W., Hofmeister-Winter, A.: Schriftzüge unter der High-Tech-Lupe. Theoretische Grundlagen und erste praktische Ergebnisse des Grazer Pilotprojekts DAmalS. In: Internatiohnales Jahrbuch für Editionswissenschaft, vol. 22, pp. 90–117 (2008)
13.
Zurück zum Zitat Kansal, H., Sanyal, S., Gupta, D.: Dewarping and deskewing of a document using affine transformation. In: Ranchordas, A., Araújo, H. (eds.) VISAPP (2), pp. 73–78. INSTICC Press, Setúbal (2009) Kansal, H., Sanyal, S., Gupta, D.: Dewarping and deskewing of a document using affine transformation. In: Ranchordas, A., Araújo, H. (eds.) VISAPP (2), pp. 73–78. INSTICC Press, Setúbal (2009)
14.
Zurück zum Zitat Kim, G., Govindaraju, V., Srihari, S.N.: An architecture for handwritten text recognition systems. IJDAR 2(1), 37–44 (1999)CrossRef Kim, G., Govindaraju, V., Srihari, S.N.: An architecture for handwritten text recognition systems. IJDAR 2(1), 37–44 (1999)CrossRef
15.
Zurück zum Zitat Lebourgeois, F., Emptoz, H.: DEBORA: Digital AccEss to BOoks of the RenAissance. IJDAR 9(2–4), 193–221 (2007)CrossRef Lebourgeois, F., Emptoz, H.: DEBORA: Digital AccEss to BOoks of the RenAissance. IJDAR 9(2–4), 193–221 (2007)CrossRef
16.
Zurück zum Zitat Lowe, K.A.: From quill to t-pen: palaeography, editing and their e-futures. Lit. Compass 9(12), 1004–1009 (2012)CrossRef Lowe, K.A.: From quill to t-pen: palaeography, editing and their e-futures. Lit. Compass 9(12), 1004–1009 (2012)CrossRef
17.
Zurück zum Zitat Moalla, I., Lebourgeois, F., Emptoz, H., Alimi, A.M.: Contribution to the discrimination of the medieval manuscript texts: application in the palaeography. In: Bunke, H., Spitz, A.L. (eds.) Document Analysis Systems, LNCS, pp. 25–37. Springer, Berlin (2006)CrossRef Moalla, I., Lebourgeois, F., Emptoz, H., Alimi, A.M.: Contribution to the discrimination of the medieval manuscript texts: application in the palaeography. In: Bunke, H., Spitz, A.L. (eds.) Document Analysis Systems, LNCS, pp. 25–37. Springer, Berlin (2006)CrossRef
18.
Zurück zum Zitat Nagy, G.: Twenty years of document image analysis in pami. IEEE Trans. Pattern Anal. Mach. Intell. 22(1), 38–62 (2000)CrossRef Nagy, G.: Twenty years of document image analysis in pami. IEEE Trans. Pattern Anal. Mach. Intell. 22(1), 38–62 (2000)CrossRef
19.
Zurück zum Zitat Ouwayed, N., Belaïd, A.: A general approach for multi-oriented text line extraction of handwritten documents. IJDAR 15(4), 297–314 (2012)CrossRef Ouwayed, N., Belaïd, A.: A general approach for multi-oriented text line extraction of handwritten documents. IJDAR 15(4), 297–314 (2012)CrossRef
20.
Zurück zum Zitat Peck, A.: Beginning GIMP: From Novice to Professional. Apress Inc., New York (2006) Peck, A.: Beginning GIMP: From Novice to Professional. Apress Inc., New York (2006)
21.
Zurück zum Zitat Philipps 1870, fol. 11r and fol. 144r. Staatsbibliothek zu Berlin, Preußischer Kulturbesitz, Department of manuscripts, (c. 1100) Philipps 1870, fol. 11r and fol. 144r. Staatsbibliothek zu Berlin, Preußischer Kulturbesitz, Department of manuscripts, (c. 1100)
22.
Zurück zum Zitat Plötz, T., Fink, G.A.: Markov models for offline handwriting recognition: a survey. IJDAR 12(4), 269–298 (2009)CrossRef Plötz, T., Fink, G.A.: Markov models for offline handwriting recognition: a survey. IJDAR 12(4), 269–298 (2009)CrossRef
23.
Zurück zum Zitat Ramel, J.Y., Sidére, N., Rayar, F.: Interactive layout analysis, content extraction and transcription of historical printed books using pattern redundancy analysis. Lit. Linguist. Comput. 28(2), 301–314 (2013) Ramel, J.Y., Sidére, N., Rayar, F.: Interactive layout analysis, content extraction and transcription of historical printed books using pattern redundancy analysis. Lit. Linguist. Comput. 28(2), 301–314 (2013)
24.
Zurück zum Zitat Romero, V., Toselli, A.H., Rodríguez, L., Vidal, E.: Computer assisted transcription for ancient text images. In: Kamel, M.S., Campilho, A.C. (eds.) ICIAR, LNCS, pp. 1182–1193. Springer, Berlin (2007) Romero, V., Toselli, A.H., Rodríguez, L., Vidal, E.: Computer assisted transcription for ancient text images. In: Kamel, M.S., Campilho, A.C. (eds.) ICIAR, LNCS, pp. 1182–1193. Springer, Berlin (2007)
25.
Zurück zum Zitat Sauvola, J., Pietikäinen, M.: Adaptive document image binarization. Pattern Recogn. 33, 225–236 (2000)CrossRef Sauvola, J., Pietikäinen, M.: Adaptive document image binarization. Pattern Recogn. 33, 225–236 (2000)CrossRef
26.
Zurück zum Zitat Serrano, N., Gimnez, A., Civera, J., Sanchis, A., Juan, A.: Interactive handwriting recognition with limited user effort. IJDAR 17(1), 47–59 (2013) Serrano, N., Gimnez, A., Civera, J., Sanchis, A., Juan, A.: Interactive handwriting recognition with limited user effort. IJDAR 17(1), 47–59 (2013)
27.
Zurück zum Zitat Serrano, N., Tarazón, L., Pérez, D., Terrades, O.R., Juan, A.: The gidoc prototype. In: Fred, A.L.N. (ed.) Pattern Recognition in Information Systems, Proceedings of the 10th International Workshop on Pattern Recognition in Information Systems, PRIS 2010, In Conjunction with ICEIS 2010, Funchal, Madeira, Portugal, June 2010, pp. 82–89. SciTePress (2010) Serrano, N., Tarazón, L., Pérez, D., Terrades, O.R., Juan, A.: The gidoc prototype. In: Fred, A.L.N. (ed.) Pattern Recognition in Information Systems, Proceedings of the 10th International Workshop on Pattern Recognition in Information Systems, PRIS 2010, In Conjunction with ICEIS 2010, Funchal, Madeira, Portugal, June 2010, pp. 82–89. SciTePress (2010)
28.
Zurück zum Zitat Smith, R.: A simple and efficient skew detection algorithm via text row accumulation. In: ICDAR, pp. 1145–1148. IEEE Computer Society (1995) Smith, R.: A simple and efficient skew detection algorithm via text row accumulation. In: ICDAR, pp. 1145–1148. IEEE Computer Society (1995)
29.
Zurück zum Zitat Worch, J.H., Lawo, M., Gottfried, B.: Glyph spotting for mediaeval handwritings by template matching. In: Concolato, C., Schmitz, P. (eds.) ACM Symposium on Document Engineering, pp. 213–216. ACM, New York (2012) Worch, J.H., Lawo, M., Gottfried, B.: Glyph spotting for mediaeval handwritings by template matching. In: Concolato, C., Schmitz, P. (eds.) ACM Symposium on Document Engineering, pp. 213–216. ACM, New York (2012)
30.
Zurück zum Zitat Wüthrich, M., Liwicki, M., Fischer, A., Indermühle, E., Bunke, H., Viehhauser, G., Stolz, M.: Language model integration for the recognition of handwritten medieval documents. In: ICDAR, pp. 211–215. IEEE Computer Society (2009) Wüthrich, M., Liwicki, M., Fischer, A., Indermühle, E., Bunke, H., Viehhauser, G., Stolz, M.: Language model integration for the recognition of handwritten medieval documents. In: ICDAR, pp. 211–215. IEEE Computer Society (2009)
31.
Zurück zum Zitat Yacoub, S.M., Saxena, V., Sami, S.N.: Perfectdoc: A ground truthing environment for complex documents. In: Eighth International Conference on Document Analysis and Recognition (ICDAR 2005), 29 Aug–1 Sept 2005, Seoul, Korea, pp. 452–457. IEEE Computer Society (2005) Yacoub, S.M., Saxena, V., Sami, S.N.: Perfectdoc: A ground truthing environment for complex documents. In: Eighth International Conference on Document Analysis and Recognition (ICDAR 2005), 29 Aug–1 Sept 2005, Seoul, Korea, pp. 452–457. IEEE Computer Society (2005)
Metadaten
Titel
Towards the interactive transcription of handwritings: anytime anywhere document analysis
verfasst von
Björn Gottfried
Marius Wegner
Mathias Lawo
Publikationsdatum
01.03.2015
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal on Document Analysis and Recognition (IJDAR) / Ausgabe 1/2015
Print ISSN: 1433-2833
Elektronische ISSN: 1433-2825
DOI
https://doi.org/10.1007/s10032-014-0234-7

Weitere Artikel der Ausgabe 1/2015

International Journal on Document Analysis and Recognition (IJDAR) 1/2015 Zur Ausgabe