Skip to main content
Top

2011 | OriginalPaper | Chapter

5. Active Interaction and Learning in Handwritten Text Transcription

Authors : Dr. Alejandro Héctor Toselli, Dr. Enrique Vidal, Prof. Francisco Casacuberta

Published in: Multimodal Interactive Pattern Recognition and Applications

Publisher: Springer London

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Computer-assisted systems are being increasingly used in a variety of real-world tasks, though their application to handwritten text transcription in old manuscripts remains largely unexplored. The basic idea explored in this chapter is to follow a sequential, line-by-line transcription of the whole manuscript in which a continuously retrained system interacts with the user to efficiently transcribe each new line. User interaction is expensive in terms of time and cost. Our top priority is to take advantage of these interactions, while trying to reduce them as most as possible.
To this end, we study three different frameworks: (a) improve a recognition system from newly recognized transcriptions via adaptation techniques, using semi-supervised learning techniques; (b) study how to best adapt from limited user supervisions, which is related to active learning; and (c) develop a simple error estimate, which is used to let the user adjust the error in a computer-assisted transcription task. In addition, we test these approaches in the sequential transcription of two old text documents.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Bertolami, R., & Bunke, H. (2008). Hidden Markov model-based ensemble methods for offline handwritten text line recognition. Pattern Recognition 41, 3452–3460. MATHCrossRef Bertolami, R., & Bunke, H. (2008). Hidden Markov model-based ensemble methods for offline handwritten text line recognition. Pattern Recognition 41, 3452–3460. MATHCrossRef
2.
go back to reference Kristjannson, T., Culotta, A., Viola, P., & McCallum, A. (2004). Interactive information extraction with constrained conditional random fields. In Proceedings of the 19th national conference on artificial intelligence (AAAI 2004) (pp. 412–418), San Jose, CA, USA. Kristjannson, T., Culotta, A., Viola, P., & McCallum, A. (2004). Interactive information extraction with constrained conditional random fields. In Proceedings of the 19th national conference on artificial intelligence (AAAI 2004) (pp. 412–418), San Jose, CA, USA.
3.
go back to reference Le Bourgeois, F., & Emptoz, H. (2007). DEBORA: Digital AccEss to BOoks of the RenAissance. International Journal on Document Analysis and Recognition, 9, 193–221. CrossRef Le Bourgeois, F., & Emptoz, H. (2007). DEBORA: Digital AccEss to BOoks of the RenAissance. International Journal on Document Analysis and Recognition, 9, 193–221. CrossRef
4.
go back to reference Likforman-Sulem, L., Zahour, A., & Taconet, B. (2007). Text line segmentation of historical documents: a survey. International Journal on Document Analysis and Recognition, 9, 123–138. CrossRef Likforman-Sulem, L., Zahour, A., & Taconet, B. (2007). Text line segmentation of historical documents: a survey. International Journal on Document Analysis and Recognition, 9, 123–138. CrossRef
5.
go back to reference Pérez, D., Tarazón, L., Serrano, N., Castro, F., Ramos-Terrades, O., & Juan, A. (2009). The GERMANA database. In Proceedings of the 10th international conference on document analysis and recognition (ICDAR 2009) (pp. 301–305), Barcelona, Spain. CrossRef Pérez, D., Tarazón, L., Serrano, N., Castro, F., Ramos-Terrades, O., & Juan, A. (2009). The GERMANA database. In Proceedings of the 10th international conference on document analysis and recognition (ICDAR 2009) (pp. 301–305), Barcelona, Spain. CrossRef
6.
go back to reference Plötz, T., & Fink, G. A. (2009). Markov models for offline handwriting recognition: a survey. International Journal on Document Analysis and Recognition, 12, 269–298. CrossRef Plötz, T., & Fink, G. A. (2009). Markov models for offline handwriting recognition: a survey. International Journal on Document Analysis and Recognition, 12, 269–298. CrossRef
7.
go back to reference Serrano, N., Pérez, D., Sanchis, A., & Juan, A. (2009). Adaptation from partially supervised handwritten text transcriptions. In Proceedings of the 11th international conference on multimodal interfaces and the 6th workshop on machine learning for multimodal interaction (ICMI-MLMI 2009) (pp. 289–292), Cambridge, MA, USA. CrossRef Serrano, N., Pérez, D., Sanchis, A., & Juan, A. (2009). Adaptation from partially supervised handwritten text transcriptions. In Proceedings of the 11th international conference on multimodal interfaces and the 6th workshop on machine learning for multimodal interaction (ICMI-MLMI 2009) (pp. 289–292), Cambridge, MA, USA. CrossRef
8.
go back to reference Serrano, N., Castro, F., & Juan, A. (2010). The RODRIGO database. In Proceedings of the 7th international conference on language resources and evaluation (LREC 2010) (pp. 2709–2712), Valleta, Malta. Serrano, N., Castro, F., & Juan, A. (2010). The RODRIGO database. In Proceedings of the 7th international conference on language resources and evaluation (LREC 2010) (pp. 2709–2712), Valleta, Malta.
9.
go back to reference Settles, B. (2009). Active learning literature survey (Computer Sciences Technical Report No. 1648). University of Wisconsin-Madison. Settles, B. (2009). Active learning literature survey (Computer Sciences Technical Report No. 1648). University of Wisconsin-Madison.
10.
go back to reference Tarazón, L., Pérez, D., Serrano, N., Alabau, V., Ramos-Terrades, O., Sanchis, A., & Juan, A. (2009). Confidence measures for error correction in interactive transcription of handwritten text. In Proceedings of the 15th international conference on image analysis and processing (ICIAP 2009) (pp. 567–574), Vietri sul Mare, Italy. Tarazón, L., Pérez, D., Serrano, N., Alabau, V., Ramos-Terrades, O., Sanchis, A., & Juan, A. (2009). Confidence measures for error correction in interactive transcription of handwritten text. In Proceedings of the 15th international conference on image analysis and processing (ICIAP 2009) (pp. 567–574), Vietri sul Mare, Italy.
11.
go back to reference Wessel, F., & Ney, H. (2005). Unsupervised training of acoustic models for large vocabulary continuous speech recognition. IEEE Transactions on Speech and Audio Processing, 13(1), 23–31. CrossRef Wessel, F., & Ney, H. (2005). Unsupervised training of acoustic models for large vocabulary continuous speech recognition. IEEE Transactions on Speech and Audio Processing, 13(1), 23–31. CrossRef
Metadata
Title
Active Interaction and Learning in Handwritten Text Transcription
Authors
Dr. Alejandro Héctor Toselli
Dr. Enrique Vidal
Prof. Francisco Casacuberta
Copyright Year
2011
Publisher
Springer London
DOI
https://doi.org/10.1007/978-0-85729-479-1_5