Skip to main content

2019 | OriginalPaper | Buchkapitel

Modern vs Diplomatic Transcripts for Historical Handwritten Text Recognition

verfasst von : Verónica Romero, Alejandro H. Toselli, Enrique Vidal, Joan Andreu Sánchez, Carlos Alonso, Lourdes Marqués

Erschienen in: New Trends in Image Analysis and Processing – ICIAP 2019

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The transcription of handwritten documents is useful to make their contents accessible to the general public. However, so far automatic transcription of historical documents has mostly focused on producing diplomatic transcripts, even if such transcripts are often only understandable by experts. Main difficulties come from the heavy use of extremely abridged and tangled abbreviations and archaic or outdated word forms. Here we study different approaches to train optical models which allow to recognize historic document images containing archaic and abbreviated handwritten text and produce modernized transcripts with expanded abbreviations. Experiments comparing the performance of the different approaches proposed are carried out on a document collection related with Spanish naval commerce during the XV–XIX centuries, which includes extremely difficult handwritten text images.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Bloomberg, D.S., Kopec, G.E., Dasari, L.: Measuring document image skew and orientation. In: SPIE, vol. 2422, pp. 302–316 (1995) Bloomberg, D.S., Kopec, G.E., Dasari, L.: Measuring document image skew and orientation. In: SPIE, vol. 2422, pp. 302–316 (1995)
2.
Zurück zum Zitat Bluche, T., et al.: Preparatory KWS experiments for large-scale indexing of a vast medieval manuscript collection in the HIMANIS project. In: 2017 14th ICDAR, vol. 01, pp. 311–316 (2017) Bluche, T., et al.: Preparatory KWS experiments for large-scale indexing of a vast medieval manuscript collection in the HIMANIS project. In: 2017 14th ICDAR, vol. 01, pp. 311–316 (2017)
3.
Zurück zum Zitat Bluche, T., Ney, H., Kermorvant, C.: The LIMSI/A2iA handwriting recognition systems for the HTRtS contest. In: ICDAR, pp. 448–452 (2015) Bluche, T., Ney, H., Kermorvant, C.: The LIMSI/A2iA handwriting recognition systems for the HTRtS contest. In: ICDAR, pp. 448–452 (2015)
4.
Zurück zum Zitat Bluche, T.: Deep neural networks for large vocabulary handwritten text recognition. Ph.D. thesis, Ecole Doctorale Informatique de Paris-Sud, May 2015 Bluche, T.: Deep neural networks for large vocabulary handwritten text recognition. Ph.D. thesis, Ecole Doctorale Informatique de Paris-Sud, May 2015
5.
Zurück zum Zitat Buse, R., Liu, Z., Caelli, T.: A structural and relational approach to handwritten word recognition. IEEE Trans. SMCS, Part B 27(5), 847–861 (1997) Buse, R., Liu, Z., Caelli, T.: A structural and relational approach to handwritten word recognition. IEEE Trans. SMCS, Part B 27(5), 847–861 (1997)
6.
Zurück zum Zitat España-Boquera, S., Castro-Bleda, M., Gorbe-Moya, J., Zamora-Martínez, F.: Improving offline handwriting text recognition with hybrid HMM/ANN models. IEEE Trans. PAMI 33(4), 767–779 (2011)CrossRef España-Boquera, S., Castro-Bleda, M., Gorbe-Moya, J., Zamora-Martínez, F.: Improving offline handwriting text recognition with hybrid HMM/ANN models. IEEE Trans. PAMI 33(4), 767–779 (2011)CrossRef
7.
Zurück zum Zitat Fawzi, A., Gadea, M.P., Martínez-Hinarejos, C.D.: Baseline detection on Arabic handwritten documents. In: Proceedings of the 2017 ACM Symposium on Document Engineering, DocEng 2017, pp. 193–196. ACM (2017) Fawzi, A., Gadea, M.P., Martínez-Hinarejos, C.D.: Baseline detection on Arabic handwritten documents. In: Proceedings of the 2017 ACM Symposium on Document Engineering, DocEng 2017, pp. 193–196. ACM (2017)
8.
Zurück zum Zitat Graves, A., Fernández, S., Gomez, F., Schmidhuber, J.: Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In: ICML, pp. 369–376 (2006) Graves, A., Fernández, S., Gomez, F., Schmidhuber, J.: Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In: ICML, pp. 369–376 (2006)
9.
Zurück zum Zitat Graves, A., Liwicki, M., Fernández, S., Bertolami, R., Bunke, H., Schmidhuber, J.: A novel connectionist system for unconstrained handwriting recognition. IEEE Trans. PAMI 31(5), 855–868 (2009)CrossRef Graves, A., Liwicki, M., Fernández, S., Bertolami, R., Bunke, H., Schmidhuber, J.: A novel connectionist system for unconstrained handwriting recognition. IEEE Trans. PAMI 31(5), 855–868 (2009)CrossRef
10.
Zurück zum Zitat Kneser, R., Ney, H.: Improved backing-off for N-gram language modeling. In: ICASSP 1995, vol. 1, pp. 181–184. IEEE Computer Society (1995) Kneser, R., Ney, H.: Improved backing-off for N-gram language modeling. In: ICASSP 1995, vol. 1, pp. 181–184. IEEE Computer Society (1995)
11.
Zurück zum Zitat Leiva, L.A., Toselli, A.H., Bordes-Cabrera, I., Hernández-Tornero, C., Vidal, E., Bosch, V.: Transcribing a 17th-century botanical manuscript: longitudinal evaluation of document layout detection and interactive transcription. Digit. Scholarsh. Humanit. 33(1), 173–202 (2017) Leiva, L.A., Toselli, A.H., Bordes-Cabrera, I., Hernández-Tornero, C., Vidal, E., Bosch, V.: Transcribing a 17th-century botanical manuscript: longitudinal evaluation of document layout detection and interactive transcription. Digit. Scholarsh. Humanit. 33(1), 173–202 (2017)
12.
Zurück zum Zitat Maas, A.L., Hannun, A.Y., Ng, A.Y.: Rectifier nonlinearities improve neural network acoustic models. In: International Conference on Machine Learning, vol. 30 (2013) Maas, A.L., Hannun, A.Y., Ng, A.Y.: Rectifier nonlinearities improve neural network acoustic models. In: International Conference on Machine Learning, vol. 30 (2013)
13.
Zurück zum Zitat Moysset, B., et al.: The A2iA multi-lingual text recognition system at the second Maurdor evaluation. In: ICFHR, pp. 297–302 (2014) Moysset, B., et al.: The A2iA multi-lingual text recognition system at the second Maurdor evaluation. In: ICFHR, pp. 297–302 (2014)
14.
Zurück zum Zitat Pham, V., Kermorvant, C., Louradour, J.: Dropout improves recurrent neural networks for handwriting recognition. CoRR abs/1312.4569 (2013) Pham, V., Kermorvant, C., Louradour, J.: Dropout improves recurrent neural networks for handwriting recognition. CoRR abs/1312.4569 (2013)
15.
Zurück zum Zitat Povey, D., et al.: The Kaldi speech recognition toolkit. In: ASRU, December 2011 Povey, D., et al.: The Kaldi speech recognition toolkit. In: ASRU, December 2011
16.
Zurück zum Zitat Puigcerver, J.: Are multidimensional recurrent layers really necessary for handwritten text recognition? In: ICDAR, vol. 01, pp. 67–72 (2017) Puigcerver, J.: Are multidimensional recurrent layers really necessary for handwritten text recognition? In: ICDAR, vol. 01, pp. 67–72 (2017)
17.
Zurück zum Zitat Quirós, L., Bosch, V., Serrano, L., Toselli, A.H., Vidal, E.: From HMMs to RNNs: computer-assisted transcription of a handwritten notarial records collection. In: 2018 16th International Conference on Frontiers in Handwriting Recognition, pp. 116–121 (2018) Quirós, L., Bosch, V., Serrano, L., Toselli, A.H., Vidal, E.: From HMMs to RNNs: computer-assisted transcription of a handwritten notarial records collection. In: 2018 16th International Conference on Frontiers in Handwriting Recognition, pp. 116–121 (2018)
18.
Zurück zum Zitat Roeder, P.: Adapting the RWTH-OCR handwriting recognition system to French handwriting. Ph.D. thesis, RWTH Aachen University, Aachen, Germany (2009) Roeder, P.: Adapting the RWTH-OCR handwriting recognition system to French handwriting. Ph.D. thesis, RWTH Aachen University, Aachen, Germany (2009)
19.
Zurück zum Zitat Romero, V., Toselli, A.H., Sánchez, J.A., Vidal, E.: Handwriting transcription and keyword spotting in historical daily records documents. In: 2016 12th IAPR Workshop on Document Analysis Systems (DAS), pp. 275–280, April 2016 Romero, V., Toselli, A.H., Sánchez, J.A., Vidal, E.: Handwriting transcription and keyword spotting in historical daily records documents. In: 2016 12th IAPR Workshop on Document Analysis Systems (DAS), pp. 275–280, April 2016
20.
Zurück zum Zitat Romero, V., Toselli, A.H., Vidal, E.: Multimodal Interactive Handwritten Text Transcription. Series in MPAI. World Scientific Publishing, Singapore (2012)CrossRef Romero, V., Toselli, A.H., Vidal, E.: Multimodal Interactive Handwritten Text Transcription. Series in MPAI. World Scientific Publishing, Singapore (2012)CrossRef
21.
Zurück zum Zitat Sánchez, J.A., Bosch, V., Romero, V., Depuydt, K., de Does, J.: Handwritten text recognition for historical documents in the transcriptorium project. In: Proceedings of the DATeCH 2014, pp. 111–117, New York, NY, USA (2014) Sánchez, J.A., Bosch, V., Romero, V., Depuydt, K., de Does, J.: Handwritten text recognition for historical documents in the transcriptorium project. In: Proceedings of the DATeCH 2014, pp. 111–117, New York, NY, USA (2014)
22.
Zurück zum Zitat Shi, B., Bai, X., Yao, C.: An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. CoRR abs/1507.05717 (2015) Shi, B., Bai, X., Yao, C.: An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. CoRR abs/1507.05717 (2015)
23.
Zurück zum Zitat Stolcke, A.: SRILM—an extensible language modeling toolkit. In: The 7th International Conference on Spoken Language Processing (ICSLP 2002), vol. 2, July 2004 Stolcke, A.: SRILM—an extensible language modeling toolkit. In: The 7th International Conference on Spoken Language Processing (ICSLP 2002), vol. 2, July 2004
24.
Zurück zum Zitat Tieleman, T., Hinton, G.: Lecture 6.5-RMSProp: divide the gradient by a running average of its recent magnitude. COURSERA Neural Netw. Mach. Learn. 4(2), 26–30 (2012) Tieleman, T., Hinton, G.: Lecture 6.5-RMSProp: divide the gradient by a running average of its recent magnitude. COURSERA Neural Netw. Mach. Learn. 4(2), 26–30 (2012)
25.
Zurück zum Zitat Toselli, A.H., Vidal, E.: Handwritten text recognition results on the Bentham collection with improved classical n-gram-HMM methods. In: International Workshop on Historical Document Imaging and Processing, pp. 15–22 (2015) Toselli, A.H., Vidal, E.: Handwritten text recognition results on the Bentham collection with improved classical n-gram-HMM methods. In: International Workshop on Historical Document Imaging and Processing, pp. 15–22 (2015)
26.
27.
Zurück zum Zitat Villegas, M., Toselli, A.H., Romero, V., Vidal, E.: Exploiting existing modern transcripts for historical handwritten text recognition. In: 2016 ICFHR, pp. 66–71, October 2016 Villegas, M., Toselli, A.H., Romero, V., Vidal, E.: Exploiting existing modern transcripts for historical handwritten text recognition. In: 2016 ICFHR, pp. 66–71, October 2016
28.
Zurück zum Zitat Vinciarelli, A., Luettin, J.: A new normalization technique for cursive handwritten words. Pattern Recogn. Lett. 22(9), 1043–1050 (2001)CrossRef Vinciarelli, A., Luettin, J.: A new normalization technique for cursive handwritten words. Pattern Recogn. Lett. 22(9), 1043–1050 (2001)CrossRef
29.
Zurück zum Zitat Vinciarelli, A., Bengio, S., Bunke, H.: Off-line recognition of unconstrained handwritten texts using HMMs and statistical language models. IEEE Trans. PAMI 26(6), 709–720 (2004)CrossRef Vinciarelli, A., Bengio, S., Bunke, H.: Off-line recognition of unconstrained handwritten texts using HMMs and statistical language models. IEEE Trans. PAMI 26(6), 709–720 (2004)CrossRef
Metadaten
Titel
Modern vs Diplomatic Transcripts for Historical Handwritten Text Recognition
verfasst von
Verónica Romero
Alejandro H. Toselli
Enrique Vidal
Joan Andreu Sánchez
Carlos Alonso
Lourdes Marqués
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-30754-7_11