Skip to main content

2011 | OriginalPaper | Buchkapitel

2. Computer Assisted Transcription: General Framework

verfasst von : Dr. Alejandro Héctor Toselli, Dr. Enrique Vidal, Prof. Francisco Casacuberta

Erschienen in: Multimodal Interactive Pattern Recognition and Applications

Verlag: Springer London

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This chapter described the common basics on which are grounded the computer assisted transcription approaches described in the three subsequent chapters: Chaps. 3, 4 and 5. Besides, a general overview is provided of the common features characterizing the up-to-date systems we have employed for handwritten text and speech recognition.
Specific mathematical formulation and modeling adequate for interactive transcription of handwritten text images and speech signals are derived from a particular instantiation of the interactive–predictive general framework already introduced in Sect. 1.​3.​3. Moreover, on this ground and by adopting the passive left-to-right interaction protocol described in Sect. 1.​4.​2, the two basic computer assisted handwriting and speech transcription approaches were developed (detailed in Chaps. 3 and 4, respectively), along with the evaluation measures used to assess their performance.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Barrachina, S., Bender, O., Casacuberta, F., Civera, J., Cubel, E., Khadivi, S., Ney, A. L. H., Tomás, J., & Vidal, E. (2009). Statistical approaches to computer-assisted translation. Computational Linguistics, 35(1), 3–28. MathSciNetCrossRef Barrachina, S., Bender, O., Casacuberta, F., Civera, J., Cubel, E., Khadivi, S., Ney, A. L. H., Tomás, J., & Vidal, E. (2009). Statistical approaches to computer-assisted translation. Computational Linguistics, 35(1), 3–28. MathSciNetCrossRef
2.
Zurück zum Zitat Jelinek, F. (1998). Statistical methods for speech recognition. Cambridge: MIT Press. Jelinek, F. (1998). Statistical methods for speech recognition. Cambridge: MIT Press.
3.
Zurück zum Zitat Katz, S. M. (1987). Estimation of probabilities from sparse data for the language model component of a speech recognizer. I.E.E.E. Transactions on Acoustics, Speech, and Signal Processing, ASSP-35, 400–401. CrossRef Katz, S. M. (1987). Estimation of probabilities from sparse data for the language model component of a speech recognizer. I.E.E.E. Transactions on Acoustics, Speech, and Signal Processing, ASSP-35, 400–401. CrossRef
4.
Zurück zum Zitat Kneser, R., & Ney, H. (1995). Improved backing-off for n-gram language modeling. In Proceedings of the international conference on acoustics, speech and signal processing (ICASSP) (Vol. 1, pp. 181–184). Kneser, R., & Ney, H. (1995). Improved backing-off for n-gram language modeling. In Proceedings of the international conference on acoustics, speech and signal processing (ICASSP) (Vol. 1, pp. 181–184).
5.
Zurück zum Zitat Liu, P., & Soong, F. K. (2006). Word graph based speech recognition error correction by handwriting input. In Proceedings of the international conference on multimodal interfaces (ICMI’06) (pp. 339–346), New York, NY, USA. New York: ACM. Liu, P., & Soong, F. K. (2006). Word graph based speech recognition error correction by handwriting input. In Proceedings of the international conference on multimodal interfaces (ICMI’06) (pp. 339–346), New York, NY, USA. New York: ACM.
6.
Zurück zum Zitat Serrano, N., Sanchis, A., & Juan, A. (2010). Balancing error and supervision effort in interactive–predictive handwritten text recognition. In Proceedings of the international conference on intelligent user interfaces (IUI’10) (pp. 373–376), Hong Kong, China. Serrano, N., Sanchis, A., & Juan, A. (2010). Balancing error and supervision effort in interactive–predictive handwritten text recognition. In Proceedings of the international conference on intelligent user interfaces (IUI’10) (pp. 373–376), Hong Kong, China.
Metadaten
Titel
Computer Assisted Transcription: General Framework
verfasst von
Dr. Alejandro Héctor Toselli
Dr. Enrique Vidal
Prof. Francisco Casacuberta
Copyright-Jahr
2011
Verlag
Springer London
DOI
https://doi.org/10.1007/978-0-85729-479-1_2

Neuer Inhalt