Skip to main content
Top

2016 | OriginalPaper | Chapter

Support System for Lecture Captioning Using Keyword Detection by Automatic Speech Recognition

Authors : Naofumi Ikeda, Yoshinori Takeuchi, Tetsuya Matsumoto, Hiroaki Kudo, Noboru Ohnishi

Published in: Computers Helping People with Special Needs

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

We propose a support system for lecture captioning. The system can detect the keywords of a lecture and present them to captionists. The captionists can understand what an instructor said even when they cannot understand the keywords, and can input keywords rapidly by pressing the corresponding function key. The system detects the keywords by automatic speech recognition (ASR). To improve the detection rate of keywords, we adapt the language model of ASR using web documents. We collect 2,700 web documents, which include 1.2 million words and 5,800 sentences. We conducted an experiment to detect keywords of a real lecture and showed that the system can achieve higher F-measure of 0.957 than that of a base language model (0.871).

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Miyoshi, S., Kawano, S., Nishioka, T., Kato, N., Shirasawa, M., Murakami, H., Minagawa, H., Ishihara, Y., Naito, I., Wakatsuki, D., Kuroki, H., Kobayashi, M.: A basic study on supplementary visual information for real-time captionists in the lecture of information science. IEICE Trans. Inf. Syst. (Japanese edition) J91(D(9)), 2236–2246 (2008) Miyoshi, S., Kawano, S., Nishioka, T., Kato, N., Shirasawa, M., Murakami, H., Minagawa, H., Ishihara, Y., Naito, I., Wakatsuki, D., Kuroki, H., Kobayashi, M.: A basic study on supplementary visual information for real-time captionists in the lecture of information science. IEICE Trans. Inf. Syst. (Japanese edition) J91(D(9)), 2236–2246 (2008)
2.
go back to reference Kato, N., Kawano, S., Kuroki, H., Murakami, H., Nishioka, T., Wakatsuki, D., Minagawa, H., Shionome, T., Miyoshi, S., Shirasawa, M., Ishihara, Y.: Basic Study of Keyword Presentation System for Hearing Impaired Students. IEICE Technical Report ET2007-81 107(462), pp. 71–76 (2008). (in Japanese) Kato, N., Kawano, S., Kuroki, H., Murakami, H., Nishioka, T., Wakatsuki, D., Minagawa, H., Shionome, T., Miyoshi, S., Shirasawa, M., Ishihara, Y.: Basic Study of Keyword Presentation System for Hearing Impaired Students. IEICE Technical Report ET2007-81 107(462), pp. 71–76 (2008). (in Japanese)
3.
go back to reference Kawahara, T.: Recent progress of spontaneous speech recognition deployment in parliament and applications to lectures. J. Multimed. Educ. Res. 9(1), S1–S8 (2012). (in Japanese) Kawahara, T.: Recent progress of spontaneous speech recognition deployment in parliament and applications to lectures. J. Multimed. Educ. Res. 9(1), S1–S8 (2012). (in Japanese)
4.
go back to reference Miyoshi, S., Kuroki, H., Kawano, S., Shirasawa, M., Ishihara, Y., Kobayashi, M.: Support technique for real-time captionist to use speech recognition software. In: Miesenberger, K., Klaus, J., Zagler, W.L., Karshmer, A.I. (eds.) ICCHP 2008. LNCS, vol. 5105, pp. 647–650. Springer, Heidelberg (2008)CrossRef Miyoshi, S., Kuroki, H., Kawano, S., Shirasawa, M., Ishihara, Y., Kobayashi, M.: Support technique for real-time captionist to use speech recognition software. In: Miesenberger, K., Klaus, J., Zagler, W.L., Karshmer, A.I. (eds.) ICCHP 2008. LNCS, vol. 5105, pp. 647–650. Springer, Heidelberg (2008)CrossRef
5.
go back to reference Miyoshi, S., Kuroki, H., Kawano, S., Shirasawa, M., Ishihara, Y., Kobayashi, M.: Support Technique for Real-Time Captionist to Use Speech Recognition Software. Tsukuba University of Technology Techno Report 14, pp. 145–151 (2007). (in Japanese) Miyoshi, S., Kuroki, H., Kawano, S., Shirasawa, M., Ishihara, Y., Kobayashi, M.: Support Technique for Real-Time Captionist to Use Speech Recognition Software. Tsukuba University of Technology Techno Report 14, pp. 145–151 (2007). (in Japanese)
6.
go back to reference Munteanu, C., Penn, G., Beacker, R.: Web-based language modelling for automatic lecture transcription. In: Proceedings of 8th Annual Conference of the International Speech Communication Association, no. ThD.P3a-2, pp. 2353–2356 (2007) Munteanu, C., Penn, G., Beacker, R.: Web-based language modelling for automatic lecture transcription. In: Proceedings of 8th Annual Conference of the International Speech Communication Association, no. ThD.P3a-2, pp. 2353–2356 (2007)
7.
go back to reference Kawahara, T., Nemoto, Y., Akita, Y.: Automatic lecture transcription by exploiting presentation slide information for language model adaptation. In: Proceedings of ICASSP, pp. 4929–4932 (2008). (in Japanese) Kawahara, T., Nemoto, Y., Akita, Y.: Automatic lecture transcription by exploiting presentation slide information for language model adaptation. In: Proceedings of ICASSP, pp. 4929–4932 (2008). (in Japanese)
8.
go back to reference Furui, S.: Recent advances in spontaneous speech recognition and understanding. In: Proceedings of ISCA & IEEE Workshop on Spontaneous Speech Processing and Recognition, pp. 1–6 (2003). (in Japanese) Furui, S.: Recent advances in spontaneous speech recognition and understanding. In: Proceedings of ISCA & IEEE Workshop on Spontaneous Speech Processing and Recognition, pp. 1–6 (2003). (in Japanese)
10.
go back to reference Stolcke, A.: SRILM – An extensible language modeling toolkit. In: Proceedings of ICSLP (2002) Stolcke, A.: SRILM – An extensible language modeling toolkit. In: Proceedings of ICSLP (2002)
Metadata
Title
Support System for Lecture Captioning Using Keyword Detection by Automatic Speech Recognition
Authors
Naofumi Ikeda
Yoshinori Takeuchi
Tetsuya Matsumoto
Hiroaki Kudo
Noboru Ohnishi
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-41267-2_53