Skip to main content
Top

2018 | OriginalPaper | Chapter

Implementation of Automatic Captioning System to Enhance the Accessibility of Meetings

Authors : Kosei Fume, Taira Ashikawa, Nayuko Watanabe, Hiroshi Fujimura

Published in: Computers Helping People with Special Needs

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In terms of information accessibility tools for hearing-impaired people, in order to understand meetings, expectations for real-time captioning utilizing speech recognition technology are increasing, from manual handwritten abstracts. However, it is still difficult to provide automatic closed captioning with a practical level of accuracy stably, without regard to various speakers and content. Therefore, we develop a web-based real-time closed captioning system that is easy to use in contact conferences, lectures, forums, etc., through trial and feedback from hearing-impaired people in the company. In this report, we outline this system as well as the results of a simple evaluation conducted inside and outside the company.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Berke, L., Caulfield, C., Huenerfauth, M.: Deaf and hard-of-hearing perspectives on imperfect automatic speech recognition for captioning one-on-one meetings. In: Proceedings of the 19th International ACM SIGACCESS Conference on Computers and Accessibility. ASSETS 2017, New York, NY, USA, pp. 155–164. ACM (2017) Berke, L., Caulfield, C., Huenerfauth, M.: Deaf and hard-of-hearing perspectives on imperfect automatic speech recognition for captioning one-on-one meetings. In: Proceedings of the 19th International ACM SIGACCESS Conference on Computers and Accessibility. ASSETS 2017, New York, NY, USA, pp. 155–164. ACM (2017)
3.
go back to reference Gaur, Y., Metze, F., Miao, Y., Bigham, J.P.: Using keyword spotting to help humans correct captioning faster. In: 16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015, pp. 2829–2833 (2015) Gaur, Y., Metze, F., Miao, Y., Bigham, J.P.: Using keyword spotting to help humans correct captioning faster. In: 16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015, pp. 2829–2833 (2015)
4.
go back to reference Huang, X., Baker, J., Reddy, R.: A historical perspective of speech recognition. Commun. ACM 57(1), 94–103 (2014)CrossRef Huang, X., Baker, J., Reddy, R.: A historical perspective of speech recognition. Commun. ACM 57(1), 94–103 (2014)CrossRef
5.
go back to reference Kafle, S., Huenerfauth, M.: Evaluating the usability of automatically generated captions for people who are deaf or hard of hearing. In: Proceedings of the 19th International ACM SIGACCESS Conference on Computers and Accessibility, ASSETS 2017, New York, NY, USA, pp. 165–174. ACM (2017) Kafle, S., Huenerfauth, M.: Evaluating the usability of automatically generated captions for people who are deaf or hard of hearing. In: Proceedings of the 19th International ACM SIGACCESS Conference on Computers and Accessibility, ASSETS 2017, New York, NY, USA, pp. 165–174. ACM (2017)
6.
go back to reference Lasecki, W.S., Miller, C.D., Kushalnagar, R., Bigham, J.P.: Real-time captioning by non-experts with legion scribe. In: Proceedings of the 15th International ACM SIGACCESS Conference on Computers and Accessibility, ASSETS 2013, New York, NY, USA, pp. 56:1–56:2. ACM (2013) Lasecki, W.S., Miller, C.D., Kushalnagar, R., Bigham, J.P.: Real-time captioning by non-experts with legion scribe. In: Proceedings of the 15th International ACM SIGACCESS Conference on Computers and Accessibility, ASSETS 2013, New York, NY, USA, pp. 56:1–56:2. ACM (2013)
7.
go back to reference Lasecki, W.S., Miller, C.D., Naim, I., Kushalnagar, R., Sadilek, A., Gildea, D., Bigham, J.P.: Scribe: deep integration of human and machine intelligence to caption speech in real time. Commun. ACM 60(9), 93–100 (2017)CrossRef Lasecki, W.S., Miller, C.D., Naim, I., Kushalnagar, R., Sadilek, A., Gildea, D., Bigham, J.P.: Scribe: deep integration of human and machine intelligence to caption speech in real time. Commun. ACM 60(9), 93–100 (2017)CrossRef
8.
go back to reference Naim, I., Gildea, D., Lasecki, W., Bigham, J.: Text alignment for real-time crowd captioning. In: North American Chapter of the Association for Computational Linguistics, NAACL 2013, pp. 201–210 (2013) Naim, I., Gildea, D., Lasecki, W., Bigham, J.: Text alignment for real-time crowd captioning. In: North American Chapter of the Association for Computational Linguistics, NAACL 2013, pp. 201–210 (2013)
9.
go back to reference Nasu, Y., Fujimura, H.: Acoustic event detection and removal using LSTM-CTC for speech recognition. IEICE Tech. Rep. 116(208), 121–126 (2016). (in Japanese) Nasu, Y., Fujimura, H.: Acoustic event detection and removal using LSTM-CTC for speech recognition. IEICE Tech. Rep. 116(208), 121–126 (2016). (in Japanese)
11.
go back to reference Ranchal, R., Taber-Doughty, T., Guo, Y., Bain, K., Martin, H., Robinson, J.P., Duerstock, B.S.: Using speech recognition for real-time captioning and lecture transcription in the classroom. IEEE Trans. Learn. Technol. 6(4), 299–311 (2013)CrossRef Ranchal, R., Taber-Doughty, T., Guo, Y., Bain, K., Martin, H., Robinson, J.P., Duerstock, B.S.: Using speech recognition for real-time captioning and lecture transcription in the classroom. IEEE Trans. Learn. Technol. 6(4), 299–311 (2013)CrossRef
Metadata
Title
Implementation of Automatic Captioning System to Enhance the Accessibility of Meetings
Authors
Kosei Fume
Taira Ashikawa
Nayuko Watanabe
Hiroshi Fujimura
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-94277-3_31