Skip to main content
Top

2022 | OriginalPaper | Chapter

Comparing the Accuracy of ACE and WER Caption Metrics When Applied to Live Television Captioning

Authors : Tian Wells, Dylan Christoffels, Christian Vogler, Raja Kushalnagar

Published in: Computers Helping People with Special Needs

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The development of caption metrics is relatively new in the accessibility research community. However, little work has been done comparing the effectiveness of newly developed caption metrics. More specifically, in low accuracy settings such as live television, where users report the most difficulty using captions. Through a user study with fifteen participants, we compared two caption metrics systems, Word Error Rate (WER) and Automated-Caption Evaluation (ACE), for their accuracy in evaluating caption quality in live television. We compared human-perceived quality statistics with each caption metric’s data. Analysis of the correlation between human statistics and each caption metric found that WER had a slightly higher correlation with participants. We found that ACE was more sensitive to errors that WER, and penalized captions more than participants. However, the difference in performance between WER and ACE was not statistically significant, and neither WER nor ACE are optimized for use with live television captioning. Future work should explore how caption metrics could be better optimized for use with live television.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Jang, P.J., Hauptmann, A.G.: Improving acoustic models with captioned multimedia speech. In: Proceedings IEEE International Conference on Multimedia Computing and Systems, vol. 2, pp. 767–771. IEEE, June 1999 Jang, P.J., Hauptmann, A.G.: Improving acoustic models with captioned multimedia speech. In: Proceedings IEEE International Conference on Multimedia Computing and Systems, vol. 2, pp. 767–771. IEEE, June 1999
2.
go back to reference Block, M.H., Okrand, M.: Real-time closed-captioned television as an educational tool. Am. Ann. Deaf 128(5), 636–641 (1983) Block, M.H., Okrand, M.: Real-time closed-captioned television as an educational tool. Am. Ann. Deaf 128(5), 636–641 (1983)
3.
go back to reference Apone, T., Brooks, M., O’Connell, T.: Caption Accuracy Metrics Project. Caption Viewer Survey: Error Ranking of Real-time Captions in Live Television News Programs. Boston (2010) Apone, T., Brooks, M., O’Connell, T.: Caption Accuracy Metrics Project. Caption Viewer Survey: Error Ranking of Real-time Captions in Live Television News Programs. Boston (2010)
4.
go back to reference Al Amin, A.: Audio-Visual Caption Evaluation Metric for People who are Deaf and Hard of Hearing (2020) Al Amin, A.: Audio-Visual Caption Evaluation Metric for People who are Deaf and Hard of Hearing (2020)
5.
go back to reference Apone, T., Botkin, B., Brooks, M., Goldberg, L.: Research into Automated Error Ranking of Real-time Captions in Live Television News Programs. The Carl and Ruth Shapiro Family National Center for Accessible Media at WGBH (NCAM) (2011) Apone, T., Botkin, B., Brooks, M., Goldberg, L.: Research into Automated Error Ranking of Real-time Captions in Live Television News Programs. The Carl and Ruth Shapiro Family National Center for Accessible Media at WGBH (NCAM) (2011)
6.
go back to reference Kafle, S., Huenerfauth, M.: Evaluating the usability of automatically generated captions for people who are deaf or hard of hearing. In: Proceedings of the 19th International ACM SIGACCESS Conference on Computers and Accessibility, pp. 165–174, October 2017 Kafle, S., Huenerfauth, M.: Evaluating the usability of automatically generated captions for people who are deaf or hard of hearing. In: Proceedings of the 19th International ACM SIGACCESS Conference on Computers and Accessibility, pp. 165–174, October 2017
Metadata
Title
Comparing the Accuracy of ACE and WER Caption Metrics When Applied to Live Television Captioning
Authors
Tian Wells
Dylan Christoffels
Christian Vogler
Raja Kushalnagar
Copyright Year
2022
DOI
https://doi.org/10.1007/978-3-031-08648-9_61