ABSTRACT
With the increase in exchange programs, many international students worldwide can face communication problems. During lectures, usually English language is used for the communication between students and teachers. However both sides, not necessarily being native speakers of English, may misunderstand each other. In this paper we propose a method for correction of non-native English speakers' speech, based on the domain focused electronic document. The method relies on the results of speech recognition (SR) software, and uses them altogether with the document. Our approach consists of three steps. Firstly, document analysis in the preprocessing phase. Secondly, finding the document part corresponding to sentence from SR software, realised using the Hidden Markov Model (HMM) based method. Finally, the correction by calculating the score for each of candidate sentences, based on the result of SR software. The probability score combines keywords comparison, BM25F method and HMM based method scores. Highest score candidate is chosen as replacement.
- M. González, J. Moreno, J. L. Martínez, and P. Martínez. An Illustrated Methodology for Evaluating ASR Systems, pages 33--42. Springer Berlin Heidelberg, Berlin, Heidelberg, 2013. Google ScholarDigital Library
- J. R. Perez-Aguera, J. Arroyo, J. Greenberg, J. P. Iglesias, and V. Fresno. Using bm25f for semantic search. In Proceedings of the 3rd International Semantic Search Workshop, pages 2:1--2:8, 2010. Google ScholarDigital Library
- A. Rozovskaya. Automated methods for correcting errors in grammar and usage. Master's thesis, University of Illinois.Google Scholar
- S. K. Tessai Hayama, Hidetsugu Nanba. Alignment between a technical paper and presentation sheets using a hidden markov model. In Proceedings of the 2005 International Conference on Active Media Technology, 2005. (AMT 2005), pages 102--106, 2005.Google ScholarCross Ref
Index Terms
- Non-native English speakers' speech correction, based on domain focused document
Recommendations
English lexical stress produced by native (L1) Bengali speakers compared to native (L1) English speakers: an acoustic study
English lexical stress is acoustically related to combination of duration, intensity, fundamental frequency (F0) and vowel quality. Errors in any or all of these correlates could interfere with production of the stress contrast, but it is unknown which ...
Phonetic realization of English lexical stress by native (L1) Bengali speakers compared to native (L1) English speakers
Compares the use of acoustic correlates in the production of English lexical stress by English and Bengali speakers.Bengali speakers able to approximate English like patterns of duration, intensity, F0 of stressed and unstressed vowels.It is very ...
The impact of non-native English speakers’ phonological and prosodic features on automatic speech recognition accuracy
Highlights- Higher speech intensity and lower speech rates improve automatic speech recognition accuracy.
- Arab ESL teachers and students give more attention to pronunciation errors that do not affect intelligibility.
- Arabic-influenced ESL ...
AbstractThe present study examines the impact of Arab speakers’ phonological and prosodic features on the accuracy of automatic speech recognition (ASR) of non-native English speech. The authors first investigated the perceptions of 30 Egyptian ESL ...
Comments