Skip to main content
Top
Published in: International Journal of Speech Technology 2/2015

01-06-2015

Automatic articulation error detection tool for Punjabi language with aid for hearing impaired people

Authors: Shailendra Singh, Anshul Thakur, Dharam Vir

Published in: International Journal of Speech Technology | Issue 2/2015

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Articulation errors are common in children but may also persist in adulthood. These errors decrease the speech intelligibility and hamper the ability of a person to communicate properly. A speech language pathologist carefully examines the speech to identify the articulation error patterns. It is exhaustive and time consuming process. This information about the patterns of articulation disorders is then used in treatment. In this paper, a noble approach has been presented to automatically detect the articulation error patterns in Punjabi speech. To provide support for hearing impaired people, a picture naming task has been developed. The proposed tool processes recorded spoken words and outputs the articulation error patterns i.e. substitution and omission of particular phonemes and their position in the spoken word. The basic principles of speech word recognition are used to establish the articulation errors in speech samples. The rigorous experimentation has shown that the proposed tool is accurate in identifying the articulation error patterns in controlled environment.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Baig, M., Masud, S., & Awais, M. (2006). Support vector machine based voice activity detection. In International Symposium on Intelligent Signal Processing Communications, Yonago (pp. 319–322). Baig, M., Masud, S., & Awais, M. (2006). Support vector machine based voice activity detection. In International Symposium on Intelligent Signal Processing Communications, Yonago (pp. 319–322).
go back to reference Bardidehl, M., Razzazii, F., & Ghassemian, H. (2007). An SVM based confidence measure for continuous speech recognition. In International Conference on Signal Processing Communications, Dubai (pp. 1015–1018). Bardidehl, M., Razzazii, F., & Ghassemian, H. (2007). An SVM based confidence measure for continuous speech recognition. In International Conference on Signal Processing Communications, Dubai (pp. 1015–1018).
go back to reference Bauman-Waengler, J. (2004). Articulatory and phonological impairments: A clinical focus (2nd ed.). Boston, MA: Pearson Publications. Bauman-Waengler, J. (2004). Articulatory and phonological impairments: A clinical focus (2nd ed.). Boston, MA: Pearson Publications.
go back to reference Bernthal, J. E., Bankson, N. W., & Flipsel, P. (2008). Articulation and phonological disorders (6th ed.). Boston, MA: Pearson Publications. Bernthal, J. E., Bankson, N. W., & Flipsel, P. (2008). Articulation and phonological disorders (6th ed.). Boston, MA: Pearson Publications.
go back to reference Chang, C., & Lin, C. (2011). LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2, 1–27.CrossRef Chang, C., & Lin, C. (2011). LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2, 1–27.CrossRef
go back to reference Chaudhari, U. V., & Picheny, M. (2009). Articulatory feature detection with support vector machines for integration into ASR and phone recognition (pp. 93–98). Merano: IEEE Work in Automatic Speech Recognition and Understanding, IEEE. Chaudhari, U. V., & Picheny, M. (2009). Articulatory feature detection with support vector machines for integration into ASR and phone recognition (pp. 93–98). Merano: IEEE Work in Automatic Speech Recognition and Understanding, IEEE.
go back to reference Chen, Y.-J. (2011). Identification of articulation error patterns using a novel dependence network. IEEE Transactions on Biomedical Engineering, 58, 3061–3068.CrossRef Chen, Y.-J. (2011). Identification of articulation error patterns using a novel dependence network. IEEE Transactions on Biomedical Engineering, 58, 3061–3068.CrossRef
go back to reference Cohen, I. (2004). Speech enhancement using a noncausal a priori SNR estimator. Signal Processing Letters, 11, 725–728.CrossRef Cohen, I. (2004). Speech enhancement using a noncausal a priori SNR estimator. Signal Processing Letters, 11, 725–728.CrossRef
go back to reference Dinz, P. S., Da Silva, E. A., & Netto, S. (2010). Digital signal processing: System analysis and design (2nd ed.). Cambridge: Cambridge University Press.CrossRef Dinz, P. S., Da Silva, E. A., & Netto, S. (2010). Digital signal processing: System analysis and design (2nd ed.). Cambridge: Cambridge University Press.CrossRef
go back to reference Elkins, A. C. (2011). Vocalic markers of deception and cognitive dissonance for automated emotion detection systems. Tuscon (AZ): The University of Arizona. Elkins, A. C. (2011). Vocalic markers of deception and cognitive dissonance for automated emotion detection systems. Tuscon (AZ): The University of Arizona.
go back to reference Ephraim, Y., & Malah, D. (1985). Speech enhancement using a minimum mean-square error log-spectral amplitude estimator. IEEE Transactions on Acoustics, Speech and Signal Processing, 33, 443–445.CrossRef Ephraim, Y., & Malah, D. (1985). Speech enhancement using a minimum mean-square error log-spectral amplitude estimator. IEEE Transactions on Acoustics, Speech and Signal Processing, 33, 443–445.CrossRef
go back to reference Ganapathiraju, A., Hamaker, J. E., & Picone, J. (2004). Applications of support vector machines to speech recognition. IEEE Transactions on Signal Processing, 52, 2348–2355.CrossRef Ganapathiraju, A., Hamaker, J. E., & Picone, J. (2004). Applications of support vector machines to speech recognition. IEEE Transactions on Signal Processing, 52, 2348–2355.CrossRef
go back to reference Georgoulas, G., Georgopoulos, V. C., & Stylios, C. D. (2006). Speech sound classification and detection of articulation disorders with support vector machines and wavelets. In 28th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, New York, NY (pp 2199–2202). Georgoulas, G., Georgopoulos, V. C., & Stylios, C. D. (2006). Speech sound classification and detection of articulation disorders with support vector machines and wavelets. In 28th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, New York, NY (pp 2199–2202).
go back to reference Hu, Y., & Loizou, P. C. (2006). Subjective comparison and evaluation of speech enhancement algorithms. In IEEE International Conference on Acoustics, Speech and Signal Processing, Toulouse (pp. I153–I156). Hu, Y., & Loizou, P. C. (2006). Subjective comparison and evaluation of speech enhancement algorithms. In IEEE International Conference on Acoustics, Speech and Signal Processing, Toulouse (pp. I153–I156).
go back to reference Kaklauskas, A., Vlasenko, A., Seniut, M., & Krutinis, M. (2009). Voice stress analyser system for E-testing. Riga: Ninth IEEE International Conference on Advanced Learning Technologies, IEEE. Kaklauskas, A., Vlasenko, A., Seniut, M., & Krutinis, M. (2009). Voice stress analyser system for E-testing. Riga: Ninth IEEE International Conference on Advanced Learning Technologies, IEEE.
go back to reference Muda, L., Begam, M., & Elamvazuthi, I. (2010). Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques. Journal of Computing, 2, 138–143. Muda, L., Begam, M., & Elamvazuthi, I. (2010). Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques. Journal of Computing, 2, 138–143.
go back to reference Natarajan, V. A., & Jothilakshmi, S. (2012). Segmentation of continuous speech into consonant and vowel units using formant frequencies. International Journal of Computer Applications, 56, 24–27.CrossRef Natarajan, V. A., & Jothilakshmi, S. (2012). Segmentation of continuous speech into consonant and vowel units using formant frequencies. International Journal of Computer Applications, 56, 24–27.CrossRef
go back to reference Patel, K., & Prasad, R. K. (2013). Speech recognition and verification using MFCC. International Journal of Emerging Science and Engineering, 1, 33–37. Patel, K., & Prasad, R. K. (2013). Speech recognition and verification using MFCC. International Journal of Emerging Science and Engineering, 1, 33–37.
go back to reference Rickie, A. D. (2006). Automatic frame length, frame overlap and hidden Markov model for speech recognition of animal vocalizations. Milwaukee, WI: Marquette University. Rickie, A. D. (2006). Automatic frame length, frame overlap and hidden Markov model for speech recognition of animal vocalizations. Milwaukee, WI: Marquette University.
go back to reference Schuller, B., Rigoll, G., & Lang, M. (2004) Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture. In IEEE Interantional Conference on Acoustics, Speech, and Signal Processing (pp. 577–580). Schuller, B., Rigoll, G., & Lang, M. (2004) Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture. In IEEE Interantional Conference on Acoustics, Speech, and Signal Processing (pp. 577–580).
go back to reference Sonkamble, B. A., & Doye, D. D. (2008). An overview of speech recognition system based on the support vector machines. In International Conference on Computer and Communication Engineering, IEEE, Kuala Lumpur (pp. 768–771). Sonkamble, B. A., & Doye, D. D. (2008). An overview of speech recognition system based on the support vector machines. In International Conference on Computer and Communication Engineering, IEEE, Kuala Lumpur (pp. 768–771).
go back to reference Vir, D. (2010). Punjabi articulation test. Patiala, India: Punjabi University. Vir, D. (2010). Punjabi articulation test. Patiala, India: Punjabi University.
Metadata
Title
Automatic articulation error detection tool for Punjabi language with aid for hearing impaired people
Authors
Shailendra Singh
Anshul Thakur
Dharam Vir
Publication date
01-06-2015
Publisher
Springer US
Published in
International Journal of Speech Technology / Issue 2/2015
Print ISSN: 1381-2416
Electronic ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-014-9256-2

Other articles of this Issue 2/2015

International Journal of Speech Technology 2/2015 Go to the issue