Skip to main content
Erschienen in: Medical & Biological Engineering & Computing 7/2011

01.07.2011 | Original Article

Voiceless Arabic vowels recognition using facial EMG

verfasst von: Luay Fraiwan, Khaldon Lweesy, Ayat Al-Nemrawi, Sondos Addabass, Rasha Saifan

Erschienen in: Medical & Biological Engineering & Computing | Ausgabe 7/2011

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This work attempts to recognize the Arabic vowels based on facial electromyograph (EMG) signals, to be used for people with speech impairment and for human computer interface. Vowels were selected since they are the most difficult letters to recognize by people in Arabic language. Twenty subjects (7 females and 13 males) were asked to pronounce three Arabic vowels continuously in a random order. Facial EMG signals were recorded over three channels from the three main facial muscles that are responsible for speech. The EMG signals are then pre-processed to eliminate noise and interference signals. Segmentation procedure was implemented to extract the time event that corresponds to each vowel based on a moving standard deviation window. The accuracy of the segmentation procedure was found to be 94%. The recognition of the vowels was carried out by extracting features from the EMG in three domains: the temporal, the spectral, and the time frequency using the wavelet packet transform. Classification of the extracted features was then finally performed using different classification methods implemented in the WEKA software. The random forest classifier with time frequency features showed the best performance with an accuracy of 77% evaluated using a 10-fold cross-validation.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Al-Jarrah O, Halawani A (2001) Recognition of gestures in Arabic sign language using neuro-fuzzy systems. Artif Intell 133:117–138CrossRef Al-Jarrah O, Halawani A (2001) Recognition of gestures in Arabic sign language using neuro-fuzzy systems. Artif Intell 133:117–138CrossRef
2.
Zurück zum Zitat Arjunan S, Weghorn H, Kumar D, Yau W (2006) Vowel recognition in English and German language using facial movement (SEMG) for speech control based HCI. In: Conferences in research and practice in information technology (CRPIT), vol 56, pp 13–18 Arjunan S, Weghorn H, Kumar D, Yau W (2006) Vowel recognition in English and German language using facial movement (SEMG) for speech control based HCI. In: Conferences in research and practice in information technology (CRPIT), vol 56, pp 13–18
3.
Zurück zum Zitat Asadpour V, Towhidkhah F, Homayounpour M (2006) Performance enhancement for audio-visual speaker identification using dynamic facial muscle model. Med Biol Eng Comput 44(10):919–930PubMedCrossRef Asadpour V, Towhidkhah F, Homayounpour M (2006) Performance enhancement for audio-visual speaker identification using dynamic facial muscle model. Med Biol Eng Comput 44(10):919–930PubMedCrossRef
4.
Zurück zum Zitat Bang-hua Y, Guo-zheng Y, Ting W, Rong-guo Y (2007) Subject-based feature extraction using fuzzy wavelet packet in brain-computer interfaces. Signal Process 87:1569–1574CrossRef Bang-hua Y, Guo-zheng Y, Ting W, Rong-guo Y (2007) Subject-based feature extraction using fuzzy wavelet packet in brain-computer interfaces. Signal Process 87:1569–1574CrossRef
5.
Zurück zum Zitat Betts B, Jorgensen C (2006) Small vocabulary communication and control using surface electromyography in an acoustically noisy environment. In: Proceedings of HICSS, Hawaii Betts B, Jorgensen C (2006) Small vocabulary communication and control using surface electromyography in an acoustically noisy environment. In: Proceedings of HICSS, Hawaii
6.
Zurück zum Zitat Betts B, Binsted K, Jorgensen C (2006) Small vocabulary speech recognition using electromyography. Interact Comput 18:1242–1259CrossRef Betts B, Binsted K, Jorgensen C (2006) Small vocabulary speech recognition using electromyography. Interact Comput 18:1242–1259CrossRef
7.
Zurück zum Zitat Caudill M (1989) Neural networks primer. Miller Freeman Publications, San Francisco, CA Caudill M (1989) Neural networks primer. Miller Freeman Publications, San Francisco, CA
8.
Zurück zum Zitat Daube J (2002) Clinical neurophysiology. Oxford University Press, Oxford Daube J (2002) Clinical neurophysiology. Oxford University Press, Oxford
9.
Zurück zum Zitat Englehart K, Hudgins B, Parker P, Stevenson M (1998) Time-frequency representation for classification of the transient myoelectric signal. In: Proceedings of the 20th annual international conference on engineering in medicine and biology, vol 5, pp 2627–2630 Englehart K, Hudgins B, Parker P, Stevenson M (1998) Time-frequency representation for classification of the transient myoelectric signal. In: Proceedings of the 20th annual international conference on engineering in medicine and biology, vol 5, pp 2627–2630
10.
Zurück zum Zitat Fraiwan L, Kasawneh N, Lweesy K (2009) Automatic sleep stage scoring with wavelet packets based on single EEG recording. In: Proceedings of the international conference on medical informatics and biomedical engineering (ICMIBE), Paris, vol 54, pp 513–516 Fraiwan L, Kasawneh N, Lweesy K (2009) Automatic sleep stage scoring with wavelet packets based on single EEG recording. In: Proceedings of the international conference on medical informatics and biomedical engineering (ICMIBE), Paris, vol 54, pp 513–516
11.
Zurück zum Zitat Hall M, Smith L (1998) Practical feature subset selection for machine learning. In: Proceedings of the 21st Australian computer science conference, pp 181–191 Hall M, Smith L (1998) Practical feature subset selection for machine learning. In: Proceedings of the 21st Australian computer science conference, pp 181–191
12.
Zurück zum Zitat Hannaford B, Lehman S (1986) Short time Fourier analysis of the electromyogram: fast movements and constant contraction. IEEE Trans Biomed Eng 33:1173–1181PubMedCrossRef Hannaford B, Lehman S (1986) Short time Fourier analysis of the electromyogram: fast movements and constant contraction. IEEE Trans Biomed Eng 33:1173–1181PubMedCrossRef
13.
Zurück zum Zitat Huang C, Chen C, Chung H (2005) The review of applications and measurements in facial electromyography. J Med Biol Eng 25(1):15–20 Huang C, Chen C, Chung H (2005) The review of applications and measurements in facial electromyography. J Med Biol Eng 25(1):15–20
14.
Zurück zum Zitat Itoh Y, Uematsu H, Nogata F, Nemoto T, Inamori A, Koide K, Matsuura H (2007) Finger curvature movement recognition interface technique using SEMG signals. J Achiev Mater Manuf Eng 23(2):43–46 Itoh Y, Uematsu H, Nogata F, Nemoto T, Inamori A, Koide K, Matsuura H (2007) Finger curvature movement recognition interface technique using SEMG signals. J Achiev Mater Manuf Eng 23(2):43–46
15.
Zurück zum Zitat Jang J (1993) ANFIS: adaptive-network-based fuzzy inference systems. IEEE Trans Syst Man Cybern 23(3):665–685CrossRef Jang J (1993) ANFIS: adaptive-network-based fuzzy inference systems. IEEE Trans Syst Man Cybern 23(3):665–685CrossRef
16.
Zurück zum Zitat Jou S, Schultz T, Walliczek M, Kraft F, Waibel A (2006) Towards continuous speech recognition using surface electromyography. In: Proceedings of ninth international conference on spoken language processing (ISCLP 2006), pp 573–576 Jou S, Schultz T, Walliczek M, Kraft F, Waibel A (2006) Towards continuous speech recognition using surface electromyography. In: Proceedings of ninth international conference on spoken language processing (ISCLP 2006), pp 573–576
17.
Zurück zum Zitat Khushaba R, Al-Jumaily A (2006) Fuzzy wavelet packet based feature extraction method for multifunction myoelectric control. Int J Biol Life Sci 2:3 Khushaba R, Al-Jumaily A (2006) Fuzzy wavelet packet based feature extraction method for multifunction myoelectric control. Int J Biol Life Sci 2:3
18.
Zurück zum Zitat Kim J, Jang W, Bein Z (1996) A dynamic gesture recognition system for Korean sign language (KSL). IEEE Trans Syst Man Cyber B 26(2):254–359CrossRef Kim J, Jang W, Bein Z (1996) A dynamic gesture recognition system for Korean sign language (KSL). IEEE Trans Syst Man Cyber B 26(2):254–359CrossRef
19.
Zurück zum Zitat Krzanowski W (1988) Principles of multivariate analysis. Oxford University Press, Oxford Krzanowski W (1988) Principles of multivariate analysis. Oxford University Press, Oxford
20.
Zurück zum Zitat Li PC, Chiang YY, Tsai KS, Young ST (2005) Genetic algorithm for the efficient selection of disyllabic word lists used in Mandarin speech discrimination tests. Med Biol Eng Comput 43(5):648–657PubMedCrossRef Li PC, Chiang YY, Tsai KS, Young ST (2005) Genetic algorithm for the efficient selection of disyllabic word lists used in Mandarin speech discrimination tests. Med Biol Eng Comput 43(5):648–657PubMedCrossRef
21.
Zurück zum Zitat Manabe H, Hiraiwa A, Sugimura T (2003) Unvoiced speech recognition using EMG-mime speech recognition. In: Proceedings of HFCS, Ft. Lauderdale, FL Manabe H, Hiraiwa A, Sugimura T (2003) Unvoiced speech recognition using EMG-mime speech recognition. In: Proceedings of HFCS, Ft. Lauderdale, FL
22.
Zurück zum Zitat Orfanidis J (1996) Introduction to signal processing. Prentice-Hall, Englewood Cliffs, NJ Orfanidis J (1996) Introduction to signal processing. Prentice-Hall, Englewood Cliffs, NJ
23.
Zurück zum Zitat Parsons T (1986) Voice and speech processing. McGraw-Hill, New York Parsons T (1986) Voice and speech processing. McGraw-Hill, New York
24.
Zurück zum Zitat Perrin E, Berger-Vachon C, Collet L (1999) Acoustical recognition of laryngeal pathology: a comparison of two strategies based on sets of features. Med Biol Eng Comput 37(5):652–658PubMedCrossRef Perrin E, Berger-Vachon C, Collet L (1999) Acoustical recognition of laryngeal pathology: a comparison of two strategies based on sets of features. Med Biol Eng Comput 37(5):652–658PubMedCrossRef
25.
Zurück zum Zitat Reaz M, Hussain M, Mohd-Yasin F (2006) Techniques of EMG signal analysis: detection, processing, classification and applications. Biol Proced Online 8(1):11–35CrossRef Reaz M, Hussain M, Mohd-Yasin F (2006) Techniques of EMG signal analysis: detection, processing, classification and applications. Biol Proced Online 8(1):11–35CrossRef
26.
Zurück zum Zitat Shanabeh T, Assaleh K, AL-Rousan M (2007) Spatio-temporal feature extraction techniques for isolated gesture recognition in Arabic sign language. IEEE Trans Syst Man Cybern B 26(2):254–359 Shanabeh T, Assaleh K, AL-Rousan M (2007) Spatio-temporal feature extraction techniques for isolated gesture recognition in Arabic sign language. IEEE Trans Syst Man Cybern B 26(2):254–359
28.
Zurück zum Zitat Staude G, Flachenecker C, Daumer M, Wolf W (2001) Onset detection in surface electromyographic signals: a systematic comparison of methods. EURASIP J Appl Signal Process 2:67–81 Staude G, Flachenecker C, Daumer M, Wolf W (2001) Onset detection in surface electromyographic signals: a systematic comparison of methods. EURASIP J Appl Signal Process 2:67–81
29.
Zurück zum Zitat Stylianou A, Luchies C, Insana M (2003) EMG onset detection using the maximum likelihood method. In: Summer bioengineering conference, Key Biscayne, FL, pp 1075–1076 Stylianou A, Luchies C, Insana M (2003) EMG onset detection using the maximum likelihood method. In: Summer bioengineering conference, Key Biscayne, FL, pp 1075–1076
30.
Zurück zum Zitat Van Boxtel G (2001) Optimal signal bandwidth for the recording of surface EMG activity of facial, jaw, oral, and neck muscles. Psychophysiology 38(1):22–34PubMedCrossRef Van Boxtel G (2001) Optimal signal bandwidth for the recording of surface EMG activity of facial, jaw, oral, and neck muscles. Psychophysiology 38(1):22–34PubMedCrossRef
31.
Zurück zum Zitat Vuskovic M, Du S (2005) Spectral moments for feature extraction from temporal signals. In: International conference on intelligent computing, pp 1063–1072 Vuskovic M, Du S (2005) Spectral moments for feature extraction from temporal signals. In: International conference on intelligent computing, pp 1063–1072
32.
Zurück zum Zitat Wang C, Gao W, Xuang Z (2001) A real-time large vocabulary continuous recognition system for Chinese sign language. In: Proceedings of IEEE pacific rim conference on multimedia, pp 150–157 Wang C, Gao W, Xuang Z (2001) A real-time large vocabulary continuous recognition system for Chinese sign language. In: Proceedings of IEEE pacific rim conference on multimedia, pp 150–157
33.
Zurück zum Zitat Witten I, Frank E (2005) Data mining: practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, San Francisco, CA Witten I, Frank E (2005) Data mining: practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, San Francisco, CA
34.
Zurück zum Zitat Xiao H, Qun Y, Waixi L, Jian Q (2008) Feature extraction of surface EMG signal based on wavelet coefficient entropy. Proceedings of the 2nd international conference on bioinformatics and biomedical engineering (ICBBE), pp 1758–1760 Xiao H, Qun Y, Waixi L, Jian Q (2008) Feature extraction of surface EMG signal based on wavelet coefficient entropy. Proceedings of the 2nd international conference on bioinformatics and biomedical engineering (ICBBE), pp 1758–1760
35.
Zurück zum Zitat Zecca M, Micera S, Carrozza M, Dario P (2002) Control of multifunctional prosthetic hands by processing the electromyographic signal. Crit Rev Biomed Eng 30:459–485PubMedCrossRef Zecca M, Micera S, Carrozza M, Dario P (2002) Control of multifunctional prosthetic hands by processing the electromyographic signal. Crit Rev Biomed Eng 30:459–485PubMedCrossRef
Metadaten
Titel
Voiceless Arabic vowels recognition using facial EMG
verfasst von
Luay Fraiwan
Khaldon Lweesy
Ayat Al-Nemrawi
Sondos Addabass
Rasha Saifan
Publikationsdatum
01.07.2011
Verlag
Springer-Verlag
Erschienen in
Medical & Biological Engineering & Computing / Ausgabe 7/2011
Print ISSN: 0140-0118
Elektronische ISSN: 1741-0444
DOI
https://doi.org/10.1007/s11517-011-0751-1

Weitere Artikel der Ausgabe 7/2011

Medical & Biological Engineering & Computing 7/2011 Zur Ausgabe

Premium Partner