Skip to main content
Erschienen in: International Journal of Speech Technology 4/2017

14.09.2017

Arabic stop consonants characterisation and classification using the normalized energy in frequency bands

verfasst von: Karim Tahiry, Badia Mounir, Ilham Mounir, Laila Elmazouzi, Abdelmajid Farchi

Erschienen in: International Journal of Speech Technology | Ausgabe 4/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In general, speech is made with sequences of consonants (fricatives, nasals and stops), vowels and glides. The classification of the stop consonants remains one of the most challenging problems in speech recognition. In this paper, we propose a new approach based on the normalized energy in frequency bands in the release and closure phases in order to characterize and classify the Arabic stop consonants (/b/, /d/, /t/, /k/ and /q/) and to recognize the CV syllable. Classification experiments were performed using decision algorithms on stop consonants C and CV syllables extracted from an Arabic corpus. The results yielded to an overall stop consonants classification of 90.27% and syllables CV recognition upper than 90% for all stops.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Abdelatty Ali, A., Spiegel, J. V., & Mueller, P. (2001). Acoustic-phonetic features for the automatic classification of stop consonants. IEEE Transactions on Speech and Audio Processing, 9(8), 833–841.CrossRef Abdelatty Ali, A., Spiegel, J. V., & Mueller, P. (2001). Acoustic-phonetic features for the automatic classification of stop consonants. IEEE Transactions on Speech and Audio Processing, 9(8), 833–841.CrossRef
Zurück zum Zitat AlDahri, S. S. (2012). A study of voice onset time for modern standard Arabic and classical Arabic. IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC 2012), Hong Kong. AlDahri, S. S. (2012). A study of voice onset time for modern standard Arabic and classical Arabic. IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC 2012), Hong Kong.
Zurück zum Zitat AlDahri, S. S., & Alotaibi, Y. A. (2010). A cross language survey of VOT values for stops (/d/, /t/). IEEE International Conference on Intelligent Computing and Intelligent Systems (ICIS 2010), Xiamen, China. AlDahri, S. S., & Alotaibi, Y. A. (2010). A cross language survey of VOT values for stops (/d/, /t/). IEEE International Conference on Intelligent Computing and Intelligent Systems (ICIS 2010), Xiamen, China.
Zurück zum Zitat Benguerel, A. P., & Bhatia, T. K. (1980). Hindi stop consonants: An acoustic and fiberscopic study. Phonetica, 37, 134–148.CrossRef Benguerel, A. P., & Bhatia, T. K. (1980). Hindi stop consonants: An acoustic and fiberscopic study. Phonetica, 37, 134–148.CrossRef
Zurück zum Zitat Blumstein, S. E., & Stevens, K. N. (1980). Perceptual invariance and onset spectra for stop consonants in different vowel environments. The Journal of the Acoustical Society of America, 67, 648–662.CrossRef Blumstein, S. E., & Stevens, K. N. (1980). Perceptual invariance and onset spectra for stop consonants in different vowel environments. The Journal of the Acoustical Society of America, 67, 648–662.CrossRef
Zurück zum Zitat Bush, M. A., Kopec, G. E., & Zue, V. W. (1983). Selecting acoustic features for stop consonant identification. In Proc. ICASSP. Bush, M. A., Kopec, G. E., & Zue, V. W. (1983). Selecting acoustic features for stop consonant identification. In Proc. ICASSP.
Zurück zum Zitat Chodroff, E., & Colin, W. (2014). Burst spectrum as a cue for the stop voicing contrast in American English. The Journal of the Acoustical Society of America, 136(5), 2762–2772.CrossRef Chodroff, E., & Colin, W. (2014). Burst spectrum as a cue for the stop voicing contrast in American English. The Journal of the Acoustical Society of America, 136(5), 2762–2772.CrossRef
Zurück zum Zitat Cooper, F. S., Delattre, P. C., Liberman, A. M., Borst, J. M., & Gerstman, L. J. (1952). Some experiments on the perception of synthetic speech sounds. The Journal of the Acoustical Society of America, 24, 597–606.CrossRef Cooper, F. S., Delattre, P. C., Liberman, A. M., Borst, J. M., & Gerstman, L. J. (1952). Some experiments on the perception of synthetic speech sounds. The Journal of the Acoustical Society of America, 24, 597–606.CrossRef
Zurück zum Zitat De Mori, R., & Flammia, G. (1993). Speaker-independent consonant classification in continuous speech with distinctive features and neural networks. The Journal of the Acoustical Society of America, 94(6), 3091–3103.CrossRef De Mori, R., & Flammia, G. (1993). Speaker-independent consonant classification in continuous speech with distinctive features and neural networks. The Journal of the Acoustical Society of America, 94(6), 3091–3103.CrossRef
Zurück zum Zitat Forrest, K., Weismer, G., Milenkovic, P., & Dougall, R. (1988). Statistical analysis of word-initial voiceless obstruents preliminary data. The Journal of the Acoustical Society of America, 84, 115–123.CrossRef Forrest, K., Weismer, G., Milenkovic, P., & Dougall, R. (1988). Statistical analysis of word-initial voiceless obstruents preliminary data. The Journal of the Acoustical Society of America, 84, 115–123.CrossRef
Zurück zum Zitat Fuchs, S. (2005). Articulatory correlates of the voicing contrast in alveolar obstruent production in German. Ph.D. Thesis. Queen Margaret University College, Edinburgh, UK. Fuchs, S. (2005). Articulatory correlates of the voicing contrast in alveolar obstruent production in German. Ph.D. Thesis. Queen Margaret University College, Edinburgh, UK.
Zurück zum Zitat Ghosh, P. K., & Narayanan, S. S. (2009). Closure duration analysis of incomplete stop consonants due to stop-stop interaction. The Journal of the Acoustical Society of America, 126, EL1–EL7.CrossRef Ghosh, P. K., & Narayanan, S. S. (2009). Closure duration analysis of incomplete stop consonants due to stop-stop interaction. The Journal of the Acoustical Society of America, 126, EL1–EL7.CrossRef
Zurück zum Zitat Jayan, A. R., Rajath, P. S., & Pandey, P. C. (2011). Detection of burst onset landmarks in speech using rate of change of spectral moments. National Conference on Communications - NCC. Jayan, A. R., Rajath, P. S., & Pandey, P. C. (2011). Detection of burst onset landmarks in speech using rate of change of spectral moments. National Conference on Communications - NCC.
Zurück zum Zitat Juneja, A., & Espy-Wilson, C. (2002). Segmentation of continuous speech using acoustic-phonetic parameters and statistical learning. Proceedings of the 9th International Conference on Neural Information Processing (pp. 726–730). Juneja, A., & Espy-Wilson, C. (2002). Segmentation of continuous speech using acoustic-phonetic parameters and statistical learning. Proceedings of the 9th International Conference on Neural Information Processing (pp. 726–730).
Zurück zum Zitat Kiefte, M. (2003). Temporal information in gated stop consonants. Speech Communication, 40, 315–333.CrossRef Kiefte, M. (2003). Temporal information in gated stop consonants. Speech Communication, 40, 315–333.CrossRef
Zurück zum Zitat Liberman, A. M., Delattre, P. C., & Cooper, F. S. (1958). Some cues for the distinction between voiced and voiceless stops in initial position. Language and Speech, 1(3), 153–167.CrossRef Liberman, A. M., Delattre, P. C., & Cooper, F. S. (1958). Some cues for the distinction between voiced and voiceless stops in initial position. Language and Speech, 1(3), 153–167.CrossRef
Zurück zum Zitat Liberman, A. M., Delattre, P. C., Cooper, F. S., & Gerstman, L. J. (1954). The role of consonant–vowel transitions in the perception of the stop and nasal consonants. Psychological Monographs, 68(8), 1–13.CrossRef Liberman, A. M., Delattre, P. C., Cooper, F. S., & Gerstman, L. J. (1954). The role of consonant–vowel transitions in the perception of the stop and nasal consonants. Psychological Monographs, 68(8), 1–13.CrossRef
Zurück zum Zitat Lisker, L., & Abramson, A. S. (1964). A cross language study of voicing in initial stops: Acoustical measurements. Word, 20(3), 384–422.CrossRef Lisker, L., & Abramson, A. S. (1964). A cross language study of voicing in initial stops: Acoustical measurements. Word, 20(3), 384–422.CrossRef
Zurück zum Zitat Lisker, L., & Abramson, A. S. (1970). The voicing dimension: Some experiments in comparative phonetics. Proceedings of the 6th ICPhS, Prague, Czech Republic (pp. 563–567). Lisker, L., & Abramson, A. S. (1970). The voicing dimension: Some experiments in comparative phonetics. Proceedings of the 6th ICPhS, Prague, Czech Republic (pp. 563–567).
Zurück zum Zitat Liu, S. A. (1996). Landmark detection for distinctive feature based speech recognition. The Journal of the Acoustical Society of America, 100, 3417–3430.CrossRef Liu, S. A. (1996). Landmark detection for distinctive feature based speech recognition. The Journal of the Acoustical Society of America, 100, 3417–3430.CrossRef
Zurück zum Zitat Mitleb, F. (2009). Voice onset time of Jordanian Arabic stops. The 3rd International Conference on Arabic Language Processing (CITALA’09), Rabat, Morocco (pp. 133–135). Mitleb, F. (2009). Voice onset time of Jordanian Arabic stops. The 3rd International Conference on Arabic Language Processing (CITALA’09), Rabat, Morocco (pp. 133–135).
Zurück zum Zitat Nittrouer, S. (1995). Children learn separate aspects of speech production at different rates: vidence from spectral moments. The Journal of the Acoustical Society of America, 97, 520–530.CrossRef Nittrouer, S. (1995). Children learn separate aspects of speech production at different rates: vidence from spectral moments. The Journal of the Acoustical Society of America, 97, 520–530.CrossRef
Zurück zum Zitat Oden, G. C., & Massaro, D. W. (1978). Integration of featural information in speech perception. Psychological Review, 85(3), 172–191.CrossRef Oden, G. C., & Massaro, D. W. (1978). Integration of featural information in speech perception. Psychological Review, 85(3), 172–191.CrossRef
Zurück zum Zitat Rothenberg, M. (2009). Voice onset time vs. articulatory modelling for stop consonants. Journal of Logopedics Phoniatrics Vocology, 34, 171–180.CrossRef Rothenberg, M. (2009). Voice onset time vs. articulatory modelling for stop consonants. Journal of Logopedics Phoniatrics Vocology, 34, 171–180.CrossRef
Zurück zum Zitat Searle, C. J. (1979). Stop consonant discrimination based on human audition. The Journal of the Acoustical Society of America, 65(3), 799–809.CrossRef Searle, C. J. (1979). Stop consonant discrimination based on human audition. The Journal of the Acoustical Society of America, 65(3), 799–809.CrossRef
Zurück zum Zitat Stevens, K. N. (1993). Models for the production and acoustics of stop consonants. Speech Communication, 13, 367–375.CrossRef Stevens, K. N. (1993). Models for the production and acoustics of stop consonants. Speech Communication, 13, 367–375.CrossRef
Zurück zum Zitat Suchato, A. (2004). Classification of stop place of articulation. Ph.D. Thesis, MIT. Suchato, A. (2004). Classification of stop place of articulation. Ph.D. Thesis, MIT.
Zurück zum Zitat Suchato, A., & Punyabukkana, P. (2005). Factors in classification of stop consonant place of articulation. Proc. Interspeech (pp. 2969–2972). Suchato, A., & Punyabukkana, P. (2005). Factors in classification of stop consonant place of articulation. Proc. Interspeech (pp. 2969–2972).
Zurück zum Zitat Sussman, H. M., Fruchter, D., Hilbert, J., & Sirosh, J. (1998). Linear correlates in the speech signal: The orderly output constraint. Behavioral and Brain Sciences, 21, 241–299. Sussman, H. M., Fruchter, D., Hilbert, J., & Sirosh, J. (1998). Linear correlates in the speech signal: The orderly output constraint. Behavioral and Brain Sciences, 21, 241–299.
Zurück zum Zitat Tahiry, K., Mounir, B., Mounir, I., & Farchi, A. (2016). Energy bands and spectral cues for Arabic vowels recognition. International Journal of Speech Technology, 19(4), 707–716.CrossRef Tahiry, K., Mounir, B., Mounir, I., & Farchi, A. (2016). Energy bands and spectral cues for Arabic vowels recognition. International Journal of Speech Technology, 19(4), 707–716.CrossRef
Zurück zum Zitat Zue, V. W. (1976). Acoustic characteristics of stop consonants: A controlled study. Ph.D. dissertation, MIT, Cambridge, MA. Zue, V. W. (1976). Acoustic characteristics of stop consonants: A controlled study. Ph.D. dissertation, MIT, Cambridge, MA.
Metadaten
Titel
Arabic stop consonants characterisation and classification using the normalized energy in frequency bands
verfasst von
Karim Tahiry
Badia Mounir
Ilham Mounir
Laila Elmazouzi
Abdelmajid Farchi
Publikationsdatum
14.09.2017
Verlag
Springer US
Erschienen in
International Journal of Speech Technology / Ausgabe 4/2017
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-017-9454-9

Weitere Artikel der Ausgabe 4/2017

International Journal of Speech Technology 4/2017 Zur Ausgabe

Neuer Inhalt