nach oben

International Journal of Speech Technology

Erschienen in:

14.09.2017

Arabic stop consonants characterisation and classification using the normalized energy in frequency bands

verfasst von: Karim Tahiry, Badia Mounir, Ilham Mounir, Laila Elmazouzi, Abdelmajid Farchi

Erschienen in: International Journal of Speech Technology | Ausgabe 4/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

In general, speech is made with sequences of consonants (fricatives, nasals and stops), vowels and glides. The classification of the stop consonants remains one of the most challenging problems in speech recognition. In this paper, we propose a new approach based on the normalized energy in frequency bands in the release and closure phases in order to characterize and classify the Arabic stop consonants (/b/, /d/, /t/, /k/ and /q/) and to recognize the CV syllable. Classification experiments were performed using decision algorithms on stop consonants C and CV syllables extracted from an Arabic corpus. The results yielded to an overall stop consonants classification of 90.27% and syllables CV recognition upper than 90% for all stops.

Vorheriger Artikel Factored front-end CMLLR for joint speaker and environment normalization under DNN-HMM

Nächster Artikel Spoken character classification using abductive network

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Abdelatty Ali, A., Spiegel, J. V., & Mueller, P. (2001). Acoustic-phonetic features for the automatic classification of stop consonants. IEEE Transactions on Speech and Audio Processing, 9(8), 833–841.CrossRef

AlDahri, S. S. (2012). A study of voice onset time for modern standard Arabic and classical Arabic. IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC 2012), Hong Kong.

AlDahri, S. S., & Alotaibi, Y. A. (2010). A cross language survey of VOT values for stops (/d/, /t/). IEEE International Conference on Intelligent Computing and Intelligent Systems (ICIS 2010), Xiamen, China.

Benguerel, A. P., & Bhatia, T. K. (1980). Hindi stop consonants: An acoustic and fiberscopic study. Phonetica, 37, 134–148.CrossRef

Blumstein, S. E., & Stevens, K. N. (1980). Perceptual invariance and onset spectra for stop consonants in different vowel environments. The Journal of the Acoustical Society of America, 67, 648–662.CrossRef

Bush, M. A., Kopec, G. E., & Zue, V. W. (1983). Selecting acoustic features for stop consonant identification. In Proc. ICASSP.

Chodroff, E., & Colin, W. (2014). Burst spectrum as a cue for the stop voicing contrast in American English. The Journal of the Acoustical Society of America, 136(5), 2762–2772.CrossRef

Cooper, F. S., Delattre, P. C., Liberman, A. M., Borst, J. M., & Gerstman, L. J. (1952). Some experiments on the perception of synthetic speech sounds. The Journal of the Acoustical Society of America, 24, 597–606.CrossRef

De Mori, R., & Flammia, G. (1993). Speaker-independent consonant classification in continuous speech with distinctive features and neural networks. The Journal of the Acoustical Society of America, 94(6), 3091–3103.CrossRef

Forrest, K., Weismer, G., Milenkovic, P., & Dougall, R. (1988). Statistical analysis of word-initial voiceless obstruents preliminary data. The Journal of the Acoustical Society of America, 84, 115–123.CrossRef

Fuchs, S. (2005). Articulatory correlates of the voicing contrast in alveolar obstruent production in German. Ph.D. Thesis. Queen Margaret University College, Edinburgh, UK.

Ghosh, P. K., & Narayanan, S. S. (2009). Closure duration analysis of incomplete stop consonants due to stop-stop interaction. The Journal of the Acoustical Society of America, 126, EL1–EL7.CrossRef

Jayan, A. R., Rajath, P. S., & Pandey, P. C. (2011). Detection of burst onset landmarks in speech using rate of change of spectral moments. National Conference on Communications - NCC.

Juneja, A., & Espy-Wilson, C. (2002). Segmentation of continuous speech using acoustic-phonetic parameters and statistical learning. Proceedings of the 9th International Conference on Neural Information Processing (pp. 726–730).

Kiefte, M. (2003). Temporal information in gated stop consonants. Speech Communication, 40, 315–333.CrossRef

Liberman, A. M., Delattre, P. C., & Cooper, F. S. (1958). Some cues for the distinction between voiced and voiceless stops in initial position. Language and Speech, 1(3), 153–167.CrossRef

Liberman, A. M., Delattre, P. C., Cooper, F. S., & Gerstman, L. J. (1954). The role of consonant–vowel transitions in the perception of the stop and nasal consonants. Psychological Monographs, 68(8), 1–13.CrossRef

Lisker, L., & Abramson, A. S. (1964). A cross language study of voicing in initial stops: Acoustical measurements. Word, 20(3), 384–422.CrossRef

Lisker, L., & Abramson, A. S. (1970). The voicing dimension: Some experiments in comparative phonetics. Proceedings of the 6th ICPhS, Prague, Czech Republic (pp. 563–567).

Liu, S. A. (1996). Landmark detection for distinctive feature based speech recognition. The Journal of the Acoustical Society of America, 100, 3417–3430.CrossRef

Mitleb, F. (2009). Voice onset time of Jordanian Arabic stops. The 3rd International Conference on Arabic Language Processing (CITALA’09), Rabat, Morocco (pp. 133–135).

Nittrouer, S. (1995). Children learn separate aspects of speech production at different rates: vidence from spectral moments. The Journal of the Acoustical Society of America, 97, 520–530.CrossRef

Oden, G. C., & Massaro, D. W. (1978). Integration of featural information in speech perception. Psychological Review, 85(3), 172–191.CrossRef

Rothenberg, M. (2009). Voice onset time vs. articulatory modelling for stop consonants. Journal of Logopedics Phoniatrics Vocology, 34, 171–180.CrossRef

Searle, C. J. (1979). Stop consonant discrimination based on human audition. The Journal of the Acoustical Society of America, 65(3), 799–809.CrossRef

Stevens, K. N. (1993). Models for the production and acoustics of stop consonants. Speech Communication, 13, 367–375.CrossRef

Suchato, A. (2004). Classification of stop place of articulation. Ph.D. Thesis, MIT.

Suchato, A., & Punyabukkana, P. (2005). Factors in classification of stop consonant place of articulation. Proc. Interspeech (pp. 2969–2972).

Sussman, H. M., Fruchter, D., Hilbert, J., & Sirosh, J. (1998). Linear correlates in the speech signal: The orderly output constraint. Behavioral and Brain Sciences, 21, 241–299.

Tahiry, K., Mounir, B., Mounir, I., & Farchi, A. (2016). Energy bands and spectral cues for Arabic vowels recognition. International Journal of Speech Technology, 19(4), 707–716.CrossRef

Zue, V. W. (1976). Acoustic characteristics of stop consonants: A controlled study. Ph.D. dissertation, MIT, Cambridge, MA.

Titel: Arabic stop consonants characterisation and classification using the normalized energy in frequency bands
verfasst von: Karim Tahiry
Badia Mounir
Ilham Mounir
Laila Elmazouzi
Abdelmajid Farchi
Publikationsdatum: 14.09.2017
Verlag: Springer US
Erschienen in: International Journal of Speech Technology / Ausgabe 4/2017
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI: https://doi.org/10.1007/s10772-017-9454-9

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Nachhaltigkeitsaward Key Visual/© Cometis AG/Global ESG Monitor | Daniel Rupp | Generiert mit KI, Search Icon, Banner Hanser, Jonas Klose/© Pine Valley Capital GmbH, Carina Kießling von der Strategieberatung Roland Berger/© Monika Walther Fotografie | ATZ, Beijing Auto Show 2024: Deutsche Hersteller wollen angreifen./© EKH-Pictures / Generated with AI / Stock.adobe.com, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, Zukunftswerkstatt Sales Excellence 2024/© AndreyPopov / Getty Images / iStock, 2023_Antrieb/© supervisuell, ATZ-Webinar: Prototypenfreie Entwicklung durch Offline- und Driver-in-the-Loop-HiL-Tests /© (c) VI-grade

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 4/2017

A voice command detection system for aerospace applications

A novel method in audio message encryption based on a mixture of chaos function

Efficient compression and reconstruction of speech signals using compressed sensing

A heterogeneous speech feature vectors generation approach with hybrid hmm classifiers

Performance enhancement of speaker identification systems using speech encryption and cancelable features

Research on English machine translation system based on the internet

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.