nach oben

International Journal of Speech Technology

Erschienen in:

04.03.2020

A genetic model for acoustic and phonetic decoding of standard arabic vowels in continuous speech

verfasst von: M. Aissiou

Erschienen in: International Journal of Speech Technology | Ausgabe 2/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

This paper presents an Acoustic and Phonetic Decoding Model (APDM) for automatic recognition Standard Arab (S.A) plain (pure) and emphatic vowels sounds of continuous naturally spoken speech, using Genetic Algorithms (G.As). SA vowels were selected since they are the most difficult phonemes to recognize. We have used GAs because of their advantages in resolving complicated optimization problems and because the results are obtained more rapidly than in the classical methods. In addition, the computational cost is greatly reduced. Our Genetic APDM performs automatically and in parallel, the operation of concatenations of short-term parametric vectors during the speech continuum segmentation stage, and the classification of the acoustic segments of continuous and natural speech into different vowel classes. In order to perform our task, we have used both the Mel Frequency Cepstrum Coding (MFCC) and the Linear Prediction Coding (LPC) methods to extract vocal tract parametric coefficients from the speech signal successfully. Among a set of classifiers we have used the distance one. This paper explains how we have used the Manhattan distance as decision rule the GA evaluation to classify the discriminate parameters vectors. The analysed corpus contains hundreds of sentences composed of the all types of SA vowels in different contexts and recorded by several Algerian male and female speakers, in quite noisy environment. The Corpus phonemes were classified successfully with an overall average rate of 98.02%.

Vorheriger Artikel Efficient signal and protocol level security for network communication

Nächster Artikel Speaker recognition based on pre-processing approaches

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Aissiou. M., & Guerti. M. (2006). Classification Génétique des Voyelles de l'Arabe Standard. International Confrence On Modelling and Diagnostics. ICCMD"06, Proceeding and CD, pp. 75. Annaba. Algerie.

Al Ani, S. H. (1970). Arabic phonology an acoustical and physiological investigation. The Hague: Mouton.CrossRef

Alghamdi, M. (2003). KACST arabic phonetic database. The Fifteenth International Congress of Phonetics Science, Barcelona.

Cantineau, J. (1960). Cours de Phonétique Arabe. Paris: Klincksiek.

Chia Ai, O., Hariharan, M., Yaacob, S., & Sin Chee, L. (2012). Classification of speech dysfluencies with MFCC nd LPCC features. Expert Systems with Applications,39(2), 2157–2165. https://doi.org/10.1016/j.eswa.2011.07.065.CrossRef

Cohen, D. (1969). Statut phonologique de l’emphase en arabe. Word,25, 59–69.CrossRef

Davis, L. (1991). Handbook of genetic algorithms. New York: Van Nostrand Reinhold.

De Jong, K. A., & Spears, W. M. (1991). Learning concept classification rules using genetic algorithms. International Joint Conference on Artificial Intelligence,1, 651–656.MATH

Duda, R. O., Hart, P. E., & Stork, D. G. (2000). Pattern classification (2nd ed.). New York: Wiley.MATH

Fre Woldu, K. (1981). Facts regarding Arabic Emphatic Consonants Production, Ruub, 7.

Goldberg, D. E. (1989). Genetic algorithms in search, optimisation and machine learning. Reading: Addison-Wesley.MATH

Goldberg, D. E. (2002). Design of innovation: Lessons from and for competent genetic algorithms. Boston, MA: Kluwer.CrossRef

Herrera, F., et al. (2003). Taxonomy for the crossover operator for real-coded genetic algorithms: An experimental study. International Journal of Intelligent Systems,18, 309–338. https://doi.org/10.1002/int.10091.CrossRefMATH

Hou, J., Rabiner, L., & Dusan, S. (2008). Parallel and hierarchical speech feature classification using frame and segment-based methods. Brisban: Interspeech.

House, A. S., & Stevens, K. N. (1957). Analog studies of the nasalization of vowels. Journal of Speech and Hearing Disorders,21, 218–232. https://doi.org/10.1044/jshd.2102.218.CrossRef

Morris, J. (2008). Conditional random fields for integrating local discriminative classifiers. IEEE Transactions on Audio, Speech and Language Processing,16(1), 617–628.CrossRef

Namrata, D. (2013). Feature extraction methods LPC, PLP and MFCC in speech recognition. International Journal for Advance Research in Engineering and Technology,1(6), 1–5.

Titel: A genetic model for acoustic and phonetic decoding of standard arabic vowels in continuous speech
verfasst von: M. Aissiou
Publikationsdatum: 04.03.2020
Verlag: Springer US
Erschienen in: International Journal of Speech Technology / Ausgabe 2/2020
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI: https://doi.org/10.1007/s10772-020-09694-y

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Die Gewinner und Laudatoren des Sustainability Award in Automotive 2024/© Uli Regenscheit | ATZlive, Search Icon, Banner Hanser, Bau Immobilie/© Gina Sanders / Fotolia, Kundenpotenzial/© Andrii Yalanskyi / Getty Images / iStock, Toyota-Logo/© ollo / Getty Images / iStock, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, ATZ-Webinar: Prototypenfreie Entwicklung durch Offline- und Driver-in-the-Loop-HiL-Tests /© (c) VI-grade, chassis.tech plus 2023/© [M] ATZlive / TÜV SÜD PRODUCT SERVICE GMBH, adäsion-Webinar-Matinee/© krystiannawrocki_ Getty Images

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 2/2020

High performance area efficient DA based FIR filter for concurrent decision feedback equalizer

Athlete’s respiratory frequency and physical energy consumption model based on speech recognition technology

Order and phase ambiguities correction in the ICA based separation of speech signals

Synthesis of phased array antenna for side lobe level reduction using the differential evolution algorithm

A study on unsupervised monaural reverberant speech separation

An efficient voting based method to detect sink hole in wireless acoustic sensor networks

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.