Skip to main content
Erschienen in: International Journal of Speech Technology 2/2020

04.03.2020

A genetic model for acoustic and phonetic decoding of standard arabic vowels in continuous speech

verfasst von: M. Aissiou

Erschienen in: International Journal of Speech Technology | Ausgabe 2/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper presents an Acoustic and Phonetic Decoding Model (APDM) for automatic recognition Standard Arab (S.A) plain (pure) and emphatic vowels sounds of continuous naturally spoken speech, using Genetic Algorithms (G.As). SA vowels were selected since they are the most difficult phonemes to recognize. We have used GAs because of their advantages in resolving complicated optimization problems and because the results are obtained more rapidly than in the classical methods. In addition, the computational cost is greatly reduced. Our Genetic APDM performs automatically and in parallel, the operation of concatenations of short-term parametric vectors during the speech continuum segmentation stage, and the classification of the acoustic segments of continuous and natural speech into different vowel classes. In order to perform our task, we have used both the Mel Frequency Cepstrum Coding (MFCC) and the Linear Prediction Coding (LPC) methods to extract vocal tract parametric coefficients from the speech signal successfully. Among a set of classifiers we have used the distance one. This paper explains how we have used the Manhattan distance as decision rule the GA evaluation to classify the discriminate parameters vectors. The analysed corpus contains hundreds of sentences composed of the all types of SA vowels in different contexts and recorded by several Algerian male and female speakers, in quite noisy environment. The Corpus phonemes were classified successfully with an overall average rate of 98.02%.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Aissiou. M., & Guerti. M. (2006). Classification Génétique des Voyelles de l'Arabe Standard. International Confrence On Modelling and Diagnostics. ICCMD"06, Proceeding and CD, pp. 75. Annaba. Algerie. Aissiou. M., & Guerti. M. (2006). Classification Génétique des Voyelles de l'Arabe Standard. International Confrence On Modelling and Diagnostics. ICCMD"06, Proceeding and CD, pp. 75. Annaba. Algerie.
Zurück zum Zitat Al Ani, S. H. (1970). Arabic phonology an acoustical and physiological investigation. The Hague: Mouton.CrossRef Al Ani, S. H. (1970). Arabic phonology an acoustical and physiological investigation. The Hague: Mouton.CrossRef
Zurück zum Zitat Alghamdi, M. (2003). KACST arabic phonetic database. The Fifteenth International Congress of Phonetics Science, Barcelona. Alghamdi, M. (2003). KACST arabic phonetic database. The Fifteenth International Congress of Phonetics Science, Barcelona.
Zurück zum Zitat Cantineau, J. (1960). Cours de Phonétique Arabe. Paris: Klincksiek. Cantineau, J. (1960). Cours de Phonétique Arabe. Paris: Klincksiek.
Zurück zum Zitat Cohen, D. (1969). Statut phonologique de l’emphase en arabe. Word,25, 59–69.CrossRef Cohen, D. (1969). Statut phonologique de l’emphase en arabe. Word,25, 59–69.CrossRef
Zurück zum Zitat Davis, L. (1991). Handbook of genetic algorithms. New York: Van Nostrand Reinhold. Davis, L. (1991). Handbook of genetic algorithms. New York: Van Nostrand Reinhold.
Zurück zum Zitat De Jong, K. A., & Spears, W. M. (1991). Learning concept classification rules using genetic algorithms. International Joint Conference on Artificial Intelligence,1, 651–656.MATH De Jong, K. A., & Spears, W. M. (1991). Learning concept classification rules using genetic algorithms. International Joint Conference on Artificial Intelligence,1, 651–656.MATH
Zurück zum Zitat Duda, R. O., Hart, P. E., & Stork, D. G. (2000). Pattern classification (2nd ed.). New York: Wiley.MATH Duda, R. O., Hart, P. E., & Stork, D. G. (2000). Pattern classification (2nd ed.). New York: Wiley.MATH
Zurück zum Zitat Fre Woldu, K. (1981). Facts regarding Arabic Emphatic Consonants Production, Ruub, 7. Fre Woldu, K. (1981). Facts regarding Arabic Emphatic Consonants Production, Ruub, 7.
Zurück zum Zitat Goldberg, D. E. (1989). Genetic algorithms in search, optimisation and machine learning. Reading: Addison-Wesley.MATH Goldberg, D. E. (1989). Genetic algorithms in search, optimisation and machine learning. Reading: Addison-Wesley.MATH
Zurück zum Zitat Goldberg, D. E. (2002). Design of innovation: Lessons from and for competent genetic algorithms. Boston, MA: Kluwer.CrossRef Goldberg, D. E. (2002). Design of innovation: Lessons from and for competent genetic algorithms. Boston, MA: Kluwer.CrossRef
Zurück zum Zitat Hou, J., Rabiner, L., & Dusan, S. (2008). Parallel and hierarchical speech feature classification using frame and segment-based methods. Brisban: Interspeech. Hou, J., Rabiner, L., & Dusan, S. (2008). Parallel and hierarchical speech feature classification using frame and segment-based methods. Brisban: Interspeech.
Zurück zum Zitat Morris, J. (2008). Conditional random fields for integrating local discriminative classifiers. IEEE Transactions on Audio, Speech and Language Processing,16(1), 617–628.CrossRef Morris, J. (2008). Conditional random fields for integrating local discriminative classifiers. IEEE Transactions on Audio, Speech and Language Processing,16(1), 617–628.CrossRef
Zurück zum Zitat Namrata, D. (2013). Feature extraction methods LPC, PLP and MFCC in speech recognition. International Journal for Advance Research in Engineering and Technology,1(6), 1–5. Namrata, D. (2013). Feature extraction methods LPC, PLP and MFCC in speech recognition. International Journal for Advance Research in Engineering and Technology,1(6), 1–5.
Metadaten
Titel
A genetic model for acoustic and phonetic decoding of standard arabic vowels in continuous speech
verfasst von
M. Aissiou
Publikationsdatum
04.03.2020
Verlag
Springer US
Erschienen in
International Journal of Speech Technology / Ausgabe 2/2020
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-020-09694-y

Weitere Artikel der Ausgabe 2/2020

International Journal of Speech Technology 2/2020 Zur Ausgabe

Neuer Inhalt