nach oben

International Journal of Speech Technology

Erschienen in:

01.09.2014

Investigation Amazigh speech recognition using CMU tools

verfasst von: Hassan Satori, Fatima ElHaoussi

Erschienen in: International Journal of Speech Technology | Ausgabe 3/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

The aim of this paper is to describe the development of a speaker-independent continuous automatic Amazigh speech recognition system. The designed system is based on the Carnegie Mellon University Sphinx tools. In the training and testing phase an in house Amazigh_Alphadigits corpus was used. This corpus was collected in the framework of this work and consists of speech and their transcription of 60 Berber Moroccan speakers (30 males and 30 females) native of Tarifit Berber. The system obtained best performance of 92.89 % when trained using 16 Gaussian Mixture models.

Vorheriger Artikel Hybrid continuous speech recognition systems by HMM, MLP and SVM: a comparative study

Nächster Artikel Inverting non-minimum phase FIR transfer functions with application to reverberant speech

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

The Amazigh Speech Corpus was collected by students during two periods of three month: (Mars to Mai 2011 and 2012), within the framework of the graduate programs of the faculty Polydisciplinary of Nador, Morocco.

Abushariah, M. A. A. M., Ainon, R. N., Zainuddin, R., Elshafei, M., & Khalifa, O. O. (2012). Arabic speaker-independent continuous automatic speech recognition based on a phonetically rich and balanced speech corpus. International Arab Journal of Information Technology, 9(1), 84–93.

Ajami Alotaibi, Y. (2005). Investigating spoken Arabic digits in speech recognition setting. Information and Computer Science, 173, 115–139.

Alotaibi, Y. A., & Shahshavari, M. M. (1998). Speech recognition—What it takes for a computer to understand your commands. IEEE Potentials.

Al-Zabibi, M. (1990) An acoustic-phonetic approach in automatic Arabic Speech Recognition. The British Library in Association with UMI.

Amazigh Languages. (2013). Encyclopædia Britannica Online. Retrieved 23 June, 2013, from http://www.britannica.com/EBchecked/topic/61496/Amazigh-languages.

Boukous, A. (1995). Société, langues et cultures au Maroc: Enjeux symboliques (No. 8). Faculté des Lettres et des Sciences Humaines.

Boukous, A. (2009). Phonologie de l’amazighe. Rabat: Institut royal de la culture amazighe.

Chaker, S. (1984). Textes en linguistique berbère: introduction au domaine berbère. Paris: Ed. du C.N.R.S.

CMU lmtool. (2013). Retrieved June 23, 2013, from http://www.speech.cs.cmu.edu/tools/lmtool-new.html.

CMU Sphinx Open Source Speech Recognition Engines. (2013). Retrieved February 10, 2013, from http://www.cmusphinx.sourceforge.net/html/cmusphinx.php.

Cole, R., Fanty, M., Muthusamy, Y., & Gopalakrishnan, M. (1990). Speaker-independent recognition of spoken English letters. In International joint conference on neural networks (IJCNN) (Vol. 2, pp. 45–51).

Fadoua, A. A., & Siham, B. (2012). Natural language processing for Amazigh language: Challenges and future directions. Language Technology for Normalisation of Less-Resourced Languages, 19.

Galand, L. (1988). Le berbère. In J. Perrot (Ed.), Les langues dans le monde ancien et moderne. Part 3: Les langues chamito-sémitiques (pp. 207–242). Paris: CNRS.

Greenberg, J. H. (1966). The languages of Africa. Mouton: The Hague.

Haton, M.-C., Cerisara, C., Fohr, D., & Laprie, Y., & Smaili, K. (2006). Reconnaissance automatique de la parole du signal a son interpretation. Paris: Universciens Dunod.

Huang, X., Acero, A., & Hon, H. (2001). Spoken language processing a guide to theory, algorithm and system design. Upper Saddle River: Prentice Hall.

Huang, X. D. (1989). The SPHINX-II Speech Recognition System: An overview. Computer Speech and Language, 7(2), 137–148.CrossRef

Huang, X. D., Ariki, Y., & Jack, M. A. (1990). Hidden Markov models for speech recognition. Edinburgh: Edinburgh University Press.

Hyassat, H., & Zitar, R. A. (2006). Arabic speech recognition using SPHINX engine. International Journal of Speech Technology, 9(3–4), 133–150.CrossRef

Le, V. B., & Besacier, L. (2009). Automatic speech recognition for under-resourced languages: Application to Vietnamese language. IEEE Transactions on Audio, Speech, and Language Processing, 17(8), 1471–1482.CrossRef

Lee, K. F. (1989). Automatic Speech Recognition the development of the SPHINX system. Boston: Kluwer.CrossRef

Ouakrim, O. (1995). Fonética y fonología del Bereber. Survey: University of Autònoma de Barcelona.

Outahajala, M., Zenkouar, L., & Rosso, P. (2011). Building an annotated corpus for Amazighe. In Proceedings of 4th international conference on Amazigh and ICT, Rabat, Morocco.

Ridouane, R. (2003). Suites de consonnes en berbère: phonétique et phonologie. Doctoral Dissertation, Université de la Sorbonne nouvelle-Paris III.

Satori, H., Harti, M., & Chenfour, N. (2007). Arabic Speech Recognition system based on CMUSphinx. In Proceedings of ISCIII2007, 3rd international symposium on computational intelligence and intelligent informatics, Agadir, Morocco, pp. 31–35.

Satori, H., Hiyassat, H., Harti, M., & Chenfour, N. (2009). Investigation Arabic Speech Recognition using CMU Sphinx System. The International Arab Journal of Information Technology, 6(2), 186–190.

Silva, D. F., de Souza, V. M., Batista, G. E., & Giusti, R. (2012). Spoken digit recognition in Portuguese using line spectral frequencies. In Advances in artificial intelligence—IBERAMIA 2012 (pp. 241–250). Berlin: Springer.

Titel: Investigation Amazigh speech recognition using CMU tools
verfasst von: Hassan Satori
Fatima ElHaoussi
Publikationsdatum: 01.09.2014
Verlag: Springer US
Erschienen in: International Journal of Speech Technology / Ausgabe 3/2014
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI: https://doi.org/10.1007/s10772-014-9223-y

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Nachhaltigkeitsaward Key Visual/© Cometis AG/Global ESG Monitor | Daniel Rupp | Generiert mit KI, Search Icon, Banner Hanser, Kryptowährungen/© gopixa / Getty Images / iStock, MG4 aus China auf dem Prüfstand im ADAC-Technik-Zentrum in Landsberg am Lech/© ADAC e.V., Chassis eines Elektrofahrzeugs/© chesky / stock.adobe.com, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, Sustainibility Finance/© Robert Kneschke / stock.adobe.com / Springer Fachmedien Wiesbaden GmbH, Zukunftswerkstatt Sales Excellence 2024/© AndreyPopov / Getty Images / iStock, 2023_Antrieb/© supervisuell

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 3/2014

Usage of a speech constraint for highlighting compensatory strategies developed in production of a second language

Manifold learning based speaker dependent dimension reduction for robust text independent speaker verification

Segmentation, indexing and retrieval of TV broadcast news bulletins using Gaussian mixture models and vector quantization codebooks

Manual sorting of numerals in an inflective language for language modelling

Inverting non-minimum phase FIR transfer functions with application to reverberant speech

Effective background data selection for SVM-based speaker recognition with unseen test environments: more is not always better

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.