Skip to main content
Top
Published in: International Journal of Speech Technology 4/2016

16-08-2016

Maghrebian dialect recognition based on support vector machines and neural network classifiers

Authors: Mohamed Hassine, Lotfi Boussaid, Hassani Messaoud

Published in: International Journal of Speech Technology | Issue 4/2016

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper investigates the feed forward back propagation neural network (FFBPNN) and the support vector machine (SVM) for the classification of two Maghrebian dialects: Tunisian and Moroccan. The dialect used by the Moroccan speakers is called “La Darijja” and that of Tunisians is called “Darija”. An Automatic Speech Recognition System is implemented in order to identify ten Arabic digits (from zero to nine). The implementation of our present system consists of two phases: The features extraction using a variety of popular hybrid techniques and the classification phase using separately the FFBPNN and the SVM. The experimental results showed that the recognition rates with both approaches have reached 98.3 % with FFBPNN and 97.5 % with SVM.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Alorifis, F. S. (2008). Automatic identification of arabic dialects using hidden markov models, Thesis, University of Pittsburgh. Alorifis, F. S. (2008). Automatic identification of arabic dialects using hidden markov models, Thesis, University of Pittsburgh.
go back to reference Ameen, A., Uma, R., & Madhusudana, R. (2012). Speaker recognition system using combined vector quantization and discrete Hidden Markov model. International Journal Of Computational Engineering Research, 2(3), 2250–3005. Ameen, A., Uma, R., & Madhusudana, R. (2012). Speaker recognition system using combined vector quantization and discrete Hidden Markov model. International Journal Of Computational Engineering Research, 2(3), 2250–3005.
go back to reference Amour, M., Bouhjar, A., & Boukhris, F. (2004). Introduction to amazigh language. Paris: IRCAM. Amour, M., Bouhjar, A., & Boukhris, F. (2004). Introduction to amazigh language. Paris: IRCAM.
go back to reference Antoniol, G., Rollo, V. F., & Venturi, G. (2005). Linear Predictive Coding and Cepstrum coefficients for mining time variant information from software repositories. St. Louis: International Workshop on Mining Software Repositories.CrossRef Antoniol, G., Rollo, V. F., & Venturi, G. (2005). Linear Predictive Coding and Cepstrum coefficients for mining time variant information from software repositories. St. Louis: International Workshop on Mining Software Repositories.CrossRef
go back to reference Barkat-Defradas, M., Hamdi, R., & Pellegrino, F. (2004). From linguistic characterization to automatic identification of arabic dialects (pp. 29–30). Paris: MIDL. Barkat-Defradas, M., Hamdi, R., & Pellegrino, F. (2004). From linguistic characterization to automatic identification of arabic dialects (pp. 29–30). Paris: MIDL.
go back to reference Biadsy, F. Hirschberg, J. & Habash, N. (2009). Spoken arabic dialect identification using phonotactic modeling. In Proceedings of the Workshop on Computational Approaches to Semitic Languages at the meeting of the European Association for Computational Linguistics (EACL), Athens. Biadsy, F. Hirschberg, J. & Habash, N. (2009). Spoken arabic dialect identification using phonotactic modeling. In Proceedings of the Workshop on Computational Approaches to Semitic Languages at the meeting of the European Association for Computational Linguistics (EACL), Athens.
go back to reference Boril, H., Sangwan, A., Hansen, J, H.L. (2012). Arabic Dialect Identification—‘Is the Secret in the Silence and Other Observations”, Center for Robust Speech Systems (CRSS), Erik Jonsson School of Engineering,University of Texas at Dallas, Richardson, INTERSPEECH. Boril, H., Sangwan, A., Hansen, J, H.L. (2012). Arabic Dialect Identification—‘Is the Secret in the Silence and Other Observations”, Center for Robust Speech Systems (CRSS), Erik Jonsson School of Engineering,University of Texas at Dallas, Richardson, INTERSPEECH.
go back to reference Boukous, A. (1998). The Moroccan Sociolinguistic Situation. Plurilinguismes (Le Maroc) (Vol. 16, pp. 5–30). Paris: Centre d’Etudes et de Recherches en Planification Linguistique. Boukous, A. (1998). The Moroccan Sociolinguistic Situation. Plurilinguismes (Le Maroc) (Vol. 16, pp. 5–30). Paris: Centre d’Etudes et de Recherches en Planification Linguistique.
go back to reference Burges, C. J. C. (1998). A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, 2, 121–167.CrossRef Burges, C. J. C. (1998). A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, 2, 121–167.CrossRef
go back to reference Cobert, D. (2003). Darija, A language of Modernity, estidios dialectlogia norteafricana y andalusi, 2003. Cobert, D. (2003). Darija, A language of Modernity, estidios dialectlogia norteafricana y andalusi, 2003.
go back to reference Dasigi, P., & Diab, M. (2011). CODACT: towards identifying orthographic variants in dialectal arabic, In Proceedings of the 5th International Joint Conference on Natural Language Processing (pp. 318–326), Chiang Mai, Thailand, 8–13. Dasigi, P., & Diab, M. (2011). CODACT: towards identifying orthographic variants in dialectal arabic, In Proceedings of the 5th International Joint Conference on Natural Language Processing (pp. 318–326), Chiang Mai, Thailand, 8–13.
go back to reference Diab, M., Hacioglu, K., & Jurafsky, D. (2007). Automated methods for processing Arabic text: From Tokenization to base phrase chunking. In A. den van Bosch, A. den van Bosch, & A. Soudi (Eds.), Arabic computational morphology: Knowledge-based and empirical methods. New York: Springer. Diab, M., Hacioglu, K., & Jurafsky, D. (2007). Automated methods for processing Arabic text: From Tokenization to base phrase chunking. In A. den van Bosch, A. den van Bosch, & A. Soudi (Eds.), Arabic computational morphology: Knowledge-based and empirical methods. New York: Springer.
go back to reference Elfardy, H. & Diab, M. (2103) Sentence-level dialect identification in arabic.In Proceedings of the 51st AnnualMeeting of the Association for Computational Linguistics, ACL 2013, Sofia. Elfardy, H. & Diab, M. (2103) Sentence-level dialect identification in arabic.In Proceedings of the 51st AnnualMeeting of the Association for Computational Linguistics, ACL 2013, Sofia.
go back to reference Gunawan, W., & Hasegawa-Johnson, M. (2001). PLP coefficients can be quantized at 400 bps (pp. 1–4). Salt Lake City: ICASSP. Gunawan, W., & Hasegawa-Johnson, M. (2001). PLP coefficients can be quantized at 400 bps (pp. 1–4). Salt Lake City: ICASSP.
go back to reference Habash, N., Rambow, O., & Roth, R. (2009). MADA + TOKAN, A toolkit for Arabic tokenization, diacritization, morphological disambiguation, POS tagging, stemming and lemmatization. In K. Choukri, & B. Maegaard (Eds.) Proceedings of the second international conference on Arabic Language resources and tools. The MEDAR Consortium. Habash, N., Rambow, O., & Roth, R. (2009). MADA + TOKAN, A toolkit for Arabic tokenization, diacritization, morphological disambiguation, POS tagging, stemming and lemmatization. In K. Choukri, & B. Maegaard (Eds.) Proceedings of the second international conference on Arabic Language resources and tools. The MEDAR Consortium.
go back to reference Habash, N., Roth, R., Rambow, O., Eskander, R., & Tomeh, N. (2013). Morphological Analysis and Disambiguation for Dialectal Arabic, In Proceedings of the 2013 conference of the North American chapter of the association for computational linguistics: Human language technologies (NAACL-HLT), Atlanta. Habash, N., Roth, R., Rambow, O., Eskander, R., & Tomeh, N. (2013). Morphological Analysis and Disambiguation for Dialectal Arabic, In Proceedings of the 2013 conference of the North American chapter of the association for computational linguistics: Human language technologies (NAACL-HLT), Atlanta.
go back to reference Hamdi, R., (2007). Rhythmic variation in arabic dialects, Thèse Université 7 novembre de Carthage Tunisie. Hamdi, R., (2007). Rhythmic variation in arabic dialects, Thèse Université 7 novembre de Carthage Tunisie.
go back to reference Hassine, M., Boussaid, L., & Messaoud, H. (2015). Hybrid techniques for Arabic Letter recognition. International Journal of Intelligent Information Systems, 4(1), 27–34.CrossRef Hassine, M., Boussaid, L., & Messaoud, H. (2015). Hybrid techniques for Arabic Letter recognition. International Journal of Intelligent Information Systems, 4(1), 27–34.CrossRef
go back to reference Haykin, S. (2009). Neural networks and learning machines. New York: Prentice Hall. Haykin, S. (2009). Neural networks and learning machines. New York: Prentice Hall.
go back to reference Hermansky, H. (1990). Perceptual linear predictive (PLP) analysis for speech. The Journal of the Acoustical Society of America, 87, 1738–1752.CrossRef Hermansky, H. (1990). Perceptual linear predictive (PLP) analysis for speech. The Journal of the Acoustical Society of America, 87, 1738–1752.CrossRef
go back to reference Lindasalwa, M., Mumtaj, B., & Elamvazuthi, I. (2010). Voice recognition algorithms using mel frequency cepstral coefficient (MFCC) and dynamic time warping (DTW) techniques. Journal of Computing, 2(3) Lindasalwa, M., Mumtaj, B., & Elamvazuthi, I. (2010). Voice recognition algorithms using mel frequency cepstral coefficient (MFCC) and dynamic time warping (DTW) techniques. Journal of Computing, 2(3)
go back to reference MATLAB User’s Guide. Mathworks Inc., 2006. MATLAB User’s Guide. Mathworks Inc., 2006.
go back to reference Nour-Eddine, L., & Abdelkader, A. (2015). GMM-Based Maghreb Dialect Identification System. Journal of Information Processing Systems, 11(1), 22–38. Nour-Eddine, L., & Abdelkader, A. (2015). GMM-Based Maghreb Dialect Identification System. Journal of Information Processing Systems, 11(1), 22–38.
go back to reference Pasha, Arfath, Al-Badrashinyy, Mohamed, Diaby, Mona, El Kholy, Ahmed, Eskander, Ramy, Habash, Nizar, et al. (2014). MADAMIRA: A fast, comprehensive tool for morphological analysis and disambiguation of Arabic. New York: Columbia University, Center for Computational Learning Systems, LREC. Pasha, Arfath, Al-Badrashinyy, Mohamed, Diaby, Mona, El Kholy, Ahmed, Eskander, Ramy, Habash, Nizar, et al. (2014). MADAMIRA: A fast, comprehensive tool for morphological analysis and disambiguation of Arabic. New York: Columbia University, Center for Computational Learning Systems, LREC.
go back to reference Price, J. Sophomore student (2005) Design an automatic speech recognition system using maltab. In Progress report for: Chesapeake information based aeronautics consortium August 2005, University of Maryland Eastern Shore Princess Anne. Price, J. Sophomore student (2005) Design an automatic speech recognition system using maltab. In Progress report for: Chesapeake information based aeronautics consortium August 2005, University of Maryland Eastern Shore Princess Anne.
go back to reference Sadat, F., Kazemi, F., & Farzindar, A. (2014). Automatic dentification of Arabic Language Varieties and Dialects in Social Media, In Proceedings of the Second Workshop on Natural Language Processing for Social Media (SocialNLP) (pp. 22–27), Dublin, August 24 2014. Sadat, F., Kazemi, F., & Farzindar, A. (2014). Automatic dentification of Arabic Language Varieties and Dialects in Social Media, In Proceedings of the Second Workshop on Natural Language Processing for Social Media (SocialNLP) (pp. 22–27), Dublin, August 24 2014.
go back to reference Semet, G., & Treffo, G. (2002). Speech recognition based on MFCC coefficients, TIPE. Semet, G., & Treffo, G. (2002). Speech recognition based on MFCC coefficients, TIPE.
go back to reference Shlens J. (2003). A tutorial on principal component analysis, Derivation, Discussion and Singular Value Decomposition. Shlens J. (2003). A tutorial on principal component analysis, Derivation, Discussion and Singular Value Decomposition.
go back to reference Smola, A.J. & Scholkopf, B. (1998) A tutorial on support vector regression, Tech. rep., NeuroCOLT2 Technical Report NC2-TR-1998-030. Smola, A.J. & Scholkopf, B. (1998) A tutorial on support vector regression, Tech. rep., NeuroCOLT2 Technical Report NC2-TR-1998-030.
go back to reference Srinivasan, A. (2011). Speech recognition using hidden markov model. Applied Mathematical Sciences, 5(79), 3943–3948. Srinivasan, A. (2011). Speech recognition using hidden markov model. Applied Mathematical Sciences, 5(79), 3943–3948.
go back to reference Venkateswarlu, R.L.K., Kumari, R. V. & Vani Jayasri, G. (2011). Speech recognition using radial basis function neural network, IEEE. Venkateswarlu, R.L.K., Kumari, R. V. & Vani Jayasri, G. (2011). Speech recognition using radial basis function neural network, IEEE.
go back to reference Yu, H. & Kim, S. (2012). SVM tutorial: Classification, regression, and ranking. In Handbook of Natural Computing (pp. 479–506). Yu, H. & Kim, S. (2012). SVM tutorial: Classification, regression, and ranking. In Handbook of Natural Computing (pp. 479–506).
Metadata
Title
Maghrebian dialect recognition based on support vector machines and neural network classifiers
Authors
Mohamed Hassine
Lotfi Boussaid
Hassani Messaoud
Publication date
16-08-2016
Publisher
Springer US
Published in
International Journal of Speech Technology / Issue 4/2016
Print ISSN: 1381-2416
Electronic ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-016-9360-6

Other articles of this Issue 4/2016

International Journal of Speech Technology 4/2016 Go to the issue