Skip to main content
Top

2018 | OriginalPaper | Chapter

An Ensemble Learning Based Bangla Phoneme Identification System Using LSF-G Features

Authors : Himadri Mukherjee, Sourav Ganguly, Santanu Phadikar, Kaushik Roy

Published in: Advanced Computational and Communication Paradigms

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Technology has evolved a lot in the last decade, and various devices have come up for assisting us in our day-to-day life. There has always been a need for simplifying the User Interfaces (UI) of such devices so that they can be easily interacted with, and a speech based UI can be a potential solution. Speech recognition is the task of identification of words from voice signals. Every language consists of a set of atomic sounds called Phonemes which builds up the entire vocabulary of that language. Speech recognition in Bangla is rather a complicated task due to the complex nature of the language like the presence of compound characters. In this paper, a Bangla Phoneme recognition system is proposed towards the development of a Bangla Speech recognition system based on Line Spectral Frequency-Grade (LSF-G) features derived from standard line spectral frequency values. The system has been tested on a Bangla Swarabarna Phoneme dataset of 3290 clips and an accuracy of 94.01% has been obtained with an Ensemble learning based approach.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference Dudley, H.: The Vocoder. Bell Labs Rec. 17, 122–126 (1939) Dudley, H.: The Vocoder. Bell Labs Rec. 17, 122–126 (1939)
3.
go back to reference Dudley, H., Riesz, R.R., Watkins, S.A.: A synthetic speaker. J. Franklin Institute 227, 739–764 (1939)CrossRef Dudley, H., Riesz, R.R., Watkins, S.A.: A synthetic speaker. J. Franklin Institute 227, 739–764 (1939)CrossRef
4.
go back to reference Forgie, J.W., Forgie, C.D.: Results obtained from a vowel recognition computer program. J. Acoust. Soc. Am. 31, 1480–1489 (1959)CrossRef Forgie, J.W., Forgie, C.D.: Results obtained from a vowel recognition computer program. J. Acoust. Soc. Am. 31, 1480–1489 (1959)CrossRef
5.
go back to reference Desai, N., Dhameliya, K., Desai, V.: Feature extraction and classification techniques for speech recognition: a review. Int. J. Emerg. Technol. Adv. Eng. 3(12), 367–371 (2013) Desai, N., Dhameliya, K., Desai, V.: Feature extraction and classification techniques for speech recognition: a review. Int. J. Emerg. Technol. Adv. Eng. 3(12), 367–371 (2013)
9.
go back to reference Besacier, L., Barnard, E., Karpov, A., Schultz, T.: Automatic speech recognition for under-resourced languages: a survey. Speech Commun. (2013) Besacier, L., Barnard, E., Karpov, A., Schultz, T.: Automatic speech recognition for under-resourced languages: a survey. Speech Commun. (2013)
10.
go back to reference Pramanik, M., Kido, K.: Bengali speech: formant structures of single vowels and initial vowels of words. In: Proceedings of ICASSP, vol. 1, pp. 178–181 (1976) Pramanik, M., Kido, K.: Bengali speech: formant structures of single vowels and initial vowels of words. In: Proceedings of ICASSP, vol. 1, pp. 178–181 (1976)
11.
go back to reference Hasnat, M.A., Mowla, J., Khan M.: Isolated and continuous Bangla speech recognition: implementation performance and application perspective. In: Proceedings of SNLP (2007) Hasnat, M.A., Mowla, J., Khan M.: Isolated and continuous Bangla speech recognition: implementation performance and application perspective. In: Proceedings of SNLP (2007)
12.
go back to reference Hasanat, A., Karim, M.R., Rahman, M.S., Iqbal, M.Z.: Recognition of spoken letters in Bangla. In: Proceedings of ICCIT (2002) Hasanat, A., Karim, M.R., Rahman, M.S., Iqbal, M.Z.: Recognition of spoken letters in Bangla. In: Proceedings of ICCIT (2002)
13.
go back to reference Ali, MdA, Hossain, M., Bhuiyan, M.N.: Automatic speech recognition technique for Bangla words. Int. J. Adv. Sci. Technol. 50, 51–59 (2013) Ali, MdA, Hossain, M., Bhuiyan, M.N.: Automatic speech recognition technique for Bangla words. Int. J. Adv. Sci. Technol. 50, 51–59 (2013)
14.
go back to reference Firoze, A., Arifin, M.S., Quadir, R., Rahman, R.M.: Bangla isolated word speech recognition. Proceedings of ICEIS 2, 73–82 (2011) Firoze, A., Arifin, M.S., Quadir, R., Rahman, R.M.: Bangla isolated word speech recognition. Proceedings of ICEIS 2, 73–82 (2011)
15.
go back to reference Kotwal, M.R.A., Hossain, Md.S., Hassan, F., Muhammad, G., Huda, M.N., Rahman, C.M.: Bangla Phoneme recognition using hybrid features. In: Proceedings of ICECE (2010) Kotwal, M.R.A., Hossain, Md.S., Hassan, F., Muhammad, G., Huda, M.N., Rahman, C.M.: Bangla Phoneme recognition using hybrid features. In: Proceedings of ICECE (2010)
16.
go back to reference Hossain, K.K., Hossain, MdJ, Ferdousi, A., Khan, MdF: Comparative study of recognition tools as back-ends for Bangla Phoneme recognition. IJRCAR 2(12), 36–40 (2014) Hossain, K.K., Hossain, MdJ, Ferdousi, A., Khan, MdF: Comparative study of recognition tools as back-ends for Bangla Phoneme recognition. IJRCAR 2(12), 36–40 (2014)
18.
go back to reference Paliwal, K.K., Alsteris., L.: Usefulness of phase spectrum in human speech perception. In: Interspeech, pp. 2117–2120 (2003) Paliwal, K.K., Alsteris., L.: Usefulness of phase spectrum in human speech perception. In: Interspeech, pp. 2117–2120 (2003)
19.
go back to reference Paliwal, K.K.: On the use of line spectral frequency parameters for speech recognition. Digit. Signal Process. 2, 80–87 (1992)CrossRef Paliwal, K.K.: On the use of line spectral frequency parameters for speech recognition. Digit. Signal Process. 2, 80–87 (1992)CrossRef
20.
go back to reference Dietterich, T.G.: Ensemble Learning. The handbook of brain theory and neural networks, vol. 2, pp. 110–125 (2002) Dietterich, T.G.: Ensemble Learning. The handbook of brain theory and neural networks, vol. 2, pp. 110–125 (2002)
Metadata
Title
An Ensemble Learning Based Bangla Phoneme Identification System Using LSF-G Features
Authors
Himadri Mukherjee
Sourav Ganguly
Santanu Phadikar
Kaushik Roy
Copyright Year
2018
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-8237-5_20