Skip to main content

2017 | OriginalPaper | Buchkapitel

READ—A Bangla Phoneme Recognition System

verfasst von : Himadri Mukherjee, Chayan Halder, Santanu Phadikar, Kaushik Roy

Erschienen in: Proceedings of the 5th International Conference on Frontiers in Intelligent Computing: Theory and Applications

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Speech Recognition is a challenging task especially for a multilingual country like India as the speakers are habituated in using mixed language and accent. Bangla is a very popular language in East Asia and a fully functional Automated Speech Recognition System (ASR) for it is yet to be developed. Every language embodies a set of sounds called phoneme set, which is the building block for the words of that language. READ (Record Extract Approximate Distinguish) is a Bangla phoneme recognition system, proposed toward the development of a Bangla ASR. To start with, Mel Scale Cepstral Coefficient (MFCC) features have been used for testing on a database of 1400 Bangla vowel phonemes and an accuracy of 98.35% has been obtained.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat L. Muda, M. Begam and I. Elamvazuthi, “Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques”, Journal of Computing, Vol. 2, Issue 3, pp 138–143, 2010 L. Muda, M. Begam and I. Elamvazuthi, “Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques”, Journal of Computing, Vol. 2, Issue 3, pp 138–143, 2010
2.
Zurück zum Zitat H. Dudley, “The Vocoder”, Bell Labs Record, Vol. 17, pp. 122–126, 1939. H. Dudley, “The Vocoder”, Bell Labs Record, Vol. 17, pp. 122–126, 1939.
3.
Zurück zum Zitat H. Dudley, R. R. Riesz and S. A. Watkins, “A Synthetic Speaker”, J. Franklin Institute, Vol. 227, pp. 739–764, 1939. H. Dudley, R. R. Riesz and S. A. Watkins, “A Synthetic Speaker”, J. Franklin Institute, Vol. 227, pp. 739–764, 1939.
4.
Zurück zum Zitat J. W. Forgie and C. D. Forgie, “Results obtained from a Vowel Recognition Computer Program”, The Journal of the Acoustical Society of America, Vol. 31, pp. 1480–1489, 1959. J. W. Forgie and C. D. Forgie, “Results obtained from a Vowel Recognition Computer Program”, The Journal of the Acoustical Society of America, Vol. 31, pp. 1480–1489, 1959.
5.
Zurück zum Zitat N. Desai, K. Dhameliya and V. Desai, “Feature Extraction and Classification Techniques for Speech Recognition: A Review”, International Journal of Emerging Technology and Advanced Engineering, Vol. 3, Issue 12, pp. 367–371, 2013 N. Desai, K. Dhameliya and V. Desai, “Feature Extraction and Classification Techniques for Speech Recognition: A Review”, International Journal of Emerging Technology and Advanced Engineering, Vol. 3, Issue 12, pp. 367–371, 2013
9.
Zurück zum Zitat M. Pramanik and K. Kido, “Bengali Speech : Formant Structures of Single Vowels And Initial Vowels of Words”, In Proc. of ICASSP, Vol. 1, pp. 178–181, 1976. M. Pramanik and K. Kido, “Bengali Speech : Formant Structures of Single Vowels And Initial Vowels of Words”, In Proc. of ICASSP, Vol. 1, pp. 178–181, 1976.
10.
Zurück zum Zitat Lewis, M. Paul, G. F. Simons, and C. D. Fennig, “Ethnologue: Languages of the World”, Nineteenth edition. Dallas, Texas: SIL International, (eds.), 2016. Lewis, M. Paul, G. F. Simons, and C. D. Fennig, “Ethnologue: Languages of the World”, Nineteenth edition. Dallas, Texas: SIL International, (eds.), 2016.
11.
Zurück zum Zitat Md. A. Ali, M. Hossain and M. N. Bhuiyan, “Automatic Speech Recognition Technique for Bangla Words”, International Journal of Advanced Science and Technology Vol. 50, pp. 51–59, 2013 Md. A. Ali, M. Hossain and M. N. Bhuiyan, “Automatic Speech Recognition Technique for Bangla Words”, International Journal of Advanced Science and Technology Vol. 50, pp. 51–59, 2013
12.
Zurück zum Zitat A. Firoze, M. S. Arifin, R. Quadir and R. M. Rahman, “Bangla Isolated Word Speech Recognition”, In Proc. of ICEIS, Vol. 2, pp. 73–82, 2011 A. Firoze, M. S. Arifin, R. Quadir and R. M. Rahman, “Bangla Isolated Word Speech Recognition”, In Proc. of ICEIS, Vol. 2, pp. 73–82, 2011
13.
Zurück zum Zitat M. A. Hasnat, J. Mowla and M. Khan, “Isolated and Continuous Bangla Speech Recognition: Implementation Performance and application perspective”, In Proc. of SNLP, 2007. M. A. Hasnat, J. Mowla and M. Khan, “Isolated and Continuous Bangla Speech Recognition: Implementation Performance and application perspective”, In Proc. of SNLP, 2007.
14.
Zurück zum Zitat K. K. Hossain, Md. J. Hossain, A. ferdousi and Md. F. Khan, “Comparative Study of Recognition Tools as Back-Ends for Bangla Phoneme Recognition”, IJRCAR, Vol. 2, Issue 12, pp. 36–40, 2014 K. K. Hossain, Md. J. Hossain, A. ferdousi and Md. F. Khan, “Comparative Study of Recognition Tools as Back-Ends for Bangla Phoneme Recognition”, IJRCAR, Vol. 2, Issue 12, pp. 36–40, 2014
15.
Zurück zum Zitat R. Karim, M. S. Rahman and M. Z Iqbal, “Recognition of spoken letters in Bangla”, in Proc. of ICCIT, 2002. R. Karim, M. S. Rahman and M. Z Iqbal, “Recognition of spoken letters in Bangla”, in Proc. of ICCIT, 2002.
16.
Zurück zum Zitat M. R. A. Kotwal, Md. S. Hossain, F. Hassan, G. Muhammad, M. N. Huda and C. M. Rahman, “Bangla Phoneme Recognition Using Hybrid Features”, In Proc. of ICECE, 2010 M. R. A. Kotwal, Md. S. Hossain, F. Hassan, G. Muhammad, M. N. Huda and C. M. Rahman, “Bangla Phoneme Recognition Using Hybrid Features”, In Proc. of ICECE, 2010
18.
Zurück zum Zitat M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann and I. H. Witten, “The WEKA Data Mining Software: An Update”, SIGKDD Explorations, Vol. 11, Issue 1, pp. 10–18, 2009 M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann and I. H. Witten, “The WEKA Data Mining Software: An Update”, SIGKDD Explorations, Vol. 11, Issue 1, pp. 10–18, 2009
Metadaten
Titel
READ—A Bangla Phoneme Recognition System
verfasst von
Himadri Mukherjee
Chayan Halder
Santanu Phadikar
Kaushik Roy
Copyright-Jahr
2017
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-3153-3_59

Premium Partner