Skip to main content
Top

2022 | OriginalPaper | Chapter

Word Estimation in Continuous Colloquial Bengali Speech

Author : Suman Das

Published in: Computational Advancement in Communication, Circuits and Systems

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Word segmentation is a crucial part in any speech to text conversion. Many works have been done on popular languages, especially on English, but a very few work has been carried out on Bengali language, especially on colloquial speech. In our work, we present a simple pitch profile-based technique to find the words in a Bengali speech. We extract the feature of the existence of words based on the pitch profile of a speech. To find the pitch profile, we have used the state phase technique. A simple deviation of a 20 ms window is studied to find the pitch. In order to reshape the pitch, power profile of the speech is used. Then apply morphology to make the profile more robust. Finally, we cluster the data and use silhouette index to select the number clusters present in the data which in turn estimate the word boundaries. The algorithm is tested over continuous colloquial Bengali speech.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Rabiner L, Juang BH, Yegnanarayana B (2009) Fundamental of speech recognition. Pearson Rabiner L, Juang BH, Yegnanarayana B (2009) Fundamental of speech recognition. Pearson
2.
go back to reference Chowdhury S (2006) Concatenative text-to-speech synthesis: a study on standard colloquial Bengali, Ph.D. thesis. Statistical Institute, Kolkata Chowdhury S (2006) Concatenative text-to-speech synthesis: a study on standard colloquial Bengali, Ph.D. thesis. Statistical Institute, Kolkata
3.
go back to reference Acoustics of Bangla Speech Sounds, Asoke Kumar Datta. Springer Publications Acoustics of Bangla Speech Sounds, Asoke Kumar Datta. Springer Publications
4.
go back to reference Das Mandal SK (2007) Role of shape parameters in speech recognition: a study on standard colloquial Bengali (SCB), Ph.D. thesis. Jadavpur University Das Mandal SK (2007) Role of shape parameters in speech recognition: a study on standard colloquial Bengali (SCB), Ph.D. thesis. Jadavpur University
5.
go back to reference Rousseeuw PJ (1987) Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math 20:53–65CrossRef Rousseeuw PJ (1987) Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math 20:53–65CrossRef
Metadata
Title
Word Estimation in Continuous Colloquial Bengali Speech
Author
Suman Das
Copyright Year
2022
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-16-4035-3_33