Skip to main content
Top

2018 | OriginalPaper | Chapter

Phonetic Transcription Comparison for Emotional Database for Speech Synthesis

Authors : Mukta Gahlawat, Amita Malik, Poonam Bansal

Published in: Speech and Language Processing for Human-Machine Communications

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Phonetics transcription is the process of representing the speech unit into phonetic alphabets. This is necessary step for doing speech synthesis. It involves segmentation and labelling of sound files. Transcription at phonetic level can be performed either manually or automatically. Both ways are implemented on different expressions like happy, neutral and sad. Comparisons using various parameters like pitch, power and formants are made for various emotions. Additionally, pros and cons of using manual and automatic segmentations are also discussed on the basis of result received on expressive speech corpus.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Kvale, K.: Segmentation and labelling of speech PhD Thesis submitted at Department of Telecommunication of The Norwegian Institute of Technology (1993) Kvale, K.: Segmentation and labelling of speech PhD Thesis submitted at Department of Telecommunication of The Norwegian Institute of Technology (1993)
2.
go back to reference Toledano, D.T., Luis, A., Gómez, H., Grande, L.V.: Automatic phonetic segmentation. IEEE Trans. Speech Audio Process. 11(6) (Nov 2003) Toledano, D.T., Luis, A., Gómez, H., Grande, L.V.: Automatic phonetic segmentation. IEEE Trans. Speech Audio Process. 11(6) (Nov 2003)
3.
go back to reference Wesenick, M.B., Kipp, A.: Estimating the Quality of Phonetic Transcriptions and Segmentations of Speech Signals, these Proceedings of ICSLP, Philadelphia/ USA (1996) Wesenick, M.B., Kipp, A.: Estimating the Quality of Phonetic Transcriptions and Segmentations of Speech Signals, these Proceedings of ICSLP, Philadelphia/ USA (1996)
4.
go back to reference Sudhakar, B., Raj, R.B.: Automatic speech segmentation to improve speech synthesis performance. In: 2013 International Conference on Circuits, Power and Computing Technologies [ICCPCT-2013], pp. 835–839 Sudhakar, B., Raj, R.B.: Automatic speech segmentation to improve speech synthesis performance. In: 2013 International Conference on Circuits, Power and Computing Technologies [ICCPCT-2013], pp. 835–839
5.
go back to reference Szklanny, K., Wójtowski, M.: Automatic segmentation quality improvement for realization of unit selection speech synthesis. In: IEEE Conference HSI, pp. 251–256 (2008) Szklanny, K., Wójtowski, M.: Automatic segmentation quality improvement for realization of unit selection speech synthesis. In: IEEE Conference HSI, pp. 251–256 (2008)
6.
go back to reference Gallardo-Antolín, A., Barra-Chicote, R., Schröder, M., Krstulovic, S., Montero, J.M.: Automatic phonetic segmentation of Spanish emotional speech. In: Interspeech, pp. 2905–2908. ISCA (2007) Gallardo-Antolín, A., Barra-Chicote, R., Schröder, M., Krstulovic, S., Montero, J.M.: Automatic phonetic segmentation of Spanish emotional speech. In: Interspeech, pp. 2905–2908. ISCA (2007)
9.
go back to reference Young, S., Odell, J., Ollason, D., Valtchev, V., Woodland, P.: The HTK Book, , Version 2.1. Cambridge University (1997) Young, S., Odell, J., Ollason, D., Valtchev, V., Woodland, P.: The HTK Book, , Version 2.1. Cambridge University (1997)
10.
go back to reference Salvi, G.: HTK Tutorial. K.T.H. Royal Institute of Technology, Department of Speech, Music and Hearing, Drottning Kristinas v. 31, SE-100 44, Stockholm, Sweden Salvi, G.: HTK Tutorial. K.T.H. Royal Institute of Technology, Department of Speech, Music and Hearing, Drottning Kristinas v. 31, SE-100 44, Stockholm, Sweden
Metadata
Title
Phonetic Transcription Comparison for Emotional Database for Speech Synthesis
Authors
Mukta Gahlawat
Amita Malik
Poonam Bansal
Copyright Year
2018
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-6626-9_21