Skip to main content
Top

2016 | OriginalPaper | Chapter

Prosody Analysis of Malay Language Storytelling Corpus

Authors : Izzad Ramli, Noraini Seman, Norizah Ardi, Nursuriati Jamil

Published in: Speech and Computer

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this paper, the prosody of the storytelling speech corpus is analyzed. The main objective of the analysis is to develop prosody rules to convert neutral speech to storytelling speech. The speech corpus (neutral and storytelling speech) contains 464 speech sentences, 4,656 words, and 10,928 syllables. It was recorded by three female storytellers, one male professional speaker, two female speakers and two male speakers. The prosodic features considered for analysis are tempo, pause (sentence and phrase-level), duration, intensity, and pitch. Further analysis of the word categories exist in storytelling speech such as verb, adverb, adjective, noun, conjunction and amplifier are also conducted. The global prosody analysis showed that mean prosodic of storytelling is higher than neutral speech, especially intensity and pitch. Investigation on the word categories showed that words categorized as adverb, adjective, amplifier and conjunctions have significant number of prominent syllables. Meanwhile, nouns and verbs do not have significant difference between neutral and storytelling speech. Positions of the words (i.e. initial, middle, last) in a phrase for different word categories also proved to have different increasing factor in duration, pitch and intensity.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Khaw, Y.J., Tan, T., Sciences, C.: Preparation of MaDiTS corpus for Malay dialect translation and speech synthesis system. In: Speech, language and Audio in Multimedia Workshop (SLAM 2014), pp. 53–57 (2014) Khaw, Y.J., Tan, T., Sciences, C.: Preparation of MaDiTS corpus for Malay dialect translation and speech synthesis system. In: Speech, language and Audio in Multimedia Workshop (SLAM 2014), pp. 53–57 (2014)
2.
go back to reference Gelin, R., D’Alessandro, C., Le, Q.: Towards a storytelling humanoid robot. In: AAAI Fall Symposium Series on Dialog with Robots, pp. 137–138 (2010) Gelin, R., D’Alessandro, C., Le, Q.: Towards a storytelling humanoid robot. In: AAAI Fall Symposium Series on Dialog with Robots, pp. 137–138 (2010)
3.
go back to reference Theune, M., Meijs, K., Heylen, D., Ordelman, R.: Generating expressive speech for storytelling applications. IEEE Trans. Audio Speech Lang. Process. 14, 1099–1108 (2006)CrossRef Theune, M., Meijs, K., Heylen, D., Ordelman, R.: Generating expressive speech for storytelling applications. IEEE Trans. Audio Speech Lang. Process. 14, 1099–1108 (2006)CrossRef
4.
go back to reference Sarkar, P., Haque, A., Dutta, A.K., Gurunath Reddy, M., Harikrishna, D.M., Dhara, P., Verma, R., Narendra, N.P., Sunil Kr., S.B., Yadav, J., Rao, K.S.: Designing Prosody Rule-set for Converting Neutral TTS Speech to storytelling style speech for Indian Languages: Bengali, Hindi and Telugu, p. 4 (2014) Sarkar, P., Haque, A., Dutta, A.K., Gurunath Reddy, M., Harikrishna, D.M., Dhara, P., Verma, R., Narendra, N.P., Sunil Kr., S.B., Yadav, J., Rao, K.S.: Designing Prosody Rule-set for Converting Neutral TTS Speech to storytelling style speech for Indian Languages: Bengali, Hindi and Telugu, p. 4 (2014)
5.
go back to reference Mustafa, M.B., Don, Z.M., Ainon, R.N., Zainuddin, R., Knowles, G.: Developing an HMM-based speech synthesis system for Malay: a comparison of iterative and isolated unit training. IEICE Trans. Inf. Syst. 97(5), 1273–1282 (2014)CrossRef Mustafa, M.B., Don, Z.M., Ainon, R.N., Zainuddin, R., Knowles, G.: Developing an HMM-based speech synthesis system for Malay: a comparison of iterative and isolated unit training. IEICE Trans. Inf. Syst. 97(5), 1273–1282 (2014)CrossRef
6.
go back to reference Maekawa, K., Koiso, H., Furui, S., Isahara, H.: Spontaneous speech corpus of Japanese. In: Proceedings LREC2000 (Second International Conference on Language Resources and Evaluation), vol. 2, pp. 947–952 , May 2000 Maekawa, K., Koiso, H., Furui, S., Isahara, H.: Spontaneous speech corpus of Japanese. In: Proceedings LREC2000 (Second International Conference on Language Resources and Evaluation), vol. 2, pp. 947–952 , May 2000
7.
go back to reference Verma, R., Sarkar, P., Rao, K. S.: Conversion of neutral speech to storytelling style speech. In: 2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR), pp. 1–6. IEEE, January 2015 Verma, R., Sarkar, P., Rao, K. S.: Conversion of neutral speech to storytelling style speech. In: 2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR), pp. 1–6. IEEE, January 2015
8.
go back to reference Roekhaut, S., Goldman, J., Simon, A.C.: A Model for Varying Speaking Style in TTS systems, pp. 4–7 (2010) Roekhaut, S., Goldman, J., Simon, A.C.: A Model for Varying Speaking Style in TTS systems, pp. 4–7 (2010)
9.
go back to reference Sproat, R., Alm, C.O., Sproat, R.: Perceptions of emotions in expressive Perceptions of Emotions in Expressive Storytelling. In: INTERSPEECH, pp. 533–536 (2005) Sproat, R., Alm, C.O., Sproat, R.: Perceptions of emotions in expressive Perceptions of Emotions in Expressive Storytelling. In: INTERSPEECH, pp. 533–536 (2005)
10.
go back to reference Doukhan, D., Rilliard, A., Rosset, S., Adda-decker, M., Alessandro, C.: Prosodic analysis of a corpus of tales. In: INTERSPEECH, pp. 3129–3132 (2011) Doukhan, D., Rilliard, A., Rosset, S., Adda-decker, M., Alessandro, C.: Prosodic analysis of a corpus of tales. In: INTERSPEECH, pp. 3129–3132 (2011)
11.
go back to reference Pvribil, J., Pvribilová, A.: Application of expressive speech in TTS system with cepstral description. In: Esposito, A., Bourbakis, N.G., Avouris, N., Hatzilygeroudis, I. (eds.) HH and HM Interaction. LNCS (LNAI), vol. 5042, pp. 200–212. Springer, Heidelberg (2008)CrossRef Pvribil, J., Pvribilová, A.: Application of expressive speech in TTS system with cepstral description. In: Esposito, A., Bourbakis, N.G., Avouris, N., Hatzilygeroudis, I. (eds.) HH and HM Interaction. LNCS (LNAI), vol. 5042, pp. 200–212. Springer, Heidelberg (2008)CrossRef
12.
go back to reference Montaño, R., Alías, F., Ferrer, J.: Prosodic analysis of storytelling discourse modes and narrative situations oriented to Text-to-Speech synthesis. In: 8th ISCA Workshop on Speech Synthesis, pp. 171–176 (2013) Montaño, R., Alías, F., Ferrer, J.: Prosodic analysis of storytelling discourse modes and narrative situations oriented to Text-to-Speech synthesis. In: 8th ISCA Workshop on Speech Synthesis, pp. 171–176 (2013)
13.
go back to reference Boersma, P.: Praat, a system for doing phonetics by computer. Glot int. 5(9/10), 341–345 (2002) Boersma, P.: Praat, a system for doing phonetics by computer. Glot int. 5(9/10), 341–345 (2002)
14.
go back to reference Bulut, M., Narayanan, S.: On the robustness of overall F0- only modifications to the perception of emotions in speech. J. Acoust. Soc. Am. 123, 4547–4558 (2008)CrossRef Bulut, M., Narayanan, S.: On the robustness of overall F0- only modifications to the perception of emotions in speech. J. Acoust. Soc. Am. 123, 4547–4558 (2008)CrossRef
Metadata
Title
Prosody Analysis of Malay Language Storytelling Corpus
Authors
Izzad Ramli
Noraini Seman
Norizah Ardi
Nursuriati Jamil
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-43958-7_68

Premium Partner