Skip to main content
Top

2017 | OriginalPaper | Chapter

A Framework to Analyze Quality of Service (QoS) for Text-To-Speech (TTS) Services

Authors : Mohd Farhan Md Fudzee, Mohamud Hassan, Hairulnizam Mahdin, Shahreen Kasim, Jemal Abawajy

Published in: Recent Advances on Soft Computing and Data Mining

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Quality of service (QoS) evaluation is vital for text-to-speech (TTS) web service applications. Most of the current solutions focus on either evaluating functional or nonfunctional attributes of the TTS. In this paper, we propose a QoS framework to evaluate and analyze the perceived QoS that combines general and specific mechanisms for measuring both functional and nonfunctional requirements of speech quality. General mechanism measures the response time of TTS services while specific mechanism measures intelligibility and naturalness through subjective quality measurements, which are mapped onto mean opinion score (MOS). The result shows the workability of the framework, tested by predetermined users to three services: service1 (Fromtexttospeech) resulting 47.84%; service2 and service3 (NaturalReader and Yakitome) are 31.62 and 21.53% respectively. The TTS services evaluation can be to enhance the user experience.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Patil, M., Kawitkar, R.S.: “Syllable” concatenation for text to speech synthesis for Devnagari script. Int. J. Adv. Res. Eng. Comput. Sci. Softw. 2(9), 180–184 (2012) Patil, M., Kawitkar, R.S.: “Syllable” concatenation for text to speech synthesis for Devnagari script. Int. J. Adv. Res. Eng. Comput. Sci. Softw. 2(9), 180–184 (2012)
2.
go back to reference Md Fudzee, M.F., Abawajy, J.: A protocol for discovering content adaptation services. In: Xiang, Y., Cuzzocrea, A., Hobbs, M., Zhou, W. (eds.) ICA3PP 2011. LNCS, vol. 7017, pp. 235–244. Springer, Heidelberg (2011). doi:10.1007/978-3-642-24669-2CrossRef Md Fudzee, M.F., Abawajy, J.: A protocol for discovering content adaptation services. In: Xiang, Y., Cuzzocrea, A., Hobbs, M., Zhou, W. (eds.) ICA3PP 2011. LNCS, vol. 7017, pp. 235–244. Springer, Heidelberg (2011). doi:10.​1007/​978-3-642-24669-2CrossRef
3.
go back to reference Wang, L., et al.: Evaluating text-to-speech intelligibility using template constrained generalized posterior probability. U.S. Patent Application (2012) Wang, L., et al.: Evaluating text-to-speech intelligibility using template constrained generalized posterior probability. U.S. Patent Application (2012)
4.
go back to reference Remes, U., Reima, K., Mikko, K.: Objective evaluation measures for speaker adaptive HMM-TTS systems. In: Proceedings of 8th ISCA Speech Synthesis Workshop (2013) Remes, U., Reima, K., Mikko, K.: Objective evaluation measures for speaker adaptive HMM-TTS systems. In: Proceedings of 8th ISCA Speech Synthesis Workshop (2013)
5.
go back to reference Möller, S., Wai, Y.C., Cote, N., Falk, T., Raake, A., Waltermann, A.: Speech quality estimation: models and trends. IEEE Sign. Process. Mag. 28, 18–28 (2011)CrossRef Möller, S., Wai, Y.C., Cote, N., Falk, T., Raake, A., Waltermann, A.: Speech quality estimation: models and trends. IEEE Sign. Process. Mag. 28, 18–28 (2011)CrossRef
6.
go back to reference Egger, S., et al.: Waiting times in quality of experience for web based services. In: 2012 Fourth International Workshop on Quality of Multimedia Experience (QoMEX). IEEE (2012) Egger, S., et al.: Waiting times in quality of experience for web based services. In: 2012 Fourth International Workshop on Quality of Multimedia Experience (QoMEX). IEEE (2012)
7.
go back to reference Streijl, C.R., Winkler, S., Hands, D.S.: Mean Opinion Score (MOS) revisited: methods and applications, limitations and alternatives. Multimedia Syst. 22, 213–227 (2014)CrossRef Streijl, C.R., Winkler, S., Hands, D.S.: Mean Opinion Score (MOS) revisited: methods and applications, limitations and alternatives. Multimedia Syst. 22, 213–227 (2014)CrossRef
8.
go back to reference Md Fudzee, M.F., Abawajy, J.: Request-driven cross-media content adaptation technique. In: Ragab, K., Helmy, T., Hassanien, A.E. (eds.) Developing Advanced Web Services Through P2P Computing and Autonomous Agents: Trends and Innovations, chap. 6, pp. 91–113. IGI Global (2010) Md Fudzee, M.F., Abawajy, J.: Request-driven cross-media content adaptation technique. In: Ragab, K., Helmy, T., Hassanien, A.E. (eds.) Developing Advanced Web Services Through P2P Computing and Autonomous Agents: Trends and Innovations, chap. 6, pp. 91–113. IGI Global (2010)
9.
go back to reference Eyben, F., et al.: Unsupervised clustering of emotion and voice styles for expressive TTS. In: International Conference on IEEE Acoustics, Speech and Signal Processing (ICASSP) (2012) Eyben, F., et al.: Unsupervised clustering of emotion and voice styles for expressive TTS. In: International Conference on IEEE Acoustics, Speech and Signal Processing (ICASSP) (2012)
10.
go back to reference Md Fudzee, M.F., Abawajy, J.: Management of Service level agreement for service-oriented content adaptation platform. In: Network and Traffic Engineering in Emerging Distributed Computing Applications, pp. 21–42 (2012) Md Fudzee, M.F., Abawajy, J.: Management of Service level agreement for service-oriented content adaptation platform. In: Network and Traffic Engineering in Emerging Distributed Computing Applications, pp. 21–42 (2012)
Metadata
Title
A Framework to Analyze Quality of Service (QoS) for Text-To-Speech (TTS) Services
Authors
Mohd Farhan Md Fudzee
Mohamud Hassan
Hairulnizam Mahdin
Shahreen Kasim
Jemal Abawajy
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-51281-5_59

Premium Partner