Skip to main content
Top

2016 | OriginalPaper | Chapter

Algorithm of Allophone Borders Correction in Automatic Segmentation of Acoustic Units

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In concatenative speech synthesis the fundamental factor with heavy influence on synthesized speech quality is the database of acoustic units. In case of bases received in automatic way, the key matter is suitable marking the borders of acoustic units. This article describes the algorithm of correction of acoustic units borders appointive in automatic way. It is based on two factors specified and tested here. It also describes worked out method of grade of acoustic units database, which allows to observe the influence of introduced correction on the base quality.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Almpanidis, G., Kotropoulos, C.: Automatic phonemic segmentation using the Bayesian information criterion with generalised gamma priors. In: Proceedings of EUSIPCO (2007) Almpanidis, G., Kotropoulos, C.: Automatic phonemic segmentation using the Bayesian information criterion with generalised gamma priors. In: Proceedings of EUSIPCO (2007)
2.
go back to reference Szklanny, K., Oliver, D.: Creation and analysis of a Polish speech database for use in unit selection speech synthesis. In: LREC Conference, Genova (2006) Szklanny, K., Oliver, D.: Creation and analysis of a Polish speech database for use in unit selection speech synthesis. In: LREC Conference, Genova (2006)
3.
go back to reference Dutoit, T.: An Introduction to Text-to-Speech Synthesis. Kluwer Academic Publishers, Dordrecht (1997)CrossRef Dutoit, T.: An Introduction to Text-to-Speech Synthesis. Kluwer Academic Publishers, Dordrecht (1997)CrossRef
4.
go back to reference Taylor, P.: Text-to-Speech Synthesis. Cambridge University Press, Cambridge (2009)CrossRef Taylor, P.: Text-to-Speech Synthesis. Cambridge University Press, Cambridge (2009)CrossRef
5.
go back to reference Van Santen, J.P.H., Sproat, R., Olive, J., Hirshberg, J.: Progress in Speech Synthesis. Springer, New York (1997)CrossRef Van Santen, J.P.H., Sproat, R., Olive, J., Hirshberg, J.: Progress in Speech Synthesis. Springer, New York (1997)CrossRef
6.
go back to reference Szpilewski, E., Piórkowska, B., Rafałko, J., Lobanov, B., Kiselov, V., Tsirulnik, L.: Polish TTS in multi-voice slavonic languages speech synthesis system. In: Proceedings of 9th International Conference Speech and Computer, SPECOM 2004, Saint-Petersburg, Russia, pp. 565–570 (2004) Szpilewski, E., Piórkowska, B., Rafałko, J., Lobanov, B., Kiselov, V., Tsirulnik, L.: Polish TTS in multi-voice slavonic languages speech synthesis system. In: Proceedings of 9th International Conference Speech and Computer, SPECOM 2004, Saint-Petersburg, Russia, pp. 565–570 (2004)
7.
go back to reference Jassem, W.: Podstawy fonetyki akustycznej, wyd. PWN, Warszawa (1973) Jassem, W.: Podstawy fonetyki akustycznej, wyd. PWN, Warszawa (1973)
8.
go back to reference Lobanov, B., Piórkowska, B., Rafałko, J., Cyrulnik, L.: Peaлизaция мeжъязыкoвыx paзличий интoнaции зaвиepшённocти и нeзaвиepшённocти в cинтeзaтope pyccкoй и пoлcкoй peчи пo тeкcтy. In: Proceedings of International Conference on Computational Linguistics and Intellectual Technologies, Dialogue 2005, Zvenigorod, Russia, pp. 356–362 (2005) Lobanov, B., Piórkowska, B., Rafałko, J., Cyrulnik, L.: Peaлизaция мeжъязыкoвыx paзличий интoнaции зaвиepшённocти и нeзaвиepшённocти в cинтeзaтope pyccкoй и пoлcкoй peчи пo тeкcтy. In: Proceedings of International Conference on Computational Linguistics and Intellectual Technologies, Dialogue 2005, Zvenigorod, Russia, pp. 356–362 (2005)
9.
go back to reference Matoušek, J.: Building a new czech text-to-speech system using triphone-based speech units. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2000. LNCS (LNAI), vol. 1902, pp. 223–228. Springer, Heidelberg (2000)CrossRef Matoušek, J.: Building a new czech text-to-speech system using triphone-based speech units. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2000. LNCS (LNAI), vol. 1902, pp. 223–228. Springer, Heidelberg (2000)CrossRef
10.
go back to reference Rafalko, J.: The algorithms of automation of the process of creating acoustic units databases in the Polish speech synthesis. In: Atanassov, K.T., et al. (eds.) Novel Developments in Uncertainty Representation and Processing. AISC, vol. 401, pp. 373–383. Springer, Heidelberg(2015)CrossRef Rafalko, J.: The algorithms of automation of the process of creating acoustic units databases in the Polish speech synthesis. In: Atanassov, K.T., et al. (eds.) Novel Developments in Uncertainty Representation and Processing. AISC, vol. 401, pp. 373–383. Springer, Heidelberg(2015)CrossRef
11.
go back to reference Skrelin, P.A.: Allophone-based concatenative speech synthesis system for Russian. In: Matoušek, V., Mautner, P., Ocelíková, J., Sojka, P. (eds.) TSD 1999. LNCS (LNAI), vol. 1692, pp. 156–159. Springer, Heidelberg (1999)CrossRef Skrelin, P.A.: Allophone-based concatenative speech synthesis system for Russian. In: Matoušek, V., Mautner, P., Ocelíková, J., Sojka, P. (eds.) TSD 1999. LNCS (LNAI), vol. 1692, pp. 156–159. Springer, Heidelberg (1999)CrossRef
Metadata
Title
Algorithm of Allophone Borders Correction in Automatic Segmentation of Acoustic Units
Author
Janusz Rafałko
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-45378-1_41

Premium Partner