Skip to main content

2008 | OriginalPaper | Buchkapitel

20. Rule-Based Speech Synthesis

verfasst von : Rolf Carlson, Prof., Björn Granström, Prof.

Erschienen in: Springer Handbook of Speech Processing

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this chapter, we review some of the issues in rule-based synthesis and specifically discuss formant synthesis. Formant synthesis and the theory behind have played an important role in both the scientific progress in understanding how humans talk and also the development of the first speech technology applications. Its flexibility and small footprint makes the approach still of interest and a valuable complement to the current dominant methods based on concatenative data-driven synthesis. As already mentioned in the overview by Schroeter (Chap. 19) we also see a new trend to combine the rule-based and data-driven approaches. Formant features from a database that can be used both to optimize a rule-based formant synthesis system and to optimize the search for good units in a concatenative system.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
20.1.
Zurück zum Zitat G. Fant: Acoustic Theory of Speech Production (Mouton, The Hague 1960) G. Fant: Acoustic Theory of Speech Production (Mouton, The Hague 1960)
20.2.
Zurück zum Zitat G. Fant: Speech Acoustics and Phonetics, Selected Writings Series: Text, Speech and Language Technology, Vol. 24 (Springer, Berlin, Heidelberg 2006) G. Fant: Speech Acoustics and Phonetics, Selected Writings Series: Text, Speech and Language Technology, Vol. 24 (Springer, Berlin, Heidelberg 2006)
20.3.
Zurück zum Zitat K. Stevens: Acoustic Phonetics (MIT Press, Cambridge 1999) K. Stevens: Acoustic Phonetics (MIT Press, Cambridge 1999)
20.4.
Zurück zum Zitat J. Holmes, I.G. Mattingly, J.N. Shearme: Speech synthesis by rule, Lang. Speech 7, 127-143 (1964)CrossRef J. Holmes, I.G. Mattingly, J.N. Shearme: Speech synthesis by rule, Lang. Speech 7, 127-143 (1964)CrossRef
20.5.
Zurück zum Zitat I.G. Mattingly: Synthesis by rule as a tool for phonological research, Lang. Speech 14(1), 47-56 (1971)CrossRef I.G. Mattingly: Synthesis by rule as a tool for phonological research, Lang. Speech 14(1), 47-56 (1971)CrossRef
20.6.
Zurück zum Zitat J.L. Flanagan: Speech Analysis, Synthesis and Perception (Springer, Berlin, Heidelberg 1972)CrossRef J.L. Flanagan: Speech Analysis, Synthesis and Perception (Springer, Berlin, Heidelberg 1972)CrossRef
20.7.
Zurück zum Zitat D. K. Klatt: Structure of a phonological rule component for a synthesis-by-rule program, IEEE Trans. ASSP-24 (1976) D. K. Klatt: Structure of a phonological rule component for a synthesis-by-rule program, IEEE Trans. ASSP-24 (1976)
20.8.
Zurück zum Zitat J. Allen, M.S. Hunnicutt, D. Klatt: From Text to Speech. The MITalk System (Cambridge University Press, Cambridge 1987) J. Allen, M.S. Hunnicutt, D. Klatt: From Text to Speech. The MITalk System (Cambridge University Press, Cambridge 1987)
20.9.
Zurück zum Zitat R. Carlson, B. Granström: A text-to-speech system based entirely on rules, Proc. ICASSP 76, Philadelphia (1976) R. Carlson, B. Granström: A text-to-speech system based entirely on rules, Proc. ICASSP 76, Philadelphia (1976)
20.10.
Zurück zum Zitat Y. Sagisaka: Speech synthesis from text, IEEE Commun. Mag. 28(1), 35-41 (1990)CrossRef Y. Sagisaka: Speech synthesis from text, IEEE Commun. Mag. 28(1), 35-41 (1990)CrossRef
20.11.
Zurück zum Zitat T. Dutoit: An Introduction to Text-to-Speech Synthesis (Kluwer Academic, Dordrecht 1997)CrossRef T. Dutoit: An Introduction to Text-to-Speech Synthesis (Kluwer Academic, Dordrecht 1997)CrossRef
20.12.
Zurück zum Zitat R. Carlson, B. Granström: Speech synthesis. In: Hardcastle WJ and Laver J. The Handbook of Phonetic Science (Blackwell, Oxford 1997) pp. 768-788 R. Carlson, B. Granström: Speech synthesis. In: Hardcastle WJ and Laver J. The Handbook of Phonetic Science (Blackwell, Oxford 1997) pp. 768-788
20.13.
Zurück zum Zitat D.K. Klatt: Review of text-to-speech conversion for English, J. Acoust. Soc. Am. 82(3), 737-793 (1987)CrossRef D.K. Klatt: Review of text-to-speech conversion for English, J. Acoust. Soc. Am. 82(3), 737-793 (1987)CrossRef
20.14.
Zurück zum Zitat W. Lawrence: The synthesis of speech from signals which have a low information rate. In: Communication Theory, ed. by W. Jackson (Butterworths, London 1953) pp. 460-469 W. Lawrence: The synthesis of speech from signals which have a low information rate. In: Communication Theory, ed. by W. Jackson (Butterworths, London 1953) pp. 460-469
20.15.
Zurück zum Zitat G. Fant: Speech Communication Research, Ing. Vetenskaps Akad. Stockholm 24, 331-337 (1953) G. Fant: Speech Communication Research, Ing. Vetenskaps Akad. Stockholm 24, 331-337 (1953)
20.16.
Zurück zum Zitat D.K. Klatt: Software for a cascade/parallel formant synthesizer, J. Acoust. Soc. Am. 67, 971 (1980)CrossRef D.K. Klatt: Software for a cascade/parallel formant synthesizer, J. Acoust. Soc. Am. 67, 971 (1980)CrossRef
20.17.
Zurück zum Zitat J. Holmes: Formant synthesizers, cascade or parallel, Speech Commun. 2, 251-273 (1983)CrossRef J. Holmes: Formant synthesizers, cascade or parallel, Speech Commun. 2, 251-273 (1983)CrossRef
20.18.
Zurück zum Zitat K. Stevens, C. Bickley: Constraints among parameters simplify control of Klatt formant synthesizer, J. Phonetics 19(1) (1991) K. Stevens, C. Bickley: Constraints among parameters simplify control of Klatt formant synthesizer, J. Phonetics 19(1) (1991)
20.19.
Zurück zum Zitat R. Carlson, B. Granström, I. Karlsson: Experiments with voice modelling in speech synthesis, Speech Commun. 10, 481-489 (1991)CrossRef R. Carlson, B. Granström, I. Karlsson: Experiments with voice modelling in speech synthesis, Speech Commun. 10, 481-489 (1991)CrossRef
20.20.
Zurück zum Zitat D. Klatt: The Klattalk text-to-speech conversion system, Proc. ICASSP 82, 1589-1592 (1982) D. Klatt: The Klattalk text-to-speech conversion system, Proc. ICASSP 82, 1589-1592 (1982)
20.21.
Zurück zum Zitat D. Klatt: DecTalk userʼs manual, Digital Equipment Corporation (1990) D. Klatt: DecTalk userʼs manual, Digital Equipment Corporation (1990)
20.22.
Zurück zum Zitat J. Liljencrants: The OVE III speech synthesizer, IEEE Trans.Audio Electroac. 16(1), 137-140 (1968)CrossRef J. Liljencrants: The OVE III speech synthesizer, IEEE Trans.Audio Electroac. 16(1), 137-140 (1968)CrossRef
20.23.
Zurück zum Zitat R. Carlson, B. Granström, S. Hunnicutt: A multi-language text-to-speech module, Proc. ICASSP 82 82(3), 1604-1607 (1982) R. Carlson, B. Granström, S. Hunnicutt: A multi-language text-to-speech module, Proc. ICASSP 82 82(3), 1604-1607 (1982)
20.24.
Zurück zum Zitat R. Carlson, B. Granström, S. Hunnicutt: Multilingual text-to-speech development and applications. In: Advances in speech, hearing and language processing, ed. by A.W. Ainsworth (JAI, London 1991) R. Carlson, B. Granström, S. Hunnicutt: Multilingual text-to-speech development and applications. In: Advances in speech, hearing and language processing, ed. by A.W. Ainsworth (JAI, London 1991)
20.25.
Zurück zum Zitat H.M. Hanson, K.N. Stevens: A quasiarticulatory approach to controlling acoustic source parameters in a Klatt-type formant synthesizer using HLsyn, J. Acoust. Soc. Am. 112, 1158-1182 (2002)CrossRef H.M. Hanson, K.N. Stevens: A quasiarticulatory approach to controlling acoustic source parameters in a Klatt-type formant synthesizer using HLsyn, J. Acoust. Soc. Am. 112, 1158-1182 (2002)CrossRef
20.26.
Zurück zum Zitat K. Stevens: Toward Formant Synthesis with Articulatory Controls, Proceedings of IEEE Workshop on Speech Synthesis (2002) K. Stevens: Toward Formant Synthesis with Articulatory Controls, Proceedings of IEEE Workshop on Speech Synthesis (2002)
20.27.
Zurück zum Zitat R. Ogden, S. Hawkins, J. House, M. Huckvale, J. Local, P. Carter, J. Dankovicová, S. Heid: ProSynth: An integrated prosodic approach to device-independent natural-sounding speech synthesis, Comput. Speech Lang. 14, 177-210 (2000)CrossRef R. Ogden, S. Hawkins, J. House, M. Huckvale, J. Local, P. Carter, J. Dankovicová, S. Heid: ProSynth: An integrated prosodic approach to device-independent natural-sounding speech synthesis, Comput. Speech Lang. 14, 177-210 (2000)CrossRef
20.28.
Zurück zum Zitat S. Heid, S. Hawkins: Synthesizing systematic variation at boundaries between vowels and obstruents. In: Proceedings of the XIVth International Congress of Phonetic Sciences, Vol. 1, ed. by J.J. Ohala, Y. Hasegawa, M. Ohala, D. Granville, A.C. Bailey (University of California, Berkeley 1999) pp. 511-514 S. Heid, S. Hawkins: Synthesizing systematic variation at boundaries between vowels and obstruents. In: Proceedings of the XIVth International Congress of Phonetic Sciences, Vol. 1, ed. by J.J. Ohala, Y. Hasegawa, M. Ohala, D. Granville, A.C. Bailey (University of California, Berkeley 1999) pp. 511-514
20.29.
Zurück zum Zitat C. Gobl, J. Karlsson: Male and female voice source dynamics. In: Vocal Fold Physiology: Acoustic, Perceptual, and Physiological Aspects of Voice Mechanisms, ed. by J. Gauffin, B. Hammarberg (Singular, San Diego 1991) C. Gobl, J. Karlsson: Male and female voice source dynamics. In: Vocal Fold Physiology: Acoustic, Perceptual, and Physiological Aspects of Voice Mechanisms, ed. by J. Gauffin, B. Hammarberg (Singular, San Diego 1991)
20.30.
Zurück zum Zitat T. V. Ananthapadmanabha: Acoustic analysis of voice source dynamics, STL-QPSR 2(3) 1-24 (1984) T. V. Ananthapadmanabha: Acoustic analysis of voice source dynamics, STL-QPSR 2(3) 1-24 (1984)
20.31.
Zurück zum Zitat P. Hedelin: A glottal LPC-vocoder, Proc. IEEE, San Diego, 1.6.1-1.6.4 (1984) P. Hedelin: A glottal LPC-vocoder, Proc. IEEE, San Diego, 1.6.1-1.6.4 (1984)
20.32.
Zurück zum Zitat J. Holmes: Influence of the glottal waveform on the naturalness of speech from a parallel formant synthesizer, IEEE Trans. Audio Electroac. AU-21, 298-305 (1973)CrossRef J. Holmes: Influence of the glottal waveform on the naturalness of speech from a parallel formant synthesizer, IEEE Trans. Audio Electroac. AU-21, 298-305 (1973)CrossRef
20.33.
Zurück zum Zitat D.K. Klatt, L. Klatt: Analysis, synthesis, and perception of voice quality variations among female and male talkers, J. Acoust. Soc. Am. 87, 820-857 (1990)CrossRef D.K. Klatt, L. Klatt: Analysis, synthesis, and perception of voice quality variations among female and male talkers, J. Acoust. Soc. Am. 87, 820-857 (1990)CrossRef
20.34.
Zurück zum Zitat A.E. Rosenberg: Effect of glottal pulse shape on the quality of natural vowels, J. Acoust. Soc. Am. 53, 1632-1645 (1971)CrossRef A.E. Rosenberg: Effect of glottal pulse shape on the quality of natural vowels, J. Acoust. Soc. Am. 53, 1632-1645 (1971)CrossRef
20.35.
Zurück zum Zitat M. Rothenberg, R. Carlson, B. Granström, J. Lindqvist-Gauffin: A three-parameter voice source for speech synthesis", Proc. of Speech Communication Seminar, Stockholm 1974; in Speech Communication, Vol. 2 (Almqvist and Wiksell, Stockholm 1975) pp. 235-243 M. Rothenberg, R. Carlson, B. Granström, J. Lindqvist-Gauffin: A three-parameter voice source for speech synthesis", Proc. of Speech Communication Seminar, Stockholm 1974; in Speech Communication, Vol. 2 (Almqvist and Wiksell, Stockholm 1975) pp. 235-243
20.36.
Zurück zum Zitat R. Carlson, B. Granström: A Phonetically Oriented Programming Language for Rule Description of Speech. In: Speech Communication, Vol. 2, ed. by G. Fant (Almqvist Wiksell, Uppsala 1975) pp. 245-253 R. Carlson, B. Granström: A Phonetically Oriented Programming Language for Rule Description of Speech. In: Speech Communication, Vol. 2, ed. by G. Fant (Almqvist Wiksell, Uppsala 1975) pp. 245-253
20.37.
Zurück zum Zitat P. Alku: An automatic inverse filtering method for the analysis of glottal waveforms, Dissertation (Helsinki University of Technology, Helsinki 1992) P. Alku: An automatic inverse filtering method for the analysis of glottal waveforms, Dissertation (Helsinki University of Technology, Helsinki 1992)
20.38.
Zurück zum Zitat C. Gobl, A. Ní Chasaide: Acoustic characteristics of voice quality, Speech Commun. 11, 481-490 (1992)CrossRef C. Gobl, A. Ní Chasaide: Acoustic characteristics of voice quality, Speech Commun. 11, 481-490 (1992)CrossRef
20.39.
Zurück zum Zitat C. Gobl, A. Ní Chasaide: The role of voice quality in communicating emotion, mood and attitude, Speech Commun. 40, 189-212 (2003)CrossRefMATH C. Gobl, A. Ní Chasaide: The role of voice quality in communicating emotion, mood and attitude, Speech Commun. 40, 189-212 (2003)CrossRefMATH
20.40.
Zurück zum Zitat I. Karlsson: Modelling speaking styles in female speech synthesis, Speech Commun. 11, 491-497 (1992)CrossRef I. Karlsson: Modelling speaking styles in female speech synthesis, Speech Commun. 11, 491-497 (1992)CrossRef
20.41.
Zurück zum Zitat G. Fant, J. Liljencrants, Q. Lin: A four parameter model of glottal flow, Speech Transmission Laboratory Quarterly and Status Report STL-QPSR No 4 (1985) G. Fant, J. Liljencrants, Q. Lin: A four parameter model of glottal flow, Speech Transmission Laboratory Quarterly and Status Report STL-QPSR No 4 (1985)
20.42.
Zurück zum Zitat C. Bickley, K. Stevens: Effects of the vocal tract constriction on the glottal source: Experimental and modelling studies, J. Phon. 14, 373-382 (1986) C. Bickley, K. Stevens: Effects of the vocal tract constriction on the glottal source: Experimental and modelling studies, J. Phon. 14, 373-382 (1986)
20.43.
Zurück zum Zitat K.N. Stevens: Airflow and turbulence noise for fricative and stop consonants: Static considerations, J. Acoust. Soc. Am. 50(4), 1180-1192 (1971)CrossRef K.N. Stevens: Airflow and turbulence noise for fricative and stop consonants: Static considerations, J. Acoust. Soc. Am. 50(4), 1180-1192 (1971)CrossRef
20.44.
Zurück zum Zitat C.H. Shadle: The Aerodynamics of Speech. Handbook of Phonetics, ed. by W.J. Hardcastle, J. Laver, (Blackwell, Oxford 1997) pp. 33-64 C.H. Shadle: The Aerodynamics of Speech. Handbook of Phonetics, ed. by W.J. Hardcastle, J. Laver, (Blackwell, Oxford 1997) pp. 33-64
20.45.
Zurück zum Zitat P. Badin, G. Fant: Fricative modeling: some essentials, Proc. Europ. Conf. Speech Technol. Paris (1989) P. Badin, G. Fant: Fricative modeling: some essentials, Proc. Europ. Conf. Speech Technol. Paris (1989)
20.46.
Zurück zum Zitat H.C. van Leeuwen, E. te Lindert: Speech Maker: A flexible and general framework for text-to-speech synthesis, and its application to Dutch, Comput. Speech Lang. 7(2), 149-168 (1993)CrossRef H.C. van Leeuwen, E. te Lindert: Speech Maker: A flexible and general framework for text-to-speech synthesis, and its application to Dutch, Comput. Speech Lang. 7(2), 149-168 (1993)CrossRef
20.47.
Zurück zum Zitat P.C. Delattre, A.M. Liberman, F.S. Cooper: Acoustic loci and transitional cues for consonants, J. Acoust. Soc. Am. 27, 769-773 (1955)CrossRef P.C. Delattre, A.M. Liberman, F.S. Cooper: Acoustic loci and transitional cues for consonants, J. Acoust. Soc. Am. 27, 769-773 (1955)CrossRef
20.48.
Zurück zum Zitat J. Liljencrants: Speech synthesizer control by smoothed step functions, STL-QPSR 1969(4), 43-50 (1969) J. Liljencrants: Speech synthesizer control by smoothed step functions, STL-QPSR 1969(4), 43-50 (1969)
20.49.
Zurück zum Zitat D.H. Klatt: Synthesis of stop consonants in initial position, J. Acoust. Soc. Am. Suppl. 147, S93 (1970)CrossRef D.H. Klatt: Synthesis of stop consonants in initial position, J. Acoust. Soc. Am. Suppl. 147, S93 (1970)CrossRef
20.50.
Zurück zum Zitat N. Umeda: Linguistic rules for text-to-speech synthesis, Proc. IEEE 64(4), 443-451 (1976)CrossRef N. Umeda: Linguistic rules for text-to-speech synthesis, Proc. IEEE 64(4), 443-451 (1976)CrossRef
20.51.
Zurück zum Zitat D.K. Klatt: Synthesis by rule of segmental durations in English sentences. In: Frontiers in Speech Communication Research, ed. by B. Lindblom, S. öhman (Academic, New York 1979) D.K. Klatt: Synthesis by rule of segmental durations in English sentences. In: Frontiers in Speech Communication Research, ed. by B. Lindblom, S. öhman (Academic, New York 1979)
20.52.
Zurück zum Zitat G. Bailly, R. Laboissière, J. L. Schwartz: Formant trajectories as audible gestures: an alternative for speech synthesis, J. Phon. 19(1), 9-23 (1991) G. Bailly, R. Laboissière, J. L. Schwartz: Formant trajectories as audible gestures: an alternative for speech synthesis, J. Phon. 19(1), 9-23 (1991)
20.53.
Zurück zum Zitat B. Lindblom: Explaining phonetic variation: A sketch of the H and H theory. In: Speech Production Modeling, ed. by Hardcastle, Marchal (Kluwer Academic, Dordrecht 1990) B. Lindblom: Explaining phonetic variation: A sketch of the H and H theory. In: Speech Production Modeling, ed. by  Hardcastle,  Marchal (Kluwer Academic, Dordrecht 1990)
20.54.
Zurück zum Zitat R. J. J. H. van Son, L. Pols: Comparing formant movements in fast and normal rate speech, Proc. Europ. Conf. on Speech Commun. Technol. 89 (1989) R. J. J. H. van Son, L. Pols: Comparing formant movements in fast and normal rate speech, Proc. Europ. Conf. on Speech Commun. Technol. 89 (1989)
20.55.
Zurück zum Zitat A. Slater, S. Hawkins: Effects of stress and vowel context on velar stops in British English, ICSLP 92 (Proc. 1992 Int. Conf. Spoken Language Processing) 1, 57-60 (1992) A. Slater, S. Hawkins: Effects of stress and vowel context on velar stops in British English, ICSLP 92 (Proc. 1992 Int. Conf. Spoken Language Processing) 1, 57-60 (1992)
20.56.
Zurück zum Zitat N. Chomsky, M. Halle: Sound pattern of English (Harper and Row, New York 1968) N. Chomsky, M. Halle: Sound pattern of English (Harper and Row, New York 1968)
20.57.
Zurück zum Zitat J.B. Pierrehumbert: The Phonetics of English Intonation (IULC, Bloomington 1987) J.B. Pierrehumbert: The Phonetics of English Intonation (IULC, Bloomington 1987)
20.58.
Zurück zum Zitat S. R. Hertz: Streams, phones, and transitions: toward a new phonological and phonetic model of formant timing, J. Phon. 19(1) (1991) S. R. Hertz: Streams, phones, and transitions: toward a new phonological and phonetic model of formant timing, J. Phon. 19(1) (1991)
20.59.
Zurück zum Zitat S.R. Hertz, J. Kadin, K.J. Karplus: The Delta rule development system for speech synthesis from text, Proc. IEEE 73(11), 1589-1601 (1985)CrossRef S.R. Hertz, J. Kadin, K.J. Karplus: The Delta rule development system for speech synthesis from text, Proc. IEEE 73(11), 1589-1601 (1985)CrossRef
20.60.
Zurück zum Zitat S. Lazzaretto, L. Nebbia: SCYLA: Speech compiler for your language, Proc. European Conf on Speech Comm and Technology, Edinburgh 1, 381-384 (1987) S. Lazzaretto, L. Nebbia: SCYLA: Speech compiler for your language, Proc. European Conf on Speech Comm and Technology, Edinburgh 1, 381-384 (1987)
20.61.
Zurück zum Zitat K. Ceder, B. Lyberg: Yet another rule compiler for text-to-speech conversion? Proc. ICSLP92, Banff, Canada, pp. 1151-1154 (1992) K. Ceder, B. Lyberg: Yet another rule compiler for text-to-speech conversion? Proc. ICSLP92, Banff, Canada, pp. 1151-1154 (1992)
20.62.
Zurück zum Zitat H. C. van Leeuwen, E. te Lindert: Speechmaker, text-to-speech synthesis based on a multilevel, synchronized data structure, Proc. ICASSP-91 (1991) H. C. van Leeuwen, E. te Lindert: Speechmaker, text-to-speech synthesis based on a multilevel, synchronized data structure, Proc. ICASSP-91 (1991)
20.63.
Zurück zum Zitat R. Carlson, B. Granström: Data-driven multimodal synthesis, Issues Speech Commun. 47(1-2), 182-193 (2005)CrossRef R. Carlson, B. Granström: Data-driven multimodal synthesis, Issues Speech Commun. 47(1-2), 182-193 (2005)CrossRef
20.64.
Zurück zum Zitat W. J. Holmes, D. J. B. Pearce: Automatic derivation of segment models for synthesis-by-rule. Proc ESCA Workshop on Speech Synthesis, Autrans, France (1990) W. J. Holmes, D. J. B. Pearce: Automatic derivation of segment models for synthesis-by-rule. Proc ESCA Workshop on Speech Synthesis, Autrans, France (1990)
20.65.
Zurück zum Zitat G. Peterson, W. Wang, E. Sivertsen: Segmentation techniques in speech synthesis, J. Acoust. Soc. Am. 32, 639-703 (1958) G. Peterson, W. Wang, E. Sivertsen: Segmentation techniques in speech synthesis, J. Acoust. Soc. Am. 32, 639-703 (1958)
20.66.
Zurück zum Zitat N.R. Dixon, H.D. Maxey: Terminal Analog Synthesis of Continuous Speech Using the Diphone Method of Segment Assembly, IEEE Trans. Audio Electroac. AU-16, 40-50 (1968)CrossRef N.R. Dixon, H.D. Maxey: Terminal Analog Synthesis of Continuous Speech Using the Diphone Method of Segment Assembly, IEEE Trans. Audio Electroac. AU-16, 40-50 (1968)CrossRef
20.67.
Zurück zum Zitat J.P. Olive: Rule synthesis of speech from dyadic units, Proc. ICASSP 77, 568-570 (1977) J.P. Olive: Rule synthesis of speech from dyadic units, Proc. ICASSP 77, 568-570 (1977)
20.68.
Zurück zum Zitat R. H. Mannell: Formant diphone parameter extraction utilising a labeled single speaker database. In: Proc. ICSLP-98 (1998) R. H. Mannell: Formant diphone parameter extraction utilising a labeled single speaker database. In: Proc. ICSLP-98 (1998)
20.69.
Zurück zum Zitat H. Mori, T. Ohtsuka, H. Kasuya: A data-driven approach to source-formant type text-to-speech system, ICSLP 2002, 2365-2368 (2002) H. Mori, T. Ohtsuka, H. Kasuya: A data-driven approach to source-formant type text-to-speech system, ICSLP 2002, 2365-2368 (2002)
20.70.
Zurück zum Zitat S. Hertz: Integration of Rule-Based Formant Synthesis and Waveform Concatenation: A Hybrid Approach to Text-to-Speech Synthesis, In: Proc. IEEE 2002 Workshop on Speech Synthesis, 11-13, Santa Monica (2002) S. Hertz: Integration of Rule-Based Formant Synthesis and Waveform Concatenation: A Hybrid Approach to Text-to-Speech Synthesis, In: Proc. IEEE 2002 Workshop on Speech Synthesis, 11-13, Santa Monica (2002)
20.71.
Zurück zum Zitat D. Talkin: Looking at Speech. In: Speech Technology, 74-77 (1989) D. Talkin: Looking at Speech. In: Speech Technology, 74-77 (1989)
20.72.
Zurück zum Zitat A. Acero: Formant analysis and synthesis using hidden Markov models, Proc. Eurospeech 99, 1047-1050 (1999) A. Acero: Formant analysis and synthesis using hidden Markov models, Proc. Eurospeech 99, 1047-1050 (1999)
20.73.
Zurück zum Zitat M. Lee, J. van Santen, B. Möbius, J. Olive: Formant tracking using context-dependent phonemic information, IEEE TSAP 13(5), 741-750 (2005) M. Lee, J. van Santen, B. Möbius, J. Olive: Formant tracking using context-dependent phonemic information, IEEE TSAP 13(5), 741-750 (2005)
20.74.
Zurück zum Zitat A.-M. Öster: The use of a synthesis-by-rule system in a study of deaf speech, STL-QPSR 1/ 1985, 95-107 (1985) A.-M. Öster: The use of a synthesis-by-rule system in a study of deaf speech, STL-QPSR 1/ 1985, 95-107 (1985)
20.75.
Zurück zum Zitat B. Granström, A.-M. Öster: Speech synthesis for hearing impaired persons - in research, training and communication, STL/QPSR 2-3/ 94, 93-111 (1994) B. Granström, A.-M. Öster: Speech synthesis for hearing impaired persons - in research, training and communication, STL/QPSR 2-3/ 94, 93-111 (1994)
20.76.
Zurück zum Zitat A. Kain, X. Niu, J. Hosom, J. Miao, J. van Santen: Formant Re-synthesis of Dysarthric Speech. Proceedings of IEEE Workshop on Speech Synthesis (2004) A. Kain, X. Niu, J. Hosom, J. Miao, J. van Santen: Formant Re-synthesis of Dysarthric Speech. Proceedings of IEEE Workshop on Speech Synthesis (2004)
20.77.
Zurück zum Zitat I. Murray, J. Arnott, N. Alm, A. Newell: A communication system for the disabled with emotional synthetic speech produced by rule, Procs. Eurospeech 91(1), 311-314 (1991) I. Murray, J. Arnott, N. Alm, A. Newell: A communication system for the disabled with emotional synthetic speech produced by rule, Procs. Eurospeech 91(1), 311-314 (1991)
20.78.
Zurück zum Zitat P. A. Cudd, S. Hunnicutt, J. Arthur, B. Granström, S. Aguilera, B. Waernulf, P. Dalsgaard, G. Wilson: Voices, attitudes and emotions in speech synthesis. In Placencia Porrero, I., and Puig de la Bellacasa, P. (Eds.), Proc of 2nd TIDE Congress on The European Context for Assistive Technology (pp. 344-347). Paris, Amsterdam: IOS Press Ohmsha (1995) P. A. Cudd, S. Hunnicutt, J. Arthur, B. Granström, S. Aguilera, B. Waernulf, P. Dalsgaard, G. Wilson: Voices, attitudes and emotions in speech synthesis. In Placencia Porrero, I., and Puig de la Bellacasa, P. (Eds.), Proc of 2nd TIDE Congress on The European Context for Assistive Technology (pp. 344-347). Paris, Amsterdam: IOS Press Ohmsha (1995)
20.79.
Zurück zum Zitat J. Cahn: The generation of affect in synthesized speech, J. Am. Voice I/O Soc. 8 (1990) J. Cahn: The generation of affect in synthesized speech, J. Am. Voice I/O Soc. 8 (1990)
Metadaten
Titel
Rule-Based Speech Synthesis
verfasst von
Rolf Carlson, Prof.
Björn Granström, Prof.
Copyright-Jahr
2008
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-540-49127-9_20

Neuer Inhalt