Compensatory Articulation During Speech: Evidence from the Analysis and Synthesis of Vocal-Tract Shapes Using an Articulatory Model

Maeda, Shinji

doi:10.1007/978-94-009-2037-8_6

Shinji Maeda³

Part of the book series: NATO ASI Series ((ASID,volume 55))

607 Accesses
156 Citations

Abstract

Temporal variations observed in the vocal-tract profiles during continuous speech were studied using a factor analysis. Roughly 1000 frames of cineradiographic and labiofilm data corresponding to 10 French sentences uttered by two speakers have been analyzed. The factor analysis permits us to describe the observed profiles as the sum of a small number of linear components. Since these components can be interpreted by articulatory terms, the linear model is considered as an articulatory one. For example, the tongue profiles can be specified by the following four parameters; the mandibular and tongue-dorsal positions, the dorsal shape, and the apex position. The entire vocal-tract configurations including the frontal lip-opening shapes can be described with reasonable accuracy by as few as seven parameters in total.

With this model the temporal variations are described in terms of the frame-by-frame samples of these articulatory parameters. We observe that “target” parameter values for the same vowel vary significantly presumably due to different phonetic contexts. An acoustic calculation with the model predicts that a particular pair of articulators can compensate acoustically each other. For example, by an appropriate adjustment of the tongue-dorsal position, the model is capable of producing the same F1-F2 pattern for different jaw position, or vice-versa. The compensation between the jaw and the dorsal positions, however, is possible only for unrounded vowels. In the case of rounded vowels, the jaw position can be compensated by the lip aperture.

The measured “target” values of the paired parameters indicated a linear relationship, suggesting that the speakers actually exploit the inter-articulator compensation in the speech production. This explains the observed large “target” value variability. The comparison of parameter trajectories for the same sentences uttered by the two speakers indicates similitude rather than difference, suggesting that the manner of the production involving the compensatory articulation could be relatively invariant.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bothorel, A., Simon, P., Wioland, F., et Zerling, J. P. (1986). Cinéradiographie des Voyelles et Consonnes du Français, Travaux de l’Institut de Phonétique de Strasbourg.
Google Scholar
Coker, C. and Fujimura, O. (1966). “Model for specification of the vocal tract area function,” J. Acoust. Soc. Am., 40, 1271.
Article Google Scholar
Coker, C. (1976). “A model of articulatory dynamics and control,” Proc. IEEE, 64(4), 452–460.
Article Google Scholar
Condax, I. D. (1979). “Mandible position and stress,” Proceedings of ASA50, J. J. Wolfs and D. H. Klatt (editors), 131–134.
Google Scholar
Gauffin, J. and Sundberg, J. (1978). “Pharyngeal constrictions,” Phonetica, 35, 157–168.
Article Google Scholar
Gay, T., Lindblom, B., and Lubker J. (1981). “Production of bite-block vowels: Acoustic equivalence by selective compensation,” J. Acoust. soc. Am., 69(3), 802–810.
Article Google Scholar
Harshman, R., Ladefoged, P., and Goldstein, L. (1977). “Factor analysis of tongue shapes,” J. Acoust. Soc. Am., 62(3), 693–707.
Article Google Scholar
Heinz, J. M. and Stevens, K. N. (1964). “On the derivation of area functions and acoustic spectra from cineradiographic films of speech,” J. Acoust. Soc. Am., 36, 1037.
Article Google Scholar
Henke, W. L. (1967). “Preliminaries to speech synthesis based upon an articulatory model,” Proc. IEEE Conf. Speech Commun. Process., 170–182.
Google Scholar
Hughes, O. M. and Abbs, J. H. (1976). “Labial-mandibular coordination in the production of speech: Implication for the operation of motor equivalence,” Phonetica, 33, 199–221.
Article Google Scholar
Jackson, M. T. T. (1988). “Analysis of tongue positions: Language-specific and cross-linguistic models,” J. Acoust. Soc. Am., 84(1), 124–143.
Article Google Scholar
Kiritani, S., Sekimoto, S., Imagawa, H., and Fujisaki, H. (1977). “Parameter description of the tongue movements for vowels,” Contribution papers of 9-th ICA, Madrid, Vol. 1, I13, 419.
Google Scholar
Liljencrants, J. (1971). “Fourier series description of the tongue profile,” Speech Transmission Laboratory, Royal Institute of Technology, Stockholm, Sweden, QPSR-4, 9–18.
Google Scholar
Lindblom, B. E. F. and Sundberg, J. E. F. (1971). “Acoustical consequences of lip, tongue, jaw, and larynx movement,” J. Acoust. Soc. Am., 4 (Part 2), 1166–1179.
Article Google Scholar
Lindblom, B., Lubker, J., and Gay, T. (1979). “Formant frequencies of some fixed-mandible vowels and a model of speech motor programming by predictive simulation,” J. Phonet. 7, 147–161.
Google Scholar
Maeda, S. (1978). “Une analyses statistique sur les positions de la tongue: Etude préliminaire sur les voyelles françaises,” 9èmes JEP, GALF, 191–199.
Google Scholar
Maeda, S. (1979). “Une modèle articulatoire de la tongue avec des composantes linéaires,” l0émes JEP, GALF, 152–164.
Google Scholar
Maeda, S. (1982). “A digital simulation method of vocal-tract system,” Speech Communication, 1, 199–229.
Article Google Scholar
Majid, R., Boë, L.J., and Perrier, P. (1986). “Fonctions de sensibilité, modèle articulatoire et voyelles du français,” 15émes JEP, GALF, 59–63.
Google Scholar
Martin, J. G. and Bunnell, H. T. (1982). “Perception of anticipatory coarticulation effects in vowel-stop consonant-vowel sequences,” J. Exp. Psychol.: Human Percept. Perform. 8, 473–488.
Article Google Scholar
Mermelstein, P. (1973). “Articulatory model for the study of speech production,” J. Acoust. Soc. Am., 53, 1070–1082.
Article Google Scholar
Öhman, S. E. F. (1966). “Coarticulation in VCV utterances: Spectrographs measurements,” J. Acoust. Soc. Am., 39, 151–168.
Article Google Scholar
Overall, J. E. (1962). “Orthogonal factors and uncorrelated factor scores,” Psychological Reports, 10, 651–662.
Google Scholar
Perrier, P. et Boë, L. J. (1989). “Passage de la coupe sagittale à la fonction d’aire: les zones de faibles dimensions,” J. Acoustique, 2, 59–67.
Google Scholar
Stone, M. (1981). “Evidence for a rhythm pattern in speech production: Observations of jaw movement,” J. Phonetics, 9, 109–120.
Google Scholar
Vaissière, J. (1987). “The use of allophonic variations of /a/ in automatic continuous speech recognition of French,” Speech Communication Group Working Papers, Research Laboratory of Electronics, Massachusetts Institute of Technology, Volume V, 15–25.
Google Scholar
Zerling, J. P. (1979). Articulation et Coarticulation dans les Groupes Occlusive-Voyelle en Francais, Doctorat de Troisième Cycle (phonétique), Université de Nancy II.
Google Scholar

Download references

Author information

Authors and Affiliations

Département de Recherche en Communication par la Parole, Centre National d’Etudes des Télécommunications (CNET), 22301, Lannion, France
Shinji Maeda

Authors

Shinji Maeda
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Linguistic Science, University of Reading, Reading, UK
William J. Hardcastle
C.N.R.S., Aix-en-Provence, France
Alain Marchal

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Maeda, S. (1990). Compensatory Articulation During Speech: Evidence from the Analysis and Synthesis of Vocal-Tract Shapes Using an Articulatory Model. In: Hardcastle, W.J., Marchal, A. (eds) Speech Production and Speech Modelling. NATO ASI Series, vol 55. Springer, Dordrecht. https://doi.org/10.1007/978-94-009-2037-8_6

Download citation

DOI: https://doi.org/10.1007/978-94-009-2037-8_6
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-010-7414-8
Online ISBN: 978-94-009-2037-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics