Abstract
Temporal variations observed in the vocal-tract profiles during continuous speech were studied using a factor analysis. Roughly 1000 frames of cineradiographic and labiofilm data corresponding to 10 French sentences uttered by two speakers have been analyzed. The factor analysis permits us to describe the observed profiles as the sum of a small number of linear components. Since these components can be interpreted by articulatory terms, the linear model is considered as an articulatory one. For example, the tongue profiles can be specified by the following four parameters; the mandibular and tongue-dorsal positions, the dorsal shape, and the apex position. The entire vocal-tract configurations including the frontal lip-opening shapes can be described with reasonable accuracy by as few as seven parameters in total.
With this model the temporal variations are described in terms of the frame-by-frame samples of these articulatory parameters. We observe that “target” parameter values for the same vowel vary significantly presumably due to different phonetic contexts. An acoustic calculation with the model predicts that a particular pair of articulators can compensate acoustically each other. For example, by an appropriate adjustment of the tongue-dorsal position, the model is capable of producing the same F1-F2 pattern for different jaw position, or vice-versa. The compensation between the jaw and the dorsal positions, however, is possible only for unrounded vowels. In the case of rounded vowels, the jaw position can be compensated by the lip aperture.
The measured “target” values of the paired parameters indicated a linear relationship, suggesting that the speakers actually exploit the inter-articulator compensation in the speech production. This explains the observed large “target” value variability. The comparison of parameter trajectories for the same sentences uttered by the two speakers indicates similitude rather than difference, suggesting that the manner of the production involving the compensatory articulation could be relatively invariant.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bothorel, A., Simon, P., Wioland, F., et Zerling, J. P. (1986). Cinéradiographie des Voyelles et Consonnes du Français, Travaux de l’Institut de Phonétique de Strasbourg.
Coker, C. and Fujimura, O. (1966). “Model for specification of the vocal tract area function,” J. Acoust. Soc. Am., 40, 1271.
Coker, C. (1976). “A model of articulatory dynamics and control,” Proc. IEEE, 64(4), 452–460.
Condax, I. D. (1979). “Mandible position and stress,” Proceedings of ASA50, J. J. Wolfs and D. H. Klatt (editors), 131–134.
Gauffin, J. and Sundberg, J. (1978). “Pharyngeal constrictions,” Phonetica, 35, 157–168.
Gay, T., Lindblom, B., and Lubker J. (1981). “Production of bite-block vowels: Acoustic equivalence by selective compensation,” J. Acoust. soc. Am., 69(3), 802–810.
Harshman, R., Ladefoged, P., and Goldstein, L. (1977). “Factor analysis of tongue shapes,” J. Acoust. Soc. Am., 62(3), 693–707.
Heinz, J. M. and Stevens, K. N. (1964). “On the derivation of area functions and acoustic spectra from cineradiographic films of speech,” J. Acoust. Soc. Am., 36, 1037.
Henke, W. L. (1967). “Preliminaries to speech synthesis based upon an articulatory model,” Proc. IEEE Conf. Speech Commun. Process., 170–182.
Hughes, O. M. and Abbs, J. H. (1976). “Labial-mandibular coordination in the production of speech: Implication for the operation of motor equivalence,” Phonetica, 33, 199–221.
Jackson, M. T. T. (1988). “Analysis of tongue positions: Language-specific and cross-linguistic models,” J. Acoust. Soc. Am., 84(1), 124–143.
Kiritani, S., Sekimoto, S., Imagawa, H., and Fujisaki, H. (1977). “Parameter description of the tongue movements for vowels,” Contribution papers of 9-th ICA, Madrid, Vol. 1, I13, 419.
Liljencrants, J. (1971). “Fourier series description of the tongue profile,” Speech Transmission Laboratory, Royal Institute of Technology, Stockholm, Sweden, QPSR-4, 9–18.
Lindblom, B. E. F. and Sundberg, J. E. F. (1971). “Acoustical consequences of lip, tongue, jaw, and larynx movement,” J. Acoust. Soc. Am., 4 (Part 2), 1166–1179.
Lindblom, B., Lubker, J., and Gay, T. (1979). “Formant frequencies of some fixed-mandible vowels and a model of speech motor programming by predictive simulation,” J. Phonet. 7, 147–161.
Maeda, S. (1978). “Une analyses statistique sur les positions de la tongue: Etude préliminaire sur les voyelles françaises,” 9èmes JEP, GALF, 191–199.
Maeda, S. (1979). “Une modèle articulatoire de la tongue avec des composantes linéaires,” l0émes JEP, GALF, 152–164.
Maeda, S. (1982). “A digital simulation method of vocal-tract system,” Speech Communication, 1, 199–229.
Majid, R., Boë, L.J., and Perrier, P. (1986). “Fonctions de sensibilité, modèle articulatoire et voyelles du français,” 15émes JEP, GALF, 59–63.
Martin, J. G. and Bunnell, H. T. (1982). “Perception of anticipatory coarticulation effects in vowel-stop consonant-vowel sequences,” J. Exp. Psychol.: Human Percept. Perform. 8, 473–488.
Mermelstein, P. (1973). “Articulatory model for the study of speech production,” J. Acoust. Soc. Am., 53, 1070–1082.
Öhman, S. E. F. (1966). “Coarticulation in VCV utterances: Spectrographs measurements,” J. Acoust. Soc. Am., 39, 151–168.
Overall, J. E. (1962). “Orthogonal factors and uncorrelated factor scores,” Psychological Reports, 10, 651–662.
Perrier, P. et Boë, L. J. (1989). “Passage de la coupe sagittale à la fonction d’aire: les zones de faibles dimensions,” J. Acoustique, 2, 59–67.
Stone, M. (1981). “Evidence for a rhythm pattern in speech production: Observations of jaw movement,” J. Phonetics, 9, 109–120.
Vaissière, J. (1987). “The use of allophonic variations of /a/ in automatic continuous speech recognition of French,” Speech Communication Group Working Papers, Research Laboratory of Electronics, Massachusetts Institute of Technology, Volume V, 15–25.
Zerling, J. P. (1979). Articulation et Coarticulation dans les Groupes Occlusive-Voyelle en Francais, Doctorat de Troisième Cycle (phonétique), Université de Nancy II.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1990 Kluwer Academic Publishers
About this chapter
Cite this chapter
Maeda, S. (1990). Compensatory Articulation During Speech: Evidence from the Analysis and Synthesis of Vocal-Tract Shapes Using an Articulatory Model. In: Hardcastle, W.J., Marchal, A. (eds) Speech Production and Speech Modelling. NATO ASI Series, vol 55. Springer, Dordrecht. https://doi.org/10.1007/978-94-009-2037-8_6
Download citation
DOI: https://doi.org/10.1007/978-94-009-2037-8_6
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-010-7414-8
Online ISBN: 978-94-009-2037-8
eBook Packages: Springer Book Archive