2009 | OriginalPaper | Chapter
Articulatory Synthesis of Speech and Singing: State of the Art and Suggestions for Future Research
Authors : Bernd J. Kröger, Peter Birkholz
Published in: Multimodal Signals: Cognitive and Algorithmic Issues
Publisher: Springer Berlin Heidelberg
Activate our intelligent search to find suitable subject content or patents.
Select sections of text to find matching patents with Artificial Intelligence. powered by
Select sections of text to find additional relevant content using AI-assisted search. powered by
Articulatory synthesis of speech and singing aims for modeling the production process of speech and singing as human-like or natural as possible. The state of the art is described for all modules of articulatory synthesis systems, i.e. vocal tract models, acoustic models, glottis models, noise source models, and control models generating articulator movements and phonatory control information. While a lot of knowledge is available for the production and for the high quality acoustic realization of
static
spoken and sung sounds it is suggested to improve the quality of control models especially for the generation of articulatory
movements
. Thus the main problem which should be addressed for improving articulatory synthesis over the next years is the development of high quality control concepts. It is suggested to use action based control concepts and to gather control knowledge by imitating natural speech acquisition and singing acquisition scenarios. It is emphasized that teacher-learner interaction and production, perception, and compre hension of auditory as well as of visual and somatosensory infor mation (multi modal information) should be included in the acquisition (i.e. training or learning) procedures.