1999 | OriginalPaper | Buchkapitel
Talker Normalization with Articulatory Analysis-by-Synthesis
verfasst von : Richard S. McGowan
Erschienen in: Computational Models of Speech Pattern Processing
Verlag: Springer Berlin Heidelberg
Enthalten in: Professional Book Archive
Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.
Wählen Sie Textabschnitte aus um mit Künstlicher Intelligenz passenden Patente zu finden. powered by
Markieren Sie Textabschnitte, um KI-gestützt weitere passende Inhalte zu finden. powered by
Internal articulatory models are used in analysis-by-synthesis to recover the movement of the speech articulators from speech acoustics. The kind of articulatory information that is recovered depends on the application and the available data. In the laboratory some articulatory data may be available along with acoustic data, and in automatic speech recognition only acoustic data is available. While there is more data available in the former than in the latter case, the amount of information sought in recovery is different in the two cases. In the laboratory physically realistic articulatory trajectories are sought, while recovery in automatic speech recognition may simply require transforming the acoustic signal to an abstract articualtory representaion employed by statistical models for subsequent categorization. Both applications require that the internal articulatory models be normalized for each talker, either for realistic recovery or for robust statistical behavior. A method for constructing mappings between the human and the internal model, while simultaneously adjusting the internal model for acoustic matching is presented. The method is tested on x-ray microbeam data taken on human subjects.