1992 | OriginalPaper | Buchkapitel
Improved Broad Phonetic Classification and Segmentation with an Auditory Model
verfasst von : L. Depuydt, J. P. Martens, L. Van Immerseel
Erschienen in: Speech Recognition and Understanding
Verlag: Springer Berlin Heidelberg
Enthalten in: Professional Book Archive
Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.
Wählen Sie Textabschnitte aus um mit Künstlicher Intelligenz passenden Patente zu finden. powered by
Markieren Sie Textabschnitte, um KI-gestützt weitere passende Inhalte zu finden. powered by
We describe a broad phonetic classification and segmentation algorithm based on neural networks and dynamic programming. The basics of our algorithm are outlined in another paper [7], so here we will focus on the introduction of auditory model features replacing the mel-scale parameters. Our auditory model incorporates critical band filtering, short time adaptation and temporal analysis of the auditory nerve responses. Unlike previously proposed synchrony models, it emphasizes the envelope rather than the instantaneous frequency as the carrier of perceptually relevant information.