2013 | OriginalPaper | Buchkapitel
Results for Variable Speaker and Recording Conditions on Spoken IR in Finnish
verfasst von : Ville T. Turunen, Mikko Kurimo, Sami Keronen
Erschienen in: Speech and Computer
Verlag: Springer International Publishing
Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.
Wählen Sie Textabschnitte aus um mit Künstlicher Intelligenz passenden Patente zu finden. powered by
Markieren Sie Textabschnitte, um KI-gestützt weitere passende Inhalte zu finden. powered by
The performance of current spoken information retrieval (IR) systems depend on the success of automatic speech recognition (ASR) to provide transcripts of the material for indexing. In addition to the ASR system design, ASR performance is strongly affected by the recording conditions, speakers, speaking style and speech content. However, the average word error rate in ASR is not a relevant measure for spoken IR, where only the extracted index terms or keywords matter. In this paper, we measure the spoken IR performance in variable material ranging from controlled single speaker news reading to real-world broadcasts with variable conditions, speakers, and background noise. The effect of using multicondition acoustic models and online adaptation is also studied, as well as controlled addition of background babble noise. The experiments are performed in Finnish, which is an agglutinative and highly inflected language, using morph-based language modelling.