Skip to main content
main-content

Über dieses Buch

Extraction and Representation of Prosodic Features for Speech Processing Applications deals with prosody from speech processing point of view with topics including:

The significance of prosody for speech processing applicationsWhy prosody need to be incorporated in speech processing applicationsDifferent methods for extraction and representation of prosody for applications such as speech synthesis, speaker recognition, language recognition and speech recognition

This book is for researchers and students at the graduate level.

Inhaltsverzeichnis

Frontmatter

Chapter 1. Significance of Prosody for Speaker, Language and Speech Recognition

Speech signal carries characteristics of the speaker, language and the sound unit, and it is difficult separate out features specific to speaker, language and sound unit. Human beings recognize speaker, language and speech using multiple cues present in speech and evidence combined to arrive at a decision. Humans use several prosodic cues for these recognition tasks. But conventional automatic speaker, language and speech recognition systems mostly rely on spectral/cepstral features which are affected by channel mismatch and noise. Therefore incorporation of prosody into these automatic recognition tasks will make them more robust and human like. In this chapter, the term prosody and its significance for speaker, language and speech recognition tasks are discussed. Human way of recognition is discussed follwed by the speaker-specific, language-specific and speech-specific aspects of prosody.
Leena Mary

Chapter 2. Automatic Extraction of Prosody for Speaker, Language and Speech Recognition

The discussions in Chapter 1 is on the automatic extraction of prosodic features for recognizing speaker, language and speech. In this chapter, different techniques suggested for automatic extraction of prosodic features are described. The techniques are broadly classified as ASR free and ASR based approaches. Techniques are further classified on the basis of segmentation approach.
Leena Mary

Chapter 3. Modeling and Integration of Prosody for Speaker, Language and Speech Recognition

Methods for extraction and representation of prosodic features were discussed in the previous chapter. Now the extracted prosodic features should be represented in a manner useful for the specific recognition task. Since prosody reflects the speaker-specific, language-specific and sound-specific. Representation should be such a way that it brings out the differences among speaker/language/sound units. Then it should be modeled appropriately. Different modeling techniques have been employed by the researchers for building prosodic models. Various methods of integration have been employed for combining evidence from prosodic models with evidence from other knowledge sources such as acoustic models and language models.
Leena Mary

Backmatter

Weitere Informationen

BranchenIndex Online

Die B2B-Firmensuche für Industrie und Wirtschaft: Kostenfrei in Firmenprofilen nach Lieferanten, Herstellern, Dienstleistern und Händlern recherchieren.

Whitepaper

- ANZEIGE -

Globales Erdungssystem in urbanen Kabelnetzen

Bedingt durch die Altersstruktur vieler Kabelverteilnetze mit der damit verbundenen verminderten Isolationsfestigkeit oder durch fortschreitenden Kabelausbau ist es immer häufiger erforderlich, anstelle der Resonanz-Sternpunktserdung alternative Konzepte für die Sternpunktsbehandlung umzusetzen. Die damit verbundenen Fehlerortungskonzepte bzw. die Erhöhung der Restströme im Erdschlussfall führen jedoch aufgrund der hohen Fehlerströme zu neuen Anforderungen an die Erdungs- und Fehlerstromrückleitungs-Systeme. Lesen Sie hier über die Auswirkung von leitfähigen Strukturen auf die Stromaufteilung sowie die Potentialverhältnisse in urbanen Kabelnetzen bei stromstarken Erdschlüssen. Jetzt gratis downloaden!

Bildnachweise