Skip to main content
main-content

Über dieses Buch

Extraction and Representation of Prosodic Features for Speech Processing Applications deals with prosody from speech processing point of view with topics including:

The significance of prosody for speech processing applicationsWhy prosody need to be incorporated in speech processing applicationsDifferent methods for extraction and representation of prosody for applications such as speech synthesis, speaker recognition, language recognition and speech recognition

This book is for researchers and students at the graduate level.

Inhaltsverzeichnis

Frontmatter

Chapter 1. Significance of Prosody for Speaker, Language and Speech Recognition

Speech signal carries characteristics of the speaker, language and the sound unit, and it is difficult separate out features specific to speaker, language and sound unit. Human beings recognize speaker, language and speech using multiple cues present in speech and evidence combined to arrive at a decision. Humans use several prosodic cues for these recognition tasks. But conventional automatic speaker, language and speech recognition systems mostly rely on spectral/cepstral features which are affected by channel mismatch and noise. Therefore incorporation of prosody into these automatic recognition tasks will make them more robust and human like. In this chapter, the term prosody and its significance for speaker, language and speech recognition tasks are discussed. Human way of recognition is discussed follwed by the speaker-specific, language-specific and speech-specific aspects of prosody.

Leena Mary

Chapter 2. Automatic Extraction of Prosody for Speaker, Language and Speech Recognition

The discussions in Chapter 1 is on the automatic extraction of prosodic features for recognizing speaker, language and speech. In this chapter, different techniques suggested for automatic extraction of prosodic features are described. The techniques are broadly classified as ASR free and ASR based approaches. Techniques are further classified on the basis of segmentation approach.

Leena Mary

Chapter 3. Modeling and Integration of Prosody for Speaker, Language and Speech Recognition

Methods for extraction and representation of prosodic features were discussed in the previous chapter. Now the extracted prosodic features should be represented in a manner useful for the specific recognition task. Since prosody reflects the speaker-specific, language-specific and sound-specific. Representation should be such a way that it brings out the differences among speaker/language/sound units. Then it should be modeled appropriately. Different modeling techniques have been employed by the researchers for building prosodic models. Various methods of integration have been employed for combining evidence from prosodic models with evidence from other knowledge sources such as acoustic models and language models.

Leena Mary

Backmatter

Weitere Informationen

BranchenIndex Online

Die B2B-Firmensuche für Industrie und Wirtschaft: Kostenfrei in Firmenprofilen nach Lieferanten, Herstellern, Dienstleistern und Händlern recherchieren.

Whitepaper

- ANZEIGE -

INDUSTRIE 4.0

Der Hype um Industrie 4.0 hat sich gelegt – nun geht es an die Umsetzung. Das Whitepaper von Protolabs zeigt Unternehmen und Führungskräften, wie sie die 4. Industrielle Revolution erfolgreich meistern. Es liegt an den Herstellern, die besten Möglichkeiten und effizientesten Prozesse bereitzustellen, die Unternehmen für die Herstellung von Produkten nutzen können. Lesen Sie mehr zu: Verbesserten Strukturen von Herstellern und Fabriken | Konvergenz zwischen Soft- und Hardwareautomatisierung | Auswirkungen auf die Neuaufstellung von Unternehmen | verkürzten Produkteinführungszeiten
Jetzt gratis downloaden!

Bildnachweise