Skip to main content

2015 | Buch

Language Identification Using Excitation Source Features

insite
SUCHEN

Über dieses Buch

This book discusses the contribution of excitation source information in discriminating language. The authors focus on the excitation source component of speech for enhancement of language identification (LID) performance. Language specific features are extracted using two different modes: (i) Implicit processing of linear prediction (LP) residual and (ii) Explicit parameterization of linear prediction residual. The book discusses how in implicit processing approach, excitation source features are derived from LP residual, Hilbert envelope (magnitude) of LP residual and Phase of LP residual; and in explicit parameterization approach, LP residual signal is processed in spectral domain to extract the relevant language specific features. The authors further extract source features from these modes, which are combined for enhancing the performance of LID systems. The proposed excitation source features are also investigated for LID in background noisy environments. Each chapter of this book provides the motivation for exploring the specific feature for LID task, and subsequently discuss the methods to extract those features and finally suggest appropriate models to capture the language specific knowledge from the proposed features. Finally, the book discuss about various combinations of spectral and source features, and the desired models to enhance the performance of LID systems.

Inhaltsverzeichnis

Frontmatter
Chapter 1. Introduction
Abstract
This chapter introduces the basic goal of language identification (LID) and its impacts on real-life applications. A brief overview of the basic features used for developing LID systems has been given and different categories of LID systems are also discussed here. Eventually, the primary issues in developing LID systems and the major contributions of this book towards solving those issues have been highlighted.
K. Sreenivasa Rao, Dipanjan Nandi
Chapter 2. Language Identification—A Brief Review
Abstract
This chapter provides compendious reviews about both the explicit and implicit LID systems present in the literature. Existing works related to language identification in Indian context are briefly discussed. The related works about the excitation source features are also presented here. Various speech features and models proposed in the context of language identification are briefly reviewed in this chapter. The motivation for the present work from the existing literature is briefly discussed.
K. Sreenivasa Rao, Dipanjan Nandi
Chapter 3. Implicit Excitation Source Features for Language Identification
Abstract
This chapter discusses about the proposed approaches to model the implicit features of excitation source information for language identification. Excitation source features such as raw LP residual samples, its magnitude and phase components are processed at three different levels: sub-segmental, segmental and supra-segmental levels to capture different aspects of excitation source information for LID task. Further, LID systems are developed by combining the evidences obtained from LID systems built using individual features.
K. Sreenivasa Rao, Dipanjan Nandi
Chapter 4. Parametric Excitation Source Features for Language Identification
Abstract
This chapter describes the proposed methods to extract parametric features at sub-segmental, segmental and supra-segmental levels to capture the language-specific excitation source information. In this work, glottal pulse, spectral and epoch parameters are used for representing sub-segmental, segmental and supra-segmental information present in excitation source signal. Further, these individual features are combined at score level to enhance the accuracy of LID systems by exploiting the non-overlapping information present among the features.
K. Sreenivasa Rao, Dipanjan Nandi
Chapter 5. Complementary and Robust Nature of Excitation Source Features for Language Identification
Abstract
This chapter discusses about the combination of implicit and parametric features of excitation source to enhance the LID accuracy. Further, complementary nature of excitation source and vocal tract features is exploited for improving the LID accuracy. The robustness of proposed language-specific excitation source features is investigated on various noisy background environments.
K. Sreenivasa Rao, Dipanjan Nandi
Chapter 6. Summary and Conclusion
Abstract
This chapter summerizes the overall contents of the book. Major contributions and future scope of work have been highlighted.
K. Sreenivasa Rao, Dipanjan Nandi
Backmatter
Metadaten
Titel
Language Identification Using Excitation Source Features
verfasst von
K. Sreenivasa Rao
Dipanjan Nandi
Copyright-Jahr
2015
Electronic ISBN
978-3-319-17725-0
Print ISBN
978-3-319-17724-3
DOI
https://doi.org/10.1007/978-3-319-17725-0

Neuer Inhalt