Skip to main content
Top

2019 | Book

Audio Processing and Speech Recognition

Concepts, Techniques and Research Overviews

Authors: Prof. Dr. Soumya Sen, Anjan Dutta, Dr. Nilanjan Dey

Publisher: Springer Singapore

Book Series : SpringerBriefs in Applied Sciences and Technology

insite
SEARCH

About this book

This book offers an overview of audio processing, including the latest advances in the methodologies used in audio processing and speech recognition. First, it discusses the importance of audio indexing and classical information retrieval problem and presents two major indexing techniques, namely Large Vocabulary Continuous Speech Recognition (LVCSR) and Phonetic Search. It then offers brief insights into the human speech production system and its modeling, which are required to produce artificial speech. It also discusses various components of an automatic speech recognition (ASR) system. Describing the chronological developments in ASR systems, and briefly examining the statistical models used in ASR as well as the related mathematical deductions, the book summarizes a number of state-of-the-art classification techniques and their application in audio/speech classification. By providing insights into various aspects of audio/speech processing and speech recognition, this book appeals a wide audience, from researchers and postgraduate students to those new to the field.


Table of Contents

Frontmatter
Chapter 1. Audio Indexing
Abstract
Audio is available from various sources like recordings of meetings, newscast, telephonic conversations, etc. In this era of information technology, with the technological progress, more and more digital audio, video, and images are being captured and stored day by day. The amount of audio data is increasing exponentially on the web and other information storehouses. In order to efficiently use this huge multimedia data, there should be an effective search technique.
Soumya Sen, Anjan Dutta, Nilanjan Dey
Chapter 2. Speech Processing and Recognition System
Abstract
In the initial decade of the twentieth century, scientists in the Bell System realized that the idea of universal services like telephony services is becoming feasible due to large-scale technological revolution [1].
Soumya Sen, Anjan Dutta, Nilanjan Dey
Chapter 3. Feature Extraction
Abstract
In order to classify any audio or speech signal, feature extraction is the prerequisite. The analog speech signal s(t) is sampled a number of times per second to be stored in some recording device or simply on a computer.
Soumya Sen, Anjan Dutta, Nilanjan Dey
Chapter 4. Audio Classification
Abstract
Classification falls under supervised learning. Supervised learning is a learning process from a given dataset or training dataset where both input and mapping output data are provided. The decision rules are designed by observing the training dataset to determine the category or class for future decision-making. Classification is the process of assigning an individual item or dataset to one of the number of existing categories or classes depending on the characteristics or features of the input data.
Soumya Sen, Anjan Dutta, Nilanjan Dey
Chapter 5. Conclusion
Abstract
Audio/speech processing is a special case of digital signal processing (DSP), which is applied to process and analyze speech signals. Some of the typical applications of speech processing are speech recognition, speech coding, speaker authentication, speech enhancement, detection and removal of noise, speech synthesis, text to speech conversion, etc. This book provides a deep insight and in-depth discussion about audio processing and automatic speech recognition.
Soumya Sen, Anjan Dutta, Nilanjan Dey
Metadata
Title
Audio Processing and Speech Recognition
Authors
Prof. Dr. Soumya Sen
Anjan Dutta
Dr. Nilanjan Dey
Copyright Year
2019
Publisher
Springer Singapore
Electronic ISBN
978-981-13-6098-5
Print ISBN
978-981-13-6097-8
DOI
https://doi.org/10.1007/978-981-13-6098-5