Weitere Kapitel dieses Buchs durch Wischen aufrufen
The automatic detection of people’s identity from their voices is part of modern telecommunication services. This generally requires the telephone transmission of the speech to remote servers that perform the recognition task. The transmission may introduce severe distortions that degrade the system performance and hence represents one of the major challenges speech technologies are currently facing. Similarly, humans also cope with the difficulty of reliably identifying talkers from speech transmitted over communication channels, particularly if the utterance heard is of short duration. The present book addresses the evaluation of the human and of the automatic speaker recognition performances under different channel distortions caused by bandwidth limitation, codecs, and electro-acoustic user interfaces, among other impairments. The main contribution of this work is the demonstration of the benefits of communication channels of extended bandwidth, together with an insight into how speaker-specific characteristics of speech are preserved through different transmissions. This book intends to assist students, researchers, and engineers to assess the speaker recognition performance employing transmitted speech. Particularly interesting for network engineers, this work provides sufficient motivation for considering speaker recognition as a criterion for the migration from narrowband to enhanced bandwidths, such as wideband and super-wideband. This book was written in the context of my PhD project, a research agreement between the Telekom Innovation Laboratories and the Technische Universität Berlin (TU Berlin), Germany, and the University of Canberra (UC), Australia.
Bitte loggen Sie sich ein, um Zugang zu diesem Inhalt zu erhalten
Sie möchten Zugang zu diesem Inhalt erhalten? Dann informieren Sie sich jetzt über unsere Produkte:
Laura Fernández Gallardo
- Springer Singapore
- Chapter 1
Neuer Inhalt/© Filograph | Getty Images | iStock