nach oben

Erschienen in:

2021 | OriginalPaper | Buchkapitel

Use of Median Timbre Features for Speaker Identification of Whispering Sound

verfasst von : Vijay M. Sardar, Manisha L. Jadhav, Saurabh H. Deshmukh

Erschienen in: Techno-Societal 2020

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Identifying speaker from the whispered voice is difficult task contrasted to neutral as voiced phonations are absent in the whisper. The accomplishment of the speaker identification system for the most part relies on the selection of proper audio features reasonable for the type of database and type of application. This paper examines the various audio features available and emphasizes on the use of selected timbrel features which are sorted by Hybrid Selection Algorithm. The limited number of timbrel features namely MFCC, Roll-off, Brightness, Roughness, and irregularity which are found outperforming when tested on CHAIN database. Likewise, the possibility of using the MEDIAN based features is investigated by analysis. The use of Median timbrel features reported an enhancement in speaker identification accuracy by 2.4% compared to timbrel features only in whisper train-whisper test scenario.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Bandwidth Improvement of Multilayer Microstrip Patch Antenna by Using Capacitive Feed Technique for Broadband Applications

Nächstes Kapitel Intelligent System for Engine Temperature Monitoring and Airbag Deployment in Cars Using

Hourri S, Kharroubi J (2020) A deep learning approach for speaker recognition. Int J Speech Technol Springer 23:123–131

Jahangir R et al (2020) Text-independent speaker identification through feature fusion and deep neural network. IEEE Access 8:32187–32202CrossRef

Singh A, Joshi AM (2020) Speaker identification through natural and whisper speech signal. In: Janyani V, Singh G, Tiwari M, d'Alessandro A (eds) Optical and wireless technologies. Lecture Notes in Electrical Engineering, vol 546. Springer, Singapore

Fan X, Godin KW, Hansen JHL (2011) Acoustic analysis of whispered speech for phoneme and speaker dependency. In: Proceedings of the annual conference of the international speech communication association, INTERSPEECH, pp 181–184

Bhattacharjee M, Prasanna SRM, Guha P (2018) Time-frequency audio features for speech-music classification. Project: Broadcast Video Analytics

Davis SB, Mermelstein P (1980) Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans Acoust Speech Signal Process ASSP s28:357–366

Hermansky H, Malaya N (1998) Spectral basis functions from discriminant analysis. In: International conference on spoken language processing

Toonen Dekkers RTJ, Aarts RM (1995) On a very low-cost speech-music discriminator. Technical Report 124/95, Nat. Lab. Technical Note

Dobrowohl FA, Milne AJ, Dean RT (2019) Timbre preferences in the context of mixing music. Appl Sci (2076–3417) 9(8):1695–1695

10.

Peeters G (2004) A large set of audio features for sound description (similarity and classification) in the CUIDADO project

11.

Park TH (2004) Towards automatic musical instrument timbre recognition, PhD thesis, the department of music, Princeton University

12.

Albert-Ludwigs-Universität Freiburg (2007) A Matlab Toolbox for Music Information. In: Proceedings of the 31st annual conference of the Gesellschaft für Klassifikation e.V., March 7–9, pp 261–268

13.

Lartillot O (2011) MIR toolbox 1.3.3 (Matlab Central Version)—User’s Manual. Finnish centre of excellence in interdisciplinary music research, University of Jyväskylä, Finland

14.

Sardar VM, Shirbahadurkar SD (2019) Timbre features for speaker identification of whispering speech: selection of optimal audio descriptors. Int J Comput Appl (Taylor Francis, U.K.). ISSN 1206–212X

15.

Cummins F, Grimaldi M, Leonard T, Simko J (2006) The chains speech corpus: characterizing individual speakers. School of Computer Science and Informatics University College, Dublin

16.

Deshmukh S, Bhirud SG (2012) A hybrid selection method of audio descriptors for singer identification in North Indian Classical Music. In: Fifth international conference on emerging trends in engineering and technology, pp 224–227

17.

Liu H, Jiang H, Zheng R (2016) Computational and mathematical methods in MedicinePB—Hindawi Publishing Corporation

18.

Shah JK, Smolenski BY, Yantorno RE, Iyer AN (2015) Sequential k-nearest neighbor pattern recognition for usable speech classification. Signal Processing Conference, IEEE Xplorer

19.

Sardar VM, Shirbahadurkar SD (2018) Speaker identification of whispering sound: effectiveness of timbral audio descriptors. In: International conference on power, communications and sustainable energy system, Chennai

20.

Wang J-C, Chin Y-H, Hsieh W-C, Lin C-H, Chen Y-R, Siahaan E (2015) Speaker identification with whispered speech for the access control system. IEEE Trans Autom Sci Eng 12:1191–1199CrossRef

21.

Sardar V, Shirbahadurkar S (2018) Speaker identification of whispering speech: an investigation on selected timbrel features and KNN distance measures. Int J Speech Technol 21. https://doi.org/10.1007/s10772-018-9527-4

Titel: Use of Median Timbre Features for Speaker Identification of Whispering Sound
verfasst von: Vijay M. Sardar
Manisha L. Jadhav
Saurabh H. Deshmukh
Verlag: Springer International Publishing
Buch: Techno-Societal 2020
Print ISBN: 978-3-030-69920-8

Electronic ISBN: 978-3-030-69921-5

Copyright-Jahr: 2021
DOI: https://doi.org/10.1007/978-3-030-69921-5_4

Premium Partner

Marktübersichten

Die im Laufe eines Jahres in der „adhäsion“ veröffentlichten Marktübersichten helfen Anwendern verschiedenster Branchen, sich einen gezielten Überblick über Lieferantenangebote zu verschaffen.

Zur Marktübersicht

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner

Marktübersichten