nach oben

Erschienen in:

2012 | OriginalPaper | Buchkapitel

9. Characterization of Noise Associated with Forensic Speech Samples

verfasst von : Jiju P. V., M.Sc., C. P. Singh, Ph.D., R. M. Sharma, Ph.D.

Erschienen in: Forensic Speaker Recognition

Verlag: Springer New York

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

For speech enhancement, different methods have been developed in the past decades. This study has been carried out for characterization of various noises associated with forensic speech samples and their classification to find specific set of filtering technique for speech recognition and speaker identification. Noisy speech samples are collected from the exhibits received in case examination in the laboratory for this study. The experiment is performed in a two-fold way: enhancing the speech for (i) speech recognition and (ii) speaker identification. The original and simulated samples are subjected to various filtering techniques, namely, FFT Filter, noise reduction, noise gate, notch filter, bandpass, butterworth filter, digital equalizer and parametric equalizer for speech recognition. For speaker identification, noise reduction, noise gate, notch filter, bandpass and butterworth filter are applied to the noisy speech samples. Characterization of noise embedded with the noisy speech samples were attained based on the application of these filtering techniques and subsequent analysis performed on them using Computerized Speech Laboratory (CSL). For speech recognition, maximum SNR improvement was achieved by FFT filter on samples Noisy Speech-I (Direct Recording), Noisy Speech-II (Telephonic Landline Recording) and Noisy Speech-III (Mobile Phone Recording). The corresponding improvements in SNR for original and simulated samples were 3.81, 7.57, 5.62 dB and 4.39, 6.26, 5.57 dB respectively. FFT filter, when applied to the Noisy Speech-I, Noisy Speech-II and Noisy Speech-III of original noisy speech samples, have given an improvement of 75, 71 and 48%, whereas simulated noisy speech samples gave an improvement of 82, 78 and 52%. For speaker identification, maximum improvement was achieved by noise reduction filter when applied to the Noisy Speech-I, Noisy Speech-II and Noisy Speech-III of original noisy speech samples, have given an improvement of 60, 64 and 52% whereas simulated noisy speech samples gave an improvement of 64, 70 and 54%. Statistical study of improvised original noisy speech and simulated noisy speech samples after filtering have revealed the degree of efficiency of different filters for Speaker Identification and how far they are dependable in forensic adverse contexts. For Speech Recognition, the degree of efficiency of filters in enhancing the speech signal is found to be in a descending order; viz. FFT Filter, Noise reduction, Noise gate, Notch filter, Bandpass, Butterworth filter, Digital equalizer and Parametric equalizer. The degree of efficiency of filters in enhancing the speech signal for Speaker Identification is found to be in a descending order; viz. Noise Reduction, Noise Gate, Notch filter, Bandpass, and Butterworth filter.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Robust Speaker Recognition in Noisy Environments: Using Dynamics of Speaker-Specific Prosody

Nächstes Kapitel Speech Processing for Robust Speaker Recognition: Analysis and Advancements for Whispered Speech

Koenig BE (1986) Spectrographic voice identification: a forensic survey. J Acoust Soc Am, 79(6):2CrossRef

Atal B (1974) Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification. J Acoust Soc Am 55:1304–1312CrossRef

Hermansky H, Morgan N, Bayya A, Kohn P (1991) Compensation for the effect of communication channel in auditory-like analysis of speech (RASTA-PLP). Proceedings of European conference on speech technology, Genova, Italy, pp 1367–1371

Boll SF (1979) Suppression of acoustic noise in speech using spectral subtraction, institute of electrical and electronics engineers. Trans ASSP 27(2)113–120CrossRef

Vaseghi SV (1996) Advanced signal processing and digital noise reduction publisher. Wiley & Teubner, West SussexCrossRef

Manohar K, Rao P (2004) Reduction of burst noises in STSA speech enhancement. Proceedings of international symposium on speech technology and processing systems and oriental COCOSDA-2004, New Delhi, vol 2, pp 254–257

Pal SK, Saxena PK (2000) Enhancement of highly noisy speech signals. J Discrete Math Sci Cryptogr, 3(1–3), 157–172MathSciNetMATH

Cole D, Moody M, Sreedharan S (1997) Robust enhancement of reverberant speech using iterative noise removal, proceeding of ESCA, Eurospeech, Rhodes, Greece, ISSN 1018-4074, 2603–2606

Filiz B, Kumar S, Srinivas N (2000) Noise reduction and echo cancellation front-end for speech codecs, institute of electrical and electronics engineers. Trans Speech Audio Process 11(1):1–13

10.

Manohar K, Rao P (2005) Reduction of burst noises in STSA speech enhancement proceedings of COCOSDA, 254257

11.

Healy EW et al (2007) The effect of smoothing filter slope and spectral frequency on temporal speech information. JASA 121(2):1177–1181MathSciNetCrossRef

12.

Bai MR et al (2007) Comparative study of audio spatializers for dual loudspeaker mobile phones. JASA 121(1):298–309CrossRef

13.

14.

Xi J, Lin Z, Yang Z, Chicharo C (2004) Noise reduction for chaotic signals based on new approach of measuring the signal determinacy. Proceedings of EUSIPCO, pp 293–296

15.

Istvan P (2008) Speech enhancement in the reconstructed phase-space Info Commun J LXIII(7):41–45

16.

Shannon BJ, Paliwal KK (2006) Role of phase estimation in speech enhancement. INTERSPEECH–ICSLP, pp 1423–1426

17.

Wang DL, Lim JS (1982) The unimportance of phase in speech enhancement. IEEE Trans Acoust Speech Signal Process 30:679–681CrossRef

18.

Lyons G, Paliwal KK (2008) Effect of compressing the dynamic range of the power spectrum in modulation filtering based speech enhancement. INTERSPEECH, pp 387–390

19.

Falk T, Stadler S, Kleijn WB, Chan G (2007) Noise suppression based on extending a speech-dominated modulation band. Proceedings of ICSLP, pp 970–973

20.

Drullman R, Festen JM, Plomp R (1994) Effect of temporal envelope smearing on speech reception. JASA 95:2670–2680CrossRef

21.

Drullman R, Festen JM, Plomp R (1994) Effect of reducing slow temporal modulations on speech reception. JASA 95:2670–2680CrossRef

22.

Arai T, Pavel M, Hermansky H, Avendano C (1996) Intelligibility of speech with filtered time trajectories of spectral envelopes. Proceedings of ICSLP, pp 2490–2493

23.

Hermansky H, Wan EA, Avendano C (1995) Speech enhancement based on temporal processing. Proceedings of ICASSP, pp 405–408

24.

Hermansky H, Wan E, Avendao C (1994) Noise suppression in cellular communications. 2nd IEEE Workshop IVTTA, pp 85–85

25.

Hollien H, Fitzgerald JT (1977) Speech enhancement techniques for crime lab use. Proceedings, international conference on crime countermeasures, science and engineering, Oxford

26.

Maithani S (2004) Noisy speech analysis for speech recognition and enhancement. Proceedings of international symposium on speech technology and processing systems and oriental COCOSDA-2004, New Delhi, vol 2, pp 192–197

27.

Ephraim Y, Malah D (1985) Speech enhancement using a minimum mean square error log-spectral amplitude estimator. IEEE Trans Acoust Speech Signal Process ASSP-33:443–445CrossRef

28.

Lim JS, Oppenheim AV (1979) Enhancement and bandwidth compression of noisy speech. Proc IEEE 67:1586–1604CrossRef

29.

Lim JS, Oppenheim AV (1978) All-pole modelling of degraded speech. IEEE Trans Acoust Speech Signal Process ASSP–26:197–210

30.

Ephraim Y (1992) Statistical model based speech enhancement systems. Proc IEEE 80:1526–1555CrossRef

31.

Singh CP, Jiju PV (2007) Noise handling in forensic acoustics-encountering with real noise. CBI Bull XV(1–3):25–33

32.

Jiju PV, Singh CP, Sharma RM (2009) Study on the selection of specific filters for enhancement of recorded speech for speaker Identification. Open Forensic Sci J 2:29–33, 1874-4028/09 Bentham Open

Titel: Characterization of Noise Associated with Forensic Speech Samples
verfasst von: Jiju P. V., M.Sc.
C. P. Singh, Ph.D.
R. M. Sharma, Ph.D.
Verlag: Springer New York
Buch: Forensic Speaker Recognition
Print ISBN: 978-1-4614-0262-6

Electronic ISBN: 978-1-4614-0263-3

Copyright-Jahr: 2012
DOI: https://doi.org/10.1007/978-1-4614-0263-3_9

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Die Gewinner und Laudatoren des Sustainability Award in Automotive 2024/© Uli Regenscheit | ATZlive, Search Icon, Banner Hanser, Suresh Vittal/© Alteryx, Additiv gefertigte Teile/© Marina_Skoropadskaya | Getty Images | iStock, Warnschild "Land unter"/© Bluedesign / Fotolia, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, ATZ-Webinar: Prototypenfreie Entwicklung durch Offline- und Driver-in-the-Loop-HiL-Tests /© (c) VI-grade, chassis.tech plus 2023/© [M] ATZlive / TÜV SÜD PRODUCT SERVICE GMBH, adäsion-Webinar-Matinee/© krystiannawrocki_ Getty Images

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.