Skip to main content

2012 | OriginalPaper | Buchkapitel

9. Characterization of Noise Associated with Forensic Speech Samples

verfasst von : Jiju P. V., M.Sc., C. P. Singh, Ph.D., R. M. Sharma, Ph.D.

Erschienen in: Forensic Speaker Recognition

Verlag: Springer New York

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

For speech enhancement, different methods have been developed in the past decades. This study has been carried out for characterization of various noises associated with forensic speech samples and their classification to find specific set of filtering technique for speech recognition and speaker identification. Noisy speech samples are collected from the exhibits received in case examination in the laboratory for this study. The experiment is performed in a two-fold way: enhancing the speech for (i) speech recognition and (ii) speaker identification. The original and simulated samples are subjected to various filtering techniques, namely, FFT Filter, noise reduction, noise gate, notch filter, bandpass, butterworth filter, digital equalizer and parametric equalizer for speech recognition. For speaker identification, noise reduction, noise gate, notch filter, bandpass and butterworth filter are applied to the noisy speech samples. Characterization of noise embedded with the noisy speech samples were attained based on the application of these filtering techniques and subsequent analysis performed on them using Computerized Speech Laboratory (CSL). For speech recognition, maximum SNR improvement was achieved by FFT filter on samples Noisy Speech-I (Direct Recording), Noisy Speech-II (Telephonic Landline Recording) and Noisy Speech-III (Mobile Phone Recording). The corresponding improvements in SNR for original and simulated samples were 3.81, 7.57, 5.62 dB and 4.39, 6.26, 5.57 dB respectively. FFT filter, when applied to the Noisy Speech-I, Noisy Speech-II and Noisy Speech-III of original noisy speech samples, have given an improvement of 75, 71 and 48%, whereas simulated noisy speech samples gave an improvement of 82, 78 and 52%. For speaker identification, maximum improvement was achieved by noise reduction filter when applied to the Noisy Speech-I, Noisy Speech-II and Noisy Speech-III of original noisy speech samples, have given an improvement of 60, 64 and 52% whereas simulated noisy speech samples gave an improvement of 64, 70 and 54%. Statistical study of improvised original noisy speech and simulated noisy speech samples after filtering have revealed the degree of efficiency of different filters for Speaker Identification and how far they are dependable in forensic adverse contexts. For Speech Recognition, the degree of efficiency of filters in enhancing the speech signal is found to be in a descending order; viz. FFT Filter, Noise reduction, Noise gate, Notch filter, Bandpass, Butterworth filter, Digital equalizer and Parametric equalizer. The degree of efficiency of filters in enhancing the speech signal for Speaker Identification is found to be in a descending order; viz. Noise Reduction, Noise Gate, Notch filter, Bandpass, and Butterworth filter.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Koenig BE (1986) Spectrographic voice identification: a forensic survey. J Acoust Soc Am, 79(6):2CrossRef Koenig BE (1986) Spectrographic voice identification: a forensic survey. J Acoust Soc Am, 79(6):2CrossRef
2.
Zurück zum Zitat Atal B (1974) Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification. J Acoust Soc Am 55:1304–1312CrossRef Atal B (1974) Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification. J Acoust Soc Am 55:1304–1312CrossRef
3.
Zurück zum Zitat Hermansky H, Morgan N, Bayya A, Kohn P (1991) Compensation for the effect of communication channel in auditory-like analysis of speech (RASTA-PLP). Proceedings of European conference on speech technology, Genova, Italy, pp 1367–1371 Hermansky H, Morgan N, Bayya A, Kohn P (1991) Compensation for the effect of communication channel in auditory-like analysis of speech (RASTA-PLP). Proceedings of European conference on speech technology, Genova, Italy, pp 1367–1371
4.
Zurück zum Zitat Boll SF (1979) Suppression of acoustic noise in speech using spectral subtraction, institute of electrical and electronics engineers. Trans ASSP 27(2)113–120CrossRef Boll SF (1979) Suppression of acoustic noise in speech using spectral subtraction, institute of electrical and electronics engineers. Trans ASSP 27(2)113–120CrossRef
5.
Zurück zum Zitat Vaseghi SV (1996) Advanced signal processing and digital noise reduction publisher. Wiley & Teubner, West SussexCrossRef Vaseghi SV (1996) Advanced signal processing and digital noise reduction publisher. Wiley & Teubner, West SussexCrossRef
6.
Zurück zum Zitat Manohar K, Rao P (2004) Reduction of burst noises in STSA speech enhancement. Proceedings of international symposium on speech technology and processing systems and oriental COCOSDA-2004, New Delhi, vol 2, pp 254–257 Manohar K, Rao P (2004) Reduction of burst noises in STSA speech enhancement. Proceedings of international symposium on speech technology and processing systems and oriental COCOSDA-2004, New Delhi, vol 2, pp 254–257
7.
Zurück zum Zitat Pal SK, Saxena PK (2000) Enhancement of highly noisy speech signals. J Discrete Math Sci Cryptogr, 3(1–3), 157–172MathSciNetMATH Pal SK, Saxena PK (2000) Enhancement of highly noisy speech signals. J Discrete Math Sci Cryptogr, 3(1–3), 157–172MathSciNetMATH
8.
Zurück zum Zitat Cole D, Moody M, Sreedharan S (1997) Robust enhancement of reverberant speech using iterative noise removal, proceeding of ESCA, Eurospeech, Rhodes, Greece, ISSN 1018-4074, 2603–2606 Cole D, Moody M, Sreedharan S (1997) Robust enhancement of reverberant speech using iterative noise removal, proceeding of ESCA, Eurospeech, Rhodes, Greece, ISSN 1018-4074, 2603–2606
9.
Zurück zum Zitat Filiz B, Kumar S, Srinivas N (2000) Noise reduction and echo cancellation front-end for speech codecs, institute of electrical and electronics engineers. Trans Speech Audio Process 11(1):1–13 Filiz B, Kumar S, Srinivas N (2000) Noise reduction and echo cancellation front-end for speech codecs, institute of electrical and electronics engineers. Trans Speech Audio Process 11(1):1–13
10.
Zurück zum Zitat Manohar K, Rao P (2005) Reduction of burst noises in STSA speech enhancement proceedings of COCOSDA, 254257 Manohar K, Rao P (2005) Reduction of burst noises in STSA speech enhancement proceedings of COCOSDA, 254257
11.
Zurück zum Zitat Healy EW et al (2007) The effect of smoothing filter slope and spectral frequency on temporal speech information. JASA 121(2):1177–1181MathSciNetCrossRef Healy EW et al (2007) The effect of smoothing filter slope and spectral frequency on temporal speech information. JASA 121(2):1177–1181MathSciNetCrossRef
12.
Zurück zum Zitat Bai MR et al (2007) Comparative study of audio spatializers for dual loudspeaker mobile phones. JASA 121(1):298–309CrossRef Bai MR et al (2007) Comparative study of audio spatializers for dual loudspeaker mobile phones. JASA 121(1):298–309CrossRef
13.
Zurück zum Zitat Luis B, Jasha D, Alex A (2008) Speech enhancement using a pitch predictive model 1-4244-1484-9/08/ ©2008 IEEE, ICASSP, pp 4885–4888 Luis B, Jasha D, Alex A (2008) Speech enhancement using a pitch predictive model 1-4244-1484-9/08/ ©2008 IEEE, ICASSP, pp 4885–4888
14.
Zurück zum Zitat Xi J, Lin Z, Yang Z, Chicharo C (2004) Noise reduction for chaotic signals based on new approach of measuring the signal determinacy. Proceedings of EUSIPCO, pp 293–296 Xi J, Lin Z, Yang Z, Chicharo C (2004) Noise reduction for chaotic signals based on new approach of measuring the signal determinacy. Proceedings of EUSIPCO, pp 293–296
15.
Zurück zum Zitat Istvan P (2008) Speech enhancement in the reconstructed phase-space Info Commun J LXIII(7):41–45 Istvan P (2008) Speech enhancement in the reconstructed phase-space Info Commun J LXIII(7):41–45
16.
Zurück zum Zitat Shannon BJ, Paliwal KK (2006) Role of phase estimation in speech enhancement. INTERSPEECH–ICSLP, pp 1423–1426 Shannon BJ, Paliwal KK (2006) Role of phase estimation in speech enhancement. INTERSPEECH–ICSLP, pp 1423–1426
17.
Zurück zum Zitat Wang DL, Lim JS (1982) The unimportance of phase in speech enhancement. IEEE Trans Acoust Speech Signal Process 30:679–681CrossRef Wang DL, Lim JS (1982) The unimportance of phase in speech enhancement. IEEE Trans Acoust Speech Signal Process 30:679–681CrossRef
18.
Zurück zum Zitat Lyons G, Paliwal KK (2008) Effect of compressing the dynamic range of the power spectrum in modulation filtering based speech enhancement. INTERSPEECH, pp 387–390 Lyons G, Paliwal KK (2008) Effect of compressing the dynamic range of the power spectrum in modulation filtering based speech enhancement. INTERSPEECH, pp 387–390
19.
Zurück zum Zitat Falk T, Stadler S, Kleijn WB, Chan G (2007) Noise suppression based on extending a speech-dominated modulation band. Proceedings of ICSLP, pp 970–973 Falk T, Stadler S, Kleijn WB, Chan G (2007) Noise suppression based on extending a speech-dominated modulation band. Proceedings of ICSLP, pp 970–973
20.
Zurück zum Zitat Drullman R, Festen JM, Plomp R (1994) Effect of temporal envelope smearing on speech reception. JASA 95:2670–2680CrossRef Drullman R, Festen JM, Plomp R (1994) Effect of temporal envelope smearing on speech reception. JASA 95:2670–2680CrossRef
21.
Zurück zum Zitat Drullman R, Festen JM, Plomp R (1994) Effect of reducing slow temporal modulations on speech reception. JASA 95:2670–2680CrossRef Drullman R, Festen JM, Plomp R (1994) Effect of reducing slow temporal modulations on speech reception. JASA 95:2670–2680CrossRef
22.
Zurück zum Zitat Arai T, Pavel M, Hermansky H, Avendano C (1996) Intelligibility of speech with filtered time trajectories of spectral envelopes. Proceedings of ICSLP, pp 2490–2493 Arai T, Pavel M, Hermansky H, Avendano C (1996) Intelligibility of speech with filtered time trajectories of spectral envelopes. Proceedings of ICSLP, pp 2490–2493
23.
Zurück zum Zitat Hermansky H, Wan EA, Avendano C (1995) Speech enhancement based on temporal processing. Proceedings of ICASSP, pp 405–408 Hermansky H, Wan EA, Avendano C (1995) Speech enhancement based on temporal processing. Proceedings of ICASSP, pp 405–408
24.
Zurück zum Zitat Hermansky H, Wan E, Avendao C (1994) Noise suppression in cellular communications. 2nd IEEE Workshop IVTTA, pp 85–85 Hermansky H, Wan E, Avendao C (1994) Noise suppression in cellular communications. 2nd IEEE Workshop IVTTA, pp 85–85
25.
Zurück zum Zitat Hollien H, Fitzgerald JT (1977) Speech enhancement techniques for crime lab use. Proceedings, international conference on crime countermeasures, science and engineering, Oxford Hollien H, Fitzgerald JT (1977) Speech enhancement techniques for crime lab use. Proceedings, international conference on crime countermeasures, science and engineering, Oxford
26.
Zurück zum Zitat Maithani S (2004) Noisy speech analysis for speech recognition and enhancement. Proceedings of international symposium on speech technology and processing systems and oriental COCOSDA-2004, New Delhi, vol 2, pp 192–197 Maithani S (2004) Noisy speech analysis for speech recognition and enhancement. Proceedings of international symposium on speech technology and processing systems and oriental COCOSDA-2004, New Delhi, vol 2, pp 192–197
27.
Zurück zum Zitat Ephraim Y, Malah D (1985) Speech enhancement using a minimum mean square error log-spectral amplitude estimator. IEEE Trans Acoust Speech Signal Process ASSP-33:443–445CrossRef Ephraim Y, Malah D (1985) Speech enhancement using a minimum mean square error log-spectral amplitude estimator. IEEE Trans Acoust Speech Signal Process ASSP-33:443–445CrossRef
28.
Zurück zum Zitat Lim JS, Oppenheim AV (1979) Enhancement and bandwidth compression of noisy speech. Proc IEEE 67:1586–1604CrossRef Lim JS, Oppenheim AV (1979) Enhancement and bandwidth compression of noisy speech. Proc IEEE 67:1586–1604CrossRef
29.
Zurück zum Zitat Lim JS, Oppenheim AV (1978) All-pole modelling of degraded speech. IEEE Trans Acoust Speech Signal Process ASSP–26:197–210 Lim JS, Oppenheim AV (1978) All-pole modelling of degraded speech. IEEE Trans Acoust Speech Signal Process ASSP–26:197–210
30.
Zurück zum Zitat Ephraim Y (1992) Statistical model based speech enhancement systems. Proc IEEE 80:1526–1555CrossRef Ephraim Y (1992) Statistical model based speech enhancement systems. Proc IEEE 80:1526–1555CrossRef
31.
Zurück zum Zitat Singh CP, Jiju PV (2007) Noise handling in forensic acoustics-encountering with real noise. CBI Bull XV(1–3):25–33 Singh CP, Jiju PV (2007) Noise handling in forensic acoustics-encountering with real noise. CBI Bull XV(1–3):25–33
32.
Zurück zum Zitat Jiju PV, Singh CP, Sharma RM (2009) Study on the selection of specific filters for enhancement of recorded speech for speaker Identification. Open Forensic Sci J 2:29–33, 1874-4028/09 Bentham Open Jiju PV, Singh CP, Sharma RM (2009) Study on the selection of specific filters for enhancement of recorded speech for speaker Identification. Open Forensic Sci J 2:29–33, 1874-4028/09 Bentham Open
Metadaten
Titel
Characterization of Noise Associated with Forensic Speech Samples
verfasst von
Jiju P. V., M.Sc.
C. P. Singh, Ph.D.
R. M. Sharma, Ph.D.
Copyright-Jahr
2012
Verlag
Springer New York
DOI
https://doi.org/10.1007/978-1-4614-0263-3_9

Neuer Inhalt