Skip to main content
Erschienen in: Microsystem Technologies 3/2019

16.01.2019 | Technical Paper

FPGA based dual microphone speech enhancement

verfasst von: Tanmay Biswas, Sudhindu Bikash Mandal, Debasri Saha, Amlan Chakrabarti

Erschienen in: Microsystem Technologies | Ausgabe 3/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper proposes an efficient reconfigurable hardware design of dual microphone speech enhancement technique using sound source localization and multi band spectral subtraction methods with elimination of background noise. Firstly, we have used a time delay of arrival algorithm using phase transform (PHAT) to achieve the time difference between the microphone signals. PHAT based filter can reach high SNR gains, which makes it very suitable for localizing the sound source in a microphone array system. After adjustment of the delay between the signals, multi band spectral subtraction technique enhances the signal from the background noise environment in each of the frequency bands. Our design has been implemented in Spartan6 Lx45 FPGA and we have presented the implementation results in terms of subjective and objective evaluations. We have also compared the angular separation between the microphones to the resultant angle between the microphones and observed that proposed design provides very sharp accuracy in every corresponding angle. The objective and subjective evaluation established that our design provides better throughput compared to the existing state of the art research works. The evaluation infers that our proposed hardware induce feasibility for hand-held devices in background noisy environment.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Benesty J (2000) Adaptive Eigenvalue decomposition algorithm for passive acoustic source localization. Acoustical Society of America Benesty J (2000) Adaptive Eigenvalue decomposition algorithm for passive acoustic source localization. Acoustical Society of America
Zurück zum Zitat Biswas T, Mandal SB, Saha D, Chakrabarti A (2017) Dual microphone sound source localization using reconfigurable hardware. In: Mandal J, Dutta P, Mukhopadhyay S (eds) Computational intelligence, communications, and business analytics. CICBA 2017. Communications in computer and information science, vol 775. Springer, Singapore Biswas T, Mandal SB, Saha D, Chakrabarti A (2017) Dual microphone sound source localization using reconfigurable hardware. In: Mandal J, Dutta P, Mukhopadhyay S (eds) Computational intelligence, communications, and business analytics. CICBA 2017. Communications in computer and information science, vol 775. Springer, Singapore
Zurück zum Zitat Brandstein M, Silverman H (1997) A practical methodology for speech source localization with microphone arrays. Comput Speech Lang 11(2):91–126CrossRef Brandstein M, Silverman H (1997) A practical methodology for speech source localization with microphone arrays. Comput Speech Lang 11(2):91–126CrossRef
Zurück zum Zitat Carter GC (1993) Tutorial overview of coherence and time delay estimation. In: Coherence and time delay estimation-an applied tutorial for research. development, test, and evaluation engineers vol 1, pp 1–27 Carter GC (1993) Tutorial overview of coherence and time delay estimation. In: Coherence and time delay estimation-an applied tutorial for research. development, test, and evaluation engineers vol 1, pp 1–27
Zurück zum Zitat Champagne B, Bédard S, Stéphenne A (1996) Performance of time delay estimation in the presence of room reverberation. IEEE Trans Speech Audio Process 4:148–152CrossRef Champagne B, Bédard S, Stéphenne A (1996) Performance of time delay estimation in the presence of room reverberation. IEEE Trans Speech Audio Process 4:148–152CrossRef
Zurück zum Zitat Hu Y, Loizou PC (2000) Perceptual evaluation of speech quality (PESQ), and objective method for end-toend of speech quality assessment of narrowband telephone network and speech codecs. ITU-T Rec, p 862 Hu Y, Loizou PC (2000) Perceptual evaluation of speech quality (PESQ), and objective method for end-toend of speech quality assessment of narrowband telephone network and speech codecs. ITU-T Rec, p 862
Zurück zum Zitat Kamath S, Loizou P (2002) A multiband spectral subtraction method for enhancing speech corrupted by colored noise. Acoustics, speech, and signal processing (ICASSP), 2002 IEEE international conference on , vol 4, pp IV-4164, IV-4164 (13–17 May 02) Kamath S, Loizou P (2002) A multiband spectral subtraction method for enhancing speech corrupted by colored noise. Acoustics, speech, and signal processing (ICASSP), 2002 IEEE international conference on , vol 4, pp IV-4164, IV-4164 (13–17 May 02)
Zurück zum Zitat Knapp CH, Carter GC (1976) The generalized correlation method for estimation of time delay. IEEE Transactions on Acoustics, Speech, and Signal Processing, vol ASSP-24, pp 320–327 Knapp CH, Carter GC (1976) The generalized correlation method for estimation of time delay. IEEE Transactions on Acoustics, Speech, and Signal Processing, vol ASSP-24, pp 320–327
Zurück zum Zitat Li D, Hu YH (2003) Energy based collaborative source localization using acoustic micro-sensor array. EURASIP J Appl Signal Process 2003(4):321–337MATH Li D, Hu YH (2003) Energy based collaborative source localization using acoustic micro-sensor array. EURASIP J Appl Signal Process 2003(4):321–337MATH
Zurück zum Zitat McAllister J (2010) FPGA-based DSP. Handbook of signal processing systems. Springer, US, pp 363–392 McAllister J (2010) FPGA-based DSP. Handbook of signal processing systems. Springer, US, pp 363–392
Zurück zum Zitat Taff LG (1997) Target localization from bearings-only observations. IEEE Transactions on Aerospace and Electronic Systems, vol 3, no 1. McGrawHill, New York, pp 15–64 Taff LG (1997) Target localization from bearings-only observations. IEEE Transactions on Aerospace and Electronic Systems, vol 3, no 1. McGrawHill, New York, pp 15–64
Zurück zum Zitat Varga A, Steeneken H (1993) Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems. Speech Commun 12(3):247–251CrossRef Varga A, Steeneken H (1993) Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems. Speech Commun 12(3):247–251CrossRef
Zurück zum Zitat VLSI (2016) The 29th conference on VLSI design and 15th conference on embedded systems—design contest VLSI (2016) The 29th conference on VLSI design and 15th conference on embedded systems—design contest
Zurück zum Zitat Xilinx product specifications (2011) Spartan-6 family overview (October 2011) Xilinx product specifications (2011) Spartan-6 family overview (October 2011)
Zurück zum Zitat Xu Y, Du J, Dai L-R, Lee C-H (2015) A regression approach to speech enhancement based on deep neural networks. IEEEACM Trans Audio Speech Lang Process 23:7–19CrossRef Xu Y, Du J, Dai L-R, Lee C-H (2015) A regression approach to speech enhancement based on deep neural networks. IEEEACM Trans Audio Speech Lang Process 23:7–19CrossRef
Metadaten
Titel
FPGA based dual microphone speech enhancement
verfasst von
Tanmay Biswas
Sudhindu Bikash Mandal
Debasri Saha
Amlan Chakrabarti
Publikationsdatum
16.01.2019
Verlag
Springer Berlin Heidelberg
Erschienen in
Microsystem Technologies / Ausgabe 3/2019
Print ISSN: 0946-7076
Elektronische ISSN: 1432-1858
DOI
https://doi.org/10.1007/s00542-019-04299-1

Weitere Artikel der Ausgabe 3/2019

Microsystem Technologies 3/2019 Zur Ausgabe

Neuer Inhalt