nach oben

Microsystem Technologies

Erschienen in:

16.01.2019 | Technical Paper

FPGA based dual microphone speech enhancement

verfasst von: Tanmay Biswas, Sudhindu Bikash Mandal, Debasri Saha, Amlan Chakrabarti

Erschienen in: Microsystem Technologies | Ausgabe 3/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

This paper proposes an efficient reconfigurable hardware design of dual microphone speech enhancement technique using sound source localization and multi band spectral subtraction methods with elimination of background noise. Firstly, we have used a time delay of arrival algorithm using phase transform (PHAT) to achieve the time difference between the microphone signals. PHAT based filter can reach high SNR gains, which makes it very suitable for localizing the sound source in a microphone array system. After adjustment of the delay between the signals, multi band spectral subtraction technique enhances the signal from the background noise environment in each of the frequency bands. Our design has been implemented in Spartan6 Lx45 FPGA and we have presented the implementation results in terms of subjective and objective evaluations. We have also compared the angular separation between the microphones to the resultant angle between the microphones and observed that proposed design provides very sharp accuracy in every corresponding angle. The objective and subjective evaluation established that our design provides better throughput compared to the existing state of the art research works. The evaluation infers that our proposed hardware induce feasibility for hand-held devices in background noisy environment.

Vorheriger Artikel Extended finite element method (XFEM) analysis of fiber reinforced composites for prediction of micro-crack propagation and delaminations in progressive damage: a review

Nächster Artikel Acoustic mode confinement using coupled cavity structures in UHF unreleased MEMS resonators

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Audumbar KA, Khade RH (2017) High speed FPGA-based data acquisition system. Microprocess Microsyst 49:87–94. https://doi.org/10.1016/j.micpro.2016.11.006 (ISSN 0141-9331)

Benesty J (2000) Adaptive Eigenvalue decomposition algorithm for passive acoustic source localization. Acoustical Society of America

Biswas T, Pal C, Mandal SB, Chakrabarti A (2014) Audio de-noising by spectral subtraction technique implemented on reconfigurable hardware. In: 2014 Seventh international conference on contemporary computing (IC3). Noida, pp 236–241. https://doi.org/10.1109/IC3.2014.6897179

Biswas T, Mandal SB, Saha D, Chakrabarti A (2017) Dual microphone sound source localization using reconfigurable hardware. In: Mandal J, Dutta P, Mukhopadhyay S (eds) Computational intelligence, communications, and business analytics. CICBA 2017. Communications in computer and information science, vol 775. Springer, Singapore

Boll S (1979) Suppression of acoustic noise in speech using spectral subtraction. Acoust Speech Signal Process IEEE Trans 27(2):113–120. https://doi.org/10.1109/TASSP.1979.1163209 ISSN=0096-3518CrossRef

Brandstein M, Silverman H (1997) A practical methodology for speech source localization with microphone arrays. Comput Speech Lang 11(2):91–126CrossRef

Carter GC (1993) Tutorial overview of coherence and time delay estimation. In: Coherence and time delay estimation-an applied tutorial for research. development, test, and evaluation engineers vol 1, pp 1–27

Champagne B, Bédard S, Stéphenne A (1996) Performance of time delay estimation in the presence of room reverberation. IEEE Trans Speech Audio Process 4:148–152CrossRef

Halupka D, Rabi AS, Aarabi P, Sheikholeslami A (2007) Low-power dual-microphone speech enhancement using field programmable gate arrays. IEEE Trans Signal Process 55(7):3526–3535. https://doi.org/10.1109/TSP.2007.893918 MathSciNetCrossRef

Hu Y, Loizou PC (2000) Perceptual evaluation of speech quality (PESQ), and objective method for end-toend of speech quality assessment of narrowband telephone network and speech codecs. ITU-T Rec, p 862

Java Program Techniques for Games (2012) Kinect 15. Kinect Mike, 14th March 2012. https://msdn.microsoft.com/en-us/library/jj131033.aspx

Kamath S, Loizou P (2002) A multiband spectral subtraction method for enhancing speech corrupted by colored noise. Acoustics, speech, and signal processing (ICASSP), 2002 IEEE international conference on , vol 4, pp IV-4164, IV-4164 (13–17 May 02)

Knapp CH, Carter GC (1976) The generalized correlation method for estimation of time delay. IEEE Transactions on Acoustics, Speech, and Signal Processing, vol ASSP-24, pp 320–327

Kumar P (2012) Generating, optimizing and verifying HDL code with MATLAB and Simulink. MathWorks. www.mathworks.com/products/hdl-verifier

Li D, Hu YH (2003) Energy based collaborative source localization using acoustic micro-sensor array. EURASIP J Appl Signal Process 2003(4):321–337MATH

McAllister J (2010) FPGA-based DSP. Handbook of signal processing systems. Springer, US, pp 363–392

Nabi W, Aloui N, Cherif A (2016) Speech enhancement in dual microphone mobile phones using Kalman filter. Appl Acoust 109:1–4. https://doi.org/10.1016/j.apacoust.2016.02.009 (ISSN 0003-682X)CrossRef

System Generator for DSP User Guide (2009) UG640 (v11.4) 2 Dec 2009. www.xilinx.com/support/sw-manual

Taff LG (1997) Target localization from bearings-only observations. IEEE Transactions on Aerospace and Electronic Systems, vol 3, no 1. McGrawHill, New York, pp 15–64

Varga A, Steeneken H (1993) Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems. Speech Commun 12(3):247–251CrossRef

VLSI (2016) The 29th conference on VLSI design and 15th conference on embedded systems—design contest

Xilinx product specifications (2011) Spartan-6 family overview (October 2011)

Xu Y, Du J, Dai L-R, Lee C-H (2015) A regression approach to speech enhancement based on deep neural networks. IEEEACM Trans Audio Speech Lang Process 23:7–19CrossRef

Zhang Y, Zhao Y (2013) Real and imaginary modulation spectral subtraction for speech enhancement. Speech Commun 55(4):509–522. https://doi.org/10.1016/j.specom.2012.09.005 (ISSN 0167-6393)CrossRef

Titel: FPGA based dual microphone speech enhancement
verfasst von: Tanmay Biswas
Sudhindu Bikash Mandal
Debasri Saha
Amlan Chakrabarti
Publikationsdatum: 16.01.2019
Verlag: Springer Berlin Heidelberg
Erschienen in: Microsystem Technologies / Ausgabe 3/2019
Print ISSN: 0946-7076
Elektronische ISSN: 1432-1858
DOI: https://doi.org/10.1007/s00542-019-04299-1

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Internationaler Motorenkongress/© [M] ATZlive | Chisnikov / Fotolia.com, Search Icon, Banner Hanser, Benedikt Bonnmann von Adesso/© Adesso, Teilzeit/© Fokussiert / stock.adobe.com, Hans-Joachim Lefeld/© Lucht Probst Associates GmbH, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, 2023_Antrieb/© supervisuell, ATZ-Webinar: Prototypenfreie Entwicklung durch Offline- und Driver-in-the-Loop-HiL-Tests /© (c) VI-grade, chassis.tech plus 2023/© [M] ATZlive / TÜV SÜD PRODUCT SERVICE GMBH

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Weitere Artikel der Ausgabe 3/2019

Effectiveness of Taguchi method for the optimization of narrowband optical filters based on grating waveguides

Experimental and numerical investigation of a large stroke compliant revolute joint

Simulation and experimental investigation on tree concentration gradient generator with U-shape microchannel

Cost-effective fabrication of ionic polymer based artificial muscles for catheter-guidewire maneuvering application

A high accuracy fluxgate DC current sensor applicable to two-wire electric appliances

Ultrasensitive micro ion selective sensor arrays for multiplex heavy metal ions detection

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.