Skip to main content

2014 | OriginalPaper | Buchkapitel

14. Structured Sparse Bayesian Modelling for Audio Restoration

verfasst von : James Murphy, Simon Godsill

Erschienen in: Compressed Sensing & Sparse Filtering

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This chapter shows how sparse solutions can be obtained for a range of problems in a Bayesian setting by using prior models on sparsity structure. As an example, a model to remove impulse and background noise from audio signals via their representation in time-frequency space using Gabor wavelets is presented. A number of prior models for the sparse structure of the signal in this space are introduced, including simple Bernoulli priors on each coefficient, Markov chains linking neighbouring coefficients in time or frequency, and Markov random fields, imposing two dimensional coherence on the coefficients. The effect of each of these priors on the reconstruction of a corrupted audio signal is shown. Impulse removal is also covered, with similar sparsity priors being applied to the location of impulse noise in the audio signal. Inference is performed by sampling from the posterior distribution of the model variables using a Gibbs sampler.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Balian R (1981) Un principe dincertitude fort en théorie du signal ou en mécanique quantique. CR Acad Sci Paris 292(2):1357–1361 Balian R (1981) Un principe dincertitude fort en théorie du signal ou en mécanique quantique. CR Acad Sci Paris 292(2):1357–1361
2.
Zurück zum Zitat Boll S (1979) Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans Acoust Speech Signal Process 27(2):113–120CrossRef Boll S (1979) Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans Acoust Speech Signal Process 27(2):113–120CrossRef
3.
Zurück zum Zitat Candès EJ, Romberg J, Tao T (2006) Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information. IEEE Trans Inf Theory 52(2):489–509CrossRefMATH Candès EJ, Romberg J, Tao T (2006) Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information. IEEE Trans Inf Theory 52(2):489–509CrossRefMATH
4.
Zurück zum Zitat Candès EJ, Tao T (2006) Near-optimal signal recovery from random projections: universal encoding strategies? IEEE Trans Inf Theory 52(12):5406–5425CrossRef Candès EJ, Tao T (2006) Near-optimal signal recovery from random projections: universal encoding strategies? IEEE Trans Inf Theory 52(12):5406–5425CrossRef
7.
Zurück zum Zitat Erkelens JS, Heusdens R (2008) Tracking of nonstationary noise based on data-driven recursive noise power estimation. IEEE Trans Audio Speech Lang Process 16(6):1112–1123CrossRef Erkelens JS, Heusdens R (2008) Tracking of nonstationary noise based on data-driven recursive noise power estimation. IEEE Trans Audio Speech Lang Process 16(6):1112–1123CrossRef
8.
Zurück zum Zitat Feichtinger HG, Strohmer T (1998) Gabor analysis algorithms: theory and applications. Birkhäuser, Boston Feichtinger HG, Strohmer T (1998) Gabor analysis algorithms: theory and applications. Birkhäuser, Boston
9.
Zurück zum Zitat Geman S, Geman D (1984) Stochastic relaxation, Gibbs distributions and the Bayesian restoration of images. IEEE Trans Pattern Anal Mach Intell 6:721–741CrossRefMATH Geman S, Geman D (1984) Stochastic relaxation, Gibbs distributions and the Bayesian restoration of images. IEEE Trans Pattern Anal Mach Intell 6:721–741CrossRefMATH
10.
Zurück zum Zitat Gilks WR, Gilks WR, Richardson S, Spiegelhalter DJ (1996) Markov chain Monte Carlo in practice. Chapman & Hall/CRC, London Gilks WR, Gilks WR, Richardson S, Spiegelhalter DJ (1996) Markov chain Monte Carlo in practice. Chapman & Hall/CRC, London
11.
Zurück zum Zitat Godsill SJ , Rayner PJW (1998) Digital audio restoration:a statistical model-based approach. Springer, Berlin (ISBN 3 540 76222 1, Sept 1998) Godsill SJ , Rayner PJW (1998) Digital audio restoration:a statistical model-based approach. Springer, Berlin (ISBN 3 540 76222 1, Sept 1998)
12.
Zurück zum Zitat Godsill SJ (2010) The shifted inverse-gamma model for noise floor estimation in archived audio recordings. Appl Signal Process 90.991-999(Special Issue on Preservation of Ethnological Recordings) Godsill SJ (2010) The shifted inverse-gamma model for noise floor estimation in archived audio recordings. Appl Signal Process 90.991-999(Special Issue on Preservation of Ethnological Recordings)
13.
Zurück zum Zitat Gustafsson S, Martin R, Jax P, Vary P (2002) A psychoacoustic approach to combined acoustic echo cancellation and noise reduction. IEEE Trans Speech Audio Process 10(5):245–256CrossRef Gustafsson S, Martin R, Jax P, Vary P (2002) A psychoacoustic approach to combined acoustic echo cancellation and noise reduction. IEEE Trans Speech Audio Process 10(5):245–256CrossRef
14.
Zurück zum Zitat Low F (1985) Complete sets of wave packets. A passion for physics-essays in honor of Geoofrey Chew. World Scientific, Singapore, pp 17–22 Low F (1985) Complete sets of wave packets. A passion for physics-essays in honor of Geoofrey Chew. World Scientific, Singapore, pp 17–22
15.
Zurück zum Zitat Mallat SG, Zhang Z (1993) Matching pursuits with time-frequency dictionaries. IEEE Trans sig process 41(12):3397–3415CrossRefMATH Mallat SG, Zhang Z (1993) Matching pursuits with time-frequency dictionaries. IEEE Trans sig process 41(12):3397–3415CrossRefMATH
16.
Zurück zum Zitat McGrory CA, Titterington DM, Reeves R et al (2009) DM Titterington, R. Reeves, and A.N. Pettitt. Variational Bayes for estimating the parameters of a hidden Potts model. Stat Comput 19(3):329–340 McGrory CA, Titterington DM, Reeves R et al (2009) DM Titterington, R. Reeves, and A.N. Pettitt. Variational Bayes for estimating the parameters of a hidden Potts model. Stat Comput 19(3):329–340
17.
Zurück zum Zitat Murphy J, Godsill S (2011) Joint Bayesian removal of impulse and background noise. In: Proceedings of the IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 261–264 Murphy J, Godsill S (2011) Joint Bayesian removal of impulse and background noise. In: Proceedings of the IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 261–264
18.
Zurück zum Zitat Murphy J, (2013) Sparse audio restoration in hidden states, hidden structures: bayesian learning in time series models, PhD Thesis, Cambridge University Murphy J, (2013) Sparse audio restoration in hidden states, hidden structures: bayesian learning in time series models, PhD Thesis, Cambridge University
19.
Zurück zum Zitat Niss M (2005) History of the Lenz-Ising model 1920–1950: from ferromagnetic to cooperative phenomena. Arch Hist Exact Sci 59(3):267–318MathSciNetCrossRefMATH Niss M (2005) History of the Lenz-Ising model 1920–1950: from ferromagnetic to cooperative phenomena. Arch Hist Exact Sci 59(3):267–318MathSciNetCrossRefMATH
20.
Zurück zum Zitat Qian S, Chen D ( 1993) Discrete Gabor transform. IEEE Trans Sig Process 41(7):2429–2438 Qian S, Chen D ( 1993) Discrete Gabor transform. IEEE Trans Sig Process 41(7):2429–2438
21.
Zurück zum Zitat Soon IY, Koh SN, Yeo CK (1998) Noisy speech enhancement using discrete cosine transform. Speech Commun 24(3):249–257CrossRef Soon IY, Koh SN, Yeo CK (1998) Noisy speech enhancement using discrete cosine transform. Speech Commun 24(3):249–257CrossRef
22.
Zurück zum Zitat Tibshirani R (1996) Regression shrinkage and selection via the lasso. J R Stat Soc Ser B (Methodol) 58: 267–288 Tibshirani R (1996) Regression shrinkage and selection via the lasso. J R Stat Soc Ser B (Methodol) 58: 267–288
23.
Zurück zum Zitat Wolfe PJ, Godsill SJ, Ng WJ (2004) Bayesian variable selection and regularisation for time-frequency surface estimation. J R Stat Soc Ser B 66(3):575–589 Read paper (with discussion) Wolfe PJ, Godsill SJ, Ng WJ (2004) Bayesian variable selection and regularisation for time-frequency surface estimation. J R Stat Soc Ser B 66(3):575–589 Read paper (with discussion)
24.
Zurück zum Zitat Wolfe PJ, Godsill SJ (2005) Interpolation of missing data values for audio signal restoration using a Gabor regression model. In: Proceedings of the IEEE international conference on acoustics, speech and signal processing (ICASSP), pp. 517–520 Wolfe PJ, Godsill SJ (2005) Interpolation of missing data values for audio signal restoration using a Gabor regression model. In: Proceedings of the IEEE international conference on acoustics, speech and signal processing (ICASSP), pp. 517–520
Metadaten
Titel
Structured Sparse Bayesian Modelling for Audio Restoration
verfasst von
James Murphy
Simon Godsill
Copyright-Jahr
2014
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-642-38398-4_14

Neuer Inhalt