Skip to main content

2018 | OriginalPaper | Buchkapitel

An AFK-SVD Sparse Representation Approach for Speech Signal Processing

verfasst von : Fenglian Li, Xueying Zhang, Hongle Zhang, Yu-Chu Tian

Erschienen in: Advances in Intelligent Information Hiding and Multimedia Signal Processing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Sparse representation is a common issue in many signal processing problems. In speech signal processing, how to sparsely represent a speech signal by dictionary learning for improving transmission efficiency has attracted considerable attention in recent years. K-SVD algorithm for dictionary learning is a typical method. But it requires to know the dictionary size prior to dictionary training. A suitable dictionary size can effectively avoid the problem of under-representation or over-representation, which affects the quality of reconstruction speech significantly. To tackle this problem, an Adaptive dictionary size Feedback filtering K-SVD (AFK-SVD) approach is presented in this paper for dictionary leaning. The proposed method first selects the dictionary size adaptively based on the speech signal feasure prior to dictionary learning, and then filters out the noise caused by over-representation. The approach has two unique features: (1) a learning model is constructed based on the training set specifically for adaptive determination of a range of the dictionary size; and (2) a two-level feedback filter measure is developed for removal of speech distortion caused by over-representation. The speech signals from TIMIT speech data sets are used to demonstrate the presented AFK-SVD approach. Experimental results showed that, in comparison with K-SVD, the proposed AFK-SVD method can improve the quality of the reconstructed speech signal in PESQ by 0.8 and SNR by 3 - 7 dB in average.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Aharon, M., Elad, M., Bruckstein, A.: \( rm k \)-SVD: an algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans. Signal Process. 54(11), 4311–4322 (2006)CrossRef Aharon, M., Elad, M., Bruckstein, A.: \( rm k \)-SVD: an algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans. Signal Process. 54(11), 4311–4322 (2006)CrossRef
2.
Zurück zum Zitat Zhou, J., Wang, J.: Fabric defect detection using adaptive dictionaries. Text. Res. J. 83(17), 1846–1859 (2013)CrossRef Zhou, J., Wang, J.: Fabric defect detection using adaptive dictionaries. Text. Res. J. 83(17), 1846–1859 (2013)CrossRef
3.
Zurück zum Zitat Bierman, R., Singh, R.: Influence of dictionary size on the lossless compression of microarray images Twentieth IEEE International Symposium on Computer-Based Medical Systems: CBMS 2007. IEEE (2007) Bierman, R., Singh, R.: Influence of dictionary size on the lossless compression of microarray images Twentieth IEEE International Symposium on Computer-Based Medical Systems: CBMS 2007. IEEE (2007)
4.
Zurück zum Zitat Sun, Y., Gomez, F., Schmidhuber, J.: On the size of the online kernel sparsification dictionary. arXiv preprint arXiv: 1206.4623 (2012) Sun, Y., Gomez, F., Schmidhuber, J.: On the size of the online kernel sparsification dictionary. arXiv preprint arXiv:​ 1206.​4623 (2012)
5.
Zurück zum Zitat Zhou, Y., et al.: Immune K-SVD algorithm for dictionary learning in speech denoising. Neurocomputing 137, 223–233 (2014)CrossRef Zhou, Y., et al.: Immune K-SVD algorithm for dictionary learning in speech denoising. Neurocomputing 137, 223–233 (2014)CrossRef
6.
Zurück zum Zitat Zhou, Y., Zhao, H., Lie, P.X.: Detection from speech analysis based on K–SVD deep belief network model. In: International Conference on Intelligent Computing, pp. 189–196. Springer (2015) Zhou, Y., Zhao, H., Lie, P.X.: Detection from speech analysis based on K–SVD deep belief network model. In: International Conference on Intelligent Computing, pp. 189–196. Springer (2015)
7.
Zurück zum Zitat Tjoa, S.K., et al.: Harmonic variable-size dictionary learning for music source separation. In: 2010 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP). IEEE (2010) Tjoa, S.K., et al.: Harmonic variable-size dictionary learning for music source separation. In: 2010 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP). IEEE (2010)
Metadaten
Titel
An AFK-SVD Sparse Representation Approach for Speech Signal Processing
verfasst von
Fenglian Li
Xueying Zhang
Hongle Zhang
Yu-Chu Tian
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-63859-1_23