Skip to main content
Erschienen in: International Journal of Speech Technology 3/2016

28.05.2016

Simultaneous speech coding and de-noising in a dictionary based quantized CS framework

verfasst von: Vinitha Ramdas, Sai Subrahmanyam R. K. Gorthi, Deepak Mishra

Erschienen in: International Journal of Speech Technology | Ausgabe 3/2016

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Speech compression or speech coding is inevitable for effective communication of speech signals in resource limited scenarios and researcher’s have been working on achieving lower and lower transmission bit rates (BR) without much compromise on the quality of speech. Medium BR hybrid speech coding schemes have gained much interest in the recent years with most of them based on CELP, the basic medium bit-rate coding scheme. In this work, we provide an insight to the capabilities of compressive sensing (CS) in speech processing and propose a novel idea in the quantized framework. Three major aspects demonstrated in this paper are (1) Inherent de-noising of noisy speech by the CS based coder along with compression (2) Quantization of CS measurements to achieve medium transmission bit-rates and (3) Enhancement of quality and compression performance of the coder with better sparse representations of speech using dictionaries. The results indicate that the proposed scheme offers better compression in comparison with basic Gaussian codebook CELP. The CS scheme has the added advantage of inherent noise suppression and provides more robustness to background noise in comparison with parameter extraction based medium bit-rate speech coding systems.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Aharon, M., Elad, M., & Bruckstein, A. (2006). K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation. IEEE Transactions on Signal Processing, 54, 4311–4322.CrossRef Aharon, M., Elad, M., & Bruckstein, A. (2006). K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation. IEEE Transactions on Signal Processing, 54, 4311–4322.CrossRef
Zurück zum Zitat Andreas, S. (1994). Spanias, speech coding: A tutorial review, Proceedings of the IEEE, vol. 82(10). Andreas, S. (1994). Spanias, speech coding: A tutorial review, Proceedings of the IEEE, vol. 82(10).
Zurück zum Zitat Chu, W. C. (2003). Speech coding algorithms foundation and evolution of standardized coders. Hoboken: Wiley.CrossRefMATH Chu, W. C. (2003). Speech coding algorithms foundation and evolution of standardized coders. Hoboken: Wiley.CrossRefMATH
Zurück zum Zitat Dai, W., Pham R. V., & Milenkovic O. (2009). A comparative study of quantized compressive sensing schemes, IEEE International Symposium on Information Theory, pp. 11–15. Dai, W., Pham R. V., & Milenkovic O. (2009). A comparative study of quantized compressive sensing schemes, IEEE International Symposium on Information Theory, pp. 11–15.
Zurück zum Zitat Daniels M. L., & Rao B. D. (2012). Compressed sensing based scalable speech coders, Proceedings of ASILOMAR, pp. 92–96. Daniels M. L., & Rao B. D. (2012). Compressed sensing based scalable speech coders, Proceedings of ASILOMAR, pp. 92–96.
Zurück zum Zitat Eldar, Y. C., & Kutyniok, G. (2012). Compressed sensing: Theory and applications. Cambridge: Cambridge University Press.CrossRef Eldar, Y. C., & Kutyniok, G. (2012). Compressed sensing: Theory and applications. Cambridge: Cambridge University Press.CrossRef
Zurück zum Zitat Foucart, S., & Rauhut, H. (2013). A mathematical introduction to compressive sensing (Vol. XVIII). New York: Springer.CrossRefMATH Foucart, S., & Rauhut, H. (2013). A mathematical introduction to compressive sensing (Vol. XVIII). New York: Springer.CrossRefMATH
Zurück zum Zitat Giacobello, D., Christensen, M. G., Murthi, M. N., Jensen, S. H., & Moonen, M. (2010). Retrieving sparse patterns using a compressed sensing framework: applications to speech coding based on sparse linear prediction. IEEE Signal Processing Letters, 17, 103–106.CrossRef Giacobello, D., Christensen, M. G., Murthi, M. N., Jensen, S. H., & Moonen, M. (2010). Retrieving sparse patterns using a compressed sensing framework: applications to speech coding based on sparse linear prediction. IEEE Signal Processing Letters, 17, 103–106.CrossRef
Zurück zum Zitat Gunawan, T.S., Khalifa, O.O., Shafie, A.A., & Ambikairajah, E. (2011) Speech compression using compressive sensing on a multicore system. In Proceedings of 4th International Conference on Mechatronics (ICOM), pp. 1–4. Gunawan, T.S., Khalifa, O.O., Shafie, A.A., & Ambikairajah, E. (2011) Speech compression using compressive sensing on a multicore system. In Proceedings of 4th International Conference on Mechatronics (ICOM), pp. 1–4.
Zurück zum Zitat Hu, Y., & Loizou, P. (2007). Subjective evaluation and comparison of speech enhancement algorithms. Journal of Speech Communications, 49, 588–601.CrossRef Hu, Y., & Loizou, P. (2007). Subjective evaluation and comparison of speech enhancement algorithms. Journal of Speech Communications, 49, 588–601.CrossRef
Zurück zum Zitat Jafari M. G. & Plumbey M. D., (2008). An adaptive orthogonal sparsifying transform for speech signals, Proceedings of IEEE Conference on Communications, Control and Signal Processing (ISCCSP), pp. 786–790. Jafari M. G. & Plumbey M. D., (2008). An adaptive orthogonal sparsifying transform for speech signals, Proceedings of IEEE Conference on Communications, Control and Signal Processing (ISCCSP), pp. 786–790.
Zurück zum Zitat Jafari M. G. & Plumbley M. D. (2009). Speech denoising based on a greedy adaptive dictionary algorithm, Proceedings of European Signal Processing Conference, pp. 1423–1426. Jafari M. G. & Plumbley M. D. (2009). Speech denoising based on a greedy adaptive dictionary algorithm, Proceedings of European Signal Processing Conference, pp. 1423–1426.
Zurück zum Zitat Kadambe, S., & Davis, J. (2010). Compressive sensing and vector quantization based image compression, Proceedings of IEEE ASILOMAR, pp. 2023–2027. Kadambe, S., & Davis, J. (2010). Compressive sensing and vector quantization based image compression, Proceedings of IEEE ASILOMAR, pp. 2023–2027.
Zurück zum Zitat Kamboh, A. M., Lawrence, K. C., Thomas, A. M., & Tsai, P. I. (2005). Design of a CELP coder and analysis of various quantization techniques. Ann Arbor: University of Michigan. Kamboh, A. M., Lawrence, K. C., Thomas, A. M., & Tsai, P. I. (2005). Design of a CELP coder and analysis of various quantization techniques. Ann Arbor: University of Michigan.
Zurück zum Zitat Kassim L.A., Khalifa, O.O., & Gunawan T.S. (2012). Compressive sensing based low bit rate speech encoder. In International Conference on Computer & Communication Engineering (ICCCE), pp. 302–307. Kassim L.A., Khalifa, O.O., & Gunawan T.S. (2012). Compressive sensing based low bit rate speech encoder. In International Conference on Computer & Communication Engineering (ICCCE), pp. 302–307.
Zurück zum Zitat Kondoz, A. M. (2004). Digital speech—coding for low bit rate communication systems (2nd ed.). New York: Chichester.CrossRef Kondoz, A. M. (2004). Digital speech—coding for low bit rate communication systems (2nd ed.). New York: Chichester.CrossRef
Zurück zum Zitat Lin K.-H., Lin C.-H., Chung K.-H., & Lin K.-S. (2013). A compressive sensing-based speech signal processing system for wearable computing device in IPTV environment. In Third International Congress on Multimedia Technology, Atlantis Press. Lin K.-H., Lin C.-H., Chung K.-H., & Lin K.-S. (2013). A compressive sensing-based speech signal processing system for wearable computing device in IPTV environment. In Third International Congress on Multimedia Technology, Atlantis Press.
Zurück zum Zitat Murray J. F. & Kreutz-Delgado K. (2004). Sparse image coding using learned dictionaries, IEEE Workshop on Machine Learning for Signal Processing, pp. 579–588. Murray J. F. & Kreutz-Delgado K. (2004). Sparse image coding using learned dictionaries, IEEE Workshop on Machine Learning for Signal Processing, pp. 579–588.
Zurück zum Zitat Nowak, R. D., & Wright, S. J. (2007). Gradient projection for sparse reconstruction: Application to compressed sensing and other inverse problems. IEEE Journal of Selected Topics in Signal Processing, 1(4), 586–597.CrossRef Nowak, R. D., & Wright, S. J. (2007). Gradient projection for sparse reconstruction: Application to compressed sensing and other inverse problems. IEEE Journal of Selected Topics in Signal Processing, 1(4), 586–597.CrossRef
Zurück zum Zitat Pham, D. S., & Venkatesh, S. (2013). Compressive speech enhancement. Journal of Speech Communication, 55, 757–768.CrossRef Pham, D. S., & Venkatesh, S. (2013). Compressive speech enhancement. Journal of Speech Communication, 55, 757–768.CrossRef
Zurück zum Zitat Plumbey, M. D., & Jafari, M. G. (2011). Fast dictionary learning for sparse representations of speech signal. IEEE Journal of Selected Topics in Signal Processing, 5, 1025–1031.CrossRef Plumbey, M. D., & Jafari, M. G. (2011). Fast dictionary learning for sparse representations of speech signal. IEEE Journal of Selected Topics in Signal Processing, 5, 1025–1031.CrossRef
Zurück zum Zitat Rubinstein R., Bruckstein A. M., & Elad M. (2010). Dictionaries for sparse representation modelling, Invited paper, proceedings of IEEE, pp. 1045–1057. Rubinstein R., Bruckstein A. M., & Elad M. (2010). Dictionaries for sparse representation modelling, Invited paper, proceedings of IEEE, pp. 1045–1057.
Zurück zum Zitat Sanderson, C. (2008). Biometric person recognition: Face, speech and fusion. Saarbrucken: VDM. Sanderson, C. (2008). Biometric person recognition: Face, speech and fusion. Saarbrucken: VDM.
Zurück zum Zitat Shirazinia, A., Chatterjee, S., & Skoglund, M. (2013). Analysis-by-synthesis quantization for compressed sensing measurements. IEEE Transaction on Signal Processing, 61(22), 5789–5800.MathSciNetCrossRef Shirazinia, A., Chatterjee, S., & Skoglund, M. (2013). Analysis-by-synthesis quantization for compressed sensing measurements. IEEE Transaction on Signal Processing, 61(22), 5789–5800.MathSciNetCrossRef
Zurück zum Zitat Sigg, C. D., Dikk, T., & Buhmann, J. M. (2012). Speech enhancement using generative dictionary learning. IEEE Transaction on Audio, Speech and Language Processing, 20(6), 1698–1712.CrossRef Sigg, C. D., Dikk, T., & Buhmann, J. M. (2012). Speech enhancement using generative dictionary learning. IEEE Transaction on Audio, Speech and Language Processing, 20(6), 1698–1712.CrossRef
Zurück zum Zitat Wang, Y., Xu, Z., Li, G., Chang L., & Hong C. (2011). Compressive sensing framework for speech signal synthesis using a hybrid dictionary, Proceedings of IEEE CISP, pp. 2400–2403 Wang, Y., Xu, Z., Li, G., Chang L., & Hong C. (2011). Compressive sensing framework for speech signal synthesis using a hybrid dictionary, Proceedings of IEEE CISP, pp. 2400–2403
Zurück zum Zitat Wu, D., Zhu W.-P., & Swamy M.N.S. On sparsity issues in compressive sensing based speech enhancement. In Proceedings of IEEE ISCAS, 2012, pp. 285–288. Wu, D., Zhu W.-P., & Swamy M.N.S. On sparsity issues in compressive sensing based speech enhancement. In Proceedings of IEEE ISCAS, 2012, pp. 285–288.
Metadaten
Titel
Simultaneous speech coding and de-noising in a dictionary based quantized CS framework
verfasst von
Vinitha Ramdas
Sai Subrahmanyam R. K. Gorthi
Deepak Mishra
Publikationsdatum
28.05.2016
Verlag
Springer US
Erschienen in
International Journal of Speech Technology / Ausgabe 3/2016
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-016-9345-5

Weitere Artikel der Ausgabe 3/2016

International Journal of Speech Technology 3/2016 Zur Ausgabe

Neuer Inhalt