Skip to main content
Top

2017 | OriginalPaper | Chapter

10. Frequency Domain Coding

Author : Tom Bäckström

Published in: Speech Coding

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Signals which are sufficiently stationary permit highly efficient coding in the frequency domain. Such signals include speech signals such as sustained vowels and prolonged fricatives, as well as generic audio signals such as music and mixed material. The main components of frequency domain coding methods include windowing, a time-frequency transform, perceptual modelling and entropy coding of the spectral components. This chapter gives an overview of such transform domain coding methods.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference 3GPP. TS 26.445, EVS Codec Detailed Algorithmic Description; 3GPP Technical Specification (Release 12), 2014 3GPP. TS 26.445, EVS Codec Detailed Algorithmic Description; 3GPP Technical Specification (Release 12), 2014
2.
go back to reference Allen, J.: Short-term spectral analysis, and modification by discrete fourier transform. IEEE Trans. Acoust. Speech Signal Process. 25, 235–238 (1977)CrossRefMATH Allen, J.: Short-term spectral analysis, and modification by discrete fourier transform. IEEE Trans. Acoust. Speech Signal Process. 25, 235–238 (1977)CrossRefMATH
3.
go back to reference Bosi, M., Goldberg, R.E.: Introduction to Digital Audio Coding and Standards. Kluwer Academic Publishers, Dordrecht (2003)CrossRef Bosi, M., Goldberg, R.E.: Introduction to Digital Audio Coding and Standards. Kluwer Academic Publishers, Dordrecht (2003)CrossRef
4.
go back to reference Bäckström, T.: Comparison of windowing in speech and audio coding. In: Proceedings of WASPAA, New Paltz, USA (2013) Bäckström, T.: Comparison of windowing in speech and audio coding. In: Proceedings of WASPAA, New Paltz, USA (2013)
5.
go back to reference Bäckström, T.: Vandermonde factorization of Toeplitz matrices and applications in filtering and warping. IEEE Trans. Signal Process. 61(24), 6257–6263 (2013)MathSciNetCrossRef Bäckström, T.: Vandermonde factorization of Toeplitz matrices and applications in filtering and warping. IEEE Trans. Signal Process. 61(24), 6257–6263 (2013)MathSciNetCrossRef
6.
go back to reference Bäckström, T., Helmrich, C.R.: Decorrelated innovative codebooks for ACELP using factorization of autocorrelation matrix. In: Proceedings of Interspeech, pp. 2794–2798 (2014) Bäckström, T., Helmrich, C.R.: Decorrelated innovative codebooks for ACELP using factorization of autocorrelation matrix. In: Proceedings of Interspeech, pp. 2794–2798 (2014)
7.
go back to reference Bäckström, T., Helmrich, C.R.: Arithmetic coding of speech and audio spectra using TCX based on linear predictive spectral envelopes. In: Proceedings of ICASSP, pp. 5127–5131 (2015) Bäckström, T., Helmrich, C.R.: Arithmetic coding of speech and audio spectra using TCX based on linear predictive spectral envelopes. In: Proceedings of ICASSP, pp. 5127–5131 (2015)
8.
go back to reference Edler, B.: Codierung von audiosignalen mit Überlappender transformation und adaptiven fensterfunktionen. Frequenz 43(9), 252–256 (1989)CrossRef Edler, B.: Codierung von audiosignalen mit Überlappender transformation und adaptiven fensterfunktionen. Frequenz 43(9), 252–256 (1989)CrossRef
9.
go back to reference Eksler, V., Jelínek, M., Salami, R.: Efficient handling of mode switching and speech transitions in the EVS codec. In: Proceedings of ICASSP, Brisbane, Australia, IEEE (2015) Eksler, V., Jelínek, M., Salami, R.: Efficient handling of mode switching and speech transitions in the EVS codec. In: Proceedings of ICASSP, Brisbane, Australia, IEEE (2015)
10.
go back to reference Fischer, T.: A pyramid vector quantizer. IEEE Trans. Inf. Theory, IT-32(4), 568–583 (1986) Fischer, T.: A pyramid vector quantizer. IEEE Trans. Inf. Theory, IT-32(4), 568–583 (1986)
11.
go back to reference Fuchs, G., Subbaraman, V., Multrus, M.: Efficient context adaptive entropy coding for real-time applications. In: Proceedings of ICASSP, IEEE, pp. 493–496 (2011) Fuchs, G., Subbaraman, V., Multrus, M.: Efficient context adaptive entropy coding for real-time applications. In: Proceedings of ICASSP, IEEE, pp. 493–496 (2011)
12.
go back to reference Gersho, A., Gray, R.M.: Vector Quantization and Signal Compression. Springer, Berlin (1992) Gersho, A., Gray, R.M.: Vector Quantization and Signal Compression. Springer, Berlin (1992)
13.
go back to reference Harris, F.J.: On the use of windows for harmonic analysis with the discrete fourier transform. Proc. IEEE 66(1), 51–83 (1978)CrossRef Harris, F.J.: On the use of windows for harmonic analysis with the discrete fourier transform. Proc. IEEE 66(1), 51–83 (1978)CrossRef
14.
go back to reference Huffman, D.A.: A method for the construction of minimum redundancy codes. Proc. IRE 40(9), 1098–1101 (1952)CrossRefMATH Huffman, D.A.: A method for the construction of minimum redundancy codes. Proc. IRE 40(9), 1098–1101 (1952)CrossRefMATH
15.
go back to reference ISO/IEC 23003–3:2012. MPEG-D (MPEG audio technologies), Part 3: Unified speech and audio coding (2012) ISO/IEC 23003–3:2012. MPEG-D (MPEG audio technologies), Part 3: Unified speech and audio coding (2012)
16.
go back to reference Malvar, H.S.: Lapped transforms for efficient transform/subband coding. IEEE Trans. Acoust. Speech Signal Process. 38(6), 969–978 (1990)CrossRef Malvar, H.S.: Lapped transforms for efficient transform/subband coding. IEEE Trans. Acoust. Speech Signal Process. 38(6), 969–978 (1990)CrossRef
17.
go back to reference Malvar, H.S.: Signal Processing with Lapped Transforms. Artech House, Inc. (1992) Malvar, H.S.: Signal Processing with Lapped Transforms. Artech House, Inc. (1992)
18.
go back to reference Mitra, S.K.: Digital Signal Processing: A Computer-Based Approach. McGraw-Hill, New York (1998) Mitra, S.K.: Digital Signal Processing: A Computer-Based Approach. McGraw-Hill, New York (1998)
19.
go back to reference Mäkinen, J., Bessette, B., Bruhn, S., Ojala, P., Salami, R., Taleb, A.: AMR-WB+: a new audio coding standard for 3rd generation mobile audio services. Proc. ICASSP 2, 1109–1112 (2005) Mäkinen, J., Bessette, B., Bruhn, S., Ojala, P., Salami, R., Taleb, A.: AMR-WB+: a new audio coding standard for 3rd generation mobile audio services. Proc. ICASSP 2, 1109–1112 (2005)
21.
go back to reference Sanchez, V.E., Adoul, J.-P.: Low-delay wideband speech coding using a new frequency domain approach. In: Proceedings of ICASSP, IEEE, vol. 2, pp. 415–418 (1993) Sanchez, V.E., Adoul, J.-P.: Low-delay wideband speech coding using a new frequency domain approach. In: Proceedings of ICASSP, IEEE, vol. 2, pp. 415–418 (1993)
22.
go back to reference Svedberg, J., Grancharov, V., Sverrisson, S., Norvell, E., Toftgård, T., Pobloth, H., Bruhn, S.: MDCT audio coding with pulse vector quantizers. In: Proceedings of ICASSP, pp. 5937–5941 (2015) Svedberg, J., Grancharov, V., Sverrisson, S., Norvell, E., Toftgård, T., Pobloth, H., Bruhn, S.: MDCT audio coding with pulse vector quantizers. In: Proceedings of ICASSP, pp. 5937–5941 (2015)
23.
go back to reference Valin, J.-M., Maxwell, G., Terriberry, T.B., Vos, K.: High-quality, low-delay music coding in the OPUS codec. In: Audio Engineering Society Convention 135. Audio Engineering Society (2013) Valin, J.-M., Maxwell, G., Terriberry, T.B., Vos, K.: High-quality, low-delay music coding in the OPUS codec. In: Audio Engineering Society Convention 135. Audio Engineering Society (2013)
24.
go back to reference Witten, I.H., Neal, R.M., Cleary, J.G.: Arithmetic coding for data compression. Commun. ACM 30(6), 520–540 (1987)CrossRef Witten, I.H., Neal, R.M., Cleary, J.G.: Arithmetic coding for data compression. Commun. ACM 30(6), 520–540 (1987)CrossRef
Metadata
Title
Frequency Domain Coding
Author
Tom Bäckström
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-50204-5_10