Skip to main content

2014 | OriginalPaper | Buchkapitel

13. Basic Audio Compression Techniques

verfasst von : Ze-Nian Li, Mark S. Drew, Jiangchuan Liu

Erschienen in: Fundamentals of Multimedia

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this chapter, compression of audio information is reviewed, with special consideration paid to speech compression. To begin with, we recall some of the issues covered in Chap. 6 on digital audio in multimedia. Here, this is combined with techniques that exploit the temporal redundancy present in audio signals. We extend the Pulse Code Modulation (PCM) scheme to DPCM, prepending the word “Differential,” as briefly introduced in Chap. 6 but fleshed out here. Specifically, in this chapter, we look at ADPCM, Vocoders, and more general Speech Compression: LPC, CELP, MBE, and MELP. Adaptive DPCM is ADPCM. In speech coding, a number of standards have evolved and we set these out here, including some of their fundamental strategies. We then go on to study coders (encoding/decoding algorithms) specifically aimed at speech compression. The properties of Vocoders are examined, including the notion of phase insensitivity, channels, and formants. Next, LPC (Linear Predictive Coding) vocoders are discussed, followed by CELP (Code Excited Linear Prediction), a more complex family of coders. Hybrid Excitation Vocoders are another large class of speech coders, and we round the discussion off by having a look at MBE (Multi-Band Excitation) and MELP (Multiband Excitation Linear Predictive) vocoders.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat N.S. Jayant, P. Noll, Digital Coding of Waveforms (Prentice-Hall, Upper Saddle River, 1984) N.S. Jayant, P. Noll, Digital Coding of Waveforms (Prentice-Hall, Upper Saddle River, 1984)
2.
Zurück zum Zitat J.C. Bellamy, Digital Telephony (Wiley, Hoboken, 2000) J.C. Bellamy, Digital Telephony (Wiley, Hoboken, 2000)
3.
Zurück zum Zitat T.E. Tremain, The government standard linear predictive coding algorithm: LPC-10. Speech Technol. 1(2), 40–49 (1982) T.E. Tremain, The government standard linear predictive coding algorithm: LPC-10. Speech Technol. 1(2), 40–49 (1982)
4.
Zurück zum Zitat J.P. Campbell Jr., T.E. Tremain, V.C. Welch, in Advances in Speech Coding, The DOD 4.8 kbps Standard (Proposed Federal Standard 1016), (Kluwer Academic Publishers, Boston, 1991) J.P. Campbell Jr., T.E. Tremain, V.C. Welch, in Advances in Speech Coding, The DOD 4.8 kbps Standard (Proposed Federal Standard 1016), (Kluwer Academic Publishers, Boston, 1991)
6.
Zurück zum Zitat GSM enhanced full rate (EFR) speech transcoding (GSM 06.60). European Telecommunications Standards Institute v.8.0.1 (1999) GSM enhanced full rate (EFR) speech transcoding (GSM 06.60). European Telecommunications Standards Institute v.8.0.1 (1999)
10.
Zurück zum Zitat D.W. Griffin, J.S. Lim, Multi-band excitation vocoder. IEEE Trans. ASSP 36(8), 1223–1235 (1988)CrossRefMATH D.W. Griffin, J.S. Lim, Multi-band excitation vocoder. IEEE Trans. ASSP 36(8), 1223–1235 (1988)CrossRefMATH
11.
Zurück zum Zitat M.S. Brandstein, P.A. Monta, J.C. Hardwick, J.S. Lim, A real-time implementation of the improved MBE speech coder. Int. Conf. on Acoustics, Speech, and Signal Proc. (1990), pp. 5–8 M.S. Brandstein, P.A. Monta, J.C. Hardwick, J.S. Lim, A real-time implementation of the improved MBE speech coder. Int. Conf. on Acoustics, Speech, and Signal Proc. (1990), pp. 5–8
12.
Zurück zum Zitat T.P. Barnwellm III, A.V. McCree, Mixed excitation LPC vocoder model for low bit rate speech coding. IEEE Trans. Speech Audio Proc. 3(4), 242–250 (1995) T.P. Barnwellm III, A.V. McCree, Mixed excitation LPC vocoder model for low bit rate speech coding. IEEE Trans. Speech Audio Proc. 3(4), 242–250 (1995)
Metadaten
Titel
Basic Audio Compression Techniques
verfasst von
Ze-Nian Li
Mark S. Drew
Jiangchuan Liu
Copyright-Jahr
2014
DOI
https://doi.org/10.1007/978-3-319-05290-8_13

Premium Partner