Skip to main content

2017 | OriginalPaper | Buchkapitel

The Perceptual Lossless Quantization of Spatial Parameter for 3D Audio Signals

verfasst von : Gang Li, Xiaochen Wang, Li Gao, Ruimin Hu, Dengshi Li

Erschienen in: MultiMedia Modeling

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

With the development of multichannel audio systems, the 3D audio systems have already come into our lives. But the increasing number of channels brought challenges to storage and transmission of large amounts of data. Spatial Audio Coding (SAC), the mainstream of 3D audio coding technologies, is key to reproduce 3D multichannel audio signals with efficient compression. Just Noticeable Difference (JND) characteristics of human auditory system can be utilized to reduce spatial perceptual redundancy in the spatial parameters quantization process of SAC. However, the current quantization methods of SAC fully combine the JND characteristics. In this paper, we proposed a Perceptual Lossless Quantization of Spatial Parameter (PLQSP) method, the azimuthal and elevational quantization step sizes of spatial parameters are combined with JNDs. Both objective and subjective experiments have conducted to prove the high efficiency of PLQSP method. Compared with reference method SLQP-L/SLQP-H, the quantization codebook size of PLQSP has decreased by 16.99% and 27.79% respectively, while preserving similar listening quality.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat ITU-R BS.775-1. Multichannel Stereophonic Sound System with and with out Accompanying Pictures (1994) ITU-R BS.775-1. Multichannel Stereophonic Sound System with and with out Accompanying Pictures (1994)
2.
Zurück zum Zitat Ando, A.: Conversion of multichannel sound signal maintaining physical properties of sound in reproduced sound field. IEEE Trans. Audio Speech Lang. Process. 19(6), 1467–1475 (2011)CrossRef Ando, A.: Conversion of multichannel sound signal maintaining physical properties of sound in reproduced sound field. IEEE Trans. Audio Speech Lang. Process. 19(6), 1467–1475 (2011)CrossRef
3.
Zurück zum Zitat Sergi, G.: Knocking at the door of cinematic artifice: Dolby Atmos, challenges and opportunities. New Soundtrack 3(2), 107–121 (2013)CrossRef Sergi, G.: Knocking at the door of cinematic artifice: Dolby Atmos, challenges and opportunities. New Soundtrack 3(2), 107–121 (2013)CrossRef
4.
Zurück zum Zitat Disch, S., Ertel, C., Faller, C., et al.: Spatial audio coding: next-generation efficient and compatible coding of multi-channel audio. In: Audio Engineering Society Convention 117. Audio Engineering Society (2004) Disch, S., Ertel, C., Faller, C., et al.: Spatial audio coding: next-generation efficient and compatible coding of multi-channel audio. In: Audio Engineering Society Convention 117. Audio Engineering Society (2004)
5.
Zurück zum Zitat Jeroen, B., Christof, F.: Spatial Audio Processing: MPEG Surround and Other Applications. Wiley, Hoboken (2007) Jeroen, B., Christof, F.: Spatial Audio Processing: MPEG Surround and Other Applications. Wiley, Hoboken (2007)
6.
Zurück zum Zitat Herre, J., Hilpert, J., Kuntz, A., et al.: MPEG-H audio—the new standard for universal spatial/3D audio coding. J. Audio Eng. Soc. 62(12), 821–830 (2015)CrossRef Herre, J., Hilpert, J., Kuntz, A., et al.: MPEG-H audio—the new standard for universal spatial/3D audio coding. J. Audio Eng. Soc. 62(12), 821–830 (2015)CrossRef
7.
Zurück zum Zitat Cheng, B.: Spatial squeezing techniques for low bit-rate multichannel audio coding (2011) Cheng, B.: Spatial squeezing techniques for low bit-rate multichannel audio coding (2011)
8.
Zurück zum Zitat Cheng, B., Ritz, C., Burnett, I., et al.: A general compression approach to multi-channel three-dimensional audio. IEEE Trans. Audio Speech Lang. Process. 21(8), 1676–1688 (2013)CrossRef Cheng, B., Ritz, C., Burnett, I., et al.: A general compression approach to multi-channel three-dimensional audio. IEEE Trans. Audio Speech Lang. Process. 21(8), 1676–1688 (2013)CrossRef
9.
Zurück zum Zitat Blauert, J.: Spatial Hearing: the Psychophysics of Human Sound Localization. MIT press, Cambridge (1997) Blauert, J.: Spatial Hearing: the Psychophysics of Human Sound Localization. MIT press, Cambridge (1997)
10.
Zurück zum Zitat Gao, L., Hu, R., Wang, X., et al.: Perceptual Lossless Quantization of Spatial Parameterof multichannel audio signals. EURASIP J. Audio Speech Music Process. 2016(1), 1–18 (2016)CrossRef Gao, L., Hu, R., Wang, X., et al.: Perceptual Lossless Quantization of Spatial Parameterof multichannel audio signals. EURASIP J. Audio Speech Music Process. 2016(1), 1–18 (2016)CrossRef
11.
Zurück zum Zitat Gao, L., Hu, R., Wang, X., et al.: Effective utilisation of JND for spatial parameters quantisation in 3D multichannel audio. Electron. Lett. 52(12), 1074–1076 (2016)CrossRef Gao, L., Hu, R., Wang, X., et al.: Effective utilisation of JND for spatial parameters quantisation in 3D multichannel audio. Electron. Lett. 52(12), 1074–1076 (2016)CrossRef
12.
Zurück zum Zitat Pulkki, V.: Virtual sound source positioning using vector base amplitude panning. J. Audio Eng. Soc. 45(6), 456–466 (1997) Pulkki, V.: Virtual sound source positioning using vector base amplitude panning. J. Audio Eng. Soc. 45(6), 456–466 (1997)
13.
Zurück zum Zitat Daniel, J., Moreau, S., Nicol, R.: Further investigations of high-order ambisonics and wavefield synthesis for holophonic sound imaging. In: Audio Engineering Society Convention 114. Audio Engineering Society (2003) Daniel, J., Moreau, S., Nicol, R.: Further investigations of high-order ambisonics and wavefield synthesis for holophonic sound imaging. In: Audio Engineering Society Convention 114. Audio Engineering Society (2003)
14.
Zurück zum Zitat Heng, W., Cong, Z., Ruimin, H., Weiping, T., Xiaochen, W.: The perceptual characteristics of 3D orientation. In: Gurrin, C., Hopfgartner, F., Hurst, W., Johansen, H., Lee, H., O’Connor, N. (eds.) MMM 2014. LNCS, vol. 8326, pp. 353–360. Springer, Heidelberg (2014). doi:10.1007/978-3-319-04117-9_35 CrossRef Heng, W., Cong, Z., Ruimin, H., Weiping, T., Xiaochen, W.: The perceptual characteristics of 3D orientation. In: Gurrin, C., Hopfgartner, F., Hurst, W., Johansen, H., Lee, H., O’Connor, N. (eds.) MMM 2014. LNCS, vol. 8326, pp. 353–360. Springer, Heidelberg (2014). doi:10.​1007/​978-3-319-04117-9_​35 CrossRef
15.
Zurück zum Zitat Makous, J.C., Middlebrooks, J.C.: Two-dimensional sound localization by human listeners. J. Acoust. Soc. Am. 87(5), 2188–2200 (1990)CrossRef Makous, J.C., Middlebrooks, J.C.: Two-dimensional sound localization by human listeners. J. Acoust. Soc. Am. 87(5), 2188–2200 (1990)CrossRef
16.
Zurück zum Zitat Bureau ITU-R. Method for the subjective assessment of intermediate quality level of coding systems. ITU-R Recommendations, Supplement 1 (2014) Bureau ITU-R. Method for the subjective assessment of intermediate quality level of coding systems. ITU-R Recommendations, Supplement 1 (2014)
Metadaten
Titel
The Perceptual Lossless Quantization of Spatial Parameter for 3D Audio Signals
verfasst von
Gang Li
Xiaochen Wang
Li Gao
Ruimin Hu
Dengshi Li
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-51814-5_32

Neuer Inhalt