Skip to main content

2018 | OriginalPaper | Buchkapitel

Primary-Ambient Extraction Based on Channel Pair for 5.1 Channel Audio Using Least Square

verfasst von : Dingyan Song, Ge Gao, Yi Chen, Xi Hu

Erschienen in: Advances in Multimedia Information Processing – PCM 2017

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

According to the growth of reality demand of digital media, the 5.1 surround is widely used and researched. To further improve the listening experience of the 5.1 channel audio, the primary-ambient extraction (PAE) is introduced to facilitate flexible rendering in spatial audio reproduction. The common multichannel PAE approach is principle component analysis (PCA), which suffers from high extraction errors and long computation time. In this letter, we proposed a novel approach based on channel pair for 5.1 channel audio, which considers the five channels as a set of channel pairs. Then a linear estimation framework is applied at any one time to only one pair, which converts the problem of PAE into the estimation of weight matrix, thus the weight of each component can be computed by using the Least Square. The experimental results indicate that the novel approach significantly outperforms the existing approach PCA.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Lee, K., Son, C., Kim, D.: Immersive virtual sound for beyond 5.1 channel audio. In: Audio Engineering Society Convention 128. Audio Engineering Society (2010) Lee, K., Son, C., Kim, D.: Immersive virtual sound for beyond 5.1 channel audio. In: Audio Engineering Society Convention 128. Audio Engineering Society (2010)
2.
Zurück zum Zitat Stefanakis, N., Mouchtaris, A.: Foreground suppression for capturing and reproduction of crowded acoustic environments. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, pp. 51–55 (2015) Stefanakis, N., Mouchtaris, A.: Foreground suppression for capturing and reproduction of crowded acoustic environments. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, pp. 51–55 (2015)
3.
Zurück zum Zitat Kowalczyk, K., Thiergart, O., Taseska, M., et al.: Parametric spatial sound processing: a flexible and efficient solution to sound scene acquisition, modification, and reproduction. IEEE Signal Process. Mag. 32(2), 31–42 (2015)CrossRef Kowalczyk, K., Thiergart, O., Taseska, M., et al.: Parametric spatial sound processing: a flexible and efficient solution to sound scene acquisition, modification, and reproduction. IEEE Signal Process. Mag. 32(2), 31–42 (2015)CrossRef
4.
Zurück zum Zitat Menzer, F., Faller, C.: Stereo-to-binaural conversion using interaural coherence matching. In: Audio Engineering Society Convention 128. Audio Engineering Society (2010) Menzer, F., Faller, C.: Stereo-to-binaural conversion using interaural coherence matching. In: Audio Engineering Society Convention 128. Audio Engineering Society (2010)
5.
Zurück zum Zitat He, J.J.: Spatial audio reproduction with primary ambient extraction. Springer, Singapore (2016) He, J.J.: Spatial audio reproduction with primary ambient extraction. Springer, Singapore (2016)
6.
Zurück zum Zitat Goodwin, M.M., Jot, J.M.: Primary-ambient signal decomposition and vector-based localization for spatial audio coding and enhancement. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2007. IEEE, pp. 1: I-9-I-12 (2007) Goodwin, M.M., Jot, J.M.: Primary-ambient signal decomposition and vector-based localization for spatial audio coding and enhancement. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2007. IEEE, pp. 1: I-9-I-12 (2007)
7.
Zurück zum Zitat Ibrahim, K.M., Allam, M.: Primary-Ambient Extraction in Audio Signals Using Adaptive Weighting and Principal Component Analysis (2016) Ibrahim, K.M., Allam, M.: Primary-Ambient Extraction in Audio Signals Using Adaptive Weighting and Principal Component Analysis (2016)
8.
Zurück zum Zitat Pulkki, V.: Virtual sound source positioning using vector base amplitude panning. J. Audio Eng. Soc. 45(6), 456–466 (1997) Pulkki, V.: Virtual sound source positioning using vector base amplitude panning. J. Audio Eng. Soc. 45(6), 456–466 (1997)
9.
Zurück zum Zitat He, J., Tan, E.L., Gan, W.S.: Linear estimation based primary-ambient extraction for stereo audio signals. IEEE/ACM Trans. Audio Speech Lang. Process. 22(2), 505–517 (2014)CrossRef He, J., Tan, E.L., Gan, W.S.: Linear estimation based primary-ambient extraction for stereo audio signals. IEEE/ACM Trans. Audio Speech Lang. Process. 22(2), 505–517 (2014)CrossRef
10.
Zurück zum Zitat Recommendation I.: Multichannel stereophonic sound system with and without accompanying picture. International Telecommunication Union, 775–1 (1992) Recommendation I.: Multichannel stereophonic sound system with and without accompanying picture. International Telecommunication Union, 775–1 (1992)
11.
Zurück zum Zitat Breebaart, J., Faller, C.: Spatial Audio Processing: MPEG Surround and Other Applications. Wiley, Chicago (2008) Breebaart, J., Faller, C.: Spatial Audio Processing: MPEG Surround and Other Applications. Wiley, Chicago (2008)
12.
Zurück zum Zitat Goodwin, M.M.: Geometric signal decompositions for spatial audio enhancement. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2008. IEEE, pp. 409–412 (2008) Goodwin, M.M.: Geometric signal decompositions for spatial audio enhancement. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2008. IEEE, pp. 409–412 (2008)
13.
Zurück zum Zitat Faller, C.: Multiple-loudspeaker playback of stereo signals. J. Audio Eng. Soc. 54(11), 1051–1064 (2006) Faller, C.: Multiple-loudspeaker playback of stereo signals. J. Audio Eng. Soc. 54(11), 1051–1064 (2006)
14.
Zurück zum Zitat Browning, T.R.: Applying the design structure matrix to system decomposition and integration problems: a review and new directions[J]. IEEE Trans. Eng. Manage. 48(3), 292–306 (2001)CrossRef Browning, T.R.: Applying the design structure matrix to system decomposition and integration problems: a review and new directions[J]. IEEE Trans. Eng. Manage. 48(3), 292–306 (2001)CrossRef
15.
Zurück zum Zitat Kendall, G.S.: The decorrelation of audio signals and its impact on spatial imagery[J]. Comput. Music J. 19(4), 71–87 (1995)CrossRef Kendall, G.S.: The decorrelation of audio signals and its impact on spatial imagery[J]. Comput. Music J. 19(4), 71–87 (1995)CrossRef
16.
Zurück zum Zitat He, J., Gan, W.S., Tan, E.L.: Primary-ambient extraction using ambient spectrum estimation for immersive spatial audio reproduction[J]. IEEE/ACM Trans. Audio Speech Lang. Process. 23(9), 1431–1444 (2015)CrossRef He, J., Gan, W.S., Tan, E.L.: Primary-ambient extraction using ambient spectrum estimation for immersive spatial audio reproduction[J]. IEEE/ACM Trans. Audio Speech Lang. Process. 23(9), 1431–1444 (2015)CrossRef
17.
Zurück zum Zitat Jeffress, L.A.: A place theory of sound localization. Journal of comparative and physiological psychology 41(1), 35–39 (1948)CrossRef Jeffress, L.A.: A place theory of sound localization. Journal of comparative and physiological psychology 41(1), 35–39 (1948)CrossRef
Metadaten
Titel
Primary-Ambient Extraction Based on Channel Pair for 5.1 Channel Audio Using Least Square
verfasst von
Dingyan Song
Ge Gao
Yi Chen
Xi Hu
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-77383-4_62

Neuer Inhalt