Skip to main content
Erschienen in:
Buchtitelbild

2017 | OriginalPaper | Buchkapitel

1. Introduction

verfasst von : JianJun He

Erschienen in: Spatial Audio Reproduction with Primary Ambient Extraction

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This chapter gives a brief introduction on the motivation of the work on spatial audio reproduction using a sound scene decomposition technique referred to as primary ambient extraction.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
[ASI08]
Zurück zum Zitat Arai YC, Sakakibara S, Ito A, Ohshima K, Sakakibara T, Nishi T et al (2008) Intra-operative natural sound decreases salivary amylase activity of patients undergoing inguinal hernia repair under epidural anesthesia. Acta Anaesthesiol Scand 52(7):987–990 Arai YC, Sakakibara S, Ito A, Ohshima K, Sakakibara T, Nishi T et al (2008) Intra-operative natural sound decreases salivary amylase activity of patients undergoing inguinal hernia repair under epidural anesthesia. Acta Anaesthesiol Scand 52(7):987–990
[AvJ04]
Zurück zum Zitat Avendano C, Jot JM (2004) A frequency-domain approach to multichannel upmix. J Audio Eng Soc 52(7/8):740–749 Avendano C, Jot JM (2004) A frequency-domain approach to multichannel upmix. J Audio Eng Soc 52(7/8):740–749
[BaS07]
Zurück zum Zitat Bai MR, Shih GY (2007) Upmixing and downmixing two-channel stereo audio for consumer electronics. IEEE Trans Consum Electron 53(3):1011–1019 Bai MR, Shih GY (2007) Upmixing and downmixing two-channel stereo audio for consumer electronics. IEEE Trans Consum Electron 53(3):1011–1019
[Beg00]
Zurück zum Zitat Begault DR (2000) 3-D sound for virtual reality and multimedia. AP Professional, Cambridge, MA Begault DR (2000) 3-D sound for virtual reality and multimedia. AP Professional, Cambridge, MA
[BWG10]
Zurück zum Zitat Begault DR, Wenzel EM, Godfroy M, Miller JD, Anderson MR (2010) Applying spatial audio to human interfaces: 25 years of NASA experience. In Proceedings of 40th AES international conference on spatial audio, Tokyo Begault DR, Wenzel EM, Godfroy M, Miller JD, Anderson MR (2010) Applying spatial audio to human interfaces: 25 years of NASA experience. In Proceedings of 40th AES international conference on spatial audio, Tokyo
[BrF07]
Zurück zum Zitat Breebaart J, Faller C (2007) Spatial audio processing: MPEG surround and other applications. Wiley, Hoboken, NJ Breebaart J, Faller C (2007) Spatial audio processing: MPEG surround and other applications. Wiley, Hoboken, NJ
[BrS08]
Zurück zum Zitat Breebaart J, Schuijers E (2008) Phantom materialization: a novel method to enhance stereo audio reproduction on headphones. IEEE Trans Audio, Speech, Lang Process 16(8):1503–1511 Breebaart J, Schuijers E (2008) Phantom materialization: a novel method to enhance stereo audio reproduction on headphones. IEEE Trans Audio, Speech, Lang Process 16(8):1503–1511
[DLH03]
Zurück zum Zitat Diette GB, Lechtzin N, Haponik E, Devrotes A, Rubin HR (2003) Distraction therapy with nature sights and sounds reduces pain during flexible bronchoscopy: a complementary approach to routine analgesia. Chest J 123(3):941–948 Diette GB, Lechtzin N, Haponik E, Devrotes A, Rubin HR (2003) Distraction therapy with nature sights and sounds reduces pain during flexible bronchoscopy: a complementary approach to routine analgesia. Chest J 123(3):941–948
[Fal04]
Zurück zum Zitat Faller C (2004) Coding of spatial audio compatible with different playback formats. In Proceedings of 117th AES Convention, San Francisco, CA Faller C (2004) Coding of spatial audio compatible with different playback formats. In Proceedings of 117th AES Convention, San Francisco, CA
[Fal06]
Zurück zum Zitat Faller C (2006) Multiple-loudspeaker playback of stereo signals. J Audio Eng Soc 54(11):1051–1064 Faller C (2006) Multiple-loudspeaker playback of stereo signals. J Audio Eng Soc 54(11):1051–1064
[Fal07]
Zurück zum Zitat Faller C (2007) Matrix surround revisited. In: Proceedings of 30th AES international conference, Saariselka, Finland Faller C (2007) Matrix surround revisited. In: Proceedings of 30th AES international conference, Saariselka, Finland
[FaB03]
Zurück zum Zitat Faller C, Baumgarte F (2003) Binaural cue coding-Part II: schemes and applications. IEEE Trans Speech Audio Process 11(6):520–531 Faller C, Baumgarte F (2003) Binaural cue coding-Part II: schemes and applications. IEEE Trans Speech Audio Process 11(6):520–531
[FaB11]
Zurück zum Zitat Faller C, Breebaart J (2011) Binaural reproduction of stereo signals using upmixing and diffuse rendering. In Proceedings of 131th audio engineering society convention, New York Faller C, Breebaart J (2011) Binaural reproduction of stereo signals using upmixing and diffuse rendering. In Proceedings of 131th audio engineering society convention, New York
[GTK11]
Zurück zum Zitat Gan WS, Tan EL, Kuo SM (2011) Audio projection: directional sound and its application in immersive communication. IEEE Sig Process Mag 28(1):43–57 Gan WS, Tan EL, Kuo SM (2011) Audio projection: directional sound and its application in immersive communication. IEEE Sig Process Mag 28(1):43–57
[Ger92]
Zurück zum Zitat Gerzon MA (1992) Optimal reproduction matricies for multispeaker stereo. J Audio Eng Soc 40(7/8):571–589 Gerzon MA (1992) Optimal reproduction matricies for multispeaker stereo. J Audio Eng Soc 40(7/8):571–589
[GoJ06a]
Zurück zum Zitat Goodwin M, Jot J-M (2006) A frequency-domain framework for spatial audio coding based on universal spatial cues. In: Proceedings of 120th AES convention Goodwin M, Jot J-M (2006) A frequency-domain framework for spatial audio coding based on universal spatial cues. In: Proceedings of 120th AES convention
[GoJ06b]
Zurück zum Zitat Goodwin M, Jot JM (2007) Binaural 3-D audio rendering based on spatial audio scene coding. In: Proceedings of 123rd AES convention, New York Goodwin M, Jot JM (2007) Binaural 3-D audio rendering based on spatial audio scene coding. In: Proceedings of 123rd AES convention, New York
[GoJ07b]
Zurück zum Zitat Goodwin M, Jot JM (2007) Primary-ambient signal decomposition and vector-based localization for spatial audio coding and enhancement. In: Proceedings of ICASSP, Hawaii, pp 9–12 Goodwin M, Jot JM (2007) Primary-ambient signal decomposition and vector-based localization for spatial audio coding and enhancement. In: Proceedings of ICASSP, Hawaii, pp 9–12
[GoJ08]
Zurück zum Zitat Goodwin M, Jot JM (2008) Spatial audio scene coding. In: Proceedings of 125th AES convention, San Francisco Goodwin M, Jot JM (2008) Spatial audio scene coding. In: Proceedings of 125th AES convention, San Francisco
[HTG14]
Zurück zum Zitat He J, Tan EL, Gan WS (2014) Linear estimation based primary-ambient extraction for stereo audio signals. IEEE/ACM Trans Audio, Speech, Lang Process 22(2):505–517 He J, Tan EL, Gan WS (2014) Linear estimation based primary-ambient extraction for stereo audio signals. IEEE/ACM Trans Audio, Speech, Lang Process 22(2):505–517
[HGT14]
Zurück zum Zitat He J, Gan WS, Tan EL (2014) A study on the frequency-domain primary-ambient extraction for stereo audio signals. In: Proceedings of ICASSP, Florence, Italy, pp. 2892–2896 He J, Gan WS, Tan EL (2014) A study on the frequency-domain primary-ambient extraction for stereo audio signals. In: Proceedings of ICASSP, Florence, Italy, pp. 2892–2896
[HHK14]
Zurück zum Zitat Herre J, Hilpert J, Kuntz A, Plogsties J (2014) MPEG-H audio—the new standard for universal spatial/3D audio coding. J Audio Eng Soc 62(12):821–830 Herre J, Hilpert J, Kuntz A, Plogsties J (2014) MPEG-H audio—the new standard for universal spatial/3D audio coding. J Audio Eng Soc 62(12):821–830
[Hol08]
Zurück zum Zitat Holman T (2008) Surround sound up and running, 2nd edn. Focal Press, MA Holman T (2008) Surround sound up and running, 2nd edn. Focal Press, MA
[ITU93]
[ITU12b]
Zurück zum Zitat ITU (2012) Report ITU-R BS.2159-4: multichannel sound technology in home and broadcasting applications ITU (2012) Report ITU-R BS.2159-4: multichannel sound technology in home and broadcasting applications
[JPL10]
Zurück zum Zitat Jeon SW, Park YC, Lee S, Youn D (2010) Robust representation of spatial sound in stereo-to-multichannel upmix. In: Proceedings of 128th AES Convention, London, UK Jeon SW, Park YC, Lee S, Youn D (2010) Robust representation of spatial sound in stereo-to-multichannel upmix. In: Proceedings of 128th AES Convention, London, UK
[JoF11]
Zurück zum Zitat Jot JM, Fejzo Z (2011) Beyond surround sound—creation, coding and reproduction of 3-D audio soundtracks. In: Proceedings of 131st AES Convention, New York, NY Jot JM, Fejzo Z (2011) Beyond surround sound—creation, coding and reproduction of 3-D audio soundtracks. In: Proceedings of 131st AES Convention, New York, NY
[JLP99]
Zurück zum Zitat Jot JM, Larcher V, Pernaux JM (1999) A comparative study of 3-d audio encoding and rendering techniques. In: Proceedings of 16th AES International Conference, Rovaniemi, Finland Jot JM, Larcher V, Pernaux JM (1999) A comparative study of 3-d audio encoding and rendering techniques. In: Proceedings of 16th AES International Conference, Rovaniemi, Finland
[JMG07]
Zurück zum Zitat Jot JM, Merimaa J, Goodwin M, Krishnaswamy A, Laroche J (2007) Spatial audio scene coding in a universal two-channel 3-D stereo format. In: Proceedings of 123rd AES Convention, New York, NY Jot JM, Merimaa J, Goodwin M, Krishnaswamy A, Laroche J (2007) Spatial audio scene coding in a universal two-channel 3-D stereo format. In: Proceedings of 123rd AES Convention, New York, NY
[KKM15]
Zurück zum Zitat Kleczkowski P, Krol A, Malecki P (2015) Multichannel sound reproduction quality improves with angular separation of direct and reflected sounds. J Audio Eng Soc 63(6):427–442 Kleczkowski P, Krol A, Malecki P (2015) Multichannel sound reproduction quality improves with angular separation of direct and reflected sounds. J Audio Eng Soc 63(6):427–442
[KTT15]
Zurück zum Zitat Kowalczyk K, Thiergart O, Taseska M, Del Galdo G, Pulkki V, Habets EAP (2015) Parametric spatial sound processing. IEEE Sig Process Mag 32(2):31–42 Kowalczyk K, Thiergart O, Taseska M, Del Galdo G, Pulkki V, Habets EAP (2015) Parametric spatial sound processing. IEEE Sig Process Mag 32(2):31–42
[LBP14]
Zurück zum Zitat Lee T, Baek Y, Park YC, Youn DH (2014) Stereo upmix-based binaural auralization for mobile devices. IEEE Trans Consum Electron 60(3):411–419 Lee T, Baek Y, Park YC, Youn DH (2014) Stereo upmix-based binaural auralization for mobile devices. IEEE Trans Consum Electron 60(3):411–419
[LMG05]
Zurück zum Zitat Loomis JM, Marston JR, Golledge RG, Klatzky RL (2005) Personal guidance system for people with visual impairment: a comparison of spatial displays for route guidance. J Vis Impair blind, 99(4):219–232 Loomis JM, Marston JR, Golledge RG, Klatzky RL (2005) Personal guidance system for people with visual impairment: a comparison of spatial displays for route guidance. J Vis Impair blind, 99(4):219–232
[MeF10]
Zurück zum Zitat Menzer F, Faller C (2010) Stereo-to-binaural conversion using interaural coherence matching. In: Proceedings of 128th AES Convention, London, UK Menzer F, Faller C (2010) Stereo-to-binaural conversion using interaural coherence matching. In: Proceedings of 128th AES Convention, London, UK
[Pul97]
Zurück zum Zitat Pulkki V (1997) Virtual sound source positioning using vector base amplitude panning. J Audio Eng Soc 45(6):456–466 Pulkki V (1997) Virtual sound source positioning using vector base amplitude panning. J Audio Eng Soc 45(6):456–466
[Pul07]
Zurück zum Zitat Pulkki V (2007) Spatial sound reproduction with directional audio coding. J Audio Eng Soc 55(6):503–516 Pulkki V (2007) Spatial sound reproduction with directional audio coding. J Audio Eng Soc 55(6):503–516
[Rum01]
Zurück zum Zitat Rumsey F (2001) Spatial Audio. Focal Press, Oxford, UK Rumsey F (2001) Spatial Audio. Focal Press, Oxford, UK
[Rum02]
Zurück zum Zitat Rumsey F (2002) Spatial quality evaluation for reproduced sound: terminology, meaning, and a scene-based paradigm. J Audio Eng Soc 50(9):651–666 Rumsey F (2002) Spatial quality evaluation for reproduced sound: terminology, meaning, and a scene-based paradigm. J Audio Eng Soc 50(9):651–666
[Rum10]
Zurück zum Zitat Rumsey F (2010) Time-frequency processing for spatial audio. J Audio Eng Soc 58(7/8):655–659 Rumsey F (2010) Time-frequency processing for spatial audio. J Audio Eng Soc 58(7/8):655–659
[Rum11]
Zurück zum Zitat Rumsey F (2011) Spatial audio: eighty years after Blumlein. J Audio Eng Soc 59(1/2):57–62 Rumsey F (2011) Spatial audio: eighty years after Blumlein. J Audio Eng Soc 59(1/2):57–62
[Rum13]
Zurück zum Zitat Rumsey F (2013) Spatial audio processing: upmix, downmix, shake it all about. J Audio Eng Soc 61(6):474–478 Rumsey F (2013) Spatial audio processing: upmix, downmix, shake it all about. J Audio Eng Soc 61(6):474–478
[SPL10]
Zurück zum Zitat Särkämö T, Pihko E, Laitinen S, Forsblom A, Soinila S, Mikkonen M, et al (2010) Music and speech listening enhance the recovery of early sensory processing after stroke. J Cog Neurosci 22(12):2716–2727 Särkämö T, Pihko E, Laitinen S, Forsblom A, Soinila S, Mikkonen M, et al (2010) Music and speech listening enhance the recovery of early sensory processing after stroke. J Cog Neurosci 22(12):2716–2727
[StM15]
Zurück zum Zitat Stefanakis N, Mouchtaris A (2015) Foreground suppression for capturing and reproduction of crowded acoustic environments. In: Proceedings of ICASSP, Brisbane, Australia, pp 51–55 Stefanakis N, Mouchtaris A (2015) Foreground suppression for capturing and reproduction of crowded acoustic environments. In: Proceedings of ICASSP, Brisbane, Australia, pp 51–55
[SHT15]
Zurück zum Zitat Sunder K, He J, Tan EL, Gan WS (2015) Natural sound rendering for headphones: integration of signal processing techniques. IEEE Sig Process Mag 32(2):100–113 Sunder K, He J, Tan EL, Gan WS (2015) Natural sound rendering for headphones: integration of signal processing techniques. IEEE Sig Process Mag 32(2):100–113
[TaG12]
Zurück zum Zitat Tan EL, Gan WS (2012) Reproduction of immersive sound using directional and conventional loudspeakers. J Acoust Soc Am 131(4):3215–3215 Tan EL, Gan WS (2012) Reproduction of immersive sound using directional and conventional loudspeakers. J Acoust Soc Am 131(4):3215–3215
[TGC12]
Zurück zum Zitat Tan EL, Gan WS, Chen CH (2012) Spatial sound reproduction using conventional and parametric loudspeakers. In: Proceedings of APSIPA ASC, Hollywood, CA Tan EL, Gan WS, Chen CH (2012) Spatial sound reproduction using conventional and parametric loudspeakers. In: Proceedings of APSIPA ASC, Hollywood, CA
[UsB07]
Zurück zum Zitat Usher J, Benesty J (2007) Enhancement of spatial sound quality: a new reverberation-extraction audio upmixer. IEEE Trans Audio, Speech, Lang Process 15(7):2141–2150 Usher J, Benesty J (2007) Enhancement of spatial sound quality: a new reverberation-extraction audio upmixer. IEEE Trans Audio, Speech, Lang Process 15(7):2141–2150
[ZiR03]
Zurück zum Zitat Zielinski SK, Rumsey F (2003) Effects of down-mix algorithms on quality of surround sound. J Audio Eng Soc 51(9):780–798 Zielinski SK, Rumsey F (2003) Effects of down-mix algorithms on quality of surround sound. J Audio Eng Soc 51(9):780–798
Metadaten
Titel
Introduction
verfasst von
JianJun He
Copyright-Jahr
2017
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-1551-9_1

Neuer Inhalt