Skip to main content
Top
Published in: Neural Computing and Applications 7/2011

01-10-2011 | ICONIP2009

Decoding ambisonic signals to irregular quad loudspeaker configuration based on hybrid ANN and modified tabu search

Authors: P. W. M. Tsang, K. W. K. Cheung, A. C. S. Leung

Published in: Neural Computing and Applications | Issue 7/2011

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Past research has proven that a first-order B-format ambisonic signal can be used to partially reconstruct the original sound field through a collection of arbitrary positioned loudspeakers. This is achieved by setting the gain of each loudspeaker to be the weighted sum of the three components in the B-format signal. Deduction of the weighting factors (a.k.a. the decoding parameters) has been successfully accomplished with the use of the Modified Tabu Search (MTS), and later with the Heuristic Genetic Algorithm (HGA) which provides higher precision and stability. Despite the favorable outcome, both methods involve large amount of iterations and the computation time is lengthy. In this paper, we propose a scheme to overcome this problem based on the integration of Neural Network Estimation (NNE) and the MTS. Compared to HGA, the new approach is about two orders of magnitude faster, and at the same time capable of attaining similar precision in determining the decoding parameters.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Literature
1.
go back to reference Rumsey F (2001) Spatial audio. Focal Press, pp 111–118 Rumsey F (2001) Spatial audio. Focal Press, pp 111–118
2.
go back to reference Gerzon MA (1970) Surround-sound from 2-channel stereo. HiFi News, August Gerzon MA (1970) Surround-sound from 2-channel stereo. HiFi News, August
3.
go back to reference Gerzon MA (1974) Surround-sound psychoacoustics. Wireless World. Wireless World, December Gerzon MA (1974) Surround-sound psychoacoustics. Wireless World. Wireless World, December
4.
go back to reference Gerzon MA (1976) Multidirectional sound reproduction systems. United States Patent, 3,997,725, December Gerzon MA (1976) Multidirectional sound reproduction systems. United States Patent, 3,997,725, December
5.
go back to reference Gerzon MA (1985) Ambisonics in multichannel broadcasting and video. J Audio Eng Soc 33(11):859–871 Gerzon MA (1985) Ambisonics in multichannel broadcasting and video. J Audio Eng Soc 33(11):859–871
6.
go back to reference Fellgett P (1975) Ambisonics. Part one: general system description. Studio Sound, pp 20–40 Fellgett P (1975) Ambisonics. Part one: general system description. Studio Sound, pp 20–40
7.
go back to reference Malham D (2007) Higher order Ambisonic systems., Space in Music–Music in Space (Mphil thesis), University of York Malham D (2007) Higher order Ambisonic systems., Space in Music–Music in Space (Mphil thesis), University of York
9.
go back to reference Gerzon MA (1983) Decoders for feeding irregular loudspeaker arrays. United States Patent 4,414,430, November 1983 Gerzon MA (1983) Decoders for feeding irregular loudspeaker arrays. United States Patent 4,414,430, November 1983
10.
go back to reference Farina A (1998) Software Implementation of B-format encoding and decoding. Pre-prints of the 104th AES Conv., Amsterdam Farina A (1998) Software Implementation of B-format encoding and decoding. Pre-prints of the 104th AES Conv., Amsterdam
11.
go back to reference Wiggins B et al (2003) The design and optimization of surround sound decoders using heuristic methods. In: Proceedings of UKSim’03, pp 106–114 Wiggins B et al (2003) The design and optimization of surround sound decoders using heuristic methods. In: Proceedings of UKSim’03, pp 106–114
12.
go back to reference Wiggins B (2004) An investigation into the real-time manipulation and control of three-dimensional sound fields. PhD thesis, University of Derby, Derby Wiggins B (2004) An investigation into the real-time manipulation and control of three-dimensional sound fields. PhD thesis, University of Derby, Derby
13.
go back to reference Gerzon MA (1992) General metatheory of auditory localization, 92nd Conv. Audio Eng. Soc, Vienna Gerzon MA (1992) General metatheory of auditory localization, 92nd Conv. Audio Eng. Soc, Vienna
14.
go back to reference Moore D, Wakefield J (2006) An enhanced approach to surround sound decoder design. In: Proc. of Comp. and Engg. Ann, Res. (CEARC’06). University of Huddersfield, Huddersfield, pp 1–6 Moore D, Wakefield J (2006) An enhanced approach to surround sound decoder design. In: Proc. of Comp. and Engg. Ann, Res. (CEARC’06). University of Huddersfield, Huddersfield, pp 1–6
15.
go back to reference Moore D, Wakefield J (2007) The design of improved first order ambisonic decoders by the application of range removal and importance in a heuristic search algorithm. In: 31st Audio Engg. Soc. Int’l Conf., June 2007 Moore D, Wakefield J (2007) The design of improved first order ambisonic decoders by the application of range removal and importance in a heuristic search algorithm. In: 31st Audio Engg. Soc. Int’l Conf., June 2007
16.
go back to reference Tsang PWM, Cheung KWK (2009) Development of a re-configurable ambisonic decoder for irregular loudspeaker configuration. IET Circuits Devices Syst 3(4):197–203 Tsang PWM, Cheung KWK (2009) Development of a re-configurable ambisonic decoder for irregular loudspeaker configuration. IET Circuits Devices Syst 3(4):197–203
17.
go back to reference Herrera F et al (2003) A taxonomy for the crossover operator for real-coded genetic algorithms: an experimental study. Int J Intel Syst 18:309–338 Herrera F et al (2003) A taxonomy for the crossover operator for real-coded genetic algorithms: an experimental study. Int J Intel Syst 18:309–338
18.
go back to reference Tsang PWM, Cheung, WK, Leung CS (2009) Decoding ambisonic signals to irregular loudspeaker configuration based on artificial neural network. Neural Inform Process 5864:273–280 (Springer, Berlin) Tsang PWM, Cheung, WK, Leung CS (2009) Decoding ambisonic signals to irregular loudspeaker configuration based on artificial neural network. Neural Inform Process 5864:273–280 (Springer, Berlin)
19.
go back to reference Rumelhart DE, Hinton GE, Williams RJ (1986) “Learning internal representations by error propagation”, parallel data processing, vol 1, chap 8. The M.I.T. Press, Cambridge, pp 318–362 Rumelhart DE, Hinton GE, Williams RJ (1986) “Learning internal representations by error propagation”, parallel data processing, vol 1, chap 8. The M.I.T. Press, Cambridge, pp 318–362
20.
go back to reference Hagan MT, Menhaj M (1994) Training feed-forward networks with the Marquardt algorithm. IEEE Trans Neural Netw 5(6):989–993CrossRef Hagan MT, Menhaj M (1994) Training feed-forward networks with the Marquardt algorithm. IEEE Trans Neural Netw 5(6):989–993CrossRef
21.
go back to reference Hagan MT, Demuth HB, Beale MH (1996) Neural network design. PWS Publishing, Boston, MA Hagan MT, Demuth HB, Beale MH (1996) Neural network design. PWS Publishing, Boston, MA
22.
go back to reference Glover F, Laguna M (1997) Tabu search. Kluwer, London Glover F, Laguna M (1997) Tabu search. Kluwer, London
23.
go back to reference Wiggins B (2007) The generation of panning laws for irregular speaker arrays using heuristic methods. AES 31st International Conference Wiggins B (2007) The generation of panning laws for irregular speaker arrays using heuristic methods. AES 31st International Conference
24.
go back to reference Poletti MA (2000) A unified theory of horizontal holographic sound systems. JAES 48(12):1155–1182 Poletti MA (2000) A unified theory of horizontal holographic sound systems. JAES 48(12):1155–1182
25.
go back to reference Betlehem T, Abhayapala TD (2005) Theory and design of sound field reproduction in reverberant rooms. ASAJ 117(4):2100–2111 Betlehem T, Abhayapala TD (2005) Theory and design of sound field reproduction in reverberant rooms. ASAJ 117(4):2100–2111
26.
go back to reference Noisternig M, Musil T, Sontacchi A, Höldrich R (2003) 3D binaural sound reproduction using a virtual ambisonic approach. VECIMS 2003. In: lnt Sym. Vir. Env. Hum. Comp. Int. and Meas. Sys. pp 174–178 Noisternig M, Musil T, Sontacchi A, Höldrich R (2003) 3D binaural sound reproduction using a virtual ambisonic approach. VECIMS 2003. In: lnt Sym. Vir. Env. Hum. Comp. Int. and Meas. Sys. pp 174–178
Metadata
Title
Decoding ambisonic signals to irregular quad loudspeaker configuration based on hybrid ANN and modified tabu search
Authors
P. W. M. Tsang
K. W. K. Cheung
A. C. S. Leung
Publication date
01-10-2011
Publisher
Springer-Verlag
Published in
Neural Computing and Applications / Issue 7/2011
Print ISSN: 0941-0643
Electronic ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-010-0397-1

Other articles of this Issue 7/2011

Neural Computing and Applications 7/2011 Go to the issue

Premium Partner