Skip to main content
Top

2015 | OriginalPaper | Chapter

Prediction Model of Multi-channel Audio Quality Based on Multiple Linear Regression

Authors : Jing Wang, Yi Zhao, Wenzhi Li, Fei Wang, Zesong Fei, Xiang Xie

Published in: Advances in Multimedia Information Processing -- PCM 2015

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Perceived audio quality is an important metric to measure the perception degradation of multi-channel audio signals especially for coding and rendering systems. Conventional objective quality measurement such as PEAQ (Perceptual Evaluation of Audio Quality) is limited to describe both the basic audio quality and the spatial impression. A novel prediction model is proposed to predict the subjective quality of 5.1-channels audio systems. Two attributes are included in the evaluation including basic quality and surround effects. Multiple Linear Regression (MLR) combined with Principal Component Analysis (PCA) is used to establish the prediction model from the objective parameters to subjective audio quality. Data set for model training and testing is obtained from formal listening tests under different coding conditions. Preliminary experiment results with 5.1-channels audio show that the proposed model can predict multi-channel audio quality more accurately than the conventional PEAQ method considering both the basic audio quality and the surround effects.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference ISO/IEC 13818-3: Information technology-Generic coding of moving pictures and associated audio information – Part 3: Audio (1998) ISO/IEC 13818-3: Information technology-Generic coding of moving pictures and associated audio information – Part 3: Audio (1998)
2.
go back to reference ISO/IEC 13818-7: Information technology - Generic coding of moving pictures and associated audio information-Part 7: Advanced Audio Coding (AAC) (2006) ISO/IEC 13818-7: Information technology - Generic coding of moving pictures and associated audio information-Part 7: Advanced Audio Coding (AAC) (2006)
3.
go back to reference ISO/IEC JTC1/SC29/WG11: Information technology - report on the verification tests of MPEG-D MPEG surround (2007) ISO/IEC JTC1/SC29/WG11: Information technology - report on the verification tests of MPEG-D MPEG surround (2007)
4.
go back to reference ITU-R BS.775-2: Multichannel stereophonic sound system with and without accompanying picture (2006) ITU-R BS.775-2: Multichannel stereophonic sound system with and without accompanying picture (2006)
5.
go back to reference Cheng, Y., Ruimin, H., Liuyue, S., et al.: A 3D audio coding technique based on extracting the distance parameter. In: IEEE International Conference on Multimedia and Expo, pp. 1–6. IEEE Press, California (2014) Cheng, Y., Ruimin, H., Liuyue, S., et al.: A 3D audio coding technique based on extracting the distance parameter. In: IEEE International Conference on Multimedia and Expo, pp. 1–6. IEEE Press, California (2014)
6.
go back to reference Bin, C., Christian, R., Ian, S.B., Xiguang, Z.: A general compression approach to multi-channel three-dimensional audio. IEEE Trans. Audio Speech Lang. Process. 21(8), 1676–1688 (2013)CrossRef Bin, C., Christian, R., Ian, S.B., Xiguang, Z.: A general compression approach to multi-channel three-dimensional audio. IEEE Trans. Audio Speech Lang. Process. 21(8), 1676–1688 (2013)CrossRef
7.
go back to reference ITU-R BS.1116-1: Methods for the subjective assessment of small impairments in audio systems including multichannel sound systems, Geneva, Switzerland (1997) ITU-R BS.1116-1: Methods for the subjective assessment of small impairments in audio systems including multichannel sound systems, Geneva, Switzerland (1997)
8.
go back to reference ITU-R BS.1285: Pre-selection methods for the subjective assessment of small impairments in audio systems, Geneva, Switzerland (1997) ITU-R BS.1285: Pre-selection methods for the subjective assessment of small impairments in audio systems, Geneva, Switzerland (1997)
9.
go back to reference ITU-R BS.1534: Method for the subjective assessment of intermediate quality level of coding systems, Geneva, Switzerland (2001) ITU-R BS.1534: Method for the subjective assessment of intermediate quality level of coding systems, Geneva, Switzerland (2001)
10.
go back to reference ITU-R BS.1387-1: Method for objective measurements of perceived audio quality, Geneva, Switzerland (2001) ITU-R BS.1387-1: Method for objective measurements of perceived audio quality, Geneva, Switzerland (2001)
11.
go back to reference Inyong, C., Shinn-Cunningham, B.G., Sang, B.C., Sung, K.-M.: Objective measurement of perceived auditory quality in multichannel audio compression coding systems. J. Audio Eng. Soc. 56, 3–17 (2008) Inyong, C., Shinn-Cunningham, B.G., Sang, B.C., Sung, K.-M.: Objective measurement of perceived auditory quality in multichannel audio compression coding systems. J. Audio Eng. Soc. 56, 3–17 (2008)
12.
go back to reference Schafer, M., Bahram, M., Vary, P.: An extension of the PEAQ measure by a binaural hearing model. In: International Conference on Acoustics, Speech and Signal Processing, pp. 8164–8168. IEEE Press, Vancouver (2013) Schafer, M., Bahram, M., Vary, P.: An extension of the PEAQ measure by a binaural hearing model. In: International Conference on Acoustics, Speech and Signal Processing, pp. 8164–8168. IEEE Press, Vancouver (2013)
13.
go back to reference Smimite, A., Beghdadi, A., Chen, K., Jafjaf, O.: A new approach for spatial audio quality assessment. In: International Conference on Telecommunications and Multimedia, pp. 46–51. IEEE Press, Greece (2014) Smimite, A., Beghdadi, A., Chen, K., Jafjaf, O.: A new approach for spatial audio quality assessment. In: International Conference on Telecommunications and Multimedia, pp. 46–51. IEEE Press, Greece (2014)
14.
go back to reference Jeroen, B., Par, S.V.D., Armin, K., Erik, S., Jeroen, B., Erik, S.: Parametric coding of stereo audio. EURASIP J. Adv. Signal Process. 9, 1305–1322 (2005)MATH Jeroen, B., Par, S.V.D., Armin, K., Erik, S., Jeroen, B., Erik, S.: Parametric coding of stereo audio. EURASIP J. Adv. Signal Process. 9, 1305–1322 (2005)MATH
15.
go back to reference Faller, C., Baumgarte, F.: Binaural cue coding: a novel and efficient representation of spatial audio. In: International Conference on Acoustics, Speech and Signal Processing, pp. 1841–1844. IEEE Press, Florida (2002) Faller, C., Baumgarte, F.: Binaural cue coding: a novel and efficient representation of spatial audio. In: International Conference on Acoustics, Speech and Signal Processing, pp. 1841–1844. IEEE Press, Florida (2002)
Metadata
Title
Prediction Model of Multi-channel Audio Quality Based on Multiple Linear Regression
Authors
Jing Wang
Yi Zhao
Wenzhi Li
Fei Wang
Zesong Fei
Xiang Xie
Copyright Year
2015
DOI
https://doi.org/10.1007/978-3-319-24075-6_66