Skip to main content
Erschienen in: Soft Computing 1/2011

01.01.2011 | Focus

Evolutionary speech quality estimation in VoIP

verfasst von: Adil Raja, R. M. A. Azad, Colin Flanagan, Conor Ryan

Erschienen in: Soft Computing | Ausgabe 1/2011

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Estimating the quality of Voice over Internet Protocol (VoIP) as perceived by humans is considered a formidable task. This is partly due to the relatively large number of variables that are involved as determinants of quality. Moreover, discerning the significance of one variable over the other is difficult. In this paper a novel approach based on genetic programming (GP) is presented. It maps the effect of network traffic parameters on listeners’ perception of speech quality. The ITU-T Recommendation P.862 (PESQ) algorithm is used as a reference model in this research. The GP discovered models that provide effective VoIP quality estimation are highly correlated to ITU-T Recommendation P.862 (PESQ). They also outperform the ITU-T Recommendation P.563 in estimating the effect that packet loss has on speech quality. The GP discovered models prove suited to real-time and in vivo evaluation of VoIP calls. Additionally, they are deployable on a wide variety of hardware platforms.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
2
Adaptive operator probabilities are discussed on page 31 of the GPLab manual.
 
Literatur
Zurück zum Zitat Cole RG, Rosenbluth JH (2001) Voice over ip performance monitoring. SIGCOMM Comput Commun Rev 31(2):9–24CrossRef Cole RG, Rosenbluth JH (2001) Voice over ip performance monitoring. SIGCOMM Comput Commun Rev 31(2):9–24CrossRef
Zurück zum Zitat Davis L (1989) Adapting operator probabilities in genetic algorithms. In: Proceedings of the third international conference on genetic algorithms, San Mateo, CA Davis L (1989) Adapting operator probabilities in genetic algorithms. In: Proceedings of the third international conference on genetic algorithms, San Mateo, CA
Zurück zum Zitat ETSI EN 301 704 V7.2.1. Digital cellular telecommunications system; Adaptive Multi-Rate (AMR) speech transcoding ETSI EN 301 704 V7.2.1. Digital cellular telecommunications system; Adaptive Multi-Rate (AMR) speech transcoding
Zurück zum Zitat Gustafson S, Burke EK, Krasnogor N (2005) On improving genetic programming for symbolic regression. In: Corne D et al (ed) Proceedings of the 2005 IEEE congress on evolutionary computation, vol 1. IEEE Press, Edinburgh, UK, pp 912–919 Gustafson S, Burke EK, Krasnogor N (2005) On improving genetic programming for symbolic regression. In: Corne D et al (ed) Proceedings of the 2005 IEEE congress on evolutionary computation, vol 1. IEEE Press, Edinburgh, UK, pp 912–919
Zurück zum Zitat Hoene C, Karl H, Wolisz A (2004) A perceptual quality model for adaptive VOIP applications. In: Proceedings of international symposium on performance evaluation of computer and telecommunication systems (SPECTS), vol 4. San Jose, California, USA, pp 2573–2577 Hoene C, Karl H, Wolisz A (2004) A perceptual quality model for adaptive VOIP applications. In: Proceedings of international symposium on performance evaluation of computer and telecommunication systems (SPECTS), vol 4. San Jose, California, USA, pp 2573–2577
Zurück zum Zitat ITU-T (1996a) Coding of speech at 8 kbit/s using conjugate-structure algebraic-code-excited linear-prediction (CS-ACELP). International Telecommunications Union, Geneva, Switzerland. ITU-T Recommendation G.729 ITU-T (1996a) Coding of speech at 8 kbit/s using conjugate-structure algebraic-code-excited linear-prediction (CS-ACELP). International Telecommunications Union, Geneva, Switzerland. ITU-T Recommendation G.729
Zurück zum Zitat ITU-T (1996b) Dual rate speech coder for multimedia communication transmitting at 5.3 and 6.3 kbit/s. International Telecommunications Union, Geneva, Switzerland. ITU-T Recommendation G.723.1 ITU-T (1996b) Dual rate speech coder for multimedia communication transmitting at 5.3 and 6.3 kbit/s. International Telecommunications Union, Geneva, Switzerland. ITU-T Recommendation G.723.1
Zurück zum Zitat ITU-T (2005a) The E-model, a computational model for use in transmission planning. International Telecommunications Union, Geneva, Switzerland. ITU-T Recommendation G.107 ITU-T (2005a) The E-model, a computational model for use in transmission planning. International Telecommunications Union, Geneva, Switzerland. ITU-T Recommendation G.107
Zurück zum Zitat ITU-T (2005b) Network model for evaluating multimedia transmission performance over internet protocol. International Telecommunications Union, Geneva, Switzerland. ITU-T Recommendation G.1050 ITU-T (2005b) Network model for evaluating multimedia transmission performance over internet protocol. International Telecommunications Union, Geneva, Switzerland. ITU-T Recommendation G.1050
Zurück zum Zitat ITU-T (2005c) Single-ended method for objective speech quality assessment in narrow-band telephony applications. International Telecommunications Union, Geneva, Switzerland. ITU-T Recommendation P.563 ITU-T (2005c) Single-ended method for objective speech quality assessment in narrow-band telephony applications. International Telecommunications Union, Geneva, Switzerland. ITU-T Recommendation P.563
Zurück zum Zitat Keijzer M (2003) Improving symbolic regression with interval arithmetic and linear scaling. In: Ryan C, Soule T, Keijzer M, Tsang E, Poli R, Costa E (eds) Genetic programming. Proceedings of EuroGP’2003, vol 2610 of LNCS. Springer-Verlag, Essex, pp 70–82 Keijzer M (2003) Improving symbolic regression with interval arithmetic and linear scaling. In: Ryan C, Soule T, Keijzer M, Tsang E, Poli R, Costa E (eds) Genetic programming. Proceedings of EuroGP’2003, vol 2610 of LNCS. Springer-Verlag, Essex, pp 70–82
Zurück zum Zitat Keijzer M (2004) Scaled symbolic regression. Genetic Program Evolvable Mach 5(3):259–269CrossRef Keijzer M (2004) Scaled symbolic regression. Genetic Program Evolvable Mach 5(3):259–269CrossRef
Zurück zum Zitat Koza JR (1992) Genetic programming: on the programming of computers by means of natural selection. MIT Press, CambridgeMATH Koza JR (1992) Genetic programming: on the programming of computers by means of natural selection. MIT Press, CambridgeMATH
Zurück zum Zitat Luke S, Panait L (2002) Lexicographic parsimony pressure. In: Langdon WB et al (ed) GECCO 2002: proceedings of the genetic and evolutionary computation conference. New York, pp 829–836 Luke S, Panait L (2002) Lexicographic parsimony pressure. In: Langdon WB et al (ed) GECCO 2002: proceedings of the genetic and evolutionary computation conference. New York, pp 829–836
Zurück zum Zitat Mitchell T (1997) Machine learning. McGraw Hill, New YorkMATH Mitchell T (1997) Machine learning. McGraw Hill, New YorkMATH
Zurück zum Zitat Mohamed S, Cervantes-Perez F, Afifi H (2001) Integrating networks measurements and speech quality subjective scores for control purposes. In: Annual joint conference of the IEEE computer and communications societies (INFOCOM), pp 641–649 Mohamed S, Cervantes-Perez F, Afifi H (2001) Integrating networks measurements and speech quality subjective scores for control purposes. In: Annual joint conference of the IEEE computer and communications societies (INFOCOM), pp 641–649
Zurück zum Zitat Mohamed S, Rubino G, Varela M (2004) A method for quantitative evaluation of audio quality over packet networks and its comparison with existing techniques. In: Measurement of speech and audio quality in networks (MESAQIN) Mohamed S, Rubino G, Varela M (2004) A method for quantitative evaluation of audio quality over packet networks and its comparison with existing techniques. In: Measurement of speech and audio quality in networks (MESAQIN)
Zurück zum Zitat Raja A, Azad RMA, Flanagan C, Picovici D, Ryan C (2006) Non-intrusive quality evaluation of VOIP using genetic programming. In: First international conference on bio inspired models of network, information and computer systems, vol 4, pp 2573–2577 Raja A, Azad RMA, Flanagan C, Picovici D, Ryan C (2006) Non-intrusive quality evaluation of VOIP using genetic programming. In: First international conference on bio inspired models of network, information and computer systems, vol 4, pp 2573–2577
Zurück zum Zitat Raja A, Azad RMA, Flanagan C, Ryan C (2007) Real-time, non-intrusive evaluation of VoIP. In: Ebner M, O’Neill M, Ekárt A, Vanneschi L, Isabel Esparcia-Alcázar A (eds) Proceedings of the 10th European Conference on Genetic Programming, volume 4445 of Lecture Notes in Computer Science. Springer, Valencia, Spain, pp. 217–228 Raja A, Azad RMA, Flanagan C, Ryan C (2007) Real-time, non-intrusive evaluation of VoIP. In: Ebner M, O’Neill M, Ekárt A, Vanneschi L, Isabel Esparcia-Alcázar A (eds) Proceedings of the 10th European Conference on Genetic Programming, volume 4445 of Lecture Notes in Computer Science. Springer, Valencia, Spain, pp. 217–228
Zurück zum Zitat Sun L, Ifeachor EC (2002) Perceived speech quality prediction for voice over ip-based networks. In: IEEE international conference on communications (ICC), vol 4, pp 2573–2577 Sun L, Ifeachor EC (2002) Perceived speech quality prediction for voice over ip-based networks. In: IEEE international conference on communications (ICC), vol 4, pp 2573–2577
Zurück zum Zitat Sun L, Ifeachor EC (2006) Voice quality prediction models and their application in VoIP networks. IEEE Trans Multimed 8(4):809–820CrossRef Sun L, Ifeachor EC (2006) Voice quality prediction models and their application in VoIP networks. IEEE Trans Multimed 8(4):809–820CrossRef
Zurück zum Zitat Sun L, Wade G, Lines BM, Ifeachor EC (2001) Impact of packet loss location on perceived speech quality. In: 2nd IP-telephony workshop. Columbia University, New York Sun L, Wade G, Lines BM, Ifeachor EC (2001) Impact of packet loss location on perceived speech quality. In: 2nd IP-telephony workshop. Columbia University, New York
Zurück zum Zitat Thorpe L, Yang W (1996) Performance of current perceptual objective speech quality measures. In: IEEE international speech coding, vol 1, pp 144–146 Thorpe L, Yang W (1996) Performance of current perceptual objective speech quality measures. In: IEEE international speech coding, vol 1, pp 144–146
Metadaten
Titel
Evolutionary speech quality estimation in VoIP
verfasst von
Adil Raja
R. M. A. Azad
Colin Flanagan
Conor Ryan
Publikationsdatum
01.01.2011
Verlag
Springer-Verlag
Erschienen in
Soft Computing / Ausgabe 1/2011
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-009-0521-2

Weitere Artikel der Ausgabe 1/2011

Soft Computing 1/2011 Zur Ausgabe