Skip to main content
Log in

Models of Human Perception

  • Published:
BT Technology Journal

Abstract

As communications systems have increased in complexity it has become increasingly difficult to measure their performance objectively. In particular, when signals are compressed for more efficient transmission, conventional engineering metrics fail to predict the performance experienced by the end user — the customer. A new generation of objective measurement techniques is emerging based on models of human perception. Such models are potentially able to predict subjective performance for a wide range of transmission technologies. This paper introduces auditory and visual perceptual models and describes the development at BT Laboratories of models suitable for performance assessment. Early applications of the models are also described.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. ITU-T Recommendation P.800: 'Methods for subjective determination of transmission quality', (this Recommendation was previously numbered P.80) (1996).

  2. ITU-T Recommendation P.911: 'Subjective audiovisual quality assessment methods for multimedia applications', (1998).

  3. Hollier M P and Cosier G: 'Assessing human perception', BT Technol J, 14,No 1, pp 206-216 (January 1996).

    Google Scholar 

  4. Beerends J G and Stemerdink J: 'A perceptual audio quality measure based on a psychoacoustic sound representation', J Audio Eng Soc, 40,No 12, pp 963-974 (1992).

    Google Scholar 

  5. Hollier M P, Rimell A, Hands D and Voelcker R M: 'Multi-modal perception', BT Technol J, 17,No 1, pp 35-46 (January 1999).

    Google Scholar 

  6. Moore B C J: 'An introduction to the psychology of hearing', 4th edition, Academic Press (1997).

  7. Patterson R D, Robinson K, Holdsworth J, McKeown D, Zhang C and Allerhand M H: 'Complex sounds and auditory images', in Cazals Y, Demany L and Horner K (Eds): 'Auditory Physiology and Perception', pp 429-446, Pergamon Press (1992).

  8. Brandenburg K and Bosi M: 'Overview of MPEG audio: current and future standards for low-bit-rate audio coding', J Audio Eng Soc, 45,Nos 1/2, pp 4-21 (1996).

    Google Scholar 

  9. Wang S, Sekey A and Gersho A: 'An objective measure for predicting subjective quality of speech coders', IEEE J Sel Areas in Comms, 10,No 5, pp 819-829 (1992).

    Google Scholar 

  10. Sekey A and Hanson B: 'Improved 1-Bark bandwidth auditory filter', J Acoust Soc Am, 75,No 6, pp 1902-1904 (1984).

    Google Scholar 

  11. 'Acoustics — normal equal loudness contours', ISO Standard 226 (1987).

  12. ITU-T Recommendation P.861: 'Objective quality measurement of telephone-band (300–3400 Hz) speech codecs', (1996).

  13. Atkinson D J: 'Proposed Annex A to Recommendation P.861', ITU-T contribution COM12–24 (December 1997).

  14. Page J H and Breen A P: 'The Laureate text-to-speech system — architecture and applications', BT Technol J, 14,No 1, pp 57-67 (January 1996).

    Google Scholar 

  15. Karunasekera A S and Kingsbury N G: 'A distortion measure for blocking artifacts in images based on human visual sensitivity', IEEE Transactions on Image Processing, 4,No 6, pp 713-724 (June 1995).

    Google Scholar 

  16. Simoncelli E P and Freeman W T: 'The steerable pyramid: a flexible architecture for multi-scale derivative computation', IEEE Second International Conference on Image Processing, Washington, 3, pp 444-447 (November 1995).

    Google Scholar 

  17. Van Claster O: 'Perceptual post processing of the coding noise for digital pictures', Master's thesis, Université de Louvain la Neuve (June 1993).

  18. van den Branden Lambrecht C J: 'Perceptual models and architectures for video coding applications', Thèse No 1520, EPFL, Lausanne (1996).

    Google Scholar 

  19. Webster A: 'Draft new recommendation on multimedia communication delay, synchronization, and frame rate measurement', ITU-T contribution COM 12–29 (December 1997).

  20. Lukas X J and Budrikis Z L: 'Picture quality prediction based on a visual model', IEEE Transactions on Communications, COM-30,No 7, pp 1679-1692 (July 1982).

    Google Scholar 

  21. Tan K T, Ghanbari M and Pearson D E: 'A video distortion meter', Informationstechnische Gesellschaft, Picture Coding Symposium, Berlin (September 1997).

  22. Ran X and Favardin N: 'A perceptually motivated three-component image model — Part II: application to image compression', IEEE Transactions on Image Processing, 4,No 4, pp 713-724 (April 1995).

    Google Scholar 

  23. Scassellati B M: 'High-level perceptual contours from a variety of low-level physical features', Masters Thesis, Massachusetts Institute of Technology (May 1995) — http://www.ai.mit.edu/people/scaz/thesis.ps.gz

  24. Smith S M and Brady J M: 'SUSAN — A new approach to low level image processing', Technical report TR95SMS1c, Oxford Centre for Functional Magnetic Resonance Imaging of the Brain (1995) — http://www.fmrib.ox.ac.uk/~steve/susan/susan.ps.gz

  25. Broom S R, Coackley P and Sheppard P J: 'Getting the message loud and clear — quantifying call clarity', British Telecommunications Eng J, 17, pp 66-72 (April 1998).

    Google Scholar 

  26. Reynolds R J, Hollier M P, Rix A W and Sheppard P J: 'Measurement of signal quality', International patent application PCT/GB98/01306 (16 May 1997).

  27. Reynolds R J and Hollier M P: 'Extending listening models into bidirectional conversation models for network assessment', ITU-T delayed contribution COM12-D036 (February 1998).

  28. Rix A W, Hollier M P, Reynolds R and Sheppard P J: 'Testing communications systems', International patent application PCT/GB98/01305 (16 May 1997).

Download references

Authors

About this article

Cite this article

Rix, A.W., Bourret, A. & Hollier, M.P. Models of Human Perception. BT Technology Journal 17, 24–34 (1999). https://doi.org/10.1023/A:1009662506355

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1009662506355

Keywords

Navigation