skip to main content
research-article

Video quality for face detection, recognition, and tracking

Published:02 September 2011Publication History
Skip Abstract Section

Abstract

Many distributed multimedia applications rely on video analysis algorithms for automated video and image processing. Little is known, however, about the minimum video quality required to ensure an accurate performance of these algorithms. In an attempt to understand these requirements, we focus on a set of commonly used face analysis algorithms. Using standard datasets and live videos, we conducted experiments demonstrating that the algorithms show almost no decrease in accuracy until the input video is reduced to a certain critical quality, which amounts to significantly lower bitrate compared to the quality commonly acceptable for human vision. Since computer vision percepts video differently than human vision, existing video quality metrics, designed for human perception, cannot be used to reason about the effects of video quality reduction on accuracy of video analysis algorithms. We therefore investigate two alternate video quality metrics, blockiness and mutual information, and show how they can be used to estimate the critical video qualities for face analysis algorithms.

References

  1. Bradski, G. R. 1998. Computer vision face tracking as a component of a perceptual user interface. In Proceedings of the 4th IEEE Workshop on Applications of Computer Vision (WACV'98). IEEE, Los Alamitos, CA, 214--219. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Collins, R., Lipton, A., Kanade, T., Fujiyoshi, H., Duggins, D., Tsin, Y., Tolliver, D., Enomoto, N., and Hasegawa, O. 2000. A system for video surveillance and monitoring. Tech. rep. CMU-RI-TR-00-12, Robotics Institute, Carnegie Mellon University.Google ScholarGoogle Scholar
  3. Eisert, P. and Girod, B. 1998. Model-based coding of facial image sequences at varying illumination conditions. In Proceedings of the IMDSP Workshop, 199--122.Google ScholarGoogle Scholar
  4. Eisert, P., Wiegand, T., and Girod, B. 2000. Model-aided coding: A new approach to incor- porate facial animationinto motion-compensated video coding. IEEE Trans. Circuits Syst.Video Technol. 10, 344--358. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Eleftheriadis, A. and Anastassiou, D. 1995. Constrained and general dynamic rate shaping of compressed digital video. In Proceedings of the IEEE International Conference on Image Processing (ICIP'95), IEEE, Los Alamitos, CA, 396--399. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Girgensohn, A., Kimber, D., Vaughan, J., Yang, T., Shipman, F., Turner, T., Rieffel, E., Wilcox, L., Chen, F., and Dunnigan, T. 2007. DOTS: Support for effective video surveillance. In Proceedings of the 15th ACM International Conference on Multimedia (ACMMM'07). ACM, New York, 423--432. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Grother, P. J., Micheals, R. J., and Phillips, P. 2003. Face recognition vendor test 2002 performance metrics. In Proceedings of the 4th International Conference on Audio Visual Based Person Authentication (AVBPA'03). 937--945. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Hakeem, A., Shafique, K., and Shah, M. 2005. Anobject-based video coding framework for video sequences obtained from static cameras. In Proceedings of the 13th ACM International Conference on Multimedia (ACMMM'05). ACM, New York, 608--617. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Javed, O., Rasheed, Z., Alatas, O., and Shah, M. 2003. KNIGHT M : A real-time surveillance system for multiple overlapping and non-overlapping cameras. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME'03), IEEE, Los Alamitos, CA, 649--652. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Kim, J., Wang, Y., and Chang, S. 2003. Content-adaptive utility-based video adaptation. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME'03). Vol. 3, IEEE, Los Alamitos, CA, 281--284. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Kim, M. and Altunbasak, Y. 2001. Optimal dynamic rate shaping for compressed video stream- ing. In Proceedings of the International Conference on Networking (ICN'01). 786--794. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Korshunov, P. and Ooi, W. T. 2005. Critical video quality for distributed automated video surveillance. In Proceedings of the 13th ACM International Conference on Multimedia (ACMMM'05). ACM, New York, 151--160. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Lu, J., Plataniotis, K. N., and Venetsanopoulos, A. N. 2003. Regularized discriminant analysis for the small sample size problem in face recognition. Patt. Recogn. Lett. 24, 3079--3087. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Muijs, R. and Kirenko, I. 2005. A no-reference blocking artifact. measure for adaptive video processing. In Proceedings of the 13th European Singal Processing Conference (EUSIPCO'05).Google ScholarGoogle Scholar
  15. Nair, V. and Clark, J. J. 2002. Automated visual surveillance using hidden markov models. In Proceedings of the 15th International Conference on Vision Interface (VI'02), 88--92.Google ScholarGoogle Scholar
  16. Rangaswami, R., Dimitrijevi, Z., Kakligian, K., Chang, E., and Wang, Y. 2004. The SfinX video surveillance system. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME'04), IEEE, Los Alamitos, CA.Google ScholarGoogle Scholar
  17. Rouse, P. and Hemami, S. 2008. Analyzing the role of visual structure in the recognition of natural image content with multi-scale SSIM. In Proceedings of SPIE Human Vision and Electronic Imaging XIII Conference. Vol. 6806, 680615.1--680615.14.Google ScholarGoogle Scholar
  18. Rowley, H., Baluja, S., and Kanade, T. 1998. Neural network-based face detection. IEEE Trans. Patt. Anal, Mach. Intell. 20, 23--38. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Sanchez, V., Basu, A., and Mandal, M. 2004. Prioritized region of interest coding in JPEG2000. In Proceedings of the 17th International Conference on Pattern Recognition (ICPR'04). Vol. 2, 799--802. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Schumeyer, R., Heredia, E. A., and Barner, K. E. 1997. Region of interest priority coding for sign language videoconferencing. In Proceedings of the First IEEE Workshop on Multimedia Signal Processing (MMSP'05). IEEE, Los Alamitos, CA, 531--536.Google ScholarGoogle Scholar
  21. Shannon, C. 1948. A mathematical theory of communication. Bell Syst. Tech. J. 27, 379--423.Google ScholarGoogle ScholarCross RefCross Ref
  22. Smolic, A., Makai, B., and Sikora, T. 1999. Real-time estimation of long-term 3-D motion parameters for snhcface animation and model-based coding applications. IEEE Trans. Circuits Syst.Video Technol. 2, 255--263. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Viola, P. and Jones, M. 2001. Robust real-time face detection. In Proceedings of the ICCV Workshop on Statistical and Computation Theories of Vision (ICCV'01). Vol. 2, 747.Google ScholarGoogle Scholar
  24. Wu, Y., Jiao, L., Wu, G., Chang, E., and Wang, Y. 2003. Invariant feature extraction and biased statistical inference for video surveillance. In Proceedings of the IEEE International Conference on Advanced Video and Signal-Based Surveillance (AVSS'03), IEEE, Los Alamitos, CA, 284--289. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Yuan, X., Sun, Z., Varol, Y., and Bebis, G. 2003. A distributed visual surveillance system. In Proceedings of the IEEE International Conference on Advanced Video and Signal-Based Surveillance (AVSS'03). IEEE, Los Alamitos, CA. 199--204. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Video quality for face detection, recognition, and tracking

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM Transactions on Multimedia Computing, Communications, and Applications
          ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 7, Issue 3
          August 2011
          117 pages
          ISSN:1551-6857
          EISSN:1551-6865
          DOI:10.1145/2000486
          Issue’s Table of Contents

          Copyright © 2011 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 2 September 2011
          • Accepted: 1 November 2009
          • Revised: 1 July 2009
          • Received: 1 December 2008
          Published in tomm Volume 7, Issue 3

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader