skip to main content
10.1145/2522848.2531738acmconferencesArticle/Chapter ViewAbstractPublication Pagesicmi-mlmiConference Proceedingsconference-collections
research-article

Partial least squares regression on grassmannian manifold for emotion recognition

Published:09 December 2013Publication History

ABSTRACT

In this paper, we propose a method for video-based human emotion recognition. For each video clip, all frames are represented as an image set, which can be modeled as a linear subspace to be embedded in Grassmannian manifold. After feature extraction, Class-specific One-to-Rest Partial Least Squares (PLS) is learned on video and audio data respectively to distinguish each class from the other confusing ones. Finally, an optimal fusion of classifiers learned from both modalities (video and audio) is conducted at decision level. Our method is evaluated on the Emotion Recognition In The Wild Challenge (EmotiW 2013). The experimental results on both validation set and blind test set are presented for comparison. The final accuracy achieved on test set outperforms the baseline by 26%.

References

  1. H. Cevikalp and B. Triggs. Face recognition based on image sets. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, pages 2567--2573, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  2. A. Dhall, R. Goecke, J. Joshi, M. Wagner, and T. Gedeon. Emotion recognition in the wild challenge 2013. In ACM International Conference on Multimodal Interaction (ICMI), pages 2496--2503. ACM, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. A. Dhall, R. Goecke, S. Lucey, and T. Gedeon. Collecting large, richly annotated facial-expression databases from movies. IEEE MultiMedia, 19(3):0034, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. F. Eyben, M. Wöllmer, and B. Schuller. Opensmile: the munich versatile and fast open-source audio feature extractor. In ACM International Conference on Multimedia, pages 1459--1462. ACM, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. G. Guo and G. Mu. Simultaneous dimensionality reduction and human age estimation via kernel partial least squares regression. In Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on, pages 657--664, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. J. Hamm and D. D. Lee. Grassmann discriminant analysis: a unifying view on subspace-based learning. In Proceedings of the 25th international conference on Machine learning, pages 376--383, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. T. Harandi, C. Sanderson, S. Shirazi, and B. C. Lovell. Graph embedding discriminant analysis on grassmannian manifolds for improved image set matching. In Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on, pages 2705--2712, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. T.-K. Kim, J. Kittler, and R. Cipolla. Discriminative learning and recognition of image set classes using canonical correlations. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 29(6):1005--1018, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. S. Koelstra, M. Pantic, and I. Patras. A dynamic texture-based approach to recognition of facial actions and their temporal models. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 32(11):1940--1954, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. M. Liu, S. Li, S. Shan, and X. Chen. Au-aware deep networks for facial expression recognition. In IEEE International Conference on Automatic Face and Gesture Recognition (FG), 2013.Google ScholarGoogle Scholar
  11. M. Pantic and L. J. M. Rothkrantz. Automatic analysis of facial expressions: The state of the art. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 22(12):1424--1445, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. R. Rosipal and N. Krämer. Overview and recent advances in partial least squares. In Subspace, Latent Structure and Feature Selection, pages 34--51. Springer, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. M. A. Turk and A. P. Pentland. Face recognition using eigenfaces. In Computer Vision and Pattern Recognition (CVPR), IEEE Conference on, pages 586--591, 1991.Google ScholarGoogle Scholar
  14. R. Wang, H. Guo, L. S. Davis, and Q. Dai. Covariance discriminative learning: A natural and efficient approach to image set classification. In Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on, pages 2496--2503, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. R. Wang, S. Shan, X. Chen, and W. Gao. Manifold-manifold distance with application to face recognition based on image set. In Computer Vision and Pattern Recognition (CVPR), IEEE Conference on, pages 1--8, 2008.Google ScholarGoogle Scholar
  16. T. Wang and P. Shi. Kernel grassmannian distances and discriminant analysis for face recognition from image sets. Pattern Recognition Letters, 30(13):1161--1165, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. H. Wold. Partial least squares. Encyclopedia of statistical sciences, 1985.Google ScholarGoogle Scholar
  18. P. Yang, Q. Liu, and D. N. Metaxas. Boosting coded dynamic features for facial action units and facial expression recognition. In Computer Vision and Pattern Recognition (CVPR), IEEE Conference on, pages 1--6, 2007.Google ScholarGoogle Scholar
  19. Y. Zhang and J. Qiang. Active and dynamic information fusion for facial expression understanding from image sequences. Pattern Analysis and Machine Intelligence, IEEE Transaction on, 27(5):699--714, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. G. Zhao and M. Pietikainen. Dynamic texture recognition using local binary patterns with an application to facial expressions. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 29(6):915--928, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Partial least squares regression on grassmannian manifold for emotion recognition

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          ICMI '13: Proceedings of the 15th ACM on International conference on multimodal interaction
          December 2013
          630 pages
          ISBN:9781450321297
          DOI:10.1145/2522848

          Copyright © 2013 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 9 December 2013

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          ICMI '13 Paper Acceptance Rate49of133submissions,37%Overall Acceptance Rate453of1,080submissions,42%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader