skip to main content
10.1145/2502081.2502223acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

The social signal interpretation (SSI) framework: multimodal signal processing and recognition in real-time

Authors Info & Claims
Published:21 October 2013Publication History

ABSTRACT

Automatic detection and interpretation of social signals carried by voice, gestures, mimics, etc. will play a key-role for next-generation interfaces as it paves the way towards a more intuitive and natural human-computer interaction. The paper at hand introduces Social Signal Interpretation (SSI), a framework for real-time recognition of social signals. SSI supports a large range of sensor devices, filter and feature algorithms, as well as, machine learning and pattern recognition tools. It encourages developers to add new components using SSI's C++ API, but also addresses front end users by offering an XML interface to build pipelines with a text editor. SSI is freely available under GPL at http://openssi.net.

References

  1. A. Camurri, P. Coletta, G. Varni, and S. Ghisio. Developing multimodal interactive systems with eyesweb xmi. In Proc. NIME, pages 305--308, New York, USA, 2007. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. G. Caridakis, J. Wagner, A. Raouzaiou, Z. Curto, E. André, and K. Karpouzis. A multimodal corpus for gesture expressivity analysis. In Proc. LREC, 2010.Google ScholarGoogle Scholar
  3. F. Eyben, M. Wöllmer, and B. Schuller. Opensmile: the munich versatile and fast open-source audio feature extractor. In Proc. MM, pages 1459--1462, New York, USA, 2010. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. S. W. Gilroy, M. Cavazza, R. Chaignon, S.-M. M\"akel\"a, M. Niranen, E. André, T. Vogt, J. Urbain, H. Seichter, M. Billinghurst, and M. Benayoun. An affective model of user experience for interactive art. In Proc. ACE, pages 107--110, New York, USA, 2008. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. F. Kistler, B. Endrass, I. Damian, C. Dang, and E. André. Natural interaction with culturally adaptive virtual characters. JMUI, pages 1--9.Google ScholarGoogle Scholar
  6. R. Niewiadomski, J. Hofmann, J. Urbain, T. Platt, J. Wagner, B. PIOT, H. Cakmak, S. Pammi, T. Baur, S. Dupont, M. Geist, F. Lingenfelser, G. McKeown, O. Pietquin, and W. Ruch. Laugh-aware virtual agent and its impact on user amusement . In Proc. AAMAS, Saint Paul, USA, May 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. Pantic, A. Nijholt, A. Pentland, and T. S. Huang. Human-centred intelligent human-computer interaction (hci$^2$): how far are we from attaining it? IJAACS, 1(2):168--187, August 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. Scherer, G. Stratou, M. Mahmoud, J. Boberg, J. Gratch, A. Rizzo, and L.-P. Morency. Automatic behavior descriptors for psychological disorder analysis. In Proc. FG, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  9. M. Serrano, L. Nigay, J.-Y. L. Lawson, A. Ramsay, R. Murray-Smith, and S. Denef. The openinterface framework: a tool for multimodal interaction. In Proc. CHI, pages 3501--3506, New York, USA, 2008. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. Spagnolli and L. Gamberini, editors. Validating presence by relying on recollection: Human experience and performance in the mixed reality system XIM, Padova, Italy, 16/10/2008 2008. CLEUP Cooperativa Libraria Universitaria Padova.Google ScholarGoogle Scholar
  11. J. Urbain, R. Niewiadomski, E. Bevacqua, T. Dutoit, A. Moinet, C. Pelachaud, B. Picart, J. Tilmanne, and J. Wagner. Avlaughtercycle. JMUI, 4:47--58, 2010.Google ScholarGoogle Scholar
  12. T. Vogt, E. André, and N. Bee. Emovoice - a framework for online recognitionof emotions from voice. In Proc. PIT, Kloster Irsee, Germany, June 2008. Springer. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Wagner, F. Lingenfelser, and E. André. The social signal interpretation framework (ssi) for real time signal processing and recognition. In Proc. of INTERSPEECH, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  14. J. Wagner, F. Lingenfelser, E. André, and J. Kim. Exploring fusion methods for multimodal emotion recognition with missing data. IEEE TAC, 99, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. Wagner, F. Lingenfelser, E. André, D. Mazzei, A. Tognetti, A. Lanatà, D. D. Rossi, A. Betella, R. Zucca, P. Omedas, and P. F. Verschure. A sensing architecture for empathetic data systems. In Proc. AH, page 96\textendash99, Stuttgart, Germany, 2013. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. The social signal interpretation (SSI) framework: multimodal signal processing and recognition in real-time

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          MM '13: Proceedings of the 21st ACM international conference on Multimedia
          October 2013
          1166 pages
          ISBN:9781450324045
          DOI:10.1145/2502081

          Copyright © 2013 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 21 October 2013

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          MM '13 Paper Acceptance Rate47of235submissions,20%Overall Acceptance Rate995of4,171submissions,24%

          Upcoming Conference

          MM '24
          MM '24: The 32nd ACM International Conference on Multimedia
          October 28 - November 1, 2024
          Melbourne , VIC , Australia

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader