research-article

The social signal interpretation (SSI) framework: multimodal signal processing and recognition in real-time

Authors:
Johannes Wagner

Lab for Human Centered Multimedia, University of Augsburg, Augsburg, Germany

Lab for Human Centered Multimedia, University of Augsburg, Augsburg, Germany
View Profile

,
Florian Lingenfelser

Lab for Human Centered Multimedia, University of Augsburg, Augsburg, Germany

Lab for Human Centered Multimedia, University of Augsburg, Augsburg, Germany
View Profile

,
Tobias Baur

Lab for Human Centered Multimedia, University of Augsburg, Augsburg, Germany

Lab for Human Centered Multimedia, University of Augsburg, Augsburg, Germany
View Profile

,
Ionut Damian

Lab for Human Centered Multimedia, University of Augsburg, Augsburg, Germany

Lab for Human Centered Multimedia, University of Augsburg, Augsburg, Germany
View Profile

,
Felix Kistler

Lab for Human Centered Multimedia, University of Augsburg, Augsburg, Germany

Lab for Human Centered Multimedia, University of Augsburg, Augsburg, Germany
View Profile

,
Elisabeth André

Lab for Human Centered Multimedia, University of Augsburg, Augsburg, Germany

Lab for Human Centered Multimedia, University of Augsburg, Augsburg, Germany
View Profile

MM '13: Proceedings of the 21st ACM international conference on MultimediaOctober 2013Pages 831–834https://doi.org/10.1145/2502081.2502223

Published:21 October 2013Publication History

MM '13: Proceedings of the 21st ACM international conference on Multimedia

Pages 831–834

ABSTRACT

Automatic detection and interpretation of social signals carried by voice, gestures, mimics, etc. will play a key-role for next-generation interfaces as it paves the way towards a more intuitive and natural human-computer interaction. The paper at hand introduces Social Signal Interpretation (SSI), a framework for real-time recognition of social signals. SSI supports a large range of sensor devices, filter and feature algorithms, as well as, machine learning and pattern recognition tools. It encourages developers to add new components using SSI's C++ API, but also addresses front end users by offering an XML interface to build pipelines with a text editor. SSI is freely available under GPL at http://openssi.net.

References

A. Camurri, P. Coletta, G. Varni, and S. Ghisio. Developing multimodal interactive systems with eyesweb xmi. In Proc. NIME, pages 305--308, New York, USA, 2007. ACM. Google ScholarDigital Library
G. Caridakis, J. Wagner, A. Raouzaiou, Z. Curto, E. André, and K. Karpouzis. A multimodal corpus for gesture expressivity analysis. In Proc. LREC, 2010.Google Scholar
F. Eyben, M. Wöllmer, and B. Schuller. Opensmile: the munich versatile and fast open-source audio feature extractor. In Proc. MM, pages 1459--1462, New York, USA, 2010. ACM. Google ScholarDigital Library
S. W. Gilroy, M. Cavazza, R. Chaignon, S.-M. M\"akel\"a, M. Niranen, E. André, T. Vogt, J. Urbain, H. Seichter, M. Billinghurst, and M. Benayoun. An affective model of user experience for interactive art. In Proc. ACE, pages 107--110, New York, USA, 2008. ACM. Google ScholarDigital Library
F. Kistler, B. Endrass, I. Damian, C. Dang, and E. André. Natural interaction with culturally adaptive virtual characters. JMUI, pages 1--9.Google Scholar
R. Niewiadomski, J. Hofmann, J. Urbain, T. Platt, J. Wagner, B. PIOT, H. Cakmak, S. Pammi, T. Baur, S. Dupont, M. Geist, F. Lingenfelser, G. McKeown, O. Pietquin, and W. Ruch. Laugh-aware virtual agent and its impact on user amusement . In Proc. AAMAS, Saint Paul, USA, May 2013. Google ScholarDigital Library
M. Pantic, A. Nijholt, A. Pentland, and T. S. Huang. Human-centred intelligent human-computer interaction (hci$^2$): how far are we from attaining it? IJAACS, 1(2):168--187, August 2008. Google ScholarDigital Library
S. Scherer, G. Stratou, M. Mahmoud, J. Boberg, J. Gratch, A. Rizzo, and L.-P. Morency. Automatic behavior descriptors for psychological disorder analysis. In Proc. FG, 2013.Google ScholarCross Ref
M. Serrano, L. Nigay, J.-Y. L. Lawson, A. Ramsay, R. Murray-Smith, and S. Denef. The openinterface framework: a tool for multimodal interaction. In Proc. CHI, pages 3501--3506, New York, USA, 2008. ACM. Google ScholarDigital Library
A. Spagnolli and L. Gamberini, editors. Validating presence by relying on recollection: Human experience and performance in the mixed reality system XIM, Padova, Italy, 16/10/2008 2008. CLEUP Cooperativa Libraria Universitaria Padova.Google Scholar
J. Urbain, R. Niewiadomski, E. Bevacqua, T. Dutoit, A. Moinet, C. Pelachaud, B. Picart, J. Tilmanne, and J. Wagner. Avlaughtercycle. JMUI, 4:47--58, 2010.Google Scholar
T. Vogt, E. André, and N. Bee. Emovoice - a framework for online recognitionof emotions from voice. In Proc. PIT, Kloster Irsee, Germany, June 2008. Springer. Google ScholarDigital Library
J. Wagner, F. Lingenfelser, and E. André. The social signal interpretation framework (ssi) for real time signal processing and recognition. In Proc. of INTERSPEECH, 2011.Google ScholarCross Ref
J. Wagner, F. Lingenfelser, E. André, and J. Kim. Exploring fusion methods for multimodal emotion recognition with missing data. IEEE TAC, 99, 2011. Google ScholarDigital Library
J. Wagner, F. Lingenfelser, E. André, D. Mazzei, A. Tognetti, A. Lanatà, D. D. Rossi, A. Betella, R. Zucca, P. Omedas, and P. F. Verschure. A sensing architecture for empathetic data systems. In Proc. AH, page 96\textendash99, Stuttgart, Germany, 2013. ACM. Google ScholarDigital Library

Index Terms

The social signal interpretation (SSI) framework: multimodal signal processing and recognition in real-time
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
  2. Machine learning
2. Hardware
  1. Communication hardware, interfaces and storage
    1. Signal processing systems

Recommendations

ASSP4MI2016: 2nd international workshop on advancements in social signal processing for multimodal interaction (workshop summary)
ICMI '16: Proceedings of the 18th ACM International Conference on Multimodal Interaction

This paper gives a summary of the 2nd International Workshop on Advancements in Social Signal Processing for Multimodal Interaction (ASSP4MI). Following our successful 1st International Workshop on Advancements in Social Signal Processing for ...
Read More
Social signal processing for dummies
ICMI '16: Proceedings of the 18th ACM International Conference on Multimodal Interaction

We introduce SSJ Creator, a modern Android GUI enabling users to design and execute social signal processing pipelines using nothing but their smartphones and without writing a single line of code. It is based on a modular Java-based social signal ...
Read More
Challenges for Social Embodiment
RFMIR '14: Proceedings of the 2014 Workshop on Roadmapping the Future of Multimodal Interaction Research including Business Opportunities and Challenges

Current research in the area of social signal processing focuses on offline analysis of previously recorded human social cues. Approaches to exploit social signal processing techniques in naturalistic environments where agents socially interact with ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MM '13: Proceedings of the 21st ACM international conference on Multimedia
October 2013
1166 pages
ISBN:9781450324045
DOI:10.1145/2502081
General Chairs:
Alejandro (Alex) Jaimes
Yahoo!, Spain
,
Nicu Sebe
University of Trento, Italy
,
Nozha Boujemaa
INRIA, France
,
Program Chairs:
Daniel Gatica-Perez
IDIAP & EPFL, Switzerland
,
David A. Shamma
Yahoo!, USA
,
Marcel Worring
University of Amsterdam, The Netherlands
,
Roger Zimmermann
National University of Singapore, Singapore
Copyright © 2013 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 21 October 2013
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
multimodal fusion
open source framework
real-time pattern recognition
social signal processing
Qualifiers
- research-article
Conference

Acceptance Rates
MM '13 Paper Acceptance Rate47of235submissions,20%Overall Acceptance Rate995of4,171submissions,24%
More
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 127
  Total Citations
  View Citations
- 1,103
  Total Downloads
- Downloads (Last 12 months)110
- Downloads (Last 6 weeks)12
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

The social signal interpretation (SSI) framework: multimodal signal processing and recognition in real-time

MM '13: Proceedings of the 21st ACM international conference on Multimedia

ABSTRACT

References

Cited By

Index Terms

Recommendations

ASSP4MI2016: 2nd international workshop on advancements in social signal processing for multimodal interaction (workshop summary)

Social signal processing for dummies

Challenges for Social Embodiment