ABSTRACT
Small group interaction occurs often in workplace and education settings. Its dynamic progression is an essential factor in dictating the final group performance outcomes. The personality of each individual within the group is reflected in his/her interpersonal behaviors with other members of the group as they engage in these task-oriented interactions. In this work, we propose an interlocutor-modulated attention BSLTM (IM-aBLSTM) architecture that models an individual's vocal behaviors during small group interactions in order to automatically infer his/her personality traits. The interlocutor-modulated attention mechanism jointly optimize the relevant interpersonal vocal behaviors of other members of group during interactions. In specifics, we evaluate our proposed IM-aBLSTM in one of the largest small group interaction database, the ELEA corpus. Our framework achieves a promising unweighted recall accuracy of 87.9% in ten different binary personality trait prediction tasks, which outperforms the best results previously reported on the same database by 10.4% absolute. Finally, by analyzing the interpersonal vocal behaviors in the region of high attention weights, we observe several distinct intra- and inter-personal vocal behavior patterns that vary as a function of personality traits.
- Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio . 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014).Google Scholar
- Cigdem Beyan, Francesca Capozzi, Cristina Becchio, and Vittorio Murino . 2018. Prediction of the Leadership Style of an Emergent Leader Using Audio and Visual Nonverbal Features. IEEE Transactions on Multimedia Vol. 20, 2 (2018), 441--456. Google ScholarDigital Library
- P Boersma and D Weenink . 2003. Praat-A system for doing phonetics by computer {Computer Software}. The Netherlands: Institute of Phonetic Sciences, University of Amsterdam (2003).Google Scholar
- Jean Carletta, Simone Ashby, Sebastien Bourban, Mike Flynn, Mael Guillemot, Thomas Hain, Jaroslav Kadlec, Vasilis Karaiskos, Wessel Kraaij, Melissa Kronenthal, et almbox. . 2005. The AMI meeting corpus: A pre-announcement. In International Workshop on Machine Learning for Multimodal Interaction. Springer, 28--39. Google ScholarDigital Library
- Oya Celiktutan and Hatice Gunes . 2017. Automatic prediction of impressions in time and across varying context: Personality, attractiveness and likeability. IEEE Transactions on Affective Computing Vol. 8, 1 (2017), 29--42. Google ScholarDigital Library
- Lei Chen, R Travis Rose, Ying Qiao, Irene Kimbara, Fey Parrill, Haleema Welji, Tony Xu Han, Jilin Tu, Zhongqiang Huang, Mary Harper, et almbox. . 2005. VACE multimodal meeting corpus. In International Workshop on Machine Learning for Multimodal Interaction. Springer, 40--51. Google ScholarDigital Library
- Walter H Crockett . 1955. Emergent leadership in small, decision-making groups. The Journal of Abnormal and Social Psychology Vol. 51, 3 (1955), 378.Google ScholarCross Ref
- Michael R Cunningham . 1977. Personality and the structure of the nonverbal communication of emotion. Journal of Personality Vol. 45, 4 (1977), 564--584.Google ScholarCross Ref
- Sheng Fang, Catherine Achard, and Séverine Dubuisson . 2016. Personality classification and behaviour interpretation: An approach based on feature categories. In Proceedings of the 18th ACM International Conference on Multimodal Interaction. ACM, 225--232. Google ScholarDigital Library
- Howard S Friedman, M Robin DiMatteo, and Angelo Taranta . 1980. A study of the relationship between individual differences in nonverbal expressiveness and factors of personality and social interaction. Journal of Research in Personality Vol. 14, 3 (1980), 351--364.Google ScholarCross Ref
- Daniel Gatica-Perez, Oya Aran, and Dinesh Jayagopi . 2017. Analysis of Small Groups. Cambridge University Press, 349--367.Google Scholar
- James Gibson, Dogan Can, Panayiotis Georgiou, David C Atkins, and Shrikanth S Narayanan . 2017. Attention networks for modeling behaviors in addiction counseling Proc. Interspeech.Google Scholar
- Lewis R Goldberg . 1990. An alternative" description of personality": the big-five factor structure. Journal of personality and social psychology Vol. 59, 6 (1990), 1216.Google ScholarCross Ref
- Alex Graves and Jürgen Schmidhuber . 2005. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Networks Vol. 18, 5--6 (2005), 602--610. Google ScholarDigital Library
- J Richard Hackman and Charles G Morris . 1975. Group tasks, group interaction process, and group performance effectiveness: A review and proposed integration1. In Advances in experimental social psychology. Vol. Vol. 8. Elsevier, 45--99.Google Scholar
- Sepp Hochreiter and Jürgen Schmidhuber . 1997. Long short-term memory. Neural computation Vol. 9, 8 (1997), 1735--1780. Google ScholarDigital Library
- Shan-Wen Hsiao, Hung-Ching Sun, Ming-Chuan Hsieh, Ming-Hsueh Tsai, Yu Tsao, and Chi-Chun Lee . 2017. Toward Automating Oral Presentation Scoring during Principal Certification Program using Audio-video Low-level Behavior Profiles. IEEE Transactions on Affective Computing (2017).Google Scholar
- Dinesh Babu Jayagopi, Hayley Hung, Chuohao Yeo, and Daniel Gatica-Perez . 2009. Modeling dominance in group conversations using nonverbal activity cues. IEEE Transactions on Audio, Speech, and Language Processing Vol. 17, 3 (2009), 501--513.Google ScholarCross Ref
- Heysem Kaya, Alexey A Karpov, and Albert Ali Salah . 2015. Fisher vectors with cascaded normalization for paralinguistic analysis Sixteenth Annual Conference of the International Speech Communication Association.Google Scholar
- Diederik Kingma and Jimmy Ba . 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).Google Scholar
- Yun-Shao Lin and Chi-Chun Lee . 2017. Deriving Dyad-Level Interaction Representation using Interlocutors Structural and Expressive Multimodal Behavior Features. Proc. Interspeech 2017 (2017), 2366--2370.Google ScholarCross Ref
- Joseph Edward McGrath . 1964. Social psychology: A brief introduction. Holt, Rinehart and Winston.Google Scholar
- Joseph Edward McGrath . 1984. Groups: Interaction and performance. Vol. Vol. 14. Prentice-Hall Englewood Cliffs, NJ.Google Scholar
- Seyedmahdad Mirsamadi, Emad Barsoum, and Cha Zhang . 2017. Automatic speech emotion recognition using recurrent neural networks with local attention. In Acoustics, Speech and Signal Processing (ICASSP), 2017 IEEE International Conference on. IEEE, 2227--2231.Google ScholarDigital Library
- Shogo Okada, Oya Aran, and Daniel Gatica-Perez . 2015. Personality trait classification via co-occurrent multiparty multimodal event discovery. In Proceedings of the 2015 ACM on International Conference on Multimodal Interaction. ACM, 15--22. Google ScholarDigital Library
- Fabio Pianesi, Massimo Zancanaro, Bruno Lepri, and Alessandro Cappelletti . 2007. A multimodal annotated corpus of consensus decision making meetings. Language Resources and Evaluation Vol. 41, 3--4 (2007), 409--429.Google ScholarCross Ref
- Jurgen Ruesch, Weldon Kees, Robert Goodloe Harper, Robert G Harper, Arthur N Wiens, and Joseph D Matarazzo . 1978. Nonverbal Communication: The State of the Art. Vol. Vol. 65. Univ of California Press.Google Scholar
- Jorge Sánchez, Florent Perronnin, Thomas Mensink, and Jakob Verbeek . 2013. Image classification with the fisher vector: Theory and practice. International journal of computer vision Vol. 105, 3 (2013), 222--245. Google ScholarDigital Library
- Dairazalia Sanchez-Cortes, Oya Aran, Marianne Schmid Mast, and Daniel Gatica-Perez . 2012. A nonverbal behavior approach to identify emergent leaders in small groups. IEEE Transactions on Multimedia Vol. 14, 3 (2012), 816--832. Google ScholarDigital Library
- Barry Schwartz, Abraham Tesser, and Evan Powell . 1982. Dominance cues in nonverbal behavior. Social Psychology Quarterly (1982), 114--120.Google Scholar
- Shikhar Sharma, Ryan Kiros, and Ruslan Salakhutdinov . 2015. Action recognition using visual attention. arXiv preprint arXiv:1511.04119 (2015).Google Scholar
- R Timothy Stein . 1975. Identifying emergent leaders from verbal and nonverbal communications. Journal of Personality and Social Psychology Vol. 32, 1 (1975), 125.Google ScholarCross Ref
- Alessandro Vinciarelli, Maja Pantic, and Hervé Bourlard . 2009. Social signal processing: Survey of an emerging domain. Image and vision computing Vol. 27, 12 (2009), 1743--1759. Google ScholarDigital Library
- Noreen M Webb . 1982. Student interaction and learning in small groups. Review of Educational research Vol. 52, 3 (1982), 421--445.Google Scholar
Index Terms
- Using Interlocutor-Modulated Attention BLSTM to Predict Personality Traits in Small Group Interaction
Recommendations
Individual identification using personality traits
In this article, a pioneer study is conducted to evaluate the possibility of identifying people through their personality traits. The study is conducted using the answers of a population of 734 individuals to a collection of 206 items. These items aim ...
Using Interactive Storytelling to Identify Personality Traits
Interactive StorytellingAbstractEach person feels and understands stories in a unique way. Stories have different meanings to people, and those depend on their personal experiences and personality. Each one of us is unique, with unique personality traits, classifiable through ...
Predicting Personality Traits using Multimodal Information
WCPR '14: Proceedings of the 2014 ACM Multi Media on Workshop on Computational Personality RecognitionMeasuring personality traits has a long story in psychology where analysis has been done by asking sets of questions. These question sets (inventories) have been designed by investigating lexical terms that we use in our daily communications or by ...
Comments