Skip to main content

2016 | OriginalPaper | Buchkapitel

Emotion in Robots Using Convolutional Neural Networks

verfasst von : Mehdi Ghayoumi, Arvind K. Bansal

Erschienen in: Social Robotics

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

These years, emotion recognition has been one of the hot topics in computer science and especially in Human-Robot Interaction (HRI) and Robot-Robot Interaction (RRI). By emotion (recognition and expression), robots can recognize human behavior and emotion better and can communicate in a more human way. On that point are some research for unimodal emotion system for robots, but because, in the real world, Human emotions are multimodal then multimodal systems can work better for the recognition. Yet, beside this multimodality feature of human emotion, using a flexible and reliable learning method can help robots to recognize better and makes more beneficial interaction. Deep learning showed its force in this area and here our model is a multimodal method which use 3 main traits (Facial Expression, Speech and gesture) for emotion (recognition and expression) in robots. We implemented the model for six basic emotion states and there are some other states of emotion, such as mix emotions, which are really laborious to be picked out by robots. Our experiments show that a significant improvement of identification accuracy is accomplished when we use convolutional Neural Network (CNN) and multimodal information system, from 91 % reported in the previous research [27] to 98.8 %.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Ekman, P., Friesen, W.V.: Constants across cultures in the face and emotion. J. Pers. Soc. Psychol. 17(2), 124–129 (1971)CrossRef Ekman, P., Friesen, W.V.: Constants across cultures in the face and emotion. J. Pers. Soc. Psychol. 17(2), 124–129 (1971)CrossRef
2.
Zurück zum Zitat Gu, Y., Mai, X., Luo, Y.-J.: Do bodily expressions compete with facial expressions? Time course of integration of emotional signals from the face and the body. PLoS One 8(7), 736–762 (2013) Gu, Y., Mai, X., Luo, Y.-J.: Do bodily expressions compete with facial expressions? Time course of integration of emotional signals from the face and the body. PLoS One 8(7), 736–762 (2013)
3.
Zurück zum Zitat Adolphs, R.: Neural systems for recognizing emotion. Current Opinion in Neurobiology 12(2), 169–177 (2002)CrossRef Adolphs, R.: Neural systems for recognizing emotion. Current Opinion in Neurobiology 12(2), 169–177 (2002)CrossRef
4.
Zurück zum Zitat Lecun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)CrossRef Lecun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)CrossRef
5.
Zurück zum Zitat Ghayoumi, M., Bansal, A.K.: Architecture of Emotion in Robots Using Convolutional Neural Networks. RSS, USA (2016) Ghayoumi, M., Bansal, A.K.: Architecture of Emotion in Robots Using Convolutional Neural Networks. RSS, USA (2016)
6.
Zurück zum Zitat Ghayoumi, M., Bansal, A.K.: Multimodal architecture for emotion in robots using deep learning. In: Future Technologies Conference, San Francisco, United States (2016) Ghayoumi, M., Bansal, A.K.: Multimodal architecture for emotion in robots using deep learning. In: Future Technologies Conference, San Francisco, United States (2016)
7.
Zurück zum Zitat Gunes, H., Piccardi, M.: A bimodal face and body gesture database for automatic analysis of human nonverbal affective behavior. In: Proceeding of ICPR 2006 the 18th International Conference on Pattern Recognition, Hong Kong, China (2006) Gunes, H., Piccardi, M.: A bimodal face and body gesture database for automatic analysis of human nonverbal affective behavior. In: Proceeding of ICPR 2006 the 18th International Conference on Pattern Recognition, Hong Kong, China (2006)
8.
Zurück zum Zitat Bänziger, T., Pirker, H., Scherer, K.: Gemep - Geneva multimodal emotion portrayals: a corpus for the study of multimodal emotional expressions. In: Deviller, L., et al. (eds.) Proceedings of LREC 2006 Workshop on Corpora for Research on Emotion and Affect, pp. 15–19, Genoa (2006) Bänziger, T., Pirker, H., Scherer, K.: Gemep - Geneva multimodal emotion portrayals: a corpus for the study of multimodal emotional expressions. In: Deviller, L., et al. (eds.) Proceedings of LREC 2006 Workshop on Corpora for Research on Emotion and Affect, pp. 15–19, Genoa (2006)
9.
Zurück zum Zitat Douglas-Cowie, E., Campbell, N., Cowie, R., Roach, P.: Emotional speech: towards a new generation of databases. Speech Commun. 40(1), 33–60 (2003)CrossRefMATH Douglas-Cowie, E., Campbell, N., Cowie, R., Roach, P.: Emotional speech: towards a new generation of databases. Speech Commun. 40(1), 33–60 (2003)CrossRefMATH
10.
Zurück zum Zitat Gunes, H., Piccardi, M.: Bimodal emotion recognition from expressive face and body gestures. J. Network Computer Appl. 30(4), 1334–1345 (2006)CrossRef Gunes, H., Piccardi, M.: Bimodal emotion recognition from expressive face and body gestures. J. Network Computer Appl. 30(4), 1334–1345 (2006)CrossRef
11.
Zurück zum Zitat el Kaliouby, R., Robinson, P.: Generalization of a vision-based computational model of mind-reading. In: Proceedings of First International Conference on Affective Computing and Intelligent Interfaces, pp. 582–589 (2005) el Kaliouby, R., Robinson, P.: Generalization of a vision-based computational model of mind-reading. In: Proceedings of First International Conference on Affective Computing and Intelligent Interfaces, pp. 582–589 (2005)
12.
Zurück zum Zitat Cowie, R., Douglas-Cowie, E., Tsapatsoulis, N., Votsis, G., Kollias, S., Fellenz, W., Taylor, J.G.: Emotion recognition in human-computer interaction. IEEE Signal Process. Magazine 18(1), 32–80 (2001)CrossRef Cowie, R., Douglas-Cowie, E., Tsapatsoulis, N., Votsis, G., Kollias, S., Fellenz, W., Taylor, J.G.: Emotion recognition in human-computer interaction. IEEE Signal Process. Magazine 18(1), 32–80 (2001)CrossRef
13.
Zurück zum Zitat Pontiac, M., Rothkrantz, L.J.M.: Automatic analysis of facial expressions: the state of the art. IEEE Trans. Pattern Anal. Mach. Intell. 22(12), 1424–1445 (2000)CrossRef Pontiac, M., Rothkrantz, L.J.M.: Automatic analysis of facial expressions: the state of the art. IEEE Trans. Pattern Anal. Mach. Intell. 22(12), 1424–1445 (2000)CrossRef
14.
Zurück zum Zitat Mehrabian, A.: Silent Messages - A Wealth of Information about Nonverbal Communication (Body Language). Personality & Emotion Tests & Software: Psychological Books & Articles of Popular Interest (2009) Mehrabian, A.: Silent Messages - A Wealth of Information about Nonverbal Communication (Body Language). Personality & Emotion Tests & Software: Psychological Books & Articles of Popular Interest (2009)
15.
Zurück zum Zitat Ghayoumi, M., Bansal, A. K.; Real emotion recognition algorithm by detecting symmetry patterns with Dihedral group. In: MCSI (2016) Ghayoumi, M., Bansal, A. K.; Real emotion recognition algorithm by detecting symmetry patterns with Dihedral group. In: MCSI (2016)
16.
Zurück zum Zitat Schultz, W.: Neural coding of basic reward terms of animal learning theory, game theory microeconomics and behavioral ecology. Cur. Opin. Neurobiol. 14(2), 139–147 (2004)CrossRef Schultz, W.: Neural coding of basic reward terms of animal learning theory, game theory microeconomics and behavioral ecology. Cur. Opin. Neurobiol. 14(2), 139–147 (2004)CrossRef
17.
Zurück zum Zitat Panksepp, J.: Affective Neuroscience. Oxford University Press, New York (1998) Panksepp, J.: Affective Neuroscience. Oxford University Press, New York (1998)
18.
Zurück zum Zitat Laird, J.: The Soar Cognitive Architecture. MIT Press, Cambridge (2012) Laird, J.: The Soar Cognitive Architecture. MIT Press, Cambridge (2012)
19.
Zurück zum Zitat Friesen, E., Ekman, P.: Facial action coding system: a technique for the measurement of facial movement, Palo Alto (1978) Friesen, E., Ekman, P.: Facial action coding system: a technique for the measurement of facial movement, Palo Alto (1978)
20.
Zurück zum Zitat Viola, P., Jones, M.J.: Robust real-time face detection. Int. J. Comput. Vis. 57(2), 137–154 (2004)CrossRef Viola, P., Jones, M.J.: Robust real-time face detection. Int. J. Comput. Vis. 57(2), 137–154 (2004)CrossRef
21.
Zurück zum Zitat Abrishami Moghaddam, H., Ghayoumi, M.: Facial image feature extraction using support vector machines. In: Proceeding VISAPP, Setubal, Portugal (2006) Abrishami Moghaddam, H., Ghayoumi, M.: Facial image feature extraction using support vector machines. In: Proceeding VISAPP, Setubal, Portugal (2006)
22.
Zurück zum Zitat Ghayoumi, M., Bansal, A.K.: An integrated approach for efficient analysis of facial expressions. In: SIGMAP, (2014) Ghayoumi, M., Bansal, A.K.: An integrated approach for efficient analysis of facial expressions. In: SIGMAP, (2014)
23.
Zurück zum Zitat Susskind, J.M., Hinton, G.E., Movellan, J.R., Anderson, A.K.: Generating facial expressions with deep belief nets. Affective Computing, Emotion Model. Synth. Recogn., 421-440 (2008) Susskind, J.M., Hinton, G.E., Movellan, J.R., Anderson, A.K.: Generating facial expressions with deep belief nets. Affective Computing, Emotion Model. Synth. Recogn., 421-440 (2008)
24.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., Hinton, G. E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012) Krizhevsky, A., Sutskever, I., Hinton, G. E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
25.
Zurück zum Zitat Sun, Y., Wang, X., Tang, X.: Deep learning face representation from predicting 10,000 classes. In: Computer Vision and Pattern Recognition (CVPR), pp. 1891–1898. IEEE (2014) Sun, Y., Wang, X., Tang, X.: Deep learning face representation from predicting 10,000 classes. In: Computer Vision and Pattern Recognition (CVPR), pp. 1891–1898. IEEE (2014)
26.
Zurück zum Zitat Ghayoumi, M., Bansal, A.: Unifying geometric features and facial action units for improved performance of facial expression analysis, CSSCC (2015) Ghayoumi, M., Bansal, A.: Unifying geometric features and facial action units for improved performance of facial expression analysis, CSSCC (2015)
27.
Zurück zum Zitat Ghayoumi, M., Tafar, M., Bansal, A. K.: Towards formal multimodal analysis of emotions for affective computing. DMS (2016) Ghayoumi, M., Tafar, M., Bansal, A. K.: Towards formal multimodal analysis of emotions for affective computing. DMS (2016)
28.
Zurück zum Zitat Huan, Y.: Wu, Ao., Zhang, G., Li, Y.: Extraction of adaptive wavelet packet filter-bank-based acoustic feature for emotion recognition. IET Signal Process. 9(4), 341–348 (2015)CrossRef Huan, Y.: Wu, Ao., Zhang, G., Li, Y.: Extraction of adaptive wavelet packet filter-bank-based acoustic feature for emotion recognition. IET Signal Process. 9(4), 341–348 (2015)CrossRef
29.
Zurück zum Zitat Kwon, O. W., Chan, K., Hao, J., Lee, T. W.: Emotion recognition by speech signals. In: 8th International Conference on Speech Communication and Technology (2003) Kwon, O. W., Chan, K., Hao, J., Lee, T. W.: Emotion recognition by speech signals. In: 8th International Conference on Speech Communication and Technology (2003)
30.
Zurück zum Zitat Lee, C.M., Narayanan, S.S.: Towards detecting emotions in spoken dialog. IEEE Trans. Speech Audio Process. 13(2), 293–303 (2005)CrossRef Lee, C.M., Narayanan, S.S.: Towards detecting emotions in spoken dialog. IEEE Trans. Speech Audio Process. 13(2), 293–303 (2005)CrossRef
31.
Zurück zum Zitat Mitra, S., Acharya, T.: Gesture recognition: a survey. IEEE Trans. Syst. Man Cybern. 37(3), 311–324 (2007)CrossRef Mitra, S., Acharya, T.: Gesture recognition: a survey. IEEE Trans. Syst. Man Cybern. 37(3), 311–324 (2007)CrossRef
32.
Zurück zum Zitat Glowinski, D., Dael, N., Camurri, A., Volpe, G., Mortillaro, M., Scherer, K.: Toward a minimal representation of affective gestures. IEEE Trans. Affect. Comput. 2(2), 106–118 (2011)CrossRef Glowinski, D., Dael, N., Camurri, A., Volpe, G., Mortillaro, M., Scherer, K.: Toward a minimal representation of affective gestures. IEEE Trans. Affect. Comput. 2(2), 106–118 (2011)CrossRef
33.
Zurück zum Zitat Camurri, A., Lagerlö, I., Volpe, G.: Recognizing emotion from dance movement: comparison of spectator recognition and automated techniques. Int. J. Hum. Comput. Stud. 59(1), 213–225 (2003)CrossRef Camurri, A., Lagerlö, I., Volpe, G.: Recognizing emotion from dance movement: comparison of spectator recognition and automated techniques. Int. J. Hum. Comput. Stud. 59(1), 213–225 (2003)CrossRef
34.
Zurück zum Zitat Castellano, G., Villalba, S.D., Camurri, A.: Recognising human emotions from body movement and gesture dynamics. In: Paiva, A.C., Prada, R., Picard, R.W. (eds.) ACII 2007. LNCS, vol. 4738, pp. 71–82. Springer, Heidelberg (2007)CrossRef Castellano, G., Villalba, S.D., Camurri, A.: Recognising human emotions from body movement and gesture dynamics. In: Paiva, A.C., Prada, R., Picard, R.W. (eds.) ACII 2007. LNCS, vol. 4738, pp. 71–82. Springer, Heidelberg (2007)CrossRef
35.
Zurück zum Zitat Ghayoumi, M.: A Review of Multimodal Biometric Systems Fusion Methods and Its Applications. ICIS, USA (2015) Ghayoumi, M.: A Review of Multimodal Biometric Systems Fusion Methods and Its Applications. ICIS, USA (2015)
36.
Zurück zum Zitat Ghayoumi, M., Khan, J., Pourebadi Khotbesara, M., Bauer, E., Hossain, A.: Follower Robot with an Optimized Gesture Recognition System. RSS, USA (2016) Ghayoumi, M., Khan, J., Pourebadi Khotbesara, M., Bauer, E., Hossain, A.: Follower Robot with an Optimized Gesture Recognition System. RSS, USA (2016)
Metadaten
Titel
Emotion in Robots Using Convolutional Neural Networks
verfasst von
Mehdi Ghayoumi
Arvind K. Bansal
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-47437-3_28