Skip to main content
Erschienen in: Cognitive Computation 3/2018

26.12.2017

Emotional Human-Machine Conversation Generation Based on Long Short-Term Memory

verfasst von: Xiao Sun, Xiaoqi Peng, Shuai Ding

Erschienen in: Cognitive Computation | Ausgabe 3/2018

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

With the rise in popularity of artificial intelligence, the technology of verbal communication between man and machine has received an increasing amount of attention, but generating a good conversation remains a difficult task. The key factor in human-machine conversation is whether the machine can give good responses that are appropriate not only at the content level (relevant and grammatical) but also at the emotion level (consistent emotional expression). In our paper, we propose a new model based on long short-term memory, which is used to achieve an encoder-decoder framework, and we address the emotional factor of conversation generation by changing the model’s input using a series of input transformations: a sequence without an emotional category, a sequence with an emotional category for the input sentence, and a sequence with an emotional category for the output responses. We perform a comparison between our work and related work and find that we can obtain slightly better results with respect to emotion consistency. Although in terms of content coherence our result is lower than those of related work, in the present stage of research, our method can generally generate emotional responses in order to control and improve the user’s emotion. Our experiment shows that through the introduction of emotional intelligence, our model can generate responses appropriate not only in content but also in emotion.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Abadi M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, Corrado G, Davis A, Dean J, Devin M. 2016. Tensorflow: large-scale machine learning on heterogeneous distributed systems. Abadi M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, Corrado G, Davis A, Dean J, Devin M. 2016. Tensorflow: large-scale machine learning on heterogeneous distributed systems.
2.
Zurück zum Zitat André E, Rehm M, Minker W, Bühler D. Endowing spoken language dialogue systems with emotional intelligence. Affective dialogue systems, tutorial and research workshop, ADS 2004, Kloster Irsee, Germany, June 14-16, 2004, proceedings, p. 178–187; 2004. André E, Rehm M, Minker W, Bühler D. Endowing spoken language dialogue systems with emotional intelligence. Affective dialogue systems, tutorial and research workshop, ADS 2004, Kloster Irsee, Germany, June 14-16, 2004, proceedings, p. 178–187; 2004.
4.
Zurück zum Zitat Bahdanau D, Cho K, Bengio Y. 2014. Neural machine translation by jointly learning to align and translate. Computer Science. Bahdanau D, Cho K, Bengio Y. 2014. Neural machine translation by jointly learning to align and translate. Computer Science.
6.
Zurück zum Zitat Cassell J, Pelachaud C, Badler N, Steedman M, Achorn B, Becket T, Douville B, Prevost S, Stone M. Animated conversation: rule-based generation of facial expression, gesture & spoken intonation for multiple conversational agents. Conference on computer graphics and interactive techniques; 1994. p. 413–420. Cassell J, Pelachaud C, Badler N, Steedman M, Achorn B, Becket T, Douville B, Prevost S, Stone M. Animated conversation: rule-based generation of facial expression, gesture & spoken intonation for multiple conversational agents. Conference on computer graphics and interactive techniques; 1994. p. 413–420.
8.
Zurück zum Zitat Cho K, Merrienboer BV, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y. 2014. Learning phrase representations using rnn encoder-decoder for statistical machine translation. Computer Science. Cho K, Merrienboer BV, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y. 2014. Learning phrase representations using rnn encoder-decoder for statistical machine translation. Computer Science.
9.
Zurück zum Zitat Frederking RE. A rule-based conversation participant. Meeting of the association for computational linguistics; 2007. Frederking RE. A rule-based conversation participant. Meeting of the association for computational linguistics; 2007.
10.
11.
Zurück zum Zitat Graves A, Ndez S, Schmidhuber J. Rgen bidirectional LSTM networks for improved phoneme classification and recognition. Artificial neural networks: formal MODELS and their applications—ICANN 2005, International conference, warsaw, poland, September 11-15, 2005, proceedings, p. 799–804; 2005. Graves A, Ndez S, Schmidhuber J. Rgen bidirectional LSTM networks for improved phoneme classification and recognition. Artificial neural networks: formal MODELS and their applications—ICANN 2005, International conference, warsaw, poland, September 11-15, 2005, proceedings, p. 799–804; 2005.
13.
Zurück zum Zitat Hochreiter S, Schmidhuber J. 1996. LSTM can solve hard long time lag problems. Advances in neural information processing systems p. 473–479. Hochreiter S, Schmidhuber J. 1996. LSTM can solve hard long time lag problems. Advances in neural information processing systems p. 473–479.
14.
Zurück zum Zitat Ji Z, Lu Z, Li H. 2014. An information retrieval approach to short text conversation. Computer Science. Ji Z, Lu Z, Li H. 2014. An information retrieval approach to short text conversation. Computer Science.
15.
Zurück zum Zitat Kadish D, Kummer N, Dulic A, Najjaran H. The empathy machine. International conference on entertainment computing; 2012. p. 461–464. Kadish D, Kummer N, Dulic A, Najjaran H. The empathy machine. International conference on entertainment computing; 2012. p. 461–464.
17.
Zurück zum Zitat Li J, Galley M, Brockett C, Gao J, Dolan B. 2015. A diversity-promoting objective function for neural conversation models. Computer Science. Li J, Galley M, Brockett C, Gao J, Dolan B. 2015. A diversity-promoting objective function for neural conversation models. Computer Science.
18.
Zurück zum Zitat Mayer JD, Salovey P. 1997. What is emotional intelligence?. Mayer JD, Salovey P. 1997. What is emotional intelligence?.
19.
Zurück zum Zitat Mikolov T, Chen K, Corrado G. 2014. word2vec. Mikolov T, Chen K, Corrado G. 2014. word2vec.
21.
Zurück zum Zitat PASCHKE A, BOLEY H. Rule responder: rule-based agents for the semantic-pragmatic web. Int J Artif Intell Tools 2012;20(06):1043–1081.CrossRef PASCHKE A, BOLEY H. Rule responder: rule-based agents for the semantic-pragmatic web. Int J Artif Intell Tools 2012;20(06):1043–1081.CrossRef
22.
Zurück zum Zitat Pascual B, Gurruchaga M, Ginebra MP, Gil FJ, Planell JA, Goñi I. A neural network approach to context-sensitive generation of conversational responses. Trans R Soc Trop Med Hyg 2015;51(6):502–504. Pascual B, Gurruchaga M, Ginebra MP, Gil FJ, Planell JA, Goñi I. A neural network approach to context-sensitive generation of conversational responses. Trans R Soc Trop Med Hyg 2015;51(6):502–504.
24.
Zurück zum Zitat Pham NV, Terada T, Tsukamoto M, Nishio S. An information retrieval system for supporting casual conversation in wearable computing environments. International workshop on smart appliances and wearable computing; 2005. p. 477–483. Pham NV, Terada T, Tsukamoto M, Nishio S. An information retrieval system for supporting casual conversation in wearable computing environments. International workshop on smart appliances and wearable computing; 2005. p. 477–483.
25.
Zurück zum Zitat Picard RW. Affective computing. Igi Global 1997;1(1):71–73. Picard RW. Affective computing. Igi Global 1997;1(1):71–73.
27.
Zurück zum Zitat Polzin T, Waibel A. 2000. Emotion - sensitive human - computer interfaces. The Isca Workshop on Speech & Emotion. Polzin T, Waibel A. 2000. Emotion - sensitive human - computer interfaces. The Isca Workshop on Speech & Emotion.
28.
Zurück zum Zitat Prendinger H, Mori J, Ishizuka M. Using human physiology to evaluate subtle expressivity of a virtual quizmaster in a mathematical game. Int J Hum Comput Stud 2005;62(2):231–245.CrossRef Prendinger H, Mori J, Ishizuka M. Using human physiology to evaluate subtle expressivity of a virtual quizmaster in a mathematical game. Int J Hum Comput Stud 2005;62(2):231–245.CrossRef
29.
Zurück zum Zitat Ptaszynski M, Dybala P, Shi W, Rzepka R, Araki K. Towards context aware emotional intelligence in machines: computing contextual appropriateness of affective states. IJCAI 2009, Proceedings of the international joint conference on artificial intelligence, Pasadena, California, USA; 2009. p. 1469–1474. Ptaszynski M, Dybala P, Shi W, Rzepka R, Araki K. Towards context aware emotional intelligence in machines: computing contextual appropriateness of affective states. IJCAI 2009, Proceedings of the international joint conference on artificial intelligence, Pasadena, California, USA; 2009. p. 1469–1474.
30.
Zurück zum Zitat R L, N P, IV S, L C, J P. 2015b. Incorporating unstructured textual knowledge sources into neural dialogue systems. R L, N P, IV S, L C, J P. 2015b. Incorporating unstructured textual knowledge sources into neural dialogue systems.
31.
Zurück zum Zitat Salovey P, Mayer JD. Emotional intelligence. Imagin Cogn Personality 1990;9(6):217–236. Salovey P, Mayer JD. Emotional intelligence. Imagin Cogn Personality 1990;9(6):217–236.
32.
Zurück zum Zitat Shang L, Lu Z, Li H. 2015. Neural responding machine for short-text conversation. Computer Science. Shang L, Lu Z, Li H. 2015. Neural responding machine for short-text conversation. Computer Science.
33.
Zurück zum Zitat Skowron M. Affect listeners: acquisition of affective states by means of conversational systems. Development of multimodal interfaces: active listening and synchrony, second COST 2102 international training school, Dublin, Ireland, March 23-27, 2009, revised selected papers; 2010. p. 169–181. Skowron M. Affect listeners: acquisition of affective states by means of conversational systems. Development of multimodal interfaces: active listening and synchrony, second COST 2102 international training school, Dublin, Ireland, March 23-27, 2009, revised selected papers; 2010. p. 169–181.
35.
Zurück zum Zitat Sundermeyer M, Schlüter R, Ney H. LSTM neural networks for language modeling. Interspeech; 2012. p. 601–608. Sundermeyer M, Schlüter R, Ney H. LSTM neural networks for language modeling. Interspeech; 2012. p. 601–608.
36.
Zurück zum Zitat Sutskever I, Vinyals O, Le Q. Sequence to sequence learning with neural networks. Adv Neural Inf Proces Syst 2014;4:3104– 3112. Sutskever I, Vinyals O, Le Q. Sequence to sequence learning with neural networks. Adv Neural Inf Proces Syst 2014;4:3104– 3112.
37.
Zurück zum Zitat Takahashi T, Mera K, Kurosawa Y, Takezawa T. Natural language dialog system considering speaker’s emotion for open-ended conversation. J Acoust Soc Am 2016;140(4):3400–3400.CrossRef Takahashi T, Mera K, Kurosawa Y, Takezawa T. Natural language dialog system considering speaker’s emotion for open-ended conversation. J Acoust Soc Am 2016;140(4):3400–3400.CrossRef
38.
Zurück zum Zitat Treur J. On human aspects in ambient intelligence. Berlin: Springer; 2008.CrossRef Treur J. On human aspects in ambient intelligence. Berlin: Springer; 2008.CrossRef
39.
Zurück zum Zitat Vinyals O, Le Q. 2015. A neural conversational model. Computer Science. Vinyals O, Le Q. 2015. A neural conversational model. Computer Science.
40.
Zurück zum Zitat Wen TH, Vandyke D, Mrksic N, Gasic M, Rojasbarahona LM, Su PH, Ultes S, Young S. 2016. A network-based end-to-end trainable task-oriented dialogue system. Wen TH, Vandyke D, Mrksic N, Gasic M, Rojasbarahona LM, Su PH, Ultes S, Young S. 2016. A network-based end-to-end trainable task-oriented dialogue system.
41.
Zurück zum Zitat Xing C, Wu W, Wu Y, Liu J, Huang Y, Zhou M, Ma WY. 2016. Topic aware neural response generation. Xing C, Wu W, Wu Y, Liu J, Huang Y, Zhou M, Ma WY. 2016. Topic aware neural response generation.
42.
Zurück zum Zitat Xiong K, Cui A, Zhang Z, Li M. 2016. Neural contextual conversation learning with labeled question-answering pairs. Xiong K, Cui A, Zhang Z, Li M. 2016. Neural contextual conversation learning with labeled question-answering pairs.
43.
Zurück zum Zitat Yan R, Song Y, Wu H. Learning to respond with deep neural networks for retrieval-based human-computer conversation system. International ACM SIGIR conference on research and development in information retrieval; 2016. p. 55–64. Yan R, Song Y, Wu H. Learning to respond with deep neural networks for retrieval-based human-computer conversation system. International ACM SIGIR conference on research and development in information retrieval; 2016. p. 55–64.
44.
Zurück zum Zitat Zhou H, Huang M, Zhang T, Zhu X, Liu B. 2017. Emotional chatting machine: emotional conversation generation with internal and external memory. Zhou H, Huang M, Zhang T, Zhu X, Liu B. 2017. Emotional chatting machine: emotional conversation generation with internal and external memory.
Metadaten
Titel
Emotional Human-Machine Conversation Generation Based on Long Short-Term Memory
verfasst von
Xiao Sun
Xiaoqi Peng
Shuai Ding
Publikationsdatum
26.12.2017
Verlag
Springer US
Erschienen in
Cognitive Computation / Ausgabe 3/2018
Print ISSN: 1866-9956
Elektronische ISSN: 1866-9964
DOI
https://doi.org/10.1007/s12559-017-9539-4

Weitere Artikel der Ausgabe 3/2018

Cognitive Computation 3/2018 Zur Ausgabe