Skip to main content
Top
Published in: World Wide Web 3/2020

07-06-2019

End-to-End latent-variable task-oriented dialogue system with exact log-likelihood optimization

Authors: Haotian Xu, Haiyun Peng, Haoran Xie, Erik Cambria, Liuyang Zhou, Weiguo Zheng

Published in: World Wide Web | Issue 3/2020

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

We propose an end-to-end dialogue model based on a hierarchical encoder-decoder, which employed a discrete latent variable to learn underlying dialogue intentions. The system is able to model the structure of utterances dominated by statistics of the language and the dependencies among utterances in dialogues without manual dialogue state design. We argue that the latent discrete variable interprets the intentions that guide machine responses generation. We also propose a model which can be refined autonomously with reinforcement learning, due to that intention selection at each dialogue turn can be formulated as a sequential decision-making process. Our experiments show that exact MLE optimized model is much more robust than neural variational inference on dialogue success rate with limited BLEU sacrifice.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Footnotes
1
Similar to LIDM, all dialogues are pre-processed by delexicalization [9]. Based on the ontology, slot-value specific words are substituted with their corresponding generic tokens.
 
2
We concatenate hidden states of bidirectional encoder RNNs of the last step to initialize hidden states of decoder RNNs
 
3
Except for training with REINFORCE models
 
Literature
1.
go back to reference Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. Computer Science (2014) Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. Computer Science (2014)
3.
go back to reference Cuayáhuitl, H., Yu, S., Williamson, A., Carse, J.: Deep reinforcement learning for multi-domain dialogue systems. arXiv:1611.08675 (2016) Cuayáhuitl, H., Yu, S., Williamson, A., Carse, J.: Deep reinforcement learning for multi-domain dialogue systems. arXiv:1611.​08675 (2016)
4.
go back to reference Das, R., Dhuliawala, S., Zaheer, M., Vilnis, L., Durugkar, I., Krishnamurthy, A., Smola, A., McCallum, A.: Go for a walk and arrive at the answer: Reasoning over paths in knowledge bases using reinforcement learning. arXiv:1711.05851 (2017) Das, R., Dhuliawala, S., Zaheer, M., Vilnis, L., Durugkar, I., Krishnamurthy, A., Smola, A., McCallum, A.: Go for a walk and arrive at the answer: Reasoning over paths in knowledge bases using reinforcement learning. arXiv:1711.​05851 (2017)
5.
go back to reference Dhingra, B., Li, L., Li, X., Gao, J., Chen, Y.N., Ahmed, F., Deng, L.: Towards end-to-end reinforcement learning of dialogue agents for information access. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), vol. 1, pp. 484–495 (2017) Dhingra, B., Li, L., Li, X., Gao, J., Chen, Y.N., Ahmed, F., Deng, L.: Towards end-to-end reinforcement learning of dialogue agents for information access. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), vol. 1, pp. 484–495 (2017)
6.
go back to reference Eric, M., Manning, C.D.: A copy-augmented sequence-to-sequence architecture gives good performance on task-oriented dialogue. arXiv:1701.04024(2017) Eric, M., Manning, C.D.: A copy-augmented sequence-to-sequence architecture gives good performance on task-oriented dialogue. arXiv:1701.​04024(2017)
7.
go back to reference Gašić, M., Breslin, C., Henderson, M., Kim, D., Szummer, M., Thomson, B., Tsiakoulis, P., Young, S.: On-Line policy optimisation of bayesian spoken dialogue systems via human interaction. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 8367–8371. IEEE (2013) Gašić, M., Breslin, C., Henderson, M., Kim, D., Szummer, M., Thomson, B., Tsiakoulis, P., Young, S.: On-Line policy optimisation of bayesian spoken dialogue systems via human interaction. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 8367–8371. IEEE (2013)
8.
go back to reference Gruber, A., Weiss, Y., Rosen-Zvi, M.: Hidden topic markov models. In: Artificial Intelligence and Statistics, pp. 163–170 (2007) Gruber, A., Weiss, Y., Rosen-Zvi, M.: Hidden topic markov models. In: Artificial Intelligence and Statistics, pp. 163–170 (2007)
9.
go back to reference Henderson, M., Thomson, B., Young, S.: Word-based dialog state tracking with recurrent neural networks. In: Meeting of the Special Interest Group on Discourse and Dialogue, pp. 292–299 (2014) Henderson, M., Thomson, B., Young, S.: Word-based dialog state tracking with recurrent neural networks. In: Meeting of the Special Interest Group on Discourse and Dialogue, pp. 292–299 (2014)
10.
go back to reference Higgins, I., Matthey, L., Pal, A., Burgess, C., Glorot, X., Botvinick, M., Mohamed, S., Lerchner, A.: beta-vae: Learning basic visual concepts with a constrained variational framework. In: Proceedings of International Conference on Learning Representations (ICLR) (2017) Higgins, I., Matthey, L., Pal, A., Burgess, C., Glorot, X., Botvinick, M., Mohamed, S., Lerchner, A.: beta-vae: Learning basic visual concepts with a constrained variational framework. In: Proceedings of International Conference on Learning Representations (ICLR) (2017)
11.
go back to reference Kingma, D., Ba, J.: Adam: a Method for Stochastic Optimization. In: The International Conference on Learning Representations (ICLR) (2015) Kingma, D., Ba, J.: Adam: a Method for Stochastic Optimization. In: The International Conference on Learning Representations (ICLR) (2015)
12.
go back to reference Liu, B., Tur, G., Hakkani-Tur, D., Shah, P., Heck, L.: End-to-end optimization of task-oriented dialogue model with deep reinforcement learning. arXiv:1711.10712 (2017) Liu, B., Tur, G., Hakkani-Tur, D., Shah, P., Heck, L.: End-to-end optimization of task-oriented dialogue model with deep reinforcement learning. arXiv:1711.​10712 (2017)
13.
go back to reference Madotto, A., Wu, C.S., Fung, P.: Mem2seq: Effectively incorporating knowledge bases into end-to-end task-oriented dialog systems. arXiv:1804.08217(2018) Madotto, A., Wu, C.S., Fung, P.: Mem2seq: Effectively incorporating knowledge bases into end-to-end task-oriented dialog systems. arXiv:1804.​08217(2018)
14.
go back to reference Mnih, A., Gregor, K.: Neural variational inference and learning in belief networks. In: Proceedings of the 34th International Conference on Machine Learning (ICML). 1402.0030 (2014) Mnih, A., Gregor, K.: Neural variational inference and learning in belief networks. In: Proceedings of the 34th International Conference on Machine Learning (ICML). 1402.​0030 (2014)
15.
go back to reference Mrkšić, N., Séaghdha, D.Ó., Thomson, B., Gasic, M., Su, P.H., Vandyke, D., Wen, T.H., Young, S.: Multi-domain dialog state tracking using recurrent neural networks. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (vol. 2: Short Papers), vol. 2, pp. 794–799 (2015) Mrkšić, N., Séaghdha, D.Ó., Thomson, B., Gasic, M., Su, P.H., Vandyke, D., Wen, T.H., Young, S.: Multi-domain dialog state tracking using recurrent neural networks. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (vol. 2: Short Papers), vol. 2, pp. 794–799 (2015)
16.
go back to reference Mrksic, N., Séaghdha, D. Ó., Wen, T., Thomson, B., Young, S.J.: Neural belief tracker: Data-driven dialogue state tracking. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, pp. 1777–1788 (2017) Mrksic, N., Séaghdha, D. Ó., Wen, T., Thomson, B., Young, S.J.: Neural belief tracker: Data-driven dialogue state tracking. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, pp. 1777–1788 (2017)
17.
go back to reference Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: Bleu: a method for automatic evaluation of machine translation. In: Meeting on Association for Computational Linguistics, pp. 311–318 (2002) Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: Bleu: a method for automatic evaluation of machine translation. In: Meeting on Association for Computational Linguistics, pp. 311–318 (2002)
18.
go back to reference Rojas-Barahona, L.M., Gasic, M., Mrksic, N., Su, P., Ultes, S., Wen, T., Young, S.J., Vandyke, D.: A network-based end-to-end trainable task-oriented dialogue system. in: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017, Valencia, Spain, April 3-7, 2017, vol. 1: Long Papers, pp. 438–449 (2017) Rojas-Barahona, L.M., Gasic, M., Mrksic, N., Su, P., Ultes, S., Wen, T., Young, S.J., Vandyke, D.: A network-based end-to-end trainable task-oriented dialogue system. in: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017, Valencia, Spain, April 3-7, 2017, vol. 1: Long Papers, pp. 438–449 (2017)
19.
go back to reference Serban, I.V., Sordoni, A., Bengio, Y., Courville, A.C., Pineau, J.: Building End-To-End dialogue systems using generative hierarchical neural network models. In: AAAI, pp. 3776–3784 (2016) Serban, I.V., Sordoni, A., Bengio, Y., Courville, A.C., Pineau, J.: Building End-To-End dialogue systems using generative hierarchical neural network models. In: AAAI, pp. 3776–3784 (2016)
20.
go back to reference Serban, I.V., Sordoni, A., Lowe, R., Charlin, L., Pineau, J., Courville, A.C., Bengio, Y.: A hierarchical latent variable encoder-decoder model for generating dialogues. In: Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence (AAAI), pp. 3295–3301 (2017) Serban, I.V., Sordoni, A., Lowe, R., Charlin, L., Pineau, J., Courville, A.C., Bengio, Y.: A hierarchical latent variable encoder-decoder model for generating dialogues. In: Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence (AAAI), pp. 3295–3301 (2017)
21.
go back to reference Sordoni, A., Bengio, Y., Vahabi, H., Lioma, C., Grue Simonsen, J., Nie, J.Y.: A hierarchical recurrent encoder-decoder for generative context-aware query suggestion. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, pp. 553–562. ACM (2015) Sordoni, A., Bengio, Y., Vahabi, H., Lioma, C., Grue Simonsen, J., Nie, J.Y.: A hierarchical recurrent encoder-decoder for generative context-aware query suggestion. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, pp. 553–562. ACM (2015)
22.
go back to reference Su, P.H., Vandyke, D., Gasic, M., Kim, D., Mrksic, N., Wen, T.H., Young, S.: Learning from real users: Rating dialogue success with neural networks for reinforcement learning in spoken dialogue systems. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH) (2015) Su, P.H., Vandyke, D., Gasic, M., Kim, D., Mrksic, N., Wen, T.H., Young, S.: Learning from real users: Rating dialogue success with neural networks for reinforcement learning in spoken dialogue systems. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH) (2015)
23.
go back to reference Wen, T., Gasic, M., Mrksic, N., Su, P., Vandyke, D., Young, S.J.: Semantically conditioned lstm-based natural language generation for spoken dialogue systems. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 1711–1721. 1508.01745 (2015) Wen, T., Gasic, M., Mrksic, N., Su, P., Vandyke, D., Young, S.J.: Semantically conditioned lstm-based natural language generation for spoken dialogue systems. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 1711–1721. 1508.​01745 (2015)
24.
go back to reference Wen, T.H., Miao, Y., Blunsom, P., Young, S.: Latent intention dialogue models. In: Precup, D., Teh, Y.W. (eds.) Proceedings of the 34th International Conference on Machine Learning, Proceedings of Machine Learning Research, vol 70, pp. 3732–3741. PMLR, International Convention Centre, Sydney, Australia. http://proceedings.mlr.press/v70/wen17a.html (2017) Wen, T.H., Miao, Y., Blunsom, P., Young, S.: Latent intention dialogue models. In: Precup, D., Teh, Y.W. (eds.) Proceedings of the 34th International Conference on Machine Learning, Proceedings of Machine Learning Research, vol 70, pp. 3732–3741. PMLR, International Convention Centre, Sydney, Australia. http://​proceedings.​mlr.​press/​v70/​wen17a.​html (2017)
25.
go back to reference Williams, J., Raux, A., Henderson, M.: The dialog state tracking challenge series: a review. Dialogue & Discourse 7(3), 4–33 (2016) Williams, J., Raux, A., Henderson, M.: The dialog state tracking challenge series: a review. Dialogue & Discourse 7(3), 4–33 (2016)
26.
go back to reference Williams, J.D., Asadi, K., Zweig, G.: Hybrid code networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning. arXiv:1702.03274 (2017) Williams, J.D., Asadi, K., Zweig, G.: Hybrid code networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning. arXiv:1702.​03274 (2017)
27.
go back to reference Williams, R.J.: Simple statistical gradient-following algorithms for connectionist reinforcement learning. Mach. Learn. 8(3-4), 229–256 (1992)CrossRef Williams, R.J.: Simple statistical gradient-following algorithms for connectionist reinforcement learning. Mach. Learn. 8(3-4), 229–256 (1992)CrossRef
28.
go back to reference Wu, C.S., Madotto, A., Winata, G.I., Fung, P.: End-To-End dynamic query memory network for entity-value independent task-oriented dialog. In: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6154–6158. IEEE (2018) Wu, C.S., Madotto, A., Winata, G.I., Fung, P.: End-To-End dynamic query memory network for entity-value independent task-oriented dialog. In: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6154–6158. IEEE (2018)
29.
go back to reference Young, S., Gašić, M., Thomson, B., Williams, J.D.: Pomdp-based statistical spoken dialog systems: A review. Proc. IEEE 101(5), 1160–1179 (2013)CrossRef Young, S., Gašić, M., Thomson, B., Williams, J.D.: Pomdp-based statistical spoken dialog systems: A review. Proc. IEEE 101(5), 1160–1179 (2013)CrossRef
30.
go back to reference Young, T., Cambria, E., Chaturvedi, I., Zhou, H., Biswas, S., Huang, M.: Augmenting End-To-End dialogue systems with commonsense knowledge. In: AAAI, pp. 4970–4977 (2018) Young, T., Cambria, E., Chaturvedi, I., Zhou, H., Biswas, S., Huang, M.: Augmenting End-To-End dialogue systems with commonsense knowledge. In: AAAI, pp. 4970–4977 (2018)
31.
go back to reference Zhang, Y., Dai, H., Kozareva, Z., Smola, A.J., Song, L.: Variational reasoning for question answering with knowledge graph. arXiv:1709.04071(2017) Zhang, Y., Dai, H., Kozareva, Z., Smola, A.J., Song, L.: Variational reasoning for question answering with knowledge graph. arXiv:1709.​04071(2017)
32.
go back to reference Zhao, T., Lu, A., Lee, K., Eskénazi, M.: Generative encoder-decoder models for task-oriented spoken dialog systems with chatting capability. In: 18th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL). 1706.08476 (2017) Zhao, T., Lu, A., Lee, K., Eskénazi, M.: Generative encoder-decoder models for task-oriented spoken dialog systems with chatting capability. In: 18th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL). 1706.​08476 (2017)
33.
go back to reference Zhao, T., Xie, K., Eskenazi, M.: Rethinking action spaces for reinforcement learning in end-to-end dialog agents with latent variable models. arXiv:1902.08858 (2019) Zhao, T., Xie, K., Eskenazi, M.: Rethinking action spaces for reinforcement learning in end-to-end dialog agents with latent variable models. arXiv:1902.​08858 (2019)
Metadata
Title
End-to-End latent-variable task-oriented dialogue system with exact log-likelihood optimization
Authors
Haotian Xu
Haiyun Peng
Haoran Xie
Erik Cambria
Liuyang Zhou
Weiguo Zheng
Publication date
07-06-2019
Publisher
Springer US
Published in
World Wide Web / Issue 3/2020
Print ISSN: 1386-145X
Electronic ISSN: 1573-1413
DOI
https://doi.org/10.1007/s11280-019-00688-8

Other articles of this Issue 3/2020

World Wide Web 3/2020 Go to the issue

Premium Partner