skip to main content
10.1145/3278721.3278736acmconferencesArticle/Chapter ViewAbstractPublication PagesaiesConference Proceedingsconference-collections
research-article
Public Access

Rationalization: A Neural Machine Translation Approach to Generating Natural Language Explanations

Published:27 December 2018Publication History

ABSTRACT

We introduce \em AI rationalization, an approach for generating explanations of autonomous system behavior as if a human had performed the behavior. We describe a rationalization technique that uses neural machine translation to translate internal state-action representations of an autonomous agent into natural language. We evaluate our technique in the Frogger game environment, training an autonomous game playing agent to rationalize its action choices using natural language. A natural language training corpus is collected from human players thinking out loud as they play the game. We motivate the use of rationalization as an approach to explanation generation and show the results of two experiments evaluating the effectiveness of rationalization. Results of these evaluations show that neural machine translation is able to accurately generate rationalizations that describe agent behavior, and that rationalizations are more satisfying to humans than other alternative methods of explanation.

References

  1. Andreas, J.; Dragan, A. D.; and Klein, D. 2017. Translating neuralese. CoRR abs/1704.06960.Google ScholarGoogle Scholar
  2. Aronson, J. 1995. A pragmatic view of thematic analysis. The qualitative report 2(1):1--3.Google ScholarGoogle Scholar
  3. Core, M.; Lane, H. C.; van Lent, M.; Gomboc, D.; Solomon, S.; and Rosenberg, M. 2006. Building Explainable Artificial Intelligence Systems. In Proceedings of the 18th Innovative Applications of Artificial Intelligence Conference. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Dorst, K., and Cross, N. 2001. Creativity in the design process: co-evolution of problem--solution. Design studies 22(5):425--437.Google ScholarGoogle Scholar
  5. Fonteyn, M. E.; Kuipers, B.; and Grobe, S. J. 1993. A description of think aloud method and protocol analysis. Qualitative Health Research 3(4):430--441.Google ScholarGoogle ScholarCross RefCross Ref
  6. Goldman, A. I., et al. 2012. Theory of mind. The Oxford handbook of philosophy of cognitive science 402--424.Google ScholarGoogle Scholar
  7. Hollander, M.; Wolfe, D. A.; and Chicken, E. 2013. Nonparametric statistical methods. John Wiley & Sons.Google ScholarGoogle Scholar
  8. Krause, J.; Perer, A.; and Ng, K. 2016. Interacting with predictions: Visual inspection of black-box machine learning models. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, 5686--5697. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Laird, J., and van Lent, M. 2001. Human-level aiâ's killer application: Interactive computer games. AI Magazine 22(2):15--25.Google ScholarGoogle Scholar
  10. Litman, L.; Robinson, J.; and Abberbock, T. 2017. Turkprime. com: A versatile crowdsourcing data acquisition platform for the behavioral sciences. Behavior research methods 49(2):433--442.Google ScholarGoogle Scholar
  11. Luong, M.-T.; Pham, H.; and Manning, C. D. 2015. Effective approaches to attention-based neural machine translation. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 1412--1421. Lisbon, Portugal: Association for Computational Linguistics.Google ScholarGoogle Scholar
  12. Mnih, V.; Kavukcuoglu, K.; Silver, D.; Rusu, A.; Veness, J.; Bellemare, M.; Graves, A.; Riedmiller, M.; Fidjeland, A.; Ostrovski, G.; Petersen, S.; Beattie, C.; Sadik, A.; Antonoglou, I.; King, H.; Kumaran, D.; Wierstra, D.; Legg, S.; and Hassabis, D. 2015. Human-level control through deep reinforcement learning. Nature 518(7540):529--533.Google ScholarGoogle ScholarCross RefCross Ref
  13. Papineni, K.; Roukos, S.; Ward, T.; and Zhu, W.-J. 2002. Bleu: a method for automatic evaluation of machine translation. In Proceedings of the 40th annual meeting on association for computational linguistics, 311--318. Association for Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Ribeiro, M. T.; Singh, S.; and Guestrin, C. 2016. Why should i trust you?: Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1135--1144. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Strauss, A., and Corbin, J. 1994. Grounded theory methodology. Handbook of qualitative research 17:273--85.Google ScholarGoogle Scholar
  16. van Lent, M.; ; Carpenter, P.; McAlinden, R.; and Brobst, P. 2005. Increasing replayability with deliberative and reactive planning. In 1st Conference on Artificial Intelligence and Interactive Digital Entertainment (AIIDE), 135--140. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. van Lent, M.; Fisher, W.; and Mancuso, M. 2004. An explainable artificial intelligence system for small-unit tactical behavior. In Proceedings of the 16th conference on Innovative Applications of Artifical Intelligence. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Watkins, C., and Dayan, P. 1992. Q-learning. Machine Learning 8(3--4):279-292. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Weston, J.; Bordes, A.; Chopra, S.; Rush, A. M.; van Merriënboer, B.; Joulin, A.; and Mikolov, T. 2015. Towards ai-complete question answering: A set of prerequisite toy tasks. arXiv preprint arXiv:1502.05698.Google ScholarGoogle Scholar
  20. Yosinski, J.; Clune, J.; Fuchs, T.; and Lipson, H. 2015. Understanding neural networks through deep visualization. In In ICMLWorkshop on Deep Learning. Citeseer.Google ScholarGoogle Scholar
  21. Zeiler, M. D., and Fergus, R. 2014. Visualizing and understanding convolutional networks. In European conference on computer vision, 818--833. Springer.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    AIES '18: Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society
    December 2018
    406 pages
    ISBN:9781450360128
    DOI:10.1145/3278721

    Copyright © 2018 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 27 December 2018

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article

    Acceptance Rates

    AIES '18 Paper Acceptance Rate61of162submissions,38%Overall Acceptance Rate61of162submissions,38%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader