Skip to main content
Top

2019 | OriginalPaper | Chapter

Testing Chatbots Using Metamorphic Relations

Authors : Josip Bozic, Franz Wotawa

Published in: Testing Software and Systems

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Modern-day demands for services often require an availability on a 24/7 basis as well as online accessibility around the globe. For this sake, personalized software systems, called chatbots, are applied. Chatbots offer services, goods or information in natural language. These programs respond to the user in real-time and offer an intuitive and simple interface to interact with. Advantages like these makes them increasingly popular. Chatbots can even act as substitutes for humans for specific purposes. Since the chatbot market is growing, chatbots might outperform and replace classical web applications in the future. For this reason, ensuring correct functionality of chatbots is of high and increasing importance. However, since different implementations and user behavior result in unpredictable results, the chatbot’s output is difficult to predict and classify as well. In fact, testing of chatbots represents a challenge because of the unavailability of a test oracle. In this paper, we introduce a metamorphic testing approach for chatbots. In general, metamorphic testing can be applied to situations where no expected values are available. In addition, we discuss how to obtain test cases for chatbots, i.e. sequences of interactions with a chatbot, in an according manner. We demonstrate our approach using a hotel booking system and discuss first experimental results.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
4.
go back to reference Bozic, J., Tazl, O.A., Wotawa, F.: Chatbot testing using AI planning. In: Proceedings of the International Conference on Artificial Intelligence Testing (AITest) (2019) Bozic, J., Tazl, O.A., Wotawa, F.: Chatbot testing using AI planning. In: Proceedings of the International Conference on Artificial Intelligence Testing (AITest) (2019)
5.
go back to reference Brandtzæg, P.B., Følstad, A.: Why people use chatbots. In: Proceedings of the 4th International Conference on Internet Science (INSCI 2017) (2017) Brandtzæg, P.B., Følstad, A.: Why people use chatbots. In: Proceedings of the 4th International Conference on Internet Science (INSCI 2017) (2017)
6.
go back to reference Chen, J., Wang, Y., Guo, Y., Jiang, M.: A metamorphic testing approach for event sequences. PLoS ONE 14(2), e0212476 (2019)CrossRef Chen, J., Wang, Y., Guo, Y., Jiang, M.: A metamorphic testing approach for event sequences. PLoS ONE 14(2), e0212476 (2019)CrossRef
7.
go back to reference Chen, T.Y., Cheung, S.C., Yiu, S.M.: Metamorphic testing: a new approach for generating next test cases. Technical report HKUST-CS98-01, Department of Computer Science, Hong Kong University of Science and Technology, Hong Kong (1998) Chen, T.Y., Cheung, S.C., Yiu, S.M.: Metamorphic testing: a new approach for generating next test cases. Technical report HKUST-CS98-01, Department of Computer Science, Hong Kong University of Science and Technology, Hong Kong (1998)
8.
go back to reference Chen, T.Y., et al.: Metamorphic testing: a review of challenges and opportunities. ACM Comput. Surv. (CSUR) 51(1), 4 (2018)CrossRef Chen, T.Y., et al.: Metamorphic testing: a review of challenges and opportunities. ACM Comput. Surv. (CSUR) 51(1), 4 (2018)CrossRef
9.
go back to reference Chen, T.Y., et al.: Metamorphic testing for cybersecurity. Computer 49(6), 48–55 (2016)CrossRef Chen, T.Y., et al.: Metamorphic testing for cybersecurity. Computer 49(6), 48–55 (2016)CrossRef
10.
go back to reference Dwarakanath, A., et al.: Identifying implementation bugs in machine learning based image classifiers using metamorphic testing. In: Proceedings of the 27th ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2018) (2018) Dwarakanath, A., et al.: Identifying implementation bugs in machine learning based image classifiers using metamorphic testing. In: Proceedings of the 27th ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2018) (2018)
11.
go back to reference Guichard, J., Ruane, E., Smith, R., Bean, D., Ventresque, A.: Assessing the robustness of conversational agents using paraphrases. IEEE, University College Dublin (2019) Guichard, J., Ruane, E., Smith, R., Bean, D., Ventresque, A.: Assessing the robustness of conversational agents using paraphrases. IEEE, University College Dublin (2019)
12.
go back to reference Lindvall, M., Porter, A., Magnusson, G., Schulze, C.: Metamorphic model-based testing of autonomous systems. In: Proceedings of the 2nd International Workshop on Metamorphic Testing (MET 2017) (2017) Lindvall, M., Porter, A., Magnusson, G., Schulze, C.: Metamorphic model-based testing of autonomous systems. In: Proceedings of the 2nd International Workshop on Metamorphic Testing (MET 2017) (2017)
13.
go back to reference Mauldin, M.L.: ChatterBots, TinyMuds and the turing test: entering the loebner prize competition. In: AAAI 1994 Proceedings of the Twelfth National Conference on Artificial Intelligence, vol. 1, pp. 16–21 (1994) Mauldin, M.L.: ChatterBots, TinyMuds and the turing test: entering the loebner prize competition. In: AAAI 1994 Proceedings of the Twelfth National Conference on Artificial Intelligence, vol. 1, pp. 16–21 (1994)
14.
go back to reference Ruane, E., Faure, T., Smith, R., Bean, D., Carson-Berndsen, J., Ventresque, A.: BoTest: a framework to test the quality of conversational agents using divergent input examples. In: Proceedings of the 23rd International Conference on Intelligent User Interfaces Companion (IUI 2018 Companion) (2018) Ruane, E., Faure, T., Smith, R., Bean, D., Carson-Berndsen, J., Ventresque, A.: BoTest: a framework to test the quality of conversational agents using divergent input examples. In: Proceedings of the 23rd International Conference on Intelligent User Interfaces Companion (IUI 2018 Companion) (2018)
15.
go back to reference Saha, P., Kanewala, U.: Fault detection effectiveness of metamorphic relations developed for testing supervised classifiers. In: Proceedings of the International Conference on Artificial Intelligence Testing (AITest) (2019) Saha, P., Kanewala, U.: Fault detection effectiveness of metamorphic relations developed for testing supervised classifiers. In: Proceedings of the International Conference on Artificial Intelligence Testing (AITest) (2019)
16.
go back to reference Segura, S., Durán, A., Sánchez, A.B., Le Berre, D., Lonca, E., Ruiz-Cortés, A.: Automated metamorphic testing of variability analysis tools. Softw. Test. Verif. Reliab. 25(2), 138–163 (2015)CrossRef Segura, S., Durán, A., Sánchez, A.B., Le Berre, D., Lonca, E., Ruiz-Cortés, A.: Automated metamorphic testing of variability analysis tools. Softw. Test. Verif. Reliab. 25(2), 138–163 (2015)CrossRef
17.
go back to reference Segura, S., Fraser, G., Sánchez, A.B., Ruiz-Cortés, A.: A survey on metamorphic testing. IEEE Trans. Softw. Eng. 42(9), 805–824 (2016)CrossRef Segura, S., Fraser, G., Sánchez, A.B., Ruiz-Cortés, A.: A survey on metamorphic testing. IEEE Trans. Softw. Eng. 42(9), 805–824 (2016)CrossRef
18.
go back to reference Segura, S., Hierons, R.M., Benavides, D., Ruiz-Cortés, A.: Automated test data generation on the analyses of feature models: a metamorphic testing approach. In: Proceedings of the 2010 Third International Conference on Software Testing, Verification and Validation (2010) Segura, S., Hierons, R.M., Benavides, D., Ruiz-Cortés, A.: Automated test data generation on the analyses of feature models: a metamorphic testing approach. In: Proceedings of the 2010 Third International Conference on Software Testing, Verification and Validation (2010)
19.
go back to reference Segura, S., Parejo, J.A., Troya, J., Ruiz-Cortés, A.: Metamorphic testing of RESTful web APIs. IEEE Trans. Softw. Eng. 44(11), 1083–1099 (2018)CrossRef Segura, S., Parejo, J.A., Troya, J., Ruiz-Cortés, A.: Metamorphic testing of RESTful web APIs. IEEE Trans. Softw. Eng. 44(11), 1083–1099 (2018)CrossRef
20.
go back to reference Shawar, B.A., Atwell, E.: Using corpora in machine-learning chatbot systems. Int. J. Corpus Linguist. 10, 489–516 (2005)CrossRef Shawar, B.A., Atwell, E.: Using corpora in machine-learning chatbot systems. Int. J. Corpus Linguist. 10, 489–516 (2005)CrossRef
21.
go back to reference Tian, Y., Pei, K., Jana, S., Ray, B.: DeepTest: automated testing of deep-neural-network-driven autonomous cars. In: Proceedings of the 40th International Conference on Software Engineering (2018) Tian, Y., Pei, K., Jana, S., Ray, B.: DeepTest: automated testing of deep-neural-network-driven autonomous cars. In: Proceedings of the 40th International Conference on Software Engineering (2018)
22.
go back to reference Vasconcelos, M., Candello, H., Pinhanez, C., dos Santos, T.: Bottester: testing conversational systems with simulated users. In: Proceedings of the XVI Brazilian Symposium on Human Factors in Computing Systems (IHC 2017) (2017) Vasconcelos, M., Candello, H., Pinhanez, C., dos Santos, T.: Bottester: testing conversational systems with simulated users. In: Proceedings of the XVI Brazilian Symposium on Human Factors in Computing Systems (IHC 2017) (2017)
23.
go back to reference Walker, M.A., Litman, D.J., Kamm, C.A., Abella, A.: PARADISE: a framework for evaluating spoken dialogue agents. In: Proceedings of the 35th Annual General Meeting of the Association for Computational Linguistics, ACL/EACL 1997 (1997) Walker, M.A., Litman, D.J., Kamm, C.A., Abella, A.: PARADISE: a framework for evaluating spoken dialogue agents. In: Proceedings of the 35th Annual General Meeting of the Association for Computational Linguistics, ACL/EACL 1997 (1997)
24.
go back to reference Wallace, R.S.: The elements of AIML style. In: ALICE A.I. Foundation (2003) Wallace, R.S.: The elements of AIML style. In: ALICE A.I. Foundation (2003)
25.
go back to reference Weyuker, E.: On testing non-testable programs. Comput. J. 25(4), 465–470 (1982)CrossRef Weyuker, E.: On testing non-testable programs. Comput. J. 25(4), 465–470 (1982)CrossRef
26.
go back to reference Xie, X., Ho, J.W.K., Murphy, C., Kaiser, G., Xu, B., Chen, T.Y.: Testing and validating machine learning classifiers by metamorphic testing. J. Syst. Softw. 84(4), 544–558 (2011)CrossRef Xie, X., Ho, J.W.K., Murphy, C., Kaiser, G., Xu, B., Chen, T.Y.: Testing and validating machine learning classifiers by metamorphic testing. J. Syst. Softw. 84(4), 544–558 (2011)CrossRef
27.
go back to reference Yan, B., Yecies, B., Zhou, Z.Q.: Metamorphic relations for data validation: a case study of translated text messages. In: Proceedings of the 4th International Workshop on Metamorphic Testing (MET 2019) (2019) Yan, B., Yecies, B., Zhou, Z.Q.: Metamorphic relations for data validation: a case study of translated text messages. In: Proceedings of the 4th International Workshop on Metamorphic Testing (MET 2019) (2019)
28.
go back to reference Zhang, M., Zhang, Y., Zhang, L., Liu, C., Khurshid, S.: DeepRoad: GAN-based metamorphic testing and input validation framework for autonomous driving systems. In: Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering (ASE 2018) (2018) Zhang, M., Zhang, Y., Zhang, L., Liu, C., Khurshid, S.: DeepRoad: GAN-based metamorphic testing and input validation framework for autonomous driving systems. In: Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering (ASE 2018) (2018)
29.
go back to reference Zhou, Z.Q., Sun, L.: Metamorphic testing of driverless cars. Commun. ACM 62(3), 61–67 (2019)CrossRef Zhou, Z.Q., Sun, L.: Metamorphic testing of driverless cars. Commun. ACM 62(3), 61–67 (2019)CrossRef
Metadata
Title
Testing Chatbots Using Metamorphic Relations
Authors
Josip Bozic
Franz Wotawa
Copyright Year
2019
DOI
https://doi.org/10.1007/978-3-030-31280-0_3

Premium Partner