Skip to main content
Top

2021 | OriginalPaper | Chapter

Alice: A General-Purpose Virtual Assistant Framework

Authors : Soon-Chang Poh, Yi-Fei Tan, Chee-Pun Ooi, Wooi-Haw Tan, Albert Quek, Chee-Yong Gan, Yew-Chun Lee, Zhun-Hau Yap, Chin-Leei Cham

Published in: Computational Science and Technology

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this paper, a virtual assistant framework called Alice is presented. This virtual assistant is a combination of 3D avatar, face detection, face recognition and face expression recognition with a voice assistant that similar to Amazon’s Alexa. The 3D avatar (Alice) is a female character animated using Unity and the lip is animated to sync with the speech to make it looks like speaking. Besides that, the 3D avatar can display different facial expressions such as happy, sad and upset. Face detection and recognition makes the system aware of the human user’s identity. Whereas, face expression recognition enables the system to detect the facial expression of the human user. Whenever there is a question being asked, the system will use Speech-to-Text system to convert human speech to text and Natural Language Processing to interpret the intent behind the text. Based on the result of interpretation, the system decides which audio file to be used as response. Then, a realistic artificial voice is generated as response to the human user. The system can access database based on user’s identity to retrieve information about that user. This may create a personalized experience for the human user. This framework can be customized for other applications for different fields. For this Alice framework, two applications have been developed namely a question answering chatbot and a customer service agent.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Adam M, Wessel M, Benlian A (2020) AI-based chatbots in customer service and their effects on user compliance. Electron markets Adam M, Wessel M, Benlian A (2020) AI-based chatbots in customer service and their effects on user compliance. Electron markets
2.
go back to reference Luo X, Tong S, Fang Z, Qu Z (2019) Frontiers: machines versus humans: the impact of artificial intelligence chatbot disclosure on customer purchases. Mark Sci 38(6):937–947 Luo X, Tong S, Fang Z, Qu Z (2019) Frontiers: machines versus humans: the impact of artificial intelligence chatbot disclosure on customer purchases. Mark Sci 38(6):937–947
3.
go back to reference Herrera A, Yaguachi L, Piedra N (2019) Building conversational interface for customer support applied to open campus an open online course provider. In: 2019 IEEE 19th international conference on advanced learning technologies (ICALT), pp. 11–13 Herrera A, Yaguachi L, Piedra N (2019) Building conversational interface for customer support applied to open campus an open online course provider. In: 2019 IEEE 19th international conference on advanced learning technologies (ICALT), pp. 11–13
4.
go back to reference Patel NP, Parikh DR, Patel DA, Patel RR (2019) AI and web-based human-like interactive university chatbot (UNIBOT). In: 2019 3rd international conference on electronics, communication and aerospace technology (ICECA), pp. 148–150 Patel NP, Parikh DR, Patel DA, Patel RR (2019) AI and web-based human-like interactive university chatbot (UNIBOT). In: 2019 3rd international conference on electronics, communication and aerospace technology (ICECA), pp. 148–150
5.
go back to reference Wu EH, Lin C, Ou Y, Liu C, Wang W, Chao C (2020) Advantages and constraints of a hybrid model K-12 e-learning assistant chatbot. IEEE Acc 8:77788–77801 Wu EH, Lin C, Ou Y, Liu C, Wang W, Chao C (2020) Advantages and constraints of a hybrid model K-12 e-learning assistant chatbot. IEEE Acc 8:77788–77801
6.
go back to reference Hoy MB (2018) Alexa, siri, cortana, and more: an introduction to voice assistants. Med Ref Ser Q 37(1):81–88 Hoy MB (2018) Alexa, siri, cortana, and more: an introduction to voice assistants. Med Ref Ser Q 37(1):81–88
7.
go back to reference Këpuska V, Bohouta G (2018) Next-generation of virtual personal assistants (microsoft cortana, apple siri, amazon alexa and google home). In: 2018 IEEE 8th annual computing and communication workshop and conference (CCWC), pp. 99–103 Këpuska V, Bohouta G (2018) Next-generation of virtual personal assistants (microsoft cortana, apple siri, amazon alexa and google home). In: 2018 IEEE 8th annual computing and communication workshop and conference (CCWC), pp. 99–103
8.
go back to reference Gelbukh A (2005) Natural language processing. In: Fifth international conference on hybrid intelligent systems (HIS’05). Rio de Janeiro, Brazil Gelbukh A (2005) Natural language processing. In: Fifth international conference on hybrid intelligent systems (HIS’05). Rio de Janeiro, Brazil
9.
go back to reference Yu AW, Dohan D, Luong M, Zhao R, Chen K, Norouzi M, Le QV (2018) Qanet Combining local convolution with global self-attention for reading comprehension. In: Proc ICLR Yu AW, Dohan D, Luong M, Zhao R, Chen K, Norouzi M, Le QV (2018) Qanet Combining local convolution with global self-attention for reading comprehension. In: Proc ICLR
10.
go back to reference Hofmann S, Reinecke M (2009) Cognitive–behavioral therapy with adults. Cambridge University Press Hofmann S, Reinecke M (2009) Cognitive–behavioral therapy with adults. Cambridge University Press
11.
go back to reference Fitzpatrick KK, Darcy A, Vierhile M (2017) Delivering cognitive behavior therapy to young adults with symptoms of depression and anxiety using a fully automated conversational agent (Woebot): a randomized controlled trial. JMIR Mental Health 4(2) Fitzpatrick KK, Darcy A, Vierhile M (2017) Delivering cognitive behavior therapy to young adults with symptoms of depression and anxiety using a fully automated conversational agent (Woebot): a randomized controlled trial. JMIR Mental Health 4(2)
12.
go back to reference Wan Y, Chiu C, Liang K, Chang P (2019) Midoriko chatbot: LSTM-based emotional 3D avatar. In: 2019 IEEE 8th global conference on consumer electronics (GCCE). Osaka, Japan, pp. 937–940 Wan Y, Chiu C, Liang K, Chang P (2019) Midoriko chatbot: LSTM-based emotional 3D avatar. In: 2019 IEEE 8th global conference on consumer electronics (GCCE). Osaka, Japan, pp. 937–940
13.
go back to reference Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780 Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780
14.
go back to reference Angga PA, Fachri WE, Elevanita A, Suryadi, Agushinta RD (2015) Design of chatbot with 3D avatar, voice interface, and facial expression. In: 2015 international conference on science in information technology (ICSITech), pp. 326–330 Angga PA, Fachri WE, Elevanita A, Suryadi, Agushinta RD (2015) Design of chatbot with 3D avatar, voice interface, and facial expression. In: 2015 international conference on science in information technology (ICSITech), pp. 326–330
15.
go back to reference Arsenijevic U, Jovic M (2019) Artificial intelligence marketing: chatbots. In: 2019 international conference on artificial intelligence: applications and innovations (IC-AIAI), pp. 19–193 Arsenijevic U, Jovic M (2019) Artificial intelligence marketing: chatbots. In: 2019 international conference on artificial intelligence: applications and innovations (IC-AIAI), pp. 19–193
18.
go back to reference King DE (2009) Dlib-ml: a machine learning toolkit. J Mach Learn Res 10:1755–1758 King DE (2009) Dlib-ml: a machine learning toolkit. J Mach Learn Res 10:1755–1758
19.
go back to reference Cao Q, Shen L, Xie W, Parkhi OM, Zisserman A (2018) VGGFace2: a dataset for recognising faces across pose and age. In: 2018 13th IEEE international conference on automatic face & gesture recognition (FG 2018). Xi’an, pp. 67–74 Cao Q, Shen L, Xie W, Parkhi OM, Zisserman A (2018) VGGFace2: a dataset for recognising faces across pose and age. In: 2018 13th IEEE international conference on automatic face & gesture recognition (FG 2018). Xi’an, pp. 67–74
23.
go back to reference Knyazev B, Shvetsov R, Efremova N, Kuharenko A (2018) Leveraging large face recognition data for emotion classification. In: IEEE international conference on automatic face and gesture recognition (FG 2018). Xi’an, pp. 692–696 Knyazev B, Shvetsov R, Efremova N, Kuharenko A (2018) Leveraging large face recognition data for emotion classification. In: IEEE international conference on automatic face and gesture recognition (FG 2018). Xi’an, pp. 692–696
24.
go back to reference Kim JY, Liu C, Calvo RA, McCabe K, Taylor SCR, Schuller BW, Wu K (2019) A comparison of online automatic speech recognition systems and the nonverbal responses to unintelligible speech. arXiv:1904.12403 Kim JY, Liu C, Calvo RA, McCabe K, Taylor SCR, Schuller BW, Wu K (2019) A comparison of online automatic speech recognition systems and the nonverbal responses to unintelligible speech. arXiv:1904.12403
Metadata
Title
Alice: A General-Purpose Virtual Assistant Framework
Authors
Soon-Chang Poh
Yi-Fei Tan
Chee-Pun Ooi
Wooi-Haw Tan
Albert Quek
Chee-Yong Gan
Yew-Chun Lee
Zhun-Hau Yap
Chin-Leei Cham
Copyright Year
2021
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-33-4069-5_31

Premium Partner