Skip to main content
Erschienen in: Neural Computing and Applications 21/2021

17.06.2021 | Original Article

Building a Vietnamese question answering system based on knowledge graph and distributed CNN

verfasst von: Trung Phan, Phuc Do

Erschienen in: Neural Computing and Applications | Ausgabe 21/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Question answering system (QAS) can be applied everywhere such as in schools, hospitals, banks, e-commerce websites. A smart QAS that can replace people is what people expect. Therefore, there are a lot of studies to build, develop, and improve QAS. However, QAS used for low-resource languages like Vietnamese is still very limited. So, in this paper, we present a method for building Vietnamese QAS. Except for specific Vietnamese language processes, most of our solutions can also be applied to other languages. We build QAS based on knowledge graph (KG) and convolutional neural network (CNN). KG provides knowledge and deducing ability for QAS. CNN is used to classify questions in the natural language to identify the correct answer to a given question. Moreover, we also use distributed architecture to train the CNN model. On the other hands, we also propose a solution to speed up searching for answers in a large KG by partitioning and indexing KG by using the DM-Tree structure. Besides, we also present experimental results and evaluation results of our model using common metrics to prove the effectiveness of our solution.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Holotescu C (2016) Moocbuddy: A chatbot for personalized learning with moocs. In: RoCHI Holotescu C (2016) Moocbuddy: A chatbot for personalized learning with moocs. In: RoCHI
2.
Zurück zum Zitat Fadhil A, Villafiorita A (2017) An adaptive learning with gamification & conversational uis: The rise of cibopolibot In: Adjunct Publication of the 25th Conference on User Modeling, Adaptation and Personalization, ser. UMAP ’17, Bratislava, Slovakia: Association for Computing Machinery, 408–412, ISBN: 9781450350679. [Online]. Available: https://doi.org/10.1145/3099023.3099112 Fadhil A, Villafiorita A (2017) An adaptive learning with gamification & conversational uis: The rise of cibopolibot In: Adjunct Publication of the 25th Conference on User Modeling, Adaptation and Personalization, ser. UMAP ’17, Bratislava, Slovakia: Association for Computing Machinery, 408–412, ISBN: 9781450350679. [Online]. Available: https://​doi.​org/​10.​1145/​3099023.​3099112
3.
Zurück zum Zitat Page LC, Gehlbach H (2017) How an artificially intelligent virtual assistant helps students navigate the road to college. AERA Open 3:233285841774922CrossRef Page LC, Gehlbach H (2017) How an artificially intelligent virtual assistant helps students navigate the road to college. AERA Open 3:233285841774922CrossRef
5.
Zurück zum Zitat Wahyudi ML, Khodra AS, Prihatmanto, Machbub C (2018) A Question Answering System Using Graph-Pattern Association Rules (QAGPAR) on YAGO Knowledge Base. In: 2018 International Conference on Information Technology Systems and Innovation, ICITSI 2018 - Proceedings, ISBN: 9781538656938. https://doi.org/10.1109/ICITSI.2018.8696046. arXiv:1902.00624 Wahyudi ML, Khodra AS, Prihatmanto, Machbub C (2018) A Question Answering System Using Graph-Pattern Association Rules (QAGPAR) on YAGO Knowledge Base. In: 2018 International Conference on Information Technology Systems and Innovation, ICITSI 2018 - Proceedings, ISBN: 9781538656938. https://​doi.​org/​10.​1109/​ICITSI.​2018.​8696046. arXiv:1902.00624
7.
Zurück zum Zitat Brandtzæg PB, Følstad A (2018) Chatbots: Changing user needs and motivations. Interactions 25:38–43CrossRef Brandtzæg PB, Følstad A (2018) Chatbots: Changing user needs and motivations. Interactions 25:38–43CrossRef
12.
Zurück zum Zitat Afrae B, Mohamed BA, Boudhir AA (2020) A question answering system with a sequence to sequence grammatical correction. In: Proceedings of the 3rd International Conference on Networking, Information Systems Security, ser. NISS2020, Marrakech, Morocco: Association for Computing Machinery, ISBN: 9781450376341. [Online]. Available: https://doi.org/10.1145/3386723.3387894 Afrae B, Mohamed BA, Boudhir AA (2020) A question answering system with a sequence to sequence grammatical correction. In: Proceedings of the 3rd International Conference on Networking, Information Systems Security, ser. NISS2020, Marrakech, Morocco: Association for Computing Machinery, ISBN: 9781450376341. [Online]. Available: https://​doi.​org/​10.​1145/​3386723.​3387894
13.
Zurück zum Zitat Wang Y, Chen Q, He C, Liu H, Wu X (2020) Knowledge base question answering system based on knowledge graph representation learning. In: Proceedings of the 2020 the 4th International Conference on Innovation in Artificial Intelligence, ser. ICIAI 2020, Xiamen, China: Association for Computing Machinery, 170–179, ISBN: 9781450376587. [Online]. Available: https://doi.org/10.1145/3390557.3394296 Wang Y, Chen Q, He C, Liu H, Wu X (2020) Knowledge base question answering system based on knowledge graph representation learning. In: Proceedings of the 2020 the 4th International Conference on Innovation in Artificial Intelligence, ser. ICIAI 2020, Xiamen, China: Association for Computing Machinery, 170–179, ISBN: 9781450376587. [Online]. Available: https://​doi.​org/​10.​1145/​3390557.​3394296
14.
Zurück zum Zitat Bhagat P, Prajapati SK, Seth A (2020) Initial lessons from building an ivrbased automated question-answering system. In: Proceedings of the 2020 International Conference on Information and Communication Technologies and Development, ser. ICTD2020, Guayaquil, Ecuador: Association for Computing Machinery, ISBN: 9781450387620. [Online]. Available: https://doi.org/10.1145/3392561.3397581 Bhagat P, Prajapati SK, Seth A (2020) Initial lessons from building an ivrbased automated question-answering system. In: Proceedings of the 2020 International Conference on Information and Communication Technologies and Development, ser. ICTD2020, Guayaquil, Ecuador: Association for Computing Machinery, ISBN: 9781450387620. [Online]. Available: https://​doi.​org/​10.​1145/​3392561.​3397581
15.
16.
Zurück zum Zitat Pham ST, Nguyen DT (2016) A computational and inferential method for analyzing the semantics of phrase and sentence in Vietnamese Question Answering System Model (VietQASM). In: Proceedings - AMS 2015: Asia Modelling Symposium 2015 - Asia 9th International Conference on Mathematical Modelling and Computer Simulation, ISBN: 9781467383233. https://doi.org/10.1109/AMS.2015.26 Pham ST, Nguyen DT (2016) A computational and inferential method for analyzing the semantics of phrase and sentence in Vietnamese Question Answering System Model (VietQASM). In: Proceedings - AMS 2015: Asia Modelling Symposium 2015 - Asia 9th International Conference on Mathematical Modelling and Computer Simulation, ISBN: 9781467383233. https://​doi.​org/​10.​1109/​AMS.​2015.​26
17.
Zurück zum Zitat Lafferty J, McCallum A, Pereira FCN (2001) Conditional random fields: Probabilistic models for segmenting and labeling sequence data. ICML ’01 Proceedings of the Eighteenth International Conference on Machine Learning, ISSN: 1750-2799. https://doi.org/10.1038/nprot.2006.61. arXiv:arXiv:1011.4088v1 Lafferty J, McCallum A, Pereira FCN (2001) Conditional random fields: Probabilistic models for segmenting and labeling sequence data. ICML ’01 Proceedings of the Eighteenth International Conference on Machine Learning, ISSN: 1750-2799. https://​doi.​org/​10.​1038/​nprot.​2006.​61. arXiv:arXiv:1011.4088v1
18.
Zurück zum Zitat Le-Hong P, Bui D-T (2018) A factoid question answering system for vietnamese. In: Companion Proceedings of the The Web Conference 2018, ser.WWW ’18, Lyon, France: International World Wide Web Conferences Steering Committee, 1049–1055, ISBN: 9781450356404. [Online]. Available: https://doi.org/10.1145/3184558.3191535 Le-Hong P, Bui D-T (2018) A factoid question answering system for vietnamese. In: Companion Proceedings of the The Web Conference 2018, ser.WWW ’18, Lyon, France: International World Wide Web Conferences Steering Committee, 1049–1055, ISBN: 9781450356404. [Online]. Available: https://​doi.​org/​10.​1145/​3184558.​3191535
20.
Zurück zum Zitat Allam AMN, Haggag MH (2016) The question answering systems : A survey. International Journal of Research and Reviews in Information Sciences (IJRRIS) 2(3) Allam AMN, Haggag MH (2016) The question answering systems : A survey. International Journal of Research and Reviews in Information Sciences (IJRRIS) 2(3)
22.
Zurück zum Zitat Sandhini S, Binu R (2018) Classification of question answering systems: A survey. In: Emerging Trends in Engineering, Science and Technology for Society, Energy and Environment - Proceedings of the International Conference in Emerging Trends in Engineering, Science and Technology, ICETEST 2018, ISBN: 9780815357605 Sandhini S, Binu R (2018) Classification of question answering systems: A survey. In: Emerging Trends in Engineering, Science and Technology for Society, Energy and Environment - Proceedings of the International Conference in Emerging Trends in Engineering, Science and Technology, ICETEST 2018, ISBN: 9780815357605
24.
Zurück zum Zitat Lei D, Chen X, Zhao J (2018) Opening the black box of deep learning, May 22. arXiv:1805.08355v1 [cs.LG] Lei D, Chen X, Zhao J (2018) Opening the black box of deep learning, May 22. arXiv:1805.08355v1 [cs.LG]
25.
Zurück zum Zitat Chang DT (2018) Concept-oriented deep learning, 5. arXiv:1806.01756v1 [cs.AI] Chang DT (2018) Concept-oriented deep learning, 5. arXiv:1806.01756v1 [cs.AI]
26.
Zurück zum Zitat Bhandare A, Bhide M, Gokhale P, Chandavarkar R (2016) Applications of Convolutional Neural Networks. Int J Comput Sci Inf Technol 7(5):2206–2215 Bhandare A, Bhide M, Gokhale P, Chandavarkar R (2016) Applications of Convolutional Neural Networks. Int J Comput Sci Inf Technol 7(5):2206–2215
29.
31.
Zurück zum Zitat Nguyen DQ, Vu T, Nguyen DQ, Dras M, Johnson M (2017) From word segmentation to POS tagging for Vietnamese. In: Proceedings of the Australasian Language Technology Association Workshop 2017, Brisbane, Australia, pp 108–113. [Online]. Available: https://www.aclweb.org/anthology/U17-1013 Nguyen DQ, Vu T, Nguyen DQ, Dras M, Johnson M (2017) From word segmentation to POS tagging for Vietnamese. In: Proceedings of the Australasian Language Technology Association Workshop 2017, Brisbane, Australia, pp 108–113. [Online]. Available: https://​www.​aclweb.​org/​anthology/​U17-1013
32.
Zurück zum Zitat Nguyen DQ, Nguyen DQ, Vu T, Dras M, Johnson M (2018) A fast and accurate vietnamese word segmenter. In: Calzolari N, Choukri K, Cieri C, Declerck T, Goggi S, Hasida K, Isahara H, Maegaard B, Mariani J, Mazo H, Moreno A, Odijk J, Piperidis S, Tokunaga T (eds.) Proceedings of the Eleventh International Conference on Language Resources and Evaluation, LREC 2018, Miyazaki, Japan, May 7-12, 2018. European Language Resources Association (ELRA). [Online]. Available: http://www.lrec-conf.org/proceedings/lrec2018/summaries/55.html Nguyen DQ, Nguyen DQ, Vu T, Dras M, Johnson M (2018) A fast and accurate vietnamese word segmenter. In: Calzolari N, Choukri K, Cieri C, Declerck T, Goggi S, Hasida K, Isahara H, Maegaard B, Mariani J, Mazo H, Moreno A, Odijk J, Piperidis S, Tokunaga T (eds.) Proceedings of the Eleventh International Conference on Language Resources and Evaluation, LREC 2018, Miyazaki, Japan, May 7-12, 2018. European Language Resources Association (ELRA). [Online]. Available: http://​www.​lrec-conf.​org/​proceedings/​lrec2018/​summaries/​55.​html
37.
Zurück zum Zitat Ciaccia P, Patella M, Zezula P (1997) M-tree: An efficient access method for similarity search in metric spaces. In: Proceedings of the 23rd International Conference on Very Large Databases, VLDB 1997, ISBN: 1558604707 Ciaccia P, Patella M, Zezula P (1997) M-tree: An efficient access method for similarity search in metric spaces. In: Proceedings of the 23rd International Conference on Very Large Databases, VLDB 1997, ISBN: 1558604707
38.
Zurück zum Zitat Do P, Hong TP, To HD (2020) Dmtree: A novel indexing method for finding similarities in large vector sets. Int J Adv Comput Sci Appl 11(4):0110483 Do P, Hong TP, To HD (2020) Dmtree: A novel indexing method for finding similarities in large vector sets. Int J Adv Comput Sci Appl 11(4):0110483
39.
Zurück zum Zitat Cloudera I (2017) Spark Guide. ISBN: 1650362048 Cloudera I (2017) Spark Guide. ISBN: 1650362048
40.
Zurück zum Zitat Databricks (2017) A Gentle Introduction To Apache Spark. Communication Databricks (2017) A Gentle Introduction To Apache Spark. Communication
41.
Zurück zum Zitat Drabas T, Lee D (2017) Learning PySpark. ISBN: 1786463709 Drabas T, Lee D (2017) Learning PySpark. ISBN: 1786463709
Metadaten
Titel
Building a Vietnamese question answering system based on knowledge graph and distributed CNN
verfasst von
Trung Phan
Phuc Do
Publikationsdatum
17.06.2021
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 21/2021
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-021-06126-z

Weitere Artikel der Ausgabe 21/2021

Neural Computing and Applications 21/2021 Zur Ausgabe