Skip to main content
Erschienen in:

2021 | OriginalPaper | Buchkapitel

Building a Knowledge Graph of Vietnam Tourism from Text

verfasst von : Phuc Do, Hung Le

Erschienen in: Computational Science and Technology

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Most data in the world is in form of text. Therefore, we can say text stores large amount of the knowledge of human beings. Extracting useful knowledge from text, however, is not a simple task. In this paper, we present a complete pipeline to extract knowledge from paragraph. This pipeline combines state-of-the-art systems in order to yield optimal results. There are some other Knowledge Graphs such as Google Knowledge Graph, YAGO, or DBpedia. Most of the data in these Knowledge Graphs is in English. On the other hand, the results from our system is used to build a new Knowledge Graph in Vietnamese of Vietnam Tourism. We use the rich resources language like English to process a low resources language like Vietnamese. We utilize the NLP tools of English such as Google translate, Stanford parser, Co-referencing, ClausIE, MinIE. We develop Google Search to find the text describing the entities in the Internet. This text is in Vietnamese. Then, we translate the Vietnamese text into English text and use English NLP tools to extract triples. Finally, we translate the triples back into Vietnamese and build the knowledge graph of Vietnam tourism. We conduct experiment and discover the advantages and disadvantages of our method.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Ehrlinger L, Woss W (2016) Towards a definition of knowledge graphs Ehrlinger L, Woss W (2016) Towards a definition of knowledge graphs
2.
Zurück zum Zitat Wolf T, Debut L, Sanh V, Chaumond J, Delangue C, Cistac P, Rault T, Louf R, Funtowicz M, Brew J (2019) Transformers: state-of-the-art natural language processing. ArXiv Wolf T, Debut L, Sanh V, Chaumond J, Delangue C, Cistac P, Rault T, Louf R, Funtowicz M, Brew J (2019) Transformers: state-of-the-art natural language processing. ArXiv
3.
Zurück zum Zitat Gashteovski K, Gemulla R, Del Corro L (2017) MinIE: minimizing facts in open information extraction. In: Proceedings of the 2017 conference on empirical methods in natural language processing, pp 2630–2640 Gashteovski K, Gemulla R, Del Corro L (2017) MinIE: minimizing facts in open information extraction. In: Proceedings of the 2017 conference on empirical methods in natural language processing, pp 2630–2640
4.
Zurück zum Zitat Webber J (2012) A programmatic introduction to Neo4j. In: Proceedings of the 3rd annual conference on systems, programming, and applications: software for humanity, pp 217–218 Webber J (2012) A programmatic introduction to Neo4j. In: Proceedings of the 3rd annual conference on systems, programming, and applications: software for humanity, pp 217–218
5.
Zurück zum Zitat Suchanek F, Kasneci G, Weikum G (2007) YAGO: a core of semantic knowledge. In: 16th international world wide web conference, WWW2007, pp 697–706 Suchanek F, Kasneci G, Weikum G (2007) YAGO: a core of semantic knowledge. In: 16th international world wide web conference, WWW2007, pp 697–706
6.
Zurück zum Zitat Lehmann J, Isele R, Jakob M, Jentzsch A, Kontokostas D, Mendes P, Hellmann S, Morsey M, Van Kleef P, Auer S, Bizer C (2014) DBpedia—a large-scale. Multilingual knowledge base extracted from Wikipedia, semantic web journal Lehmann J, Isele R, Jakob M, Jentzsch A, Kontokostas D, Mendes P, Hellmann S, Morsey M, Van Kleef P, Auer S, Bizer C (2014) DBpedia—a large-scale. Multilingual knowledge base extracted from Wikipedia, semantic web journal
7.
Zurück zum Zitat Corro L, Gemulla R (2013) ClausIE: clause-based open information extraction. In: WWW 2013—proceedings of the 22nd international conference on world wide web, pp 355–366 Corro L, Gemulla R (2013) ClausIE: clause-based open information extraction. In: WWW 2013—proceedings of the 22nd international conference on world wide web, pp 355–366
8.
Zurück zum Zitat Do P (2019) SparkHINlog: extension of sparkDatalog for heterogeneous information network. J Intell Fuzzy Syst 37(6):7555–7566CrossRef Do P (2019) SparkHINlog: extension of sparkDatalog for heterogeneous information network. J Intell Fuzzy Syst 37(6):7555–7566CrossRef
9.
Zurück zum Zitat Vu T, Nguyen DQ, Nguyen D, Dras M, Johnson M (2018) VnCoreNLP: a vietnamese natural language processing toolkit. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: demonstrations, pp 56–60 Vu T, Nguyen DQ, Nguyen D, Dras M, Johnson M (2018) VnCoreNLP: a vietnamese natural language processing toolkit. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: demonstrations, pp 56–60
10.
Zurück zum Zitat Vu T, Nguyen DQ, Nguyen D, Dras M, Johnson M (2017) From word segmentation to POS tagging for Vietnamese. In: Proceedings of the 15th annual workshop of the Australasian Language technology association, pp 108–113 Vu T, Nguyen DQ, Nguyen D, Dras M, Johnson M (2017) From word segmentation to POS tagging for Vietnamese. In: Proceedings of the 15th annual workshop of the Australasian Language technology association, pp 108–113
Metadaten
Titel
Building a Knowledge Graph of Vietnam Tourism from Text
verfasst von
Phuc Do
Hung Le
Copyright-Jahr
2021
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-33-4069-5_1