Skip to main content
Top

2025 | OriginalPaper | Chapter

iText2KG: Incremental Knowledge Graphs Construction Using Large Language Models

Authors : Yassir Lairgi, Ludovic Moncla, Rémy Cazabet, Khalid Benabdeslem, Pierre Cléau

Published in: Web Information Systems Engineering – WISE 2024

Publisher: Springer Nature Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Most available data is unstructured, making it challenging to access valuable information. Automatically building Knowledge Graphs (KGs) is crucial for structuring data and making it accessible, allowing users to search for information effectively. KGs also facilitate insights, inference, and reasoning. Traditional NLP methods, such as named entity recognition and relation extraction, are key in information retrieval but face limitations, including predefined entity types and the need for supervised learning. Current research leverages large language models’ capabilities, such as zero- or few-shot learning. However, unresolved and semantically duplicated entities and relations still pose challenges, leading to inconsistent graphs and requiring extensive post-processing. Additionally, most approaches are topic-dependent. In this paper, we propose iText2KG (The code and the dataset are available at https://​github.​com/​AuvaLab/​itext2kg), a method for incremental, topic-independent KG construction without post-processing. This plug-and-play, zero-shot method is applicable across a wide range of KG construction scenarios and comprises four modules: Documents Distiller, Incremental Entities Extractor, Incremental Relations Extractor, and Graph Integrator. Our method demonstrates superior performance compared to baseline methods across three scenarios: converting scientific papers to graphs, websites to graphs, and CVs to graphs.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Carta, S., Giuliani, A., Piano, L., Podda, A.S., Pompianu, L., Tiddia, S.G.: Iterative zero-shot LLM prompting for knowledge graph construction. arXiv preprint arXiv:2307.01128 (2023) Carta, S., Giuliani, A., Piano, L., Podda, A.S., Pompianu, L., Tiddia, S.G.: Iterative zero-shot LLM prompting for knowledge graph construction. arXiv preprint arXiv:​2307.​01128 (2023)
2.
go back to reference Ding, L., Zhou, S., Xiao, J., Han, J.: Automated construction of theme-specific knowledge graphs. arXiv preprint arXiv:2404.19146 (2024) Ding, L., Zhou, S., Xiao, J., Han, J.: Automated construction of theme-specific knowledge graphs. arXiv preprint arXiv:​2404.​19146 (2024)
3.
go back to reference Eberendu, A.C., et al.: Unstructured data: an overview of the data of big data. Int. J. Comput. Trends Technol. 38(1), 46–50 (2016)CrossRef Eberendu, A.C., et al.: Unstructured data: an overview of the data of big data. Int. J. Comput. Trends Technol. 38(1), 46–50 (2016)CrossRef
4.
go back to reference Hu, Y., Zou, F., Han, J., Sun, X., Wang, Y.: LLM-TIKG: threat intelligence knowledge graph construction utilizing large language model. Available at SSRN 4671345 (2023) Hu, Y., Zou, F., Han, J., Sun, X., Wang, Y.: LLM-TIKG: threat intelligence knowledge graph construction utilizing large language model. Available at SSRN 4671345 (2023)
5.
go back to reference Jin, B., Liu, G., Han, C., Jiang, M., Ji, H., Han, J.: Large language models on graphs: a comprehensive survey. arXiv preprint arXiv:2312.02783 (2023) Jin, B., Liu, G., Han, C., Jiang, M., Ji, H., Han, J.: Large language models on graphs: a comprehensive survey. arXiv preprint arXiv:​2312.​02783 (2023)
6.
go back to reference Kabal, O., Harazallah, M., Guillet, F., Ichise, R.: Enhancing domain-independent knowledge graph construction through OpenIE cleaning and LLMs validation (G-T2KG). In: 28th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems (KES 2024) (2024), to appear Kabal, O., Harazallah, M., Guillet, F., Ichise, R.: Enhancing domain-independent knowledge graph construction through OpenIE cleaning and LLMs validation (G-T2KG). In: 28th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems (KES 2024) (2024), to appear
7.
go back to reference Kommineni, V.K., König-Ries, B., Samuel, S.: From human experts to machines: an LLM supported approach to ontology and knowledge graph construction. arXiv preprint arXiv:2403.08345 (2024) Kommineni, V.K., König-Ries, B., Samuel, S.: From human experts to machines: an LLM supported approach to ontology and knowledge graph construction. arXiv preprint arXiv:​2403.​08345 (2024)
9.
go back to reference Nasar, Z., Jaffry, S.W., Malik, M.K.: Named entity recognition and relation extraction: state-of-the-art. ACM Comput. Surv. (CSUR) 54(1), 1–39 (2021)CrossRef Nasar, Z., Jaffry, S.W., Malik, M.K.: Named entity recognition and relation extraction: state-of-the-art. ACM Comput. Surv. (CSUR) 54(1), 1–39 (2021)CrossRef
11.
go back to reference Sun, Z., Ting, Y.S., Liang, Y., Duan, N., Huang, S., Cai, Z.: Knowledge graph in astronomical research with large language models: quantifying driving forces in interdisciplinary scientific discovery. arXiv preprint arXiv:2406.01391 (2024) Sun, Z., Ting, Y.S., Liang, Y., Duan, N., Huang, S., Cai, Z.: Knowledge graph in astronomical research with large language models: quantifying driving forces in interdisciplinary scientific discovery. arXiv preprint arXiv:​2406.​01391 (2024)
12.
go back to reference Wornow, M., Lozano, A., Dash, D., Jindal, J., Mahaffey, K.W., Shah, N.H.: Zero-shot clinical trial patient matching with LLMs. arXiv preprint arXiv:2402.05125 (2024) Wornow, M., Lozano, A., Dash, D., Jindal, J., Mahaffey, K.W., Shah, N.H.: Zero-shot clinical trial patient matching with LLMs. arXiv preprint arXiv:​2402.​05125 (2024)
13.
go back to reference Zhang, Y., et al.: AttacKG+: Boosting attack knowledge graph construction with large language models. arXiv preprint arXiv:2405.04753 (2024) Zhang, Y., et al.: AttacKG+: Boosting attack knowledge graph construction with large language models. arXiv preprint arXiv:​2405.​04753 (2024)
14.
go back to reference Zhu, Y., et al.: LLMs for knowledge graph construction and reasoning: recent capabilities and future opportunities. arXiv preprint arXiv:2305.13168 (2023) Zhu, Y., et al.: LLMs for knowledge graph construction and reasoning: recent capabilities and future opportunities. arXiv preprint arXiv:​2305.​13168 (2023)
Metadata
Title
iText2KG: Incremental Knowledge Graphs Construction Using Large Language Models
Authors
Yassir Lairgi
Ludovic Moncla
Rémy Cazabet
Khalid Benabdeslem
Pierre Cléau
Copyright Year
2025
Publisher
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-96-0573-6_16

Premium Partner