Skip to main content
Top

Knowledge-Enhanced Vietnamese Paraphrase Identification

  • 2026
  • OriginalPaper
  • Chapter
Published in:

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This chapter delves into the complexities of Vietnamese Paraphrase Identification (PI), a crucial task for various natural language processing applications. The study highlights the unique challenges posed by the Vietnamese language, such as its syllable-based orthography and six-tone system, which complicate word segmentation and disambiguation. The authors propose a method that enhances pretrained language models (PLMs) with external knowledge, specifically by incorporating named entity information from Wikipedia using Wikipedia2Vec. The experimental results demonstrate significant improvements in PI accuracy, outperforming strong baselines like BERT and its variants. The chapter also discusses the corpora used, including the vnPara corpus and an augmented version enriched with diverse entities. The detailed evaluation of different PLMs and the comprehensive analysis of the results provide valuable insights into the effectiveness of knowledge-enhanced models for Vietnamese PI. This study is a significant contribution to the field, offering practical solutions and theoretical advancements that can be applied to other languages and NLP tasks.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Business + Economics & Engineering + Technology"

Online-Abonnement

Springer Professional "Business + Economics & Engineering + Technology" gives you access to:

  • more than 130.000 books
  • more than 540 journals

from the following subject areas:

  • Automotive
  • Construction + Real Estate
  • Business IT + Informatics
  • Electrical Engineering + Electronics
  • Energy + Sustainability
  • Finance + Banking
  • Management + Leadership
  • Marketing + Sales
  • Mechanical Engineering + Materials
  • Surfaces + Materials Technology
  • Insurance + Risk


Secure your knowledge advantage now!

Springer Professional "Engineering + Technology"

Online-Abonnement

Springer Professional "Engineering + Technology" gives you access to:

  • more than 75.000 books
  • more than 390 journals

from the following specialised fileds:

  • Automotive
  • Business IT + Informatics
  • Construction + Real Estate
  • Electrical Engineering + Electronics
  • Energy + Sustainability
  • Mechanical Engineering + Materials
  • Surfaces + Materials Technology





 

Secure your knowledge advantage now!

Springer Professional "Business + Economics"

Online-Abonnement

Springer Professional "Business + Economics" gives you access to:

  • more than 100.000 books
  • more than 340 journals

from the following specialised fileds:

  • Construction + Real Estate
  • Business IT + Informatics
  • Finance + Banking
  • Management + Leadership
  • Marketing + Sales
  • Insurance + Risk



Secure your knowledge advantage now!

Title
Knowledge-Enhanced Vietnamese Paraphrase Identification
Authors
Minh Lu Xuan
Minh Nguyen Hong
Loc Nguyen Xuan
Thai Do Thanh
Duc Bui Tien
Quang Tran Minh
Copyright Year
2026
Publisher
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-95-4960-3_8
This content is only visible if you are logged in and have the appropriate permissions.

Premium Partner

    Image Credits
    Neuer Inhalt/© ITandMEDIA, Nagarro GmbH/© Nagarro GmbH, AvePoint Deutschland GmbH/© AvePoint Deutschland GmbH, AFB Gemeinnützige GmbH/© AFB Gemeinnützige GmbH, USU GmbH/© USU GmbH, Ferrari electronic AG/© Ferrari electronic AG