Skip to main content
Top

14. Graph-Based Hierarchical Record Clustering for Unsupervised Entity Resolution

  • 2022
  • OriginalPaper
  • Chapter
Published in:

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The chapter discusses the critical problem of entity resolution in data cleaning, curation, and integration, focusing on unsupervised methods. It introduces a graph-based hierarchical record clustering technique, GDWM, which modifies the Data Washing Machine (DWM) algorithm to enhance accuracy and efficiency. The method integrates graph-based transitive closure and Modularity optimization, eliminating the need for iterative reiterations and threshold setting. The chapter also presents experiments on synthetic benchmark datasets, demonstrating the effectiveness and superior performance of GDWM compared to the original DWM. The results highlight significant improvements in precision, recall, F1 scores, and execution time, making GDWM a standout solution in the field of unsupervised entity resolution.

Not a customer yet? Then find out more about our access models now:

Individual Access

Start your personal individual access now. Get instant access to more than 164,000 books and 540 journals – including PDF downloads and new releases.

Starting from 54,00 € per month!    

Get access

Access for Businesses

Utilise Springer Professional in your company and provide your employees with sound specialist knowledge. Request information about corporate access now.

Find out how Springer Professional can uplift your work!

Contact us now
Title
Graph-Based Hierarchical Record Clustering for Unsupervised Entity Resolution
Authors
Islam Akef Ebeid
John R. Talburt
Md Abdus Salam Siddique
Copyright Year
2022
DOI
https://doi.org/10.1007/978-3-030-97652-1_14
This content is only visible if you are logged in and have the appropriate permissions.

Premium Partner

    Image Credits
    Neuer Inhalt/© ITandMEDIA, Nagarro GmbH/© Nagarro GmbH, AvePoint Deutschland GmbH/© AvePoint Deutschland GmbH, AFB Gemeinnützige GmbH/© AFB Gemeinnützige GmbH, USU GmbH/© USU GmbH, Ferrari electronic AG/© Ferrari electronic AG