Skip to main content
Erschienen in: Journal on Data Semantics 3-4/2021

16.10.2021 | Original Article

Measuring Clusters of Labels in an Embedding Space to Refine Relations in Ontology Alignment

verfasst von: Molka Tounsi Dhouib, Catherine Faron, Andrea G. B. Tettamanzi

Erschienen in: Journal on Data Semantics | Ausgabe 3-4/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Ontology alignment plays a key role in the management of heterogeneous data sources and metadata. In this context, various ontology alignment techniques have been proposed to discover correspondences between the entities of different ontologies. This paper proposes a new ontology alignment approach based on a set of rules exploiting the embedding space and measuring clusters of labels to discover the relationship between entities. We tested our system on the OAEI conference complex alignment benchmark track and then applied it to aligning ontologies in a real-world case study. The experimental results show that the combination of word embedding and a measure of dispersion of the clusters of labels, which we call the radius measure, makes it possible to determine, with good accuracy, not only equivalence relations, but also hierarchical relations between entities.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Alshargi F, Shekarpour S, Soru T, Sheth A (2018a) Concept2vec: metrics for evaluating quality of embeddings for ontological concepts. arXiv:1803.04488 Alshargi F, Shekarpour S, Soru T, Sheth A (2018a) Concept2vec: metrics for evaluating quality of embeddings for ontological concepts. arXiv:​1803.​04488
2.
Zurück zum Zitat Alshargi F, Shekarpour S, Soru T, Sheth AP (2018b) Metrics for evaluating quality of embeddings for Ontological concepts. Proceedings of the AAAI 2019 Spring Symposium on Combining Machine Learning with Knowledge Engineering (AAAI-MAKE 2019) Alshargi F, Shekarpour S, Soru T, Sheth AP (2018b) Metrics for evaluating quality of embeddings for Ontological concepts. Proceedings of the AAAI 2019 Spring Symposium on Combining Machine Learning with Knowledge Engineering (AAAI-MAKE 2019)
3.
Zurück zum Zitat Ardjani F, Bouchiha D, Malki M (2015) Ontology-alignment techniques: survey and analysis. Int J Modern Educ Comput Sci 7:11 Ardjani F, Bouchiha D, Malki M (2015) Ontology-alignment techniques: survey and analysis. Int J Modern Educ Comput Sci 7:11
4.
Zurück zum Zitat Aumueller D, Do HH, Massmann S, Rahm E (2005) Schema and ontology matching with coma++. In: Proceedings of the 2005 ACM SIGMOD international conference on Management of data, pp 906–908 Aumueller D, Do HH, Massmann S, Rahm E (2005) Schema and ontology matching with coma++. In: Proceedings of the 2005 ACM SIGMOD international conference on Management of data, pp 906–908
5.
Zurück zum Zitat Chen M, Tian Y, Yang M, Zaniolo C (2016) Multilingual knowledge graph embeddings for cross-lingual knowledge alignment. arXiv:1611.03954 Chen M, Tian Y, Yang M, Zaniolo C (2016) Multilingual knowledge graph embeddings for cross-lingual knowledge alignment. arXiv:​1611.​03954
6.
Zurück zum Zitat Cruz IF, Antonelli FP, Stroe C (2009) Agreementmaker: efficient matching for large real-world schemas and ontologies. Proce VLDB Endow 2(2):1586–1589CrossRef Cruz IF, Antonelli FP, Stroe C (2009) Agreementmaker: efficient matching for large real-world schemas and ontologies. Proce VLDB Endow 2(2):1586–1589CrossRef
8.
Zurück zum Zitat Devlin J, Chang MW, Lee K, Toutanova K (2018) Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv:1810.04805 Devlin J, Chang MW, Lee K, Toutanova K (2018) Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv:​1810.​04805
9.
Zurück zum Zitat Do HH, Rahm E (2002) Coma-a system for flexible combination of schema matching approaches. In: VLDB’02: proceedings of the 28th international conference on very large databases, Elsevier, pp 610–621 Do HH, Rahm E (2002) Coma-a system for flexible combination of schema matching approaches. In: VLDB’02: proceedings of the 28th international conference on very large databases, Elsevier, pp 610–621
10.
Zurück zum Zitat Doan A, Halevy AY (2005) Semantic integration research in the database community: a brief survey. AI Mag 26(1):83–83 Doan A, Halevy AY (2005) Semantic integration research in the database community: a brief survey. AI Mag 26(1):83–83
11.
Zurück zum Zitat Ehrig M, Staab S (2004) Qom–quick ontology mapping. In: International semantic web conference. Springer, pp 683–697 Ehrig M, Staab S (2004) Qom–quick ontology mapping. In: International semantic web conference. Springer, pp 683–697
12.
Zurück zum Zitat Euzenat J, Valtchev P (2004) Similarity-based ontology alignment in owl-lite. In: Proceedings of the 16th European conference on artificial intelligence (ECAI). IOS press, pp 333–337 Euzenat J, Valtchev P (2004) Similarity-based ontology alignment in owl-lite. In: Proceedings of the 16th European conference on artificial intelligence (ECAI). IOS press, pp 333–337
13.
Zurück zum Zitat Euzenat J, Shvaiko P et al (2007) Ontology matching, vol 18. Springer, BerlinMATH Euzenat J, Shvaiko P et al (2007) Ontology matching, vol 18. Springer, BerlinMATH
14.
Zurück zum Zitat Giunchiglia F, Shvaiko P, Yatskevich M (2004) S-match: an algorithm and an implementation of semantic matching. In: European semantic web symposium. Springer, pp 61–75 Giunchiglia F, Shvaiko P, Yatskevich M (2004) S-match: an algorithm and an implementation of semantic matching. In: European semantic web symposium. Springer, pp 61–75
15.
Zurück zum Zitat Giunchiglia F, Yatskevich M, Shvaiko P (2007) Semantic matching: algorithms and implementation. In: Journal on data semantics IX. Springer, pp 1–38 Giunchiglia F, Yatskevich M, Shvaiko P (2007) Semantic matching: algorithms and implementation. In: Journal on data semantics IX. Springer, pp 1–38
17.
Zurück zum Zitat Gromann D, Declerck T (2018) Comparing pretrained multilingual word embeddings on an ontology alignment task. In: Proceedings of the eleventh international conference on language resources and evaluation (LREC 2018) Gromann D, Declerck T (2018) Comparing pretrained multilingual word embeddings on an ontology alignment task. In: Proceedings of the eleventh international conference on language resources and evaluation (LREC 2018)
18.
Zurück zum Zitat Hassen W (2012) Medley results for oaei 2012. In: Proceedings of the 7th international conference on ontology matching-volume 946, CEUR-WS. org, pp 168–172 Hassen W (2012) Medley results for oaei 2012. In: Proceedings of the 7th international conference on ontology matching-volume 946, CEUR-WS. org, pp 168–172
19.
Zurück zum Zitat Jean-Mary YR, Shironoshita EP, Kabuka MR (2009) Ontology matching with semantic verification. J Web Seman 7(3):235–251CrossRef Jean-Mary YR, Shironoshita EP, Kabuka MR (2009) Ontology matching with semantic verification. J Web Seman 7(3):235–251CrossRef
20.
Zurück zum Zitat Jian N, Hu W, Cheng G, Qu Y (2005) Falcon-ao: Aligning ontologies with falcon. In: Proceedings of K-CAP workshop on integrating ontologies, pp 85–91 Jian N, Hu W, Cheng G, Qu Y (2005) Falcon-ao: Aligning ontologies with falcon. In: Proceedings of K-CAP workshop on integrating ontologies, pp 85–91
21.
Zurück zum Zitat Jiang S, Lowd D, Kafle S, Dou D (2016) Ontology matching with knowledge rules. In: Transactions on large-scale data-and knowledge-centered systems XXVIII. Springer, pp 75–95 Jiang S, Lowd D, Kafle S, Dou D (2016) Ontology matching with knowledge rules. In: Transactions on large-scale data-and knowledge-centered systems XXVIII. Springer, pp 75–95
22.
Zurück zum Zitat Kalfoglou Y, Schorlemmer M (2003) Ontology mapping: the state of the art. Knowl Eng Rev 18(1):1–31CrossRef Kalfoglou Y, Schorlemmer M (2003) Ontology mapping: the state of the art. Knowl Eng Rev 18(1):1–31CrossRef
23.
Zurück zum Zitat Kolyvakis P, Kalousis A, Kiritsis D (2018) Deepalignment: unsupervised ontology matching with refined word vectors. In: Proceedings of the 2018 conference of the north american chapter of the association for computational linguistics: human language technologies, vol 1 (Long Papers), pp 787–798 Kolyvakis P, Kalousis A, Kiritsis D (2018) Deepalignment: unsupervised ontology matching with refined word vectors. In: Proceedings of the 2018 conference of the north american chapter of the association for computational linguistics: human language technologies, vol 1 (Long Papers), pp 787–798
24.
Zurück zum Zitat Lastra-Díaz JJ, Goikoetxea J, Taieb MAH, García-Serrano A, Aouicha MB, Agirre E (2019) A reproducible survey on word embeddings and ontology-based methods for word similarity: linear combinations outperform the state of the art. Eng Appl Artif Intell 85:645–665CrossRef Lastra-Díaz JJ, Goikoetxea J, Taieb MAH, García-Serrano A, Aouicha MB, Agirre E (2019) A reproducible survey on word embeddings and ontology-based methods for word similarity: linear combinations outperform the state of the art. Eng Appl Artif Intell 85:645–665CrossRef
25.
Zurück zum Zitat Lesk M (1986) Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone. In: Proceedings of the 5th annual international conference on systems documentation, Citeseer, pp 24–26 Lesk M (1986) Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone. In: Proceedings of the 5th annual international conference on systems documentation, Citeseer, pp 24–26
26.
Zurück zum Zitat Li J, Tang J, Li Y, Luo Q (2008) Rimom: a dynamic multistrategy ontology alignment framework. IEEE Trans Knowl Data Eng 21(8):1218–1232 Li J, Tang J, Li Y, Luo Q (2008) Rimom: a dynamic multistrategy ontology alignment framework. IEEE Trans Knowl Data Eng 21(8):1218–1232
27.
Zurück zum Zitat Madhavan J, Bernstein PA, Rahm E (2001) Generic schema matching with cupid. In: vldb, Citeseer, vol 1, pp 49–58 Madhavan J, Bernstein PA, Rahm E (2001) Generic schema matching with cupid. In: vldb, Citeseer, vol 1, pp 49–58
28.
Zurück zum Zitat Martin L, Muller B, Suárez PJO, Dupont Y, Romary L, de la Clergerie ÉV, Seddah D, Sagot B (2019) Camembert: a tasty french language model. arXiv:1911.03894 Martin L, Muller B, Suárez PJO, Dupont Y, Romary L, de la Clergerie ÉV, Seddah D, Sagot B (2019) Camembert: a tasty french language model. arXiv:​1911.​03894
29.
Zurück zum Zitat Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, pp 3111–3119 Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, pp 3111–3119
30.
Zurück zum Zitat Mohammadi M, Atashin AA, Hofman W, Tan Y (2018) Comparison of ontology alignment systems across single matching task via the Mcnemar’s test. ACM Trans Knowl Discov Data (TKDD) 12(4):51 Mohammadi M, Atashin AA, Hofman W, Tan Y (2018) Comparison of ontology alignment systems across single matching task via the Mcnemar’s test. ACM Trans Knowl Discov Data (TKDD) 12(4):51
31.
Zurück zum Zitat Monge AE, Elkan C et al (1996) The field matching problem: algorithms and applications. Kdd 2:267–270 Monge AE, Elkan C et al (1996) The field matching problem: algorithms and applications. Kdd 2:267–270
32.
Zurück zum Zitat Ngo D, Bellahsene Z (2012) Yam++: a multi-strategy based approach for ontology matching task. In: International conference on knowledge engineering and knowledge management. Springer, pp 421–425 Ngo D, Bellahsene Z (2012) Yam++: a multi-strategy based approach for ontology matching task. In: International conference on knowledge engineering and knowledge management. Springer, pp 421–425
33.
Zurück zum Zitat Nkisi-Orji I, Wiratunga N, Massie S, Hui KY, Heaven R (2018) Ontology alignment based on word embedding and random forest classification. In: Joint European conference on machine learning and knowledge discovery in databases. Springer, pp 557–572 Nkisi-Orji I, Wiratunga N, Massie S, Hui KY, Heaven R (2018) Ontology alignment based on word embedding and random forest classification. In: Joint European conference on machine learning and knowledge discovery in databases. Springer, pp 557–572
34.
Zurück zum Zitat Noy NF, Musen MA (2001) Anchor-prompt: Using non-local context for semantic matching. In: OIS@ IJCAI Noy NF, Musen MA (2001) Anchor-prompt: Using non-local context for semantic matching. In: OIS@ IJCAI
35.
Zurück zum Zitat Ochieng P, Kyanda S (2018) Large-scale ontology matching: state-of-the-art analysis. ACM Comput Surv (CSUR) 51(4):75CrossRef Ochieng P, Kyanda S (2018) Large-scale ontology matching: state-of-the-art analysis. ACM Comput Surv (CSUR) 51(4):75CrossRef
36.
Zurück zum Zitat Otero-Cerdeira L, Rodríguez-Martínez FJ, Gómez-Rodríguez A (2015) Ontology matching: a literature review. Expert Syst Appl 42(2):949–971CrossRef Otero-Cerdeira L, Rodríguez-Martínez FJ, Gómez-Rodríguez A (2015) Ontology matching: a literature review. Expert Syst Appl 42(2):949–971CrossRef
37.
Zurück zum Zitat Parrochia D, Neuville P (2014) Taxinomie et réalité: vers une métaclassification. ISTE Group Parrochia D, Neuville P (2014) Taxinomie et réalité: vers une métaclassification. ISTE Group
38.
Zurück zum Zitat Peters ME, Neumann M, Iyyer M, Gardner M, Clark C, Lee K, Zettlemoyer L (2018) Deep contextualized word representations. arXiv:1802.05365 Peters ME, Neumann M, Iyyer M, Gardner M, Clark C, Lee K, Zettlemoyer L (2018) Deep contextualized word representations. arXiv:​1802.​05365
39.
Zurück zum Zitat Rahm E, Bernstein PA (2001) A survey of approaches to automatic schema matching. VLDB J 10(4):334–350CrossRef Rahm E, Bernstein PA (2001) A survey of approaches to automatic schema matching. VLDB J 10(4):334–350CrossRef
40.
Zurück zum Zitat Ristoski P, Faralli S, Ponzetto SP, Paulheim H (2017) Large-scale taxonomy induction using entity and word embeddings. In: Proceedings of the international conference on web intelligence. ACM, pp 81–87 Ristoski P, Faralli S, Ponzetto SP, Paulheim H (2017) Large-scale taxonomy induction using entity and word embeddings. In: Proceedings of the international conference on web intelligence. ACM, pp 81–87
41.
Zurück zum Zitat Ritze D, Meilicke C, Šváb-Zamazal O, Stuckenschmidt H (2009) A pattern-based ontology matching approach for detecting complex correspondences. In: ISWC workshop on ontology matching, chantilly (VA US), pp 25–36 Ritze D, Meilicke C, Šváb-Zamazal O, Stuckenschmidt H (2009) A pattern-based ontology matching approach for detecting complex correspondences. In: ISWC workshop on ontology matching, chantilly (VA US), pp 25–36
42.
Zurück zum Zitat Ritze D, Völker J, Meilicke C, Sváb-Zamazal O (2010) Linguistic analysis for complex ontology matching. In: CEUR workshop proceedings, RWTH, vol 689, Paper–1 Ritze D, Völker J, Meilicke C, Sváb-Zamazal O (2010) Linguistic analysis for complex ontology matching. In: CEUR workshop proceedings, RWTH, vol 689, Paper–1
43.
Zurück zum Zitat Schmidt D, Basso R, Trojahn C, Vieira R (2018) Matching domain and top-level ontologies exploring word sense disambiguation and word embedding. In: Ontology matching: OM-2018: proceedings of the ISWC workshop, p 1 Schmidt D, Basso R, Trojahn C, Vieira R (2018) Matching domain and top-level ontologies exploring word sense disambiguation and word embedding. In: Ontology matching: OM-2018: proceedings of the ISWC workshop, p 1
44.
Zurück zum Zitat Shvaiko P, Euzenat J (2005) A survey of schema-based matching approaches. In: Journal on data semantics IV. Springer, pp 146–171 Shvaiko P, Euzenat J (2005) A survey of schema-based matching approaches. In: Journal on data semantics IV. Springer, pp 146–171
45.
Zurück zum Zitat Shvaiko P, Euzenat J (2011) Ontology matching: state of the art and future challenges. IEEE Trans Knowl Data Eng 25(1):158–176CrossRef Shvaiko P, Euzenat J (2011) Ontology matching: state of the art and future challenges. IEEE Trans Knowl Data Eng 25(1):158–176CrossRef
46.
Zurück zum Zitat Sun M, Zhu H, Xie R, Liu Z (2017) Iterative entity alignment via joint knowledge embeddings. In: International joint conference on artificial intelligence. AAAI Press Sun M, Zhu H, Xie R, Liu Z (2017) Iterative entity alignment via joint knowledge embeddings. In: International joint conference on artificial intelligence. AAAI Press
47.
Zurück zum Zitat Sun Z, Hu W, Zhang Q, Qu Y (2018) Bootstrapping entity alignment with knowledge graph embedding. IJCAI 18:4396–4402 Sun Z, Hu W, Zhang Q, Qu Y (2018) Bootstrapping entity alignment with knowledge graph embedding. IJCAI 18:4396–4402
48.
Zurück zum Zitat Thieblin E (2019) Task-oriented complex alignments on conference organisation Thieblin E (2019) Task-oriented complex alignments on conference organisation
49.
Zurück zum Zitat Thiéblin E, Haemmerlé O, Hernandez N, Trojahn C (2017) Un jeu de données d’évaluation de correspondances complexes entre ontologies Thiéblin E, Haemmerlé O, Hernandez N, Trojahn C (2017) Un jeu de données d’évaluation de correspondances complexes entre ontologies
50.
Zurück zum Zitat Thiéblin É, Haemmerlé O, Hernandez N, Trojahn C (2018) Task-oriented complex ontology alignment: two alignment evaluation sets. In: European semantic web conference. Springer, pp 655–670 Thiéblin É, Haemmerlé O, Hernandez N, Trojahn C (2018) Task-oriented complex ontology alignment: two alignment evaluation sets. In: European semantic web conference. Springer, pp 655–670
51.
Zurück zum Zitat Vieira R, Revoredo K (2017) Using word semantics on entity names for correspondence set generation. In: OM@ ISWC, pp 223–224 Vieira R, Revoredo K (2017) Using word semantics on entity names for correspondence set generation. In: OM@ ISWC, pp 223–224
52.
Zurück zum Zitat Zhang Y, Wang X, Lai S, He S, Liu K, Zhao J, Lv X (2014) Ontology matching with word embeddings. In: Chinese computational linguistics and natural language processing based on naturally annotated big data. Springer, pp 34–45 Zhang Y, Wang X, Lai S, He S, Liu K, Zhao J, Lv X (2014) Ontology matching with word embeddings. In: Chinese computational linguistics and natural language processing based on naturally annotated big data. Springer, pp 34–45
Metadaten
Titel
Measuring Clusters of Labels in an Embedding Space to Refine Relations in Ontology Alignment
verfasst von
Molka Tounsi Dhouib
Catherine Faron
Andrea G. B. Tettamanzi
Publikationsdatum
16.10.2021
Verlag
Springer Berlin Heidelberg
Erschienen in
Journal on Data Semantics / Ausgabe 3-4/2021
Print ISSN: 1861-2032
Elektronische ISSN: 1861-2040
DOI
https://doi.org/10.1007/s13740-021-00137-8

Weitere Artikel der Ausgabe 3-4/2021

Journal on Data Semantics 3-4/2021 Zur Ausgabe

Premium Partner