Skip to main content

2014 | OriginalPaper | Buchkapitel

Adapting Gloss Vector Semantic Relatedness Measure for Semantic Similarity Estimation: An Evaluation in the Biomedical Domain

verfasst von : Ahmad Pesaranghader, Azadeh Rezaei, Ali Pesaranghader

Erschienen in: Semantic Technology

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Automatic methods of ontology alignment are essential for establishing interoperability across web services. These methods are needed to measure semantic similarity between two ontologies’ entities to discover reliable correspondences. While existing similarity measures suffer from some difficulties, semantic relatedness measures tend to yield better results; even though they are not completely appropriate for the ‘equivalence’ relationship (e.g. “blood” and “bleeding” related but not similar). We attempt to adapt Gloss Vector relatedness measure for similarity estimation. Generally, Gloss Vector uses angles between entities’ gloss vectors for relatedness calculation. After employing Pearson’s chi-squared test for statistical elimination of insignificant features to optimize entities’ gloss vectors, by considering concepts’ taxonomy, we enrich them for better similarity measurement. Discussed measures get evaluated in the biomedical domain using MeSH, MEDLINE and dataset of 301 concept pairs. We conclude Adapted Gloss Vector similarity results are more correlated with human judgment of similarity compared to other measures.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Muthaiyah, S., Kerschberg, L.: A hybrid ontology mediation approach for the semantic web. Int. J. E-Bus. Res. 4, 79–91 (2008)CrossRef Muthaiyah, S., Kerschberg, L.: A hybrid ontology mediation approach for the semantic web. Int. J. E-Bus. Res. 4, 79–91 (2008)CrossRef
2.
Zurück zum Zitat Chen, B., Foster, G., Kuhn, R.: Bilingual sense similarity for statistical machine translation. In: Proceedings of the ACL, pp. 834–843 (2010) Chen, B., Foster, G., Kuhn, R.: Bilingual sense similarity for statistical machine translation. In: Proceedings of the ACL, pp. 834–843 (2010)
3.
Zurück zum Zitat Pesaranghader, A., Mustapha, N., Pesaranghader, A.: Applying semantic similarity measures to enhance topic-specific web crawling. In: Proceedings of the 13th International Conference on Intelligent Systems Design and Applications (ISDA’13), pp. 205–212 (2013) Pesaranghader, A., Mustapha, N., Pesaranghader, A.: Applying semantic similarity measures to enhance topic-specific web crawling. In: Proceedings of the 13th International Conference on Intelligent Systems Design and Applications (ISDA’13), pp. 205–212 (2013)
4.
Zurück zum Zitat Gruber, T.R.: Toward principles for the design of ontologies used for knowledge sharing. Int. J. Hum Comput Stud. 43, 907–928 (1995)CrossRef Gruber, T.R.: Toward principles for the design of ontologies used for knowledge sharing. Int. J. Hum Comput Stud. 43, 907–928 (1995)CrossRef
5.
Zurück zum Zitat Firth, J.R.: A synopsis of linguistic theory 1930–1955. In: Firth, J.R. (ed.) Studies in Linguistic Analysis, pp. 1–32. Blackwell, Oxford (1957) Firth, J.R.: A synopsis of linguistic theory 1930–1955. In: Firth, J.R. (ed.) Studies in Linguistic Analysis, pp. 1–32. Blackwell, Oxford (1957)
6.
Zurück zum Zitat Lesk, M.: Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice-cream cone. In: Proceedings of the 5th Annual International Conference on Systems Documentation, New York, USA, pp. 24–26 (1986) Lesk, M.: Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice-cream cone. In: Proceedings of the 5th Annual International Conference on Systems Documentation, New York, USA, pp. 24–26 (1986)
7.
Zurück zum Zitat Banerjee, S., Pedersen, T.: An adapted Lesk algorithm for word sense disambiguation using WordNet. In: Gelbukh, A. (ed.) CICLing 2002. LNCS, vol. 2276, pp. 136–145. Springer, Heidelberg (2002) Banerjee, S., Pedersen, T.: An adapted Lesk algorithm for word sense disambiguation using WordNet. In: Gelbukh, A. (ed.) CICLing 2002. LNCS, vol. 2276, pp. 136–145. Springer, Heidelberg (2002)
8.
Zurück zum Zitat Patwardhan, S., Pedersen, T: Using WordNet-based context vectors to estimate the semantic relatedness of concepts. In: Proceedings of the EACL 2006 Workshop (2006) Patwardhan, S., Pedersen, T: Using WordNet-based context vectors to estimate the semantic relatedness of concepts. In: Proceedings of the EACL 2006 Workshop (2006)
9.
Zurück zum Zitat Liu, Y., McInnes, B.T., Pedersen, T., Melton-Meaux, G., Pakhomov. S.: Semantic relatedness study using second order co-occurrence vectors computed from biomedical corpora, UMLS and WordNet. In: Proceedings of the 2nd ACM SIGHIT IHI (2012) Liu, Y., McInnes, B.T., Pedersen, T., Melton-Meaux, G., Pakhomov. S.: Semantic relatedness study using second order co-occurrence vectors computed from biomedical corpora, UMLS and WordNet. In: Proceedings of the 2nd ACM SIGHIT IHI (2012)
10.
Zurück zum Zitat Pesaranghader, A., Pesaranghader, A., Rezaei, A.: Applying latent semantic analysis to optimize second-order co-occurrence vectors for semantic relatedness measurement. In: Proceedings of the 1st International Conference on Mining Intelligence and Knowledge Exploration (MIKE’13), pp. 588–599 (2013) Pesaranghader, A., Pesaranghader, A., Rezaei, A.: Applying latent semantic analysis to optimize second-order co-occurrence vectors for semantic relatedness measurement. In: Proceedings of the 1st International Conference on Mining Intelligence and Knowledge Exploration (MIKE’13), pp. 588–599 (2013)
11.
Zurück zum Zitat Pesaranghader, A., Pesaranghader, A., Rezaei, A.: Augmenting concept definition in gloss vector semantic relatedness measure using Wikipedia articles. In: Proceedings of the 1st International Conference on Data Engineering (DeEng-2013), pp. 623–630 (2014) Pesaranghader, A., Pesaranghader, A., Rezaei, A.: Augmenting concept definition in gloss vector semantic relatedness measure using Wikipedia articles. In: Proceedings of the 1st International Conference on Data Engineering (DeEng-2013), pp. 623–630 (2014)
12.
Zurück zum Zitat Rada, R., Mili, H., Bicknell, E., Blettner, M.: Development and application of a metric on semantic nets. IEEE Trans. Syst. Man Cybern. 19, 17–30 (1989)CrossRef Rada, R., Mili, H., Bicknell, E., Blettner, M.: Development and application of a metric on semantic nets. IEEE Trans. Syst. Man Cybern. 19, 17–30 (1989)CrossRef
13.
Zurück zum Zitat Caviedes, J., Cimino, J.: Towards the development of a conceptual distance metric for the UMLS. J. Biomed. Inf. 372, 77–85 (2004)CrossRef Caviedes, J., Cimino, J.: Towards the development of a conceptual distance metric for the UMLS. J. Biomed. Inf. 372, 77–85 (2004)CrossRef
14.
Zurück zum Zitat Wu, Z., Palmer, M.: Verb semantics and lexical selections. In: Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, (1994) Wu, Z., Palmer, M.: Verb semantics and lexical selections. In: Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, (1994)
15.
Zurück zum Zitat Leacock, C., Chodorow, M.: Combining local context and WordNet similarity for word sense identification. In: Fellbaum, C. (ed.) WordNet: An Electronic Lexical Database, pp. 265–283. MIT press, Cambridge (1998) Leacock, C., Chodorow, M.: Combining local context and WordNet similarity for word sense identification. In: Fellbaum, C. (ed.) WordNet: An Electronic Lexical Database, pp. 265–283. MIT press, Cambridge (1998)
16.
Zurück zum Zitat Zhong, J., Zhu, H., Li, J., Yu, Y.: Conceptual graph matching for semantic search. In: Priss, U., Corbett, D.R., Angelova, G. (eds.) ICCS 2002. LNCS (LNAI), vol. 2393, pp. 92–106. Springer, Heidelberg (2002)CrossRef Zhong, J., Zhu, H., Li, J., Yu, Y.: Conceptual graph matching for semantic search. In: Priss, U., Corbett, D.R., Angelova, G. (eds.) ICCS 2002. LNCS (LNAI), vol. 2393, pp. 92–106. Springer, Heidelberg (2002)CrossRef
17.
Zurück zum Zitat Nguyen, H.A., Al-Mubaid, H.: New ontology-based semantic similarity measure for the biomedical domain. In: Proceedings of IEEE International Conference on Granular Computing GrC’06, pp. 623–628 (2006) Nguyen, H.A., Al-Mubaid, H.: New ontology-based semantic similarity measure for the biomedical domain. In: Proceedings of IEEE International Conference on Granular Computing GrC’06, pp. 623–628 (2006)
18.
Zurück zum Zitat Resnik, P.: Using information content to evaluate semantic similarity in a taxonomy. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence (1995) Resnik, P.: Using information content to evaluate semantic similarity in a taxonomy. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence (1995)
19.
Zurück zum Zitat Jiang, J.J., Conrath, D.W.: Semantic similarity based on corpus statistics and lexical taxonomy. In: International Conference on Research in Computational Linguistics (1997) Jiang, J.J., Conrath, D.W.: Semantic similarity based on corpus statistics and lexical taxonomy. In: International Conference on Research in Computational Linguistics (1997)
20.
Zurück zum Zitat Lin, D.: An Information-theoretic definition of similarity. In: 15th International Conference on Machine Learning, Madison, USA, (1998) Lin, D.: An Information-theoretic definition of similarity. In: 15th International Conference on Machine Learning, Madison, USA, (1998)
21.
Zurück zum Zitat Pesaranghader, A., Muthaiyah, S.: Definition-based information content vectors for semantic similarity measurement. In: Noah, S.A., Abdullah, A., Arshad, H., Abu Bakar, A., Othman, Z.A., Sahran, S., Omar, N., Othman, Z. (eds.) M-CAIT 2013. CCIS, vol. 378, pp. 268–282. Springer, Heidelberg (2013) Pesaranghader, A., Muthaiyah, S.: Definition-based information content vectors for semantic similarity measurement. In: Noah, S.A., Abdullah, A., Arshad, H., Abu Bakar, A., Othman, Z.A., Sahran, S., Omar, N., Othman, Z. (eds.) M-CAIT 2013. CCIS, vol. 378, pp. 268–282. Springer, Heidelberg (2013)
22.
Zurück zum Zitat Pakhomov, S., McInnes, B., Adam, T., Liu, Y., Pedersen, T., Melton, G.: Semantic similarity and relatedness between clinical terms: an experimental study. In: Proceedings of AMIA, pp. 572–576 (2010) Pakhomov, S., McInnes, B., Adam, T., Liu, Y., Pedersen, T., Melton, G.: Semantic similarity and relatedness between clinical terms: an experimental study. In: Proceedings of AMIA, pp. 572–576 (2010)
Metadaten
Titel
Adapting Gloss Vector Semantic Relatedness Measure for Semantic Similarity Estimation: An Evaluation in the Biomedical Domain
verfasst von
Ahmad Pesaranghader
Azadeh Rezaei
Ali Pesaranghader
Copyright-Jahr
2014
DOI
https://doi.org/10.1007/978-3-319-06826-8_11

Neuer Inhalt