Skip to main content

2015 | OriginalPaper | Buchkapitel

Explorations of Cross-Disciplinary Term Similarity

verfasst von : Hanif Sheikhadbolkarim, Laurianne Sitbon

Erschienen in: Information Retrieval Technology

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper presents some initial explorations into how to compute term similarity across different domains, or in the present case, scientific disciplines. In particular we explore the concepts of polysemy across disciplines, where the same term can have different meaning across different discipline. This can lead to confusion and/or erroneous query expansion, if the domain is not properly identified. Typical bag-of-words systems are not equipped to highlight such differences as terms would have a single representation. Identifying the synonymy of terms across different domains is also a difficult problem for typical bag-of-words systems, as they use surrounding words that will usually also be different across domains. Yet discovering such similarities across domains can support tasks such as literature discovery. We propose an approach that integrates knowledge based distances into a distributional semantics framework and demonstrate its efficiency on a hand-crafted dataset.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Blitzer, J., Mcdonald, R., Pereira, F.: Domain adaptation with structural correspondence learning. In: Proceedings of EMNLP, pp. 120–128 (2006) Blitzer, J., Mcdonald, R., Pereira, F.: Domain adaptation with structural correspondence learning. In: Proceedings of EMNLP, pp. 120–128 (2006)
2.
Zurück zum Zitat Bullinaria, J.A., Levy, J.P.: Extracting semantic representations from word co-occurrence statistics: stop-lists, stemming, and SVD. Behav. Res. Methods 44, 890–907 (2012)CrossRef Bullinaria, J.A., Levy, J.P.: Extracting semantic representations from word co-occurrence statistics: stop-lists, stemming, and SVD. Behav. Res. Methods 44, 890–907 (2012)CrossRef
3.
Zurück zum Zitat Gopalan, R., Li, R., Chellappa, R.: Domain adaptation for object recognition: an unsupervised approach. In: Proceedings of ICCV, pp. 999–1006 (2011) Gopalan, R., Li, R., Chellappa, R.: Domain adaptation for object recognition: an unsupervised approach. In: Proceedings of ICCV, pp. 999–1006 (2011)
4.
Zurück zum Zitat Hu, X., Cai, Z., Graesser, A.C., Ventura, M.: Similarity between semantic spaces. In: Proceedings of CogSci 2005, pp. 995–1000 (2005) Hu, X., Cai, Z., Graesser, A.C., Ventura, M.: Similarity between semantic spaces. In: Proceedings of CogSci 2005, pp. 995–1000 (2005)
5.
Zurück zum Zitat Kamps, J., Marx, M., Mokken, R.J., De Rijke, M.: Using wordnet to measure semantic orientations of adjectives. In: Proceedings of LREC, pp. 1115–1118 (2004) Kamps, J., Marx, M., Mokken, R.J., De Rijke, M.: Using wordnet to measure semantic orientations of adjectives. In: Proceedings of LREC, pp. 1115–1118 (2004)
6.
Zurück zum Zitat Kim, S.N., Cavedon, L.: Classifying domain-specific terms using a dictionary. In: Proceedings of ALTA, p. 57 (2011) Kim, S.N., Cavedon, L.: Classifying domain-specific terms using a dictionary. In: Proceedings of ALTA, p. 57 (2011)
7.
Zurück zum Zitat Bentivogli, L., Forner, P., Magnini, B., Pianta, E.: Revising the wordnet domains hierarchy: semantics, coverage and balancing. In: Proceedings of the Workshop on Multilingual Linguistic Ressources, pp. 101–108. ACL (2004) Bentivogli, L., Forner, P., Magnini, B., Pianta, E.: Revising the wordnet domains hierarchy: semantics, coverage and balancing. In: Proceedings of the Workshop on Multilingual Linguistic Ressources, pp. 101–108. ACL (2004)
8.
Zurück zum Zitat Mikolov, T., Zweig, G.: Context dependent recurrent neural network language model. In: Proceedings of SLT, pp. 234–239 (2012) Mikolov, T., Zweig, G.: Context dependent recurrent neural network language model. In: Proceedings of SLT, pp. 234–239 (2012)
9.
Zurück zum Zitat Pan, S.J., Kwok, J.T., Yang, Q.: Transfer learning via dimensionality reduction. In: Proceedings of AAAI, pp. 677–682 (2008) Pan, S.J., Kwok, J.T., Yang, Q.: Transfer learning via dimensionality reduction. In: Proceedings of AAAI, pp. 677–682 (2008)
10.
Zurück zum Zitat Turney, P.D., Pantel, P.: From frequency to meaning: vector space models of semantics. J. Artif. Intell. Res. 37(1), 141–188 (2010)MathSciNetMATH Turney, P.D., Pantel, P.: From frequency to meaning: vector space models of semantics. J. Artif. Intell. Res. 37(1), 141–188 (2010)MathSciNetMATH
Metadaten
Titel
Explorations of Cross-Disciplinary Term Similarity
verfasst von
Hanif Sheikhadbolkarim
Laurianne Sitbon
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-28940-3_34

Neuer Inhalt