Skip to main content

2016 | OriginalPaper | Buchkapitel

Benchmarking Semantic Capabilities of Analogy Querying Algorithms

verfasst von : Christoph Lofi, Athiq Ahamed, Pratima Kulkarni, Ravi Thakkar

Erschienen in: Database Systems for Advanced Applications

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Enabling semantically rich query paradigms is one of the core challenges of current information systems research. In this context, due to their importance and ubiquity in natural language, analogy queries are of particular interest. Current developments in natural language processing and machine learning resulted in some very promising algorithms relying on deep learning neural word embeddings which might contribute to finally realizing analogy queries. However, it is still quite unclear how well these algorithms work from a semantic point of view. One of the problems is that there is no clear consensus on the intended semantics of analogy queries. Furthermore, there are no suitable benchmark dataset available respecting the semantic properties of real-life analogies. Therefore, in this, paper, we discuss the challenges of benchmarking the semantics of analogy query algorithms with a special focus on neural embeddings. We also introduce the AGS analogy benchmark dataset which rectifies many weaknesses of established datasets. Finally, our experiments evaluating state-of-the-art algorithms underline the need for further research in this promising field.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Lofi, C.: Analogy queries in information systems – a new challenge. J. Inf. Knowl. Manage. 12, 1350021 (2013)CrossRef Lofi, C.: Analogy queries in information systems – a new challenge. J. Inf. Knowl. Manage. 12, 1350021 (2013)CrossRef
2.
Zurück zum Zitat Hofstadter, D.R.: Analogy as the core of cognition. In: The Analogical Mind, pp. 499–538 (2001) Hofstadter, D.R.: Analogy as the core of cognition. In: The Analogical Mind, pp. 499–538 (2001)
3.
Zurück zum Zitat Gentner, D.: Why we’re so smart. In: Language in Mind: Advances in the Study of Language and Thought, pp. 195–235. MIT Press (2003) Gentner, D.: Why we’re so smart. In: Language in Mind: Advances in the Study of Language and Thought, pp. 195–235. MIT Press (2003)
4.
Zurück zum Zitat Mikolov, T., Yih, W., Zweig, G.: Linguistic regularities in continuous space word representations. In: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language (NAACL-HLT), Atlanta, USA (2013) Mikolov, T., Yih, W., Zweig, G.: Linguistic regularities in continuous space word representations. In: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language (NAACL-HLT), Atlanta, USA (2013)
5.
Zurück zum Zitat Gentner, D., Holyoak, K.J., Kokinov, B.N. (eds.): The Analogical Mind: Perspectives from Cognitive Science. MIT Press, Cambridge (2001) Gentner, D., Holyoak, K.J., Kokinov, B.N. (eds.): The Analogical Mind: Perspectives from Cognitive Science. MIT Press, Cambridge (2001)
6.
Zurück zum Zitat Itkonen, E.: Analogy as Structure and Process: Approaches in Linguistics, Cognitive Psychology and Philosophy of Science. John Benjamins Pub. Co., Amsterdam (2005)CrossRef Itkonen, E.: Analogy as Structure and Process: Approaches in Linguistics, Cognitive Psychology and Philosophy of Science. John Benjamins Pub. Co., Amsterdam (2005)CrossRef
7.
Zurück zum Zitat Shelley, C.: Multiple Analogies in Science and Philosophy. John Benjamins Pub., Amsterdam (2003)CrossRef Shelley, C.: Multiple Analogies in Science and Philosophy. John Benjamins Pub., Amsterdam (2003)CrossRef
8.
Zurück zum Zitat Kant, I.: Critique of Judgement. Hackett, Indianapolis (1790) Kant, I.: Critique of Judgement. Hackett, Indianapolis (1790)
9.
10.
Zurück zum Zitat Gentner, D.: Structure-mapping: a theoretical framework for analogy. Cogn. Sci. 7, 155–170 (1983)CrossRef Gentner, D.: Structure-mapping: a theoretical framework for analogy. Cogn. Sci. 7, 155–170 (1983)CrossRef
11.
Zurück zum Zitat Gentner, D., Gunn, V.: Structural alignment facilitates the noticing of differences. Mem. Cogn. 29, 565–577 (2001)CrossRef Gentner, D., Gunn, V.: Structural alignment facilitates the noticing of differences. Mem. Cogn. 29, 565–577 (2001)CrossRef
12.
Zurück zum Zitat Lofi, C., Nieke, C.: Modeling analogies for human-centered information systems. In: Jatowt, A., et al. (eds.) SocInfo 2013. LNCS, vol. 8238, pp. 1–15. Springer, Heidelberg (2013)CrossRef Lofi, C., Nieke, C.: Modeling analogies for human-centered information systems. In: Jatowt, A., et al. (eds.) SocInfo 2013. LNCS, vol. 8238, pp. 1–15. Springer, Heidelberg (2013)CrossRef
13.
Zurück zum Zitat Blythe, J., Veloso, M.: Analogical replay for efficient conditional planning. In: National Conference on Artificial Intelligence (AAAI), Providence, Rhode Island, USA (1997) Blythe, J., Veloso, M.: Analogical replay for efficient conditional planning. In: National Conference on Artificial Intelligence (AAAI), Providence, Rhode Island, USA (1997)
14.
Zurück zum Zitat Leake, D.: Case-Based Reasoning: Experiences, Lessons, and Future Directions. MIT Press, Cambridge (1996) Leake, D.: Case-Based Reasoning: Experiences, Lessons, and Future Directions. MIT Press, Cambridge (1996)
15.
Zurück zum Zitat Forbus, K.D., Mostek, T., Ferguson, R.: Analogy ontology for integrating analogical processing and first-principles reasoning. In: National Conference on Artificial Intelligence (AAAI), Edmonton, Alberta, Canada (2002) Forbus, K.D., Mostek, T., Ferguson, R.: Analogy ontology for integrating analogical processing and first-principles reasoning. In: National Conference on Artificial Intelligence (AAAI), Edmonton, Alberta, Canada (2002)
16.
Zurück zum Zitat Bollegala, D.T., Matsuo, Y., Ishizuka, M.: Measuring the similarity between implicit semantic relations from the web. In: International Conference on World Wide Web (WWW), Madrid, Spain (2009) Bollegala, D.T., Matsuo, Y., Ishizuka, M.: Measuring the similarity between implicit semantic relations from the web. In: International Conference on World Wide Web (WWW), Madrid, Spain (2009)
17.
Zurück zum Zitat Davidov, D.: Unsupervised discovery of generic relationships using pattern clusters and its evaluation by automatically generated SAT analogy questions. In: Association for Computational Linguistics: Human Language Technologies (ACL:HLT), Columbus, Ohio, USA (2008) Davidov, D.: Unsupervised discovery of generic relationships using pattern clusters and its evaluation by automatically generated SAT analogy questions. In: Association for Computational Linguistics: Human Language Technologies (ACL:HLT), Columbus, Ohio, USA (2008)
18.
19.
Zurück zum Zitat Lofi, C.: Measuring semantic similarity and relatedness with distributional and knowledge-based approaches. Database Soc. Jpn. J. 14, 1–9 (2016) Lofi, C.: Measuring semantic similarity and relatedness with distributional and knowledge-based approaches. Database Soc. Jpn. J. 14, 1–9 (2016)
20.
Zurück zum Zitat Ștefănescu, D., Banjade, R., Rus, V.: Latent semantic analysis models on Wikipedia and TASA. In: Language Resources Evaluation Conference (LREC), Reykjavik, Island (2014) Ștefănescu, D., Banjade, R., Rus, V.: Latent semantic analysis models on Wikipedia and TASA. In: Language Resources Evaluation Conference (LREC), Reykjavik, Island (2014)
21.
Zurück zum Zitat Mnih, A., Hinton, G.E.: A scalable hierarchical distributed language model. Adv. Neural Inf. Process. Syst. 21, 1081–1088 (2009) Mnih, A., Hinton, G.E.: A scalable hierarchical distributed language model. Adv. Neural Inf. Process. Syst. 21, 1081–1088 (2009)
22.
Zurück zum Zitat Pennington, J., Socher, R., Manning, C.D.: Glove: global vectors for word representation. In: Conference on Empirical Methods on Natural Language Processing (EMNLP), Doha, Qatar (2014) Pennington, J., Socher, R., Manning, C.D.: Glove: global vectors for word representation. In: Conference on Empirical Methods on Natural Language Processing (EMNLP), Doha, Qatar (2014)
23.
Zurück zum Zitat Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12, 2493–2537 (2011)MATH Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12, 2493–2537 (2011)MATH
25.
Zurück zum Zitat Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. Adv. Neural Inf. Process. Syst. 26, 3111–3119 (2013) Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. Adv. Neural Inf. Process. Syst. 26, 3111–3119 (2013)
26.
Zurück zum Zitat Gao, B., Bian, J., Liu, T.-Y.: WordRep: a benchmark for research on learning word representations. In: ICML Workshop on Knowledge-Powered Deep Learning for Text Mining, Beijing, China (2014) Gao, B., Bian, J., Liu, T.-Y.: WordRep: a benchmark for research on learning word representations. In: ICML Workshop on Knowledge-Powered Deep Learning for Text Mining, Beijing, China (2014)
27.
Zurück zum Zitat Finkelstein, L., Gabrilovich, E., Matias, Y., Rivlin, E., Solan, Z., Wolfman, G., Ruppin, E.: Placing search in context: the concept revisited. In: International Conference on World Wide Web (WWW), Hong Kong, China (2001) Finkelstein, L., Gabrilovich, E., Matias, Y., Rivlin, E., Solan, Z., Wolfman, G., Ruppin, E.: Placing search in context: the concept revisited. In: International Conference on World Wide Web (WWW), Hong Kong, China (2001)
28.
Zurück zum Zitat Hill, F., Reichart, R., Korhonen, A.: SimLex-999: evaluating semantic models with (genuine) similarity estimation. Prepr. Publ. arXiv. arXiv:1408.3456 2014 Hill, F., Reichart, R., Korhonen, A.: SimLex-999: evaluating semantic models with (genuine) similarity estimation. Prepr. Publ. arXiv. arXiv:​1408.​3456 2014
29.
Zurück zum Zitat Lofi, C., Selke, J., Balke, W.-T.: Information extraction meets crowdsourcing: a promising couple. Datenbank-Spektrum. 12, 109–120 (2012)CrossRef Lofi, C., Selke, J., Balke, W.-T.: Information extraction meets crowdsourcing: a promising couple. Datenbank-Spektrum. 12, 109–120 (2012)CrossRef
Metadaten
Titel
Benchmarking Semantic Capabilities of Analogy Querying Algorithms
verfasst von
Christoph Lofi
Athiq Ahamed
Pratima Kulkarni
Ravi Thakkar
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-32025-0_29

Premium Partner