Skip to main content

2020 | OriginalPaper | Buchkapitel

Wikipedia-Based Entity Linking for the Digital Library of Polish and Poland-Related News Pamphlets

verfasst von : Maciej Ogrodniczuk, Włodzimierz Gruszczyński

Erschienen in: Digital Libraries at Times of Massive Societal Transition

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The paper presents a series of experiments related to enhancing the content of digital library items with links to relevant Wikipedia entries that could offer the reader additional background information. Two methods of gathering such links are investigated: a Wikifier-based solution and search in Wikipedia using its integrated engine. The results are additionally filtered using frequency information from a large corpus and additional rules.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Słownik geograficzny Królestwa Polskiego i innych krajów słowiańskich (Geographical Dictionary of the Kingdom of Poland), Warszawa (1880). (in Polish) Słownik geograficzny Królestwa Polskiego i innych krajów słowiańskich (Geographical Dictionary of the Kingdom of Poland), Warszawa (1880). (in Polish)
3.
Zurück zum Zitat Bunescu, R., Paşca, M.: Using encyclopedic knowledge for named entity disambiguation. In: 11th Conference of the European Chapter of the Association for Computational Linguistics, Trento, Italy. Association for Computational Linguistics (2006). https://www.aclweb.org/anthology/E06-1002 Bunescu, R., Paşca, M.: Using encyclopedic knowledge for named entity disambiguation. In: 11th Conference of the European Chapter of the Association for Computational Linguistics, Trento, Italy. Association for Computational Linguistics (2006). https://​www.​aclweb.​org/​anthology/​E06-1002
4.
Zurück zum Zitat Cucerzan, S.: Large-scale named entity disambiguation based on Wikipedia data. In: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), Prague, Czech Republic, pp. 708–716. Association for Computational Linguistics (2007). https://www.aclweb.org/anthology/D07-1074 Cucerzan, S.: Large-scale named entity disambiguation based on Wikipedia data. In: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), Prague, Czech Republic, pp. 708–716. Association for Computational Linguistics (2007). https://​www.​aclweb.​org/​anthology/​D07-1074
5.
Zurück zum Zitat Konopczyński, W.: Polski słownik biograficzny (Polish Biographical Dictionary). Polska Akademia Umiejętności (1935). (in Polish) Konopczyński, W.: Polski słownik biograficzny (Polish Biographical Dictionary). Polska Akademia Umiejętności (1935). (in Polish)
6.
Zurück zum Zitat Milne, D., Witten, I.H.: Learning to link with Wikipedia. In: Proceedings of the 17th ACM Conference on Information and Knowledge Management, CIKM 2008, New York, NY, USA, pp. 509–518. Association for Computing Machinery (2008). https://doi.org/10.1145/1458082.1458150 Milne, D., Witten, I.H.: Learning to link with Wikipedia. In: Proceedings of the 17th ACM Conference on Information and Knowledge Management, CIKM 2008, New York, NY, USA, pp. 509–518. Association for Computing Machinery (2008). https://​doi.​org/​10.​1145/​1458082.​1458150
7.
Zurück zum Zitat Moro, A., Raganato, A., Navigli, R.: Entity linking meets word sense disambiguation: a unified approach. Trans. Assoc. Comput. Linguist. (TACL) 2, 231–244 (2014)CrossRef Moro, A., Raganato, A., Navigli, R.: Entity linking meets word sense disambiguation: a unified approach. Trans. Assoc. Comput. Linguist. (TACL) 2, 231–244 (2014)CrossRef
9.
Zurück zum Zitat Ogrodniczuk, M., Gruszczyński, W.: Digital library of Poland-related old ephemeral prints: preserving multilingual cultural heritage. In: Proceedings of the Workshop on Language Technologies for Digital Humanities and Cultural Heritage, Hissar, Bulgaria, pp. 27–33 (2011). http://www.aclweb.org/anthology/W11-4105 Ogrodniczuk, M., Gruszczyński, W.: Digital library of Poland-related old ephemeral prints: preserving multilingual cultural heritage. In: Proceedings of the Workshop on Language Technologies for Digital Humanities and Cultural Heritage, Hissar, Bulgaria, pp. 27–33 (2011). http://​www.​aclweb.​org/​anthology/​W11-4105
10.
12.
Zurück zum Zitat Przepiórkowski, A., Bańko, M., Górski, R.L., Lewandowska-Tomaszczyk, B. (eds.): Narodowy Korpus Języka Polskiego (National Corpus of Polish). Wydawnictwo Naukowe PWN, Warsaw (2012). (in Polish) Przepiórkowski, A., Bańko, M., Górski, R.L., Lewandowska-Tomaszczyk, B. (eds.): Narodowy Korpus Języka Polskiego (National Corpus of Polish). Wydawnictwo Naukowe PWN, Warsaw (2012). (in Polish)
14.
Zurück zum Zitat Zawadzki, K.: Gazety ulotne polskie i Polski dotyczące z XVI, XVII i XVIII wieku (Polish and Poland-related Ephemeral Prints from the 16th-18th Centuries). National Ossoliński Institute, Polish Academy of Sciences, Wrocław (1990). (in Polish) Zawadzki, K.: Gazety ulotne polskie i Polski dotyczące z XVI, XVII i XVIII wieku (Polish and Poland-related Ephemeral Prints from the 16th-18th Centuries). National Ossoliński Institute, Polish Academy of Sciences, Wrocław (1990). (in Polish)
Metadaten
Titel
Wikipedia-Based Entity Linking for the Digital Library of Polish and Poland-Related News Pamphlets
verfasst von
Maciej Ogrodniczuk
Włodzimierz Gruszczyński
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-64452-9_7