Skip to main content

2015 | OriginalPaper | Buchkapitel

Extraction of Historical Events from Wikipedia

verfasst von : Daniel Hienert, Francesco Luciano

Erschienen in: The Semantic Web: ESWC 2012 Satellite Events

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The DBpedia project extracts structured information from Wikipedia and makes it available on the web. Information is gathered mainly with the help of infoboxes that contain structured information of the Wikipedia article. A lot of information is only contained in the article body and is not yet included in DBpedia. In this paper we focus on the extraction of historical events from Wikipedia articles that are available for about 2,500 years for different languages. We have extracted about 121,000 events with more than 325,000 links to DBpedia entities and provide access to this data via a Web API, SPARQL endpoint, Linked Data Interface and in a timeline application.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: a nucleus for a web of open data. In: Aberer, K., et al. (eds.) The Semantic Web. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007)CrossRef Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: a nucleus for a web of open data. In: Aberer, K., et al. (eds.) The Semantic Web. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007)CrossRef
2.
Zurück zum Zitat Bhole, A., et al.: Extracting named entities and relating them over time based on wikipedia. Informatica (Slovenia) 31(4), 463–468 (2007) Bhole, A., et al.: Extracting named entities and relating them over time based on wikipedia. Informatica (Slovenia) 31(4), 463–468 (2007)
3.
Zurück zum Zitat Buscaldi, D., Rosso, P.: A bag-of-words based ranking method for the wikipedia question answering task. In: Peters, C., et al. (eds.) Evaluation of Multilingual and Multi-modal Information Retrieval. LNCS, vol. 4730, pp. 550–553. Springer, Heidelberg (2006)CrossRef Buscaldi, D., Rosso, P.: A bag-of-words based ranking method for the wikipedia question answering task. In: Peters, C., et al. (eds.) Evaluation of Multilingual and Multi-modal Information Retrieval. LNCS, vol. 4730, pp. 550–553. Springer, Heidelberg (2006)CrossRef
4.
Zurück zum Zitat Buscaldi, D., Rosso, P.: A comparison of methods for the automatic identification of locations in wikipedia. In: Proceedings of the 4th ACM workshop on Geographical information retrieval, pp. 89–92. ACM, New York, NY, USA (2007) Buscaldi, D., Rosso, P.: A comparison of methods for the automatic identification of locations in wikipedia. In: Proceedings of the 4th ACM workshop on Geographical information retrieval, pp. 89–92. ACM, New York, NY, USA (2007)
5.
Zurück zum Zitat Chasin, R.: Event and Temporal Information Extraction towards Timelines of Wikipedia Articles. Simile, pp. 1–9 (2010) Chasin, R.: Event and Temporal Information Extraction towards Timelines of Wikipedia Articles. Simile, pp. 1–9 (2010)
6.
Zurück zum Zitat Dakka, W., Cucerzan, S.: Augmenting Wikipedia with Named Entity Tags. In: Proceedings of IJCNLP 2008 (2008) Dakka, W., Cucerzan, S.: Augmenting Wikipedia with Named Entity Tags. In: Proceedings of IJCNLP 2008 (2008)
7.
Zurück zum Zitat Exner, P., Nugues, P.: Using semantic role labeling to extract events from Wikipedia. In: Proceedings of the Workshop on Detection, Representation, and Exploitation of Events in the Semantic Web (DeRiVE 2011). Workshop in Conjunction with the 10th International Semantic Web Conference 2011 (ISWC 2011). Bonn (2011) Exner, P., Nugues, P.: Using semantic role labeling to extract events from Wikipedia. In: Proceedings of the Workshop on Detection, Representation, and Exploitation of Events in the Semantic Web (DeRiVE 2011). Workshop in Conjunction with the 10th International Semantic Web Conference 2011 (ISWC 2011). Bonn (2011)
8.
Zurück zum Zitat Fellbaum, C. (ed.): WordNet An Electronic Lexical Database. The MIT Press, Cambridge (1998)MATH Fellbaum, C. (ed.): WordNet An Electronic Lexical Database. The MIT Press, Cambridge (1998)MATH
9.
Zurück zum Zitat van Hage, W.R., et al.: Design and use of the simple event model (SEM). Web Semant. Sci. Serv. Agents World Wide Web 9, 2 (2011) van Hage, W.R., et al.: Design and use of the simple event model (SEM). Web Semant. Sci. Serv. Agents World Wide Web 9, 2 (2011)
10.
Zurück zum Zitat Hienert, D., et al.: VIZGR: combining data on a visual level. In: Proceedings of the 7th International Conference on Web Information Systems and Technologies (WEBIST) (2011) Hienert, D., et al.: VIZGR: combining data on a visual level. In: Proceedings of the 7th International Conference on Web Information Systems and Technologies (WEBIST) (2011)
11.
Zurück zum Zitat Medelyan, O., et al.: Mining meaning from wikipedia. Int. J. Hum.-Comput. Stud. 67(9), 716–754 (2009)CrossRef Medelyan, O., et al.: Mining meaning from wikipedia. Int. J. Hum.-Comput. Stud. 67(9), 716–754 (2009)CrossRef
12.
Zurück zum Zitat Ruiz-Casado, M., et al.: Automatising the learning of lexical patterns: an application to the enrichment of WordNet by extracting semantic relationships from wikipedia. Data Knowl. Eng. 61(3), 484–499 (2007)CrossRef Ruiz-Casado, M., et al.: Automatising the learning of lexical patterns: an application to the enrichment of WordNet by extracting semantic relationships from wikipedia. Data Knowl. Eng. 61(3), 484–499 (2007)CrossRef
13.
Zurück zum Zitat Shaw, R., Troncy, R., Hardman, L.: LODE: Linking Open Descriptions of Events. In: Gómez-Pérez, A., Yu, Y., Ding, Y. (eds.) The Semantic Web. LNCS, vol. 5926, pp. 153–167. Springer, Heidelberg (2009)CrossRef Shaw, R., Troncy, R., Hardman, L.: LODE: Linking Open Descriptions of Events. In: Gómez-Pérez, A., Yu, Y., Ding, Y. (eds.) The Semantic Web. LNCS, vol. 5926, pp. 153–167. Springer, Heidelberg (2009)CrossRef
14.
Zurück zum Zitat Suchanek, F.M., et al.: Combining linguistic and statistical analysis to extract relations from web documents. In: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 712–717. ACM, New York, NY, USA (2006) Suchanek, F.M., et al.: Combining linguistic and statistical analysis to extract relations from web documents. In: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 712–717. ACM, New York, NY, USA (2006)
15.
Zurück zum Zitat Suchanek, F.M., et al.: Yago: a core of semantic knowledge. In: Proceedings of the 16th international conference on World Wide Web, pp. 697–706. ACM, New York, NY, USA (2007) Suchanek, F.M., et al.: Yago: a core of semantic knowledge. In: Proceedings of the 16th international conference on World Wide Web, pp. 697–706. ACM, New York, NY, USA (2007)
16.
Zurück zum Zitat Toral, A., Munoz, R.: A proposal to automatically build and maintain gazetteers for named entity recognition by using wikipedia. In: EACL 2006 (2006) Toral, A., Munoz, R.: A proposal to automatically build and maintain gazetteers for named entity recognition by using wikipedia. In: EACL 2006 (2006)
17.
Zurück zum Zitat Wang, G., Zhang, H., Wang, H., Yu, Y.: Enhancing relation extraction by eliciting selectional constraint features from wikipedia. In: Kedad, Z., Lammari, N., Métais, E., Meziane, F., Rezgui, Y. (eds.) Natural Language Processing and Information Systems. LNCS, vol. 4592, pp. 329–340. Springer, Heidelberg (2007)CrossRef Wang, G., Zhang, H., Wang, H., Yu, Y.: Enhancing relation extraction by eliciting selectional constraint features from wikipedia. In: Kedad, Z., Lammari, N., Métais, E., Meziane, F., Rezgui, Y. (eds.) Natural Language Processing and Information Systems. LNCS, vol. 4592, pp. 329–340. Springer, Heidelberg (2007)CrossRef
18.
Zurück zum Zitat Wang, G., Yu, Y., Zhu, H.: PORE: positive-only relation extraction from wikipedia text. In: Aberer, K., et al. (eds.) The Semantic Web. LNCS, vol. 4825, pp. 580–594. Springer, Heidelberg (2007)CrossRef Wang, G., Yu, Y., Zhu, H.: PORE: positive-only relation extraction from wikipedia text. In: Aberer, K., et al. (eds.) The Semantic Web. LNCS, vol. 4825, pp. 580–594. Springer, Heidelberg (2007)CrossRef
19.
Zurück zum Zitat Woodward, D.: Extraction and Visualization of Temporal Information and Related Named Entities from Wikipedia. Springs, pp. 1–8 (2001) Woodward, D.: Extraction and Visualization of Temporal Information and Related Named Entities from Wikipedia. Springs, pp. 1–8 (2001)
20.
Zurück zum Zitat Wu, F., et al.: Information extraction from Wikipedia: moving down the long tail. In: Proceeding of the 14th ACM SIGKDD International Conference on Knowledge discovery and data mining, pp. 731–739. ACM, New York, NY, USA (2008) Wu, F., et al.: Information extraction from Wikipedia: moving down the long tail. In: Proceeding of the 14th ACM SIGKDD International Conference on Knowledge discovery and data mining, pp. 731–739. ACM, New York, NY, USA (2008)
21.
Zurück zum Zitat Wu, F., Weld, D.S.: Automatically refining the wikipedia infobox ontology. In: Proceeding of the 17th International Conference on World Wide Web, pp. 635–644. ACM, New York, NY, USA (2008) Wu, F., Weld, D.S.: Automatically refining the wikipedia infobox ontology. In: Proceeding of the 17th International Conference on World Wide Web, pp. 635–644. ACM, New York, NY, USA (2008)
22.
Zurück zum Zitat Wu, F., Weld, D.S.: Autonomously semantifying wikipedia. In: Proceedings of the sixteenth ACM Conference on Information and Knowledge Management, pp. 41–50. ACM, New York, NY, USA (2007) Wu, F., Weld, D.S.: Autonomously semantifying wikipedia. In: Proceedings of the sixteenth ACM Conference on Information and Knowledge Management, pp. 41–50. ACM, New York, NY, USA (2007)
Metadaten
Titel
Extraction of Historical Events from Wikipedia
verfasst von
Daniel Hienert
Francesco Luciano
Copyright-Jahr
2015
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-662-46641-4_2

Neuer Inhalt