Skip to main content
Erschienen in:
Buchtitelbild

2017 | OriginalPaper | Buchkapitel

Extracting Wikipedia Data to Enrich Spatial Information

verfasst von : Jörg Roth

Erschienen in: Innovations for Community Services

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Freely available geo data allow a developer to create new types of remarkable services related to the user’s location. Even though current geo data sources have a high coverage and quality, they do not contain all information required by new services. This is because geo data sources usually focus on object geometries and object types. Important information is often missing. As an example: city entries mainly contain the city name and border, but not the name of mayor, amount of taxes, year of foundation, number of districts etc. These data are available in online encyclopediae such as Wikipedia, but there is no obvious approach to relate both sources. Our objective was thus to create an automatic import from Wikipedia articles that describe geo objects and extract all relevant data. To extract processible values we are able to identify property types such dates, money values, powers, heights, sizes etc. This makes it possible to use these data for further computation, e.g. to search for maxima, build averages and sums or to create comparative conditions in queries.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Auer, S., Lehmann, J., Hellmann, S.: LinkedGeoData: adding a spatial dimension to the web of data. In: Bernstein, A., Karger, D.R., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 731–746. Springer, Heidelberg (2009). doi:10.1007/978-3-642-04930-9_46 CrossRef Auer, S., Lehmann, J., Hellmann, S.: LinkedGeoData: adding a spatial dimension to the web of data. In: Bernstein, A., Karger, D.R., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 731–746. Springer, Heidelberg (2009). doi:10.​1007/​978-3-642-04930-9_​46 CrossRef
2.
Zurück zum Zitat Barrett, D.J.: MediaWiki (Wikipedia and Beyond), O’Reilly (2008) Barrett, D.J.: MediaWiki (Wikipedia and Beyond), O’Reilly (2008)
3.
Zurück zum Zitat Bennett, J.: OpenStreetMap. Packt Publishing, Birmingham (2010) Bennett, J.: OpenStreetMap. Packt Publishing, Birmingham (2010)
4.
Zurück zum Zitat Gabrilovich, E., Markovitch, S.: Computing semantic relatedness using Wikipedia-based explicit semantic analysis. In: Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI 2007), Hyderabad, India, 6–12 January 2007, pp. 1606–1611 (2007) Gabrilovich, E., Markovitch, S.: Computing semantic relatedness using Wikipedia-based explicit semantic analysis. In: Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI 2007), Hyderabad, India, 6–12 January 2007, pp. 1606–1611 (2007)
6.
Zurück zum Zitat Hoffart, J., Suchanek, F.M., Berberich, K., Weikum, G.: YAGO2: a spatially and temporally enhanced knowledge base from Wikipedia. Artif. Intell. 194, 28–61 (2013)MathSciNetCrossRefMATH Hoffart, J., Suchanek, F.M., Berberich, K., Weikum, G.: YAGO2: a spatially and temporally enhanced knowledge base from Wikipedia. Artif. Intell. 194, 28–61 (2013)MathSciNetCrossRefMATH
7.
Zurück zum Zitat Milne, D., Witten, I.H.: An open-source toolkit for mining Wikipedia. Artif. Intell. 194(2013), 222–239 (2013). ElsevierMathSciNetCrossRef Milne, D., Witten, I.H.: An open-source toolkit for mining Wikipedia. Artif. Intell. 194(2013), 222–239 (2013). ElsevierMathSciNetCrossRef
9.
Zurück zum Zitat Ponzetto, S.P., Strube, M.: Deriving a large scale taxonomy from Wikipedia. In: AAAI 2007 Proceedings of the 22nd National Conference on Artificial Intelligence - Volume 2, 22–26 July 2007, Vancouver, British Columbia, pp. 1440–1445 (2007) Ponzetto, S.P., Strube, M.: Deriving a large scale taxonomy from Wikipedia. In: AAAI 2007 Proceedings of the 22nd National Conference on Artificial Intelligence - Volume 2, 22–26 July 2007, Vancouver, British Columbia, pp. 1440–1445 (2007)
10.
Zurück zum Zitat Prato, A., Ronchetti, M.: Using Wikipedia as a reference for extracting semantic information from a text. In: Third International Conference on Advances in Semantic Processing, SEMAPRO 2009, 11–16 October 2009, Sliema, Malta, pp. 56–61 (2009) Prato, A., Ronchetti, M.: Using Wikipedia as a reference for extracting semantic information from a text. In: Third International Conference on Advances in Semantic Processing, SEMAPRO 2009, 11–16 October 2009, Sliema, Malta, pp. 56–61 (2009)
11.
Zurück zum Zitat Roth, J.: Die HomeRun-Plattform für ortsbezogene Dienste außerhalb des Massenmarktes. In: Zipf, A., Lanig, S., Bauer, M. (eds.) 6. GI/ITG KuVS Workshop Location Based Services and Applications, Heidelberger Geographische Bausteine Heft 18, 2010 (2010). (in German) Roth, J.: Die HomeRun-Plattform für ortsbezogene Dienste außerhalb des Massenmarktes. In: Zipf, A., Lanig, S., Bauer, M. (eds.) 6. GI/ITG KuVS Workshop Location Based Services and Applications, Heidelberger Geographische Bausteine Heft 18, 2010 (2010). (in German)
12.
Zurück zum Zitat Roth, J.: Übernahme von Geodatenbeständen aus Open Street Map und Bereitstellung einer effizienten Zugriffsmöglichkeit für ortsbezogene Dienste, Praxis der Informationsverarbeitung und Kommunikation (PIK), vol. 13, no. 4 (2010). (in German) Roth, J.: Übernahme von Geodatenbeständen aus Open Street Map und Bereitstellung einer effizienten Zugriffsmöglichkeit für ortsbezogene Dienste, Praxis der Informationsverarbeitung und Kommunikation (PIK), vol. 13, no. 4 (2010). (in German)
13.
Zurück zum Zitat Roth, J.: Combining symbolic and spatial exploratory search – the homerun explorer. In: Innovative Internet Computing Systems (I2CS), Hagen, 19–21 June 2013, Fortschritt-Berichte VDI, Reihe, vol. 10, no. 826, pp. 94–108 (2013) Roth, J.: Combining symbolic and spatial exploratory search – the homerun explorer. In: Innovative Internet Computing Systems (I2CS), Hagen, 19–21 June 2013, Fortschritt-Berichte VDI, Reihe, vol. 10, no. 826, pp. 94–108 (2013)
14.
Zurück zum Zitat Roth, J.: From weak to strong geo object classification. In: Schau, V., Eichler, G., Roth, J. (eds.) Proceedings of the 10th Workshop Location-Based Application and Services (LBAS) 16–17 September 2013, University of Jena, Germany, Logos Verlag Berlin, pp. 3–12 (2014) Roth, J.: From weak to strong geo object classification. In: Schau, V., Eichler, G., Roth, J. (eds.) Proceedings of the 10th Workshop Location-Based Application and Services (LBAS) 16–17 September 2013, University of Jena, Germany, Logos Verlag Berlin, pp. 3–12 (2014)
15.
Zurück zum Zitat Roth, J.: Predicting route targets based on optimality considerations. In: International Conference on Innovations for Community Services (I4CS), Reims (France) 4–6 June 2014, pp. 61–68. IEEE Xplore (2015) Roth, J.: Predicting route targets based on optimality considerations. In: International Conference on Innovations for Community Services (I4CS), Reims (France) 4–6 June 2014, pp. 61–68. IEEE Xplore (2015)
16.
Zurück zum Zitat Roth, J.: Fast spatio-symbolic searching in huge geo databases. In: Proceedings of the 11th Workshop Location-Based application and Services (LBAS), 18–19 September 2014, Telekom Innovation Laboratories, Darmstadt, Germany, Logos Verlag (2015) Roth, J.: Fast spatio-symbolic searching in huge geo databases. In: Proceedings of the 11th Workshop Location-Based application and Services (LBAS), 18–19 September 2014, Telekom Innovation Laboratories, Darmstadt, Germany, Logos Verlag (2015)
17.
Zurück zum Zitat Roth, J.: Generating meaningful location descriptions. In: International Conference on Innovations for Community Services (I4CS), 8–10 July 2015, Nuremberg (Germany), pp. 30–37. IEEE Xplore (2015) Roth, J.: Generating meaningful location descriptions. In: International Conference on Innovations for Community Services (I4CS), 8–10 July 2015, Nuremberg (Germany), pp. 30–37. IEEE Xplore (2015)
18.
Zurück zum Zitat Völkel, M., Krötzsch, M., Vrandecic, D., Haller, H., Studer, R.: Semantic Wikipedia. In: Proceedings of the 15th International Conference on World Wide Web (WWW 2006), 23–26 May 2006, Edinburgh, Scotland, pp. 585–594 (2006) Völkel, M., Krötzsch, M., Vrandecic, D., Haller, H., Studer, R.: Semantic Wikipedia. In: Proceedings of the 15th International Conference on World Wide Web (WWW 2006), 23–26 May 2006, Edinburgh, Scotland, pp. 585–594 (2006)
20.
Zurück zum Zitat Wu, F., Weld, D.S.: Open information extraction using Wikipedia. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, 11–16 July 2010, pp. 118–127 (2010) Wu, F., Weld, D.S.: Open information extraction using Wikipedia. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, 11–16 July 2010, pp. 118–127 (2010)
21.
Zurück zum Zitat Zesch, T., Müller, C., Gurevych, I.: Extracting lexical semantic knowledge from Wikipedia and Wiktionary. In: Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008), 28–30 May 2008, Marrakech, Morocco (2008) Zesch, T., Müller, C., Gurevych, I.: Extracting lexical semantic knowledge from Wikipedia and Wiktionary. In: Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008), 28–30 May 2008, Marrakech, Morocco (2008)
Metadaten
Titel
Extracting Wikipedia Data to Enrich Spatial Information
verfasst von
Jörg Roth
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-60447-3_1