Skip to main content
Erschienen in: GeoInformatica 4/2014

01.10.2014

Publishing deep web geographic data

verfasst von: Helena Piccinini, Marco A. Casanova, Luiz André P. P. Leme, Antonio L. Furtado

Erschienen in: GeoInformatica | Ausgabe 4/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This article introduces a design process, called W-RayS, to describe Deep Web geographic data and to publish the descriptions both on the Web of Data and on the Surface Web. The article also outlines a toolkit that supports the process and discusses an experiment in which the toolkit was used to publish data stored in a large map server. Briefly, to describe geographic data in vector format, the designer should first specify views over the underlying geographic database that capture the basic characteristics of the geographic objects and their topological relationships represented in the vector data. The same idea is applied to raster data, but using a gazetteer or any other geographic database that covers the same area as the raster data. Then, the designer should map the view definitions to an RDF schema, following the Linked Data principles. The descriptions of the geographic data are therefore formalized as sets of RDF triples synthesized from the conventional data. To publish geographic data descriptions on the Web of Data, the designer may decide to materialize the RDF triples and store them in a repository or create a SPARQL endpoint to access the triples on demand. To publish geographic data descriptions on the Surface Web, W-RayS offers the designer tools to transform the RDF triples to natural language sentences, organized as static Web pages with embedded RDFa. The inclusion of RDFa preserves the structure of the data and allows more specific queries, processed by engines that analyze Web pages with RDFa.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
3
A Part-of-Speech Tagger marks (“tags”) a word in a text (within a corpus) with its corresponding part of speech, based on its definition and its relationship with adjacent and related words or phrases in a clause or paragraph. Part-of-speech is a linguistic category of words or lexical items that is usually defined by the syntactic or morphologic behavior of the lexical item in question. Common linguistic categories include nouns and verbs.
 
5
The definition of the RDF schema is available at www.​inf.​puc-rio.​br/​~hpiccinini/​wray/​biome.​owl
 
Literatur
2.
Zurück zum Zitat Madhavan J, Afanasiev L, Antova L, Halevy A (2009) Harnessing the Deep Web: Present and Future (Vol. cs.DB). Presented at the Fourth Biennial Conference on Innovative Data Systems Research Madhavan J, Afanasiev L, Antova L, Halevy A (2009) Harnessing the Deep Web: Present and Future (Vol. cs.DB). Presented at the Fourth Biennial Conference on Innovative Data Systems Research
4.
5.
Zurück zum Zitat Martins B, Silva MJ, Chaves M (2007). O sistema CaGE no HAREM-reconhecimento de entidades geográficas em textos em língua portuguesa. Linguateca Martins B, Silva MJ, Chaves M (2007). O sistema CaGE no HAREM-reconhecimento de entidades geográficas em textos em língua portuguesa. Linguateca
6.
Zurück zum Zitat Szekely P, Knoblock CA, Gupta S, Taheriyan M, Wu B (2011) Exploiting semantics of web services for geospatial data fusion (pp. 32–39). Presented at the Proceedings of the 1st ACM SIGSPATIAL International Workshop on Spatial Semantics and Ontologies, ACM Press. doi:10.1145/2068976.2068981 Szekely P, Knoblock CA, Gupta S, Taheriyan M, Wu B (2011) Exploiting semantics of web services for geospatial data fusion (pp. 32–39). Presented at the Proceedings of the 1st ACM SIGSPATIAL International Workshop on Spatial Semantics and Ontologies, ACM Press. doi:10.​1145/​2068976.​2068981
8.
Zurück zum Zitat Madhavan J, Ko D, Kot L, Ganapathy V, Rasmussen A, Halevy A (2008) Google’s Deep Web crawl (Vol. 1, pp. 1241–1252). Presented at the Proceedings of the VLDB Endowment, VLDB Endowment Madhavan J, Ko D, Kot L, Ganapathy V, Rasmussen A, Halevy A (2008) Google’s Deep Web crawl (Vol. 1, pp. 1241–1252). Presented at the Proceedings of the VLDB Endowment, VLDB Endowment
9.
Zurück zum Zitat Maiti A, Dasgupta A, Zhang N, Das G (2009) HDSampler: revealing data behind web form interfaces (pp. 1131–1134). Presented at the Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data, ACM. doi:10.1145/1559845.1560001 Maiti A, Dasgupta A, Zhang N, Das G (2009) HDSampler: revealing data behind web form interfaces (pp. 1131–1134). Presented at the Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data, ACM. doi:10.​1145/​1559845.​1560001
10.
12.
Zurück zum Zitat Cafarella MJ, Halevy A, Khoussainova N (2009) Data integration for the relational web. Proceedings of the VLDB Endowment (PVLDB) 2(1):1090–1101CrossRef Cafarella MJ, Halevy A, Khoussainova N (2009) Data integration for the relational web. Proceedings of the VLDB Endowment (PVLDB) 2(1):1090–1101CrossRef
13.
Zurück zum Zitat He B, Zhang Z, Chang K. C.-C (2005) MetaQuerier: querying structured web sources on-the-fly (pp. 927–929). Presented at the Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, ACM Request Permissions. doi:10.1145/1066157.1066291 He B, Zhang Z, Chang K. C.-C (2005) MetaQuerier: querying structured web sources on-the-fly (pp. 927–929). Presented at the Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, ACM Request Permissions. doi:10.​1145/​1066157.​1066291
14.
Zurück zum Zitat He H, Meng W, Yu C, Wu Z (2005) WISE-Integrator: a system for extracting and integrating complex web search interfaces of the deep web (pp. 1314–1317). Presented at the Proceedings of the 31st International Conference on Very Large Data Bases He H, Meng W, Yu C, Wu Z (2005) WISE-Integrator: a system for extracting and integrating complex web search interfaces of the deep web (pp. 1314–1317). Presented at the Proceedings of the 31st International Conference on Very Large Data Bases
15.
Zurück zum Zitat Kabisch T, Dragut E. C, Yu C, Leser U (2010) Deep web integration with VisQI (Vol. 3, pp. 1613–1616). Presented at the Proceedings of the VLDB Endowment (PVLDB) Kabisch T, Dragut E. C, Yu C, Leser U (2010) Deep web integration with VisQI (Vol. 3, pp. 1613–1616). Presented at the Proceedings of the VLDB Endowment (PVLDB)
16.
Zurück zum Zitat Rajaraman A (2009) Kosmix: high-performance topic exploration using the deep web (Vol. 2, pp. 1524–1529). Presented at the Proceedings of the VLDB Endowment (PVLDB) Rajaraman A (2009) Kosmix: high-performance topic exploration using the deep web (Vol. 2, pp. 1524–1529). Presented at the Proceedings of the VLDB Endowment (PVLDB)
21.
Zurück zum Zitat Zheng Z (2002) AnswerBus question answering system (pp. 399–404). Presented at the Proceedings of the 2nd International Conference on Human Language Technology Research Zheng Z (2002) AnswerBus question answering system (pp. 399–404). Presented at the Proceedings of the 2nd International Conference on Human Language Technology Research
22.
Zurück zum Zitat Nguyen TH, Nguyen H, Freire J (2010) PruSM: a prudent schema matching approach for web forms (pp. 1385–1388). Presented at the Proceedings of the 19th ACM international conference on Information and knowledge management, ACM Request Permissions. doi:10.1145/1871437.1871627 Nguyen TH, Nguyen H, Freire J (2010) PruSM: a prudent schema matching approach for web forms (pp. 1385–1388). Presented at the Proceedings of the 19th ACM international conference on Information and knowledge management, ACM Request Permissions. doi:10.​1145/​1871437.​1871627
25.
Zurück zum Zitat Hewlett D, Kalyanpur A, Kolovski V, Halaschek-Wiener C (2005) Effective NL paraphrasing of ontologies on the Semantic Web. Presented at the Proceedings of the Workshop on End-User Semantic Web Interaction of the 4th International Semantic Web Conference Hewlett D, Kalyanpur A, Kolovski V, Halaschek-Wiener C (2005) Effective NL paraphrasing of ontologies on the Semantic Web. Presented at the Proceedings of the Workshop on End-User Semantic Web Interaction of the 4th International Semantic Web Conference
27.
Zurück zum Zitat Hollink L, Schreiber G, Wielemaker J, Wielinga B (2003) Semantic annotation of image collections. Proceedings of the Workshop on Knowledge Markup and Semantic Annotation of the Second International Conference on Knowledge Capture Hollink L, Schreiber G, Wielemaker J, Wielinga B (2003) Semantic annotation of image collections. Proceedings of the Workshop on Knowledge Markup and Semantic Annotation of the Second International Conference on Knowledge Capture
31.
Zurück zum Zitat Auer S, Dietzold S, Lehmann J, Hellmann S, Aumueller D (2009) Triplify: light-weight linked data publication from relational databases (pp. 621–630). Presented at the Proceedings of the 18th International Conference on World Wide Web, ACM. doi:10.1145/1526709.1526793 Auer S, Dietzold S, Lehmann J, Hellmann S, Aumueller D (2009) Triplify: light-weight linked data publication from relational databases (pp. 621–630). Presented at the Proceedings of the 18th International Conference on World Wide Web, ACM. doi:10.​1145/​1526709.​1526793
32.
Zurück zum Zitat Bizer C, Seaborne A (2004) D2RQ - Treating Non-RDF Databases as Virtual RDF Graphs. Presented at the Proceedings of the 3rd International Semantic Web Conference Bizer C, Seaborne A (2004) D2RQ - Treating Non-RDF Databases as Virtual RDF Graphs. Presented at the Proceedings of the 3rd International Semantic Web Conference
33.
Zurück zum Zitat Cullot N, Ghawi R, Yétongnon K (2007) DB2OWL: A Tool for Automatic Database-to-Ontology Mapping. (pp. 491–494). Presented at the Proceedings of the 15th Italian Symposium on Advanced Database Systems Cullot N, Ghawi R, Yétongnon K (2007) DB2OWL: A Tool for Automatic Database-to-Ontology Mapping. (pp. 491–494). Presented at the Proceedings of the 15th Italian Symposium on Advanced Database Systems
34.
Zurück zum Zitat Cerbah F (2008) Learning Highly Structured Semantic Repositories from Relational Databases (Vol. 5021, pp. 777–781). Presented at the Proceedings of the 5th European Semantic Web Conference. doi:10.1007/978-3-540-68234-9_57 Cerbah F (2008) Learning Highly Structured Semantic Repositories from Relational Databases (Vol. 5021, pp. 777–781). Presented at the Proceedings of the 5th European Semantic Web Conference. doi:10.​1007/​978-3-540-68234-9_​57
35.
Zurück zum Zitat Knoblock C, Szekely P, Ambite J, Goel A, Gupta S, Lerman K, Muslea M, Taheriyan M, Mallick P (2012) Semi-automatically mapping structured sources into the semantic web. (pp. 375–390). Presented at the Proceedings of the 9th International Conference on the Semantic Web: Research and Applications, Springer-Verlag. doi:10.1007/978-3-642-30284-8_32 Knoblock C, Szekely P, Ambite J, Goel A, Gupta S, Lerman K, Muslea M, Taheriyan M, Mallick P (2012) Semi-automatically mapping structured sources into the semantic web. (pp. 375–390). Presented at the Proceedings of the 9th International Conference on the Semantic Web: Research and Applications, Springer-Verlag. doi:10.​1007/​978-3-642-30284-8_​32
36.
Zurück zum Zitat Bizer C, Heath T, Berners-Lee T (2009) Linked Data - The Story So Far. IGI Global. International Journal on Semantic Web and Information Systems, 5(3) Bizer C, Heath T, Berners-Lee T (2009) Linked Data - The Story So Far. IGI Global. International Journal on Semantic Web and Information Systems, 5(3)
44.
Zurück zum Zitat Figueredo LAGA, Masello J (2005) SIDRA - Aggregate Database – Definition and Loading. Diretoria de Informática, IBGE, Rio de Janeiro, Brazil Figueredo LAGA, Masello J (2005) SIDRA - Aggregate Database – Definition and Loading. Diretoria de Informática, IBGE, Rio de Janeiro, Brazil
45.
Zurück zum Zitat Piccinini H, Lemos M, Casanova MA, Furtado AL (2010) W-Ray: A Strategy to Publish Deep Web Geographic Data (Vol. 6413, pp. 2–11). Presented at the Proceedings of the Workshop on Semantic and Conceptual Issues in GIS of the 29th International Conference on Conceptual Modeling. doi:10.1007/978-3-642-16385-2_2 Piccinini H, Lemos M, Casanova MA, Furtado AL (2010) W-Ray: A Strategy to Publish Deep Web Geographic Data (Vol. 6413, pp. 2–11). Presented at the Proceedings of the Workshop on Semantic and Conceptual Issues in GIS of the 29th International Conference on Conceptual Modeling. doi:10.​1007/​978-3-642-16385-2_​2
Metadaten
Titel
Publishing deep web geographic data
verfasst von
Helena Piccinini
Marco A. Casanova
Luiz André P. P. Leme
Antonio L. Furtado
Publikationsdatum
01.10.2014
Verlag
Springer US
Erschienen in
GeoInformatica / Ausgabe 4/2014
Print ISSN: 1384-6175
Elektronische ISSN: 1573-7624
DOI
https://doi.org/10.1007/s10707-013-0201-3

Weitere Artikel der Ausgabe 4/2014

GeoInformatica 4/2014 Zur Ausgabe