Skip to main content
Top
Published in: GeoInformatica 4/2014

01-10-2014

Publishing deep web geographic data

Authors: Helena Piccinini, Marco A. Casanova, Luiz André P. P. Leme, Antonio L. Furtado

Published in: GeoInformatica | Issue 4/2014

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This article introduces a design process, called W-RayS, to describe Deep Web geographic data and to publish the descriptions both on the Web of Data and on the Surface Web. The article also outlines a toolkit that supports the process and discusses an experiment in which the toolkit was used to publish data stored in a large map server. Briefly, to describe geographic data in vector format, the designer should first specify views over the underlying geographic database that capture the basic characteristics of the geographic objects and their topological relationships represented in the vector data. The same idea is applied to raster data, but using a gazetteer or any other geographic database that covers the same area as the raster data. Then, the designer should map the view definitions to an RDF schema, following the Linked Data principles. The descriptions of the geographic data are therefore formalized as sets of RDF triples synthesized from the conventional data. To publish geographic data descriptions on the Web of Data, the designer may decide to materialize the RDF triples and store them in a repository or create a SPARQL endpoint to access the triples on demand. To publish geographic data descriptions on the Surface Web, W-RayS offers the designer tools to transform the RDF triples to natural language sentences, organized as static Web pages with embedded RDFa. The inclusion of RDFa preserves the structure of the data and allows more specific queries, processed by engines that analyze Web pages with RDFa.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
3
A Part-of-Speech Tagger marks (“tags”) a word in a text (within a corpus) with its corresponding part of speech, based on its definition and its relationship with adjacent and related words or phrases in a clause or paragraph. Part-of-speech is a linguistic category of words or lexical items that is usually defined by the syntactic or morphologic behavior of the lexical item in question. Common linguistic categories include nouns and verbs.
 
5
The definition of the RDF schema is available at www.​inf.​puc-rio.​br/​~hpiccinini/​wray/​biome.​owl
 
Literature
2.
go back to reference Madhavan J, Afanasiev L, Antova L, Halevy A (2009) Harnessing the Deep Web: Present and Future (Vol. cs.DB). Presented at the Fourth Biennial Conference on Innovative Data Systems Research Madhavan J, Afanasiev L, Antova L, Halevy A (2009) Harnessing the Deep Web: Present and Future (Vol. cs.DB). Presented at the Fourth Biennial Conference on Innovative Data Systems Research
4.
5.
go back to reference Martins B, Silva MJ, Chaves M (2007). O sistema CaGE no HAREM-reconhecimento de entidades geográficas em textos em língua portuguesa. Linguateca Martins B, Silva MJ, Chaves M (2007). O sistema CaGE no HAREM-reconhecimento de entidades geográficas em textos em língua portuguesa. Linguateca
6.
go back to reference Szekely P, Knoblock CA, Gupta S, Taheriyan M, Wu B (2011) Exploiting semantics of web services for geospatial data fusion (pp. 32–39). Presented at the Proceedings of the 1st ACM SIGSPATIAL International Workshop on Spatial Semantics and Ontologies, ACM Press. doi:10.1145/2068976.2068981 Szekely P, Knoblock CA, Gupta S, Taheriyan M, Wu B (2011) Exploiting semantics of web services for geospatial data fusion (pp. 32–39). Presented at the Proceedings of the 1st ACM SIGSPATIAL International Workshop on Spatial Semantics and Ontologies, ACM Press. doi:10.​1145/​2068976.​2068981
8.
go back to reference Madhavan J, Ko D, Kot L, Ganapathy V, Rasmussen A, Halevy A (2008) Google’s Deep Web crawl (Vol. 1, pp. 1241–1252). Presented at the Proceedings of the VLDB Endowment, VLDB Endowment Madhavan J, Ko D, Kot L, Ganapathy V, Rasmussen A, Halevy A (2008) Google’s Deep Web crawl (Vol. 1, pp. 1241–1252). Presented at the Proceedings of the VLDB Endowment, VLDB Endowment
9.
go back to reference Maiti A, Dasgupta A, Zhang N, Das G (2009) HDSampler: revealing data behind web form interfaces (pp. 1131–1134). Presented at the Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data, ACM. doi:10.1145/1559845.1560001 Maiti A, Dasgupta A, Zhang N, Das G (2009) HDSampler: revealing data behind web form interfaces (pp. 1131–1134). Presented at the Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data, ACM. doi:10.​1145/​1559845.​1560001
12.
go back to reference Cafarella MJ, Halevy A, Khoussainova N (2009) Data integration for the relational web. Proceedings of the VLDB Endowment (PVLDB) 2(1):1090–1101CrossRef Cafarella MJ, Halevy A, Khoussainova N (2009) Data integration for the relational web. Proceedings of the VLDB Endowment (PVLDB) 2(1):1090–1101CrossRef
13.
go back to reference He B, Zhang Z, Chang K. C.-C (2005) MetaQuerier: querying structured web sources on-the-fly (pp. 927–929). Presented at the Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, ACM Request Permissions. doi:10.1145/1066157.1066291 He B, Zhang Z, Chang K. C.-C (2005) MetaQuerier: querying structured web sources on-the-fly (pp. 927–929). Presented at the Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, ACM Request Permissions. doi:10.​1145/​1066157.​1066291
14.
go back to reference He H, Meng W, Yu C, Wu Z (2005) WISE-Integrator: a system for extracting and integrating complex web search interfaces of the deep web (pp. 1314–1317). Presented at the Proceedings of the 31st International Conference on Very Large Data Bases He H, Meng W, Yu C, Wu Z (2005) WISE-Integrator: a system for extracting and integrating complex web search interfaces of the deep web (pp. 1314–1317). Presented at the Proceedings of the 31st International Conference on Very Large Data Bases
15.
go back to reference Kabisch T, Dragut E. C, Yu C, Leser U (2010) Deep web integration with VisQI (Vol. 3, pp. 1613–1616). Presented at the Proceedings of the VLDB Endowment (PVLDB) Kabisch T, Dragut E. C, Yu C, Leser U (2010) Deep web integration with VisQI (Vol. 3, pp. 1613–1616). Presented at the Proceedings of the VLDB Endowment (PVLDB)
16.
go back to reference Rajaraman A (2009) Kosmix: high-performance topic exploration using the deep web (Vol. 2, pp. 1524–1529). Presented at the Proceedings of the VLDB Endowment (PVLDB) Rajaraman A (2009) Kosmix: high-performance topic exploration using the deep web (Vol. 2, pp. 1524–1529). Presented at the Proceedings of the VLDB Endowment (PVLDB)
21.
go back to reference Zheng Z (2002) AnswerBus question answering system (pp. 399–404). Presented at the Proceedings of the 2nd International Conference on Human Language Technology Research Zheng Z (2002) AnswerBus question answering system (pp. 399–404). Presented at the Proceedings of the 2nd International Conference on Human Language Technology Research
22.
go back to reference Nguyen TH, Nguyen H, Freire J (2010) PruSM: a prudent schema matching approach for web forms (pp. 1385–1388). Presented at the Proceedings of the 19th ACM international conference on Information and knowledge management, ACM Request Permissions. doi:10.1145/1871437.1871627 Nguyen TH, Nguyen H, Freire J (2010) PruSM: a prudent schema matching approach for web forms (pp. 1385–1388). Presented at the Proceedings of the 19th ACM international conference on Information and knowledge management, ACM Request Permissions. doi:10.​1145/​1871437.​1871627
25.
go back to reference Hewlett D, Kalyanpur A, Kolovski V, Halaschek-Wiener C (2005) Effective NL paraphrasing of ontologies on the Semantic Web. Presented at the Proceedings of the Workshop on End-User Semantic Web Interaction of the 4th International Semantic Web Conference Hewlett D, Kalyanpur A, Kolovski V, Halaschek-Wiener C (2005) Effective NL paraphrasing of ontologies on the Semantic Web. Presented at the Proceedings of the Workshop on End-User Semantic Web Interaction of the 4th International Semantic Web Conference
27.
go back to reference Hollink L, Schreiber G, Wielemaker J, Wielinga B (2003) Semantic annotation of image collections. Proceedings of the Workshop on Knowledge Markup and Semantic Annotation of the Second International Conference on Knowledge Capture Hollink L, Schreiber G, Wielemaker J, Wielinga B (2003) Semantic annotation of image collections. Proceedings of the Workshop on Knowledge Markup and Semantic Annotation of the Second International Conference on Knowledge Capture
31.
go back to reference Auer S, Dietzold S, Lehmann J, Hellmann S, Aumueller D (2009) Triplify: light-weight linked data publication from relational databases (pp. 621–630). Presented at the Proceedings of the 18th International Conference on World Wide Web, ACM. doi:10.1145/1526709.1526793 Auer S, Dietzold S, Lehmann J, Hellmann S, Aumueller D (2009) Triplify: light-weight linked data publication from relational databases (pp. 621–630). Presented at the Proceedings of the 18th International Conference on World Wide Web, ACM. doi:10.​1145/​1526709.​1526793
32.
go back to reference Bizer C, Seaborne A (2004) D2RQ - Treating Non-RDF Databases as Virtual RDF Graphs. Presented at the Proceedings of the 3rd International Semantic Web Conference Bizer C, Seaborne A (2004) D2RQ - Treating Non-RDF Databases as Virtual RDF Graphs. Presented at the Proceedings of the 3rd International Semantic Web Conference
33.
go back to reference Cullot N, Ghawi R, Yétongnon K (2007) DB2OWL: A Tool for Automatic Database-to-Ontology Mapping. (pp. 491–494). Presented at the Proceedings of the 15th Italian Symposium on Advanced Database Systems Cullot N, Ghawi R, Yétongnon K (2007) DB2OWL: A Tool for Automatic Database-to-Ontology Mapping. (pp. 491–494). Presented at the Proceedings of the 15th Italian Symposium on Advanced Database Systems
34.
go back to reference Cerbah F (2008) Learning Highly Structured Semantic Repositories from Relational Databases (Vol. 5021, pp. 777–781). Presented at the Proceedings of the 5th European Semantic Web Conference. doi:10.1007/978-3-540-68234-9_57 Cerbah F (2008) Learning Highly Structured Semantic Repositories from Relational Databases (Vol. 5021, pp. 777–781). Presented at the Proceedings of the 5th European Semantic Web Conference. doi:10.​1007/​978-3-540-68234-9_​57
35.
go back to reference Knoblock C, Szekely P, Ambite J, Goel A, Gupta S, Lerman K, Muslea M, Taheriyan M, Mallick P (2012) Semi-automatically mapping structured sources into the semantic web. (pp. 375–390). Presented at the Proceedings of the 9th International Conference on the Semantic Web: Research and Applications, Springer-Verlag. doi:10.1007/978-3-642-30284-8_32 Knoblock C, Szekely P, Ambite J, Goel A, Gupta S, Lerman K, Muslea M, Taheriyan M, Mallick P (2012) Semi-automatically mapping structured sources into the semantic web. (pp. 375–390). Presented at the Proceedings of the 9th International Conference on the Semantic Web: Research and Applications, Springer-Verlag. doi:10.​1007/​978-3-642-30284-8_​32
36.
go back to reference Bizer C, Heath T, Berners-Lee T (2009) Linked Data - The Story So Far. IGI Global. International Journal on Semantic Web and Information Systems, 5(3) Bizer C, Heath T, Berners-Lee T (2009) Linked Data - The Story So Far. IGI Global. International Journal on Semantic Web and Information Systems, 5(3)
44.
go back to reference Figueredo LAGA, Masello J (2005) SIDRA - Aggregate Database – Definition and Loading. Diretoria de Informática, IBGE, Rio de Janeiro, Brazil Figueredo LAGA, Masello J (2005) SIDRA - Aggregate Database – Definition and Loading. Diretoria de Informática, IBGE, Rio de Janeiro, Brazil
45.
go back to reference Piccinini H, Lemos M, Casanova MA, Furtado AL (2010) W-Ray: A Strategy to Publish Deep Web Geographic Data (Vol. 6413, pp. 2–11). Presented at the Proceedings of the Workshop on Semantic and Conceptual Issues in GIS of the 29th International Conference on Conceptual Modeling. doi:10.1007/978-3-642-16385-2_2 Piccinini H, Lemos M, Casanova MA, Furtado AL (2010) W-Ray: A Strategy to Publish Deep Web Geographic Data (Vol. 6413, pp. 2–11). Presented at the Proceedings of the Workshop on Semantic and Conceptual Issues in GIS of the 29th International Conference on Conceptual Modeling. doi:10.​1007/​978-3-642-16385-2_​2
Metadata
Title
Publishing deep web geographic data
Authors
Helena Piccinini
Marco A. Casanova
Luiz André P. P. Leme
Antonio L. Furtado
Publication date
01-10-2014
Publisher
Springer US
Published in
GeoInformatica / Issue 4/2014
Print ISSN: 1384-6175
Electronic ISSN: 1573-7624
DOI
https://doi.org/10.1007/s10707-013-0201-3

Other articles of this Issue 4/2014

GeoInformatica 4/2014 Go to the issue