2009 | OriginalPaper | Chapter
Can RDB2RDF Tools Feasibily Expose Large Science Archives for Data Integration?
Authors : Alasdair J. G. Gray, Norman Gray, Iadh Ounis
Published in: The Semantic Web: Research and Applications
Publisher: Springer Berlin Heidelberg
Activate our intelligent search to find suitable subject content or patents.
Select sections of text to find matching patents with Artificial Intelligence. powered by
Select sections of text to find additional relevant content using AI-assisted search. powered by
Many science archive centres publish very large volumes of image, simulation, and experiment data. In order to integrate and analyse the available data, scientists need to be able to (i) identify and locate all the data relevant to their work; (ii) understand the multiple heterogeneous data models in which the data is published; and (iii) interpret and process the data they retrieve.
rdf
has been shown to be a generally successful framework within which to perform such data integration work. It can be equally successful in the context of scientific data, if it is demonstrably practical to expose that data as
rdf
.
In this paper we investigate the capabilities of
rdf
to enable the integration of scientific data sources. Specifically, we discuss the suitability of
sparql
for expressing scientific queries, and the performance of several triple stores and
rdbrdf
tools for executing queries over a moderately sized sample of a large astronomical data set. We found that more research and improvements are required into
sparql
and
rdbrdf
tools to efficiently expose existing science archives for data integration.