ABSTRACT
An ever-increasing number of real-life applications produce spatiotemporal data that record the position of moving objects (persons, cars, vessels, aircrafts, etc.). In order to provide integrated views with other relevant data sources (e.g., weather, vessel databases, etc.), this data is represented in RDF and stored in knowledge bases with the following notable features: (a) the data is dynamic, since new spatio-temporal data objects are recorded every second, and (b) the size of the data is vast and can easily lead to scalability issues. As a result, this raises the need for efficient management of large-scale, dynamic, spatio-temporal RDF data. In this paper, we propose boosting the performance of spatio-temporal RDF queries by compressing the spatio-temporal information of each RDF entity into a unique integer value. We exploit this encoding in a filter-and-refine framework for processing of spatio-temporal RDF data efficiently. By means of an extensive evaluation on real-life data sets, we demonstrate the merits of our framework.
- Ibrahim Abdelaziz, Razen Harbi, Semih Salihoglu, Panos Kalnis, and Nikos Mamoulis. 2015. SPARTex: A Vertex-Centric Framework for RDF Data Analytics. PVLDB 8, 12 (2015), 1880--1891. Google ScholarDigital Library
- Christophe Claramunt, Cyril Ray, Elena Camossi, Anne-Laure Jousselme, Melita Hadzagic, Gennady L. Andrienko, Natalia V. Andrienko, Yannis Theodoridis, George A. Vouros, and Loïc Salmon. 2017. Maritime data integration and analysis: recent progress and research challenges. In Proceedings of the 20th International Conference on Extending Database Technology, EDBT 2017, Venice, Italy, March 21-24, 2017. 192--197.Google Scholar
- Sairam Gurajada, Stephan Seufert, Iris Miliaraki, and Martin Theobald. 2014. TriAD: A distributed shared-nothing RDF engine based on asynchronous message passing. In Proceedings of SIGMOD. 289--300. Google ScholarDigital Library
- Mohammad Hammoud, Dania Abed Rabbou, Reza Nouri, Seyed-Mehdi-Reza Beheshti, and Sherif Sakr. 2015. DREAM: Distributed RDF Engine with Adaptive Query Planner and Minimal Communication. PVLDB 8, 6 (2015), 654--665. Google ScholarDigital Library
- Razen Harbi, Ibrahim Abdelaziz, Panos Kalnis, and Nikos Mamoulis. 2015. Evaluating SPARQL Queries on Massive RDF Datasets. PVLDB 8, 12 (2015), 1848--1851. Google ScholarDigital Library
- Jiewen Huang, Daniel J. Abadi, and Kun Ren. 2011. Scalable SPARQL Querying of Large RDF Graphs. PVLDB 4, 11 (2011), 1123--1134.Google ScholarDigital Library
- Christian S. Jensen, Dan Lin, and Beng Chin Ooi. 2004. Query and Update Efficient B+-Tree Based Indexing of Moving Objects. In (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, Toronto, Canada, August 31 - September 3 2004. 768--779. Google ScholarDigital Library
- Christian S. Jensen, Dalia Tiesyte, and Nerius Tradisauskas. 2006. Robust B+-Tree-Based Indexing of Moving Objects. In 7th International Conference on Mobile Data Management (MDM 2006), Nara, Japan, May 9--13, 2006. 12. Google ScholarDigital Library
- Ibrahim Kamel and Christos Faloutsos. 1994. Hilbert R-tree: An Improved R-tree using Fractals. In VLDB'94, Proceedings of 20th International Conference on Very Large Data Bases, September 12--15, 1994, Santiago de Chile, Chile. 500--509. Google ScholarDigital Library
- Manolis Koubarakis and Kostis Kyzirakos. 2010. Modeling and Querying Metadata in the Semantic Sensor Web: The Model stRDF and the Query Language stSPARQL. In Proceedings of ESWC. 425--439. Google ScholarDigital Library
- Kostis Kyzirakos, Manos Karpathiotakis, Konstantina Bereta, George Garbis, Charalampos Nikolaou, Panayiotis Smeros, Stella Giannakopoulou, Kallirroi Dogani, and Manolis Koubarakis. 2013. The Spatiotemporal RDF Store Strabon. In Proceedings of SSTD. 496--500.Google ScholarCross Ref
- John Liagouris, Nikos Mamoulis, Panagiotis Bouros, and Manolis Terrovitis. 2014. An Effective Encoding Scheme for Spatial RDF Data. PVLDB 7 (2014), 1271--1282. Google ScholarDigital Library
- Bongki Moon, H. V. Jagadish, Christos Faloutsos, and Joel H. Saltz. 2001. Analysis of the Clustering Properties of the Hilbert Space-Filling Curve. IEEE Trans. Knowl. Data Eng. 13, 1 (2001), 124--141. Google ScholarDigital Library
- Thomas Neumann and Gerhard Weikum. 2010. The RDF-3X engine for scalable management of RDF data. VLDB J. 19, 1 (2010), 91--113. Google ScholarDigital Library
- Peng Peng, Lei Zou, M. Tamer Özsu, Lei Chen, and Dongyan Zhao. 2016. Processing SPARQL queries over distributed RDF graphs. VLDB J. 25, 2 (2016), 243--268. Google ScholarDigital Library
- Matthew Perry, Prateek Jain, and Amit P. Sheth. 2011. SPARQL-ST: Extending SPARQL to Support Spatiotemporal Queries. In Geospatial Semantics and the Semantic Web. 61--86.Google Scholar
- Dong Wang, Lei Zou, Yansong Feng, Xuchuan Shen, Jilei Tian, and Dongyan Zhao. 2013. S-store: An Engine for Large RDF Graph Integrating Spatial Information. In Database Systems for Advanced Applications, 18th International Conference, DASFAA 2013, Wuhan, China, April 22--25, 2013. Proceedings, Part II. 31--47.Google Scholar
- Dong Wang, Lei Zou, and Dongyan Zhao. 2014. g<sup>st</sup>-Store: An Engine for Large RDF Graph Integrating Spatio-temporal Information. In Proceedings of EDBT. 652--655.Google Scholar
- Cathrin Weiss, Panagiotis Karras, and Abraham Bernstein. 2008. Hexastore: sextuple indexing for semantic web data management. PVLDB 1, 1 (2008), 1008--1019. Google ScholarDigital Library
- Kai Zeng, Jiacheng Yang, Haixun Wang, Bin Shao, and Zhongyuan Wang. 2013. A Distributed Graph Engine for Web Scale RDF Data. PVLDB 6, 4 (2013), 265--276. Google ScholarDigital Library
- Xiaofei Zhang, Lei Chen, Yongxin Tong, and Min Wang. 2013. EAGRE: Towards scalable I/O efficient SPARQL query evaluation on the cloud. In Proceedings of ICDE. 565--576. Google ScholarDigital Library
Index Terms
- Efficient spatio-temporal RDF query processing in large dynamic knowledge bases
Recommendations
Dynamic and fast processing of queries on large-scale RDF data
As RDF data continue to gain popularity, we witness the fast growing trend of RDF datasets in both the number of RDF repositories and the size of RDF datasets. Many known RDF datasets contain billions of RDF triples (subject, predicate and object). One ...
Scalable join processing on very large RDF graphs
SIGMOD '09: Proceedings of the 2009 ACM SIGMOD International Conference on Management of dataWith the proliferation of the RDF data format, engines for RDF query processing are faced with very large graphs that contain hundreds of millions of RDF triples. This paper addresses the resulting scalability problems. Recent prior work along these ...
Towards distributed processing of RDF path queries
A technical infrastructure for storing, querying and managing RDF data is a key element in the current semantic web development. Systems like Jena, Sesame or the ICS-FORTH RDF Suite are widely used for building semantic web applications. Currently, none ...
Comments