Skip to main content

2017 | OriginalPaper | Buchkapitel

Pre-processing and Indexing Techniques for Constellation Queries in Big Data

verfasst von : Amir Khatibi, Fabio Porto, Joao Guilherme Rittmeyer, Eduardo Ogasawara, Patrick Valduriez, Dennis Shasha

Erschienen in: Big Data Analytics and Knowledge Discovery

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Geometric patterns are defined by a spatial distribution of a set of objects. They can be found in many spatial datasets as in seismic, astronomy, and transportation. A particular interesting geometric pattern is exhibited by the Einstein cross, which is an astronomical phenomenon in which a single quasar is observed as four distinct sky objects when captured by earth telescopes. Finding such crosses, as well as other geometric patterns, collectively refered to as constellation queries, is a challenging problem as the potential number of sets of elements that compose shapes is exponentially large in the size of the dataset and the query pattern. In this paper we propose algorithms to optimize the computation of constellation queries. Our techniques involve pre-processing the query to reduce its dimensionality as well as indexing the data to fasten stars neighboring computation using a PH-tree. We have implemented our techniques in Spark and evaluated our techniques by a series of experiments. The PH-tree indexing showed very good results and guarantees query answer completeness.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Brilhante, I.R., de Macêdo, J.A.F., Nardini, F.M., Perego, R., Renso, C.: On planning sightseeing tours with tripbuilder. Inf. Process. Manage. 51(2), 1–15 (2015)CrossRef Brilhante, I.R., de Macêdo, J.A.F., Nardini, F.M., Perego, R., Renso, C.: On planning sightseeing tours with tripbuilder. Inf. Process. Manage. 51(2), 1–15 (2015)CrossRef
2.
Zurück zum Zitat Overbye, D.: Astronomers observe supernova and find they are watching reruns. New York Times, USA (2015) Overbye, D.: Astronomers observe supernova and find they are watching reruns. New York Times, USA (2015)
3.
Zurück zum Zitat Porto, F., Khatibi, A., Nobre, J.R., Ogasawara, E., Valduriez, P., Shasha, D.: Constellation queries over big data. eprint arXiv:1703.02638 - Bibliographic Code: 2017arXiv170302638P, March 2017 Porto, F., Khatibi, A., Nobre, J.R., Ogasawara, E., Valduriez, P., Shasha, D.: Constellation queries over big data. eprint arXiv:​1703.​02638 - Bibliographic Code: 2017arXiv170302638P, March 2017
4.
Zurück zum Zitat Papadias, N.M.D., Delis, V.: Algorithms for querying by spatial structure. In: Proceedings of the 24th VLDB Conference, pp. 546–557 (1998) Papadias, N.M.D., Delis, V.: Algorithms for querying by spatial structure. In: Proceedings of the 24th VLDB Conference, pp. 546–557 (1998)
5.
Zurück zum Zitat Brucato, A.M., Beltran, J.F., Meliou, A.: Scalable package queries in relational database systems. Proc. VLDB Endowment 9, 576–597 (2016)CrossRef Brucato, A.M., Beltran, J.F., Meliou, A.: Scalable package queries in relational database systems. Proc. VLDB Endowment 9, 576–597 (2016)CrossRef
6.
Zurück zum Zitat Zäschke, T., Zimmerli, C., Norrie, M.C.: The PH-tree: a space-efficient storage structure and multi-dimensional index. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 397–408 (2014) Zäschke, T., Zimmerli, C., Norrie, M.C.: The PH-tree: a space-efficient storage structure and multi-dimensional index. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 397–408 (2014)
Metadaten
Titel
Pre-processing and Indexing Techniques for Constellation Queries in Big Data
verfasst von
Amir Khatibi
Fabio Porto
Joao Guilherme Rittmeyer
Eduardo Ogasawara
Patrick Valduriez
Dennis Shasha
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-64283-3_12