Skip to main content
Erschienen in: Cluster Computing 4/2017

09.08.2017

A two-phase data space partitioning for efficient skyline computation

verfasst von: Aziz Nasridinov, Jong-Hyeok Choi, Young-Ho Park

Erschienen in: Cluster Computing | Ausgabe 4/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The skyline has attracted a lot of attention due to its wide application in various fields. However, the skyline computation is a challenging issue as there is a high probability that today’s applications deal with large and high-dimensional data. As skyline computation for such huge amount of data consumes much time, parallel and distributed skyline computations are considered. State-of-the-art methods for parallel and distributed skyline computations use various data space partitioning techniques. However, these methods are not efficient, as in certain cases, these methods perform unnecessary skyline computations in a partitioned space, where local-skyline tuples do not contribute to the global-skyline. This may impose additional processing overload and enlarge the overall skyline computation time. In this paper, we propose a novel data space partitioning method for parallel and distributed skyline computation that consists of two-phases: diagonal and entropy score curve based partitioning. The proposed method produces a small set of local-skyline tuples and leads to a more sophisticated merging step. The experiment results demonstrate that the proposed method reduces the number of comparisons and processing time of skyline computation in large amount of data when compared with the existing state-of-the-art methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Zhu, H., Zhu, P., Li, X., Liu, Q.: Top-k skyline groups queries. In: Proceedings of 20th International Conference on Extending Database Technology (EDBT), pp. 442–445 (2017) Zhu, H., Zhu, P., Li, X., Liu, Q.: Top-k skyline groups queries. In: Proceedings of 20th International Conference on Extending Database Technology (EDBT), pp. 442–445 (2017)
2.
Zurück zum Zitat Asudeh, A., Thirumuruganathan, S., Zhang, N., Das, G.: Discovering the skyline of web databases. Proc. VLDB Endow. 9(7), 600–611 (2016)CrossRef Asudeh, A., Thirumuruganathan, S., Zhang, N., Das, G.: Discovering the skyline of web databases. Proc. VLDB Endow. 9(7), 600–611 (2016)CrossRef
3.
Zurück zum Zitat Park, Y., Min, J.K., Shim, K.: Parallel computation of skyline and reverse skyline queries using MapReduce. Proc. VLDB Endow. 6(14), 2002–2013 (2013)CrossRef Park, Y., Min, J.K., Shim, K.: Parallel computation of skyline and reverse skyline queries using MapReduce. Proc. VLDB Endow. 6(14), 2002–2013 (2013)CrossRef
4.
Zurück zum Zitat Wu, P., Zhang, C., Feng, Y., Zhao, B.Y., Agrawal, D., Abbadi, A.E.: Parallelizing skyline queries for scalable distribution. In: Proceedings of the 10th international conference on Advances in Database Technology (EDBT), pp. 112–130 (2006) Wu, P., Zhang, C., Feng, Y., Zhao, B.Y., Agrawal, D., Abbadi, A.E.: Parallelizing skyline queries for scalable distribution. In: Proceedings of the 10th international conference on Advances in Database Technology (EDBT), pp. 112–130 (2006)
5.
Zurück zum Zitat Vlachou, A., Doulkeridis, C., Kotidis, Y.: Angle-based space partitioning for efficient parallel skyline computation. In: Proceedings of the International conference on Management of data (SIGMOD), pp. 227–238 (2008) Vlachou, A., Doulkeridis, C., Kotidis, Y.: Angle-based space partitioning for efficient parallel skyline computation. In: Proceedings of the International conference on Management of data (SIGMOD), pp. 227–238 (2008)
6.
Zurück zum Zitat Borzsonyi, S., Kossmann, D., Stocker, K.: The skyline operator. In: Proceedings of the 17th International Conference on Data Engineering (ICDE), pp. 421–430 (2001) Borzsonyi, S., Kossmann, D., Stocker, K.: The skyline operator. In: Proceedings of the 17th International Conference on Data Engineering (ICDE), pp. 421–430 (2001)
7.
Zurück zum Zitat Chomicki, J., Godfrey, P., Gryz, J., Liang, D.: Skyline with presorting. In: Proceedings of the 19th International Conference on Data Engineering (ICDE), pp. 717–719 (2003) Chomicki, J., Godfrey, P., Gryz, J., Liang, D.: Skyline with presorting. In: Proceedings of the 19th International Conference on Data Engineering (ICDE), pp. 717–719 (2003)
8.
Zurück zum Zitat Chomicki, J., Ciaccia, P., Meneghetti, N.: Skyline queries, front and back. ACM SIGMOD Record, vol. 42, no. 3 (2013) Chomicki, J., Ciaccia, P., Meneghetti, N.: Skyline queries, front and back. ACM SIGMOD Record, vol. 42, no. 3 (2013)
9.
Zurück zum Zitat Choi, J.H., Lee, Y.J., Shin, H.S., Nasridinov, A.: An efficient computation of skyline queries using hash tables. Adv. Sci. Lett. 22(9), 2348–2353 (2016)CrossRef Choi, J.H., Lee, Y.J., Shin, H.S., Nasridinov, A.: An efficient computation of skyline queries using hash tables. Adv. Sci. Lett. 22(9), 2348–2353 (2016)CrossRef
10.
Zurück zum Zitat Bartolini, I., Ciaccia, P., Patella, M.: Efficient sort-based skyline evaluation. ACM Trans. Database Syst. 33(4), 31–49 (2008)CrossRef Bartolini, I., Ciaccia, P., Patella, M.: Efficient sort-based skyline evaluation. ACM Trans. Database Syst. 33(4), 31–49 (2008)CrossRef
11.
Zurück zum Zitat Bartolini, I., Ciaccia, P., Patella, M.: SaLSa: computing the skyline without scanning the whole sky. In: Proceedings of the 15th ACM International Conference on Information and Knowledge Management (CIKM), pp. 405–414 (2006) Bartolini, I., Ciaccia, P., Patella, M.: SaLSa: computing the skyline without scanning the whole sky. In: Proceedings of the 15th ACM International Conference on Information and Knowledge Management (CIKM), pp. 405–414 (2006)
12.
Zurück zum Zitat Ihm, S.Y., Lee, K.E., Nasridinov, A., Heo, J.S., Park, Y.H.: Approximate convex skyline: a partitioned layer-based index for efficient processing top-k queries. Knowl.-Based Syst. 61, 13–28 (2014)CrossRef Ihm, S.Y., Lee, K.E., Nasridinov, A., Heo, J.S., Park, Y.H.: Approximate convex skyline: a partitioned layer-based index for efficient processing top-k queries. Knowl.-Based Syst. 61, 13–28 (2014)CrossRef
13.
Zurück zum Zitat Son, Y., Ihm, S.Y., Nasridinov, A., Park, Y.H.: Adaptive convex skyline: a threshold-based project partitioned layer-based index for efficient-processing top-k queries in entrepreneurship applications. J. Supercomput. 72(11), 4262–4275 (2016)CrossRef Son, Y., Ihm, S.Y., Nasridinov, A., Park, Y.H.: Adaptive convex skyline: a threshold-based project partitioned layer-based index for efficient-processing top-k queries in entrepreneurship applications. J. Supercomput. 72(11), 4262–4275 (2016)CrossRef
14.
Zurück zum Zitat Zhange, B., Zhou, S., Guan, J.: Adapting skyline computation to the MapReduce framework: algorithms and experiments. In: Proceeding of the 16th International Conference on Database Systems for Advanced Applications (DASFAA), pp. 403–414 (2011) Zhange, B., Zhou, S., Guan, J.: Adapting skyline computation to the MapReduce framework: algorithms and experiments. In: Proceeding of the 16th International Conference on Database Systems for Advanced Applications (DASFAA), pp. 403–414 (2011)
15.
Zurück zum Zitat Park, Y., Min, J.K., Shim, K.: Efficient processing of skyline queries using MapReduce. IEEE Trans. Knowl. Data Eng. 29(5), 1031–1044 (2017)CrossRef Park, Y., Min, J.K., Shim, K.: Efficient processing of skyline queries using MapReduce. IEEE Trans. Knowl. Data Eng. 29(5), 1031–1044 (2017)CrossRef
16.
Zurück zum Zitat Koh, J.L., Chen, C.C., Chan, C.Y., Chen, A.L.P.: MapReduce skyline query processing with partitioning and distributed dominance tests. Inf. Sci. 375, 114–137 (2017)CrossRef Koh, J.L., Chen, C.C., Chan, C.Y., Chen, A.L.P.: MapReduce skyline query processing with partitioning and distributed dominance tests. Inf. Sci. 375, 114–137 (2017)CrossRef
17.
Zurück zum Zitat Wu, J., Chen, L., Yu, Q., Kuang, L., Wang, Y., Wu, Z.: Selecting skyline services for QoS-aware composition by upgrading MapReduce paradigm. Cluster Computing 16(4), 693–706 (2013)CrossRef Wu, J., Chen, L., Yu, Q., Kuang, L., Wang, Y., Wu, Z.: Selecting skyline services for QoS-aware composition by upgrading MapReduce paradigm. Cluster Computing 16(4), 693–706 (2013)CrossRef
18.
Zurück zum Zitat Bogh, K.S., Chester, S., Assent, I.: SkyAlign: a portable, work-efficient skyline algorithm for multicore and GPU architectures. VLDB J. 25(6), 817–841 (2016)CrossRef Bogh, K.S., Chester, S., Assent, I.: SkyAlign: a portable, work-efficient skyline algorithm for multicore and GPU architectures. VLDB J. 25(6), 817–841 (2016)CrossRef
19.
Zurück zum Zitat Bogh, K.S., Chester, S., Assent, I.: Work-efficient parallel skyline computation for the GPU. Proc. VLDB Endow. 8(9), 962–973 (2015)CrossRef Bogh, K.S., Chester, S., Assent, I.: Work-efficient parallel skyline computation for the GPU. Proc. VLDB Endow. 8(9), 962–973 (2015)CrossRef
20.
Zurück zum Zitat Hayward, R., McDiarmid, C.: Average case analysis of heap building by repeated insertion. J. Algorithms 12(1), 126–153 (1991)CrossRefMATHMathSciNet Hayward, R., McDiarmid, C.: Average case analysis of heap building by repeated insertion. J. Algorithms 12(1), 126–153 (1991)CrossRefMATHMathSciNet
Metadaten
Titel
A two-phase data space partitioning for efficient skyline computation
verfasst von
Aziz Nasridinov
Jong-Hyeok Choi
Young-Ho Park
Publikationsdatum
09.08.2017
Verlag
Springer US
Erschienen in
Cluster Computing / Ausgabe 4/2017
Print ISSN: 1386-7857
Elektronische ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-017-1070-6

Weitere Artikel der Ausgabe 4/2017

Cluster Computing 4/2017 Zur Ausgabe