Skip to main content
Erschienen in: Knowledge and Information Systems 3/2019

23.04.2018 | Regular Paper

Dynamic skyline computation on massive data

verfasst von: Xixian Han, Bailing Wang, Guojun Lai

Erschienen in: Knowledge and Information Systems | Ausgabe 3/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In many applications, dynamic skyline query is an important operation to find the interesting tuples in a potentially huge data space. Given the query point, dynamic skyline query returns tuples which are not dynamically dominated by other tuples. It is found that the existing algorithms cannot process dynamic skyline query on massive data efficiently. This paper proposes a novel dynamic-sorted-list-based DDS algorithm to efficiently compute dynamic skyline results on massive data. Given the query point, the dynamic sorted list of each attribute is not materialized but generated dynamically by the sorted list of the attribute. DDS retrieves the tuples in the involved dynamic sorted lists in the round-robin fashion until the early termination condition is satisfied, and computes the dynamic skyline results by retrieving the candidates. The pruning operation is devised to reduce the number of the retrieved candidates. The extensive experimental results, conducted on synthetic and real-life data sets, show that DDS outperforms the existing algorithms significantly.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Bloom B (1970) Space/time trade-offs in hash coding with allowable errors. Commun ACM 13(7):422–426CrossRefMATH Bloom B (1970) Space/time trade-offs in hash coding with allowable errors. Commun ACM 13(7):422–426CrossRefMATH
2.
Zurück zum Zitat Börzsönyi S, Kossmann D, Stocker K (2001) The skyline operator. In: Proceedings of the 17th international conference on data engineering, pp 421–430 Börzsönyi S, Kossmann D, Stocker K (2001) The skyline operator. In: Proceedings of the 17th international conference on data engineering, pp 421–430
3.
Zurück zum Zitat Chen L, Lian X (2009) Efficient processing of metric skyline queries. IEEE Trans Knowl Data Eng 21(3):351–365CrossRef Chen L, Lian X (2009) Efficient processing of metric skyline queries. IEEE Trans Knowl Data Eng 21(3):351–365CrossRef
4.
Zurück zum Zitat Chomicki J, Godfrey P, Gryz J et al (2003) Skyline with presorting. In: Proceedings of the 19th international conference on data engineering, pp 717–719 Chomicki J, Godfrey P, Gryz J et al (2003) Skyline with presorting. In: Proceedings of the 19th international conference on data engineering, pp 717–719
5.
Zurück zum Zitat Dellis E, Seeger B (2007) Efficient computation of reverse skyline queries. In: Proceedings of the 33rd international conference on very large data bases, VLDB ’07, pp 291–302 Dellis E, Seeger B (2007) Efficient computation of reverse skyline queries. In: Proceedings of the 33rd international conference on very large data bases, VLDB ’07, pp 291–302
6.
Zurück zum Zitat Dellis E, Vlachou A, Vladimirskiy I et al (2006) Constrained subspace skyline computation. In: Proceedings of the 15th ACM international conference on information and knowledge management, CIKM ’06, pp 415–424 Dellis E, Vlachou A, Vladimirskiy I et al (2006) Constrained subspace skyline computation. In: Proceedings of the 15th ACM international conference on information and knowledge management, CIKM ’06, pp 415–424
7.
Zurück zum Zitat Deng K, Zhou X, Shen H (2007) Multi-source skyline query processing in road networks. In: Proceedings of the 23rd international conference on data engineering, ICDE 2007, pp 796–805 Deng K, Zhou X, Shen H (2007) Multi-source skyline query processing in road networks. In: Proceedings of the 23rd international conference on data engineering, ICDE 2007, pp 796–805
8.
Zurück zum Zitat Endres M, Kießling W (2014) High parallel skyline computation over low-cardinality domains. In: Proceedings of the 18th East European conference on advances in databases and information systems, pp 97–111 Endres M, Kießling W (2014) High parallel skyline computation over low-cardinality domains. In: Proceedings of the 18th East European conference on advances in databases and information systems, pp 97–111
9.
Zurück zum Zitat Endres M, Kießling W (2015) Parallel skyline computation exploiting the lattice structure. J Database Manag 26(4):18–43CrossRef Endres M, Kießling W (2015) Parallel skyline computation exploiting the lattice structure. J Database Manag 26(4):18–43CrossRef
10.
Zurück zum Zitat Endres M, Preisinger T (2017) Beyond skylines: explicit preferences. In: Proceedings of the 22nd international conference database systems for advanced applications, part I, pp 327–342 Endres M, Preisinger T (2017) Beyond skylines: explicit preferences. In: Proceedings of the 22nd international conference database systems for advanced applications, part I, pp 327–342
11.
Zurück zum Zitat Endres M, Roocks P, Kießling W (2015) Scalagon: an efficient skyline algorithm for all seasons. In: Proceedings of the 20th international conference database systems for advanced applications, part II, pp 292–308 Endres M, Roocks P, Kießling W (2015) Scalagon: an efficient skyline algorithm for all seasons. In: Proceedings of the 20th international conference database systems for advanced applications, part II, pp 292–308
12.
Zurück zum Zitat Fagin R, Lotem A, Naor M (2001) Optimal aggregation algorithms for middleware. In: Proceedings of the 20th ACM SIGMOD-SIGACT-SIGART symposium on principles of database systems, PODS ’01, pp 102–113 Fagin R, Lotem A, Naor M (2001) Optimal aggregation algorithms for middleware. In: Proceedings of the 20th ACM SIGMOD-SIGACT-SIGART symposium on principles of database systems, PODS ’01, pp 102–113
13.
Zurück zum Zitat Fuhry D, Jin R, Zhang D (2009) Efficient skyline computation in metric space. In: Proceedings of the 12th international conference on extending database technology: advances in database technology, EDBT ’09, pp 1042–1051 Fuhry D, Jin R, Zhang D (2009) Efficient skyline computation in metric space. In: Proceedings of the 12th international conference on extending database technology: advances in database technology, EDBT ’09, pp 1042–1051
14.
Zurück zum Zitat Godfrey P, Shipley R, Gryz J (2007) Algorithms and analyses for maximal vector computation. VLDB J 16(1):5–28CrossRef Godfrey P, Shipley R, Gryz J (2007) Algorithms and analyses for maximal vector computation. VLDB J 16(1):5–28CrossRef
15.
Zurück zum Zitat Han X, Li J, Gao H (2015) TDEP: efficiently processing top-k dominating query on massive data. Knowl Inf Syst 43(3):689–718CrossRef Han X, Li J, Gao H (2015) TDEP: efficiently processing top-k dominating query on massive data. Knowl Inf Syst 43(3):689–718CrossRef
16.
Zurück zum Zitat Han X, Li J, Gao H (2017) Efficient top-k dominating computation on massive data. IEEE Trans Knowl Data Eng 29(6):1199–1211CrossRef Han X, Li J, Gao H (2017) Efficient top-k dominating computation on massive data. IEEE Trans Knowl Data Eng 29(6):1199–1211CrossRef
17.
Zurück zum Zitat Han X, Li J, Yang D et al (2013) Efficient skyline computation on big data. IEEE Trans Knowl Data Eng 25(11):2521–2535CrossRef Han X, Li J, Yang D et al (2013) Efficient skyline computation on big data. IEEE Trans Knowl Data Eng 25(11):2521–2535CrossRef
18.
Zurück zum Zitat Han X, Liu X, Li J et al (2016) TKAP: efficiently processing top-k query on massive data by adaptive pruning. Knowl Inf Syst 47(2):301–328CrossRef Han X, Liu X, Li J et al (2016) TKAP: efficiently processing top-k query on massive data by adaptive pruning. Knowl Inf Syst 47(2):301–328CrossRef
19.
Zurück zum Zitat Islam M, Liu C (2016) Know your customer: computing k-most promising products for targeted marketing. VLDB J 25(4):545–570CrossRef Islam M, Liu C (2016) Know your customer: computing k-most promising products for targeted marketing. VLDB J 25(4):545–570CrossRef
20.
Zurück zum Zitat Kossmann D, Ramsak F, Rost S (2002) Shooting stars in the sky: an online algorithm for skyline queries. In: Proceedings of the 28th international conference on very large data bases, pp 275–286 Kossmann D, Ramsak F, Rost S (2002) Shooting stars in the sky: an online algorithm for skyline queries. In: Proceedings of the 28th international conference on very large data bases, pp 275–286
21.
Zurück zum Zitat Lee J, Hwang S (2014) Scalable skyline computation using a balanced pivot selection technique. Inf Syst 39:1–21CrossRef Lee J, Hwang S (2014) Scalable skyline computation using a balanced pivot selection technique. Inf Syst 39:1–21CrossRef
22.
Zurück zum Zitat Lee K, Lee W, Zheng B et al (2010) Z-sky: an efficient skyline query processing framework based on z-order. VLDB J 19(3):333–362CrossRef Lee K, Lee W, Zheng B et al (2010) Z-sky: an efficient skyline query processing framework based on z-order. VLDB J 19(3):333–362CrossRef
23.
Zurück zum Zitat Lu H, Jensen C (2012) Upgrading uncompetitive products economically. In: IEEE 28th international conference on data engineering, pp 977–988 Lu H, Jensen C (2012) Upgrading uncompetitive products economically. In: IEEE 28th international conference on data engineering, pp 977–988
24.
Zurück zum Zitat Morse M, Patel J, Jagadish H (2007) Efficient skyline computation over low-cardinality domains. In: Proceedings of the 33rd international conference on very large data bases, pp 267–278 Morse M, Patel J, Jagadish H (2007) Efficient skyline computation over low-cardinality domains. In: Proceedings of the 33rd international conference on very large data bases, pp 267–278
25.
Zurück zum Zitat Papadias D, Tao Y, Fu G et al (2005) Progressive skyline computation in database systems. ACM Trans Database Syst 30(1):41–82CrossRef Papadias D, Tao Y, Fu G et al (2005) Progressive skyline computation in database systems. ACM Trans Database Syst 30(1):41–82CrossRef
26.
Zurück zum Zitat Sacharidis D, Bouros P, Sellis T (2008) Caching dynamic skyline queries. In: Proceedings of the 20th international conference on scientific and statistical database management, SSDBM ’08, pp 455–472 Sacharidis D, Bouros P, Sellis T (2008) Caching dynamic skyline queries. In: Proceedings of the 20th international conference on scientific and statistical database management, SSDBM ’08, pp 455–472
27.
Zurück zum Zitat Sharifzadeh M, Shahabi C (2006) The spatial skyline queries. In: Proceedings of the 32nd international conference on very large data bases, VLDB ’06, pp 751–762 Sharifzadeh M, Shahabi C (2006) The spatial skyline queries. In: Proceedings of the 32nd international conference on very large data bases, VLDB ’06, pp 751–762
28.
Zurück zum Zitat Sheng C, Tao Y (2011) On finding skylines in external memory. In: Proceedings of the 30th ACM SIGMOD-SIGACT-SIGART symposium on principles of database systems, pp 107–116 Sheng C, Tao Y (2011) On finding skylines in external memory. In: Proceedings of the 30th ACM SIGMOD-SIGACT-SIGART symposium on principles of database systems, pp 107–116
29.
Zurück zum Zitat Tan K, Eng P, Ooi B (2001) Efficient progressive skyline computation. In: Proceedings of the 27th international conference on very large data bases, VLDB ’01, pp 301–310 Tan K, Eng P, Ooi B (2001) Efficient progressive skyline computation. In: Proceedings of the 27th international conference on very large data bases, VLDB ’01, pp 301–310
30.
Zurück zum Zitat Tao Y, Xiao X, Pei J (2007) Efficient skyline and top-k retrieval in subspaces. IEEE Trans Knowl Data Eng 19(8):1072–1088CrossRef Tao Y, Xiao X, Pei J (2007) Efficient skyline and top-k retrieval in subspaces. IEEE Trans Knowl Data Eng 19(8):1072–1088CrossRef
31.
Zurück zum Zitat Wang W, Wang E, Chen A (2011) Dynamic skylines considering range queries. In: Database systems for advanced applications: 16th international conference, DASFAA 2011, pp 235–250 Wang W, Wang E, Chen A (2011) Dynamic skylines considering range queries. In: Database systems for advanced applications: 16th international conference, DASFAA 2011, pp 235–250
32.
Zurück zum Zitat Wang W, Zhang J, Sun M et al (2017) Efficient parallel spatial skyline evaluation using mapreduce. In: Proceedings of the 20th international conference on extending database technology, pp 426–437 Wang W, Zhang J, Sun M et al (2017) Efficient parallel spatial skyline evaluation using mapreduce. In: Proceedings of the 20th international conference on extending database technology, pp 426–437
33.
Zurück zum Zitat Wang Z, Gao Y, Liu Q et al (2016) Efficient group-by reverse skyline computation. World Wide Web 19(6):1023–1049CrossRef Wang Z, Gao Y, Liu Q et al (2016) Efficient group-by reverse skyline computation. World Wide Web 19(6):1023–1049CrossRef
34.
Zurück zum Zitat Wu X, Tao Y, Wong R et al (2009) Finding the influence set through skylines. In: Proceedings of the 12th international conference on extending database technology: advances in database technology, EDBT ’09, pp 1030–1041 Wu X, Tao Y, Wong R et al (2009) Finding the influence set through skylines. In: Proceedings of the 12th international conference on extending database technology: advances in database technology, EDBT ’09, pp 1030–1041
35.
Zurück zum Zitat Zhang S, Mamoulis N, Cheung D (2009) Scalable skyline computation using object-based space partitioning. In: Proceedings of the 2009 ACM SIGMOD international conference on management of data, SIGMOD ’09, pp 483–494 Zhang S, Mamoulis N, Cheung D (2009) Scalable skyline computation using object-based space partitioning. In: Proceedings of the 2009 ACM SIGMOD international conference on management of data, SIGMOD ’09, pp 483–494
Metadaten
Titel
Dynamic skyline computation on massive data
verfasst von
Xixian Han
Bailing Wang
Guojun Lai
Publikationsdatum
23.04.2018
Verlag
Springer London
Erschienen in
Knowledge and Information Systems / Ausgabe 3/2019
Print ISSN: 0219-1377
Elektronische ISSN: 0219-3116
DOI
https://doi.org/10.1007/s10115-018-1193-y

Weitere Artikel der Ausgabe 3/2019

Knowledge and Information Systems 3/2019 Zur Ausgabe