Skip to main content

2016 | OriginalPaper | Buchkapitel

Top-k Queries Over Uncertain Scores

verfasst von : Qing Liu, Debabrota Basu, Talel Abdessalem, Stéphane Bressan

Erschienen in: On the Move to Meaningful Internet Systems: OTM 2016 Conferences

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Modern recommendation systems leverage some forms of collaborative user or crowd sourced collection of information. For instance, services like TripAdvisor, Airbnb and HungyGoWhere rely on user-generated content to describe and classify hotels, vacation rentals and restaurants. By nature of such independent collection of information, the multiplicity, diversity and varying quality of the information collected result in uncertainty. Objects, such as the services offered by hotels, vacation rentals and restaurants, have uncertain scores for their various features.
In this context, ranking of uncertain data becomes a crucial issue. Several data models for uncertain data and several semantics for probabilistic top-k queries have been proposed in the literature. We consider here a model of objects with uncertain scores given as probability distributions and the semantics proposed by the state of the art reference work of Soliman, Hyas and Ben-David.
In this paper, we explore the design space of Metropolis-Hastings Markov chain Monte Carlo algorithms for answering probabilistic top-k queries over a database of objects with uncertain scores. We are able to devise several algorithms that yield better performance than the reference algorithm. We empirically and comparatively prove the effectiveness and efficiency of these new algorithms.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Amarilli, A., Amsterdamer, Y., Milo, T.: Uncertainty in crowd data sourcing under structural constraints. In: Han, W.-S., Lee, M.L., Muliantara, A., Sanjaya, N.A., Thalheim, B., Zhou, S. (eds.) DASFAA 2014. LNCS, vol. 8505, pp. 351–359. Springer, Heidelberg (2014). doi:10.1007/978-3-662-43984-5_27 Amarilli, A., Amsterdamer, Y., Milo, T.: Uncertainty in crowd data sourcing under structural constraints. In: Han, W.-S., Lee, M.L., Muliantara, A., Sanjaya, N.A., Thalheim, B., Zhou, S. (eds.) DASFAA 2014. LNCS, vol. 8505, pp. 351–359. Springer, Heidelberg (2014). doi:10.​1007/​978-3-662-43984-5_​27
2.
Zurück zum Zitat Cormode, G., Li, F., Yi, K.: Semantics of ranking queries for probabilistic data and expected ranks. In: ICDE, pp. 305–316 (2009) Cormode, G., Li, F., Yi, K.: Semantics of ranking queries for probabilistic data and expected ranks. In: ICDE, pp. 305–316 (2009)
3.
Zurück zum Zitat Davidson, S.B., Khanna, S., Milo, T., Roy, S.: Using the crowd for top-k and group-by queries. In ICDT, pp. 225–236 (2013) Davidson, S.B., Khanna, S., Milo, T., Roy, S.: Using the crowd for top-k and group-by queries. In ICDT, pp. 225–236 (2013)
4.
Zurück zum Zitat Ge, T., Zdonik, S., Madden, S.: Top-k queries on uncertain data: on score distribution and typical answers. In: SIGMOD, pp. 375–388. ACM (2009) Ge, T., Zdonik, S., Madden, S.: Top-k queries on uncertain data: on score distribution and typical answers. In: SIGMOD, pp. 375–388. ACM (2009)
5.
Zurück zum Zitat Hua, M., Pei, J., Zhang, W., Lin, X.: Ranking queries on uncertain data: a probabilistic threshold approach. In: SIGMOD, pp. 673–686. ACM (2008) Hua, M., Pei, J., Zhang, W., Lin, X.: Ranking queries on uncertain data: a probabilistic threshold approach. In: SIGMOD, pp. 673–686. ACM (2008)
6.
Zurück zum Zitat Jestes, J., Cormode, G., Li, F., Yi, K.: Semantics of ranking queries for probabilistic data. TKDE 23(12), 1903–1917 (2011) Jestes, J., Cormode, G., Li, F., Yi, K.: Semantics of ranking queries for probabilistic data. TKDE 23(12), 1903–1917 (2011)
7.
Zurück zum Zitat Li, J., Deshpande, A.: Ranking continuous probabilistic datasets. VLDB 3(1–2), 638–649 (2010) Li, J., Deshpande, A.: Ranking continuous probabilistic datasets. VLDB 3(1–2), 638–649 (2010)
8.
Zurück zum Zitat Li, J., Saha, B., Deshpande, A.: A unified approach to ranking in probabilistic databases. VLDB 2(1), 502–513 (2009) Li, J., Saha, B., Deshpande, A.: A unified approach to ranking in probabilistic databases. VLDB 2(1), 502–513 (2009)
9.
Zurück zum Zitat Newman, M.E., Barkema, G.T., Newman, M.: Monte Carlo Methods in Statistical Physics, vol. 13. Clarendon Press, Oxford (1999)MATH Newman, M.E., Barkema, G.T., Newman, M.: Monte Carlo Methods in Statistical Physics, vol. 13. Clarendon Press, Oxford (1999)MATH
10.
Zurück zum Zitat O’Leary, D.P.: Multidimensional integration: partition and conquer. Comput. Sci. Eng. 6(6), 58–66 (2004)CrossRef O’Leary, D.P.: Multidimensional integration: partition and conquer. Comput. Sci. Eng. 6(6), 58–66 (2004)CrossRef
11.
Zurück zum Zitat Re, C., Dalvi, N., Suciu, D.: Efficient top-k query evaluation on probabilistic data. In: ICDE, pp. 886–895 (2007) Re, C., Dalvi, N., Suciu, D.: Efficient top-k query evaluation on probabilistic data. In: ICDE, pp. 886–895 (2007)
12.
Zurück zum Zitat Soliman, M.A., Ilyas, I.F.: Ranking with uncertain scores. In: ICDE, pp. 317–328 (2009) Soliman, M.A., Ilyas, I.F.: Ranking with uncertain scores. In: ICDE, pp. 317–328 (2009)
13.
Zurück zum Zitat Soliman, M.A., Ilyas, I.F., Ben-David, S.: Supporting ranking queries on uncertain and incomplete data. VLDB J. 19(4), 477–501 (2010)CrossRef Soliman, M.A., Ilyas, I.F., Ben-David, S.: Supporting ranking queries on uncertain and incomplete data. VLDB J. 19(4), 477–501 (2010)CrossRef
14.
Zurück zum Zitat Soliman, M.A., Ilyas, I.F., Chang, KC.-C.: Top-k query processing in uncertain databases. In: ICDE, pp. 896–905 (2007) Soliman, M.A., Ilyas, I.F., Chang, KC.-C.: Top-k query processing in uncertain databases. In: ICDE, pp. 896–905 (2007)
15.
Zurück zum Zitat Wang, C., Yuan, L.Y., You, J.-H., Zaiane, O.R., Pei, J.: On pruning for top-k ranking in uncertain databases. VLDB 4(10), 598–609 (2011) Wang, C., Yuan, L.Y., You, J.-H., Zaiane, O.R., Pei, J.: On pruning for top-k ranking in uncertain databases. VLDB 4(10), 598–609 (2011)
16.
Zurück zum Zitat Yi, K., Li, F., Kollios, G., Srivastava, D.: Efficient processing of top-k queries in uncertain databases with x-relations. TKDE 20(12), 1669–1682 (2008) Yi, K., Li, F., Kollios, G., Srivastava, D.: Efficient processing of top-k queries in uncertain databases with x-relations. TKDE 20(12), 1669–1682 (2008)
17.
Zurück zum Zitat Zhang, X., Li, G., Feng, J.: Crowdsourced top-k algorithms: an experimental evaluation. VLDB 9(8), 612–623 (2016) Zhang, X., Li, G., Feng, J.: Crowdsourced top-k algorithms: an experimental evaluation. VLDB 9(8), 612–623 (2016)
Metadaten
Titel
Top-k Queries Over Uncertain Scores
verfasst von
Qing Liu
Debabrota Basu
Talel Abdessalem
Stéphane Bressan
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-48472-3_14