Skip to main content

2017 | OriginalPaper | Buchkapitel

Trading Off Popularity for Diversity in the Results Sets of Keyword Queries on Linked Data

verfasst von : Ananya Dass, Dimitri Theodoratos

Erschienen in: Web Engineering

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Keyword search is the most popular technique for querying the ever growing repositories of RDF graph data on the Web. However, keyword queries are ambiguous. As a consequence, they typically produce on linked data a huge number of candidate results corresponding to a plethora of alternative query interpretations. Current approaches ignore the diversity of the result interpretations and might fail to satisfy the users who are looking for less popular results. In this paper, we propose a novel approach for keyword search result diversification on RDF graphs. Our approach instead of diversifying the query results per se, diversifies the interpretations of the query (i.e., pattern graphs). We model the problem as an optimization problem aiming at selecting k pattern graphs which maximize an objective function balancing relevance and diversity. We devise metrics to assess the relevance and diversity of a set of pattern graphs, and we design a greedy heuristic algorithm to generate a relevant and diverse list of k pattern graphs for a given keyword query. The experimental results show the effectiveness of our approach and proposed metrics and also the efficiency of our algorithm.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Agrawal, R., Gollapudi, S., Halverson, A., Ieong, S.: Diversifying search results. In: WSDM, pp. 5–14. ACM (2009) Agrawal, R., Gollapudi, S., Halverson, A., Ieong, S.: Diversifying search results. In: WSDM, pp. 5–14. ACM (2009)
2.
Zurück zum Zitat Aksoy, C., Dass, A., Theodoratos, D., Wu, X.: Diversification of keyword query result patterns. In: Cui, B., Zhang, N., Xu, J., Lian, X., Liu, D. (eds.) WAIM 2016. LNCS, vol. 9659, pp. 171–183. Springer, Cham (2016). doi:10.1007/978-3-319-39958-4_14 Aksoy, C., Dass, A., Theodoratos, D., Wu, X.: Diversification of keyword query result patterns. In: Cui, B., Zhang, N., Xu, J., Lian, X., Liu, D. (eds.) WAIM 2016. LNCS, vol. 9659, pp. 171–183. Springer, Cham (2016). doi:10.​1007/​978-3-319-39958-4_​14
3.
Zurück zum Zitat Bikakis, N., Giannopoulos, G., Liagouris, J., Skoutas, D., Dalamagas, T., Sellis, T.: RDivF: diversifying keyword search on RDF graphs. In: Aalberg, T., Papatheodorou, C., Dobreva, M., Tsakonas, G., Farrugia, C.J. (eds.) TPDL 2013. LNCS, vol. 8092, pp. 413–416. Springer, Heidelberg (2013). doi:10.1007/978-3-642-40501-3_49 CrossRef Bikakis, N., Giannopoulos, G., Liagouris, J., Skoutas, D., Dalamagas, T., Sellis, T.: RDivF: diversifying keyword search on RDF graphs. In: Aalberg, T., Papatheodorou, C., Dobreva, M., Tsakonas, G., Farrugia, C.J. (eds.) TPDL 2013. LNCS, vol. 8092, pp. 413–416. Springer, Heidelberg (2013). doi:10.​1007/​978-3-642-40501-3_​49 CrossRef
4.
Zurück zum Zitat Carbonell, J., Goldstein, J.: The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: SIGIR, pp. 335–336 (1998) Carbonell, J., Goldstein, J.: The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: SIGIR, pp. 335–336 (1998)
5.
Zurück zum Zitat Carterette, B.: An analysis of NP-completeness in novelty and diversity ranking. Inf. Retrieval 14(1), 89–106 (2011)CrossRef Carterette, B.: An analysis of NP-completeness in novelty and diversity ranking. Inf. Retrieval 14(1), 89–106 (2011)CrossRef
6.
Zurück zum Zitat Chen, H., Karger, D.R.: Less is more: probabilistic models for retrieving fewer relevant documents. In: SIGIR, pp. 429–436. ACM (2006) Chen, H., Karger, D.R.: Less is more: probabilistic models for retrieving fewer relevant documents. In: SIGIR, pp. 429–436. ACM (2006)
7.
Zurück zum Zitat Dass, A., Aksoy, C., Dimitriou, A., Theodoratos, D.: Exploiting semantic result clustering to support keyword search on linked data. In: Benatallah, B., Bestavros, A., Manolopoulos, Y., Vakali, A., Zhang, Y. (eds.) WISE 2014. LNCS, vol. 8786, pp. 448–463. Springer, Cham (2014). doi:10.1007/978-3-319-11749-2_34 Dass, A., Aksoy, C., Dimitriou, A., Theodoratos, D.: Exploiting semantic result clustering to support keyword search on linked data. In: Benatallah, B., Bestavros, A., Manolopoulos, Y., Vakali, A., Zhang, Y. (eds.) WISE 2014. LNCS, vol. 8786, pp. 448–463. Springer, Cham (2014). doi:10.​1007/​978-3-319-11749-2_​34
8.
Zurück zum Zitat Dass, A., Aksoy, C., Dimitriou, A., Theodoratos, D.: Keyword pattern graph relaxation for selective result space expansion on linked data. In: Cimiano, P., Frasincar, F., Houben, G.-J., Schwabe, D. (eds.) ICWE 2015. LNCS, vol. 9114, pp. 287–306. Springer, Cham (2015). doi:10.1007/978-3-319-19890-3_19 CrossRef Dass, A., Aksoy, C., Dimitriou, A., Theodoratos, D.: Keyword pattern graph relaxation for selective result space expansion on linked data. In: Cimiano, P., Frasincar, F., Houben, G.-J., Schwabe, D. (eds.) ICWE 2015. LNCS, vol. 9114, pp. 287–306. Springer, Cham (2015). doi:10.​1007/​978-3-319-19890-3_​19 CrossRef
9.
Zurück zum Zitat Dass, A., Aksoy, C., Dimitriou, A., Theodoratos, D., Wu, X.: Diversifying the results of keyword queries on linked data. In: Cellary, W., Mokbel, M.F., Wang, J., Wang, H., Zhou, R., Zhang, Y. (eds.) WISE 2016. LNCS, vol. 10041, pp. 199–207. Springer, Cham (2016). doi:10.1007/978-3-319-48740-3_14 CrossRef Dass, A., Aksoy, C., Dimitriou, A., Theodoratos, D., Wu, X.: Diversifying the results of keyword queries on linked data. In: Cellary, W., Mokbel, M.F., Wang, J., Wang, H., Zhou, R., Zhang, Y. (eds.) WISE 2016. LNCS, vol. 10041, pp. 199–207. Springer, Cham (2016). doi:10.​1007/​978-3-319-48740-3_​14 CrossRef
10.
Zurück zum Zitat Dass, A., Dimitriou, A., Aksoy, C., Theodoratos, D.: Incorporating Cohesiveness into keyword search on linked data. In: Wang, J., Cellary, W., Wang, D., Wang, H., Chen, S.-C., Li, T., Zhang, Y. (eds.) WISE 2015. LNCS, vol. 9419, pp. 47–62. Springer, Cham (2015). doi:10.1007/978-3-319-26187-4_4 CrossRef Dass, A., Dimitriou, A., Aksoy, C., Theodoratos, D.: Incorporating Cohesiveness into keyword search on linked data. In: Wang, J., Cellary, W., Wang, D., Wang, H., Chen, S.-C., Li, T., Zhang, Y. (eds.) WISE 2015. LNCS, vol. 9419, pp. 47–62. Springer, Cham (2015). doi:10.​1007/​978-3-319-26187-4_​4 CrossRef
11.
Zurück zum Zitat Demidova, E., Fankhauser, P., Zhou, X., Nejdl, W.: DivQ: diversification for keyword search over structured databases. In: SIGIR, pp. 331–338. ACM (2010) Demidova, E., Fankhauser, P., Zhou, X., Nejdl, W.: DivQ: diversification for keyword search over structured databases. In: SIGIR, pp. 331–338. ACM (2010)
12.
Zurück zum Zitat Drosou, M., Pitoura, E.: Search result diversification. ACM SIGMOD Rec. 39(1), 41–47 (2010)CrossRef Drosou, M., Pitoura, E.: Search result diversification. ACM SIGMOD Rec. 39(1), 41–47 (2010)CrossRef
13.
Zurück zum Zitat Elbassuoni, S., Ramanath, M., Schenkel, R., Weikum, G.: Searching RDF graphs with SPARQL and keywords. IEEE Data Eng. Bull. 33(1), 16–24 (2010) Elbassuoni, S., Ramanath, M., Schenkel, R., Weikum, G.: Searching RDF graphs with SPARQL and keywords. IEEE Data Eng. Bull. 33(1), 16–24 (2010)
14.
Zurück zum Zitat Gollapudi, S., Sharma, A.: An axiomatic approach for result diversification. In: WWW, pp. 381–390. ACM (2009) Gollapudi, S., Sharma, A.: An axiomatic approach for result diversification. In: WWW, pp. 381–390. ACM (2009)
15.
Zurück zum Zitat Hasan, M., Mueen, A., Tsotras, V., Keogh, E.: Diversifying query results on semi-structured data. In: CIKM, pp. 2099–2103. ACM (2012) Hasan, M., Mueen, A., Tsotras, V., Keogh, E.: Diversifying query results on semi-structured data. In: CIKM, pp. 2099–2103. ACM (2012)
16.
Zurück zum Zitat Li, G., et al.: Ease: an effective 3-in-1 keyword search method for unstructured, semi-structured and structured data. In: SIGMOD, pp. 903–914 (2008) Li, G., et al.: Ease: an effective 3-in-1 keyword search method for unstructured, semi-structured and structured data. In: SIGMOD, pp. 903–914 (2008)
17.
Zurück zum Zitat Li, J., Liu, C., Yu, J.X.: Context-based diversification for keyword queries over XML data. Proc. KDE 27(3), 660–672 (2015) Li, J., Liu, C., Yu, J.X.: Context-based diversification for keyword queries over XML data. Proc. KDE 27(3), 660–672 (2015)
18.
Zurück zum Zitat Radlinski, F., Dumais, S.: Improving personalized web search using result diversification. In: SIGIR, pp. 691–692. ACM (2006) Radlinski, F., Dumais, S.: Improving personalized web search using result diversification. In: SIGIR, pp. 691–692. ACM (2006)
19.
Zurück zum Zitat Ruotsalo, T., Frosterus, M.: Semantic entity search diversification. In: ICSC, pp. 32–39 (2013) Ruotsalo, T., Frosterus, M.: Semantic entity search diversification. In: ICSC, pp. 32–39 (2013)
20.
Zurück zum Zitat Tran, T., Wang, H., Rudolph, S., Cimiano, P.: Top-k exploration of query candidates for efficient keyword search on graph-shaped (RDF) data. In: ICDE (2009) Tran, T., Wang, H., Rudolph, S., Cimiano, P.: Top-k exploration of query candidates for efficient keyword search on graph-shaped (RDF) data. In: ICDE (2009)
21.
Zurück zum Zitat Zhang, M., Hurley, N.: Avoiding monotony: improving the diversity of recommendation lists. In Recommender Systems, pp. 123–130 (2008) Zhang, M., Hurley, N.: Avoiding monotony: improving the diversity of recommendation lists. In Recommender Systems, pp. 123–130 (2008)
22.
Zurück zum Zitat Ziegler, C.-N., McNee, S.M., Konstan, J.A., Lausen, G.: Improving recommendation lists through topic diversification. In: WWW, pp. 22–32. ACM (2005) Ziegler, C.-N., McNee, S.M., Konstan, J.A., Lausen, G.: Improving recommendation lists through topic diversification. In: WWW, pp. 22–32. ACM (2005)
Metadaten
Titel
Trading Off Popularity for Diversity in the Results Sets of Keyword Queries on Linked Data
verfasst von
Ananya Dass
Dimitri Theodoratos
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-60131-1_9

Premium Partner