Skip to main content

2016 | OriginalPaper | Buchkapitel

Diversification of Keyword Query Result Patterns

verfasst von : Cem Aksoy, Ananya Dass, Dimitri Theodoratos, Xiaoying Wu

Erschienen in: Web-Age Information Management

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Keyword search allows the users to search for information on tree data without making use of a complex query language and without knowing the schema of the data sources. However, keyword queries are usually ambiguous in expressing the user intent. Most of the current keyword search approaches either filter or use a scoring function to rank the candidate result set. These techniques do not differentiate the results and might return to the user a result set which is not the intended. To address this problem, we introduce in this paper an original approach for diversification of keyword search results on tree data which aims at returning a subset of the candidate result set trading off relevance for diversity. We formally define the problem of diversification of patterns of keyword search results on tree data as an optimization problem. We introduce relevance and diversity measures on result pattern sets. We design a greedy heuristic algorithm that chooses top-k most relevant and diverse result patterns for a given keyword query. Our experimental results show that the introduced relevance and diversity measures can be used effectively and that our algorithm can efficiently compute a set of result patterns for keyword queries which is both relevant and diverse.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Agrawal, R., Gollapudi, S., Halverson, A., Ieong, S.: Diversifying search results. In: WSDM, pp. 5–14 (2009) Agrawal, R., Gollapudi, S., Halverson, A., Ieong, S.: Diversifying search results. In: WSDM, pp. 5–14 (2009)
2.
Zurück zum Zitat Aksoy, C., Dass, A., Theodoratos, D., Wu, X.: Clustering query results to support keyword search on tree data. In: Li, F., Li, G., Hwang, S., Yao, B., Zhang, Z. (eds.) WAIM 2014. LNCS, vol. 8485, pp. 213–224. Springer, Heidelberg (2014) Aksoy, C., Dass, A., Theodoratos, D., Wu, X.: Clustering query results to support keyword search on tree data. In: Li, F., Li, G., Hwang, S., Yao, B., Zhang, Z. (eds.) WAIM 2014. LNCS, vol. 8485, pp. 213–224. Springer, Heidelberg (2014)
3.
Zurück zum Zitat Aksoy, C., Dimitriou, A., Theodoratos, D.: Reasoning with patterns to effectively answer XML keyword queries. VLDB J. 24(3), 441–465 (2015)CrossRef Aksoy, C., Dimitriou, A., Theodoratos, D.: Reasoning with patterns to effectively answer XML keyword queries. VLDB J. 24(3), 441–465 (2015)CrossRef
4.
Zurück zum Zitat Aksoy, C., Dimitriou, A., Theodoratos, D., Wu, X.: XReason: A semantic approach that reasons with patterns to answer XML keyword queries. In: DASFAA, pp. 299–314 (2013) Aksoy, C., Dimitriou, A., Theodoratos, D., Wu, X.: XReason: A semantic approach that reasons with patterns to answer XML keyword queries. In: DASFAA, pp. 299–314 (2013)
5.
Zurück zum Zitat Angel, A., Koudas, N.: Efficient diversity-aware search. In: SIGMOD, pp. 781–792 (2011) Angel, A., Koudas, N.: Efficient diversity-aware search. In: SIGMOD, pp. 781–792 (2011)
6.
Zurück zum Zitat Bao, Z., Ling, T.W., Chen, B., Lu, J.: Effective XML keyword search with relevance oriented ranking. In: ICDE, pp. 517–528 (2009) Bao, Z., Ling, T.W., Chen, B., Lu, J.: Effective XML keyword search with relevance oriented ranking. In: ICDE, pp. 517–528 (2009)
7.
Zurück zum Zitat Bao, Z., Lu, J., Ling, T.W., Chen, B.: Towards an effective XML keyword search. IEEE Trans. Knowl. Data Eng. 22(8), 1077–1092 (2010)CrossRef Bao, Z., Lu, J., Ling, T.W., Chen, B.: Towards an effective XML keyword search. IEEE Trans. Knowl. Data Eng. 22(8), 1077–1092 (2010)CrossRef
8.
Zurück zum Zitat Carbonell, J., Goldstein, J.: The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: SIGIR, pp. 335–336 (1998) Carbonell, J., Goldstein, J.: The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: SIGIR, pp. 335–336 (1998)
9.
Zurück zum Zitat Carterette, B.: An analysis of NP-completeness in novelty and diversity ranking. Inf. Retr. 14(1), 89–106 (2011)CrossRef Carterette, B.: An analysis of NP-completeness in novelty and diversity ranking. Inf. Retr. 14(1), 89–106 (2011)CrossRef
10.
Zurück zum Zitat Clarke, C.L., Kolla, M., Cormack, G.V., Vechtomova, O., Ashkan, A., Büttcher, S., MacKinnon, I.: Novelty and diversity in information retrieval evaluation. In: SIGIR, pp. 659–666 (2008) Clarke, C.L., Kolla, M., Cormack, G.V., Vechtomova, O., Ashkan, A., Büttcher, S., MacKinnon, I.: Novelty and diversity in information retrieval evaluation. In: SIGIR, pp. 659–666 (2008)
11.
Zurück zum Zitat Demidova, E., Fankhauser, P., Zhou, X., Nejdl, W.: DivQ: Diversification for keyword search over structured databases. In: SIGIR, pp. 331–338 (2010) Demidova, E., Fankhauser, P., Zhou, X., Nejdl, W.: DivQ: Diversification for keyword search over structured databases. In: SIGIR, pp. 331–338 (2010)
12.
Zurück zum Zitat Drosou, M., Pitoura, E.: Search result diversification. SIGMOD Rec. 39(1), 41–47 (2010)CrossRef Drosou, M., Pitoura, E.: Search result diversification. SIGMOD Rec. 39(1), 41–47 (2010)CrossRef
13.
Zurück zum Zitat Erkut, E., Ulkusal, Y., Yenicerioglu, O.: A comparison of p-dispersion heuristics. Comput. Oper. Res. 21(10), 1103–1113 (1994)CrossRefMATH Erkut, E., Ulkusal, Y., Yenicerioglu, O.: A comparison of p-dispersion heuristics. Comput. Oper. Res. 21(10), 1103–1113 (1994)CrossRefMATH
14.
Zurück zum Zitat Gollapudi, S., Sharma, A.: An axiomatic approach for result diversification. In: WWW, pp. 381–390 (2009) Gollapudi, S., Sharma, A.: An axiomatic approach for result diversification. In: WWW, pp. 381–390 (2009)
15.
Zurück zum Zitat Hasan, M., Mueen, A., Tsotras, V., Keogh, E.: Diversifying query results on semi-structured data. In: CIKM, pp. 2099–2103 (2012) Hasan, M., Mueen, A., Tsotras, V., Keogh, E.: Diversifying query results on semi-structured data. In: CIKM, pp. 2099–2103 (2012)
16.
Zurück zum Zitat Li, J., Liu, C., Yu, J.: Context-based diversification for keyword queries over XML data. IEEE Trans. Knowl. Data Eng. 27(3), 660–672 (2015)CrossRef Li, J., Liu, C., Yu, J.: Context-based diversification for keyword queries over XML data. IEEE Trans. Knowl. Data Eng. 27(3), 660–672 (2015)CrossRef
17.
Zurück zum Zitat Li, J., Liu, C., Zhou, R., Wang, W.: Suggestion of promising result types for XML keyword search. In: EDBT, pp. 561–572 (2010) Li, J., Liu, C., Zhou, R., Wang, W.: Suggestion of promising result types for XML keyword search. In: EDBT, pp. 561–572 (2010)
18.
Zurück zum Zitat Liu, Z., Natarajan, S., Chen, Y.: Query expansion based on clustered results. Proc. VLDB Endow. 4(6), 350–361 (2011)CrossRef Liu, Z., Natarajan, S., Chen, Y.: Query expansion based on clustered results. Proc. VLDB Endow. 4(6), 350–361 (2011)CrossRef
19.
Zurück zum Zitat Liu, Z., Sun, P., Chen, Y.: Structured search result differentiation. PVLDB 2(1), 313–324 (2009) Liu, Z., Sun, P., Chen, Y.: Structured search result differentiation. PVLDB 2(1), 313–324 (2009)
20.
Zurück zum Zitat Radlinski, F., Dumais, S.: Improving personalized web search using result diversification. In: SIGIR, pp. 691–692 (2006) Radlinski, F., Dumais, S.: Improving personalized web search using result diversification. In: SIGIR, pp. 691–692 (2006)
21.
Zurück zum Zitat Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manage. 24(5), 513–523 (1988)CrossRef Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manage. 24(5), 513–523 (1988)CrossRef
22.
Zurück zum Zitat Yu, C., Lakshmanan, L., Amer-Yahia, S.: Recommendation diversification using explanations. In: ICDE, pp. 1299–1302 (2009) Yu, C., Lakshmanan, L., Amer-Yahia, S.: Recommendation diversification using explanations. In: ICDE, pp. 1299–1302 (2009)
23.
Zurück zum Zitat Zhang, M., Hurley, N.: Avoiding monotony: Improving the diversity of recommendation lists. In: RecSys, pp. 123–130 (2008) Zhang, M., Hurley, N.: Avoiding monotony: Improving the diversity of recommendation lists. In: RecSys, pp. 123–130 (2008)
24.
Zurück zum Zitat Zhang, Y., Callan, J., Minka, T.: Novelty and redundancy detection in adaptive filtering. In: SIGIR, pp. 81–88 (2002) Zhang, Y., Callan, J., Minka, T.: Novelty and redundancy detection in adaptive filtering. In: SIGIR, pp. 81–88 (2002)
Metadaten
Titel
Diversification of Keyword Query Result Patterns
verfasst von
Cem Aksoy
Ananya Dass
Dimitri Theodoratos
Xiaoying Wu
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-39958-4_14

Neuer Inhalt