skip to main content
research-article

Exemplar queries: give me an example of what you need

Published:01 January 2014Publication History
Skip Abstract Section

Abstract

Search engines are continuously employing advanced techniques that aim to capture user intentions and provide results that go beyond the data that simply satisfy the query conditions. Examples include the personalized results, related searches, similarity search, popular and relaxed queries. In this work we introduce a novel query paradigm that considers a user query as an example of the data in which the user is interested. We call these queries exemplar queries and claim that they can play an important role in dealing with the information deluge. We provide a formal specification of the semantics of such queries and show that they are fundamentally different from notions like queries by example, approximate and related queries. We provide an implementation of these semantics for graph-based data and present an exact solution with a number of optimizations that improve performance without compromising the quality of the answers. We also provide an approximate solution that prunes the search space and achieves considerably better time-performance with minimal or no impact on effectiveness. We experimentally evaluate the effectiveness and efficiency of these solutions with synthetic and real datasets, and illustrate the usefulness of exemplar queries in practice.

References

  1. R. Agrawal, S. Gollapudi, A. Halverson, and S. Ieong. Diversifying search results. In WSDM, pages 5--14, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. Anagnostopoulos, L. Becchetti, C. Castillo, and A. Gionis. An optimization framework for query recommendation. In WSDM, pages 161--170, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. R. Baeza-Yates, P. Boldi, and C. Castillo. Generalizing pagerank: damping functions for link-based ranking algorithms. In SIGIR, pages 308--315, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. I. Bedini, B. Elser, and Y. Velegrakis. The trento big data platform for public administration and large companies: Use cases and opportunities. PVLDB, 6(11):1166--1167, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Bergamaschi, E. Domnori, F. Guerra, R. Trillo Lado, and Y. Velegrakis. Keyword search over relational databases: a meta-data approach. In SIGMOD, pages 565--576, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. Bergamaschi, F. Guerra, S. Rota, and Y. Velegrakis. A hidden markov model approach to keyword-based search over relational databases. In ER, pages 411--420, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Bhatia, D. Majumdar, and P. Mitra. Query suggestions in the absence of query logs. In SIGIR, pages 795--804, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. P. Boldi, F. Bonchi, C. Castillo, and S. Vigna. Query reformulation mining: models, patterns, and applications. IR, 14(3):257--289, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. I. Bordino, G. De Francisci Morales, I. Weber, and F. Bonchi. From machu_picchu to rafting the urubamba river: anticipating information needs via the entity-query graph. In WSDM, pages 275--284, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. S. Chakrabarti. Dynamic personalized pagerank in entity-relation graphs. In WWW, pages 571--580, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. X. Dong, A. Y. Halevy, and J. Madhavan. Reference Reconciliation in Complex Information Spaces. In SIGMOD, pages 85--96, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Z. Dou, S. Hu, Y. Luo, R. Song, and J. Wen. Finding dimensions for queries. In CIKM, pages 1311--1320, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. X. Gao, B. Xiao, D. Tao, and X. Li. A survey of graph edit distance. Pattern Analysis and applications, 13(1):113--129, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. S. Gauch and J. B. Smith. Search improvement via automatic query reformulation. TOIS, 9(3):249--280, 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. T. H. Haveliwala. Topic-sensitive pagerank. In WWW, pages 517--526, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. M. R. Henzinger, T. A. Henzinger, and P. W. Kopke. Computing simulations on finite and infinite graphs. In FOCS, pages 453--462. IEEE, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. B. Jansen, D. Booth, and A. Spink. Determining the informational, navigational, and transactional intent of web queries. Information Processing & Management, 44(3):1251--1266, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. G. Jeh and J. Widom. Scaling personalized web search. In WWW, pages 271--279, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. A. Khan, N. Li, X. Yan, Z. Guan, S. Chakraborty, and S. Tao. Neighborhood based fast graph search in large networks. In SIGMOD, pages 901--912, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. A. Khan, Y. Wu, C. C. Aggarwal, and X. Yan. Nema: Fast graph search with label similarity. In PVLDB, pages 181--192, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. N. Lao and W. W. Cohen. Fast query execution for retrieval models based on path-constrained random walks. In KDD, pages 881--888, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. C. Mishra and N. Koudas. Interactive query refinement. In EDBT, pages 862--873, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. D. Mottin, A. Marascu, S. B. Roy, G. Das, T. Palpanas, and Y. Velegrakis. A probabilistic optimization framework for the empty-answer problem. PVLDB, 6(14):1762--1773, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. D. Mottin, T. Palpanas, and Y. Velegrakis. Entity Ranking Using Click-Log Information. IDA Journal, 17(5):837--856, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. V. M. Ngo and T. H. Cao. Ontology-based query expansion with latently related named entities for semantic text search. In IJIIDS, volume 283, pages 41--52. 2010.Google ScholarGoogle ScholarCross RefCross Ref
  26. L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. Technical Report 1999-66, Stanford InfoLab, November 1999.Google ScholarGoogle Scholar
  27. D. Park. Concurrency and automata on infinite sequences. Springer, 1981.Google ScholarGoogle ScholarCross RefCross Ref
  28. Y. Qiu and H.-P. Frei. Concept based query expansion. In SIGIR, pages 160--169, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. C. E. Shannon. A mathematical theory of communication. SIG-MOBILE Mob. Comput. Commun. Rev., 5(1):3--55, Jan. 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. D. Vallet and H. Zaragoza. Inferring the most important types of a query: a semantic approach. In SIGIR, pages 857--858, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. X. Wang, X. Ding, A. K. H. Tung, S. Ying, and H. Jin. An efficient graph indexing method. In ICDE, pages 210--221, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. X. Wang and C. Zhai. Mining term association patterns from search logs for effective query reformulation. In CIKM, pages 479--488, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. X. Yan, P. S. Yu, and J. Han. Graph indexing: a frequent structure-based approach. In SIGMOD, pages 335--346, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. P. Zhao and J. Han. On graph query optimization in large networks. VLDB J., 3(1-2):340--351, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. M. M. Zloof. Query by example. In AFIPS NCC, pages 431--438, 1975. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Exemplar queries: give me an example of what you need
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image Proceedings of the VLDB Endowment
      Proceedings of the VLDB Endowment  Volume 7, Issue 5
      Janary 2014
      100 pages
      ISSN:2150-8097
      Issue’s Table of Contents

      Publisher

      VLDB Endowment

      Publication History

      • Published: 1 January 2014
      Published in pvldb Volume 7, Issue 5

      Qualifiers

      • research-article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader