skip to main content
10.1145/1516360.1516406acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article
Free Access

Data clouds: summarizing keyword search results over structured data

Published:24 March 2009Publication History

ABSTRACT

Keyword searches are attractive because they facilitate users searching structured databases. On the other hand, tag clouds are popular for navigation and visualization purposes over unstructured data because they can highlight the most significant concepts and hidden relationships in the underlying content dynamically. In this paper, we propose coupling the flexibility of keyword searches over structured data with the summarization and navigation capabilities of tag clouds to help users access a database. We propose using clouds over structured data (data clouds) to summarize the results of keyword searches over structured data and to guide users to refine their searches. The cloud presents the most significant words associated with the search results. Our keyword search model allows searching for entities than can span multiple tables in the database rather than just tuples, as existing keyword searches over databases do. We present several methods to compute the scores both for the entities and for the terms in the search results. We describe algorithms for keyword searches with data clouds and we present our system, CourseCloud, that offers a unified search and browse interface to a course database. We present experimental results showing (a) the appropriateness of the methods used for scoring terms, (b) the performance of the proposed algorithms, and (c) the effectiveness of CourseCloud compared to typical search and browse interfaces to a course database.

References

  1. Del.icio.us: url: http://del.icio.us/.Google ScholarGoogle Scholar
  2. Flickr: url: http://www.flickr.com/.Google ScholarGoogle Scholar
  3. ManyEyes: http://services.alphaworks.ibm.com/manyeyes/page/tag_cloud.html.Google ScholarGoogle Scholar
  4. technorati: url: http://www.technorati.com/.Google ScholarGoogle Scholar
  5. Wikipedia {Tag Cloud}: url: http://en.wikipedia.org/wiki/tag_cloud.Google ScholarGoogle Scholar
  6. Wordle: http://http://wordle.net/.Google ScholarGoogle Scholar
  7. S. Agrawal, S. Chaudhuri, and G. Das. DBXplorer: A system for keyword-based search over relational databases. In ICDE, pages 5--16, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. C. Arndt. Information Measures: Information and Its Description in Science and Engineering. Springer, 2004.Google ScholarGoogle Scholar
  9. A. Balmin, V. Hristidis, and Y. Papakonstantinou. Objectrank: Authority-based keyword search in databases. In VLDB, pages 564--575, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. O. Ben-Yitzhak, N. Golbandi, N. Har'El, R. Lempel, A. Neumann, S. Ofek-Koifman, D. Sheinwald, E. Shekita, B. Sznajder, and S. Yogev. Beyond basic faceted search. In Proc. of 1st Int'l Conf. on Web Search and Data Mining (WSDM), pages 33--44, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. G. Bhalotia, A. Hulgeri, C. Nakhe, S. Chakrabarti, and S. Sudarshan. Keyword searching and browsing in databases using BANKS. In ICDE, pages 431--440, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. Computer Networks, 30(1--7):107--117, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. D. Coupland. Microserfs. In Flamingo, 1996.Google ScholarGoogle Scholar
  14. M. Daconta, L. Obrst, and K. Smith. The Semantic Web: A guide to the future of XML, Web services, and knowledge management. John Wiley & Sons, Indianapolis, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. G. Das, V. Hristidis, N. Kapoor, and S. Sudarshan. Ordering the attributes of query results. In SIGMOD, pages 395--406, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. M. Dredze, H. Wallach, D. Puller, and F. Pereira. Generating summary keywords for emails using topics. In IUI, pages 199--206, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Y. Hassan-Montero and V. Herrero-Solana. Improving tag-clouds as visual information retrieval interfaces. In Int'l Conf. on Multidisciplinary Information Sciences and Technologies (InSciT2006), 2006.Google ScholarGoogle Scholar
  18. V. Hristidis, L. Gravano, and Y. Papakonstantinou. Efficient IR-style keyword search over relational databases. In VLDB, pages 850--861, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. V. Hristidis and Y. Papakonstantinou. DISCOVER: Keyword search in relational databases. In VLDB, pages 670--681, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. O. Kaser and D. Lemire. Tagcloud drawing: Algorithms for cloud visualization. In WWW, 2007.Google ScholarGoogle Scholar
  21. G. Koutrika, A. Simitsis, and Y. Ioannidis. Précis: The essence of a query answer. In ICDE, pages 69--78, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. B. Y-L Kuo, T. Hentrich, B. Good, and M. Wilkinson. Tag clouds for summarizing web search results. In WWW, pages 1203--1204, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. M. Miah, G. Das, V. Hristidis, and H. Mannila. Standing out in a crowd: Selecting attributes for maximum visibility. In ICDE, pages 356--365, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Addison Gallery of American Art. http://978.andover.edu/addison/about.htm.Google ScholarGoogle Scholar
  25. Trellian. http://www.keyworddiscovery.com/keyword-stats.html.Google ScholarGoogle Scholar
  1. Data clouds: summarizing keyword search results over structured data

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      EDBT '09: Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
      March 2009
      1180 pages
      ISBN:9781605584225
      DOI:10.1145/1516360

      Copyright © 2009 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 24 March 2009

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate7of10submissions,70%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader