ABSTRACT
Keyword searches are attractive because they facilitate users searching structured databases. On the other hand, tag clouds are popular for navigation and visualization purposes over unstructured data because they can highlight the most significant concepts and hidden relationships in the underlying content dynamically. In this paper, we propose coupling the flexibility of keyword searches over structured data with the summarization and navigation capabilities of tag clouds to help users access a database. We propose using clouds over structured data (data clouds) to summarize the results of keyword searches over structured data and to guide users to refine their searches. The cloud presents the most significant words associated with the search results. Our keyword search model allows searching for entities than can span multiple tables in the database rather than just tuples, as existing keyword searches over databases do. We present several methods to compute the scores both for the entities and for the terms in the search results. We describe algorithms for keyword searches with data clouds and we present our system, CourseCloud, that offers a unified search and browse interface to a course database. We present experimental results showing (a) the appropriateness of the methods used for scoring terms, (b) the performance of the proposed algorithms, and (c) the effectiveness of CourseCloud compared to typical search and browse interfaces to a course database.
- Del.icio.us: url: http://del.icio.us/.Google Scholar
- Flickr: url: http://www.flickr.com/.Google Scholar
- ManyEyes: http://services.alphaworks.ibm.com/manyeyes/page/tag_cloud.html.Google Scholar
- technorati: url: http://www.technorati.com/.Google Scholar
- Wikipedia {Tag Cloud}: url: http://en.wikipedia.org/wiki/tag_cloud.Google Scholar
- Wordle: http://http://wordle.net/.Google Scholar
- S. Agrawal, S. Chaudhuri, and G. Das. DBXplorer: A system for keyword-based search over relational databases. In ICDE, pages 5--16, 2002. Google ScholarDigital Library
- C. Arndt. Information Measures: Information and Its Description in Science and Engineering. Springer, 2004.Google Scholar
- A. Balmin, V. Hristidis, and Y. Papakonstantinou. Objectrank: Authority-based keyword search in databases. In VLDB, pages 564--575, 2004. Google ScholarDigital Library
- O. Ben-Yitzhak, N. Golbandi, N. Har'El, R. Lempel, A. Neumann, S. Ofek-Koifman, D. Sheinwald, E. Shekita, B. Sznajder, and S. Yogev. Beyond basic faceted search. In Proc. of 1st Int'l Conf. on Web Search and Data Mining (WSDM), pages 33--44, 2008. Google ScholarDigital Library
- G. Bhalotia, A. Hulgeri, C. Nakhe, S. Chakrabarti, and S. Sudarshan. Keyword searching and browsing in databases using BANKS. In ICDE, pages 431--440, 2002. Google ScholarDigital Library
- S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. Computer Networks, 30(1--7):107--117, 1998. Google ScholarDigital Library
- D. Coupland. Microserfs. In Flamingo, 1996.Google Scholar
- M. Daconta, L. Obrst, and K. Smith. The Semantic Web: A guide to the future of XML, Web services, and knowledge management. John Wiley & Sons, Indianapolis, 2003. Google ScholarDigital Library
- G. Das, V. Hristidis, N. Kapoor, and S. Sudarshan. Ordering the attributes of query results. In SIGMOD, pages 395--406, 2006. Google ScholarDigital Library
- M. Dredze, H. Wallach, D. Puller, and F. Pereira. Generating summary keywords for emails using topics. In IUI, pages 199--206, 2008. Google ScholarDigital Library
- Y. Hassan-Montero and V. Herrero-Solana. Improving tag-clouds as visual information retrieval interfaces. In Int'l Conf. on Multidisciplinary Information Sciences and Technologies (InSciT2006), 2006.Google Scholar
- V. Hristidis, L. Gravano, and Y. Papakonstantinou. Efficient IR-style keyword search over relational databases. In VLDB, pages 850--861, 2003. Google ScholarDigital Library
- V. Hristidis and Y. Papakonstantinou. DISCOVER: Keyword search in relational databases. In VLDB, pages 670--681, 2002. Google ScholarDigital Library
- O. Kaser and D. Lemire. Tagcloud drawing: Algorithms for cloud visualization. In WWW, 2007.Google Scholar
- G. Koutrika, A. Simitsis, and Y. Ioannidis. Précis: The essence of a query answer. In ICDE, pages 69--78, 2006. Google ScholarDigital Library
- B. Y-L Kuo, T. Hentrich, B. Good, and M. Wilkinson. Tag clouds for summarizing web search results. In WWW, pages 1203--1204, 2007. Google ScholarDigital Library
- M. Miah, G. Das, V. Hristidis, and H. Mannila. Standing out in a crowd: Selecting attributes for maximum visibility. In ICDE, pages 356--365, 2008. Google ScholarDigital Library
- Addison Gallery of American Art. http://978.andover.edu/addison/about.htm.Google Scholar
- Trellian. http://www.keyworddiscovery.com/keyword-stats.html.Google Scholar
- Data clouds: summarizing keyword search results over structured data
Recommendations
Privacy-preserving data utilization in hybrid clouds
As cloud computing becomes prevalent, more and more sensitive data is being centralized into the cloud, which raises a new challenge on how to utilize the outsourced data in a privacy-preserving manner. Although searchable encryption allows for privacy-...
Word clouds of multiple search results
IRFC'11: Proceedings of the Second international conference on Multidisciplinary information retrieval facilitySearch engine result pages (SERPs) are known as the most expensive real estate on the planet. Most queries yield millions of organic search results, yet searchers seldom look beyond the first handful of results. To make things worse, different searchers ...
Tag clouds and keyword clouds: evaluating zero-interaction benefits
CHI EA '11: CHI '11 Extended Abstracts on Human Factors in Computing SystemsTag clouds are typically presented so that users can actively utilize community-generated metadata to query a collection. This research investigates whether such keyword clouds, and other interactive search metadata, also provide measureable passive ...
Comments