skip to main content
10.1145/1458082.1458112acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Can all tags be used for search?

Published:26 October 2008Publication History

ABSTRACT

Collaborative tagging has become an increasingly popular means for sharing and organizing Web resources, leading to a huge amount of user generated metadata. These tags represent quite a few different aspects of the resources they describe and it is not obvious whether and how these tags or subsets of them can be used for search. This paper is the first to present an in-depth study of tagging behavior for very different kinds of resources and systems - Web pages (Del.icio.us), music (Last.fm), and images (Flickr) - and compares the results with anchor text characteristics. We analyze and classify sample tags from these systems, to get an insight into what kinds of tags are used for different resources, and provide statistics on tag distributions in all three tagging environments. Since even relevant tags may not add new information to the search procedure, we also check overlap of tags with content, with metadata assigned by experts and from other sources. We discuss the potential of different kinds of tags for improving search, comparing them with user queries posted to search engines as well as through a user survey. The results are promising and provide more insight into both the use of different kinds of tags for improving search and possible extensions of tagging systems to support the creation of potentially search-relevant tags.

References

  1. M. Ames and M. Naaman. Why we tag: motivations for annotation in mobile and online media. In Proceedings SIGCHI, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. S. Bao, X. Wu, B. Fei, G. Xue, Z. Su, and Y. Yu. Optimizing web search using social annotations. In Proceedings WWW, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. Brin and L. Page. The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems, 30(1-7):107--117, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. J. Cohen. A coefficient of agreement for nominal scales. Educational and Psychological Measurement, 20(1):37--46, 1960.Google ScholarGoogle ScholarCross RefCross Ref
  5. B. D. Davison. Topical locality in the web. In Proceedings SIGIR, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. B. D. Eugenio and M. Glass. The kappa statistic: A second look. Computational Linguistics, 30(1):95--101, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. C. S. Firan, W. Nejdl, and R. Paiu. The benefit of using tag-based profiles. In Proceedings LA-WEB, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. A. Golder and B. A. Huberman. Usage patterns of collaborative tagging systems. Journal of Information Science, 32(2):198--208, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. H. Halpin, V. Robu, and H. Shepherd. The complex dynamics of collaborative tagging. In Proceedings WWW, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. P. Heymann, G. Koutrika, and H. Garcia-Molina. Can social bookmarking improve web search? In Proceedings WSDM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Hotho, R. Jäschke, C. Schmitz, and G. Stumme. Information retrieval in folksonomies: Search and ranking. In Proceedings ESWC, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. M.-Y. Kan. Web page categorization without the web page. In Proceedings WWW, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. P. Knees, T. Pohle, M. Schedl, and G. Widmer. A music search engine built upon audio-based and web-based similarity measures. In Proceedings SIGIR, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. R. Kraft and J. Zien. Mining anchor text for query refinement. In Proceedings WWW, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. C. Marlow, M. Naaman, D. Boyd, and M. Davis. Ht06, tagging paper, taxonomy, flickr, academic article, to read. In Proceedings HYPERTEXT, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. O. A. McBryan. GENVL and WWWW: Tools for Taming the Web. In Proceedings WWW, 1994.Google ScholarGoogle ScholarCross RefCross Ref
  17. M. Naaman, S. Harada, Q. Wang, H. Garcia-Molina, and A. Paepcke. Context data in geo-referenced digital photo collections. In Proceedings MULTIMEDIA '04, pages 196--203, New York, NY, USA, 2004. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. G. Pass, A. Chowdhury, and C. Torgeson. A picture of search. In Proceedings Infoscale, 2006.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. S. Sen, S. K. Lam, A. M. Rashid, D. Cosley, D. Frankowski, J. Osterhouse, F. M. Harper, and J. Riedl. tagging, communities, vocabulary, evolution. In Proceedings CSCW, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. D. Shen, J.-T. Sun, Q. Yang, and Z. Chen. A comparison of implicit and explicit links for web page classification. In Proceedings WWW, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Z. Xu, Y. Fu, J. Mao, and D. Su. Towards the semantic web: Collaborative tag suggestions. WWW Workshop on Collaborative Web Tagging, 2006.Google ScholarGoogle Scholar
  22. A. Zollers. Emerging motivations for tagging: Expression, performance, and activism. WWW Workshop on Tagging and Metadata for Social Information Organization, 2007.Google ScholarGoogle Scholar

Index Terms

  1. Can all tags be used for search?

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader