skip to main content
10.1145/1390334.1390425acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Social tag prediction

Published:20 July 2008Publication History

ABSTRACT

In this paper, we look at the "social tag prediction" problem. Given a set of objects, and a set of tags applied to those objects by users, can we predict whether a given tag could/should be applied to a particular object? We investigated this question using one of the largest crawls of the social bookmarking system del.icio.us gathered to date. For URLs in del.icio.us, we predicted tags based on page text, anchor text, surrounding hosts, and other tags applied to the URL. We found an entropy-based metric which captures the generality of a particular tag and informs an analysis of how well that tag can be predicted. We also found that tag-based association rules can produce very high-precision predictions as well as giving deeper understanding into the relationships between tags. Our results have implications for both the study of tagging systems as potential information retrieval tools, and for the design of such systems.

References

  1. R. Agrawal, T. Imieliński, and A. Swami. Mining Association Rules Between Sets of Items in Large Databases. SIGMOD Record, 22(2), 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. Aurnhammer, P. Hanappe, and L. Steels. Integrating Collaborative Tagging and Emergent Semantics for Image Retrieval. Collaborative Web Tagging Workshop (WWW'06).Google ScholarGoogle Scholar
  3. S. Chakrabarti, B. Dom, and P. Indyk. Enhanced Hypertext Categorization Using Hyperlinks. SIGMOD'98. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. E. Chi and T. Mytkowicz. Understanding the Efficiency of Social Tagging Systems using Information Theory. HT'08. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. E. Gabrilovich and S. Markovitch. Text Categorization with Many Redundant Features: Using Aggressive Feature Selection to Make SVMs Competitive with C4.5. ICML'04. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. Golder and B. A. Huberman. Usage Patterns of Collaborative Tagging Systems. Journal of Information Science, 32(2):198--208, April 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. T. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating Strategies for Similarity Search on the Web. WWW'02. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. P. Heymann, G. Koutrika, and H. Garcia-Molina. Can Social Bookmarking Improve Web Search. WSDM'08. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. T. Joachims. A Support Vector Method for Multivariate Performance Measures. ICML'05. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. T. Joachims. Making Large-scale Support Vector Machine Learning Practical. Advances in Kernel Methods: Support Vector Learning, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. K. Jones and C. van Rijsbergen. Information Retrieval Test Collections. Journal of Documentation, 32(1):59--75, 1976.Google ScholarGoogle ScholarCross RefCross Ref
  12. K. Jones and C. van Rijsbergen. Information Retrieval Test Collections. Journal of Documentation, 32(1):59--75, 1976.Google ScholarGoogle ScholarCross RefCross Ref
  13. G. Mishne. AutoTag: a collaborative approach to automated tag assignment for weblog posts. WWW'06. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. C. Schmitz, A. Hotho, R. Jaschke, and G. Stumme. Mining Association Rules in Folksonomies. IFCS'06.Google ScholarGoogle Scholar
  15. E. Schwarzkopf, D. Heckmann, D. Dengler, and A. Kroner. Mining the Structure of Tag Spaces for User Modeling. Workshop on Data Mining for User Modeling (ICUM'07).Google ScholarGoogle Scholar
  16. S. Sen, S. K. Lam, A. M. Rashid, D. Cosley, D. Frankowski, J. Osterhouse, F. M. Harper, and J. Riedl. tagging, communities, vocabulary, evolution. CSCW'06. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. S. Sood, K. Hammond, S. Owsley, and L. Birnbaum. TagAssist: Automatic Tag Suggestion for Blog Posts. ICWSM'07.Google ScholarGoogle Scholar
  18. Z. Xu, Y. Fu, J. Mao, and D. Su. Towards the Semantic Web: Collaborative Tag Suggestions. Collaborative Web Tagging Workshop (WWW'06).Google ScholarGoogle Scholar
  19. Y. Yang and J. O. Pedersen. A Comparative Study on Feature Selection in Text Categorization. ICML'97. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Y. Yang, S. Slattery, and R. Ghani. A Study of Approaches to Hypertext Categorization. Journal of Intelligent Information Systems, 18(2--3), 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Social tag prediction

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
          July 2008
          934 pages
          ISBN:9781605581644
          DOI:10.1145/1390334

          Copyright © 2008 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 20 July 2008

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate792of3,983submissions,20%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader