skip to main content
10.1145/584792.584884acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

Personalized web search by mapping user queries to categories

Published:04 November 2002Publication History

ABSTRACT

Current web search engines are built to serve all users, independent of the needs of any individual user. Personalization of web search is to carry out retrieval for each user incorporating his/her interests. We propose a novel technique to map a user query to a set of categories, which represent the user's search intention. This set of categories can serve as a context to disambiguate the words in the user's query. A user profile and a general profile are learned from the user's search history and a category hierarchy respectively. These two profiles are combined to map a user query into a set of categories. Several learning and combining algorithms are evaluated and found to be effective. Among the algorithms to learn a user profile, we choose the Rocchio-based method for its simplicity, efficiency and its ability to be adaptive. Experimental results indicate that our technique to personalize web search is both effective and efficient.

References

  1. J. Allan. Incremental relevance feedback for information filtering. SIGIR, 1996 Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. Balabanovic and Y. Shoham. Learning information retrieval agents: Experiments with automated Web browsing. In On-line Working Notes of the AAAI Spring Symposium Series on Information Gathering from Distributed, Heterogeneous Environments, 1995.Google ScholarGoogle Scholar
  3. K. Bollacker, S. Lawrence, and C. Lee Giles. A system for automatic personalized tracking of scientific literature on the web. ACM DL, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. J. Budzik and J. K. Hammond. Watson: Anticipating and contextualizing information needs. In Proceedings of the Sixty-second Annual Meeting of the American Society for Information Science, 1999Google ScholarGoogle Scholar
  5. U. Çetintemel, M. J. Franklin, and C. Lee Giles. Self-Adaptive User Profiles for Large-Scale Data Delivery.ICDE, 2000Google ScholarGoogle Scholar
  6. L. Chen and K. Sycara. WebMate: A Personal Agent for Browsing and Searching. Autonomous Agents and Multi Agent Systems, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Deerwester, S. T. Dumais, G. Furnas, T. Landauer, and R. Harshman. Indexing by latent semantic analysis. JASIS, 18(2), 1990.Google ScholarGoogle Scholar
  8. R. Dolin, D. Agrawal, A. El Abbadi and J. Pearlman. Using Automated Classification for Summarizating and Selecting Heterogeneous Information Sources. D-Lib Magazine, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. W. Foltz and S. T. Dumais. Personalized information delivery: An analysis of information filtering methods. CACM,1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. W. Frakes, and R. Baeza-Yates. Information Retrieval: Data Structures and Algorithms. 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. S. Gauch, G. Wang, M. Gomez. ProFusion: Intelligent Fusion from Multiple, Distributed Search Engines. Journal of Universal Computer Science, 2(9), 1996Google ScholarGoogle Scholar
  12. E. Glover, G. Flake, S. Lawrence, W. Birmingham, A. Kruger, C. Giles, and D. Pennock. Improving Category Specific Web Search by Learning Query Modifications. SAINT, 2001 Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. G. H. Golub and C. F. Van Loan. Matrix Computations. Third Edition, 1996 Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. L. Gravano, and H. Garcia-Molina. Generalizing GlOSS to Vector-Space Databases and Broker Hierarchies. VLDB, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. D. Grossman and O. Frieder. Information Retrieval: Algorithms and Heuristics. 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. A. E. Howe and D. Dreilinger. SavvySearch: A meta-search engine that learns which search engines to query. AI Magazine, 18(2), 1997.Google ScholarGoogle Scholar
  17. P. Ipeirotis, L. Gravano, and M. Sahami. Probe, Count, and Classify: Categorizing Hidden Web Databases. ACM SIGMOD, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Joachims, T., Freitag, D., and Mitchell, T. Webwatcher: A tour guide for the World Wide Web. IJCAI, 1997Google ScholarGoogle Scholar
  19. D. Koller and M. Sahami. Hierarchically classifying documents using very few words. ICML, 1997 Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Y. Labrou and T. Finin. Yahoo! as an ontology: using Yahoo! categories to describe documents. CIKM, 1999 Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. H. Lieberman. Letizia: An agent that assists Web browsing. IJCAI, 1995.Google ScholarGoogle Scholar
  22. W. Meng, W. Wang, H. Sun and C. Yu. Concept Hierarchy Based Text Database Categorization. International Journal on Knowledge and Information Systems, March 2002.Google ScholarGoogle Scholar
  23. T. Mitchell. Machine Learning, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. M. Pazzani and D. Billsus. Learning and Revising User Profiles: The identification of interesting web sites. Machine Learning, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. A. L. Powell, J. C. French, J. P. Callan and M. Connell. The impact of database selection on distributed searching. SIGIR, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. A. Pretschner and S. Gauch. Ontology based personalized search. ICTAI, 1999 Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. J. Rocchio. Relevance feedback in information retrieval. In The smart retrieval system: Experiments in automatic document processing, 1971.Google ScholarGoogle Scholar
  28. S. Robertson and I. Soboroff. The TREC-10 Filtering Track Final Report. TREC-10, 2001.Google ScholarGoogle Scholar
  29. G. Salton and M. J. McGill. Introduction to Modern Information Retrieval, 1983. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. D. H. Widyantoro, T. R. Ioerger and J. Yen. An adaptive algorithm for learning changes in user interests. CIKM, 1999 Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. T. W. Yan and H. Garcia-Molina. SIFT -- A Tool for Wide-Area Information Dissemination. USENIX Technical Conference, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Y. Yang and C. G. Chute. An example-based mapping method for text categorization and retrieval. TOIS, 1994 Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Y. Yang. Noise Reduction in a Statistical Approach to Text Categorization. SIGIR 1995 Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Y. Yang and X. Liu, A re-examination of text categorization methods. SIGIR 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. C. Yu, W. Meng, W. Wu and K. Liu. Efficient and Effective Metasearch for Text Databases Incorporating Linkages among Documents. ACM SIGMOD, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Personalized web search by mapping user queries to categories

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        CIKM '02: Proceedings of the eleventh international conference on Information and knowledge management
        November 2002
        704 pages
        ISBN:1581134924
        DOI:10.1145/584792

        Copyright © 2002 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 4 November 2002

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate1,861of8,427submissions,22%

        Upcoming Conference

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader