ABSTRACT
Current web search engines are built to serve all users, independent of the needs of any individual user. Personalization of web search is to carry out retrieval for each user incorporating his/her interests. We propose a novel technique to map a user query to a set of categories, which represent the user's search intention. This set of categories can serve as a context to disambiguate the words in the user's query. A user profile and a general profile are learned from the user's search history and a category hierarchy respectively. These two profiles are combined to map a user query into a set of categories. Several learning and combining algorithms are evaluated and found to be effective. Among the algorithms to learn a user profile, we choose the Rocchio-based method for its simplicity, efficiency and its ability to be adaptive. Experimental results indicate that our technique to personalize web search is both effective and efficient.
- J. Allan. Incremental relevance feedback for information filtering. SIGIR, 1996 Google ScholarDigital Library
- M. Balabanovic and Y. Shoham. Learning information retrieval agents: Experiments with automated Web browsing. In On-line Working Notes of the AAAI Spring Symposium Series on Information Gathering from Distributed, Heterogeneous Environments, 1995.Google Scholar
- K. Bollacker, S. Lawrence, and C. Lee Giles. A system for automatic personalized tracking of scientific literature on the web. ACM DL, 1999. Google ScholarDigital Library
- J. Budzik and J. K. Hammond. Watson: Anticipating and contextualizing information needs. In Proceedings of the Sixty-second Annual Meeting of the American Society for Information Science, 1999Google Scholar
- U. Çetintemel, M. J. Franklin, and C. Lee Giles. Self-Adaptive User Profiles for Large-Scale Data Delivery.ICDE, 2000Google Scholar
- L. Chen and K. Sycara. WebMate: A Personal Agent for Browsing and Searching. Autonomous Agents and Multi Agent Systems, 1998. Google ScholarDigital Library
- S. Deerwester, S. T. Dumais, G. Furnas, T. Landauer, and R. Harshman. Indexing by latent semantic analysis. JASIS, 18(2), 1990.Google Scholar
- R. Dolin, D. Agrawal, A. El Abbadi and J. Pearlman. Using Automated Classification for Summarizating and Selecting Heterogeneous Information Sources. D-Lib Magazine, 1998. Google ScholarDigital Library
- W. Foltz and S. T. Dumais. Personalized information delivery: An analysis of information filtering methods. CACM,1992. Google ScholarDigital Library
- W. Frakes, and R. Baeza-Yates. Information Retrieval: Data Structures and Algorithms. 1992. Google ScholarDigital Library
- S. Gauch, G. Wang, M. Gomez. ProFusion: Intelligent Fusion from Multiple, Distributed Search Engines. Journal of Universal Computer Science, 2(9), 1996Google Scholar
- E. Glover, G. Flake, S. Lawrence, W. Birmingham, A. Kruger, C. Giles, and D. Pennock. Improving Category Specific Web Search by Learning Query Modifications. SAINT, 2001 Google ScholarDigital Library
- G. H. Golub and C. F. Van Loan. Matrix Computations. Third Edition, 1996 Google ScholarDigital Library
- L. Gravano, and H. Garcia-Molina. Generalizing GlOSS to Vector-Space Databases and Broker Hierarchies. VLDB, 1995. Google ScholarDigital Library
- D. Grossman and O. Frieder. Information Retrieval: Algorithms and Heuristics. 1998. Google ScholarDigital Library
- A. E. Howe and D. Dreilinger. SavvySearch: A meta-search engine that learns which search engines to query. AI Magazine, 18(2), 1997.Google Scholar
- P. Ipeirotis, L. Gravano, and M. Sahami. Probe, Count, and Classify: Categorizing Hidden Web Databases. ACM SIGMOD, 2001. Google ScholarDigital Library
- Joachims, T., Freitag, D., and Mitchell, T. Webwatcher: A tour guide for the World Wide Web. IJCAI, 1997Google Scholar
- D. Koller and M. Sahami. Hierarchically classifying documents using very few words. ICML, 1997 Google ScholarDigital Library
- Y. Labrou and T. Finin. Yahoo! as an ontology: using Yahoo! categories to describe documents. CIKM, 1999 Google ScholarDigital Library
- H. Lieberman. Letizia: An agent that assists Web browsing. IJCAI, 1995.Google Scholar
- W. Meng, W. Wang, H. Sun and C. Yu. Concept Hierarchy Based Text Database Categorization. International Journal on Knowledge and Information Systems, March 2002.Google Scholar
- T. Mitchell. Machine Learning, 1997. Google ScholarDigital Library
- M. Pazzani and D. Billsus. Learning and Revising User Profiles: The identification of interesting web sites. Machine Learning, 1997. Google ScholarDigital Library
- A. L. Powell, J. C. French, J. P. Callan and M. Connell. The impact of database selection on distributed searching. SIGIR, 2000. Google ScholarDigital Library
- A. Pretschner and S. Gauch. Ontology based personalized search. ICTAI, 1999 Google ScholarDigital Library
- J. Rocchio. Relevance feedback in information retrieval. In The smart retrieval system: Experiments in automatic document processing, 1971.Google Scholar
- S. Robertson and I. Soboroff. The TREC-10 Filtering Track Final Report. TREC-10, 2001.Google Scholar
- G. Salton and M. J. McGill. Introduction to Modern Information Retrieval, 1983. Google ScholarDigital Library
- D. H. Widyantoro, T. R. Ioerger and J. Yen. An adaptive algorithm for learning changes in user interests. CIKM, 1999 Google ScholarDigital Library
- T. W. Yan and H. Garcia-Molina. SIFT -- A Tool for Wide-Area Information Dissemination. USENIX Technical Conference, 1995. Google ScholarDigital Library
- Y. Yang and C. G. Chute. An example-based mapping method for text categorization and retrieval. TOIS, 1994 Google ScholarDigital Library
- Y. Yang. Noise Reduction in a Statistical Approach to Text Categorization. SIGIR 1995 Google ScholarDigital Library
- Y. Yang and X. Liu, A re-examination of text categorization methods. SIGIR 1999. Google ScholarDigital Library
- C. Yu, W. Meng, W. Wu and K. Liu. Efficient and Effective Metasearch for Text Databases Incorporating Linkages among Documents. ACM SIGMOD, 2001. Google ScholarDigital Library
Index Terms
- Personalized web search by mapping user queries to categories
Recommendations
Personalized Web Search For Improving Retrieval Effectiveness
Abstract--Current Web search engines are built to serve all users, independent of the special needs of any individual user. Personalization of Web search is to carry out retrieval for each user incorporating his/her interests. We propose a novel ...
Deriving Concept-Based User Profiles from Search Engine Logs
User profiling is a fundamental component of any personalization applications. Most existing user profiling strategies are based on objects that users are interested in (i.e., positive preferences), but not the objects that users dislike (i.e., negative ...
A personalized URL re-ranking methodology using user's browsing behavior
KES-AMSTA'08: Proceedings of the 2nd KES International conference on Agent and multi-agent systems: technologies and applicationsThis paper proposes a personalized re-ranking of URLs returned by a search engine using user's browsing behaviors. Our personalization method constructs an index of the anchor text retrieved from the web pages that the user has clicked during his/her ...
Comments