ABSTRACT
In many cases, a user turns to search engines to find information about real-world situations, namely, political elections, sport competitions, or natural disasters. Such temporal querying behavior can be observed through a significant number of event-related queries generated in web search. In this paper, we study the task of detecting event-related queries, which is the first step for understanding temporal query intent and enabling different temporal search applications, e.g., time-aware query auto-completion, temporal ranking, and result diversification. We propose a two-step approach to detecting events from query logs. We first identify a set of event candidates by considering both implicit and explicit temporal information needs. The next step further classifies the candidates into two main categories, namely, event or non-event. In more detail, we leverage different machine learning techniques for query classification, which are trained using the feature set composed of time series features from signal processing, along with features derived from click-through information, and standard statistical features. In order to evaluate our proposed approach, we conduct an experiment using two real-world query logs with manually annotated relevance assessments for 837 events. To this end, we provide a large set of event-related queries made available for fostering research on this challenging task.
- G. E. P. Box and G. Jenkins. Time Series Analysis, Forecasting and Control. Holden-Day, Incorporated, 1990. Google ScholarDigital Library
- R. Burghartz and K. Berberich. MPI-INF at the NTCIR-11 Temporal Query Classification Task. In Proceedings of the 11th NTCIR Conference, 2014.Google Scholar
- R. Campos, G. Dias, A. Jorge, and C. Nunes. GTE: A distributional second-order co-occurrence approach to improve the identification of top relevant dates in web snippets. In Proceedings of CIKM '12, 2012. Google ScholarDigital Library
- R. B. Cleveland, W. S. Cleveland, J. E. McRae, and I. Terpenning. STL: A seasonal-trend decomposition procedure based on loess (with discussion). Journal of Official Statistics, 6:3--73, 1990.Google Scholar
- Z. Dou, R. Song, and J.-R. Wen. A large-scale evaluation and analysis of personalized search strategies. In Proceedings of WWW '07, 2007. Google ScholarDigital Library
- S. N. Ghoreishi and A. Sun. Predicting event-relatedness of popular queries. In Proceedings of CIKM '13, 2013. Google ScholarDigital Library
- C. C. Holt. Forecasting seasonals and trends by exponentially weighted moving averages. International Journal of Forecasting, 20(1):5--10, 2004.Google ScholarCross Ref
- H. Joho, A. Jatowt, and R. Blanco. NTCIR temporalia: a test collection for temporal information access research. In Proceedings of WWW '14, 2014. Google ScholarDigital Library
- R. Jones and F. Diaz. Temporal profiles of queries. ACM Trans. Inf. Syst., 25, July 2007. Google ScholarDigital Library
- S. R. Kairam, M. R. Morris, J. Teevan, D. J. Liebling, and S. T. Dumais. Towards supporting search over trending events with social media. In Proceedings of ICWSM '13, 2013.Google Scholar
- N. Kanhabua and K. Nørvåg. Determining time of queries for re-ranking search results. In Proceedings of ECDL '10, 2010. Google ScholarDigital Library
- N. Kanhabua, S. Romano, and A. Stewart. Identifying relevant temporal expressions for real-world events. In Proceedings of the SIGIR 2012 Workshop on Time-aware Information Access (TAIA '12), 2012.Google Scholar
- J. Kleinberg. Bursty and hierarchical structure in streams. In Proceedings of SIGKDD '02, 2002. Google ScholarDigital Library
- A. Kulkarni, J. Teevan, K. M. Svore, and S. T. Dumais. Understanding temporal query dynamics. In Proceedings of WSDM '11, 2011. Google ScholarDigital Library
- V. Lavrenko and W. B. Croft. Relevance based language models. In Proceedings of SIGIR '01, 2001. Google ScholarDigital Library
- D. Metzler, R. Jones, F. Peng, and R. Zhang. Improving search relevance for implicitly temporal queries. In Proceedings of SIGIR '09, 2009. Google ScholarDigital Library
- S. Nunes, C. Ribeiro, and G. David. Use of temporal expressions in web search. In Proceedings of ECIR '08, 2008. Google ScholarDigital Library
- K. Radinsky, K. Svore, S. Dumais, J. Teevan, A. Bocharov, and E. Horvitz. Modeling and predicting behavioral dynamics on the web. In Proceedings of WWW '12, 2012. Google ScholarDigital Library
- M. Shokouhi. Detecting seasonal queries by time-series analysis. In Proceeding of SIGIR '11, 2011. Google ScholarDigital Library
- J. Strötgen and M. Gertz. HeidelTime: High quality rule-based extraction and normalization of temporal expressions. In Proceedings of Workshop on Semantic Evaluation, 2010. Google ScholarDigital Library
- J. Strötgen and M. Gertz. Event-centric search and exploration in document collections. In Proceedings of JCDL '12, 2012. Google ScholarDigital Library
- J.-R. Wen, J.-Y. Nie, and H.-J. Zhang. Query clustering using user logs. ACM Trans. Inf. Syst., 20(1):59--81, Jan. 2002. Google ScholarDigital Library
- R. Zhang, Y. Konda, A. Dong, P. Kolari, Y. Chang, and Z. Zheng. Learning recurrent event queries for web search. In Proceedings of EMNLP '10, 2010. Google ScholarDigital Library
- X. Zhu, J. Guo, X. Cheng, P. Du, and H.-W. Shen. A unified framework for recommending diverse and relevant queries. In Proceedings of WWW '11, 2011. Google ScholarDigital Library
Index Terms
- Learning to Detect Event-Related Queries for Web Search
Recommendations
Learning Temporal Ambiguity in Web Search Queries
CIKM '17: Proceedings of the 2017 ACM on Conference on Information and Knowledge ManagementTime has strong influence on web search. The temporal intent of the searcher adds an important dimension to the relevance judgments of web queries. However, lack of understanding their temporal requirements increases the ambiguity of the queries, ...
Detecting Intent of Web Queries Using Questions and Answers in CQA Corpus
WI-IAT '11: Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01Detecting intent in Web search activity is important task for finding relevant Web information. However extracting intents from users' queries is difficult as users express their intent by issuing short and often ambiguous queries, yet at the same time ...
Learning recurrent event queries for web search
EMNLP '10: Proceedings of the 2010 Conference on Empirical Methods in Natural Language ProcessingRecurrent event queries (REQ) constitute a special class of search queries occurring at regular, predictable time intervals. The freshness of documents ranked for such queries is generally of critical importance. REQ forms a significant volume, as much ...
Comments