skip to main content
10.1145/1526709.1526774acmconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

Discovering users' specific geo intention in web search

Published:20 April 2009Publication History

ABSTRACT

Discovering users' specific and implicit geographic intention in web search can greatly help satisfy users' information needs. We build a geo intent analysis system that uses minimal supervision to learn a model from large amounts of web-search logs for this discovery. We build a city language model, which is a probabilistic representation of the language surrounding the mention of a city in web queries. We use several features derived from these language models to: (1) identify users' implicit geo intent and pinpoint the city corresponding to this intent, (2) determine whether the geo-intent is localized around the users' current geographic location, (3) predict cities for queries that have a mention of an entity that is located in a specific place. Experimental results demonstrate the effectiveness of using features derived from the city language model. We find that (1) the system has over 90% precision and more than 74% accuracy for the task of detecting users' implicit city level geo intent (2) the system achieves more than 96% accuracy in determining whether implicit geo queries are local geo queries, neighbor region geo queries or none-of-these (3) the city language model can effectively retrieve cities in location-specific queries with high precision (88%) and recall (74%); human evaluation shows that the language model predicts city labels for location-specific queries with high accuracy (84.5%).

References

  1. GeoCLEF workshop -- Evaluation of cross--language geographic information retrieval systems. www.uni--hildesheim.de/geoclef.Google ScholarGoogle Scholar
  2. L. Andrade and M. J. Silva. Relevance ranking for geographic ir. In ACM GIR, 2006.Google ScholarGoogle Scholar
  3. D. Bohning. Multinomial Logistic Regression Algorithm. Annals of the Inst. of Statistical Math., 44:197--200, November 1992.Google ScholarGoogle ScholarCross RefCross Ref
  4. J. Broglio, J. P. Callan, and W. B. Croft. An overview of the INQUERY system as used for the TIPSTER project. Technical report, Amherst, MA, USA, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. C.-C. Chang and C.-J. Lin. LIBSVM: a library for support vector machines. Software available at http://www.csie.ntu.edu.tw/cjlin/libsvm, 2001.Google ScholarGoogle Scholar
  6. S. F. Chen and J. Goodman. An empirical study of smoothing techniques for language modeling. In Proceedings of ACL, pages 310--318, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. K. W. Church and P. Hanks. Word association norms, mutual information, and lexicography. In Proceedings of ACL, pages 76--83, 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J. H. Friedman. Greedy function approximation: A gradient boosting machine. Annals of Statistics, 29:1189--1232, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  9. R. Jones, W. V. Zhang, B. Rey, P. Jhala, and E. Stipp. Geographic intention and modification in web search. International Journal of Geographical Information Science (IJGIS), March 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. M. Pasca. Weakly-supervised discovery of named entities using web search queries. In CIKM, pages 683--690, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. J. M. Ponte and W. B. Croft. A language modeling approach to information retrieval. In ACM SIGIR, pages 275--281, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. R. Purves and C. Jones, editors. ACM GIR. ACM, 2007.Google ScholarGoogle Scholar
  13. J. Qian. Local Search Using Address Completion. US Patent Application 20080065694, March 2008.Google ScholarGoogle Scholar
  14. H. Raghavan, J. Allan, and A. McCallum. An exploration of entity models, collective classification and relation description. In ACM LinkKDD, pages 1--10, 2004.Google ScholarGoogle Scholar
  15. S. Riise, D. Patel, and E. Stipp. Geographical Location Extraction. US Patent Application 20050108213, 2003.Google ScholarGoogle Scholar
  16. M. Sanderson and J. Kohler. Analyzing geographic queries. In ACM GIR, Sheffield, UK, 2004.Google ScholarGoogle Scholar
  17. S. Tong and D. Koller. Support vector machine active learning with applications to text classification. In Proceedings of ICML, pages 999--1006, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. L. Wang, C. Wang, X. Xie, J. Forman, Y. Lu, W.-Y. Ma, and Y. Li. Detecting dominant locations from search queries. In ACM SIGIR, pages 424--431, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. M. J. Welch and J. Cho. Automatically identifying localizable queries. In ACM SIGIR, pages 507--514, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. B. Yu and G. Cai. A query-aware document ranking method for geographic information retrieval. In ACM GIR, pages 49--54, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to ad-hoc Information Retrieval. In ACM SIGIR, pages 334--342, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Z. Zhuang, C. Brunk, and C. L. Giles. Modeling and visualizing geo-sensitive queries based on user clicks. In ACM LocWeb, pages 73--76, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Discovering users' specific geo intention in web search

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader