skip to main content
10.1145/3003464.3003469acmotherconferencesArticle/Chapter ViewAbstractPublication PagesgirConference Proceedingsconference-collections
research-article

Refining imprecise spatio-temporal events: a network-based approach

Published:31 October 2016Publication History

ABSTRACT

Events as composites of temporal, spatial and actor information are a central object of interest in many information retrieval (IR) scenarios. There are several challenges to such event-centric IR, which range from the detection and extraction of geographic, temporal and actor mentions in documents to the construction of event descriptions as triples of locations, dates, and actors that can support event query scenarios. For the latter challenge, existing approaches fall short when dealing with imprecise event components. For example, if the exact location or date is unknown, existing IR methods are often unaware of different granularity levels and the conceptual proximity of dates or locations.

To address these problems, we present a framework that efficiently answers imprecise event queries, whose geographic or temporal component is given only at a coarse granularity level. Our approach utilizes a network-based event model that includes location, date, and actor components that are extracted from large document collections. Instances of entity and event mentions in the network are weighted based on both their frequency of occurrence and textual distance to reflect semantic relatedness. We demonstrate the utility and flexibility of our approach for evaluating imprecise event queries based on a large collection of events extracted from the English Wikipedia for a ground truth of news events.

References

  1. H. Abdelhaq, M. Gertz, and A. Armiti. Efficient Online Extraction of Keywords for Localized Events in Twitter. GeoInformatica, 2016 (online April 2016).Google ScholarGoogle Scholar
  2. A. Abujabal and K. Berberich. Important Events in the Past, Present, and Future. In TempWeb, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. B. Adams and M. Gahegan. Exploratory Chronotopic Data Analysis. In GIScience, 2016.Google ScholarGoogle ScholarCross RefCross Ref
  4. C. C. Aggarwal and K. Subbian. Event Detection in Social Streams. In SDM, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  5. J. Allan. Topic Detection and Tracking: Event-Based Information Organization, volume 12. Springer Science & Business Media, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. J. Allan, R. Papka, and V. Lavrenko. On-line New Event Detection and Tracking. In SIGIR, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. R. Blanco and C. Lioma. Graph-based Term Weighting for Information Retrieval. Information Retrieval, 15(1), 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. T. Brants, F. Chen, and A. Farahat. A System for New Event Detection. In SIGIR, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. A. Ceroni, U. Gadiraju, J. Matschke, S. Wingert, and M. Fisichella. Where the Event Lies: Predicting Event Occurrence in Textual Documents. In SIGIR, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. W. Feng, C. Zhang, W. Zhang, J. Han, J. Wang, C. Aggarwal, and J. Huang. STREAMCUBE: Hierarchical Spatio-temporal Hashtag Clustering for Event Exploration Over the Twitter Stream. In ICDE, 2015.Google ScholarGoogle ScholarCross RefCross Ref
  11. J. R. Finkel, T. Grenager, and C. D. Manning. Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling. In ACL, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. Foley, M. Bendersky, and V. Josifovski. Learning to Extract Local Events from the Web. In SIGIR, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Geiß, A. Spitz, and M. Gertz. Beyond Friendships and Followers: The Wikipedia Social Network. In ASONAM, 2015.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. J. Geiß, A. Spitz, J. Strötgen, and M. Gertz. The Wikipedia Location Network: Overcoming Borders and Oceans. In GIR, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. S. Julinda, C. Boden, and A. Akbik. Extracting a Repository of Events and Event References from News Clusters. In AHA! Workshop on Information Discovery in Text, 2014.Google ScholarGoogle Scholar
  16. N. Kanhabua and W. Nejdl. On the Value of Temporal Anchor Texts in Wikipedia. In TAIA, 2014.Google ScholarGoogle Scholar
  17. A. Marcus, M. S. Bernstein, O. Badar, D. R. Karger, S. Madden, and R. C. Miller. TwitInfo: Aggregating and Visualizing Microblogs for Event Exploration. In CHI, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. A. Mishra and K. Berberich. Event Digest: A Holistic View on Past Events. In SIGIR, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. A. Mishra, D. Milchevski, and K. Berberich. Linking Wikipedia Events to Past News. In TAIA, 2014.Google ScholarGoogle Scholar
  20. S. Nepomnyachiy, B. Gelley, W. Jiang, and T. Minkus. What, Where, and When: Keyword Search With Spatio-temporal Ranges. In GIR, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. F. Rousseau and M. Vazirgiannis. Graph-of-Word and TW-IDF: New Approach to Ad Hoc IR. In CIKM, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. T. Sakaki, M. Okazaki, and Y. Matsuo. Earthquake Shakes Twitter Users: Real-time Event Detection by Social Sensors. In WWW, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. A. D. Sarma, A. Jain, and C. Yu. Dynamic Relationship and Event Discovery. In WSDM, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. A. Schmidt, J. Hoffart, D. Milchevski, and G. Weikum. Context-Sensitive Auto-Completion for Searching with Entities and Categories. In SIGIR, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. R. W. Sinnott. Virtues of the Haversine. Sky and Telescope, 68(2):158, 1984.Google ScholarGoogle Scholar
  26. A. Spitz, V. Dixit, L. Richter, M. Gertz, and J. Geiß. State of the Union: A Data Consumer's Perspective on Wikidata and Its Properties for the Classification and Resolution of Entities. In Wiki Workshop at ICWSM, 2016.Google ScholarGoogle Scholar
  27. A. Spitz, J. Geiß, and M. Gertz. So Far Away and Yet so Close: Augmenting Toponym Disambiguation and Similarity with Text-based Networks. In GeoRich Workshop at SIGMOD, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. A. Spitz and M. Gertz. Terms over LOAD: Leveraging Named Entities for Cross-Document Extraction and Summarization of Events. In SIGIR, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. J. Strötgen and M. Gertz. Event-centric Search and Exploration in Document Collections. In JCDL, 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. J. Strötgen and M. Gertz. Multilingual and cross-domain temporal tagging. Language Resources and Evaluation, 47(2), 2013.Google ScholarGoogle Scholar
  31. J. Strötgen and M. Gertz. Proximity2-aware Ranking for Textual, Temporal, and Geographic Queries. In CIKM, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. D. Vrandečić and M. Krötzsch. Wikidata: A Free Collaborative Knowledgebase. Communications of the ACM, 57(10), 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Refining imprecise spatio-temporal events: a network-based approach

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          GIR '16: Proceedings of the 10th Workshop on Geographic Information Retrieval
          October 2016
          68 pages
          ISBN:9781450345880
          DOI:10.1145/3003464

          Copyright © 2016 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 31 October 2016

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          GIR '16 Paper Acceptance Rate9of12submissions,75%Overall Acceptance Rate46of61submissions,75%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader