skip to main content
10.1145/2505515.2505655acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Estimating document focus time

Published:27 October 2013Publication History

ABSTRACT

Temporality is an important characteristic of text documents. While some documents are clearly atemporal, many have temporal character and can be mapped to certain time periods. In this paper, we introduce the problem of estimating focus time of documents. Document focus time is defined as the time to which the content of a document refers to and is considered as a complementary dimension to its creation time or timestamp. We propose several estimators of focus time by utilizing external knowledge bases such as news article collections which contain explicit temporal references. We then evaluate the effectiveness of our methods on diverse datasets of documents about historical events in five countries.

References

  1. Alonso, O. et al. Temporal Information Retrieval: Challenges and Opportunities. In TWAW 2011, pp. 1--8Google ScholarGoogle Scholar
  2. Arikan, I. Bedathur, S.J. and Berberich, K. Time Will Tell: Leveraging Temporal Expressions in IR. In WSDM 2009Google ScholarGoogle Scholar
  3. Au Yeung, C.-M. and Jatowt, A., Studying how the Past is Remembered: Towards Computational History through Large Scale Text Mining. In CIKM 2011, pp. 1231--1240 Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Berberich, K., Bedathur, S.J., Alonso, O. and Weikum, G. A Language Modeling Approach for Temporal Information Needs. In ECIR 2010, pp. 13--25, 2010 Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Campos, R., Dias, G., Jorge, A. M., and Nunes, C. GTE: A Distributional Second-Order Co-Occurrence Approach to Improve the Identification of Top Relevant Dates. In CIKM 2012, 2035--2039 Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Cavnar, W. B. and Trenkle, J. M. N-Gram-Based Text Categorization. In SDAIR 1994, pp. 161--175Google ScholarGoogle Scholar
  7. Jones, R., and Diaz, F. Temporal Profiles of Queries. In TOIS: ACM Transactions on Information Systems, 25(3), 2007 Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Jong de, F.M.G. and Rode, H. and Hiemstra, D. Temporal Language Models for the Disclosure of Historical Text. In AHC'05, pp. 161--168Google ScholarGoogle Scholar
  9. Kanhabua, N., and Nørvåg, K. Determining Time of Queries for Re-ranking Search Results. In ECDL 2010, pp. 261--272, 2010 Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Kanhabua, N., and Nørvåg, K. Using Temporal Language Models for Document Dating, In MLKDD 2009, pp. 738--741, 2009 Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Kerr, G. Timeline of World History, Canary Press, 2011Google ScholarGoogle Scholar
  12. Mani, I., and Wilson, G. Robust Temporal Processing of News. In ACL 2000, pp. 69--76, 2000 Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Manning, C. and Schütze, H. Foundations of Statistical Natural Language Processing, MIT Press, 1999 Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Metzler, D., Jones, R., Peng, F., and Zhang, R. Improving Search Relevance for Implicitly Temporal Queries. In SIGIR 2009, 700--701 Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Michel, J.-B. et al. Quantitative Analysis of Culture Using Millions of Digitized Books. Science, 331(6014), pp. 176--182, 2011Google ScholarGoogle ScholarCross RefCross Ref
  16. Mihalcea, R., and Tarau, P. Textrank: Bringing Order into Text. In EMNLP 2004, pp. 404--411. 2004Google ScholarGoogle Scholar
  17. Ratnikas, A. Timelines of History, 2012 (Kindle edition)Google ScholarGoogle Scholar
  18. Strötgen, J. and Gertz, M. TimeTrails: a system for exploring spatio-temporal information in documents. In VLDB 2010, pp. 1569--1572Google ScholarGoogle Scholar
  19. Strötgen, J. Alonso, O. and Gertz, M. Identification of top relevant temporal expressions in documents. In TempWeb 2012, pp. 33--40 Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Sheather, S.J. Density Estimation. Statistical Science. Vol. 19, Number 4, pp. 588--597, 2004Google ScholarGoogle Scholar

Index Terms

  1. Estimating document focus time

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        CIKM '13: Proceedings of the 22nd ACM international conference on Information & Knowledge Management
        October 2013
        2612 pages
        ISBN:9781450322638
        DOI:10.1145/2505515

        Copyright © 2013 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 27 October 2013

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        CIKM '13 Paper Acceptance Rate143of848submissions,17%Overall Acceptance Rate1,861of8,427submissions,22%

        Upcoming Conference

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader