Abstract
Towards computational journalism, we present FactWatcher, a system that helps journalists identify data-backed, attention-seizing facts which serve as leads to news stories. FactWatcher discovers three types of facts, including situational facts, one-of-the-few facts, and prominent streaks, through a unified suite of data model, algorithm framework, and fact ranking measure. Given an append-only database, upon the arrival of a new tuple, FactWatcher monitors if the tuple triggers any new facts. Its algorithms efficiently search for facts without exhaustively testing all possible ones. Furthermore, FactWatcher provides multiple features in striving for an end-to-end system, including fact ranking, fact-to-statement translation and keyword-based fact search.
- S. Cohen, J. T. Hamilton, and F. Turner. Computational journalism. Commun. ACM, 54(10):66--71, Oct. 2011. Google ScholarDigital Library
- S. Cohen, C. Li, J. Yang, and C. Yu. Computational journalism: A call to arms to database researchers. In CIDR, pages 148--151, 2011.Google Scholar
- X. Jiang, C. Li, P. Luo, M. Wang, and Y. Yu. Prominent streak discovery in sequence data. In KDD, pages 1280--1288, 2011. Google ScholarDigital Library
- A. Sultana, N. Hassan, C. Li, J. Yang, and C. Yu. Incremental discovery of prominent situational facts. In ICDE, pages 112--123, 2014.Google ScholarCross Ref
- Y. Wu, P. K. Agarwal, C. Li, J. Yang, and C. Yu. On one of the few objects. In KDD, pages 1487--1495, 2012. Google ScholarDigital Library
- G. Zhang, X. Jiang, P. Luo, M. Wang, and C. Li. Discovering general prominent streaks in sequence data. ACM TKDD, 8(2): 9:1--9:37, June 2014. Google ScholarDigital Library
Recommendations
Time-aware evidence ranking for fact-checking
AbstractTruth can vary over time. Fact-checking decisions on claim veracity should therefore take into account temporal information of both the claim and supporting or refuting evidence. In this work, we investigate the hypothesis that the ...
Mining query subtopics from search log data
SIGIR '12: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrievalMost queries in web search are ambiguous and multifaceted. Identifying the major senses and facets of queries from search log data, referred to as query subtopic mining in this paper, is a very important issue in web search. Through search log analysis, ...
Fact checks versus problematic content in search rankings: SEO effects and the question of Google’s content moderation
WEBSCI '24: Proceedings of the 16th ACM Web Science ConferenceThis study investigates the ranking of problematic content and fact-checks of that content in Google Web Search results, examining their competition. The analysis is based on over 825 URLs extracted from Google Search Engine results pages (SERP) using ...
Comments