skip to main content
10.1145/1557019.1557077acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Meme-tracking and the dynamics of the news cycle

Published:28 June 2009Publication History

ABSTRACT

Tracking new topics, ideas, and "memes" across the Web has been an issue of considerable interest. Recent work has developed methods for tracking topic shifts over long time scales, as well as abrupt spikes in the appearance of particular named entities. However, these approaches are less well suited to the identification of content that spreads widely and then fades over time scales on the order of days - the time scale at which we perceive news and events.

We develop a framework for tracking short, distinctive phrases that travel relatively intact through on-line text; developing scalable algorithms for clustering textual variants of such phrases, we identify a broad class of memes that exhibit wide spread and rich variation on a daily basis. As our principal domain of study, we show how such a meme-tracking approach can provide a coherent representation of the news cycle - the daily rhythms in the news media that have long been the subject of qualitative interpretation but have never been captured accurately enough to permit actual quantitative analysis. We tracked 1.6 million mainstream media sites and blogs over a period of three months with the total of 90 million articles and we find a set of novel and persistent temporal patterns in the news cycle. In particular, we observe a typical lag of 2.5 hours between the peaks of attention to a phrase in the news media and in blogs respectively, with divergent behavior around the overall peak and a "heartbeat"-like pattern in the handoff between news and blogs. We also develop and analyze a mathematical model for the kinds of temporal variation that the system exhibits.

Skip Supplemental Material Section

Supplemental Material

p497-leskovec.mp4

mp4

112 MB

References

  1. Supporting website: http://memetracker.orgGoogle ScholarGoogle Scholar
  2. L. Adamic and N. Glance. The political blogosphere and the 2004 U.S. election. Workshop on Link Discovery, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. E. Adar, L. Zhang, L. Adamic, R. Lukose. Implicit structure and dynamics of blogspace. Wks. Weblogging Ecosystem'04.Google ScholarGoogle Scholar
  4. R. Albert and A.-L. Barabási. Statistical mechanics of complex networks. Rev. of Modern Phys., 74:47--97, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  5. J. Allan (ed). Topic Detection and Tracking. Kluwer, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  6. L. Bennett. News: The Politics of Illusion. A. B. Longman (Classics in Political Science), seventh edition, 2006.Google ScholarGoogle Scholar
  7. D. Blei, J. Lafferty. Dynamic topic models. ICML, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. JMLR, pages 3:993--1022, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. G. Calinescu, H. Karloff, Y. Rabani. An improved approximation algorithm for multiway cut. JCSS 60(2000). Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. E. Dahlhaus, D. S. Johnson, C. H. Papadimitriou, P. D. Seymour, and M. Yannakakis. The complexity of multiterminal cuts. SIAM J. Comput., 23(4):864--894, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. E. Gabrilovich, S. Dumais, and E. Horvitz. Newsjunkie: Providing personalized newsfeeds via analysis ofinformation novelty. In WWW '04, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. M. Gamon, S. Basu, D. Belenko, D. Fisher, M. Hurst, and A. C. Kanig. Blews: Using blogs to provide context for news articles. In ICWSM '08, 2008.Google ScholarGoogle Scholar
  13. N. Godbole, M. Srinivasaiah, and S. Skiena. Large-scale sentiment analysis for news and blogs. In ICWSM '07, 2007.Google ScholarGoogle Scholar
  14. D. Gruhl, D. Liben-Nowell, R. V. Guha, and A. Tomkins. Information diffusion through blogspace. In WWW '04, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. Harsin. The rumour bomb: Theorising the convergence of new and old trendsin mediated U.S. politics. Southern Review: Communication, Politics and Culture,39(2006).Google ScholarGoogle Scholar
  16. S. Havre, B. Hetzler, L. Nowell. ThemeRiver: Visualizing theme changes over time. IEEE Symp. Info. Vis. 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J. Kleinberg. Bursty and hierarchical structure in streams. In KDD '02, pages 91--101, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. M. Kot. Elements of Mathematical Ecology. Cambridge University Press, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  19. B. Kovach and T. Rosenstiel. Warp Speed: America in the Age of Mixed Media. Century Foundation Press, 1999.Google ScholarGoogle Scholar
  20. R. Kumar, J. Novak, P. Raghavan, and A. Tomkins. Structure and evolution of blogspace. CACM, 47(12):35--39, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. M. Lacker and C. Peskin. Control of ovulation number in a model of ovarian follicularmaturation. In AMS Symposium on Mathematical Biology,pages 21--32, 1981.Google ScholarGoogle Scholar
  22. P.F. Lazarsfeld, B. Berelson, and H. Gaudet. The People's Choice. Duell, Sloan, and Pearce, 1944.Google ScholarGoogle Scholar
  23. J. Leskovec, M. McGlohon, C. Faloutsos, N. Glance, M. Hurst. Cascading behavior in large blog graphs. SDM'07.Google ScholarGoogle Scholar
  24. R. D. Malmgren, D. B. Stouffer, A. Motter, and L. A. N. Amaral. A poissonian explanation for heavy tails in e-mail communication. PNAS, to appear, 2008.Google ScholarGoogle Scholar
  25. J. Schmidt. Blogging practices: An analytical framework. Journal of Computer-Mediated Communication, 12(4), 2007.Google ScholarGoogle ScholarCross RefCross Ref
  26. J. Singer. The political j-blogger. Journalism, 6(2005).Google ScholarGoogle Scholar
  27. Spinn3r API. http://www.spinn3r.com. 2008.Google ScholarGoogle Scholar
  28. M. L. Stein, S. Paterno, and R. C. Burnett. Newswriter's Handbook: An Introduction to Journalism. Blackwell, 2006.Google ScholarGoogle Scholar
  29. A. Vazquez, J. G. Oliveira, Z. Deszo, K.-I. Goh, I. Kondor, and A.-L. Barabasi. Modeling bursts and heavy tails in human dynamics. Physical Review E, 73(036127), 2006.Google ScholarGoogle Scholar
  30. X. Wang and A. McCallum. Topics over time: a non-markov continuous-time model of topicaltrends. Proc. KDD, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. X. Wang, C. Zhai, X. Hu, R. Sproat. Mining correlated bursty topic patterns from coordinated textstreams. KDD, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. F. Wu and B. Huberman. Novelty and collective attention. Proc. Natl. Acad. Sci. USA, 104, 2007.Google ScholarGoogle Scholar

Index Terms

  1. Meme-tracking and the dynamics of the news cycle

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      KDD '09: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
      June 2009
      1426 pages
      ISBN:9781605584959
      DOI:10.1145/1557019

      Copyright © 2009 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 28 June 2009

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,133of8,635submissions,13%

      Upcoming Conference

      KDD '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader