Skip to main content
Erschienen in: Social Network Analysis and Mining 1/2015

01.12.2015 | Original Article

Event detection, tracking, and visualization in Twitter: a mention-anomaly-based approach

verfasst von: Adrien Guille, Cécile Favre

Erschienen in: Social Network Analysis and Mining | Ausgabe 1/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The ever-growing number of people using Twitter makes it a valuable source of timely information. However, detecting events in Twitter is a difficult task, because tweets that report interesting events are overwhelmed by a large volume of tweets on unrelated topics. Existing methods focus on the textual content of tweets and ignore the social aspect of Twitter. In this paper, we propose mention-anomaly-based event detection (MABED), a novel statistical method that relies solely on tweets and leverages the creation frequency of dynamic links (i.e., mentions) that users insert in tweets to detect significant events and estimate the magnitude of their impact over the crowd. MABED also differs from the literature in that it dynamically estimates the period of time during which each event is discussed, rather than assuming a predefined fixed duration for all events. The experiments we conducted on both English and French Twitter data show that the mention-anomaly-based approach leads to more accurate event detection and improved robustness in presence of noisy Twitter content. Qualitatively speaking, we find that MABED helps with the interpretation of detected events by providing clear textual descriptions and precise temporal descriptions. We also show how MABED can help understanding users’ interest. Furthermore, we describe three visualizations designed to favor an efficient exploration of the detected events.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Aggarwal CC, Subbian K (2012) Event detection in social streams. In: SDM, pp 624–635 Aggarwal CC, Subbian K (2012) Event detection in social streams. In: SDM, pp 624–635
Zurück zum Zitat Aiello LM, Petkos G, Martin C, Corney D, Papadopoulos S, Skraba R, Goker A, Kompatsiaris Y, Jaimes A (2013) Sensing trending topics in twitter. IEEE Trans Multimed 15(6):1–15CrossRef Aiello LM, Petkos G, Martin C, Corney D, Papadopoulos S, Skraba R, Goker A, Kompatsiaris Y, Jaimes A (2013) Sensing trending topics in twitter. IEEE Trans Multimed 15(6):1–15CrossRef
Zurück zum Zitat AlSumait L, Barbará D, Domeniconi C (2008) On-line lda: adaptive topic models for mining text streams with applications to topic detection and tracking. In: ICDM, pp 3–12 AlSumait L, Barbará D, Domeniconi C (2008) On-line lda: adaptive topic models for mining text streams with applications to topic detection and tracking. In: ICDM, pp 3–12
Zurück zum Zitat Becker H, Iter D, Naaman M, Gravano L (2011) Beyond trending topics: real-world event identification on twitter. In: ICWSM, pp 438–441 Becker H, Iter D, Naaman M, Gravano L (2011) Beyond trending topics: real-world event identification on twitter. In: ICWSM, pp 438–441
Zurück zum Zitat Benhardus J, Kalita J (2013) Streaming trend detection in twitter. IJWBC 9(1):122–139CrossRef Benhardus J, Kalita J (2013) Streaming trend detection in twitter. IJWBC 9(1):122–139CrossRef
Zurück zum Zitat Bentley J (1984) Programming pearls: algorithm design techniques. CACM 27(9):865–873CrossRef Bentley J (1984) Programming pearls: algorithm design techniques. CACM 27(9):865–873CrossRef
Zurück zum Zitat Bernstein MS, Suh B, Hong L, Chen J, Kairam S, Chi EH (2010) Eddi: interactive topic-based browsing of social status streams. In: UIST ’10, pp 303–312 Bernstein MS, Suh B, Hong L, Chen J, Kairam S, Chi EH (2010) Eddi: interactive topic-based browsing of social status streams. In: UIST ’10, pp 303–312
Zurück zum Zitat Blei D, Ng A, Jordan M (2003) Latent dirichlet allocation. JMLR 3:993–1022MATH Blei D, Ng A, Jordan M (2003) Latent dirichlet allocation. JMLR 3:993–1022MATH
Zurück zum Zitat Blondel V, Guillaume JL, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks. J Stat Mech: Theory Exp P10008:1–12 Blondel V, Guillaume JL, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks. J Stat Mech: Theory Exp P10008:1–12
Zurück zum Zitat Erdem O, Ceyhan E, Varli Y (2012) A new correlation coefficient for bivariate time-series data. In: MAF, pp 58–73 Erdem O, Ceyhan E, Varli Y (2012) A new correlation coefficient for bivariate time-series data. In: MAF, pp 58–73
Zurück zum Zitat Fan TH, Lee S, Lu HI, Tsou TS, Wang TC, Yao A (2003) An optimal algorithm for maximum-sum segment and its application in bioinformatics. In: CIAA, pp 251–257 Fan TH, Lee S, Lu HI, Tsou TS, Wang TC, Yao A (2003) An optimal algorithm for maximum-sum segment and its application in bioinformatics. In: CIAA, pp 251–257
Zurück zum Zitat Fung GPC, Yu JX, Yu PS, Lu H (2005) Parameter free bursty events detection in text streams. In: VLDB, pp 181–192 Fung GPC, Yu JX, Yu PS, Lu H (2005) Parameter free bursty events detection in text streams. In: VLDB, pp 181–192
Zurück zum Zitat Guille A, Favre C, Hacid H, Zighed D (2013) Sondy: an open source platform for social dynamics mining and analysis. In: SIGMOD, pp 1005–1008 Guille A, Favre C, Hacid H, Zighed D (2013) Sondy: an open source platform for social dynamics mining and analysis. In: SIGMOD, pp 1005–1008
Zurück zum Zitat Haynes J, Perisic I (2010) Mapping search relevance to social networks. In: SNA-KDD, pp 11–20 Haynes J, Perisic I (2010) Mapping search relevance to social networks. In: SNA-KDD, pp 11–20
Zurück zum Zitat Hopcroft J, Tarjan R (1973) Algorithm 447: efficient algorithms for graph manipulation. CACM 16(6):372–378CrossRef Hopcroft J, Tarjan R (1973) Algorithm 447: efficient algorithms for graph manipulation. CACM 16(6):372–378CrossRef
Zurück zum Zitat Humphries M, Gurney K, Prescott T (2006) The brainstem reticular formation is a small-world, not scale-free, network. Biol Sci 273:503–511CrossRef Humphries M, Gurney K, Prescott T (2006) The brainstem reticular formation is a small-world, not scale-free, network. Biol Sci 273:503–511CrossRef
Zurück zum Zitat Kim YH, Seo S, Ha YH, Lim S, Yoon Y (2013) Two applications of clustering techniques to twitter: community detection and issue extraction. Discret Dyn Nat Soc 2013:1–9CrossRef Kim YH, Seo S, Ha YH, Lim S, Yoon Y (2013) Two applications of clustering techniques to twitter: community detection and issue extraction. Discret Dyn Nat Soc 2013:1–9CrossRef
Zurück zum Zitat Kleinberg J (2002) Bursty and hierarchical structure in streams. In: KDD, pp 91–101 Kleinberg J (2002) Bursty and hierarchical structure in streams. In: KDD, pp 91–101
Zurück zum Zitat Kraft T, Wang D, Delawder J, Dou W, Li Y, Ribarsky W (2013) Less after-the-fact: investigative visual analysis of events from streaming twitter. In: LDAV, pp 95–103 Kraft T, Wang D, Delawder J, Dou W, Li Y, Ribarsky W (2013) Less after-the-fact: investigative visual analysis of events from streaming twitter. In: LDAV, pp 95–103
Zurück zum Zitat Kwak H, Lee C, Park H, Moon S (2010) What is Twitter, a social network or a news media? In: WWW, pp 591–600 Kwak H, Lee C, Park H, Moon S (2010) What is Twitter, a social network or a news media? In: WWW, pp 591–600
Zurück zum Zitat Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics, pp 159–174 Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics, pp 159–174
Zurück zum Zitat Lappas T, Arai B, Platakis M, Kotsakos D, Gunopulos D (2009) On burstiness-aware search for document sequences. In: KDD, pp 477–486 Lappas T, Arai B, Platakis M, Kotsakos D, Gunopulos D (2009) On burstiness-aware search for document sequences. In: KDD, pp 477–486
Zurück zum Zitat Lau JH, Collier N, Baldwin T (2012) On-line trend analysis with topic models: #twitter trends detection topic model online. In: COLING, pp 1519–1534 Lau JH, Collier N, Baldwin T (2012) On-line trend analysis with topic models: #twitter trends detection topic model online. In: COLING, pp 1519–1534
Zurück zum Zitat Lee P, Lakshmanan LV, Milios E (2013) Keysee: supporting keyword search on evolving events in social streams. In: KDD ’13, pp 1478–1481 Lee P, Lakshmanan LV, Milios E (2013) Keysee: supporting keyword search on evolving events in social streams. In: KDD ’13, pp 1478–1481
Zurück zum Zitat Li C, Sun A, Datta A (2012) Twevent: segment-based event detection from tweets. In: CIKM, pp 155–164 Li C, Sun A, Datta A (2012) Twevent: segment-based event detection from tweets. In: CIKM, pp 155–164
Zurück zum Zitat Marcus A, Bernstein MS, Badar O, Karger DR, Madden S, Miller RC (2011) Twitinfo: aggregating and visualizing microblogs for event exploration. In: CHI ’11, pp 227–236 Marcus A, Bernstein MS, Badar O, Karger DR, Madden S, Miller RC (2011) Twitinfo: aggregating and visualizing microblogs for event exploration. In: CHI ’11, pp 227–236
Zurück zum Zitat Mathioudakis M, Koudas N (2010) Twittermonitor: trend detection over the twitter stream. In: SIGMOD, pp 1155–1158 Mathioudakis M, Koudas N (2010) Twittermonitor: trend detection over the twitter stream. In: SIGMOD, pp 1155–1158
Zurück zum Zitat McMinn AJ, Moshfeghi Y, Jose JM (2013) Building a large-scale corpus for evaluating event detection on twitter. In: CIKM, pp 409–418 McMinn AJ, Moshfeghi Y, Jose JM (2013) Building a large-scale corpus for evaluating event detection on twitter. In: CIKM, pp 409–418
Zurück zum Zitat Newman MEJ (2006) Modularity and community structure in networks. Proc Natl Acad Sci USA 103(23):8577–8582CrossRef Newman MEJ (2006) Modularity and community structure in networks. Proc Natl Acad Sci USA 103(23):8577–8582CrossRef
Zurück zum Zitat Parikh R, Karlapalem K (2013) Et: events from tweets. In: WWW (companion volume), pp 613–620 Parikh R, Karlapalem K (2013) Et: events from tweets. In: WWW (companion volume), pp 613–620
Zurück zum Zitat Romero D, Chenhao T, Ugander J (2013) On the interplay between social and topical structure. In: ICWSM, pp 516–525 Romero D, Chenhao T, Ugander J (2013) On the interplay between social and topical structure. In: ICWSM, pp 516–525
Zurück zum Zitat Romero DM, Meeder B, Kleinberg J (2011) Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter. In: WWW, pp 695–704 Romero DM, Meeder B, Kleinberg J (2011) Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter. In: WWW, pp 695–704
Zurück zum Zitat Shamma DA, Kennedy L, Churchill EF (2011) Peaks and persistence: modeling the shape of microblog conversations. In: CSCW, pp 355–358 Shamma DA, Kennedy L, Churchill EF (2011) Peaks and persistence: modeling the shape of microblog conversations. In: CSCW, pp 355–358
Zurück zum Zitat Valkanas G, Gunopulos D (2013) How the live web feels about events. In: CIKM, pp 639–648 Valkanas G, Gunopulos D (2013) How the live web feels about events. In: CIKM, pp 639–648
Zurück zum Zitat Weng J, Lee BS (2011) Event detection in twitter. In: ICWSM, pp 401–408 Weng J, Lee BS (2011) Event detection in twitter. In: ICWSM, pp 401–408
Zurück zum Zitat Yang J, Leskovec J (2011) Patterns of temporal variation in online media. In: WSDM, pp 177–186 Yang J, Leskovec J (2011) Patterns of temporal variation in online media. In: WSDM, pp 177–186
Zurück zum Zitat Yuheng H, Ajita J, Dorée DS, Fei W (2012) What were the tweets about? Topical associations between public events and twitter feeds. In: ICWSM, pp 154–161 Yuheng H, Ajita J, Dorée DS, Fei W (2012) What were the tweets about? Topical associations between public events and twitter feeds. In: ICWSM, pp 154–161
Metadaten
Titel
Event detection, tracking, and visualization in Twitter: a mention-anomaly-based approach
verfasst von
Adrien Guille
Cécile Favre
Publikationsdatum
01.12.2015
Verlag
Springer Vienna
Erschienen in
Social Network Analysis and Mining / Ausgabe 1/2015
Print ISSN: 1869-5450
Elektronische ISSN: 1869-5469
DOI
https://doi.org/10.1007/s13278-015-0258-0

Weitere Artikel der Ausgabe 1/2015

Social Network Analysis and Mining 1/2015 Zur Ausgabe