Skip to main content
Top
Published in: GeoInformatica 2/2020

09-01-2020

Enhancing local live tweet stream to detect news

Authors: Hong Wei, Jagan Sankaranarayanan, Hanan Samet

Published in: GeoInformatica | Issue 2/2020

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Twitter captures invaluable information about real-world news, spanning a wide scale from large national/international stories like a presidential election to small local stories such as a local farmers market. Detecting and extracting small news for a local place is a challenging problem and the focus of this work. The main challenge lies in identifying these small stories that correspond to a local area of interest, which are typically harder to detect compared to national stories in the sense that there may be just a handful of tweets about a local story. A system, called Firefly, is proposed that overcomes the data sparsity and captures thousands of local stories per day from a metropolitan area (e.g., Boston). The key idea lies in combining the enhancement of a local live tweet stream in Twitter, the identification of “locality-aware” keywords, and using these keywords to cluster tweets. Experiments show that the proposed system has a significantly higher recall over a set of representative local news agencies, and at the same time, outperforms the baseline approach TwitterStand. More importantly, the results also demonstrate that our system, by utilizing the enhanced local live tweet stream, discovers much more local news than the methods working only on geotagged tweets, i.e., those with embedded GPS coordinate values.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Sankaranarayanan J, Samet H, Teitler BE et al TwitterStand: news in tweets. SIGSPATIAL ’09 Sankaranarayanan J, Samet H, Teitler BE et al TwitterStand: news in tweets. SIGSPATIAL ’09
2.
go back to reference Compton R, Jurgens D, Allen D Geotagging one hundred million Twitter accounts with total variation minimization. volume abs/1404.7152 Compton R, Jurgens D, Allen D Geotagging one hundred million Twitter accounts with total variation minimization. volume abs/1404.7152
3.
go back to reference Kwan E, Hsu P-L, Liang J-H et al Event identification for social streams using keyword-based evolving graph sequences Kwan E, Hsu P-L, Liang J-H et al Event identification for social streams using keyword-based evolving graph sequences
4.
go back to reference Mathioudakis M, Koudas N TwitterMonitor: trend detection over the Twitter stream. SIGMOD ’10 Mathioudakis M, Koudas N TwitterMonitor: trend detection over the Twitter stream. SIGMOD ’10
5.
go back to reference Wei H, Sankaranarayanan J, Samet H Finding and tracking local Twitter users for news detection. SIGSPATIAL ’17 Wei H, Sankaranarayanan J, Samet H Finding and tracking local Twitter users for news detection. SIGSPATIAL ’17
6.
go back to reference Krumm J, Horvitz E Eyewitness: identifying local events via space-time signals in Twitter feeds. SIGSPATIAL ’15 Krumm J, Horvitz E Eyewitness: identifying local events via space-time signals in Twitter feeds. SIGSPATIAL ’15
7.
go back to reference Zhang C, Zhou G, Yuan Q et al GeoBurst: real-time local event detection in geo-tagged tweet streams. SIGIR ’16 Zhang C, Zhou G, Yuan Q et al GeoBurst: real-time local event detection in geo-tagged tweet streams. SIGIR ’16
8.
go back to reference Atefeh F, Khreich W (2015) A survey of techniques for event detection in Twitter. Comput Intell 31(1):132–164CrossRef Atefeh F, Khreich W (2015) A survey of techniques for event detection in Twitter. Comput Intell 31(1):132–164CrossRef
9.
go back to reference Abdelhaq H (2015) Localized events in social media streams: detection, tracking, and recommendation, Heidelberg University, PhD thesis Abdelhaq H (2015) Localized events in social media streams: detection, tracking, and recommendation, Heidelberg University, PhD thesis
10.
go back to reference Li Q, Nourbakhsh A, Shah S et al Real-time novel event detection from social media. ICDE ’17 Li Q, Nourbakhsh A, Shah S et al Real-time novel event detection from social media. ICDE ’17
11.
go back to reference Zhang C, Liu L, Lei D et al TrioVecEvent: embedding-based online local event detection in geo-tagged tweet streams. KDD ’17 Zhang C, Liu L, Lei D et al TrioVecEvent: embedding-based online local event detection in geo-tagged tweet streams. KDD ’17
12.
go back to reference Walther M, Kaisser M Geo-spatial event detection in the Twitter stream. ECIR ’13 Walther M, Kaisser M Geo-spatial event detection in the Twitter stream. ECIR ’13
13.
go back to reference Boettcher A, Lee D EventRadar: a real-time local event detection scheme using Twitter stream. GreenCom ’12 Boettcher A, Lee D EventRadar: a real-time local event detection scheme using Twitter stream. GreenCom ’12
14.
go back to reference Hong L, Ahmed A, Gurumurthy S et al Discovering geographical topics in the Twitter stream. WWW ’12 Hong L, Ahmed A, Gurumurthy S et al Discovering geographical topics in the Twitter stream. WWW ’12
15.
go back to reference Zhou X, Chen L Event detection over Twitter social media streams, vol 23 Zhou X, Chen L Event detection over Twitter social media streams, vol 23
16.
go back to reference Wei W, Joseph K, Lo W et al A Bayesian graphical model to discover latent events from Twitter. ICWSM ’15 Wei W, Joseph K, Lo W et al A Bayesian graphical model to discover latent events from Twitter. ICWSM ’15
17.
go back to reference Skovsgaard A, Sidlauskas D, Jensen CS Scalable top-k spatio-temporal term querying. ICDE ’14 Skovsgaard A, Sidlauskas D, Jensen CS Scalable top-k spatio-temporal term querying. ICDE ’14
18.
go back to reference Abdelhaq H, Sengstock C, Gertz M EvenTweet: online localized event detection from Twitter. Volume 6 of PVLDB ’13 Abdelhaq H, Sengstock C, Gertz M EvenTweet: online localized event detection from Twitter. Volume 6 of PVLDB ’13
19.
go back to reference Kamath KY, Caverlee J, Lee K et al Spatio-temporal dynamics of online memes: a study of geo-tagged tweets. WWW ’13 Kamath KY, Caverlee J, Lee K et al Spatio-temporal dynamics of online memes: a study of geo-tagged tweets. WWW ’13
20.
go back to reference Watanabe K, Ochi M, Okabe M et al Jasmine: a real-time local-event detection system based on geolocation information propagated to microblogs. CIKM ’11 Watanabe K, Ochi M, Okabe M et al Jasmine: a real-time local-event detection system based on geolocation information propagated to microblogs. CIKM ’11
21.
go back to reference Jonathan C, Magdy A, Mokbel MF et al GARNET: a holistic system approach for trending queries in microblogs. ICDE ’16 Jonathan C, Magdy A, Mokbel MF et al GARNET: a holistic system approach for trending queries in microblogs. ICDE ’16
22.
go back to reference Kang W, Tung AKH, Zhao F et al Interactive hierarchical tag clouds for summarizing spatiotemporal social contents. ICDE ’14 Kang W, Tung AKH, Zhao F et al Interactive hierarchical tag clouds for summarizing spatiotemporal social contents. ICDE ’14
23.
go back to reference Magdy A, Mokbel MF, Elnikety S et al Mercury: a memory-constrained spatio-temporal real-time search on microblogs. ICDE ’14 Magdy A, Mokbel MF, Elnikety S et al Mercury: a memory-constrained spatio-temporal real-time search on microblogs. ICDE ’14
24.
go back to reference Magdy A, Aly AM, Mokbel MF et al GeoTrend: spatial trending queries on real-time microblogs. SIGSPATIAL ’16 Magdy A, Aly AM, Mokbel MF et al GeoTrend: spatial trending queries on real-time microblogs. SIGSPATIAL ’16
25.
go back to reference Xu J-M, Bhargava A, Nowak R et al Socioscope: spatio-temporal signal recovery from social media. ECML PKDD ’12 Xu J-M, Bhargava A, Nowak R et al Socioscope: spatio-temporal signal recovery from social media. ECML PKDD ’12
26.
go back to reference Lappas T, Vieira MR, Gunopulos D et al On the spatiotemporal burstiness of terms. PVLDB ’12. VLDB endowment Lappas T, Vieira MR, Gunopulos D et al On the spatiotemporal burstiness of terms. PVLDB ’12. VLDB endowment
27.
go back to reference He Q, Chang K, Lim E-P Analyzing feature trajectories for event detection. SIGIR ’07 He Q, Chang K, Lim E-P Analyzing feature trajectories for event detection. SIGIR ’07
28.
go back to reference Budak C, Georgiou T, Agrawal D et al GeoScope: online detection of geo-correlated information trends in social networks. PVLDB ’13. VLDB endowment Budak C, Georgiou T, Agrawal D et al GeoScope: online detection of geo-correlated information trends in social networks. PVLDB ’13. VLDB endowment
29.
go back to reference Liu Z, Huang Y, Trampier JR LEDS: local event discovery and summarization from tweets. SIGSPATIAL ’16 Liu Z, Huang Y, Trampier JR LEDS: local event discovery and summarization from tweets. SIGSPATIAL ’16
30.
go back to reference Valkanas G, Gunopulos D How the live web feels about events. CIKM ’13 Valkanas G, Gunopulos D How the live web feels about events. CIKM ’13
31.
go back to reference Roller S, Speriosu M, Rallapalli S et al Supervised text-based geolocation using language models on an adaptive grid. EMNLP-CoNLL ’12 Roller S, Speriosu M, Rallapalli S et al Supervised text-based geolocation using language models on an adaptive grid. EMNLP-CoNLL ’12
32.
go back to reference Marcus A, Bernstein MS, Badar O et al Twitinfo: aggregating and visualizing microblogs for event exploration. CHI ’11 Marcus A, Bernstein MS, Badar O et al Twitinfo: aggregating and visualizing microblogs for event exploration. CHI ’11
33.
go back to reference Quezada M, Peña Araya V, Poblete B Location-aware model for news events in social media. SIGIR ’15 Quezada M, Peña Araya V, Poblete B Location-aware model for news events in social media. SIGIR ’15
34.
go back to reference Li R, Lei KH, Khadiwala R et al TEDAS: a Twitter-based event detection and analysis system. ICDE ’12 Li R, Lei KH, Khadiwala R et al TEDAS: a Twitter-based event detection and analysis system. ICDE ’12
35.
go back to reference Yamaguchi Y, Amagasa T, Kitagawa H Landmark-based user location inference in social media. COSN ’13 Yamaguchi Y, Amagasa T, Kitagawa H Landmark-based user location inference in social media. COSN ’13
36.
go back to reference Davis CA Jr, Pappa GL, de Oliveira DRR, et al. (2011) Inferring the location of Twitter messages based on user relationships. Trans GIS 15(6):735–751CrossRef Davis CA Jr, Pappa GL, de Oliveira DRR, et al. (2011) Inferring the location of Twitter messages based on user relationships. Trans GIS 15(6):735–751CrossRef
37.
go back to reference Sadilek A, Kautz H, Bigham JP Finding your friends and following them to where you are. WSDM ’12 Sadilek A, Kautz H, Bigham JP Finding your friends and following them to where you are. WSDM ’12
38.
go back to reference Chen Y, Zhao J, Hu X et al From interest to function: location estimation in social media. AAAI ’13. AAAI Press Chen Y, Zhao J, Hu X et al From interest to function: location estimation in social media. AAAI ’13. AAAI Press
39.
go back to reference Cheng Z, Caverlee J, Lee K You are where you tweet: a content-based approach to geo-locating Twitter users. CIKM ’10 Cheng Z, Caverlee J, Lee K You are where you tweet: a content-based approach to geo-locating Twitter users. CIKM ’10
40.
go back to reference Mahmud J, Nichols J, Drews C (2014) Home location identification of Twitter users. ACM TIST 5(3):47,1–47,21 Mahmud J, Nichols J, Drews C (2014) Home location identification of Twitter users. ACM TIST 5(3):47,1–47,21
41.
go back to reference Han B, Cook P, Baldwin T (2014) Text-based Twitter user geolocation prediction. J AIR 49(1):451–500 Han B, Cook P, Baldwin T (2014) Text-based Twitter user geolocation prediction. J AIR 49(1):451–500
42.
go back to reference Dalvi N, Kumar R, Pang B Object matching in tweets with spatial models. WSDM ’12 Dalvi N, Kumar R, Pang B Object matching in tweets with spatial models. WSDM ’12
43.
go back to reference Li G, Hu J, Feng J et al Effective location identification from microblogs. ICDE ’14 Li G, Hu J, Feng J et al Effective location identification from microblogs. ICDE ’14
44.
go back to reference Sakaki T, Okazaki M, Matsuo Y Earthquake shakes Twitter users: real-time event detection by social sensors Sakaki T, Okazaki M, Matsuo Y Earthquake shakes Twitter users: real-time event detection by social sensors
45.
go back to reference Weng J, Lee B-S Event detection in Twitter. ICWSM ’11 Weng J, Lee B-S Event detection in Twitter. ICWSM ’11
46.
go back to reference Albakour M-D, Macdonald C, Ounis I Identifying local events by using microblogs as social sensors Albakour M-D, Macdonald C, Ounis I Identifying local events by using microblogs as social sensors
47.
go back to reference Takhteyev Y, Gruzd A, Wellman B (2012) Geography of Twitter networks. Social Networks 34(1):73–81. Capturing context: integrating spatial and social network analysesCrossRef Takhteyev Y, Gruzd A, Wellman B (2012) Geography of Twitter networks. Social Networks 34(1):73–81. Capturing context: integrating spatial and social network analysesCrossRef
48.
go back to reference Mok D, Wellman B, Carrasco J (2010) Does distance matter in the age of the internet? Urban Stud 47(13):2747–2783CrossRef Mok D, Wellman B, Carrasco J (2010) Does distance matter in the age of the internet? Urban Stud 47(13):2747–2783CrossRef
49.
go back to reference Vardi Y, Zhang C-H (2000) The multivariate L1-median and associated data depth. Proc NAS 97(4):1423–1426CrossRef Vardi Y, Zhang C-H (2000) The multivariate L1-median and associated data depth. Proc NAS 97(4):1423–1426CrossRef
50.
go back to reference Zaharia M, Chowdhury M, Das T et al Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing. NSDI ’12 Zaharia M, Chowdhury M, Das T et al Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing. NSDI ’12
51.
go back to reference Dave A IndexedRDD: efficient fine-grained updates for RDDs Dave A IndexedRDD: efficient fine-grained updates for RDDs
52.
go back to reference Zaharia M, Chowdhury M, Franklin MJ et al Spark: cluster computing with working sets. HotCloud ’10 Zaharia M, Chowdhury M, Franklin MJ et al Spark: cluster computing with working sets. HotCloud ’10
53.
go back to reference Zaharia M, Das T, Li H et al Discretized streams: fault-tolerant streaming computation at scale. SOSP ’13 Zaharia M, Das T, Li H et al Discretized streams: fault-tolerant streaming computation at scale. SOSP ’13
54.
go back to reference McMinn AJ, Moshfeghi Y, Jose JM Building a large-scale corpus for evaluating event detection on Twitter McMinn AJ, Moshfeghi Y, Jose JM Building a large-scale corpus for evaluating event detection on Twitter
55.
go back to reference Zheng Y, Zhang H, Yu Y Detecting collective anomalies from multiple spatio-temporal datasets across different domains Zheng Y, Zhang H, Yu Y Detecting collective anomalies from multiple spatio-temporal datasets across different domains
Metadata
Title
Enhancing local live tweet stream to detect news
Authors
Hong Wei
Jagan Sankaranarayanan
Hanan Samet
Publication date
09-01-2020
Publisher
Springer US
Published in
GeoInformatica / Issue 2/2020
Print ISSN: 1384-6175
Electronic ISSN: 1573-7624
DOI
https://doi.org/10.1007/s10707-019-00392-9

Other articles of this Issue 2/2020

GeoInformatica 2/2020 Go to the issue