Skip to main content
Top

2017 | OriginalPaper | Chapter

Event Detection and Summarization Using Phrase Network

Authors : Sara Melvin, Wenchao Yu, Peng Ju, Sean Young, Wei Wang

Published in: Machine Learning and Knowledge Discovery in Databases

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Identifying events in real-time data streams such as Twitter is crucial for many occupations to make timely, actionable decisions. It is however extremely challenging because of the subtle difference between “events” and trending topics, the definitive rarity of these events, and the complexity of modern Internet’s text data. Existing approaches often utilize topic modeling technique and keywords frequency to detect events on Twitter, which have three main limitations: (1) supervised and semi-supervised methods run the risk of missing important, breaking news events; (2) existing topic/event detection models are base on words, while the correlations among phrases are ignored; (3) many previous methods identify trending topics as events. To address these limitations, we propose the model, PhraseNet, an algorithm to detect and summarize events from tweets. To begin, all topics are defined as a clustering of high-frequency phrases extracted from text. All trending topics are then identified based on temporal spikes of the phrase cluster frequencies. PhraseNet thus filters out high-confidence events from other trending topics using number of peaks and variance of peak intensity. We evaluate PhraseNet on a three month duration of Twitter data and show the both the efficiency and the effectiveness of our approach.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Agarwal, M.K., Ramamritham, K., Bhide, M.: Real time discovery of dense clusters in highly dynamic graphs: identifying real world events in highly dynamic environments. VLDB 5(10), 980–991 (2012) Agarwal, M.K., Ramamritham, K., Bhide, M.: Real time discovery of dense clusters in highly dynamic graphs: identifying real world events in highly dynamic environments. VLDB 5(10), 980–991 (2012)
2.
go back to reference Blondel, V.D., Guillaume, J.-L., Lambiotte, R., Lefebvre, E.: Fast unfolding of communities in large networks. J. Stat. Mech. Theor. Exp. 2008(10), P10008 (2008)CrossRef Blondel, V.D., Guillaume, J.-L., Lambiotte, R., Lefebvre, E.: Fast unfolding of communities in large networks. J. Stat. Mech. Theor. Exp. 2008(10), P10008 (2008)CrossRef
3.
go back to reference Bollen, J., Mao, H., Zeng, X.: Twitter mood predicts the stock market. J. Comput. Sci. 2(1), 1–8 (2011)CrossRef Bollen, J., Mao, H., Zeng, X.: Twitter mood predicts the stock market. J. Comput. Sci. 2(1), 1–8 (2011)CrossRef
4.
go back to reference Chua, F.C.T., Asur, S.: Automatic summarization of events from social media. In: ICWSM (2013) Chua, F.C.T., Asur, S.: Automatic summarization of events from social media. In: ICWSM (2013)
5.
go back to reference Du, N., Dai, H., Trivedi, R., Upadhyay, U., Gomez-Rodriguez, M., Song, L.: Recurrent marked temporal point processes: embedding event history to vector. In: KDD, pp. 1555–1564. ACM (2016) Du, N., Dai, H., Trivedi, R., Upadhyay, U., Gomez-Rodriguez, M., Song, L.: Recurrent marked temporal point processes: embedding event history to vector. In: KDD, pp. 1555–1564. ACM (2016)
6.
go back to reference El-Kishky, A., Song, Y., Wang, C., Voss, C.R., Han, J.: Scalable topical phrase mining from text corpora. VLDB 8(3), 305–316 (2014) El-Kishky, A., Song, Y., Wang, C., Voss, C.R., Han, J.: Scalable topical phrase mining from text corpora. VLDB 8(3), 305–316 (2014)
7.
go back to reference Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: SIGMOD, vol. 29, pp. 1–12. ACM (2000) Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: SIGMOD, vol. 29, pp. 1–12. ACM (2000)
8.
go back to reference Kwak, H., Lee, C., Park, H., Moon, S.: What is Twitter, a social network or a news media? In: WWW, pp. 591–600. ACM (2010) Kwak, H., Lee, C., Park, H., Moon, S.: What is Twitter, a social network or a news media? In: WWW, pp. 591–600. ACM (2010)
9.
go back to reference Li, C., Sun, A., Datta, A.: Twevent: segment-based event detection from tweets. In: CIKM, pp. 155–164. ACM (2012) Li, C., Sun, A., Datta, A.: Twevent: segment-based event detection from tweets. In: CIKM, pp. 155–164. ACM (2012)
10.
go back to reference Lin, C.X., Zhao, B., Mei, Q., Han, J.: PET: a statistical model for popular events tracking in social communities. In: KDD, pp. 929–938. ACM (2010) Lin, C.X., Zhao, B., Mei, Q., Han, J.: PET: a statistical model for popular events tracking in social communities. In: KDD, pp. 929–938. ACM (2010)
11.
go back to reference Mathioudakis, M., Koudas, N.: TwitterMonitor: trend detection over the Twitter stream. In: SIGMOD, pp. 1155–1158. ACM (2010) Mathioudakis, M., Koudas, N.: TwitterMonitor: trend detection over the Twitter stream. In: SIGMOD, pp. 1155–1158. ACM (2010)
12.
go back to reference Popescu, A.-M., Pennacchiotti, M.: Detecting controversial events from Twitter. In: CIKM, pp. 1873–1876. ACM (2010) Popescu, A.-M., Pennacchiotti, M.: Detecting controversial events from Twitter. In: CIKM, pp. 1873–1876. ACM (2010)
13.
go back to reference Popescu, A.-M., Pennacchiotti, M., Paranjpe, D.: Extracting events and event descriptions from Twitter. In: WWW, pp. 105–106. ACM (2011) Popescu, A.-M., Pennacchiotti, M., Paranjpe, D.: Extracting events and event descriptions from Twitter. In: WWW, pp. 105–106. ACM (2011)
14.
go back to reference Qin, Y., Zhang, Y., Zhang, M., Zheng, D.: Feature-rich segment-based news event detection on Twitter. In: IJCNLP, pp. 302–310 (2013) Qin, Y., Zhang, Y., Zhang, M., Zheng, D.: Feature-rich segment-based news event detection on Twitter. In: IJCNLP, pp. 302–310 (2013)
15.
go back to reference Ritter, A., Etzioni, O., Clark, S., et al.: Open domain event extraction from Twitter. In: KDD, pp. 1104–1112. ACM (2012) Ritter, A., Etzioni, O., Clark, S., et al.: Open domain event extraction from Twitter. In: KDD, pp. 1104–1112. ACM (2012)
16.
go back to reference Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes Twitter users: real-time event detection by social sensors. In: WWW, pp. 851–860. ACM (2010) Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes Twitter users: real-time event detection by social sensors. In: WWW, pp. 851–860. ACM (2010)
17.
go back to reference Saraf, P., Ramakrishnan, N.: EMBERS AutoGSR: automated coding of civil unrest events. In: KDD, pp. 599–608. ACM (2016) Saraf, P., Ramakrishnan, N.: EMBERS AutoGSR: automated coding of civil unrest events. In: KDD, pp. 599–608. ACM (2016)
18.
go back to reference Thelwall, M., Buckley, K., Paltoglou, G.: Sentiment in Twitter events. J. Assoc. Inf. Sci. Technol. 62(2), 406–418 (2011)CrossRef Thelwall, M., Buckley, K., Paltoglou, G.: Sentiment in Twitter events. J. Assoc. Inf. Sci. Technol. 62(2), 406–418 (2011)CrossRef
19.
go back to reference Xie, W., Zhu, F., Jiang, J., Lim, E.-P., Wang, K.: TopicSketch: real-time bursty topic detection from Twitter. TKDE 28(8), 2216–2229 (2016) Xie, W., Zhu, F., Jiang, J., Lim, E.-P., Wang, K.: TopicSketch: real-time bursty topic detection from Twitter. TKDE 28(8), 2216–2229 (2016)
20.
go back to reference Yu, W., Aggarwal, C.C., Wang, W.: Temporally factorized network modeling for evolutionary network analysis. In: WSDM, pp. 455–464. ACM (2017) Yu, W., Aggarwal, C.C., Wang, W.: Temporally factorized network modeling for evolutionary network analysis. In: WSDM, pp. 455–464. ACM (2017)
21.
go back to reference Zhao, L., Ye, J., Chen, F., Lu, C.-T., Ramakrishnan, N.: Hierarchical incomplete multi-source feature learning for spatiotemporal event forecasting. In: KDD, pp. 2085–2094. ACM (2016) Zhao, L., Ye, J., Chen, F., Lu, C.-T., Ramakrishnan, N.: Hierarchical incomplete multi-source feature learning for spatiotemporal event forecasting. In: KDD, pp. 2085–2094. ACM (2016)
Metadata
Title
Event Detection and Summarization Using Phrase Network
Authors
Sara Melvin
Wenchao Yu
Peng Ju
Sean Young
Wei Wang
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-71273-4_8

Premium Partner