Skip to main content
Erschienen in: Knowledge and Information Systems 2/2019

16.05.2018 | Regular Paper

\(\hbox {NE}^2\): named event extraction engine

verfasst von: Swati Gupta, D. Patel

Erschienen in: Knowledge and Information Systems | Ausgabe 2/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Named event discovery using news headlines is an important problem with various applications in story telling, news event exploration, social media information fusion, etc. Named events are short phrases that represent the name of events like 2016 Rio Olympic Games, 2G Case, and Adarsh Society Scam. Existing work has largely focused on discovering events of named events using data mining and text mining techniques. However, the problem of discovering named event has not been addressed yet. In this paper, we present a system \(\hbox {NE}^{2}\) that uses pattern- based method to discover named events using news headlines. Along with named event, we also discover its categories, popular durations, popularity, and type of named events. Named events are categorized into candidate-level and high-level categories using URL information, and popular durations of named events are extracted using temporal information of news headlines. Our system generates 75,689 number of named events by analyzing 6.5 million news headlines. Out of 75,689 named events, 62,950 (82%) are categorized and popular duration are extracted for 73,288 (96.8%) number of named events. Based on performed experiments, our proposed system \(\hbox {NE}^{2}\) has 68% of accuracy for named events, 71.6% for named event’s category, and 78.4% for named event’s popular duration.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
2
Named events which reoccur after a particular time interval.
 
3
Named events which are popular for at least four months consecutively.
 
4
Named events which are neither recurrent or durative.
 
11
Star Wars, World Cup 2015, Bigg Boss 8, Bigg Boss 9, Cubs Game Day, World Cup 2014, Independence Day, Rio Olympics 2016.
 
12
Bihar Assembly Polls, Coal Scam Case, Delhi Election Results 2015, Conversion Row, Kerala Polls, Fathers day.
 
13
Saif Ali Khan Assault Case, Tapas Pal Case, Selfie Row, Bangkok Blast Case, Lok Saba Polls.
 
15
1000 headlines from each month.
 
Literatur
1.
Zurück zum Zitat Steinberger R, Pouliquen B, der Goot EV (2013) An introduction to the Europe Media Monitor family of applications. arXiv:1309.5290 Steinberger R, Pouliquen B, der Goot EV (2013) An introduction to the Europe Media Monitor family of applications. arXiv:​1309.​5290
2.
Zurück zum Zitat Mazumder S, Bishnoi B, Patel D (2014) News headlines: what they can tell us? I-CARE 2014. ACM, New YorkCrossRef Mazumder S, Bishnoi B, Patel D (2014) News headlines: what they can tell us? I-CARE 2014. ACM, New YorkCrossRef
3.
Zurück zum Zitat Keneshloo Y, Cadena J, Korkmaz G, Ramakrishnan N (2014) Detecting and forecasting domestic political crises: a graph-based approach. In: WebSci’14. ACM, New York, Ny, USA Keneshloo Y, Cadena J, Korkmaz G, Ramakrishnan N (2014) Detecting and forecasting domestic political crises: a graph-based approach. In: WebSci’14. ACM, New York, Ny, USA
4.
Zurück zum Zitat Kuzey E, Weikum G (2014) EVIN: building a knowledge base of events. In: WWW’14 companion. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland Kuzey E, Weikum G (2014) EVIN: building a knowledge base of events. In: WWW’14 companion. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland
5.
Zurück zum Zitat Li C, Sun A, Datta A (2012) Twevent: segment-based event detection from tweets. In: CIKM’12. ACM, New York, NY, USA Li C, Sun A, Datta A (2012) Twevent: segment-based event detection from tweets. In: CIKM’12. ACM, New York, NY, USA
6.
Zurück zum Zitat Alfonseca E, Pighin D, Garrido G (2013) HEADY: news headline abstraction through event pattern clustering. In proceedings of Association for Computer Linguistics (ACL- 2013), pp 1243–1253 Alfonseca E, Pighin D, Garrido G (2013) HEADY: news headline abstraction through event pattern clustering. In proceedings of Association for Computer Linguistics (ACL- 2013), pp 1243–1253
7.
Zurück zum Zitat Feng X, Huang L, Tang D, Ji H, Qin B, Liu T (2016) A language-independent neural network for event detection. In: Proceedings of ACL 2016, pp 66–71 Feng X, Huang L, Tang D, Ji H, Qin B, Liu T (2016) A language-independent neural network for event detection. In: Proceedings of ACL 2016, pp 66–71
8.
Zurück zum Zitat Gupta K, Mittal V, Bishnoi B, Maheshwari S, Patel D (2016) AcT: accuracy-aware crawling techniques for cloud-crawler, vol 19. Kluwer, Hingham Gupta K, Mittal V, Bishnoi B, Maheshwari S, Patel D (2016) AcT: accuracy-aware crawling techniques for cloud-crawler, vol 19. Kluwer, Hingham
9.
Zurück zum Zitat Jain N, Gupta S, Patel D (2016) E3: keyphrase based news event exploration engine. In: HT’16. ACM, New York, NY, USA Jain N, Gupta S, Patel D (2016) E3: keyphrase based news event exploration engine. In: HT’16. ACM, New York, NY, USA
10.
Zurück zum Zitat Rusu D, Hodson J, Kimball A (2014) Unsupervised techniques for extracting and clustering complex events in news. Association for Computational Linguistics, BaltimoreCrossRef Rusu D, Hodson J, Kimball A (2014) Unsupervised techniques for extracting and clustering complex events in news. Association for Computational Linguistics, BaltimoreCrossRef
11.
Zurück zum Zitat Strötgen J, Gertz M (2010) HeidelTime: high quality rule-based extraction and normalization of temporal expressions. In: SemEval’10. Association for Computational Linguistics, Stroudsburg, PA, USA Strötgen J, Gertz M (2010) HeidelTime: high quality rule-based extraction and normalization of temporal expressions. In: SemEval’10. Association for Computational Linguistics, Stroudsburg, PA, USA
12.
Zurück zum Zitat Ghoreishi SN, Sun A (2013) Predicting event-relatedness of popular queries. In: CIKM’13. ACM, New York, NY, USA Ghoreishi SN, Sun A (2013) Predicting event-relatedness of popular queries. In: CIKM’13. ACM, New York, NY, USA
13.
Zurück zum Zitat Drakengren T, Jonsson P (1997) Towards a complete classification of tractability in Allen’s algebra. In: IJCAI’97. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA Drakengren T, Jonsson P (1997) Towards a complete classification of tractability in Allen’s algebra. In: IJCAI’97. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA
14.
Zurück zum Zitat Liu Z, Li P, Zheng Y, Sun M (2009) Clustering to find exemplar terms for keyphrase extraction. In: EMNLP’09. Association for Computational Linguistics, Stroudsburg, PA, USA Liu Z, Li P, Zheng Y, Sun M (2009) Clustering to find exemplar terms for keyphrase extraction. In: EMNLP’09. Association for Computational Linguistics, Stroudsburg, PA, USA
15.
Zurück zum Zitat Mihalcea R, Tarau P (2004) TextRank: bringing order into texts. Association for Computational Linguistics, Barcelona Mihalcea R, Tarau P (2004) TextRank: bringing order into texts. Association for Computational Linguistics, Barcelona
16.
17.
Zurück zum Zitat Witten IH, Paynter GW, Frank E, Gutwin C, Nevill-Manning CG (1999) KEA: practical automatic keyphrase extraction. In: DL’99. ACM, New York, NY, USA Witten IH, Paynter GW, Frank E, Gutwin C, Nevill-Manning CG (1999) KEA: practical automatic keyphrase extraction. In: DL’99. ACM, New York, NY, USA
18.
Zurück zum Zitat Naughton M, Naughton M, Kushmerick N, Carthy J (2006) Event extraction from heterogeneous news sources. In Proceedings of the AAAI Workshop on Event Extraction and Synthesis, pp 1–6 Naughton M, Naughton M, Kushmerick N, Carthy J (2006) Event extraction from heterogeneous news sources. In Proceedings of the AAAI Workshop on Event Extraction and Synthesis, pp 1–6
19.
Zurück zum Zitat Nguyen T, Phung D, Adams B, Venkatesh S (2013) Event extraction using behaviors of sentiment signals and burst structure in social media. Knowl Inf Syst 37:279–304CrossRef Nguyen T, Phung D, Adams B, Venkatesh S (2013) Event extraction using behaviors of sentiment signals and burst structure in social media. Knowl Inf Syst 37:279–304CrossRef
20.
Zurück zum Zitat Kanhabua N, Ngoc Nguyen T, Nejdl W (2015) Learning to detect event-related queries for web search. In: WWW’15 companion. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland Kanhabua N, Ngoc Nguyen T, Nejdl W (2015) Learning to detect event-related queries for web search. In: WWW’15 companion. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland
21.
Zurück zum Zitat Strötgen J, Gertz M (2012) Event-centric search and exploration in document collections. In: JCDL’12. ACM, New York, NY, USA Strötgen J, Gertz M (2012) Event-centric search and exploration in document collections. In: JCDL’12. ACM, New York, NY, USA
22.
Zurück zum Zitat Foley J, Bendersky M, Josifovski V (2015) Learning to extract local events from the web. In: SIGIR’15. ACM, New York, NY, USA Foley J, Bendersky M, Josifovski V (2015) Learning to extract local events from the web. In: SIGIR’15. ACM, New York, NY, USA
23.
Zurück zum Zitat Radinsky K, Davidovich S, Markovitch S (2011) Learning causality for news events prediction. In Proceedings of WWW Radinsky K, Davidovich S, Markovitch S (2011) Learning causality for news events prediction. In Proceedings of WWW
24.
Zurück zum Zitat Ritter A, Mausam, Etzioni O, Clark S (2012) Open domain event extraction from twitter. In: KDD’12. ACM, New York, NY, USA Ritter A, Mausam, Etzioni O, Clark S (2012) Open domain event extraction from twitter. In: KDD’12. ACM, New York, NY, USA
25.
Zurück zum Zitat Kuzey E, Vreeken J, Weikum G (2014) A fresh look on knowledge bases: distilling named events from news. In: CIKM’14. ACM, New York, NY, USA Kuzey E, Vreeken J, Weikum G (2014) A fresh look on knowledge bases: distilling named events from news. In: CIKM’14. ACM, New York, NY, USA
26.
Zurück zum Zitat Leban G, Fortuna B, Brank J, Grobelnik M (2014) Event registry: learning about world events from news. In: WWW’14 companion. ACM, New York, NY, USA Leban G, Fortuna B, Brank J, Grobelnik M (2014) Event registry: learning about world events from news. In: WWW’14 companion. ACM, New York, NY, USA
Metadaten
Titel
: named event extraction engine
verfasst von
Swati Gupta
D. Patel
Publikationsdatum
16.05.2018
Verlag
Springer London
Erschienen in
Knowledge and Information Systems / Ausgabe 2/2019
Print ISSN: 0219-1377
Elektronische ISSN: 0219-3116
DOI
https://doi.org/10.1007/s10115-018-1208-8

Weitere Artikel der Ausgabe 2/2019

Knowledge and Information Systems 2/2019 Zur Ausgabe