Skip to main content
Erschienen in: World Wide Web 4/2020

05.05.2020

Yet another approach to understanding news event evolution

verfasst von: Shangwen Lv, Longtao Huang, Liangjun Zang, Wei Zhou, Jizhong Han, Songlin Hu

Erschienen in: World Wide Web | Ausgabe 4/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

With information explosion on the Internet, only returning ranked documents by search engines cannot satisfy people’s requirements on news events understanding. A more intelligent news events search engine should not only retrieve all related documents about a specific event, but also provide a global view about how the event originates and evolves. In order to solve this challenge, two tasks, event news retrieval and eventline generation should be processed. For event news retrieval, existing approaches mainly focus on the document-level similarity to retrieve related news documents, while external knowledge is not effectively taken into consideration. To this end, we propose a similarity model named Event-Oriented Similarity combining the document-level with the knowledge-level similarity to retrieve news documents related to the specific event. For eventline generation, in order to outline the event structure more accurately, we construct an Event-Oriented Similarity Graph to represent the relationship among retrieved event news documents and develop a community detection algorithm to segment sub-events which are consequently chained into a cohesive eventline. Experimental results on real-world datasets demonstrate that the proposed approach outperforms existing methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Allan, J., Carbonell, J.G., Doddington, G., Yamron, J., Yang, Y.: Topic detection and tracking pilot study final report (1998) Allan, J., Carbonell, J.G., Doddington, G., Yamron, J., Yang, Y.: Topic detection and tracking pilot study final report (1998)
2.
Zurück zum Zitat Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J Mach Learn Res 3(Jan), 993–1022 (2003)MATH Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J Mach Learn Res 3(Jan), 993–1022 (2003)MATH
3.
Zurück zum Zitat Chen, M.: Efficient vector representation for documents through corruption. arXiv:1707.02377 (2017) Chen, M.: Efficient vector representation for documents through corruption. arXiv:1707.​02377 (2017)
4.
Zurück zum Zitat Chen, Z., Zhang, X., Boedihardjo, A.P., Dai, J., Lu, C.: Multimodal storytelling via generative adversarial imitation learning. In: Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, IJCAI 2017, August 19-25, 2017. pp. 3967–3973. Melbourne, Australia. https://doi.org/10.24963/ijcai.2017/554 (2017) Chen, Z., Zhang, X., Boedihardjo, A.P., Dai, J., Lu, C.: Multimodal storytelling via generative adversarial imitation learning. In: Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, IJCAI 2017, August 19-25, 2017. pp. 3967–3973. Melbourne, Australia. https://​doi.​org/​10.​24963/​ijcai.​2017/​554 (2017)
5.
Zurück zum Zitat Daiber, J., Jakob, M., Hokamp, C., Mendes, P.N.: Improving efficiency and accuracy in multilingual entity extraction. In: I-SEMANTICS 2013 - 9Th International Conference on Semantic Systems, ISEM ‘13, September 4-6, 2013. pp. 121–124. Graz, Austria. https://doi.org/10.1145/2506182.2506198 (2013) Daiber, J., Jakob, M., Hokamp, C., Mendes, P.N.: Improving efficiency and accuracy in multilingual entity extraction. In: I-SEMANTICS 2013 - 9Th International Conference on Semantic Systems, ISEM ‘13, September 4-6, 2013. pp. 121–124. Graz, Austria. https://​doi.​org/​10.​1145/​2506182.​2506198 (2013)
6.
Zurück zum Zitat Hossain, M.S., Butler, P., Boedihardjo, A.P., Ramakrishnan, N.: Storytelling in entity networks to support intelligence analysts. In: The 18Th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ‘12, August 12-16, 2012. pp. 1375–1383. Beijing, China. https://doi.org/10.1145/2339530.2339742 (2012) Hossain, M.S., Butler, P., Boedihardjo, A.P., Ramakrishnan, N.: Storytelling in entity networks to support intelligence analysts. In: The 18Th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ‘12, August 12-16, 2012. pp. 1375–1383. Beijing, China. https://​doi.​org/​10.​1145/​2339530.​2339742 (2012)
7.
Zurück zum Zitat Hossain, M.S., Gresock, J., Edmonds, Y., Helm, R., Potts, M., Ramakrishnan, N.: Connecting the dots between pubmed abstracts. PloS One 7(1), e29509 (2012)CrossRef Hossain, M.S., Gresock, J., Edmonds, Y., Helm, R., Potts, M., Ramakrishnan, N.: Connecting the dots between pubmed abstracts. PloS One 7(1), e29509 (2012)CrossRef
8.
Zurück zum Zitat Huang, L.: Optimized event storyline generation based on mixture-event-aspect model. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, EMNLP 2013, 18-21 October 2013, Grand Hyatt Seattle, Seattle, Washington, USA, A meeting of SIGDAT, a Special Interest Group of the ACL. pp. 726–735. http://aclweb.org/anthology/D/D13/D13-1068.pdf (2013) Huang, L.: Optimized event storyline generation based on mixture-event-aspect model. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, EMNLP 2013, 18-21 October 2013, Grand Hyatt Seattle, Seattle, Washington, USA, A meeting of SIGDAT, a Special Interest Group of the ACL. pp. 726–735. http://​aclweb.​org/​anthology/​D/​D13/​D13-1068.​pdf (2013)
9.
Zurück zum Zitat Jo, Y., Hopcroft, J.E., Lagoze, C.: The Web of topics: discovering the topology of topic evolution in a corpus. In: Proceedings of the 20th International Conference on World Wide Web, WWW 2011, March 28 - April 1, 2011. pp. 257–266. Hyderabad, India. https://doi.org/10.1145/1963405.1963444 (2011) Jo, Y., Hopcroft, J.E., Lagoze, C.: The Web of topics: discovering the topology of topic evolution in a corpus. In: Proceedings of the 20th International Conference on World Wide Web, WWW 2011, March 28 - April 1, 2011. pp. 257–266. Hyderabad, India. https://​doi.​org/​10.​1145/​1963405.​1963444 (2011)
12.
Zurück zum Zitat Kuzey, E., Vreeken, J., Weikum, G.: A fresh look on knowledge bases: Distilling named events from news. In: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, CIKM 2014, November 3-7, 2014. pp. 1689–1698. Shanghai, China. https://doi.org/10.1145/2661829.2661984 (2014) Kuzey, E., Vreeken, J., Weikum, G.: A fresh look on knowledge bases: Distilling named events from news. In: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, CIKM 2014, November 3-7, 2014. pp. 1689–1698. Shanghai, China. https://​doi.​org/​10.​1145/​2661829.​2661984 (2014)
14.
Zurück zum Zitat Lee, P., Lakshmanan, L.V.S., Milios, E.E.: CAST: A context-aware story-teller for streaming social content. In: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, CIKM 2014, Shanghai, China, November 3-7, 2014. pp. 789–798. https://doi.org/10.1145/2661829.2661859 (2014) Lee, P., Lakshmanan, L.V.S., Milios, E.E.: CAST: A context-aware story-teller for streaming social content. In: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, CIKM 2014, Shanghai, China, November 3-7, 2014. pp. 789–798. https://​doi.​org/​10.​1145/​2661829.​2661859 (2014)
15.
Zurück zum Zitat Lin, C., Lin, C., Li, J., Wang, D., Chen, Y., Li, T.: Generating event storylines from microblogs. In: 21St ACM International Conference on Information and Knowledge Management, CIKM’12, Maui, HI, USA, October 29 - November 02, 2012. pp. 175–184. https://doi.org/10.1145/2396761.2396787 (2012) Lin, C., Lin, C., Li, J., Wang, D., Chen, Y., Li, T.: Generating event storylines from microblogs. In: 21St ACM International Conference on Information and Knowledge Management, CIKM’12, Maui, HI, USA, October 29 - November 02, 2012. pp. 175–184. https://​doi.​org/​10.​1145/​2396761.​2396787 (2012)
16.
Zurück zum Zitat Liu, B., Niu, D., Lai, K., Kong, L., Xu, Y.: Growing story forest online from massive breaking news. In: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, CIKM 2017, Singapore, November 06 - 10, 2017. pp. 777–785. https://doi.org/10.1145/3132847.3132852 (2017) Liu, B., Niu, D., Lai, K., Kong, L., Xu, Y.: Growing story forest online from massive breaking news. In: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, CIKM 2017, Singapore, November 06 - 10, 2017. pp. 777–785. https://​doi.​org/​10.​1145/​3132847.​3132852 (2017)
17.
Zurück zum Zitat Liu, Y., Lv, N., Luo, J., Yang, H.: Subtopic based topic evolution analysis. In: International Conference on Web Information Systems and Mining, 2009. WISM 2009. pp. 168–172. IEEE (2009) Liu, Y., Lv, N., Luo, J., Yang, H.: Subtopic based topic evolution analysis. In: International Conference on Web Information Systems and Mining, 2009. WISM 2009. pp. 168–172. IEEE (2009)
20.
Zurück zum Zitat Newman, M.E.: Analysis of weighted networks. Phys. Rev. E 70(5), 056131 (2004)CrossRef Newman, M.E.: Analysis of weighted networks. Phys. Rev. E 70(5), 056131 (2004)CrossRef
21.
Zurück zum Zitat Newman, M.E., Girvan, M.: Finding and evaluating community structure in networks. Phys. Rev. E 69(2), 026113 (2004)CrossRef Newman, M.E., Girvan, M.: Finding and evaluating community structure in networks. Phys. Rev. E 69(2), 026113 (2004)CrossRef
22.
Zurück zum Zitat Rosenberg, A., Hirschberg, J.: V-measure: a conditional entropy-based external cluster evaluation measure. In: EMNLP-CoNLL 2007, Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, pp. 410–420. Prague, Czech Republic. http://www.aclweb.org/anthology/D07-1043 (2007) Rosenberg, A., Hirschberg, J.: V-measure: a conditional entropy-based external cluster evaluation measure. In: EMNLP-CoNLL 2007, Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, pp. 410–420. Prague, Czech Republic. http://​www.​aclweb.​org/​anthology/​D07-1043 (2007)
23.
Zurück zum Zitat Rosvall, M., Axelsson, D., Bergstrom, C.T.: The map equation. The European Physical Journal Special Topics 178(1), 13–23 (2009)CrossRef Rosvall, M., Axelsson, D., Bergstrom, C.T.: The map equation. The European Physical Journal Special Topics 178(1), 13–23 (2009)CrossRef
24.
25.
Zurück zum Zitat Shahaf, D., Guestrin, C.: Connecting the dots between news articles. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, July 25-28, 2010. pp. 623–632. Washington, DC, USA. https://doi.org/10.1145/1835804.1835884 (2010) Shahaf, D., Guestrin, C.: Connecting the dots between news articles. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, July 25-28, 2010. pp. 623–632. Washington, DC, USA. https://​doi.​org/​10.​1145/​1835804.​1835884 (2010)
28.
Zurück zum Zitat Yamron, J., Carp, I., Gillick, L., Lowe, S., Van Mulbregt, P.: Event tracking and text segmentation via hidden markov models. In: 1997 IEEE Workshop on Automatic Speech Recognition and Understanding, 1997. Proceedings, pp. 519–526. IEEE (1997) Yamron, J., Carp, I., Gillick, L., Lowe, S., Van Mulbregt, P.: Event tracking and text segmentation via hidden markov models. In: 1997 IEEE Workshop on Automatic Speech Recognition and Understanding, 1997. Proceedings, pp. 519–526. IEEE (1997)
29.
Zurück zum Zitat Yan, R., Wan, X., Otterbacher, J., Kong, L., Li, X., Zhang, Y.: Evolutionary timeline summarization: a balanced optimization framework via iterative substitution. In: Proceeding of the 34Th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2011, July 25-29, 2011. pp. 745–754. Beijing, China. https://doi.org/10.1145/2009916.2010016 (2011) Yan, R., Wan, X., Otterbacher, J., Kong, L., Li, X., Zhang, Y.: Evolutionary timeline summarization: a balanced optimization framework via iterative substitution. In: Proceeding of the 34Th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2011, July 25-29, 2011. pp. 745–754. Beijing, China. https://​doi.​org/​10.​1145/​2009916.​2010016 (2011)
30.
Zurück zum Zitat Zhang, X., Guo, Z., Li, B.: An effective algorithm of news topic tracking. In: WRI Global Congress on Intelligent Systems, 2009. GCIS’09, vol. 3, pp. 510–513. IEEE (2009) Zhang, X., Guo, Z., Li, B.: An effective algorithm of news topic tracking. In: WRI Global Congress on Intelligent Systems, 2009. GCIS’09, vol. 3, pp. 510–513. IEEE (2009)
31.
Zurück zum Zitat Zhou, D., Xu, H., Dai, X., He, Y.: Unsupervised storyline extraction from news articles. In: Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, IJCAI 2016, 9-15 July 2016. pp. 3014–3021. New York, NY, USA. http://www.ijcai.org/Abstract/16/428 (2016) Zhou, D., Xu, H., Dai, X., He, Y.: Unsupervised storyline extraction from news articles. In: Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, IJCAI 2016, 9-15 July 2016. pp. 3014–3021. New York, NY, USA. http://​www.​ijcai.​org/​Abstract/​16/​428 (2016)
Metadaten
Titel
Yet another approach to understanding news event evolution
verfasst von
Shangwen Lv
Longtao Huang
Liangjun Zang
Wei Zhou
Jizhong Han
Songlin Hu
Publikationsdatum
05.05.2020
Verlag
Springer US
Erschienen in
World Wide Web / Ausgabe 4/2020
Print ISSN: 1386-145X
Elektronische ISSN: 1573-1413
DOI
https://doi.org/10.1007/s11280-020-00818-7

Weitere Artikel der Ausgabe 4/2020

World Wide Web 4/2020 Zur Ausgabe