Skip to main content
Top

2021 | OriginalPaper | Chapter

Summarization of Twitter Events with Deep Neural Network Pre-trained Models

Authors : Kunal Chakma, Amitava Das, Swapan Debbarma

Published in: Information Management and Big Data

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Due to the proliferation of online social media services such as Twitter, there is an upsurge in the volume of user-generated textual content. Such voluminous content is difficult to be consumed by users. Therefore, the development of technological solutions to automatically summarise the voluminous texts are essential. The work presented in this paper reports on the development of automatically generating abstractive summaries from a collection of texts from Twitter. Our proposed approach is a two-stage framework which includes: 1) Event detection by clustering and 2) Summarization of the events. We first generated a contextualized vector representation of the tweets and then applied different clustering techniques on the vectors. We evaluated the generated clusters, and based on the evaluation; we chose the best one found suitable for the summarization task. For the summarization task, we used the pre-trained models of two recently developed state-of-the-art deep neural network architectures and evaluated them on the event clusters. Standard measures of ROUGE scores have been used for evaluating the summaries. We obtained best ROUGE-1 score of 46%, ROUGE-2 score of 30%, ROUGE-L score of 41% and ROUGE-SU score of 23% from our experiments.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Manuel, J., Moreno, T.: Automatic Text Summarization. Wiley (2014) Manuel, J., Moreno, T.: Automatic Text Summarization. Wiley (2014)
2.
go back to reference Hasan, M., Orgun, M.A., Schwitter, R.: A survey on real-time event detection from the Twitter data stream. Inf. Sci. 44(4), 443–463 (2017)CrossRef Hasan, M., Orgun, M.A., Schwitter, R.: A survey on real-time event detection from the Twitter data stream. Inf. Sci. 44(4), 443–463 (2017)CrossRef
3.
go back to reference Nallapati, R. Zhou, B., Santos, C.D., Gulcehre, C., Xiang, B.: Abstractive text summarization using sequence-to-sequence RNNs and beyond. Comput. Nat. Lang. Learn. (2016) Nallapati, R. Zhou, B., Santos, C.D., Gulcehre, C., Xiang, B.: Abstractive text summarization using sequence-to-sequence RNNs and beyond. Comput. Nat. Lang. Learn. (2016)
4.
go back to reference Cheng, J., Lapata, M.: Neural summarization by extracting sentences and words. In: Proceedings of the \(54^{th}\) Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 484–494 (2016) Cheng, J., Lapata, M.: Neural summarization by extracting sentences and words. In: Proceedings of the \(54^{th}\) Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 484–494 (2016)
5.
go back to reference Zhou, Q., Yang, N., Wei, F., Huang, S., Zhou, M., Zhao, T.: Neural document summarization by jointly learning to score and select sentences. In: Proceedings of the \(56^{th}\) Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 654–663 (2018) Zhou, Q., Yang, N., Wei, F., Huang, S., Zhou, M., Zhao, T.: Neural document summarization by jointly learning to score and select sentences. In: Proceedings of the \(56^{th}\) Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 654–663 (2018)
6.
go back to reference See, A., Liu, P.J., Manning, C.D.: Get to the point: summarization with pointer-generator networks. In: Proceedings of the \(55^{th}\) Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 1073–1083 (2017) See, A., Liu, P.J., Manning, C.D.: Get to the point: summarization with pointer-generator networks. In: Proceedings of the \(55^{th}\) Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 1073–1083 (2017)
7.
go back to reference Vaswani, A., et al.: Attention is all you need. In: CoRR, (abs/1706.03762) (2017) Vaswani, A., et al.: Attention is all you need. In: CoRR, (abs/1706.03762) (2017)
8.
go back to reference Liu, Y., Lapata, M.: Text summarization with pretrained encoders. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing, vol. 1, pp. 3730–3740 (2019) Liu, Y., Lapata, M.: Text summarization with pretrained encoders. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing, vol. 1, pp. 3730–3740 (2019)
9.
go back to reference Erkan, G., Radev, D.R.: LexRank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. 22, 457–479 (2004) Erkan, G., Radev, D.R.: LexRank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. 22, 457–479 (2004)
10.
go back to reference Gong, Y., Liu, X.: Generic text summarization using relevance measure and latent semantic analysis. In: Proceedings \(24^{th}\) Annual International ACM SIGIR Conference Research and Development Information Retrival, September, pp. 19–25 (2001) Gong, Y., Liu, X.: Generic text summarization using relevance measure and latent semantic analysis. In: Proceedings \(24^{th}\) Annual International ACM SIGIR Conference Research and Development Information Retrival, September, pp. 19–25 (2001)
12.
go back to reference Radev, D.R., Hovy, E., McKeown, K.: Introduction to the special issue on summarization. Comput. Linguis. 28(4), 399–408 (2002)CrossRef Radev, D.R., Hovy, E., McKeown, K.: Introduction to the special issue on summarization. Comput. Linguis. 28(4), 399–408 (2002)CrossRef
13.
go back to reference Nenkova, A., Vanderwende, L.: The impact of frequency on summarization. Microsoft Res., Redmond, Washington, DC, USA, Technical Report. MSR-TR-2005, vol. 101 (2005) Nenkova, A., Vanderwende, L.: The impact of frequency on summarization. Microsoft Res., Redmond, Washington, DC, USA, Technical Report. MSR-TR-2005, vol. 101 (2005)
14.
go back to reference He, Z., et al.: Document summarization based on data reconstruction. In: Proceedings \(26^{th}\) AAAI Conference Artificial Intelligence, July, pp. 620–626 (2012) He, Z., et al.: Document summarization based on data reconstruction. In: Proceedings \(26^{th}\) AAAI Conference Artificial Intelligence, July, pp. 620–626 (2012)
15.
go back to reference Rudra, K., Ghosh, S., Ganguly, N., Goyal, P., Ghosh, S.: Extracting situational information from microblogs during disaster events: a classification-summarization approach. In: Proceedings of \(24^{th}\) ACM International Conference Information Knowledge Management, October, pp. 583–592 (2015) Rudra, K., Ghosh, S., Ganguly, N., Goyal, P., Ghosh, S.: Extracting situational information from microblogs during disaster events: a classification-summarization approach. In: Proceedings of \(24^{th}\) ACM International Conference Information Knowledge Management, October, pp. 583–592 (2015)
16.
go back to reference Inouye, D.: Multiple Post Microblog Summarization, University of Colorado at Colorado Springs (2010) Inouye, D.: Multiple Post Microblog Summarization, University of Colorado at Colorado Springs (2010)
17.
go back to reference Arthur, D., Vassilvitskii, S.: k-means++: the advantages of careful seeding. In: Proceedings of the 18th Annual ACM-SIAM Symposium on Discrete Algorithms, New Orleans, LA, pp. 1027–1035 (2007) Arthur, D., Vassilvitskii, S.: k-means++: the advantages of careful seeding. In: Proceedings of the 18th Annual ACM-SIAM Symposium on Discrete Algorithms, New Orleans, LA, pp. 1027–1035 (2007)
18.
go back to reference Sharifi, B., Hutton, M.-A., Kalita, J.: Summarizing microblogs automatically. In: Human Language Technologies: the 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, HLT, pp. 685–688 (2010) Sharifi, B., Hutton, M.-A., Kalita, J.: Summarizing microblogs automatically. In: Human Language Technologies: the 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, HLT, pp. 685–688 (2010)
19.
go back to reference Beverungen, G., Kalita, J.: Evaluating methods for summarizing Twitter posts. In: Proceedings of the 4th ACM International Conference on Web Search and Data Mining (WSDM), Hong Kong, China, pp. 1–6 (2011) Beverungen, G., Kalita, J.: Evaluating methods for summarizing Twitter posts. In: Proceedings of the 4th ACM International Conference on Web Search and Data Mining (WSDM), Hong Kong, China, pp. 1–6 (2011)
20.
go back to reference Tibshirani, R., Walther, G., Hastie, T. : Estimating the number of clusters in a data set via the gap statistic. J. Royal Stat. Soc. Ser B (Stat. Methodol.) 63(2), 411–423 (2001) Tibshirani, R., Walther, G., Hastie, T. : Estimating the number of clusters in a data set via the gap statistic. J. Royal Stat. Soc. Ser B (Stat. Methodol.) 63(2), 411–423 (2001)
21.
go back to reference Kaufmann, M.: Syntactic normalization of Twitter messages. In: Proceedings of International Conference on Natural Language Processing (ICON), Kharagpur, India (2010) Kaufmann, M.: Syntactic normalization of Twitter messages. In: Proceedings of International Conference on Natural Language Processing (ICON), Kharagpur, India (2010)
22.
go back to reference Perez-Tellez, F., Pinto, D., Cardiff, J., Rosso, P.: On the difficulty of clustering company tweets. In: Proceedings of the 2nd International Workshop on Search and Mining User-Generated Contents, Toronto, Canada, pp. 92–102 (2010) Perez-Tellez, F., Pinto, D., Cardiff, J., Rosso, P.: On the difficulty of clustering company tweets. In: Proceedings of the 2nd International Workshop on Search and Mining User-Generated Contents, Toronto, Canada, pp. 92–102 (2010)
23.
go back to reference Shou, L., Wang, Z., Chen, K., Chen, G.: Sumblr: continuous summarization of evolving tweet streams. In: Proceedings \(36^{th}\) Int. ACM SIGIR Conference Research Development Information Retrival, August, pp. 533–542 (2013) Shou, L., Wang, Z., Chen, K., Chen, G.: Sumblr: continuous summarization of evolving tweet streams. In: Proceedings \(36^{th}\) Int. ACM SIGIR Conference Research Development Information Retrival, August, pp. 533–542 (2013)
24.
go back to reference Judd, J., Kalita, J.: Better Twitter summaries? In: Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Atlanta, GA, pp. 445–449 (2013) Judd, J., Kalita, J.: Better Twitter summaries? In: Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Atlanta, GA, pp. 445–449 (2013)
25.
go back to reference Nichols, J., Mahmud, J., Drews, C.: Summarizing sporting events using Twitter. In: Proceedings of the ACM International Conference on Intelligent User Interfaces, New York, NY, pp. 189–198 (2012) Nichols, J., Mahmud, J., Drews, C.: Summarizing sporting events using Twitter. In: Proceedings of the ACM International Conference on Intelligent User Interfaces, New York, NY, pp. 189–198 (2012)
26.
go back to reference Harabagiu, S., Hickl, A.: Relevance modeling for microblog summarization. In: Proceedings of the 5th International Conference on Weblogs and Social Media (ICWSM), Barcelona, Spain, pp. 514–517 (2011) Harabagiu, S., Hickl, A.: Relevance modeling for microblog summarization. In: Proceedings of the 5th International Conference on Weblogs and Social Media (ICWSM), Barcelona, Spain, pp. 514–517 (2011)
27.
go back to reference Garg, N., Favre, B., Reidhammer, K., Hakkani-Tur, D.: ClusterRank: a graph based method for meeting summarization. In: Proceedings of \(10^{th}\) Annual Conference of International Speech Communication, pp. 1499–1502 (2009) Garg, N., Favre, B., Reidhammer, K., Hakkani-Tur, D.: ClusterRank: a graph based method for meeting summarization. In: Proceedings of \(10^{th}\) Annual Conference of International Speech Communication, pp. 1499–1502 (2009)
28.
go back to reference Mihalcea, R., Tarau, P.: Textrank: bringing order into texts. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, pp. 404–411 (2004) Mihalcea, R., Tarau, P.: Textrank: bringing order into texts. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, pp. 404–411 (2004)
29.
go back to reference Yang, X., Ghoting, A., Ruan, Y., Parthasarathy, S.: A framework for summarizing and analyzing twitter feeds. In: Proceedings of the \(18^{th}\) ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2012, pp. 370–378. ACM, New York (2012) Yang, X., Ghoting, A., Ruan, Y., Parthasarathy, S.: A framework for summarizing and analyzing twitter feeds. In: Proceedings of the \(18^{th}\) ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2012, pp. 370–378. ACM, New York (2012)
30.
go back to reference Wang, Z., Shou, L., Chen, K., Chen, G., Mehrotra, S.: On summarization and timeline generation for evolutionary tweet streams. IEEE Trans. Knowl. Data Eng. 27(5), 1301–1315 (2015)CrossRef Wang, Z., Shou, L., Chen, K., Chen, G., Mehrotra, S.: On summarization and timeline generation for evolutionary tweet streams. IEEE Trans. Knowl. Data Eng. 27(5), 1301–1315 (2015)CrossRef
31.
go back to reference Aggarwal, C.C., Han, J., Wang, J., Yu, P.S.: A framework for clustering evolving data streams. In: Proceedings of the 29th International Conference on Very Large Data Bases, Berlin, Germany, pp. 81–92 (2003) Aggarwal, C.C., Han, J., Wang, J., Yu, P.S.: A framework for clustering evolving data streams. In: Proceedings of the 29th International Conference on Very Large Data Bases, Berlin, Germany, pp. 81–92 (2003)
32.
go back to reference Niu, J., Zhao, Q., Chen, H., Atiquzzaman, M., Peng, F. : OnSeS: a novel online short text summarization based on BM25 and neural network. In: IEEE Global Communications Conference (GLOBECOM), Washington, DC, pp. 1–6 (2016) Niu, J., Zhao, Q., Chen, H., Atiquzzaman, M., Peng, F. : OnSeS: a novel online short text summarization based on BM25 and neural network. In: IEEE Global Communications Conference (GLOBECOM), Washington, DC, pp. 1–6 (2016)
33.
go back to reference Mikolov, T., Sutskever, I., Chen, K., Corrado, G., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, vol. 26 (2013) Mikolov, T., Sutskever, I., Chen, K., Corrado, G., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, vol. 26 (2013)
35.
go back to reference \(Do\breve{g}an\), E., Kaya, B.: Text summarization in social networks by using deep learning. In: 1st International Informatics and Software Engineering Conference (UBMYK), Ankara, Turkey. pp. 1–5 (2019) \(Do\breve{g}an\), E., Kaya, B.: Text summarization in social networks by using deep learning. In: 1st International Informatics and Software Engineering Conference (UBMYK), Ankara, Turkey. pp. 1–5 (2019)
37.
go back to reference Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018) Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:​1810.​04805 (2018)
38.
go back to reference Xin, J., Jiawei, H.: K-means clustering. In: Encyclopedia of Machine Learning and Data Mining, pp. 695–697 (2017) Xin, J., Jiawei, H.: K-means clustering. In: Encyclopedia of Machine Learning and Data Mining, pp. 695–697 (2017)
41.
go back to reference Brendan F.J., Delbert, D.: Clustering by passing messages between data points, pp. 972–976 (2007) Brendan F.J., Delbert, D.: Clustering by passing messages between data points, pp. 972–976 (2007)
42.
go back to reference Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Neural Information Processing Systems (2014) Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Neural Information Processing Systems (2014)
43.
go back to reference Sherstinsky, A.: Fundamentals of recurrent neural network (RNN) and long short-term memory (LSTM) network. In: CoRR, (abs/1808.03314) (2018) Sherstinsky, A.: Fundamentals of recurrent neural network (RNN) and long short-term memory (LSTM) network. In: CoRR, (abs/1808.03314) (2018)
44.
go back to reference Vinyals, O., Fortunato, M., Jaitly, N.: Pointer networks. In: Neural Information Processing Systems (2015) Vinyals, O., Fortunato, M., Jaitly, N.: Pointer networks. In: Neural Information Processing Systems (2015)
45.
go back to reference Tu, Z., Lu, Z., Liu, Y., Liu, X., Li, H.: Modeling coverage for neural machine translation. In: Association for Computational Linguistics (2016) Tu, Z., Lu, Z., Liu, Y., Liu, X., Li, H.: Modeling coverage for neural machine translation. In: Association for Computational Linguistics (2016)
46.
go back to reference Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. In: Proceedings of the ICLR Conference, San Diego, USA. pp. 1–15 (2015) Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. In: Proceedings of the ICLR Conference, San Diego, USA. pp. 1–15 (2015)
47.
go back to reference Lin, C-Y.: ROUGE: a package for automatic evaluation of summaries. In: Proceedings of the Workshop on Text Summarization Branches Out, WAS 2004 Lin, C-Y.: ROUGE: a package for automatic evaluation of summaries. In: Proceedings of the Workshop on Text Summarization Branches Out, WAS 2004
48.
go back to reference Nils, R., Iryna, G.: Sentence-BERT: sentence embeddings using Siamese BERT-networks. In: Proceedings of Empirical Methods in Natural Language Processing. Association for Computational Linguistics (2019). http://arxiv.org/abs/1908.10084 Nils, R., Iryna, G.: Sentence-BERT: sentence embeddings using Siamese BERT-networks. In: Proceedings of Empirical Methods in Natural Language Processing. Association for Computational Linguistics (2019). http://​arxiv.​org/​abs/​1908.​10084
Metadata
Title
Summarization of Twitter Events with Deep Neural Network Pre-trained Models
Authors
Kunal Chakma
Amitava Das
Swapan Debbarma
Copyright Year
2021
DOI
https://doi.org/10.1007/978-3-030-76228-5_4

Premium Partner