Skip to main content

2021 | OriginalPaper | Buchkapitel

A Novel Approach to Text Summarisation Using Topic Modelling and Noun Phrase Extraction

verfasst von : Nikhil M. Lal, S. Krishnanunni, Vishnu Vijayakumar, N. Vaishnavi, S. Siji Rani , K. Deepa Raj

Erschienen in: Advances in Computing and Network Communications

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Over the past few years, one of the remarkable developments that happened on the web is the rapid growth of textual data. This substantial increase, however, induces a complication in the retrieval of vital information from the digitized collection of data. The conventional technique used to tackle this problem is Automatic Text Summarisation. This technique extracts the essential words or sentences from the data and summarises it without affecting the semantics. Automatic text summarisation is classified into two, Extractive and Abstractive. The Extractive method summarises a document by selecting the important words or sentences from it, based on some attributes while the Abstractive method attempts to generate its summary from the semantics of the data. In this paper, we propose a novel approach in Extractive text summarisation by using a new sentence scoring parameter. The experimental results show that the proposed sentence scoring parameter improves the performance of the Extractive text summariser, when compared with other summarisation models. To validate our proposed model, we compared it with four commonly used summarisation models on grounds of ROUGE-1 score and F1 score.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat El-Kassas, W.S., Salama, C.R., Rafea, A.A., Mohamed, H.K.: Automatic text summarization: a comprehensive survey. Expert Syst. Appl. 113679 (2020) El-Kassas, W.S., Salama, C.R., Rafea, A.A., Mohamed, H.K.: Automatic text summarization: a comprehensive survey. Expert Syst. Appl. 113679 (2020)
2.
Zurück zum Zitat Deepa Raj K., Sajeev, G.P.: A community based web summarization in near linear time. In: 2018 International Conference on Advances in Computing, Communications and Informatics (ICACCI), pp. 962–968 (2018) Deepa Raj K., Sajeev, G.P.: A community based web summarization in near linear time. In: 2018 International Conference on Advances in Computing, Communications and Informatics (ICACCI), pp. 962–968 (2018)
3.
Zurück zum Zitat Naveen, G.K.R.: Query-based multi-document summarization by clustering of documents (2014) Naveen, G.K.R.: Query-based multi-document summarization by clustering of documents (2014)
4.
Zurück zum Zitat Christian, H., Agus, M.P., Suhartono, D.: Single document automatic text summarization using TF-IDF. ComTech: Comput. Math. Eng. Appl. 7(4), 285–294 (2016) Christian, H., Agus, M.P., Suhartono, D.: Single document automatic text summarization using TF-IDF. ComTech: Comput. Math. Eng. Appl. 7(4), 285–294 (2016)
5.
Zurück zum Zitat Nallapati, R., Zhou, B., Gulcehre, C., Xiang, B., et al.: Abstractive text summarization using sequence-to-sequence RNNs and beyond. arXiv preprint arXiv:1602.06023 (2016) Nallapati, R., Zhou, B., Gulcehre, C., Xiang, B., et al.: Abstractive text summarization using sequence-to-sequence RNNs and beyond. arXiv preprint arXiv:​1602.​06023 (2016)
6.
Zurück zum Zitat Otter, D.W., Medina, J.R., Kalita, J.K.: A survey of the usages of deep learning for natural language processing. IEEE Trans. Neural Netw. Learn. Syst. (2020) Otter, D.W., Medina, J.R., Kalita, J.K.: A survey of the usages of deep learning for natural language processing. IEEE Trans. Neural Netw. Learn. Syst. (2020)
7.
Zurück zum Zitat Bommasani, R., Davis, K., Cardie, C.: Distilling static embeddings from pretrained contextual representations, Bert wears gloves (2019) Bommasani, R., Davis, K., Cardie, C.: Distilling static embeddings from pretrained contextual representations, Bert wears gloves (2019)
8.
Zurück zum Zitat Jacobi, C., Van Atteveldt, W., Welbers, K.: Quantitative analysis of large amounts of journalistic texts using topic modelling. Digit. J. 4(1), 89–106 (2016) Jacobi, C., Van Atteveldt, W., Welbers, K.: Quantitative analysis of large amounts of journalistic texts using topic modelling. Digit. J. 4(1), 89–106 (2016)
9.
Zurück zum Zitat Gunawan, D., Pasaribu, A., Rahmat, R.F., Budiarto, R.: Automatic text summarization for indonesian language using TextTeaser. Proc. Electr. Eng. Comput. Sci. Inform. 3(1), 012048 (2016) Gunawan, D., Pasaribu, A., Rahmat, R.F., Budiarto, R.: Automatic text summarization for indonesian language using TextTeaser. Proc. Electr. Eng. Comput. Sci. Inform. 3(1), 012048 (2016)
10.
Zurück zum Zitat Patel, S.M., Dabhi, V.K., Prajapati, H.B.: Extractive based automatic text summarization. JCP 12(6), 550–563 (2017)CrossRef Patel, S.M., Dabhi, V.K., Prajapati, H.B.: Extractive based automatic text summarization. JCP 12(6), 550–563 (2017)CrossRef
11.
Zurück zum Zitat Patel, V., Tabrizi, N.: An automatic text summarization: a systematic. Int. J. Inf. Eng. Electron. Bus. (2019) Patel, V., Tabrizi, N.: An automatic text summarization: a systematic. Int. J. Inf. Eng. Electron. Bus. (2019)
12.
Zurück zum Zitat Mandal, S., Singh, G.K., Pal, A.: A hybrid text summarization approach. J. Inform. Math. Sci. 9(3), 547–555 (2017) Mandal, S., Singh, G.K., Pal, A.: A hybrid text summarization approach. J. Inform. Math. Sci. 9(3), 547–555 (2017)
13.
Zurück zum Zitat Khan, R., Qian, Y., Naeem, S.: Extractive based text summarization using k-means and TF-IDF. Int. J. Inf. Eng. Electron. Bus. 11(3), 33 (2019) Khan, R., Qian, Y., Naeem, S.: Extractive based text summarization using k-means and TF-IDF. Int. J. Inf. Eng. Electron. Bus. 11(3), 33 (2019)
14.
Zurück zum Zitat El-Refaiy, A., Abas, A.R., Elhenawy, I.: Review of recent techniques for extractive text summarization. J. Theor. Appl. Inf. Technol. 96, 7739–7759 (2018) El-Refaiy, A., Abas, A.R., Elhenawy, I.: Review of recent techniques for extractive text summarization. J. Theor. Appl. Inf. Technol. 96, 7739–7759 (2018)
15.
Zurück zum Zitat Kumar, K.M.S., Soumya, R.: Text summarization using clustering technique. Int. J. Appl. Eng. Res. 10, 25511–25519 (2015) Kumar, K.M.S., Soumya, R.: Text summarization using clustering technique. Int. J. Appl. Eng. Res. 10, 25511–25519 (2015)
16.
Zurück zum Zitat Bewoor, M.S., Patil, S.H.: Empirical analysis of single and multi document summarization using clustering algorithms. Eng. Technol. Appl. Sci. Res. 8(1), 2562–2567 (2018)CrossRef Bewoor, M.S., Patil, S.H.: Empirical analysis of single and multi document summarization using clustering algorithms. Eng. Technol. Appl. Sci. Res. 8(1), 2562–2567 (2018)CrossRef
17.
Zurück zum Zitat Mallick, C., Das, A.K., Dutta, M., Das, A.K., Sarkar, A.: Graph-based text summarization using modified TextRank. In: Soft Computing in Data Analytics, pp. 137–146. Springer (2019) Mallick, C., Das, A.K., Dutta, M., Das, A.K., Sarkar, A.: Graph-based text summarization using modified TextRank. In: Soft Computing in Data Analytics, pp. 137–146. Springer (2019)
20.
Zurück zum Zitat Gensim. Documentation. pypi.org/project/gensim (2020) Gensim. Documentation. pypi.org/project/gensim (2020)
21.
Zurück zum Zitat Google. Colaboratory. colab.research.google.com Google. Colaboratory. colab.research.google.com
24.
Zurück zum Zitat Siji Rani, S., Sreejith, K., Sanker, A.: A hybrid approach for automatic document summarization. In: 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI), pp. 663–669 (2017) Siji Rani, S., Sreejith, K., Sanker, A.: A hybrid approach for automatic document summarization. In: 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI), pp. 663–669 (2017)
25.
Zurück zum Zitat Resoomer. Text summariser. resoomer.com/en Resoomer. Text summariser. resoomer.com/en
26.
Zurück zum Zitat Leidner, J.L.: Summarization in the financial and regulatory domain. In: Trends and Applications of Text Summarization Techniques, pp. 187–215. IGI Global (2020) Leidner, J.L.: Summarization in the financial and regulatory domain. In: Trends and Applications of Text Summarization Techniques, pp. 187–215. IGI Global (2020)
27.
Zurück zum Zitat Kaggle. Dataset1: All news dataset. kaggle.com/snapcrack/all-the-news Kaggle. Dataset1: All news dataset. kaggle.com/snapcrack/all-the-news
28.
Zurück zum Zitat Kaggle. Dataset2: News summary. kaggle.com/sunnysai12345/news-summary Kaggle. Dataset2: News summary. kaggle.com/sunnysai12345/news-summary
29.
Zurück zum Zitat Kaggle. Dataset3: Covid19 dataset. kaggle.com/jannalipenkova/covid19-public-media-dataset (2020) Kaggle. Dataset3: Covid19 dataset. kaggle.com/jannalipenkova/covid19-public-media-dataset (2020)
Metadaten
Titel
A Novel Approach to Text Summarisation Using Topic Modelling and Noun Phrase Extraction
verfasst von
Nikhil M. Lal
S. Krishnanunni
Vishnu Vijayakumar
N. Vaishnavi
S. Siji Rani
K. Deepa Raj
Copyright-Jahr
2021
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-33-6987-0_24

Neuer Inhalt