Skip to main content
Top

2023 | OriginalPaper | Chapter

Abstractive Text Summarization of Hindi Corpus Using Transformer Encoder-Decoder Model

Authors : Rashi Bhansali, Anushka Bhave, Gauri Bharat, Vedant Mahajan, Manikrao Laxmanrao Dhore

Published in: International Symposium on Intelligent Informatics

Publisher: Springer Nature Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Text Summarization based on Abstraction is the task of generating a concise summary that captures the principal ideas of the source text. It potentially contains new phrases that do not appear in the original text. Although it is widely studied for languages like English and French, owing to the scarcity of data on regional vernacular languages like Hindi, the research in this area is still in the primitive stages. We propose a novel approach for building an Abstractive Text Summarizer for Hindi corpus using the Transformer encoder-decoder architecture. Firstly, efficient pre-trained word representations are generated using Facebook’s fastText model. Next, the Transformer model is employed to extract contextual dependencies and yield better semantic representations for a morphologically rich language like Hindi, engendering an abstractive summary. On performing an experimental evaluation on the Hindi news dataset to generate news article headlines, we achieve a ROUGE-1 precision and recall score of 0.682 and 0.598, respectively, which outperforms the state-of-the-art techniques.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
3.
go back to reference J.-M. Torres-Moreno, Automatic text summarization (Wiley, 2014) J.-M. Torres-Moreno, Automatic text summarization (Wiley, 2014)
4.
go back to reference S. Chopra, M. Auli, A.M. Rush, Abstractive sentence summarization with attentive recurrent neural networks, in Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2016). https://doi.org/10.18653/v1/N16-1012 S. Chopra, M. Auli, A.M. Rush, Abstractive sentence summarization with attentive recurrent neural networks, in Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2016). https://​doi.​org/​10.​18653/​v1/​N16-1012
12.
15.
go back to reference V. Dalal, L. Malik, Semantic graph based automatic text summarization for hindi documents using particle swarm optimization, in Information and Communication Technology for Intelligent Systems (ICTIS 2017), vol. 2 (Springer International Publishing) V. Dalal, L. Malik, Semantic graph based automatic text summarization for hindi documents using particle swarm optimization, in Information and Communication Technology for Intelligent Systems (ICTIS 2017), vol. 2 (Springer International Publishing)
23.
go back to reference O. Vasilyev, V. Dharnidharka, J. Bohannon, Fill in the BLANC: human-free quality estimation of document summaries (2020) O. Vasilyev, V. Dharnidharka, J. Bohannon, Fill in the BLANC: human-free quality estimation of document summaries (2020)
Metadata
Title
Abstractive Text Summarization of Hindi Corpus Using Transformer Encoder-Decoder Model
Authors
Rashi Bhansali
Anushka Bhave
Gauri Bharat
Vedant Mahajan
Manikrao Laxmanrao Dhore
Copyright Year
2023
Publisher
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-19-8094-7_13