Skip to main content
Top

2020 | OriginalPaper | Chapter

Comparative Analysis of Hindi Text Summarization for Multiple Documents by Padding of Ancillary Features

Authors : Archana N. Gulati, Sudhir D. Sawarkar

Published in: Performance Management of Integrated Systems and its Applications in Software Engineering

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

There is an enormous amount of textual material, and it is only growing every single day. The data available on Internet comprised of Web pages, news articles, status updates, blogs which are unstructured. There is a great need to reduce much of these text data to shorter, focused summaries that capture the salient details so that the user can navigate it more effectively as well as check whether the larger documents contain the information that we are looking for. Text summary is generating a shorter version of the original text. The need of summarization arises because every time it is not possible to read the detailed document due to lack of time. Automatic text summarization methods are greatly needed to address the ever-growing amount of text data available online both to better help discover relevant information and to consume relevant information faster. To address the issue of time constraint, an extractive text summarization technique has been proposed in this research work which selects important sentences from a text document to get a gist of information contained in it. A fuzzy technique has been used to generate extractive summary from multiple documents by using eight and eleven feature sets. The eleven feature set combines the existing eight features (term frequency-inverse sentence, length of sentence in the document, location of sentence in document, similarity between sentences, numerical data, title overlap, subject object verb (SOV) qualifier, lexical similarity) and three ancillary features (proper nouns, hindi cue phrase, thematic words). It was seen that applying fuzzy technique with eleven features gave better results for summarization than the same using eight features. The precision increases in the range of 3–5% for different datasets. Datasets used were Hindi news articles from online sources.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Thaokar, C., & Malik, L. (2013). Test model for summarizing Hindi text using extraction method. In IEEE Conference on ICT 2013. Thaokar, C., & Malik, L. (2013). Test model for summarizing Hindi text using extraction method. In IEEE Conference on ICT 2013.
2.
go back to reference Babar, S. A., & Patil, P. D. (2015). Improving performance of text summarization. In International Conference on Information and Communication Technologies (ICICT 2014), Procedia Computer Science (Vol. 46, pp. 354–363). Babar, S. A., & Patil, P. D. (2015). Improving performance of text summarization. In International Conference on Information and Communication Technologies (ICICT 2014), Procedia Computer Science (Vol. 46, pp. 354–363).
3.
go back to reference Meena, Y. K., & Gopalani, D. (2015). Evolutionary algorithms for extractive automatic text summarization. In International Conference on Intelligent Computing, Communication & Convergence (ICCC-2014), Procedia Computer Science (Vol. 48, pp. 244–249). Meena, Y. K., & Gopalani, D. (2015). Evolutionary algorithms for extractive automatic text summarization. In International Conference on Intelligent Computing, Communication & Convergence (ICCC-2014), Procedia Computer Science (Vol. 48, pp. 244–249).
4.
go back to reference Hahn, U., & Mani, I. (2000). The challenges of automatic summarization. In 2000 IEEE. Hahn, U., & Mani, I. (2000). The challenges of automatic summarization. In 2000 IEEE.
5.
go back to reference Megala, S. S., Kavitha, A., & Marimuthu, A. (2014). Enriching text summarization using fuzzy logic. (IJCSIT) International Journal of Computer Science and Information Technologies, 5(1), 863–867. Megala, S. S., Kavitha, A., & Marimuthu, A. (2014). Enriching text summarization using fuzzy logic. (IJCSIT) International Journal of Computer Science and Information Technologies, 5(1), 863–867.
6.
go back to reference Kyoomarsi, F., Khosravi, H., Eslami, E., & Davoudi, M. (2010). Extraction based text summarization using fuzzy analysis. Iranian Journal of Fuzzy Systems, 7(3), 15–32. Kyoomarsi, F., Khosravi, H., Eslami, E., & Davoudi, M. (2010). Extraction based text summarization using fuzzy analysis. Iranian Journal of Fuzzy Systems, 7(3), 15–32.
7.
go back to reference Kumar, Y., & Gopalani, D. (2015). Feature priority based sentence filtering method for extractive automatic text summarization. In ICCC-2015, Procedia Computer Science (Vol. 48, pp. 728–734). Kumar, Y., & Gopalani, D. (2015). Feature priority based sentence filtering method for extractive automatic text summarization. In ICCC-2015, Procedia Computer Science (Vol. 48, pp. 728–734).
8.
go back to reference Patil, P. D., & Mane, P. M. (2015). Improving the performance for single and multi-document text summarization via LSA & FL. IJCST, 2(4). Patil, P. D., & Mane, P. M. (2015). Improving the performance for single and multi-document text summarization via LSA & FL. IJCST, 2(4).
9.
go back to reference Patil, P. D., & Mane, P. M. (2014). A comprehensive review on fuzzy logic & latent semantic analysis techniques for improving the performance of text summarization. International Journal of Advance Research in Computer Science and Management Studies (IJARCSMS), 2(11). Patil, P. D., & Mane, P. M. (2014). A comprehensive review on fuzzy logic & latent semantic analysis techniques for improving the performance of text summarization. International Journal of Advance Research in Computer Science and Management Studies (IJARCSMS), 2(11).
10.
go back to reference Patil, P. D., & Kulkarni, N. J. (2014). Text summarization using fuzzy logic. International Journal of Innovative Research in Advanced Engineering (IJIRAE), 1(3). Patil, P. D., & Kulkarni, N. J. (2014). Text summarization using fuzzy logic. International Journal of Innovative Research in Advanced Engineering (IJIRAE), 1(3).
11.
go back to reference Santana Megala, S., & Kavitha, A. (2014). Feature extraction based legal document summarization. IJARMS, 2(12). Santana Megala, S., & Kavitha, A. (2014). Feature extraction based legal document summarization. IJARMS, 2(12).
12.
go back to reference Suanmali1, L., Salim, N., & Binwahlan, M. S. (2009). Fuzzy logic based method for improving text summarization. International Journal of Computer Science and Information Security (IJCSIS), 2(1). Suanmali1, L., Salim, N., & Binwahlan, M. S. (2009). Fuzzy logic based method for improving text summarization. International Journal of Computer Science and Information Security (IJCSIS), 2(1).
Metadata
Title
Comparative Analysis of Hindi Text Summarization for Multiple Documents by Padding of Ancillary Features
Authors
Archana N. Gulati
Sudhir D. Sawarkar
Copyright Year
2020
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-13-8253-6_22

Premium Partner