nach oben

Erschienen in:

2024 | OriginalPaper | Buchkapitel

Analysis of Deep Learning Models for Text Summarization of User Manuals

verfasst von : Mihir Kayastha, Megh Khaire, Malhar Gate, Param Joshi, Sheetal Sonawane

Erschienen in: Big Data, Machine Learning, and Applications

Verlag: Springer Nature Singapore

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

User manuals have an inconsistent structure with the data presented in multiple formats such as tables, images, etc. It makes processing them a challenging task as we need to account for these inconsistencies. In this work, we propose a pipeline for processing user manuals and analyzing abstractive model PEGASUS and extractive models XLNet, BERT, and GPT-2 for summarization of user manuals. To evaluate the models, we have generated extractive and abstractive datasets and used metrics such as hit ratio, overlap, and rouge score to compare the performance of the models. We observed that an abstractive model gives more human-like summaries compared to the extractive models which although have higher rouge scores, suffer in readability. The system utilizes automatic text summarization along with multiple methods to process user manuals and extract required information in a summarized manner.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Implementing Autonomous Navigation on an Omni Wheeled Robot Using 2D LiDAR, Tracking Camera and ROS

Nächstes Kapitel Modelling Seismic Performance of Reinforced Concrete Buildings Within Response Spectrum Framework

Artifex Software I. Pymupdf: A lightweight PDF, XPS, and e-book viewer, renderer, and toolkit. https://github.com/pymupdf/PyMuPDF

Britz D, Goldie A, Luong MT, Le Q (2017) Massive exploration of neural machine translation architectures. arXiv:1703.03906 (2017)

Chaput M (2007) Whoosh: python-based search engine

Devlin J, Chang MW, Lee K, Toutanova K (2018) Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv:1810.04805

Erera S, Shmueli-Scheuer M, Feigenblat G, Nakash OP, Boni O, Roitman H, Cohen D, Weiner B, Mass Y, Rivlin O et al (2019) A summarization system for scientific documents. arXiv:1908.11152

Lin CY (2004) ROUGE: a package for automatic evaluation of summaries. In: Text summarization branches out. Association for Computational Linguistics, Barcelona, Spain, pp 74–81. https://aclanthology.org/W04-1013

Manuals online: free library for manuals. http://www.manualsonline.com/

Manualslib: the ultimate manuals library. https://www.manualslib.com/

Pdf.js: a portable document format (pdf) viewer, built with html5. https://github.com/mozilla/pdf.js

10.

Radford A, Wu J, Child R, Luan D, Amodei D, Sutskever I et al (2019) Language models are unsupervised multitask learners. OpenAI Blog 1(8):9

11.

Sonawane S, Kulkarni P, Deshpande C, Athawale B (2019) Extractive summarization using semigraph (ESSG). Evol Syst 10(3):409–424CrossRef

12.

Yang Z, Dai Z, Yang Y, Carbonell J, Salakhutdinov RR, Le QV (2019) Xlnet: generalized autoregressive pretraining for language understanding. Adv Neural Inf Process Syst 32

13.

Zhang J, Zhao Y, Saleh M, Liu P (2020) Pegasus: pre-training with extracted gap-sentences for abstractive summarization. In: International conference on machine learning. PMLR, pp 11328–11339

Titel: Analysis of Deep Learning Models for Text Summarization of User Manuals
verfasst von: Mihir Kayastha
Megh Khaire
Malhar Gate
Param Joshi
Sheetal Sonawane
Verlag: Springer Nature Singapore
Buch: Big Data, Machine Learning, and Applications
Print ISBN: 978-981-9934-80-5

Electronic ISBN: 978-981-9934-81-2

Copyright-Jahr: 2024
DOI: https://doi.org/10.1007/978-981-99-3481-2_35

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner