Skip to main content
Top

2024 | OriginalPaper | Chapter

Enhancing Extractive Summarization in Student Assignments Using BERT and K-Means Clustering

Authors : Mamluatul Hani’ah, Vivi Nur Wijayaningrum, Astrifidha Rahma Amalia

Published in: Proceedings of the 4th International Conference on Electronics, Biomedical Engineering, and Health Informatics

Publisher: Springer Nature Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Evaluation through learning assessments is a fundamental factor in determining student’s success in achieving specified competencies. During the essay evaluation, the lecturer needs to check each assignment individually. However, when dealing with long essay answers, extra attention is necessary to extract the key points effectively. As a result, this process takes a lot of time and may potentially lead to carelessness or boredom during the assignment-checking process. The way to overcome this problem is by using Extractive summarization. The Extractive Summarization method summarizes by extracting key points from the student assignments and creating a summary without making any changes to the original text. Currently, Extractive Summarization widely uses deep learning technology, such as Bidirectional Encoder Representations from Transformers (BERT). BERT can effectively recognize contextual information within sentences. Output from BERT is sentence embeddings that can serve as valuable features for clustering. The summary result generated by applying k-means clustering to group sentences that have similarities and then selecting one primary sentence from each cluster to represent the cluster. This research proposed an approach for selecting sentence candidates for each cluster using TF-IDF weighting. The proposed method achieved the best ROUGE score on ROUGE-1 recall with 0.73003. We compare our results with the previous study’s BERT k-means approach, which selects sentence candidates from the closest sentences to the centroid for summary selection. The experimental results show that the proposed method achieves slightly better ROUGE scores than the previous study. Furthermore, in terms of execution time comparison, the proposed method has a shorter execution time.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Kurniawan A, Febrianti AN, Hardianti T et al (2022) Evaluasi Pembelajaran. PT Global Eksekutif Teknologi, Padang Kurniawan A, Febrianti AN, Hardianti T et al (2022) Evaluasi Pembelajaran. PT Global Eksekutif Teknologi, Padang
2.
go back to reference Hani’ah M, Kurniawan Y, Rozi IF (2021) livE (onLine--java Exercise) java programming language learning system for lab and online test. Matrix: Jurnal Manajemen Teknologi dan Informatika 11:1–10 Hani’ah M, Kurniawan Y, Rozi IF (2021) livE (onLine--java Exercise) java programming language learning system for lab and online test. Matrix: Jurnal Manajemen Teknologi dan Informatika 11:1–10
3.
go back to reference Setiawati W, Asmira O, Ariyana Y et al (2019) Buku penilaian berorientasi higher order thinking skills Setiawati W, Asmira O, Ariyana Y et al (2019) Buku penilaian berorientasi higher order thinking skills
15.
go back to reference Devlin J, Chang MW, Lee K, Toutanova K (2018) BERT: Pre-training of deep bidirectional transformers for language understanding. NAACL HLT 2019. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies. vol 1, pp 4171–4186 Devlin J, Chang MW, Lee K, Toutanova K (2018) BERT: Pre-training of deep bidirectional transformers for language understanding. NAACL HLT 2019. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies. vol 1, pp 4171–4186
16.
go back to reference Vaswani A, Brain G, Shazeer N et al (2017) Attention is all you need. Adv Neural Inf Process Syst 30 Vaswani A, Brain G, Shazeer N et al (2017) Attention is all you need. Adv Neural Inf Process Syst 30
17.
go back to reference Miller D (2019) Leveraging BERT for extractive text summarization on lectures. arXiv preprint arXiv:190604165 Miller D (2019) Leveraging BERT for extractive text summarization on lectures. arXiv preprint arXiv:190604165
19.
21.
go back to reference Juarto B, Yulianto (2023) Indonesian news classification using IndoBert. Int J Intell Syst Appl Eng 11:454–460 Juarto B, Yulianto (2023) Indonesian news classification using IndoBert. Int J Intell Syst Appl Eng 11:454–460
25.
go back to reference Ririd A, Hani’ah M, Putri I (2020) Analisis Pertumbuhan Balita Menggunakan Algoritma Kmeans++ Untuk Mengetahui Resiko Obesitas. SENTIA 2020 12 Ririd A, Hani’ah M, Putri I (2020) Analisis Pertumbuhan Balita Menggunakan Algoritma Kmeans++ Untuk Mengetahui Resiko Obesitas. SENTIA 2020 12
26.
go back to reference Abu Nada AM, Alajrami E, Al-Saqqa AA, Abu-Naser SS (2020) Arabic text summarization using AraBERT model using extractive text summarization approach Abu Nada AM, Alajrami E, Al-Saqqa AA, Abu-Naser SS (2020) Arabic text summarization using AraBERT model using extractive text summarization approach
28.
go back to reference Lin C-Y (2004) Rouge: a package for automatic evaluation of summaries. In: Text summarization branches out. pp 74–81 Lin C-Y (2004) Rouge: a package for automatic evaluation of summaries. In: Text summarization branches out. pp 74–81
29.
Metadata
Title
Enhancing Extractive Summarization in Student Assignments Using BERT and K-Means Clustering
Authors
Mamluatul Hani’ah
Vivi Nur Wijayaningrum
Astrifidha Rahma Amalia
Copyright Year
2024
Publisher
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-97-1463-6_31