Skip to main content
Top

2020 | OriginalPaper | Chapter

Human-Like Summaries from Heterogeneous and Time-Windowed Software Development Artefacts

Authors : Mahfouth Alghamdi, Christoph Treude, Markus Wagner

Published in: Parallel Problem Solving from Nature – PPSN XVI

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Automatic text summarisation has drawn considerable interest in the area of software engineering. It is challenging to summarise the activities related to a software project, (1) because of the volume and heterogeneity of involved software artefacts, and (2) because it is unclear what information a developer seeks in such a multi-document summary. We present the first framework for summarising multi-document software artefacts containing heterogeneous data within a given time frame. To produce human-like summaries, we employ a range of iterative heuristics to minimise the cosine-similarity between texts and high-dimensional feature vectors. A first study shows that users find the automatically generated summaries the most useful when they are generated using word similarity and based on the eight most relevant software artefacts.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
https://​github.​com/​nodejs/​node, accessed on February 2, 2020.
 
2
Based on a two-sided Mann-Whitney U test, there is no statistically significant difference at p = 0.05 between Greedy and Brute Force.
 
3
Let us recall let Random Selection does not generate only one summary at random, but many until the time limit is reached, and it then returns the best.
 
4
The average running time per algorithm (in seconds) to generate a summary is, from left: 151.92, 0.49, 10.0, 10.0, 10.20, 6.67 s.
 
Literature
1.
go back to reference Alghamdi, M., Treude, C., Wagner, M.: Toward human-like summaries generated from heterogeneous software artefacts. In: Genetic and Evolutionary Computation Conference Companion, Prague, Czech Republic, pp. 1701–1702. ACM (2019). ISBN 9781450367486 Alghamdi, M., Treude, C., Wagner, M.: Toward human-like summaries generated from heterogeneous software artefacts. In: Genetic and Evolutionary Computation Conference Companion, Prague, Czech Republic, pp. 1701–1702. ACM (2019). ISBN 9781450367486
2.
go back to reference Baxendale, P.B.: Machine-made index for technical literature|an experiment. IBM J. Res. Dev. 2(4), 354–361 (1958)CrossRef Baxendale, P.B.: Machine-made index for technical literature|an experiment. IBM J. Res. Dev. 2(4), 354–361 (1958)CrossRef
3.
go back to reference Erkan, G., Radev, D.R.: LexRank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 22, 457–479 (2004)CrossRef Erkan, G., Radev, D.R.: LexRank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 22, 457–479 (2004)CrossRef
5.
go back to reference Kupiec, J., Pedersen, J., Chen, F.: A trainable document summarizer. In: 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), pp. 68–73 (1995) Kupiec, J., Pedersen, J., Chen, F.: A trainable document summarizer. In: 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), pp. 68–73 (1995)
6.
go back to reference Luhn, H.P.: The automatic creation of literature abstracts. IBM J. Res. Dev. 2(2), 159–165 (1958) Luhn, H.P.: The automatic creation of literature abstracts. IBM J. Res. Dev. 2(2), 159–165 (1958)
7.
go back to reference Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008) Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008)
8.
go back to reference Nair, V., et al.: Data-driven search-based software engineering. In: 15th International Conference on Mining Software Repositories (MSR), Gothenburg, Sweden, pp. 341–352. ACM (2018). ISBN 9781450357166 Nair, V., et al.: Data-driven search-based software engineering. In: 15th International Conference on Mining Software Repositories (MSR), Gothenburg, Sweden, pp. 341–352. ACM (2018). ISBN 9781450357166
9.
go back to reference Nazar, N., et al.: Source code fragment summarization with small scale crowdsourcing based features. Front. Comput. Sci. 10(3), 504–517 (2016)CrossRef Nazar, N., et al.: Source code fragment summarization with small scale crowdsourcing based features. Front. Comput. Sci. 10(3), 504–517 (2016)CrossRef
10.
go back to reference Nenkova, A., McKeown, K.: Automatic summarization. Found. Trends Info. Retr. 5(2–3), 103–233 (2011)CrossRef Nenkova, A., McKeown, K.: Automatic summarization. Found. Trends Info. Retr. 5(2–3), 103–233 (2011)CrossRef
11.
go back to reference Peyrard, M., Eckle-Kohler, J.: A general optimization framework for multi-document summarization using genetic algorithms and swarm intelligence. In: 26th International Conference on Computational Linguistics: Technical Papers (COLIN), pp. 247–257 (2016) Peyrard, M., Eckle-Kohler, J.: A general optimization framework for multi-document summarization using genetic algorithms and swarm intelligence. In: 26th International Conference on Computational Linguistics: Technical Papers (COLIN), pp. 247–257 (2016)
12.
go back to reference Rastkar, S., Murphy, G.C., Murray, G.: Automatic summarization of bug reports. IEEE Trans. Softw. Eng. 40(4), 366–380 (2014)CrossRef Rastkar, S., Murphy, G.C., Murray, G.: Automatic summarization of bug reports. IEEE Trans. Softw. Eng. 40(4), 366–380 (2014)CrossRef
13.
go back to reference Rigby, P.C., Robillard, M.P.: Discovering essential code elements in informal documentation. In: 35th International Conference on Software Engineering (ICSE), pp. 832–841. IEEE (2013) Rigby, P.C., Robillard, M.P.: Discovering essential code elements in informal documentation. In: 35th International Conference on Software Engineering (ICSE), pp. 832–841. IEEE (2013)
14.
go back to reference Sohangir, S., Wang, D.: Improved sqrt-cosine similarity measurement. J. Big Data 41, 25 (2017)CrossRef Sohangir, S., Wang, D.: Improved sqrt-cosine similarity measurement. J. Big Data 41, 25 (2017)CrossRef
15.
go back to reference Torres-Moreno, J.-M.: Automatic Text Summarization. Wiley, Boca Raton (2014)CrossRef Torres-Moreno, J.-M.: Automatic Text Summarization. Wiley, Boca Raton (2014)CrossRef
16.
go back to reference Treude, C., Filho, F.F., Kulesza, U.: Summarizing and measuring development activity. In: 10th Joint Meeting on Foundations of Software Engineering (FSE), pp. 625–636 (2015) Treude, C., Filho, F.F., Kulesza, U.: Summarizing and measuring development activity. In: 10th Joint Meeting on Foundations of Software Engineering (FSE), pp. 625–636 (2015)
18.
go back to reference Ying, A.T.T., Robillard, M.P.: Code fragment summarization. In: 9th Joint Meeting on Foundations of Software Engineering (FSE), pp. 655–658 (2013) Ying, A.T.T., Robillard, M.P.: Code fragment summarization. In: 9th Joint Meeting on Foundations of Software Engineering (FSE), pp. 655–658 (2013)
Metadata
Title
Human-Like Summaries from Heterogeneous and Time-Windowed Software Development Artefacts
Authors
Mahfouth Alghamdi
Christoph Treude
Markus Wagner
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-58115-2_23

Premium Partner