Skip to main content
Erschienen in: Empirical Software Engineering 2/2015

01.04.2015

Modelling the ‘hurried’ bug report reading process to summarize bug reports

verfasst von: Rafael Lotufo, Zeeshan Malik, Krzysztof Czarnecki

Erschienen in: Empirical Software Engineering | Ausgabe 2/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Although bug reports are frequently consulted project assets, they are communication logs, by-products of bug resolution, and not artifacts created with the intent of being easy to follow. To facilitate bug report digestion, we propose a new, unsupervised, bug report summarization approach that estimates the attention a user would hypothetically give to different sentences in a bug report, when pressed with time. We pose three hypotheses on what makes a sentence relevant: discussing frequently discussed topics, being evaluated or assessed by other sentences, and keeping focused on the bug report’s title and description. Our results suggest that our hypotheses are valid, since the summaries have as much as 12 % improvement in standard summarization evaluation metrics compared to the previous approach. Our evaluation also asks developers to assess the quality and usefulness of the summaries created for bug reports they have worked on. Feedback from developers not only shows the summaries are useful, but also points out important requirements for this, and any bug summarization approach, and indicates directions for future work.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Ankolekar A, Sycara K, Herbsleb J, Kraut R, Welty C (2006) Supporting online problem-solving communities with the semantic web. WWW Ankolekar A, Sycara K, Herbsleb J, Kraut R, Welty C (2006) Supporting online problem-solving communities with the semantic web. WWW
Zurück zum Zitat Anvik J, Hiew L, C Murphy G (2006) Who should fix this bug? In: Proceedings of the 28th international conference on software engineering. ACM Anvik J, Hiew L, C Murphy G (2006) Who should fix this bug? In: Proceedings of the 28th international conference on software engineering. ACM
Zurück zum Zitat Beineke P, Hastie T, Manning C (2004) Exploring sentiment summarization. AAAI Beineke P, Hastie T, Manning C (2004) Exploring sentiment summarization. AAAI
Zurück zum Zitat Bettenburg N, Just S, Schröter A, Weiss C, Premraj R, Zimmermann T (2008a) What makes a good bug report? SIGSOFT Bettenburg N, Just S, Schröter A, Weiss C, Premraj R, Zimmermann T (2008a) What makes a good bug report? SIGSOFT
Zurück zum Zitat Bettenburg N, Premraj R, Zimmermann T (2008b) Extracting structural information from bug reports. MSR Bettenburg N, Premraj R, Zimmermann T (2008b) Extracting structural information from bug reports. MSR
Zurück zum Zitat Blei DM, Ng Y, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res Blei DM, Ng Y, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res
Zurück zum Zitat Boehm B, Basili VR (2001) Software defect reduction top 10 list. IEEE Comput:34 Boehm B, Basili VR (2001) Software defect reduction top 10 list. IEEE Comput:34
Zurück zum Zitat Breu S, Premraj R, Sillito J, Zimmermann T (2010) Information needs in bug reports: improving cooperation between developers and users. Comput Supported Coop Work Breu S, Premraj R, Sillito J, Zimmermann T (2010) Information needs in bug reports: improving cooperation between developers and users. Comput Supported Coop Work
Zurück zum Zitat Brin S, Page L (1998) The anatomy of a large-scale hypertextual web search engine. WWW Brin S, Page L (1998) The anatomy of a large-scale hypertextual web search engine. WWW
Zurück zum Zitat Büttcher S, Clarke C, Cormack G (2010) Information retrieval: implementing and evaluating search engines. MIT Press Büttcher S, Clarke C, Cormack G (2010) Information retrieval: implementing and evaluating search engines. MIT Press
Zurück zum Zitat Dit B, Marcus A (2008) Improving the readability of defect reports. RSSE Dit B, Marcus A (2008) Improving the readability of defect reports. RSSE
Zurück zum Zitat Edmundson HP (1969) New methods in automatic extracting. J ACM (JACM) 16(2) Edmundson HP (1969) New methods in automatic extracting. J ACM (JACM) 16(2)
Zurück zum Zitat Gasser L, Ripoche G (2003) Distributed collective practices and free/open-source software problem management: perspectives and methods. CITE Gasser L, Ripoche G (2003) Distributed collective practices and free/open-source software problem management: perspectives and methods. CITE
Zurück zum Zitat Go A, Bhayani R (2009) Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford Go A, Bhayani R (2009) Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford
Zurück zum Zitat Haiduc S, Aponte J, Moreno L, Marcus A (2010) On the use of automated text summarization techniques for summarizing source code. In: 2010 17th working conference on reverse engineering (WCRE). IEEE Haiduc S, Aponte J, Moreno L, Marcus A (2010) On the use of automated text summarization techniques for summarizing source code. In: 2010 17th working conference on reverse engineering (WCRE). IEEE
Zurück zum Zitat Hamou-Lhadj A, Lethbridge T (2006) Summarizing the content of large traces to facilitate the understanding of the behaviour of a software system. In: 14th IEEE international conference on program comprehension, 2006. ICPC 2006. IEEE Hamou-Lhadj A, Lethbridge T (2006) Summarizing the content of large traces to facilitate the understanding of the behaviour of a software system. In: 14th IEEE international conference on program comprehension, 2006. ICPC 2006. IEEE
Zurück zum Zitat Hiew L (2006) Assisted detection of duplicate bug reports, Master’s thesis, The University of British Columbia Hiew L (2006) Assisted detection of duplicate bug reports, Master’s thesis, The University of British Columbia
Zurück zum Zitat Hofmann T (1999) Probabilistic latent semantic indexing. In: SIGIR. ACM Hofmann T (1999) Probabilistic latent semantic indexing. In: SIGIR. ACM
Zurück zum Zitat Lloret E, Palomar M (2012) Text summarisation in progress: a literature review. Artif Intell Rev 37(1) Lloret E, Palomar M (2012) Text summarisation in progress: a literature review. Artif Intell Rev 37(1)
Zurück zum Zitat Lotufo R, Malik Z, Czarnecki K (2012a) Modelling the ‘hurried’ bug report reading process for bug report summarization. ICSM Lotufo R, Malik Z, Czarnecki K (2012a) Modelling the ‘hurried’ bug report reading process for bug report summarization. ICSM
Zurück zum Zitat Lotufo R, Passos L, Czarnecki K (2012b) Towards improving bug tracking systems with game mechanisms. MSR Lotufo R, Passos L, Czarnecki K (2012b) Towards improving bug tracking systems with game mechanisms. MSR
Zurück zum Zitat Mani S, Catherine R, Sinha VS, Dubey A (2012) Ausum: approach for unsupervised bug report summarization. In: Proceedings of the ACM SIGSOFT 20th international symposium on the foundations of software engineering. ACM Mani S, Catherine R, Sinha VS, Dubey A (2012) Ausum: approach for unsupervised bug report summarization. In: Proceedings of the ACM SIGSOFT 20th international symposium on the foundations of software engineering. ACM
Zurück zum Zitat Mann WC, Thompson SA (1988) Rhetorical structure theory: toward a functional theory of text organization. Text 8(3) Mann WC, Thompson SA (1988) Rhetorical structure theory: toward a functional theory of text organization. Text 8(3)
Zurück zum Zitat Menzies T, Marcus A (2008) Automated severity assessment of software defect reports. In: IEEE international conference on software maintenance, 2008. ICSM 2008. IEEE Menzies T, Marcus A (2008) Automated severity assessment of software defect reports. In: IEEE international conference on software maintenance, 2008. ICSM 2008. IEEE
Zurück zum Zitat Mihalcea R, Textrank PT (2004) Bringing order into texts. EMNLP Mihalcea R, Textrank PT (2004) Bringing order into texts. EMNLP
Zurück zum Zitat Murray G (2008) Summarizing spoken and written conversations. EMNLP Murray G (2008) Summarizing spoken and written conversations. EMNLP
Zurück zum Zitat Nenkova A, Louis Ae (2008) Can you summarize this? Identifying correlates of input difficulty for generic multi-document summarization Nenkova A, Louis Ae (2008) Can you summarize this? Identifying correlates of input difficulty for generic multi-document summarization
Zurück zum Zitat Nenkova A, Passonneau R, McKeown K (2007) The pyramid method: Incorporating human content selection variation in summarization evaluation. ACM Trans Comput Logic Nenkova A, Passonneau R, McKeown K (2007) The pyramid method: Incorporating human content selection variation in summarization evaluation. ACM Trans Comput Logic
Zurück zum Zitat Porter MF et al (1980) An algorithm for suffix stripping Porter MF et al (1980) An algorithm for suffix stripping
Zurück zum Zitat Quan X, Liu G, Lu Z, Ni X, Wenyin L (2009) Short text similarity based on probabilistic topics. Knowl Inf Syst Quan X, Liu G, Lu Z, Ni X, Wenyin L (2009) Short text similarity based on probabilistic topics. Knowl Inf Syst
Zurück zum Zitat Radev DR (2004) Lexrank: graph-based lexical centrality as salience in text summarization. Artif Int Radev DR (2004) Lexrank: graph-based lexical centrality as salience in text summarization. Artif Int
Zurück zum Zitat Rastkar S, Murphy GC, Murray G (2010) Summarizing software artifacts: a case study of bug reports. ICSE Rastkar S, Murphy GC, Murray G (2010) Summarizing software artifacts: a case study of bug reports. ICSE
Zurück zum Zitat Runeson P, AlexanderssonM, Nyholm O (2007) Detection of duplicate defect reports using natural language processing. In: Proceedings of the 29th international conference on software engineering Runeson P, AlexanderssonM, Nyholm O (2007) Detection of duplicate defect reports using natural language processing. In: Proceedings of the 29th international conference on software engineering
Zurück zum Zitat Sridhara G, Hill E, Muppaneni D, Pollock L, Vijay-Shanker K (2010) Towards automatically generating summary comments for java methods. In: Proceedings of the IEEE/ACM international conference on automated software engineering. ACM Sridhara G, Hill E, Muppaneni D, Pollock L, Vijay-Shanker K (2010) Towards automatically generating summary comments for java methods. In: Proceedings of the IEEE/ACM international conference on automated software engineering. ACM
Zurück zum Zitat Strauss A, Corbin J (2008) Basics of qualitative research: techniques and procedures for developing grounded theory. Sage Publications Strauss A, Corbin J (2008) Basics of qualitative research: techniques and procedures for developing grounded theory. Sage Publications
Zurück zum Zitat Sun B,Mitra P, Giles CL, Yen J, Zha H (2007) Topic segmentation with shared topic detection and alignment of multiple documents. SIGIR Sun B,Mitra P, Giles CL, Yen J, Zha H (2007) Topic segmentation with shared topic detection and alignment of multiple documents. SIGIR
Zurück zum Zitat Tang H, Tan S, Cheng X (2009) A survey on sentiment detection of reviews. Exp Syst Appl Tang H, Tan S, Cheng X (2009) A survey on sentiment detection of reviews. Exp Syst Appl
Zurück zum Zitat Thung F, Lo D, Jiang L (2012) Automatic defect categorization. In: 2012 19th working conference on reverse engineering (WCRE). IEEE Thung F, Lo D, Jiang L (2012) Automatic defect categorization. In: 2012 19th working conference on reverse engineering (WCRE). IEEE
Zurück zum Zitat Tian Y, Lo D, Sun C (2012) Information retrieval based nearest neighbor classification for fine-grained bug severity prediction. In: 2012 19th working conference on reverse engineering (WCRE). IEEE Tian Y, Lo D, Sun C (2012) Information retrieval based nearest neighbor classification for fine-grained bug severity prediction. In: 2012 19th working conference on reverse engineering (WCRE). IEEE
Zurück zum Zitat Wang X, Zhang L, Xie T, Anvik J, Sun J (2008) An approach to detecting duplicate bug reports using natural language and execution information. In: Proceedings of the 30th international conference on software engineering. ACM Wang X, Zhang L, Xie T, Anvik J, Sun J (2008) An approach to detecting duplicate bug reports using natural language and execution information. In: Proceedings of the 30th international conference on software engineering. ACM
Zurück zum Zitat Weiss C, Premraj R, Zimmermann T, Zeller A (2007) How long will it take to fix this bug? In: Proceedings of the 4th international workshop on mining software repositories. IEEE Computer Society Weiss C, Premraj R, Zimmermann T, Zeller A (2007) How long will it take to fix this bug? In: Proceedings of the 4th international workshop on mining software repositories. IEEE Computer Society
Metadaten
Titel
Modelling the ‘hurried’ bug report reading process to summarize bug reports
verfasst von
Rafael Lotufo
Zeeshan Malik
Krzysztof Czarnecki
Publikationsdatum
01.04.2015
Verlag
Springer US
Erschienen in
Empirical Software Engineering / Ausgabe 2/2015
Print ISSN: 1382-3256
Elektronische ISSN: 1573-7616
DOI
https://doi.org/10.1007/s10664-014-9311-2

Weitere Artikel der Ausgabe 2/2015

Empirical Software Engineering 2/2015 Zur Ausgabe