Skip to main content

2019 | OriginalPaper | Buchkapitel

Recognizing Faults in Software Related Difficult Data

verfasst von : Michał Choraś, Marek Pawlicki, Rafał Kozik

Erschienen in: Computational Science – ICCS 2019

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper we have investigated the use of numerous machine learning algorithms, with emphasis on multilayer artificial neural networks in the domain of software source code fault prediction. The main contribution lies in enhancing the data pre-processing step as the partial solution for handling software related difficult data. Before we put the data into an Artificial Neural Network, we are implementing PCA (Principal Component Analysis) and k-means clustering. The data-clustering step improves the quality of the whole dataset. Using the presented approach we were able to obtain 10% increase of accuracy of the fault detection. In order to ensure the most reliable results, we implement 10-fold cross-validation methodology during experiments. We have also evaluated a wide range of hyperparameter setups for the network, and compared the results to the state of the art, cost-sensitive approaches - Random Forest, AdaBoost, RepTrees and GBT.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Lo, J.: The implementation of artificial neural networks applying to software reliability modeling. In: 2009 Chinese Control and Decision Conference, pp. 4349–4354, June 2009 Lo, J.: The implementation of artificial neural networks applying to software reliability modeling. In: 2009 Chinese Control and Decision Conference, pp. 4349–4354, June 2009
2.
Zurück zum Zitat Choraś, M., Kozik, R., Renk, R., Hołubowicz, W.: A practical framework and guidelines to enhance cyber security and privacy. In: International Joint Conference - CISIS 2015 and ICEUTE 2015, 8th International Conference on Computational Intelligence in Security for Information Systems/6th International Conference on EUropean Transnational Education, 15–17 June 2015, Burgos, Spain, pp. 485–495 (2015) Choraś, M., Kozik, R., Renk, R., Hołubowicz, W.: A practical framework and guidelines to enhance cyber security and privacy. In: International Joint Conference - CISIS 2015 and ICEUTE 2015, 8th International Conference on Computational Intelligence in Security for Information Systems/6th International Conference on EUropean Transnational Education, 15–17 June 2015, Burgos, Spain, pp. 485–495 (2015)
8.
Zurück zum Zitat Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE synthetic minority over-sampling technique. J. Artif. Int. Res. 16(1), 321–357 (2002)MATH Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE synthetic minority over-sampling technique. J. Artif. Int. Res. 16(1), 321–357 (2002)MATH
9.
Zurück zum Zitat Pawlicki, M., Choraś, M., Kozik, R.: Recent granular computing implementations and its feasibility in cybersecurity domain. In Proceedings of the 13th International Conference on Availability, Reliability and Security, ARES 2018, 27–30 August 2018, Hamburg, Germany, pp. 61:1–61:6 (2018) Pawlicki, M., Choraś, M., Kozik, R.: Recent granular computing implementations and its feasibility in cybersecurity domain. In Proceedings of the 13th International Conference on Availability, Reliability and Security, ARES 2018, 27–30 August 2018, Hamburg, Germany, pp. 61:1–61:6 (2018)
10.
Zurück zum Zitat Kozik, R., Pawlicki, M., Choraś, M., Pedrycz, W.: Practical employment of granular computing to complex application layer cyberattack detection. Complexity 2019, 1–9 (2019)CrossRef Kozik, R., Pawlicki, M., Choraś, M., Pedrycz, W.: Practical employment of granular computing to complex application layer cyberattack detection. Complexity 2019, 1–9 (2019)CrossRef
12.
Zurück zum Zitat D’Ambros, M., Lanza, M., Robbes, R.: An extensive comparison of bug prediction approaches. In: Proceedings of MSR 2010 (7th IEEE Working Conference on Mining Software Repositories), pp. 31–41. IEEE CS Press (2010) D’Ambros, M., Lanza, M., Robbes, R.: An extensive comparison of bug prediction approaches. In: Proceedings of MSR 2010 (7th IEEE Working Conference on Mining Software Repositories), pp. 31–41. IEEE CS Press (2010)
13.
Zurück zum Zitat Moser, R., Pedrycz, W., Succi, G.: A comparative analysis of the efficiency of change metrics and static code attributes for defect prediction. In: Proceedings of the 30th International Conference on Software Engineering, ICSE 2008, pp. 181–190. ACM, New York (2008) Moser, R., Pedrycz, W., Succi, G.: A comparative analysis of the efficiency of change metrics and static code attributes for defect prediction. In: Proceedings of the 30th International Conference on Software Engineering, ICSE 2008, pp. 181–190. ACM, New York (2008)
14.
Zurück zum Zitat Choraś, M., Kozik, R., Puchalski, D., Renk, R.: Increasing product owners’ cognition and decision-making capabilities by data analysis approach. Cogn. Technol. Work 21, 191–200 (2019)CrossRef Choraś, M., Kozik, R., Puchalski, D., Renk, R.: Increasing product owners’ cognition and decision-making capabilities by data analysis approach. Cogn. Technol. Work 21, 191–200 (2019)CrossRef
15.
Zurück zum Zitat Kozik, R., Choraś, M., Puchalski, D., Renk, R.: Q-rapids framework for advanced data analysis to improve rapid software development. J. Ambient Intell. Humaniz. Comput. 10(5), 1927–1936 (2019)CrossRef Kozik, R., Choraś, M., Puchalski, D., Renk, R.: Q-rapids framework for advanced data analysis to improve rapid software development. J. Ambient Intell. Humaniz. Comput. 10(5), 1927–1936 (2019)CrossRef
Metadaten
Titel
Recognizing Faults in Software Related Difficult Data
verfasst von
Michał Choraś
Marek Pawlicki
Rafał Kozik
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-22744-9_20