Skip to main content
Top

2019 | OriginalPaper | Chapter

Applying Weighted Particle Swarm Optimization to Imbalanced Data in Software Defect Prediction

Authors : Lucija Brezočnik, Vili Podgorelec

Published in: New Technologies, Development and Application

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Imbalanced data typically refers to class distribution skews and underrepresented data, which affect the performance of learning algorithms. Such data are well-known in real-life situations, such as behavior analysis, cancer malignancy grading, industrial systems’ monitoring and software defect prediction. In this paper, we present a W-PSO method, which comprises weighting of instances in a dataset and the Particle Swarm Optimization algorithm. The presented method was combined with classification methods C4.5 and Naive Bayes, respectively, and tested experimentally on ten freely accessible software defect prediction datasets. Based on the results achieved, the presented W-PSO method creates better classification models than classification methods C4.5 and Naive Bayes in the majority of the cases.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Brezočnik, L.: Feature selection for classification using particle swarm optimization. In: IEEE EUROCON 2017 – 17th International Conference on Smart Technologies, pp. 966–971. IEEE, Ohrid, Macedonia (2017) Brezočnik, L.: Feature selection for classification using particle swarm optimization. In: IEEE EUROCON 2017 – 17th International Conference on Smart Technologies, pp. 966–971. IEEE, Ohrid, Macedonia (2017)
2.
go back to reference Brezočnik, L., Karakatič, S., Podgorelec, V.: Weighted particle swarm optimization for imbalanced data. In: Twenty-sixth International Electrotechnical and Computer Science Conference ERK 2017, pp. 387–390. IEEE Slovenia Section, Portorož, Slovenia (2017) Brezočnik, L., Karakatič, S., Podgorelec, V.: Weighted particle swarm optimization for imbalanced data. In: Twenty-sixth International Electrotechnical and Computer Science Conference ERK 2017, pp. 387–390. IEEE Slovenia Section, Portorož, Slovenia (2017)
3.
go back to reference Fenton, N.E., Ohlsson, N.: Quantitative analysis of faults and failures in a complex software system. IEEE Trans. Softw. Eng. 26(8), 797–814 (2000)CrossRef Fenton, N.E., Ohlsson, N.: Quantitative analysis of faults and failures in a complex software system. IEEE Trans. Softw. Eng. 26(8), 797–814 (2000)CrossRef
5.
go back to reference Karakatič, S., Heričko, M., Podgorelec, V.: Weighting and sampling data for individual classifiers and bagging with genetic algorithms. In: 7th International Joint Conference on Computational Intelligence IJCCI, pp. 180–187. IEEE, Lisbon, Portugal (2015) Karakatič, S., Heričko, M., Podgorelec, V.: Weighting and sampling data for individual classifiers and bagging with genetic algorithms. In: 7th International Joint Conference on Computational Intelligence IJCCI, pp. 180–187. IEEE, Lisbon, Portugal (2015)
6.
go back to reference Kennedy, J., Eberhart, R.: Particle swarm optimization. In: IEEE International Conference on Neural Networks, vol. 4, pp. 1942–1948. IEEE, Perth, Australia (1995) Kennedy, J., Eberhart, R.: Particle swarm optimization. In: IEEE International Conference on Neural Networks, vol. 4, pp. 1942–1948. IEEE, Perth, Australia (1995)
7.
go back to reference Khoshgoftaar, T.M., Allen, E.B., Deng, J.: Using regression trees to classify fault-prone software modules. IEEE Trans. Reliab. 51(4), 455–462 (2002)CrossRef Khoshgoftaar, T.M., Allen, E.B., Deng, J.: Using regression trees to classify fault-prone software modules. IEEE Trans. Reliab. 51(4), 455–462 (2002)CrossRef
8.
go back to reference Menzies, T., Di Stefano, J.S., Chapman, M., McGrill, K.: Metrics that matter. In: 27th Annual NASA Goddard/IEEE Software Engineering Workshop, pp. 51–57. IEEE (2002) Menzies, T., Di Stefano, J.S., Chapman, M., McGrill, K.: Metrics that matter. In: 27th Annual NASA Goddard/IEEE Software Engineering Workshop, pp. 51–57. IEEE (2002)
9.
go back to reference Menzies, T., Greenwald, J., Frank, A.: Data mining static code attributes to learn defect predictors. IEEE Trans. Softw. Eng. 33(1), 2–13 (2007)CrossRef Menzies, T., Greenwald, J., Frank, A.: Data mining static code attributes to learn defect predictors. IEEE Trans. Softw. Eng. 33(1), 2–13 (2007)CrossRef
10.
go back to reference Pighin, M., Podgorelec, V., Kokol, P.: Fault-threshold prediction with linear programming methodologies. Empirical Softw. Eng. 8(2), 117–138 (2003)CrossRef Pighin, M., Podgorelec, V., Kokol, P.: Fault-threshold prediction with linear programming methodologies. Empirical Softw. Eng. 8(2), 117–138 (2003)CrossRef
11.
go back to reference Podgorelec, V.: Improved mining of software complexity data on evolutionary filtered training sets. WSEAS Trans. Inf. Sci. Appl. 6(11), 1751–1760 (2009) Podgorelec, V.: Improved mining of software complexity data on evolutionary filtered training sets. WSEAS Trans. Inf. Sci. Appl. 6(11), 1751–1760 (2009)
12.
go back to reference Podgorelec, V., Karakatič, S.: Predicting software defect-proneness from software repository data – a case of eclipse bug data. In: 18th International Multiconference Information Society – IS 2015, Collaboration, Software and Services in Information Society, pp. 5–8. InstitutJožef Stefan, Ljubljana, Slovenia (2015) Podgorelec, V., Karakatič, S.: Predicting software defect-proneness from software repository data – a case of eclipse bug data. In: 18th International Multiconference Information Society – IS 2015, Collaboration, Software and Services in Information Society, pp. 5–8. InstitutJožef Stefan, Ljubljana, Slovenia (2015)
13.
go back to reference Podgorelec, V., Kokol, P.: Evolutionary induced decision trees for dangerous software modules prediction. Inf. Process. Lett. 82(1), 31–38 (2002)MathSciNetCrossRef Podgorelec, V., Kokol, P.: Evolutionary induced decision trees for dangerous software modules prediction. Inf. Process. Lett. 82(1), 31–38 (2002)MathSciNetCrossRef
14.
go back to reference Porter, A.A., Selby, R.W.: Empirically guided software development using metric-based classification trees. IEEE Softw. 7(2), 46–54 (1990)CrossRef Porter, A.A., Selby, R.W.: Empirically guided software development using metric-based classification trees. IEEE Softw. 7(2), 46–54 (1990)CrossRef
15.
go back to reference Song, Q., Jia, Z., Shepperd, M., Ying, S., Liu, J.: A general software defect-proneness prediction framework. IEEE Trans. Softw. Eng. 37(3), 356–370 (2011)CrossRef Song, Q., Jia, Z., Shepperd, M., Ying, S., Liu, J.: A general software defect-proneness prediction framework. IEEE Trans. Softw. Eng. 37(3), 356–370 (2011)CrossRef
16.
go back to reference Wahono, R.S.: A systematic literature review of software defect prediction: Research trends, datasets, methods and frameworks. J. Softw. Eng. 1(1), 1–16 (2015) Wahono, R.S.: A systematic literature review of software defect prediction: Research trends, datasets, methods and frameworks. J. Softw. Eng. 1(1), 1–16 (2015)
Metadata
Title
Applying Weighted Particle Swarm Optimization to Imbalanced Data in Software Defect Prediction
Authors
Lucija Brezočnik
Vili Podgorelec
Copyright Year
2019
DOI
https://doi.org/10.1007/978-3-319-90893-9_35