Skip to main content
Erschienen in: The Journal of Supercomputing 15/2023

06.05.2023

On the value of parameter tuning in stacking ensemble model for software regression test effort estimation

verfasst von: Taher Labidi, Zaineb Sakhrawi

Erschienen in: The Journal of Supercomputing | Ausgabe 15/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

A type of software testing, regression testing is often costly and labour-intensive. As such, multiple corporations have intensified efforts to estimate the amount of effort required. However, frequent alterations in software projects impact the precision of software regression test effort estimation (SRTEE), which increases the difficulty of managing software projects. Therefore, machine learning (ML) has increasingly been used to develop more accurate SRTEEs. The estimation process of a software project comprises inputs, the model, and outputs. This present study examines the quality of estimation inputs and the model required to deliver accurate estimation outputs. An SRTEE that uses the stacking ensemble model (StackSRTEE) was developed to increase the precision of SRTEE. It consisted of the three most common ML methods, namely neural networks, support vector regression, and decision tree regression. The grid search (GS) technique was then used to tune the hyperparameters of the StackSRTEE before it was trained and tested using a dataset from the International Software Benchmarking Standards Group (ISBSG) repository. The size of the functional change; specifically, enhancement; was used as the primary independent variable to improve the inputs of the StackSRTEE model. With the appropriate features; such as the functional change size of an enhancement; (1) the proposed StackSRTEE model yielded higher accuracy than the three individual ML methods on their own, (2) using GS to tune and set the individual ML methods increased the precision of the SRTEE outputs, and (3) the StackSRTEE-based GS tuning yielded estimations that were more precise.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Chittimalli P, Harrold M (2008) Test selection on system requirements. In: ISEC, Proceedings of the 1st conference on India software engineering conference, vol 56, no 2, pp 87-96 Chittimalli P, Harrold M (2008) Test selection on system requirements. In: ISEC, Proceedings of the 1st conference on India software engineering conference, vol 56, no 2, pp 87-96
2.
Zurück zum Zitat Minhas NM, Petersen K, Börstler J, Wnuk K (2020) Regression testing for large-scale embedded software development-exploring the state of practice. Inf Softw Technol 120:106254CrossRef Minhas NM, Petersen K, Börstler J, Wnuk K (2020) Regression testing for large-scale embedded software development-exploring the state of practice. Inf Softw Technol 120:106254CrossRef
3.
Zurück zum Zitat Hron M, Obwegeser N (2020) Why and how is Scrum being adapted in practice: a systematic review. J Syst Softw 183:111110CrossRef Hron M, Obwegeser N (2020) Why and how is Scrum being adapted in practice: a systematic review. J Syst Softw 183:111110CrossRef
4.
Zurück zum Zitat Ali SS, Zafar MS, Saeed MT (2020) Effort estimation problems in software maintenance—a survey. In: 3rd International Conference on Computing, Mathematics and Engineering Technologies (iCoMET), pp 1–9 Ali SS, Zafar MS, Saeed MT (2020) Effort estimation problems in software maintenance—a survey. In: 3rd International Conference on Computing, Mathematics and Engineering Technologies (iCoMET), pp 1–9
5.
Zurück zum Zitat Sharma A, Kushwaha DS (2012) Applying requirement based complexity for the estimation of software development and testing effort. ACM SIGSOFT Softw Eng Notes 73(1):1–11 Sharma A, Kushwaha DS (2012) Applying requirement based complexity for the estimation of software development and testing effort. ACM SIGSOFT Softw Eng Notes 73(1):1–11
6.
Zurück zum Zitat Nguyen V, Pham V, Lam V (2013) qEstimation: a process for estimating size and effort of software testing. In: Proceedings of the 2013 International Conference on Software and System Process ICSSP, San Francisco CA, USA, pp 20–28, Nguyen V, Pham V, Lam V (2013) qEstimation: a process for estimating size and effort of software testing. In: Proceedings of the 2013 International Conference on Software and System Process ICSSP, San Francisco CA, USA, pp 20–28,
7.
Zurück zum Zitat Kamaraj K, Lanitha B, Karthic S, Senthil Prakash PN, Mahaveerakannan R (2023) A hybridized artificial neural network for automated software test oracle. Comput Syst Sci Eng 45(2):1837–1850CrossRef Kamaraj K, Lanitha B, Karthic S, Senthil Prakash PN, Mahaveerakannan R (2023) A hybridized artificial neural network for automated software test oracle. Comput Syst Sci Eng 45(2):1837–1850CrossRef
8.
Zurück zum Zitat Kaur A, Kaur K (2019) Investigation on test effort estimation of mobile applications: systematic literature review and survey. Inf Softw Technol 110:56–77CrossRef Kaur A, Kaur K (2019) Investigation on test effort estimation of mobile applications: systematic literature review and survey. Inf Softw Technol 110:56–77CrossRef
9.
Zurück zum Zitat Sakhrawi Z, Sellami A, Bouassida N (2020) Software enhancement effort estimation using machine learning regression methods. Int J Comput Inf Syst Ind Manag Appl 12:412–423 Sakhrawi Z, Sellami A, Bouassida N (2020) Software enhancement effort estimation using machine learning regression methods. Int J Comput Inf Syst Ind Manag Appl 12:412–423
10.
Zurück zum Zitat Wen J, Li S, Lin Z, Hu Y, Huang C (2012) Systematic literature review of machine learning based software development effort estimation models. Inf Softw Technol 54(1):41–59CrossRef Wen J, Li S, Lin Z, Hu Y, Huang C (2012) Systematic literature review of machine learning based software development effort estimation models. Inf Softw Technol 54(1):41–59CrossRef
11.
Zurück zum Zitat Minku LL, Yao X (2011) A principled evaluation of ensembles of learning machines for software effort estimation. In: Proceedings of the 7th International Conference on Predictive Models in Software Engineering, pp 1–10 Minku LL, Yao X (2011) A principled evaluation of ensembles of learning machines for software effort estimation. In: Proceedings of the 7th International Conference on Predictive Models in Software Engineering, pp 1–10
12.
Zurück zum Zitat Lopez-Martin C (2022) Machine learning techniques for software testing effort prediction. Softw Qual J 30(1):65–100CrossRef Lopez-Martin C (2022) Machine learning techniques for software testing effort prediction. Softw Qual J 30(1):65–100CrossRef
13.
Zurück zum Zitat Mantovani RG, Rossi ALD, Vanschoren J, Bischl B, De Carvalho AC (2015) Effectiveness of random search in SVM hyperparameter tuning. In: In 2015 International Joint Conference on Neural Networks (IJCNN). IEEE, pp 1–8 Mantovani RG, Rossi ALD, Vanschoren J, Bischl B, De Carvalho AC (2015) Effectiveness of random search in SVM hyperparameter tuning. In: In 2015 International Joint Conference on Neural Networks (IJCNN). IEEE, pp 1–8
14.
Zurück zum Zitat Ma X, Zhang Y, Wang Y (2015) Performance evaluation of kernel functions based on grid search for support vector regression. In: 7th International Conference on Cybernetics and Intelligent Systems (CIS) and IEEE Conference on Robotics Automation and Mechatronics (RAM), Cambodia, pp 283–288 Ma X, Zhang Y, Wang Y (2015) Performance evaluation of kernel functions based on grid search for support vector regression. In: 7th International Conference on Cybernetics and Intelligent Systems (CIS) and IEEE Conference on Robotics Automation and Mechatronics (RAM), Cambodia, pp 283–288
15.
Zurück zum Zitat Priya Varshini AG, Anitha Kumari K, Varadarajan V (2021) Estimating software development efforts using a random forest-based stacked ensemble approach. Electronics 10(10):1–19 Priya Varshini AG, Anitha Kumari K, Varadarajan V (2021) Estimating software development efforts using a random forest-based stacked ensemble approach. Electronics 10(10):1–19
16.
Zurück zum Zitat Sakhrawi Z, Sellami A, Bouassida N (2022) Support vector regression for enhancement effort prediction of Scrum projects from COSMIC functional size. Innov Syst Softw Eng 18(1):137–153CrossRef Sakhrawi Z, Sellami A, Bouassida N (2022) Support vector regression for enhancement effort prediction of Scrum projects from COSMIC functional size. Innov Syst Softw Eng 18(1):137–153CrossRef
17.
Zurück zum Zitat Idri A, Hosni M, Abran A (2016) Systematic literature review of ensemble effort estimation. J Syst Softw 118:151–175CrossRef Idri A, Hosni M, Abran A (2016) Systematic literature review of ensemble effort estimation. J Syst Softw 118:151–175CrossRef
18.
Zurück zum Zitat Xie R, Li J, Wang J, Dai W, Leier A, Marquez-Lago TT, Akutsu T, Lithgow T, Song J, Zhang Y (2021) DeepVF: a deep learning-based hybrid framework for identifying virulence factors using the stacking strategy. Brief Bioinform 22(3):bbaa125CrossRef Xie R, Li J, Wang J, Dai W, Leier A, Marquez-Lago TT, Akutsu T, Lithgow T, Song J, Zhang Y (2021) DeepVF: a deep learning-based hybrid framework for identifying virulence factors using the stacking strategy. Brief Bioinform 22(3):bbaa125CrossRef
19.
Zurück zum Zitat Abnane I, Idri A, Hosni M, Abran A (2021) Heterogeneous ensemble imputation for software development effort estimation. In: Proceedings PROMISE the 17th International Conference on Predictive Models and Data Analytics in Software Engineering, Athens Greece, pp 1–10 Abnane I, Idri A, Hosni M, Abran A (2021) Heterogeneous ensemble imputation for software development effort estimation. In: Proceedings PROMISE the 17th International Conference on Predictive Models and Data Analytics in Software Engineering, Athens Greece, pp 1–10
20.
Zurück zum Zitat Sampath Kumar P, Venkatesan R (2021) Improving accuracy of software estimation using stacking ensemble method. In: Advances in Machine Learning and Computational Intelligence. Springer, pp 219–227 Sampath Kumar P, Venkatesan R (2021) Improving accuracy of software estimation using stacking ensemble method. In: Advances in Machine Learning and Computational Intelligence. Springer, pp 219–227
21.
Zurück zum Zitat Sakhrawi Z, Sellami A, Bouassida N (2021) Software enhancement effort estimation using correlation-based feature selection and stacking ensemble method. Cluster Comput 25(4):2779–2792CrossRef Sakhrawi Z, Sellami A, Bouassida N (2021) Software enhancement effort estimation using correlation-based feature selection and stacking ensemble method. Cluster Comput 25(4):2779–2792CrossRef
22.
Zurück zum Zitat Sakhrawi Z, Sellami A, Bouassida N (2022) Software enhancement effort estimation using stacking ensemble model within the scrum projects: a proposed web interface. In: Proceedings of the 17th International Conference on Software Technologies ICSOFT, Lisbon, Portugal, pp 91–100 Sakhrawi Z, Sellami A, Bouassida N (2022) Software enhancement effort estimation using stacking ensemble model within the scrum projects: a proposed web interface. In: Proceedings of the 17th International Conference on Software Technologies ICSOFT, Lisbon, Portugal, pp 91–100
23.
Zurück zum Zitat Hosni M, Idri A, Abran A, Bou Nassif A (2018) On the value of parameter tuning in heterogeneous ensembles effort estimation. Soft Comput 22(18):5977–6010CrossRef Hosni M, Idri A, Abran A, Bou Nassif A (2018) On the value of parameter tuning in heterogeneous ensembles effort estimation. Soft Comput 22(18):5977–6010CrossRef
24.
Zurück zum Zitat Mehta S, Patnaik KS (2021) Improved prediction of software defects using ensemble machine learning techniques. Neural Comput Appl 33:10551–10562CrossRef Mehta S, Patnaik KS (2021) Improved prediction of software defects using ensemble machine learning techniques. Neural Comput Appl 33:10551–10562CrossRef
26.
Zurück zum Zitat Fonseca CM, Fleming PJ (1993) Genetic algorithms for multiobjective optimization: formulation discussion and generalization. In: Proceedings of the 5th International Conference on Genetic Algorithms, pp 416–423 Fonseca CM, Fleming PJ (1993) Genetic algorithms for multiobjective optimization: formulation discussion and generalization. In: Proceedings of the 5th International Conference on Genetic Algorithms, pp 416–423
27.
Zurück zum Zitat Geetha U, Sankar S (2022) Multi-objective modified particle swarm optimization for test suite reduction (MOMPSO). Comput Syst Sci Eng 42(3):899–917CrossRef Geetha U, Sankar S (2022) Multi-objective modified particle swarm optimization for test suite reduction (MOMPSO). Comput Syst Sci Eng 42(3):899–917CrossRef
28.
Zurück zum Zitat Bergstra J, Bengio Y (2012) Random search for hyperparameter optimization. Mach Learn Res 13:281–305MathSciNetMATH Bergstra J, Bengio Y (2012) Random search for hyperparameter optimization. Mach Learn Res 13:281–305MathSciNetMATH
29.
Zurück zum Zitat Bacanin N, Stoean C, Zivkovic M, Rakic M, Strulak-Wójcikiewicz R, Stoean R (2023) On the benefits of using metaheuristics in the hyperparameter tuning of deep learning models for energy load forecasting. Energies 16(3):1434CrossRef Bacanin N, Stoean C, Zivkovic M, Rakic M, Strulak-Wójcikiewicz R, Stoean R (2023) On the benefits of using metaheuristics in the hyperparameter tuning of deep learning models for energy load forecasting. Energies 16(3):1434CrossRef
30.
Zurück zum Zitat Song L, Minku L, Yao X (2013) The impact of parameter tuning on software effort estimation using learning machines. In: Proceedings of the 9th International Conference on Predictive Models in Software Engineering. PROMISE, New York, United States, pp 1–10 Song L, Minku L, Yao X (2013) The impact of parameter tuning on software effort estimation using learning machines. In: Proceedings of the 9th International Conference on Predictive Models in Software Engineering. PROMISE, New York, United States, pp 1–10
31.
Zurück zum Zitat Kumar SP, Sekhar BH, Anisha K, Janmenjoy N, Bighnaraj N (2020) Advancement from neural networks to deep learning in software effort estimation: perspective of two decades. Comput Sci Rev 38:100288MathSciNetCrossRef Kumar SP, Sekhar BH, Anisha K, Janmenjoy N, Bighnaraj N (2020) Advancement from neural networks to deep learning in software effort estimation: perspective of two decades. Comput Sci Rev 38:100288MathSciNetCrossRef
33.
34.
Zurück zum Zitat Kantardzic M (2011) Data mining: concepts, models, methods, and algorithms. Wiley, HobokenCrossRefMATH Kantardzic M (2011) Data mining: concepts, models, methods, and algorithms. Wiley, HobokenCrossRefMATH
35.
Zurück zum Zitat Nassif BA, Mohammad A, Fernando CL, Danny H (2013) A comparison between decision trees and decision tree forest models for software development effort estimation. In: 2013 Third International Conference on Communications and Information Technology (ICCIT). IEEE, pp 220–224 Nassif BA, Mohammad A, Fernando CL, Danny H (2013) A comparison between decision trees and decision tree forest models for software development effort estimation. In: 2013 Third International Conference on Communications and Information Technology (ICCIT). IEEE, pp 220–224
36.
Zurück zum Zitat Shepperd M, MacDonell S (2012) Evaluating prediction systems in software project estimation. Inf Softw Technol 54(8):820–827CrossRef Shepperd M, MacDonell S (2012) Evaluating prediction systems in software project estimation. Inf Softw Technol 54(8):820–827CrossRef
37.
Zurück zum Zitat Gautam SS, Singh V (2018) The state-of-the-art in software development effort estimation. J Softw Evol Process 30(12):e1983CrossRef Gautam SS, Singh V (2018) The state-of-the-art in software development effort estimation. J Softw Evol Process 30(12):e1983CrossRef
38.
Zurück zum Zitat Labidi T, Sakhrawi Z, Sellami A, Mtibaa A, Bouassida N (2022) On the use of OLS regression algorithm and Pearson correlation algorithm for improving the SLA establishment process in cloud computing. Innov Syst Softw Eng 18(1):215–229CrossRef Labidi T, Sakhrawi Z, Sellami A, Mtibaa A, Bouassida N (2022) On the use of OLS regression algorithm and Pearson correlation algorithm for improving the SLA establishment process in cloud computing. Innov Syst Softw Eng 18(1):215–229CrossRef
Metadaten
Titel
On the value of parameter tuning in stacking ensemble model for software regression test effort estimation
verfasst von
Taher Labidi
Zaineb Sakhrawi
Publikationsdatum
06.05.2023
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 15/2023
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-023-05334-9

Weitere Artikel der Ausgabe 15/2023

The Journal of Supercomputing 15/2023 Zur Ausgabe

Premium Partner