Skip to main content
Top
Published in: The Journal of Supercomputing 15/2023

06-05-2023

On the value of parameter tuning in stacking ensemble model for software regression test effort estimation

Authors: Taher Labidi, Zaineb Sakhrawi

Published in: The Journal of Supercomputing | Issue 15/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

A type of software testing, regression testing is often costly and labour-intensive. As such, multiple corporations have intensified efforts to estimate the amount of effort required. However, frequent alterations in software projects impact the precision of software regression test effort estimation (SRTEE), which increases the difficulty of managing software projects. Therefore, machine learning (ML) has increasingly been used to develop more accurate SRTEEs. The estimation process of a software project comprises inputs, the model, and outputs. This present study examines the quality of estimation inputs and the model required to deliver accurate estimation outputs. An SRTEE that uses the stacking ensemble model (StackSRTEE) was developed to increase the precision of SRTEE. It consisted of the three most common ML methods, namely neural networks, support vector regression, and decision tree regression. The grid search (GS) technique was then used to tune the hyperparameters of the StackSRTEE before it was trained and tested using a dataset from the International Software Benchmarking Standards Group (ISBSG) repository. The size of the functional change; specifically, enhancement; was used as the primary independent variable to improve the inputs of the StackSRTEE model. With the appropriate features; such as the functional change size of an enhancement; (1) the proposed StackSRTEE model yielded higher accuracy than the three individual ML methods on their own, (2) using GS to tune and set the individual ML methods increased the precision of the SRTEE outputs, and (3) the StackSRTEE-based GS tuning yielded estimations that were more precise.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Chittimalli P, Harrold M (2008) Test selection on system requirements. In: ISEC, Proceedings of the 1st conference on India software engineering conference, vol 56, no 2, pp 87-96 Chittimalli P, Harrold M (2008) Test selection on system requirements. In: ISEC, Proceedings of the 1st conference on India software engineering conference, vol 56, no 2, pp 87-96
2.
go back to reference Minhas NM, Petersen K, Börstler J, Wnuk K (2020) Regression testing for large-scale embedded software development-exploring the state of practice. Inf Softw Technol 120:106254CrossRef Minhas NM, Petersen K, Börstler J, Wnuk K (2020) Regression testing for large-scale embedded software development-exploring the state of practice. Inf Softw Technol 120:106254CrossRef
3.
go back to reference Hron M, Obwegeser N (2020) Why and how is Scrum being adapted in practice: a systematic review. J Syst Softw 183:111110CrossRef Hron M, Obwegeser N (2020) Why and how is Scrum being adapted in practice: a systematic review. J Syst Softw 183:111110CrossRef
4.
go back to reference Ali SS, Zafar MS, Saeed MT (2020) Effort estimation problems in software maintenance—a survey. In: 3rd International Conference on Computing, Mathematics and Engineering Technologies (iCoMET), pp 1–9 Ali SS, Zafar MS, Saeed MT (2020) Effort estimation problems in software maintenance—a survey. In: 3rd International Conference on Computing, Mathematics and Engineering Technologies (iCoMET), pp 1–9
5.
go back to reference Sharma A, Kushwaha DS (2012) Applying requirement based complexity for the estimation of software development and testing effort. ACM SIGSOFT Softw Eng Notes 73(1):1–11 Sharma A, Kushwaha DS (2012) Applying requirement based complexity for the estimation of software development and testing effort. ACM SIGSOFT Softw Eng Notes 73(1):1–11
6.
go back to reference Nguyen V, Pham V, Lam V (2013) qEstimation: a process for estimating size and effort of software testing. In: Proceedings of the 2013 International Conference on Software and System Process ICSSP, San Francisco CA, USA, pp 20–28, Nguyen V, Pham V, Lam V (2013) qEstimation: a process for estimating size and effort of software testing. In: Proceedings of the 2013 International Conference on Software and System Process ICSSP, San Francisco CA, USA, pp 20–28,
7.
go back to reference Kamaraj K, Lanitha B, Karthic S, Senthil Prakash PN, Mahaveerakannan R (2023) A hybridized artificial neural network for automated software test oracle. Comput Syst Sci Eng 45(2):1837–1850CrossRef Kamaraj K, Lanitha B, Karthic S, Senthil Prakash PN, Mahaveerakannan R (2023) A hybridized artificial neural network for automated software test oracle. Comput Syst Sci Eng 45(2):1837–1850CrossRef
8.
go back to reference Kaur A, Kaur K (2019) Investigation on test effort estimation of mobile applications: systematic literature review and survey. Inf Softw Technol 110:56–77CrossRef Kaur A, Kaur K (2019) Investigation on test effort estimation of mobile applications: systematic literature review and survey. Inf Softw Technol 110:56–77CrossRef
9.
go back to reference Sakhrawi Z, Sellami A, Bouassida N (2020) Software enhancement effort estimation using machine learning regression methods. Int J Comput Inf Syst Ind Manag Appl 12:412–423 Sakhrawi Z, Sellami A, Bouassida N (2020) Software enhancement effort estimation using machine learning regression methods. Int J Comput Inf Syst Ind Manag Appl 12:412–423
10.
go back to reference Wen J, Li S, Lin Z, Hu Y, Huang C (2012) Systematic literature review of machine learning based software development effort estimation models. Inf Softw Technol 54(1):41–59CrossRef Wen J, Li S, Lin Z, Hu Y, Huang C (2012) Systematic literature review of machine learning based software development effort estimation models. Inf Softw Technol 54(1):41–59CrossRef
11.
go back to reference Minku LL, Yao X (2011) A principled evaluation of ensembles of learning machines for software effort estimation. In: Proceedings of the 7th International Conference on Predictive Models in Software Engineering, pp 1–10 Minku LL, Yao X (2011) A principled evaluation of ensembles of learning machines for software effort estimation. In: Proceedings of the 7th International Conference on Predictive Models in Software Engineering, pp 1–10
12.
go back to reference Lopez-Martin C (2022) Machine learning techniques for software testing effort prediction. Softw Qual J 30(1):65–100CrossRef Lopez-Martin C (2022) Machine learning techniques for software testing effort prediction. Softw Qual J 30(1):65–100CrossRef
13.
go back to reference Mantovani RG, Rossi ALD, Vanschoren J, Bischl B, De Carvalho AC (2015) Effectiveness of random search in SVM hyperparameter tuning. In: In 2015 International Joint Conference on Neural Networks (IJCNN). IEEE, pp 1–8 Mantovani RG, Rossi ALD, Vanschoren J, Bischl B, De Carvalho AC (2015) Effectiveness of random search in SVM hyperparameter tuning. In: In 2015 International Joint Conference on Neural Networks (IJCNN). IEEE, pp 1–8
14.
go back to reference Ma X, Zhang Y, Wang Y (2015) Performance evaluation of kernel functions based on grid search for support vector regression. In: 7th International Conference on Cybernetics and Intelligent Systems (CIS) and IEEE Conference on Robotics Automation and Mechatronics (RAM), Cambodia, pp 283–288 Ma X, Zhang Y, Wang Y (2015) Performance evaluation of kernel functions based on grid search for support vector regression. In: 7th International Conference on Cybernetics and Intelligent Systems (CIS) and IEEE Conference on Robotics Automation and Mechatronics (RAM), Cambodia, pp 283–288
15.
go back to reference Priya Varshini AG, Anitha Kumari K, Varadarajan V (2021) Estimating software development efforts using a random forest-based stacked ensemble approach. Electronics 10(10):1–19 Priya Varshini AG, Anitha Kumari K, Varadarajan V (2021) Estimating software development efforts using a random forest-based stacked ensemble approach. Electronics 10(10):1–19
16.
go back to reference Sakhrawi Z, Sellami A, Bouassida N (2022) Support vector regression for enhancement effort prediction of Scrum projects from COSMIC functional size. Innov Syst Softw Eng 18(1):137–153CrossRef Sakhrawi Z, Sellami A, Bouassida N (2022) Support vector regression for enhancement effort prediction of Scrum projects from COSMIC functional size. Innov Syst Softw Eng 18(1):137–153CrossRef
17.
go back to reference Idri A, Hosni M, Abran A (2016) Systematic literature review of ensemble effort estimation. J Syst Softw 118:151–175CrossRef Idri A, Hosni M, Abran A (2016) Systematic literature review of ensemble effort estimation. J Syst Softw 118:151–175CrossRef
18.
go back to reference Xie R, Li J, Wang J, Dai W, Leier A, Marquez-Lago TT, Akutsu T, Lithgow T, Song J, Zhang Y (2021) DeepVF: a deep learning-based hybrid framework for identifying virulence factors using the stacking strategy. Brief Bioinform 22(3):bbaa125CrossRef Xie R, Li J, Wang J, Dai W, Leier A, Marquez-Lago TT, Akutsu T, Lithgow T, Song J, Zhang Y (2021) DeepVF: a deep learning-based hybrid framework for identifying virulence factors using the stacking strategy. Brief Bioinform 22(3):bbaa125CrossRef
19.
go back to reference Abnane I, Idri A, Hosni M, Abran A (2021) Heterogeneous ensemble imputation for software development effort estimation. In: Proceedings PROMISE the 17th International Conference on Predictive Models and Data Analytics in Software Engineering, Athens Greece, pp 1–10 Abnane I, Idri A, Hosni M, Abran A (2021) Heterogeneous ensemble imputation for software development effort estimation. In: Proceedings PROMISE the 17th International Conference on Predictive Models and Data Analytics in Software Engineering, Athens Greece, pp 1–10
20.
go back to reference Sampath Kumar P, Venkatesan R (2021) Improving accuracy of software estimation using stacking ensemble method. In: Advances in Machine Learning and Computational Intelligence. Springer, pp 219–227 Sampath Kumar P, Venkatesan R (2021) Improving accuracy of software estimation using stacking ensemble method. In: Advances in Machine Learning and Computational Intelligence. Springer, pp 219–227
21.
go back to reference Sakhrawi Z, Sellami A, Bouassida N (2021) Software enhancement effort estimation using correlation-based feature selection and stacking ensemble method. Cluster Comput 25(4):2779–2792CrossRef Sakhrawi Z, Sellami A, Bouassida N (2021) Software enhancement effort estimation using correlation-based feature selection and stacking ensemble method. Cluster Comput 25(4):2779–2792CrossRef
22.
go back to reference Sakhrawi Z, Sellami A, Bouassida N (2022) Software enhancement effort estimation using stacking ensemble model within the scrum projects: a proposed web interface. In: Proceedings of the 17th International Conference on Software Technologies ICSOFT, Lisbon, Portugal, pp 91–100 Sakhrawi Z, Sellami A, Bouassida N (2022) Software enhancement effort estimation using stacking ensemble model within the scrum projects: a proposed web interface. In: Proceedings of the 17th International Conference on Software Technologies ICSOFT, Lisbon, Portugal, pp 91–100
23.
go back to reference Hosni M, Idri A, Abran A, Bou Nassif A (2018) On the value of parameter tuning in heterogeneous ensembles effort estimation. Soft Comput 22(18):5977–6010CrossRef Hosni M, Idri A, Abran A, Bou Nassif A (2018) On the value of parameter tuning in heterogeneous ensembles effort estimation. Soft Comput 22(18):5977–6010CrossRef
24.
go back to reference Mehta S, Patnaik KS (2021) Improved prediction of software defects using ensemble machine learning techniques. Neural Comput Appl 33:10551–10562CrossRef Mehta S, Patnaik KS (2021) Improved prediction of software defects using ensemble machine learning techniques. Neural Comput Appl 33:10551–10562CrossRef
26.
go back to reference Fonseca CM, Fleming PJ (1993) Genetic algorithms for multiobjective optimization: formulation discussion and generalization. In: Proceedings of the 5th International Conference on Genetic Algorithms, pp 416–423 Fonseca CM, Fleming PJ (1993) Genetic algorithms for multiobjective optimization: formulation discussion and generalization. In: Proceedings of the 5th International Conference on Genetic Algorithms, pp 416–423
27.
go back to reference Geetha U, Sankar S (2022) Multi-objective modified particle swarm optimization for test suite reduction (MOMPSO). Comput Syst Sci Eng 42(3):899–917CrossRef Geetha U, Sankar S (2022) Multi-objective modified particle swarm optimization for test suite reduction (MOMPSO). Comput Syst Sci Eng 42(3):899–917CrossRef
28.
go back to reference Bergstra J, Bengio Y (2012) Random search for hyperparameter optimization. Mach Learn Res 13:281–305MathSciNetMATH Bergstra J, Bengio Y (2012) Random search for hyperparameter optimization. Mach Learn Res 13:281–305MathSciNetMATH
29.
go back to reference Bacanin N, Stoean C, Zivkovic M, Rakic M, Strulak-Wójcikiewicz R, Stoean R (2023) On the benefits of using metaheuristics in the hyperparameter tuning of deep learning models for energy load forecasting. Energies 16(3):1434CrossRef Bacanin N, Stoean C, Zivkovic M, Rakic M, Strulak-Wójcikiewicz R, Stoean R (2023) On the benefits of using metaheuristics in the hyperparameter tuning of deep learning models for energy load forecasting. Energies 16(3):1434CrossRef
30.
go back to reference Song L, Minku L, Yao X (2013) The impact of parameter tuning on software effort estimation using learning machines. In: Proceedings of the 9th International Conference on Predictive Models in Software Engineering. PROMISE, New York, United States, pp 1–10 Song L, Minku L, Yao X (2013) The impact of parameter tuning on software effort estimation using learning machines. In: Proceedings of the 9th International Conference on Predictive Models in Software Engineering. PROMISE, New York, United States, pp 1–10
31.
go back to reference Kumar SP, Sekhar BH, Anisha K, Janmenjoy N, Bighnaraj N (2020) Advancement from neural networks to deep learning in software effort estimation: perspective of two decades. Comput Sci Rev 38:100288MathSciNetCrossRef Kumar SP, Sekhar BH, Anisha K, Janmenjoy N, Bighnaraj N (2020) Advancement from neural networks to deep learning in software effort estimation: perspective of two decades. Comput Sci Rev 38:100288MathSciNetCrossRef
34.
35.
go back to reference Nassif BA, Mohammad A, Fernando CL, Danny H (2013) A comparison between decision trees and decision tree forest models for software development effort estimation. In: 2013 Third International Conference on Communications and Information Technology (ICCIT). IEEE, pp 220–224 Nassif BA, Mohammad A, Fernando CL, Danny H (2013) A comparison between decision trees and decision tree forest models for software development effort estimation. In: 2013 Third International Conference on Communications and Information Technology (ICCIT). IEEE, pp 220–224
36.
go back to reference Shepperd M, MacDonell S (2012) Evaluating prediction systems in software project estimation. Inf Softw Technol 54(8):820–827CrossRef Shepperd M, MacDonell S (2012) Evaluating prediction systems in software project estimation. Inf Softw Technol 54(8):820–827CrossRef
37.
go back to reference Gautam SS, Singh V (2018) The state-of-the-art in software development effort estimation. J Softw Evol Process 30(12):e1983CrossRef Gautam SS, Singh V (2018) The state-of-the-art in software development effort estimation. J Softw Evol Process 30(12):e1983CrossRef
38.
go back to reference Labidi T, Sakhrawi Z, Sellami A, Mtibaa A, Bouassida N (2022) On the use of OLS regression algorithm and Pearson correlation algorithm for improving the SLA establishment process in cloud computing. Innov Syst Softw Eng 18(1):215–229CrossRef Labidi T, Sakhrawi Z, Sellami A, Mtibaa A, Bouassida N (2022) On the use of OLS regression algorithm and Pearson correlation algorithm for improving the SLA establishment process in cloud computing. Innov Syst Softw Eng 18(1):215–229CrossRef
Metadata
Title
On the value of parameter tuning in stacking ensemble model for software regression test effort estimation
Authors
Taher Labidi
Zaineb Sakhrawi
Publication date
06-05-2023
Publisher
Springer US
Published in
The Journal of Supercomputing / Issue 15/2023
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-023-05334-9

Other articles of this Issue 15/2023

The Journal of Supercomputing 15/2023 Go to the issue

Premium Partner