Top

The Journal of Supercomputing

Published in:

06-05-2023

On the value of parameter tuning in stacking ensemble model for software regression test effort estimation

Authors: Taher Labidi, Zaineb Sakhrawi

Published in: The Journal of Supercomputing | Issue 15/2023

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

A type of software testing, regression testing is often costly and labour-intensive. As such, multiple corporations have intensified efforts to estimate the amount of effort required. However, frequent alterations in software projects impact the precision of software regression test effort estimation (SRTEE), which increases the difficulty of managing software projects. Therefore, machine learning (ML) has increasingly been used to develop more accurate SRTEEs. The estimation process of a software project comprises inputs, the model, and outputs. This present study examines the quality of estimation inputs and the model required to deliver accurate estimation outputs. An SRTEE that uses the stacking ensemble model (StackSRTEE) was developed to increase the precision of SRTEE. It consisted of the three most common ML methods, namely neural networks, support vector regression, and decision tree regression. The grid search (GS) technique was then used to tune the hyperparameters of the StackSRTEE before it was trained and tested using a dataset from the International Software Benchmarking Standards Group (ISBSG) repository. The size of the functional change; specifically, enhancement; was used as the primary independent variable to improve the inputs of the StackSRTEE model. With the appropriate features; such as the functional change size of an enhancement; (1) the proposed StackSRTEE model yielded higher accuracy than the three individual ML methods on their own, (2) using GS to tune and set the individual ML methods increased the precision of the SRTEE outputs, and (3) the StackSRTEE-based GS tuning yielded estimations that were more precise.

previous article Computation offloading and heterogeneous task caching in MEC-enabled vehicular networks

next article HFBO-KSELM: Hybrid Flash Butterfly Optimization-based Kernel Softplus Extreme Learning Machine for Classification of Chronic Kidney Disease

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

https://scikit-learn.org/.../sklearn.ensemble.StackingRegressor.html.

sklearn.metrics.r2_score.html.

https://www.investopedia.com/terms/p/positive-correlation.asp.

Chittimalli P, Harrold M (2008) Test selection on system requirements. In: ISEC, Proceedings of the 1st conference on India software engineering conference, vol 56, no 2, pp 87-96

Minhas NM, Petersen K, Börstler J, Wnuk K (2020) Regression testing for large-scale embedded software development-exploring the state of practice. Inf Softw Technol 120:106254CrossRef

Hron M, Obwegeser N (2020) Why and how is Scrum being adapted in practice: a systematic review. J Syst Softw 183:111110CrossRef

Ali SS, Zafar MS, Saeed MT (2020) Effort estimation problems in software maintenance—a survey. In: 3rd International Conference on Computing, Mathematics and Engineering Technologies (iCoMET), pp 1–9

Sharma A, Kushwaha DS (2012) Applying requirement based complexity for the estimation of software development and testing effort. ACM SIGSOFT Softw Eng Notes 73(1):1–11

Nguyen V, Pham V, Lam V (2013) qEstimation: a process for estimating size and effort of software testing. In: Proceedings of the 2013 International Conference on Software and System Process ICSSP, San Francisco CA, USA, pp 20–28,

Kamaraj K, Lanitha B, Karthic S, Senthil Prakash PN, Mahaveerakannan R (2023) A hybridized artificial neural network for automated software test oracle. Comput Syst Sci Eng 45(2):1837–1850CrossRef

Kaur A, Kaur K (2019) Investigation on test effort estimation of mobile applications: systematic literature review and survey. Inf Softw Technol 110:56–77CrossRef

Sakhrawi Z, Sellami A, Bouassida N (2020) Software enhancement effort estimation using machine learning regression methods. Int J Comput Inf Syst Ind Manag Appl 12:412–423

10.

Wen J, Li S, Lin Z, Hu Y, Huang C (2012) Systematic literature review of machine learning based software development effort estimation models. Inf Softw Technol 54(1):41–59CrossRef

11.

Minku LL, Yao X (2011) A principled evaluation of ensembles of learning machines for software effort estimation. In: Proceedings of the 7th International Conference on Predictive Models in Software Engineering, pp 1–10

12.

Lopez-Martin C (2022) Machine learning techniques for software testing effort prediction. Softw Qual J 30(1):65–100CrossRef

13.

Mantovani RG, Rossi ALD, Vanschoren J, Bischl B, De Carvalho AC (2015) Effectiveness of random search in SVM hyperparameter tuning. In: In 2015 International Joint Conference on Neural Networks (IJCNN). IEEE, pp 1–8

14.

Ma X, Zhang Y, Wang Y (2015) Performance evaluation of kernel functions based on grid search for support vector regression. In: 7th International Conference on Cybernetics and Intelligent Systems (CIS) and IEEE Conference on Robotics Automation and Mechatronics (RAM), Cambodia, pp 283–288

15.

Priya Varshini AG, Anitha Kumari K, Varadarajan V (2021) Estimating software development efforts using a random forest-based stacked ensemble approach. Electronics 10(10):1–19

16.

Sakhrawi Z, Sellami A, Bouassida N (2022) Support vector regression for enhancement effort prediction of Scrum projects from COSMIC functional size. Innov Syst Softw Eng 18(1):137–153CrossRef

17.

Idri A, Hosni M, Abran A (2016) Systematic literature review of ensemble effort estimation. J Syst Softw 118:151–175CrossRef

18.

Xie R, Li J, Wang J, Dai W, Leier A, Marquez-Lago TT, Akutsu T, Lithgow T, Song J, Zhang Y (2021) DeepVF: a deep learning-based hybrid framework for identifying virulence factors using the stacking strategy. Brief Bioinform 22(3):bbaa125CrossRef

19.

Abnane I, Idri A, Hosni M, Abran A (2021) Heterogeneous ensemble imputation for software development effort estimation. In: Proceedings PROMISE the 17th International Conference on Predictive Models and Data Analytics in Software Engineering, Athens Greece, pp 1–10

20.

Sampath Kumar P, Venkatesan R (2021) Improving accuracy of software estimation using stacking ensemble method. In: Advances in Machine Learning and Computational Intelligence. Springer, pp 219–227

21.

Sakhrawi Z, Sellami A, Bouassida N (2021) Software enhancement effort estimation using correlation-based feature selection and stacking ensemble method. Cluster Comput 25(4):2779–2792CrossRef

22.

Sakhrawi Z, Sellami A, Bouassida N (2022) Software enhancement effort estimation using stacking ensemble model within the scrum projects: a proposed web interface. In: Proceedings of the 17th International Conference on Software Technologies ICSOFT, Lisbon, Portugal, pp 91–100

23.

Hosni M, Idri A, Abran A, Bou Nassif A (2018) On the value of parameter tuning in heterogeneous ensembles effort estimation. Soft Comput 22(18):5977–6010CrossRef

24.

Mehta S, Patnaik KS (2021) Improved prediction of software defects using ensemble machine learning techniques. Neural Comput Appl 33:10551–10562CrossRef

25.

Claesen M, De Moor B (2015) Hyperparameter search in machine learning. arXiv preprint arXiv:1502.02127,

26.

Fonseca CM, Fleming PJ (1993) Genetic algorithms for multiobjective optimization: formulation discussion and generalization. In: Proceedings of the 5th International Conference on Genetic Algorithms, pp 416–423

27.

Geetha U, Sankar S (2022) Multi-objective modified particle swarm optimization for test suite reduction (MOMPSO). Comput Syst Sci Eng 42(3):899–917CrossRef

28.

Bergstra J, Bengio Y (2012) Random search for hyperparameter optimization. Mach Learn Res 13:281–305MathSciNetMATH

29.

Bacanin N, Stoean C, Zivkovic M, Rakic M, Strulak-Wójcikiewicz R, Stoean R (2023) On the benefits of using metaheuristics in the hyperparameter tuning of deep learning models for energy load forecasting. Energies 16(3):1434CrossRef

30.

Song L, Minku L, Yao X (2013) The impact of parameter tuning on software effort estimation using learning machines. In: Proceedings of the 9th International Conference on Predictive Models in Software Engineering. PROMISE, New York, United States, pp 1–10

31.

Kumar SP, Sekhar BH, Anisha K, Janmenjoy N, Bighnaraj N (2020) Advancement from neural networks to deep learning in software effort estimation: perspective of two decades. Comput Sci Rev 38:100288MathSciNetCrossRef

32.

Haykin S (2009) Neural networks and learning machines, 3rd edn. Pearson, [Online]. Available: https://www.amazon.com/Neural-Networks-Learning-Machines-3rd/dp/0131471392

33.

Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297CrossRefMATH

34.

Kantardzic M (2011) Data mining: concepts, models, methods, and algorithms. Wiley, HobokenCrossRefMATH

35.

Nassif BA, Mohammad A, Fernando CL, Danny H (2013) A comparison between decision trees and decision tree forest models for software development effort estimation. In: 2013 Third International Conference on Communications and Information Technology (ICCIT). IEEE, pp 220–224

36.

Shepperd M, MacDonell S (2012) Evaluating prediction systems in software project estimation. Inf Softw Technol 54(8):820–827CrossRef

37.

Gautam SS, Singh V (2018) The state-of-the-art in software development effort estimation. J Softw Evol Process 30(12):e1983CrossRef

38.

Labidi T, Sakhrawi Z, Sellami A, Mtibaa A, Bouassida N (2022) On the use of OLS regression algorithm and Pearson correlation algorithm for improving the SLA establishment process in cloud computing. Innov Syst Softw Eng 18(1):215–229CrossRef

Title: On the value of parameter tuning in stacking ensemble model for software regression test effort estimation
Authors: Taher Labidi
Zaineb Sakhrawi
Publication date: 06-05-2023
Publisher: Springer US
Published in: The Journal of Supercomputing / Issue 15/2023
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI: https://doi.org/10.1007/s11227-023-05334-9

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Springer Professional "Wirtschaft+Technik"

Other articles of this Issue 15/2023

Mathematical modeling and performance evaluation of BeRAN for 6G wireless networks

AVS_FD_MVITS: an agile IT service design workflow for small data centers

External-attention dual-modality fusion network for RGBT tracking

DewGame: D2D communication enabled dew computing for 5G IoT using coalition formation game

Dynamic resource allocation scheme for mobile edge computing

The short-term prediction of daily traffic volume for rural roads using shallow and deep learning networks: ANN and LSTM

Premium Partner