Skip to main content
Erschienen in: Empirical Software Engineering 1-2/2012

01.02.2012

A permutation test based on regression error characteristic curves for software cost estimation models

verfasst von: Nikolaos Mittas, Lefteris Angelis

Erschienen in: Empirical Software Engineering | Ausgabe 1-2/2012

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Background

Regression Error Characteristic (REC) curves provide a visualization tool, able to characterize graphically the prediction power of alternative predictive models. Due to the benefits of using such a visualization description of the whole distribution of error, REC analysis was recently introduced in software cost estimation to aid the decision of choosing the most appropriate cost estimation model during the management of a forthcoming project.

Aims

Although significant information can be retrieved from a readable graph, REC curves are not able to assess whether the divergences between the alternative error functions can constitute evidence for a statistically significant difference.

Method

In this paper, we propose a graphical procedure that utilizes (a) the process of repetitive permutations and (b) and the maximum vertical deviation between two comparative Regression Error Characteristic curves in order to conduct a hypothesis test for assessing the statistical significance of error functions.

Results

In our case studies, the data used come from software projects and the models compared are cost prediction models. The results clearly showed that the proposed statistical test is necessary in order to assess the significance of the superiority of a prediction model, since it provides an objective criterion for the distances between the REC curves. Moreover, the procedure can be easily applied to any dataset where the objective is the prediction of a response variable of interest and the comparison of alternative prediction techniques in order to select the best strategy.

Conclusions

The proposed hypothesis test, accompanying an informative graphical tool, is more easily interpretable than the conventional parametric and non-parametric statistical procedures. Moreover, it is free from normality assumptions of the error distributions when the samples are small-sized and highly skewed. Finally, the proposed graphical test can be applied to the comparisons of any alternative prediction methods and models and also to any other validation procedure.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Bi J, Bennet KP (2003) Regression Error Characteristics Curves. Proceedings of the AIII 20th International Conference on Machine Learning (ICML’03), August, 43–50 Bi J, Bennet KP (2003) Regression Error Characteristics Curves. Proceedings of the AIII 20th International Conference on Machine Learning (ICML’03), August, 43–50
Zurück zum Zitat Briand L, Langley T, Wieczorek I (2000) A replicated assessment and comparison of common software cost modeling techniques. Proceedings of the IEEE International Conference Software Engineering (ICSE 22), 377–386 Briand L, Langley T, Wieczorek I (2000) A replicated assessment and comparison of common software cost modeling techniques. Proceedings of the IEEE International Conference Software Engineering (ICSE 22), 377–386
Zurück zum Zitat Demsar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30MathSciNetMATH Demsar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30MathSciNetMATH
Zurück zum Zitat Efron B, Tibshirani RJ (1993) An introduction to the bootstrap. Chapman & Hall. Efron B, Tibshirani RJ (1993) An introduction to the bootstrap. Chapman & Hall.
Zurück zum Zitat Egan JP (1975) Signal detection theory and ROC analysis. Series in Cognition and Perception, Academic Press. Egan JP (1975) Signal detection theory and ROC analysis. Series in Cognition and Perception, Academic Press.
Zurück zum Zitat Fawcett T (2006) An introduction to ROC analysis. Pattern Recogn Lett 27:861–874CrossRef Fawcett T (2006) An introduction to ROC analysis. Pattern Recogn Lett 27:861–874CrossRef
Zurück zum Zitat Foss T, Stensrud E, Kitchenham B, Myrtveit I (2003) A simulation study of the model evaluation criterion MMRE. IEEE Trans Software Eng 29(11):985–995CrossRef Foss T, Stensrud E, Kitchenham B, Myrtveit I (2003) A simulation study of the model evaluation criterion MMRE. IEEE Trans Software Eng 29(11):985–995CrossRef
Zurück zum Zitat Good IG (2000) Permutation tests: a practical guide to resampling methods for testing hypotheses. Springer Series in Statistics 2nd edition Good IG (2000) Permutation tests: a practical guide to resampling methods for testing hypotheses. Springer Series in Statistics 2nd edition
Zurück zum Zitat Green DM, Swets JM (1966) Signal detection theory and psychophysics. Wiley, New York Green DM, Swets JM (1966) Signal detection theory and psychophysics. Wiley, New York
Zurück zum Zitat Jeffery R, Ruhe M, Wieczorek I (2001) Using public domain metrics to estimate software development effort. In: Proceedings of the IEEE 7th International Software Metrics Symposium (METRICS 2001), April, 16–27 Jeffery R, Ruhe M, Wieczorek I (2001) Using public domain metrics to estimate software development effort. In: Proceedings of the IEEE 7th International Software Metrics Symposium (METRICS 2001), April, 16–27
Zurück zum Zitat Jorgensen M, Shepperd M (2007) A systematic review of software development cost estimation studies. IEEE Trans Software Eng 33(1):33–53CrossRef Jorgensen M, Shepperd M (2007) A systematic review of software development cost estimation studies. IEEE Trans Software Eng 33(1):33–53CrossRef
Zurück zum Zitat Kitchenham B, Mendes T (2004) A comparison of cross-company and within-company effort estimation models for web applications. In: Proceedings of the Empirical Assessment in Software Engineering (EASE), 47–55. Kitchenham B, Mendes T (2004) A comparison of cross-company and within-company effort estimation models for web applications. In: Proceedings of the Empirical Assessment in Software Engineering (EASE), 47–55.
Zurück zum Zitat Kitchenham B, Mendes T (2009) Why comparative effort prediction studies may be invalid. In: Proceedings of the ACM 5th International Conference on Predictor Models in Software Engineering, May Kitchenham B, Mendes T (2009) Why comparative effort prediction studies may be invalid. In: Proceedings of the ACM 5th International Conference on Predictor Models in Software Engineering, May
Zurück zum Zitat Kitchenham B, Pfleeger SL, McColl B, Eagan S (2002) An empirical study of maintenance and development accuracy. J Syst Softw 64:57–77CrossRef Kitchenham B, Pfleeger SL, McColl B, Eagan S (2002) An empirical study of maintenance and development accuracy. J Syst Softw 64:57–77CrossRef
Zurück zum Zitat Kitchenham B, Pickard L, MacDonell S, Shepperd M (2001) What accuracy statistics really measure. IEE Proc Software 148(3):81–85CrossRef Kitchenham B, Pickard L, MacDonell S, Shepperd M (2001) What accuracy statistics really measure. IEE Proc Software 148(3):81–85CrossRef
Zurück zum Zitat Korte M, Port D (2008) Confidence in software cost estimation results based on mmre and pred. In Proceedings of the 4th ACM International Workshop on Predictor Models in Software Engineering, 63–70 Korte M, Port D (2008) Confidence in software cost estimation results based on mmre and pred. In Proceedings of the 4th ACM International Workshop on Predictor Models in Software Engineering, 63–70
Zurück zum Zitat Liebchen G, Shepperd M (2008) Data sets and data quality in software engineering. Proceedings of the 4th ACM International Workshop on Predictor Models in Software Engineering, 39–44 Liebchen G, Shepperd M (2008) Data sets and data quality in software engineering. Proceedings of the 4th ACM International Workshop on Predictor Models in Software Engineering, 39–44
Zurück zum Zitat Lusted LB (1978) General problems in medical decision making with comments on ROC analysis. Semin Nucl Med 8(4):299–306CrossRef Lusted LB (1978) General problems in medical decision making with comments on ROC analysis. Semin Nucl Med 8(4):299–306CrossRef
Zurück zum Zitat Obuchowski NA (2003) Receiver operating characteristic curves and their use in radiology. Radiology 229(1):3–8CrossRef Obuchowski NA (2003) Receiver operating characteristic curves and their use in radiology. Radiology 229(1):3–8CrossRef
Zurück zum Zitat Port D, Korte M (2008) Comparative studies of the model evaluation criterions mmre and pred in software cost estimation research. In Proceedings of the ACM-IEEE 2nd International Symposium on Empirical Software Engineering and Management, 51–60 Port D, Korte M (2008) Comparative studies of the model evaluation criterions mmre and pred in software cost estimation research. In Proceedings of the ACM-IEEE 2nd International Symposium on Empirical Software Engineering and Management, 51–60
Zurück zum Zitat Mair C, Shepperd M (2005) The consistency of empirical comparisons of regression and analogy-based software project cost prediction. In: Proceedings of the International Symposium on Empirical Software Engineering (ISESE’05), November, 509–518 Mair C, Shepperd M (2005) The consistency of empirical comparisons of regression and analogy-based software project cost prediction. In: Proceedings of the International Symposium on Empirical Software Engineering (ISESE’05), November, 509–518
Zurück zum Zitat Mendes E, Di Martino S, Ferrucci F, Gravino C (2008) Cross-company vs. single-company web effort models using the Tukutuku database: an extended study. J Syst Softw 81(5):673–690CrossRef Mendes E, Di Martino S, Ferrucci F, Gravino C (2008) Cross-company vs. single-company web effort models using the Tukutuku database: an extended study. J Syst Softw 81(5):673–690CrossRef
Zurück zum Zitat Metz CE (1978) Basic principles of ROC analysis. Semin Nucl Med 8(4):283–298CrossRef Metz CE (1978) Basic principles of ROC analysis. Semin Nucl Med 8(4):283–298CrossRef
Zurück zum Zitat Mittas N, Angelis L (2008a) Comparing cost prediction models by resampling techniques. J Syst Softw 81(5):616–632CrossRef Mittas N, Angelis L (2008a) Comparing cost prediction models by resampling techniques. J Syst Softw 81(5):616–632CrossRef
Zurück zum Zitat Mittas N, Angelis L, (2008b) Comparing software cost prediction models by a visualization tool. In Proceedings of the IEEE 34th Euromicro Conference on Software Engineering and Advanced Applications (SEAA’08), 433–440 Mittas N, Angelis L, (2008b) Comparing software cost prediction models by a visualization tool. In Proceedings of the IEEE 34th Euromicro Conference on Software Engineering and Advanced Applications (SEAA’08), 433–440
Zurück zum Zitat Mittas N, Angelis L (2010a) Visual comparison of software cost estimation models by regression error characteristic analysis. J Syst Softw 83:621–637CrossRef Mittas N, Angelis L (2010a) Visual comparison of software cost estimation models by regression error characteristic analysis. J Syst Softw 83:621–637CrossRef
Zurück zum Zitat Mittas N, Angelis L (2010b) LSEbA: least squares regression and estimation by analogy in a semi-parametric model for software cost estimation. Mittas N, Angelis L (2010b) LSEbA: least squares regression and estimation by analogy in a semi-parametric model for software cost estimation.
Zurück zum Zitat Mittas N, Athanasiades M, Angelis L (2008) Improving analogy-based software cost estimation by a resampling method. Inform Softw Technol 50(3):221–230CrossRef Mittas N, Athanasiades M, Angelis L (2008) Improving analogy-based software cost estimation by a resampling method. Inform Softw Technol 50(3):221–230CrossRef
Zurück zum Zitat Miyazaki Y, Terakado M, Ozaki K, Nozaki H (1994) Robust regression for developing software estimation models. J Syst Softw 27:3–16CrossRef Miyazaki Y, Terakado M, Ozaki K, Nozaki H (1994) Robust regression for developing software estimation models. J Syst Softw 27:3–16CrossRef
Zurück zum Zitat Moore DS, McCabe GP, Duckworth WD II, Sclove SL (2003) The practice of business statistics. Freeman, New York Moore DS, McCabe GP, Duckworth WD II, Sclove SL (2003) The practice of business statistics. Freeman, New York
Zurück zum Zitat Myrtveit I, Stensrud E, Shepperd M (2005) Reliability and validity in comparative studies of software prediction models. IEEE Trans Software Eng 31(5):380–391CrossRef Myrtveit I, Stensrud E, Shepperd M (2005) Reliability and validity in comparative studies of software prediction models. IEEE Trans Software Eng 31(5):380–391CrossRef
Zurück zum Zitat Sheskin DJ (2004) Handbook of parametric and nonparametric statistical procedures, third ed. Chapman & Hall/CRC Sheskin DJ (2004) Handbook of parametric and nonparametric statistical procedures, third ed. Chapman & Hall/CRC
Zurück zum Zitat Šidàk Z (1967) Rectangular confidence region for the means of multivariate normal distributions. J Am Stat Assoc 62:626–633MATHCrossRef Šidàk Z (1967) Rectangular confidence region for the means of multivariate normal distributions. J Am Stat Assoc 62:626–633MATHCrossRef
Zurück zum Zitat Stensrud E, Myrveit I (1998) Human performance estimating with analogy and regression models: An empirical validation. In: Proceedings of the IEEE 5th International Software Metrics Symposium (Metrics’98), CA, 205–213 Stensrud E, Myrveit I (1998) Human performance estimating with analogy and regression models: An empirical validation. In: Proceedings of the IEEE 5th International Software Metrics Symposium (Metrics’98), CA, 205–213
Zurück zum Zitat Zweig MH, Campbell G (1993) Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. Clin Chem 39(8):561–577 Zweig MH, Campbell G (1993) Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. Clin Chem 39(8):561–577
Metadaten
Titel
A permutation test based on regression error characteristic curves for software cost estimation models
verfasst von
Nikolaos Mittas
Lefteris Angelis
Publikationsdatum
01.02.2012
Verlag
Springer US
Erschienen in
Empirical Software Engineering / Ausgabe 1-2/2012
Print ISSN: 1382-3256
Elektronische ISSN: 1573-7616
DOI
https://doi.org/10.1007/s10664-011-9177-5

Weitere Artikel der Ausgabe 1-2/2012

Empirical Software Engineering 1-2/2012 Zur Ausgabe

Premium Partner