ABSTRACT
This study investigates to what extent Web effort estimation models built using cross-company data sets can provide suitable effort estimates for Web projects belonging to another company, when compared to Web effort estimates obtained using that company's own data on their past projects (single-company data set). It extends a previous study (S3) where these same research questions were investigated using data on 67 Web projects from the Tukutuku database. Since S3 was carried out, data on other 128 Web projects was added to Tukutuku; therefore this study uses the entire set of 195 projects from the Tukutuku database, which now also includes new data from other single-company data sets. Predictions between cross-company and single-company models are compared using Manual Stepwise Regression+Linear Regression and Case-Based Reasoning. In addition, we also investigated to what extent applying a filtering mechanism to cross-company datasets prior to building prediction models can affect the accuracy of the effort estimates they provide. The present study corroborates the conclusions of S3 since the cross-company models provided much worse predictions than the single-company models. Moreover, the use of the filtering mechanism significantly improved the prediction accuracy of cross-company models when estimating single-company projects, making it comparable to that using single-company datasets.
- Conte, S. D., Dunsmore, H. E., Shen, V. Y., 1986. Software Engineering Metrics and Models. Benjamin-Cummins. Google ScholarDigital Library
- Turhan, B., Menzies, T., Bener, A., Di Stefano, J., 2009. On the relative value of cross-company and within-company data for defect prediction. ESE, 14: 540--578. Google ScholarDigital Library
- Cohen, J., 1988. Statistical power analysis for the behavioral sciences. Lawrence Earlbaum Ass., 2nd edition.Google Scholar
- Cook, R. D., 1977. Detection of influential observations in linear regression. Technometrics, 19, 15--18.Google Scholar
- Kitchenham B., 1998. A Procedure for Analyzing Unbalanced Datasets. IEEE TSE, 278--301. Google ScholarDigital Library
- Kitchenham, B. A., Pickard, L., Peeger, S. 1995. Case studies for method and tool evaluation. IEEE Software 12(4): 52--62. Google ScholarDigital Library
- Kitchenham, B., Pickard, L. M., MacDonell, S. G., Shepperd, M. J, 2001. What accuracy statistics really measure. IEE Proceedings Software 148 (3) 81--85.Google ScholarCross Ref
- Kitchenham, B., Mendes, E., 2004. A Comparison of Cross-company and Single-company Effort Estimation Models for Web Applications, in Procs. of EASE, 47--55.Google Scholar
- Kitchenham B. A., Mendes, M., Travassos, G. H., 2007. Cross versus Within-Company Cost Estimation Studies: A Systematic Review. IEEE TSE 33(5): 316--329. Google ScholarDigital Library
- Kocaguneli, E., Gay, G., Menzies, T., Yang, Y., Keung, J. W., 2010. When to use data from other projects for effort estimation. In Procs. of ASE, 321--324 Google ScholarDigital Library
- Kocaguneli, E., Menzies, T., 2011. How to Find Relevant Data for Effort Estimation? In Procs of ESEM, 255--264. Google ScholarDigital Library
- Kocaguneli, E., Menzies, T., Bener, A. B., Keung, J. W., 2012. Exploiting the Essential Assumptions of Analogy-Based Effort Estimation. IEEE TSE 425--438. Google ScholarDigital Library
- Li, Y. F., Xie, M., Goh, T. N., 2009. A Study of Project Selection and Feature Weighting for Analogy based Software Cost Estimation. JSS, 82(2), 241--252. Google ScholarDigital Library
- Maxwell, K., 2002. Applied Statistics for Software Managers. Software Quality Institute Series, Prentice Hall.Google Scholar
- Mendes, E., 2008. Web Cost Estimation and Productivity Benchmarking. ISSSE: 194--222.Google Scholar
- Mendes, E., Mosley, N., Counsell, S., 2002. Comparison of Length, complexity and functionality as size measures for predicting Web design and authoring effort, IEE Procs. Software 149 (3), 86--92.Google ScholarCross Ref
- Mendes, E., Mosley, N., Counsell, S., 2003. Investigating early web size measures for web cost estimation. In Procs. Evaluation and Assessment in Software Engineering, 1--22.Google Scholar
- Mendes, E., Mosley, N. and Counsell, S., 2005. The Need for Web Engineering: An Introduction. Web Engineering, Springer-Verlag, Mendes, E. and Mosley, N. (Eds.)Google Scholar
- Mendes, E., Mosley, N., Counsell, S. 2005. Investigating Web Size Metrics for Early Web Cost Estimation. JSS, 77(2), 157--172. Google ScholarDigital Library
- Mendes, E., Kitchenham B. A., 2004. Further Comparison of Cross-Company and Within Company Effort Estimation Models for Web Applications. In Procs of METRICS, IEEE Computer Society, 348--357. Google ScholarDigital Library
- Mendes, E., Di Martino, S., Ferrucci, F., Gravino, C., 2007. Effort Estimation: How Valuable is it for a Web company to Use a Cross-company Data Set, Compared to Using Its Own Single-company Data Set?. In Procs of WWW07, 963--972. Google ScholarDigital Library
- Mendes, E., Di Martino, S., Ferrucci, F., Gravino, C., 2008. Cross-company vs. single-company web effort models using the Tukutuku database: An extended study. JSS 81, 673--690. Google ScholarDigital Library
- Menzies, T., Butcher, A., Marcus, A., Zimmermann, T., Cok, D. R., 2011. Local vs. global models for effort estimation and defect prediction. In Procs. of ASE, 343--351. Google ScholarDigital Library
- Shepperd, M. J., and G. Kadoda, 2001. Using Simulation to Evaluate Prediction Techniques, in Proceedings IEEE 7th Intl Software Metrics Symposium, London, UK, 349--358. Google ScholarDigital Library
Index Terms
- Web effort estimation: the value of cross-company data set compared to single-company data set
Recommendations
Effort estimation: how valuable is it for a web company to use a cross-company data set, compared to using its own single-company data set?
WWW '07: Proceedings of the 16th international conference on World Wide WebPrevious studies comparing the prediction accuracy of effort models built using Web cross- and single-company data sets have been inconclusive, and as such replicated studies are necessary to determine under what circumstances a company can place ...
A Comparison of Cross-Versus Single-Company Effort Prediction Models for Web Projects
SEAA '14: Proceedings of the 2014 40th EUROMICRO Conference on Software Engineering and Advanced ApplicationsBackground: In order to address the challenges in companies having no or limited effort datasets of their own, cross-company models have been a focus of interest for previous studies. Further, a particular domain of investigation has been Web projects. ...
Further Comparison of Cross-Company and Within-Company Effort Estimation Models for Web Applications
METRICS '04: Proceedings of the Software Metrics, 10th International SymposiumThis paper extends a previous study, using data on 67 Web projects from the Tukutuku database, investigating to what extent a cross-company cost model can be successfully employed to estimate effort for projects that belong to a single company, where no ...
Comments