ABSTRACT
Effort and cost estimation is crucial in software management. Estimation of software size plays a key role in the estimation. Line of Code (LOC) is still a commonly used software size measure. Despite the fact that software sizing is well recognized as an important problem for more than two decades, there is still much problem in existing methods. Conceptual data model is widely used in the requirements analysis for information systems. It is also not difficult to construct conceptual data models in the early stage of developing information systems. Much characteristic of an information system is actually reflected from its conceptual data model. We explore into the use of conceptual data model for estimating LOC. This paper proposes a novel method for estimating LOC for an information system from its conceptual data model through the use of multiple linear regression model. We have validated the method through collecting samples from both the industry and open-source systems.
- Albrecht, A. J., and Gaffney, J. E. Jr. Software function, source lines of code, and development effort prediction: a software science validation. IEEE Trans. Software Eng., vol. SE-9, no. 6, Nov. 1983, 639--648.Google ScholarDigital Library
- Armour, P. Ten unmyths of project estimation: reconsidering some commonly accepted project management practices. Comm. ACM 45,11( Nov. 2002), 15--18. Google ScholarDigital Library
- Belsley, D. A., Kuh, E., and Welsch, R. E. Regression Diagnostics: Identifying Influential Data and Sources of Collinearity. John Wiley, New York, 2004.Google Scholar
- Blaha, M., and Premerlani, W. Object-Oriented Modeling and Design for Database Applications. Prentice Hall, 1998. Google ScholarDigital Library
- Boehm, B. W., and Fairley, R. E. Software estimation perspectives. IEEE Software, Nov./Dec. 2000, 22--26. Google ScholarDigital Library
- Boehm, B. W. et al. Software Cost Estimation with COCOMO II.Prentice Hall, 2000. Google ScholarDigital Library
- Burgess, R. S. Structured Program Design Using JSP. ELBS, 1988.Google Scholar
- Canfora, G., Cerulo, L., and Troiano, L. An experience of fuzzy linear regression applied to effort estimation. In Proc. 16th International Conference on Software Engineering & Knowledge Engineering, 2004, 57--61.Google Scholar
- Chen, P. P. The entity-relationship model - towards a unified view of data. ACM Trans. Database Syst. 1,1 ( Mar. 1976), 9--36. Google ScholarDigital Library
- COSMIC-Full Functions - Release 2.0. September 1999.Google Scholar
- Costagliola, G., Ferrucci, F., Tortora, G. and Vitiello, G. Class point: an approach for the size estimation of object-oriented systems. IEEE Trans. Software Eng., 31, 1(Jan, 2005), 52--74. Google ScholarDigital Library
- Freshmeat. http://freshmeat.net.Google Scholar
- Garmus, D., and Herron, D. Function Point Analysis: Measurement Practices for Successful Software Projects. Addison Wesley, 2000. Google ScholarDigital Library
- Ghezzi, C., Jazayeri, M. and Mandrioli, D. Fundamentals of Software Engineering. 2nd Edition, Prentice, 2003. Google ScholarDigital Library
- Jeffery, D. R., Low, G. C., and Barnes, M. A comparison of function point counting techniques. IEEE Trans. Software Eng., May, 1993, 529--532. Google ScholarDigital Library
- Jeffery, D. R., and Walkerden, F. An empirical study of analogy-based software effort estimation. Empirical Software Engineering, Kluwer Academic Publishers, 4, 2 (June 1999), 135--158. Google ScholarDigital Library
- Kennedy, P. A Guide to Econometrics. Blackwell Publishing, 5th Edition, 2003.Google Scholar
- Lai, R., and Huang, S. J. A model for estimating the size of a formal communication protocol application and its implementation. IEEE Trans. Software Eng., Jan, 2003, 46--62. Google ScholarDigital Library
- Laranjeira, L. A. Software size estimation of object-oriented systems. IEEE Trans. Software Eng., May, 1990, 510--522. Google ScholarDigital Library
- McClave, J. T., and Sincich, T. Statistics. 9th Ed, Prentice Hall, 2003.Google Scholar
- Miranda, E. An evaluation of the paired comparisons method for software sizing. In Proc. Int. Conf. On Software Eng., 2000, 597--604. Google ScholarDigital Library
- Neter, J., Kutner, M. H., Nachtsheim, C. J., and Wasserman, W. Applied Linear Regression Models, IRWIN, 1996.Google Scholar
- Ruthe, M., Jeffery, R., and Wieczorek, I. Cost estimation for web applications. In Proc. Int. Conf. On Software Eng., 2003, 285--294. Google ScholarDigital Library
- SAS/STAT User's Guide. http://www.id.unizh.ch/software/unix/statmath/sas/sasdoc/stat/.Google Scholar
- Smith, J. The estimation of effort based on use cases, Rational Software White Paper.1999.Google Scholar
- SourceForge.net. http://sourceforge.net/.Google Scholar
- Stensrud, E., Foss, T., Kitchenham, B., Myrtveit, I. An empirical validation of the relationship between the magnitude of relative error and project size. In Proc. IEEE Symp. Software Metrics, 2002, 3--12. Google ScholarDigital Library
- Tan, H. B. K., and Zhao, Y. ER-based software sizing for data-intensive systems. In Proc. Int. Conf. on Conceptual Modeling, 2004, 180--190.Google ScholarCross Ref
- Teorey, T. J., Yang, D., and Fry, J. P. A logical design methodology for relational databases using the extended entity-relationship model. ACM Computing Surveys, June, 1986, 197--222. Google ScholarDigital Library
Index Terms
- Estimating LOC for information systems from their conceptual data models
Recommendations
Conceptual data model-based software size estimation for information systems
Size estimation plays a key role in effort estimation that has a crucial impact on software projects in the software industry. Some information required by existing software sizing methods is difficult to predict in the early stage of software ...
Deriving initial data warehouse structures from the conceptual data models of the underlying operational information systems
DOLAP '99: Proceedings of the 2nd ACM international workshop on Data warehousing and OLAPIn recent years the construction of large scale data schemes for operational systems has been the major problem of conceptual data modeling for business needs. Multidimensional data structures used for decision support applications in data warehouses ...
A generic conceptual data model of social media services
EuroPLop '19: Proceedings of the 24th European Conference on Pattern Languages of ProgramsData modelling patterns provide a good guideline for creating a data model of a software or service. However, there is a lack of such a pattern respecting data protection requirements that an online service should satisfy. Especially for social media ...
Comments