Skip to main content
Erschienen in: Software Quality Journal 1/2010

01.03.2010

Empirical validation of object-oriented metrics for predicting fault proneness models

verfasst von: Yogesh Singh, Arvinder Kaur, Ruchika Malhotra

Erschienen in: Software Quality Journal | Ausgabe 1/2010

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Empirical validation of software metrics used to predict software quality attributes is important to ensure their practical relevance in software organizations. The aim of this work is to find the relation of object-oriented (OO) metrics with fault proneness at different severity levels of faults. For this purpose, different prediction models have been developed using regression and machine learning methods. We evaluate and compare the performance of these methods to find which method performs better at different severity levels of faults and empirically validate OO metrics given by Chidamber and Kemerer. The results of the empirical study are based on public domain NASA data set. The performance of the predicted models was evaluated using Receiver Operating Characteristic (ROC) analysis. The results show that the area under the curve (measured from the ROC analysis) of models predicted using high severity faults is low as compared with the area under the curve of the model predicted with respect to medium and low severity faults. However, the number of faults in the classes correctly classified by predicted models with respect to high severity faults is not low. This study also shows that the performance of machine learning methods is better than logistic regression method with respect to all the severities of faults. Based on the results, it is reasonable to claim that models targeted at different severity levels of faults could help for planning and executing testing by focusing resources on fault-prone parts of the design and code that are likely to cause serious failures.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Afzal, W. (2007). Metrics in software test planning and test design processes. Ph.D. Disseration. Afzal, W. (2007). Metrics in software test planning and test design processes. Ph.D. Disseration.
Zurück zum Zitat Aggarwal, K. K., Singh, Y., Kaur, A., & Malhotra, R. (2005). Software reuse metrics for object-oriented systems. In Proceedings of the Third ACIS Int’l Conference On Software Engineering Research, Management and Applications (SERA ‘05), 48–55. Aggarwal, K. K., Singh, Y., Kaur, A., & Malhotra, R. (2005). Software reuse metrics for object-oriented systems. In Proceedings of the Third ACIS Int’l Conference On Software Engineering Research, Management and Applications (SERA ‘05), 48–55.
Zurück zum Zitat Aggarwal, K. K., Singh, Y., Kaur, A., & Malhotra, R. (2006a). Empirical study of object-oriented metrics. Journal of Object Technology, 5(8), 149–173. Aggarwal, K. K., Singh, Y., Kaur, A., & Malhotra, R. (2006a). Empirical study of object-oriented metrics. Journal of Object Technology, 5(8), 149–173.
Zurück zum Zitat Aggarwal, K. K., Singh, Y., Kaur, A., & Malhotra, R. (2006b). Investigating the effect of coupling metrics on fault proneness in object-oriented systems. Software Quality Professional, 8(4), 4–16. Aggarwal, K. K., Singh, Y., Kaur, A., & Malhotra, R. (2006b). Investigating the effect of coupling metrics on fault proneness in object-oriented systems. Software Quality Professional, 8(4), 4–16.
Zurück zum Zitat Aggarwal, K. K., Singh, Y., Kaur, A., & Malhotra, R. (2007). Application of artificial neural network for predicting fault proneness models. International conference on information systems, technology and management (ICISTM 2007), March 12–13, New Delhi, India. Aggarwal, K. K., Singh, Y., Kaur, A., & Malhotra, R. (2007). Application of artificial neural network for predicting fault proneness models. International conference on information systems, technology and management (ICISTM 2007), March 12–13, New Delhi, India.
Zurück zum Zitat Aggarwal, K. K., Singh, Y., Kaur, A., & Malhotra, R. (2009). Empirical analysis for investigating the effect of object-oriented metrics on fault proneness: A replicated case study. Software Process: Improvement and Practice, 16(1), 39–62. doi:10.1002/spip.389s.CrossRef Aggarwal, K. K., Singh, Y., Kaur, A., & Malhotra, R. (2009). Empirical analysis for investigating the effect of object-oriented metrics on fault proneness: A replicated case study. Software Process: Improvement and Practice, 16(1), 39–62. doi:10.​1002/​spip.​389s.CrossRef
Zurück zum Zitat Barnett, V., & Price, T. (1995). Outliers in statistical data. London: Wiley. Barnett, V., & Price, T. (1995). Outliers in statistical data. London: Wiley.
Zurück zum Zitat Basili, V., Briand, L., & Melo, W. (1996). A validation of object-oriented design metrics as quality indicators. IEEE Transactions on Software Engineering, 22(10), 751–761. doi:10.1109/32.544352.CrossRef Basili, V., Briand, L., & Melo, W. (1996). A validation of object-oriented design metrics as quality indicators. IEEE Transactions on Software Engineering, 22(10), 751–761. doi:10.​1109/​32.​544352.CrossRef
Zurück zum Zitat Belsley, D., Kuh, E., & Welsch, R. (1980). Regression diagnostics: Identifying influential data and sources of collinearity. New York: Wiley.MATH Belsley, D., Kuh, E., & Welsch, R. (1980). Regression diagnostics: Identifying influential data and sources of collinearity. New York: Wiley.MATH
Zurück zum Zitat Bieman, J., & Kang, B. (1995). Cohesion and reuse in an object-oriented system. In Proceedings of the ACM Symposium on Software Reusability (SSR’94), 259–262. Bieman, J., & Kang, B. (1995). Cohesion and reuse in an object-oriented system. In Proceedings of the ACM Symposium on Software Reusability (SSR’94), 259–262.
Zurück zum Zitat Binkley, A., & Schach, S. (1998). Validation of the coupling dependency metric as a risk predictor. In Proceedings of the International Conference on Software Engineering, 452–455. Binkley, A., & Schach, S. (1998). Validation of the coupling dependency metric as a risk predictor. In Proceedings of the International Conference on Software Engineering, 452–455.
Zurück zum Zitat Briand, L., Daly, W., & Wust, J. (1999). A unified framework for coupling measurement in object-oriented systems. IEEE Transactions on Software Engineering, 25(1), 91–121. doi:10.1109/32.748920.CrossRef Briand, L., Daly, W., & Wust, J. (1999). A unified framework for coupling measurement in object-oriented systems. IEEE Transactions on Software Engineering, 25(1), 91–121. doi:10.​1109/​32.​748920.CrossRef
Zurück zum Zitat Briand, L., Wüst, J., & Lounis, H. (2001). Replicated Case Studies for Investigating Quality Factors in Object-Oriented Designs, Empirical Software Engineering. International Journal (Toronto, Ont.), 6(1), 11–58.MATH Briand, L., Wüst, J., & Lounis, H. (2001). Replicated Case Studies for Investigating Quality Factors in Object-Oriented Designs, Empirical Software Engineering. International Journal (Toronto, Ont.), 6(1), 11–58.MATH
Zurück zum Zitat Cartwright, M., & Shepperd, M. (1999). An empirical investigation of an object-oriented software system. IEEE Transactions on Software Engineering, 26(8), 786–796. doi:10.1109/32.879814.CrossRef Cartwright, M., & Shepperd, M. (1999). An empirical investigation of an object-oriented software system. IEEE Transactions on Software Engineering, 26(8), 786–796. doi:10.​1109/​32.​879814.CrossRef
Zurück zum Zitat Chidamber, S., Darcy, D., & Kemerer, C. (1998). Managerial use of metrics for object-oriented software: An exploratory analysis. IEEE Transactions on Software Engineering, 24(8), 629–639. doi:10.1109/32.707698.CrossRef Chidamber, S., Darcy, D., & Kemerer, C. (1998). Managerial use of metrics for object-oriented software: An exploratory analysis. IEEE Transactions on Software Engineering, 24(8), 629–639. doi:10.​1109/​32.​707698.CrossRef
Zurück zum Zitat Chidamber, S., & Kamerer, C. (1991). Towards a metrics suite for object oriented design. In Proceedings of the Conference on Object-Oriented Programming: Systems, Languages and Applications (OOPSLA’91). SIGPLAN Notices, 26(11), 197–211. Chidamber, S., & Kamerer, C. (1991). Towards a metrics suite for object oriented design. In Proceedings of the Conference on Object-Oriented Programming: Systems, Languages and Applications (OOPSLA’91). SIGPLAN Notices, 26(11), 197–211.
Zurück zum Zitat Duman, E. (2006). Comparison of decision tree algorithms in identifying bank customers who are likely to buy credit cards. Seventh international Baltic conference on databases and information systems, Kaunas, Lithuania, July 3–6, 2006. Duman, E. (2006). Comparison of decision tree algorithms in identifying bank customers who are likely to buy credit cards. Seventh international Baltic conference on databases and information systems, Kaunas, Lithuania, July 3–6, 2006.
Zurück zum Zitat Eftekhar, B., Mohammad, K, Ardebili, H., Ghodsi, M., & Ketabchi, E. (2005). Comparision of artificial neural network and logistic regression models for prediction of mortality in head truma based on initial clinical data. BMC Medical Informatics and Decision Making, 5(3), 3. doi: 10.1186/1472-6947-5-3.CrossRef Eftekhar, B., Mohammad, K, Ardebili, H., Ghodsi, M., & Ketabchi, E. (2005). Comparision of artificial neural network and logistic regression models for prediction of mortality in head truma based on initial clinical data. BMC Medical Informatics and Decision Making, 5(3), 3. doi: 10.​1186/​1472-6947-5-3.CrossRef
Zurück zum Zitat El Emam, K., Benlarbi, S., Goel, N., & Rai, S. (1999). A validation of object-oriented metrics. Technical report ERB-1063, NRC. El Emam, K., Benlarbi, S., Goel, N., & Rai, S. (1999). A validation of object-oriented metrics. Technical report ERB-1063, NRC.
Zurück zum Zitat El Emam, K., Benlarbi, S., Goel, N., & Rai, S. (2001). The confounding effect of class size on the validity of object-oriented metrics. IEEE Transactions on Software Engineering, 27(7), 630–650. doi:10.1109/32.935855.CrossRef El Emam, K., Benlarbi, S., Goel, N., & Rai, S. (2001). The confounding effect of class size on the validity of object-oriented metrics. IEEE Transactions on Software Engineering, 27(7), 630–650. doi:10.​1109/​32.​935855.CrossRef
Zurück zum Zitat Fenton, N., & Neil, M. (1999). A critique of software defect prediction models. IEEE Transactions on Software Engineering, 25(3), 1–15. Fenton, N., & Neil, M. (1999). A critique of software defect prediction models. IEEE Transactions on Software Engineering, 25(3), 1–15.
Zurück zum Zitat Gyimothy, T., Ferenc, R., & Siket, I. (2005). Empirical validation of object-oriented metrics on open source software for fault prediction. IEEE Transactions on Software Engineering, 31(10), 897–910. doi:10.1109/TSE.2005.112.CrossRef Gyimothy, T., Ferenc, R., & Siket, I. (2005). Empirical validation of object-oriented metrics on open source software for fault prediction. IEEE Transactions on Software Engineering, 31(10), 897–910. doi:10.​1109/​TSE.​2005.​112.CrossRef
Zurück zum Zitat Hair, J., Anderson, R., & Tatham, W. (2006). Black multivariate data analysis. London: Pearson Education. Hair, J., Anderson, R., & Tatham, W. (2006). Black multivariate data analysis. London: Pearson Education.
Zurück zum Zitat Han, J., & Kamber, M. (2001). Data mining: Concepts and techniques. India: Harchort India Private Limited. Han, J., & Kamber, M. (2001). Data mining: Concepts and techniques. India: Harchort India Private Limited.
Zurück zum Zitat Hanley, J., & McNeil, B. J. (1982). The meaning and use of the area under a receiver operating characteristic ROC curve. Radiology, 143, 29–36. Hanley, J., & McNeil, B. J. (1982). The meaning and use of the area under a receiver operating characteristic ROC curve. Radiology, 143, 29–36.
Zurück zum Zitat Harrison, R., Counsell, S. J., & Nithi, R. V. (1998). An evaluation of MOOD set of object-oriented software metrics. IEEE Transactions on Software Engineering, 24(6), 491–496. doi:10.1109/32.689404.CrossRef Harrison, R., Counsell, S. J., & Nithi, R. V. (1998). An evaluation of MOOD set of object-oriented software metrics. IEEE Transactions on Software Engineering, 24(6), 491–496. doi:10.​1109/​32.​689404.CrossRef
Zurück zum Zitat Henderson-Sellers, B. (1996). Object-oriented metrics, measures of complexity. Englewood Cliffs, NJ: Prentice Hall. Henderson-Sellers, B. (1996). Object-oriented metrics, measures of complexity. Englewood Cliffs, NJ: Prentice Hall.
Zurück zum Zitat Hitz, M., & Montazeri, B. (1995). Measuring coupling and cohesion in object-oriented systems. In Proceedings of the International Symposium on Applied Corporate Computing, Monterrey, Mexico. Hitz, M., & Montazeri, B. (1995). Measuring coupling and cohesion in object-oriented systems. In Proceedings of the International Symposium on Applied Corporate Computing, Monterrey, Mexico.
Zurück zum Zitat Horch, J. (2003). Practical guide to software quality management (2nd ed.). London: Artech House. Horch, J. (2003). Practical guide to software quality management (2nd ed.). London: Artech House.
Zurück zum Zitat Hosmer, D., & Lemeshow, S. (1989). Applied logistic regression. New York: Wiley. Hosmer, D., & Lemeshow, S. (1989). Applied logistic regression. New York: Wiley.
Zurück zum Zitat IEEE Std. 1044-1993. (1994). IEEE standard classification for software anomalies. IEEE Std. 1044-1993. (1994). IEEE standard classification for software anomalies.
Zurück zum Zitat Khoshgaftaar, T. M., Allen, E. D., Hudepohl, J. P., & Aud, S. J. (1997). Application of neural networks to software quality modeling of a very large telecommunications system. IEEE Transactions on Neural Networks, 8(4), 902–909. doi:10.1109/72.595888.CrossRef Khoshgaftaar, T. M., Allen, E. D., Hudepohl, J. P., & Aud, S. J. (1997). Application of neural networks to software quality modeling of a very large telecommunications system. IEEE Transactions on Neural Networks, 8(4), 902–909. doi:10.​1109/​72.​595888.CrossRef
Zurück zum Zitat Khoshgoftaar, T., Geleyn, E., Nguyen, L., & Bullard, L. (2002). Cost-sensitive boosting in software quality modeling. In Proceedings of 7th IEEE International Symposium on High Assurance Systems Engineering, 51–60. Khoshgoftaar, T., Geleyn, E., Nguyen, L., & Bullard, L. (2002). Cost-sensitive boosting in software quality modeling. In Proceedings of 7th IEEE International Symposium on High Assurance Systems Engineering, 51–60.
Zurück zum Zitat Laird, L., & Brennan, M. (2006). Software measurement and estimation: A practical approach. NJ: Wiley. Laird, L., & Brennan, M. (2006). Software measurement and estimation: A practical approach. NJ: Wiley.
Zurück zum Zitat Lake, A., & Cook, C. (1994). Use of factor analysis to develop OOP software complexity metrics. In Proceedings of the 6th Annual Oregon Workshop on Software Metrics, Silver Falls, Oregon. Lake, A., & Cook, C. (1994). Use of factor analysis to develop OOP software complexity metrics. In Proceedings of the 6th Annual Oregon Workshop on Software Metrics, Silver Falls, Oregon.
Zurück zum Zitat Lee, Y., Liang, B., Wu, S., & Wang, F. (1995). Measuring the coupling and cohesion of an object-oriented program based on information flow. In Proceedings of the International Conference on Software Quality, Maribor, Slovenia. Lee, Y., Liang, B., Wu, S., & Wang, F. (1995). Measuring the coupling and cohesion of an object-oriented program based on information flow. In Proceedings of the International Conference on Software Quality, Maribor, Slovenia.
Zurück zum Zitat Lorenz, M., & Kidd, J. (1994). Object-oriented software metrics. Englewood Cliffs, NJ: Prentice-Hall. Lorenz, M., & Kidd, J. (1994). Object-oriented software metrics. Englewood Cliffs, NJ: Prentice-Hall.
Zurück zum Zitat Lovin, C., & Yaptangco, T. (2006). Best practices: Measuring the success of enterprise testing. Dell Power Solutions. pp. 101–103. Lovin, C., & Yaptangco, T. (2006). Best practices: Measuring the success of enterprise testing. Dell Power Solutions. pp. 101–103.
Zurück zum Zitat Menzies, T., Greenwald, J., & Frank, A. (2007). Data mining static code attributes to learn defect predictors. IEEE Transactions on Software Engineering, 32(11), 1–12. Menzies, T., Greenwald, J., & Frank, A. (2007). Data mining static code attributes to learn defect predictors. IEEE Transactions on Software Engineering, 32(11), 1–12.
Zurück zum Zitat Olague, H., Etzkorn, L., Gholston, S., & Quattlebaum, S. (2007). Empirical validation of three software metrics suites to predict fault-proneness of object-oriented classes developed using highly iterative or agile software development processes. IEEE Transactions on Software Engineering, 33(8), 402–419. doi:10.1109/TSE.2007.1015.CrossRef Olague, H., Etzkorn, L., Gholston, S., & Quattlebaum, S. (2007). Empirical validation of three software metrics suites to predict fault-proneness of object-oriented classes developed using highly iterative or agile software development processes. IEEE Transactions on Software Engineering, 33(8), 402–419. doi:10.​1109/​TSE.​2007.​1015.CrossRef
Zurück zum Zitat Phadke, A., & Allen, E. (2005). Predicting risky modules in open-source software for high-performance computing. In Proceedings of Second International Workshop on Software Engineering for High Performance Computing System Applications, 60–64. Phadke, A., & Allen, E. (2005). Predicting risky modules in open-source software for high-performance computing. In Proceedings of Second International Workshop on Software Engineering for High Performance Computing System Applications, 60–64.
Zurück zum Zitat Porter, A., & Selby, R. (1990). Empirically guided software development using metric-based classification trees. IEEE Software, 7(2), 46–54. doi:10.1109/52.50773. Porter, A., & Selby, R. (1990). Empirically guided software development using metric-based classification trees. IEEE Software, 7(2), 46–54. doi:10.​1109/​52.​50773.
Zurück zum Zitat Stone, M. (1974). Cross-validatory choice and assessment of statistical predictions. Journal of the Royal Statistical Society. Series A (General), 36, 111–147.MATH Stone, M. (1974). Cross-validatory choice and assessment of statistical predictions. Journal of the Royal Statistical Society. Series A (General), 36, 111–147.MATH
Zurück zum Zitat Tang, M. H., Kao, M. H., & Chen, M. H. (1999). An empirical study on object-oriented metrics. In Proceedings of Metrics, 242–249. Tang, M. H., Kao, M. H., & Chen, M. H. (1999). An empirical study on object-oriented metrics. In Proceedings of Metrics, 242–249.
Zurück zum Zitat Tian, J. (2005). Software quality engineering: Testing, quality assurance, and quantifiable improvement. NJ: Wiley. Tian, J. (2005). Software quality engineering: Testing, quality assurance, and quantifiable improvement. NJ: Wiley.
Zurück zum Zitat Yu, P., Systa, T., & Muller, H. (2002). Predicting fault-proneness using OO metrics: An industrial case study. In Proceedings of Sixth European Conference on Software Maintenance and Reengineering, Budapest, Hungary, 99–107. Yu, P., Systa, T., & Muller, H. (2002). Predicting fault-proneness using OO metrics: An industrial case study. In Proceedings of Sixth European Conference on Software Maintenance and Reengineering, Budapest, Hungary, 99–107.
Zurück zum Zitat Zhou, Y., & Leung, H. (2006). Empirical analysis of object-oriented design metrics for predicting high severity faults. IEEE Transactions on Software Engineering, 32(10), 771–784. doi:10.1109/TSE.2006.102.CrossRef Zhou, Y., & Leung, H. (2006). Empirical analysis of object-oriented design metrics for predicting high severity faults. IEEE Transactions on Software Engineering, 32(10), 771–784. doi:10.​1109/​TSE.​2006.​102.CrossRef
Metadaten
Titel
Empirical validation of object-oriented metrics for predicting fault proneness models
verfasst von
Yogesh Singh
Arvinder Kaur
Ruchika Malhotra
Publikationsdatum
01.03.2010
Verlag
Springer US
Erschienen in
Software Quality Journal / Ausgabe 1/2010
Print ISSN: 0963-9314
Elektronische ISSN: 1573-1367
DOI
https://doi.org/10.1007/s11219-009-9079-6

Weitere Artikel der Ausgabe 1/2010

Software Quality Journal 1/2010 Zur Ausgabe

EditorialNotes

In this issue