Skip to main content

2018 | OriginalPaper | Buchkapitel

An Empirical Analysis for Predicting Source Code File Reusability Using Meta-Classification Algorithms

verfasst von : Loveleen Kaur, Ashutosh Mishra

Erschienen in: Advanced Computational and Communication Paradigms

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Although various quantifiers of software component reusability have been proposed, these metrics have seldom been utilized in existing literature to analyze source code file reusability transpiring within a single product family. Such metrics can be effortlessly employed to develop reuse prediction models which can support the software practitioners in obtaining information regarding the total cost involved in developing a novel version of a prevailing software or upgrading an existing software version by estimating the total reusable code files in advance without being compelled to scrutinize the complete codebase. In view of this, this research work aims to examine the efficacy of seven meta-classification techniques in the development of such reuse prediction models on four software datasets constructed from four successively released versions of software using appropriate reuse metrics. We also evaluate the predictive performance of these meta-classifiers against the statistical technique of logistic regression and rank these techniques using the Friedman statistical test.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
4
The ROC curve is a plot of sensitivity (on the y-axis) and 1-specificity (on the x-axis). Several cutoff points between 0 and 1 are chosen during the creation of a ROC curve. The sensitivity of the model is calibrated as the percentage of the reused files that were predicted accurately. The specificity of the model is calibrated as the percentage of the non-reused files that were predicted accurately. High values are desired for both sensitivity and specificity.
 
5
The results indicated that stacking achieved a sensitivity value of 0 and specificity value of 1 for all the four selected versions, thus indicating that it is not able to classify the reused classes at all and predicts all the classes included in the dataset to be “not reused”.
 
Literatur
1.
Zurück zum Zitat Mojica, I.J., Adams, B., Nagappan, M., Dienst, S., Berger, T., Hassan, A.E.: A large-scale empirical study on software reuse in mobile apps. IEEE Softw. 31(2), 78–86 (2014)CrossRef Mojica, I.J., Adams, B., Nagappan, M., Dienst, S., Berger, T., Hassan, A.E.: A large-scale empirical study on software reuse in mobile apps. IEEE Softw. 31(2), 78–86 (2014)CrossRef
2.
Zurück zum Zitat Mockus, A.: Large-scale code reuse in open source software. In: First International Workshop on Emerging Trends in FLOSS Research and Development, pp. 1–7. IEEE (2007) Mockus, A.: Large-scale code reuse in open source software. In: First International Workshop on Emerging Trends in FLOSS Research and Development, pp. 1–7. IEEE (2007)
3.
Zurück zum Zitat Washizaki, H., Koike, T., Namiki, R., Tanabe, H.: Reusability metrics for program source code written in C language and their evaluation. Product-Focused Software Process Improvement. Springer, 89–103 (2012)CrossRef Washizaki, H., Koike, T., Namiki, R., Tanabe, H.: Reusability metrics for program source code written in C language and their evaluation. Product-Focused Software Process Improvement. Springer, 89–103 (2012)CrossRef
4.
Zurück zum Zitat Witten, I.H., Frank, E., Hall, M.A., Pal, C.J.: Data Mining: Practical machine learning tools and techniques. 4th edn. Morgan Kaufmann (2016) Witten, I.H., Frank, E., Hall, M.A., Pal, C.J.: Data Mining: Practical machine learning tools and techniques. 4th edn. Morgan Kaufmann (2016)
5.
Zurück zum Zitat Kotsiantis, S.B., Zaharakis, I., Pintelas, P.: Supervised machine learning: a review of classification techniques. Informatica 31, 249–268 (2007)MathSciNetMATH Kotsiantis, S.B., Zaharakis, I., Pintelas, P.: Supervised machine learning: a review of classification techniques. Informatica 31, 249–268 (2007)MathSciNetMATH
6.
Zurück zum Zitat Peng, C.Y.J., Lee, K.L., Ingersoll, G.M.: An introduction to logistic regression analysis and reporting. J. Education. Res. 96(1), 3–14 (2002)CrossRef Peng, C.Y.J., Lee, K.L., Ingersoll, G.M.: An introduction to logistic regression analysis and reporting. J. Education. Res. 96(1), 3–14 (2002)CrossRef
7.
Zurück zum Zitat Hall, T., Beecham, S., Bowes, D., Gray, D., Counsell, S.: A systematic literature review on fault prediction performance in software engineering. IEEE Trans. Softw. Eng. 38(6), 1276–1304 (2012)CrossRef Hall, T., Beecham, S., Bowes, D., Gray, D., Counsell, S.: A systematic literature review on fault prediction performance in software engineering. IEEE Trans. Softw. Eng. 38(6), 1276–1304 (2012)CrossRef
8.
Zurück zum Zitat Moser, R., Pedrycz, W., Succi, G.: A comparative analysis of the efficiency of change metrics and static code attributes for defect prediction. In: Proceedings of the 30th International conference on Software engineering. pp. 181–190. ACM (2008) Moser, R., Pedrycz, W., Succi, G.: A comparative analysis of the efficiency of change metrics and static code attributes for defect prediction. In: Proceedings of the 30th International conference on Software engineering. pp. 181–190. ACM (2008)
9.
Zurück zum Zitat Mijač, M., Stapić, Z.: Reusability metrics of software components: survey. In: Proceedings of the 26th Central European Conference on Information and Intelligent Systems. pp. 221–231 (2015) Mijač, M., Stapić, Z.: Reusability metrics of software components: survey. In: Proceedings of the 26th Central European Conference on Information and Intelligent Systems. pp. 221–231 (2015)
10.
Zurück zum Zitat Hudaib, A., Huneiti, A., Othman, I.: Software Reusability classification and predication using self-organizing map (SOM). Commun. Netw. 8, 179–192 (2016)CrossRef Hudaib, A., Huneiti, A., Othman, I.: Software Reusability classification and predication using self-organizing map (SOM). Commun. Netw. 8, 179–192 (2016)CrossRef
11.
Zurück zum Zitat Padhy, N., Satapathy, S., Singh, R.P.: Utility of an Object Oriented Reusability Metrics and Estimation Complexity. Indian J. Sci. Technol. 8(1), 1–9 (2017) Padhy, N., Satapathy, S., Singh, R.P.: Utility of an Object Oriented Reusability Metrics and Estimation Complexity. Indian J. Sci. Technol. 8(1), 1–9 (2017)
12.
Zurück zum Zitat Devine, T., Goseva-Popstojanova, K., Krishnan, S., Lutz, R.R.: Assessment and cross-product prediction of software product line quality: accounting for reuse across products, over multiple releases. Automat. Softw. Eng. 23(2), 253–302 (2016)CrossRef Devine, T., Goseva-Popstojanova, K., Krishnan, S., Lutz, R.R.: Assessment and cross-product prediction of software product line quality: accounting for reuse across products, over multiple releases. Automat. Softw. Eng. 23(2), 253–302 (2016)CrossRef
13.
Zurück zum Zitat Manhas, S., Sandhu, P.S., Chopra, V., Neeru, N.: Identification of reusable software modules in function oriented software systems using neural network based technique. World Acad. Sci. Eng. Technol. 43, 823–827 (2010) Manhas, S., Sandhu, P.S., Chopra, V., Neeru, N.: Identification of reusable software modules in function oriented software systems using neural network based technique. World Acad. Sci. Eng. Technol. 43, 823–827 (2010)
14.
Zurück zum Zitat Sharma, A., Grover, P.S., Kumar, R.: Reusability assessment for software components. ACM SIGSOFT Softw. Eng. Notes 34(2), 1–6 (2009)CrossRef Sharma, A., Grover, P.S., Kumar, R.: Reusability assessment for software components. ACM SIGSOFT Softw. Eng. Notes 34(2), 1–6 (2009)CrossRef
15.
Zurück zum Zitat Shri, A., Sandhu, P.S., Gupta, V., Anand, S.: Prediction of reusability of object oriented software systems using clustering approach. World Acad. Sci. Eng. Technol. 43, 853–856 (2010) Shri, A., Sandhu, P.S., Gupta, V., Anand, S.: Prediction of reusability of object oriented software systems using clustering approach. World Acad. Sci. Eng. Technol. 43, 853–856 (2010)
16.
Zurück zum Zitat Chidamber, S.R., Kemerer, C.F.: A metrics suite for object oriented design. IEEE Trans. Softw. Eng. 20(6), 476–493 (1994)CrossRef Chidamber, S.R., Kemerer, C.F.: A metrics suite for object oriented design. IEEE Trans. Softw. Eng. 20(6), 476–493 (1994)CrossRef
17.
Zurück zum Zitat Martin, R.C.: Agile Software Development: Principles, Patterns, and Practices. Prentice Hall (2002) Martin, R.C.: Agile Software Development: Principles, Patterns, and Practices. Prentice Hall (2002)
18.
Zurück zum Zitat Lorenz, M., Kidd, J.: Object-oriented software metrics: a practical guide. Prentice-Hall, Inc. (1994) Lorenz, M., Kidd, J.: Object-oriented software metrics: a practical guide. Prentice-Hall, Inc. (1994)
22.
Zurück zum Zitat Kohavi, R.: A study of cross-validation and bootstrap for accuracy estimation and model selection. Ijcai 14(2), 1137–1145 (1995) Kohavi, R.: A study of cross-validation and bootstrap for accuracy estimation and model selection. Ijcai 14(2), 1137–1145 (1995)
23.
Zurück zum Zitat Gibbons, J.D., Chakraborti, S.: Nonparametric Statistical Inference. International Encyclopedia of Statistical Science. Springer, Berlin Heidelberg (2011)CrossRef Gibbons, J.D., Chakraborti, S.: Nonparametric Statistical Inference. International Encyclopedia of Statistical Science. Springer, Berlin Heidelberg (2011)CrossRef
24.
Zurück zum Zitat Buchgeher, G., Weinreich, R.: Integrated software architecture management and validation. In: Proceedings of the 3rd International Conference on Software Engineering Advances. pp. 427–436. IEEE (2008) Buchgeher, G., Weinreich, R.: Integrated software architecture management and validation. In: Proceedings of the 3rd International Conference on Software Engineering Advances. pp. 427–436. IEEE (2008)
25.
Zurück zum Zitat Scandariato, R., Walden, J.: Predicting vulnerable classes in an Android application. In: Proceedings of the 4th international workshop on Security measurements and metrics. pp. 11–16. ACM (2012) Scandariato, R., Walden, J.: Predicting vulnerable classes in an Android application. In: Proceedings of the 4th international workshop on Security measurements and metrics. pp. 11–16. ACM (2012)
Metadaten
Titel
An Empirical Analysis for Predicting Source Code File Reusability Using Meta-Classification Algorithms
verfasst von
Loveleen Kaur
Ashutosh Mishra
Copyright-Jahr
2018
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-8237-5_48

Neuer Inhalt