Skip to main content
Top

2021 | OriginalPaper | Chapter

Software Quality for AI: Where We Are Now?

Authors : Valentina Lenarduzzi, Francesco Lomio, Sergio Moreschini, Davide Taibi, Damian Andrew Tamburri

Published in: Software Quality: Future Perspectives on Software Engineering Quality

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Artificial Intelligence is getting more and more popular, being adopted in a large number of applications and technology we use on a daily basis. However, a large number of Artificial Intelligence applications are produced by developers without proper training on software quality practices or processes, and in general, lack in-depth knowledge regarding software engineering processes. The main reason is due to the fact that the machine-learning engineer profession has been born very recently, and currently there is a very limited number of training or guidelines on issues (such as code quality or testing) for machine learning and applications using machine learning code. In this work, we aim at highlighting the main software quality issues of Artificial Intelligence systems, with a central focus on machine learning code, based on the experience of our four research groups. Moreover, we aim at defining a shared research road map, that we would like to discuss and to follow in collaboration with the workshop participants. As a result, the software quality of AI-enabled systems is often poorly tested and of very low quality.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
2
 
3
The Networked European Software and Services Initiative - NESSI http://​www.​nessi-europe.​com.
 
Literature
6.
go back to reference Avgeriou, P., Kruchten, P., Ozkaya, I., Seaman, C.: Managing technical debt in software engineering (Dagstuhl seminar 16162). Dagstuhl Reports 6 (2016) Avgeriou, P., Kruchten, P., Ozkaya, I., Seaman, C.: Managing technical debt in software engineering (Dagstuhl seminar 16162). Dagstuhl Reports 6 (2016)
7.
go back to reference Avgeriou, P., et al.: An overview and comparison of technical debt measurement tools. IEEE Softw. (2021) Avgeriou, P., et al.: An overview and comparison of technical debt measurement tools. IEEE Softw. (2021)
8.
go back to reference Bradley, A.P.: The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recogn. 30(7), 1145–1159 (1997)CrossRef Bradley, A.P.: The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recogn. 30(7), 1145–1159 (1997)CrossRef
10.
go back to reference Chen, T.Y.: Metamorphic testing: a simple method for alleviating the test oracle problem. In: Proceedings of the 10th International Workshop on Automation of Software Test, AST 2015, pp. 53–54. IEEE Press (2015) Chen, T.Y.: Metamorphic testing: a simple method for alleviating the test oracle problem. In: Proceedings of the 10th International Workshop on Automation of Software Test, AST 2015, pp. 53–54. IEEE Press (2015)
11.
go back to reference Cohen, G., Afshar, S., Tapson, J., Van Schaik, A.: EMNIST: extending MNIST to handwritten letters. In: 2017 International Joint Conference on Neural Networks (IJCNN), pp. 2921–2926. IEEE (2017) Cohen, G., Afshar, S., Tapson, J., Van Schaik, A.: EMNIST: extending MNIST to handwritten letters. In: 2017 International Joint Conference on Neural Networks (IJCNN), pp. 2921–2926. IEEE (2017)
13.
go back to reference Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE (2009) Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE (2009)
15.
go back to reference Kohavi, R.: A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence, IJCAI 1995, vol. 2, pp. 1137–1143. Morgan Kaufmann Publishers Inc., San Francisco (1995) Kohavi, R.: A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence, IJCAI 1995, vol. 2, pp. 1137–1143. Morgan Kaufmann Publishers Inc., San Francisco (1995)
16.
go back to reference Larus, J., et al.: When computers decide: European recommendations on machine-learned automated decision making (2018) Larus, J., et al.: When computers decide: European recommendations on machine-learned automated decision making (2018)
17.
go back to reference LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)CrossRef LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)CrossRef
19.
22.
go back to reference Murphy, C., Kaiser, G.E., Arias, M.: A framework for quality assurance of machine learning applications. Columbia University Computer Science Technical reports, CUCS-034-06 (2006) Murphy, C., Kaiser, G.E., Arias, M.: A framework for quality assurance of machine learning applications. Columbia University Computer Science Technical reports, CUCS-034-06 (2006)
26.
go back to reference Rushby, J.: Quality measures and assurance for AI (artificial intelligence) software. Technical report (1988) Rushby, J.: Quality measures and assurance for AI (artificial intelligence) software. Technical report (1988)
27.
go back to reference Russell, S.J., Norvig, P.: Artificial Intelligence - A Modern Approach: The Intelligent Agent Book. Prentice Hall Series in Artificial Intelligence. Prentice Hall, Upper Saddle River (1995)MATH Russell, S.J., Norvig, P.: Artificial Intelligence - A Modern Approach: The Intelligent Agent Book. Prentice Hall Series in Artificial Intelligence. Prentice Hall, Upper Saddle River (1995)MATH
28.
go back to reference Sculley, D., et al.: Hidden technical debt in machine learning systems. In: Advances in Neural Information Processing Systems, pp. 2503–2511 (2015) Sculley, D., et al.: Hidden technical debt in machine learning systems. In: Advances in Neural Information Processing Systems, pp. 2503–2511 (2015)
29.
go back to reference Wang, J., Li, L., Zeller, A.: Better code, better sharing: on the need of analyzing Jupyter notebooks (2019) Wang, J., Li, L., Zeller, A.: Better code, better sharing: on the need of analyzing Jupyter notebooks (2019)
30.
go back to reference Zhang, J.M., Harman, M., Ma, L., Liu, Y.: Machine learning testing: survey, landscapes and horizons. IEEE Trans. Softw. Eng. PP, 1 (2020) Zhang, J.M., Harman, M., Ma, L., Liu, Y.: Machine learning testing: survey, landscapes and horizons. IEEE Trans. Softw. Eng. PP, 1 (2020)
31.
go back to reference Ören, T.I.: Quality assurance paradigms for artificial intelligence in modelling and simulation. Simulation 48(4), 149–151 (1987)CrossRef Ören, T.I.: Quality assurance paradigms for artificial intelligence in modelling and simulation. Simulation 48(4), 149–151 (1987)CrossRef
Metadata
Title
Software Quality for AI: Where We Are Now?
Authors
Valentina Lenarduzzi
Francesco Lomio
Sergio Moreschini
Davide Taibi
Damian Andrew Tamburri
Copyright Year
2021
DOI
https://doi.org/10.1007/978-3-030-65854-0_4

Premium Partner