Skip to main content

2021 | OriginalPaper | Buchkapitel

About the Quality of Data and Services in Natural Sciences

verfasst von : Barbara Pernici, Francesca Ratti, Gabriele Scalia

Erschienen in: Next-Gen Digital Services. A Retrospective and Roadmap for Service Computing of the Future

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Managing data related to natural sciences poses new and challenging problems as it is impossible to represent reality on a one-to-one scale, and imprecision has to be taken into account, both in data memorization and in its processing. Machine learning has been a key enabler in the context of information extraction from natural sciences data. However, data-driven results are strongly affected by the volume, the sparsity and different types of imprecision in the available sources. Therefore, it becomes pivotal to associate both to data and to data-driven services information about their quality, in order to effectively interpret the results. Different levels of granularity and multiple data modalities captured from the same processes could coexist, due to technological constraints or other intrinsic limiting factors. In addition, different levels of granularity might be also the result of application requirements, and outcomes at multiple levels of precision needs to be provided. Affinities of quality issues in domains such as chemistry, biology, and geoinformatics are discussed in the paper.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
6.
Zurück zum Zitat Autelitano, A., Pernici, B., Scalia, G.: Spatio-temporal mining of keywords for social media cross-social crawling of emergency events. Geoinformatica 23(3), 425–447 (2019)CrossRef Autelitano, A., Pernici, B., Scalia, G.: Spatio-temporal mining of keywords for social media cross-social crawling of emergency events. Geoinformatica 23(3), 425–447 (2019)CrossRef
8.
Zurück zum Zitat Bertossi, L., Geerts, F.: Data quality and explainable AI. J. Data Inf. Qual. (JDIQ) 12(2), 1–9 (2020)CrossRef Bertossi, L., Geerts, F.: Data quality and explainable AI. J. Data Inf. Qual. (JDIQ) 12(2), 1–9 (2020)CrossRef
10.
Zurück zum Zitat Breck, E., Polyzotis, N., Roy, S., Whang, S., Zinkevich, M.: Data validation for machine learning. In: Talwalkar, A., Smith, V., Zaharia, M. (eds.) Proceedings of Machine Learning and Systems 2019, MLSys 2019, Stanford, CA, USA, 31 March–2 April 2019 (2019). https://proceedings.mlsys.org/book/267.pdf. mlsys.org Breck, E., Polyzotis, N., Roy, S., Whang, S., Zinkevich, M.: Data validation for machine learning. In: Talwalkar, A., Smith, V., Zaharia, M. (eds.) Proceedings of Machine Learning and Systems 2019, MLSys 2019, Stanford, CA, USA, 31 March–2 April 2019 (2019). https://​proceedings.​mlsys.​org/​book/​267.​pdf. mlsys.org
11.
15.
Zurück zum Zitat Ching, T., et al.: Opportunities and obstacles for deep learning in biology and medicine. J. Roy. Soc. Interface 15(141), 20170387 (2018)CrossRef Ching, T., et al.: Opportunities and obstacles for deep learning in biology and medicine. J. Roy. Soc. Interface 15(141), 20170387 (2018)CrossRef
16.
Zurück zum Zitat Consortiu, H., et al.: The human body at cellular resolution: the NIH Human Biomolecular Atlas Program. Nature 574(7777), 187 (2019)CrossRef Consortiu, H., et al.: The human body at cellular resolution: the NIH Human Biomolecular Atlas Program. Nature 574(7777), 187 (2019)CrossRef
18.
Zurück zum Zitat Fox, C.R., Ülkümen, G.: Distinguishing Two Dimensions of Uncertainty, vol. 14, chap. 1. Universitetsforlaget Oslo (2011) Fox, C.R., Ülkümen, G.: Distinguishing Two Dimensions of Uncertainty, vol. 14, chap. 1. Universitetsforlaget Oslo (2011)
19.
Zurück zum Zitat Gala, R., et al.: A coupled autoencoder approach for multi-modal analysis of cell types. In: Advances in Neural Information Processing Systems, pp. 9267–9276 (2019) Gala, R., et al.: A coupled autoencoder approach for multi-modal analysis of cell types. In: Advances in Neural Information Processing Systems, pp. 9267–9276 (2019)
20.
Zurück zum Zitat Gilpin, L.H., Bau, D., Yuan, B.Z., Bajwa, A., Specter, M., Kagal, L.: Explaining explanations: an overview of interpretability of machine learning. In: 2018 IEEE 5th International Conference on Data Science and Advanced Analytics (DSAA), pp. 80–89. IEEE (2018) Gilpin, L.H., Bau, D., Yuan, B.Z., Bajwa, A., Specter, M., Kagal, L.: Explaining explanations: an overview of interpretability of machine learning. In: 2018 IEEE 5th International Conference on Data Science and Advanced Analytics (DSAA), pp. 80–89. IEEE (2018)
22.
Zurück zum Zitat Grambow, C.A., Li, Y.P., Green, W.H.: Accurate thermochemistry with small data sets: a bond additivity correction and transfer learning approach. J. Phys. Chem. A 123(27), 5826–5835 (2019)CrossRef Grambow, C.A., Li, Y.P., Green, W.H.: Accurate thermochemistry with small data sets: a bond additivity correction and transfer learning approach. J. Phys. Chem. A 123(27), 5826–5835 (2019)CrossRef
23.
Zurück zum Zitat Gu, Z., de Schipper, N.C., Van Deun, K.: Variable selection in the regularized simultaneous component analysis method for multi-source data integration. Scientific Rep. 9(1), 1–21 (2019) Gu, Z., de Schipper, N.C., Van Deun, K.: Variable selection in the regularized simultaneous component analysis method for multi-source data integration. Scientific Rep. 9(1), 1–21 (2019)
24.
Zurück zum Zitat Hansen, N., He, X., Griggs, R., Moshammer, K.: Knowledge generation through data research: new validation targets for the refinement of kinetic mechanisms. In: Proceedings of the Combustion Institute (2018) Hansen, N., He, X., Griggs, R., Moshammer, K.: Knowledge generation through data research: new validation targets for the refinement of kinetic mechanisms. In: Proceedings of the Combustion Institute (2018)
25.
Zurück zum Zitat Havas, C., et al.: E2mC: improving emergency management service practice through social media and crowdsourcing analysis in near real time. Sensors 17(12), 2766 (2017)CrossRef Havas, C., et al.: E2mC: improving emergency management service practice through social media and crowdsourcing analysis in near real time. Sensors 17(12), 2766 (2017)CrossRef
31.
Zurück zum Zitat Li, Y.P., Han, K., Grambow, C.A., Green, W.H.: Self-evolving machine: a continuously improving model for molecular thermochemistry. J. Phys. Chem. A 123(10), 2142–2152 (2019)CrossRef Li, Y.P., Han, K., Grambow, C.A., Green, W.H.: Self-evolving machine: a continuously improving model for molecular thermochemistry. J. Phys. Chem. A 123(10), 2142–2152 (2019)CrossRef
32.
Zurück zum Zitat Metzger, A., Pohl, K., Papazoglou, M.P., Di Nitto, E., Marconi, A., Karastoyanova, D.: Research challenges on adaptive software and services in the future internet: towards an S-cube research roadmap. In: Metzger, A., Pohl, K., Papazoglou, M.P. (eds.) First International Workshop on European Software Services and Systems Research - Results and Challenges, S-Cube 2012, Zurich, Switzerland, 5 June 2012, pp. 1–7. IEEE (2012). https://doi.org/10.1109/S-Cube.2012.6225501 Metzger, A., Pohl, K., Papazoglou, M.P., Di Nitto, E., Marconi, A., Karastoyanova, D.: Research challenges on adaptive software and services in the future internet: towards an S-cube research roadmap. In: Metzger, A., Pohl, K., Papazoglou, M.P. (eds.) First International Workshop on European Software Services and Systems Research - Results and Challenges, S-Cube 2012, Zurich, Switzerland, 5 June 2012, pp. 1–7. IEEE (2012). https://​doi.​org/​10.​1109/​S-Cube.​2012.​6225501
38.
Zurück zum Zitat Scalia, G., Pelucchi, M., Stagni, A., Cuoci, A., Faravelli, T., Pernici, B.: Evaluating scalable uncertainty estimation methods for deep learning-based molecular property prediction. Data Sci. 2(1–2), 245–273 (2019)CrossRef Scalia, G., Pelucchi, M., Stagni, A., Cuoci, A., Faravelli, T., Pernici, B.: Evaluating scalable uncertainty estimation methods for deep learning-based molecular property prediction. Data Sci. 2(1–2), 245–273 (2019)CrossRef
39.
Zurück zum Zitat Squires, S., Ewing, R., Prügel-Bennett, A., Niranjan, M.: A method of integrating spatial proteomics and protein-protein interaction network data. In: Liu, D., Xie, S., Li, Y., Zhao, D., El-Alfy, E.-S.M. (eds.) ICONIP 2017, Part V. LNCS, vol. 10638, pp. 782–790. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-70139-4_79CrossRef Squires, S., Ewing, R., Prügel-Bennett, A., Niranjan, M.: A method of integrating spatial proteomics and protein-protein interaction network data. In: Liu, D., Xie, S., Li, Y., Zhao, D., El-Alfy, E.-S.M. (eds.) ICONIP 2017, Part V. LNCS, vol. 10638, pp. 782–790. Springer, Cham (2017). https://​doi.​org/​10.​1007/​978-3-319-70139-4_​79CrossRef
42.
Zurück zum Zitat Wang, J., et al.: Data denoising with transfer learning in single-cell transcriptomics. Nat. Methods 16(9), 875–878 (2019)CrossRef Wang, J., et al.: Data denoising with transfer learning in single-cell transcriptomics. Nat. Methods 16(9), 875–878 (2019)CrossRef
Metadaten
Titel
About the Quality of Data and Services in Natural Sciences
verfasst von
Barbara Pernici
Francesca Ratti
Gabriele Scalia
Copyright-Jahr
2021
DOI
https://doi.org/10.1007/978-3-030-73203-5_18