Skip to main content

2020 | OriginalPaper | Buchkapitel

OMProv: Provenance Mechanism for Objects in Deep Learning

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Deep learning technology is widely used in industry and academia nowadays. Several kinds of objects are involved in deep learning workflows, including algorithms, models, and labeled datasets. The effectiveness of organizing and understanding the relationship among these objects determines the efficiency of development and production. This paper proposes OMProv, which is a provenance mechanism for recording the lineage within each kind of object, and the relationship among different kinds of objects in the same execution. A weighted directed acyclic graph-based version graph abstraction and a version inference algorithm are proposed. They are consciously designed to fit the characteristics of deep learning scenarios. OMProv has been implemented in OMAI, an all-in-one deep learning platform for the cloud. OMProv helps users organize objects effectively and intuitively, and understand the root causes of the changed job results like performance or accuracy in an efficient way. The management of deep learning lifecycles and related data assets can also be simplified by using OMProv.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Acar, U., Buneman, P., Cheney, J., Van Den Bussche, J., Kwasnikowska, N., Vansummeren, S.: A graph model of data and workflow provenance. In: Proceedings of the 2nd Workshop on Theory and Practice of Provenance, pp. 1–10 (2010) Acar, U., Buneman, P., Cheney, J., Van Den Bussche, J., Kwasnikowska, N., Vansummeren, S.: A graph model of data and workflow provenance. In: Proceedings of the 2nd Workshop on Theory and Practice of Provenance, pp. 1–10 (2010)
2.
Zurück zum Zitat Agrawal, P., et al.: Data platform for machine learning. In: Proceedings of the 2019 International Conference on Management of Data, pp. 1803–1816 (2019) Agrawal, P., et al.: Data platform for machine learning. In: Proceedings of the 2019 International Conference on Management of Data, pp. 1803–1816 (2019)
4.
Zurück zum Zitat Duarte, J.C., Cavalcanti, M.C.R., de Souza Costa, I., Esteves, D.: An interoperable service for the provenance of machine learning experiments. In: Proceedings of the 2017 International Conference on Web Intelligence, pp. 132–138 (2017) Duarte, J.C., Cavalcanti, M.C.R., de Souza Costa, I., Esteves, D.: An interoperable service for the provenance of machine learning experiments. In: Proceedings of the 2017 International Conference on Web Intelligence, pp. 132–138 (2017)
5.
Zurück zum Zitat Jin, R., Ruan, N., Xiang, Y., Wang, H.: Path-tree: an efficient reachability indexing scheme for large directed graphs. ACM Trans. Database Syst. 36(1), 1–44 (2011) CrossRef Jin, R., Ruan, N., Xiang, Y., Wang, H.: Path-tree: an efficient reachability indexing scheme for large directed graphs. ACM Trans. Database Syst. 36(1), 1–44 (2011) CrossRef
6.
Zurück zum Zitat Lin, J., Xie, D., Yu, B.: Research on Cloud Service Adaptation of Deep Learning. Softw. Guide 19(6), 1–8 (2020). (in Chinese) Lin, J., Xie, D., Yu, B.: Research on Cloud Service Adaptation of Deep Learning. Softw. Guide 19(6), 1–8 (2020). (in Chinese)
7.
Zurück zum Zitat Miao, H., Chavan, A., Deshpande, A.: ProvDB: lifecycle management of collaborative analysis workflows. In: Proceedings of the 2nd Workshop on Human-in-the-Loop Data Analytics, pp. 1–6 (2017) Miao, H., Chavan, A., Deshpande, A.: ProvDB: lifecycle management of collaborative analysis workflows. In: Proceedings of the 2nd Workshop on Human-in-the-Loop Data Analytics, pp. 1–6 (2017)
8.
Zurück zum Zitat Miao, H., Li, A., Davis, L.S., Deshpande, A.: Towards unified data and lifecycle management for deep learning. In: Proceedings of the 33rd International Conference on Data Engineering, pp. 571–582 (2017) Miao, H., Li, A., Davis, L.S., Deshpande, A.: Towards unified data and lifecycle management for deep learning. In: Proceedings of the 33rd International Conference on Data Engineering, pp. 571–582 (2017)
9.
Zurück zum Zitat Moreau, L., et al.: The open provenance model core specification (v1.1). Future Gener. Comput. Syst. 27(6), 743–756 (2011)CrossRef Moreau, L., et al.: The open provenance model core specification (v1.1). Future Gener. Comput. Syst. 27(6), 743–756 (2011)CrossRef
10.
Zurück zum Zitat Simmhan, Y.L., Plale, B., Gannon, D.: A survey of data provenance techniques. Technical report IUB-CS-TR618, Computer Science Department, Indiana University (2005) Simmhan, Y.L., Plale, B., Gannon, D.: A survey of data provenance techniques. Technical report IUB-CS-TR618, Computer Science Department, Indiana University (2005)
11.
Zurück zum Zitat Simon, K.: An improved algorithm for transitive closure on acyclic digraphs. Theor. Comput. Sci. 58(1–3), 325–346 (1988)MathSciNetCrossRef Simon, K.: An improved algorithm for transitive closure on acyclic digraphs. Theor. Comput. Sci. 58(1–3), 325–346 (1988)MathSciNetCrossRef
13.
Zurück zum Zitat Tsay, J., Mummert, T., Bobroff, N., Braz, A., Westerink, P., Hirzel, M.: Runway: machine learning model experiment management tool. In: Proceedings of SysML Conference 2018, pp. 1–3 (2018) Tsay, J., Mummert, T., Bobroff, N., Braz, A., Westerink, P., Hirzel, M.: Runway: machine learning model experiment management tool. In: Proceedings of SysML Conference 2018, pp. 1–3 (2018)
14.
Zurück zum Zitat Vartak, M., et al.: ModelDB: a system for machine learning model management. In: Proceedings of the 1st Workshop on Human-in-the-Loop Data Analytics, pp. 1–3 (2016) Vartak, M., et al.: ModelDB: a system for machine learning model management. In: Proceedings of the 1st Workshop on Human-in-the-Loop Data Analytics, pp. 1–3 (2016)
15.
Zurück zum Zitat Wang, J., Crawl, D., Purawat, S., Nguyen, M., Altintas, I.: Big data provenance: challenges, state of the art and opportunities. In: Proceedings of the 2015 IEEE International Conference on Big Data, pp. 2509–2516 (2015) Wang, J., Crawl, D., Purawat, S., Nguyen, M., Altintas, I.: Big data provenance: challenges, state of the art and opportunities. In: Proceedings of the 2015 IEEE International Conference on Big Data, pp. 2509–2516 (2015)
17.
Zurück zum Zitat Zhang, Y., Xu, F., Frise, E., Wu, S., Yu, B., Xu, W.: DataLab: a version data management and analytics system. In: Proceedings of the 2nd International Workshop on Big Data Software Engineering, pp. 12–18 (2016) Zhang, Y., Xu, F., Frise, E., Wu, S., Yu, B., Xu, W.: DataLab: a version data management and analytics system. In: Proceedings of the 2nd International Workshop on Big Data Software Engineering, pp. 12–18 (2016)
Metadaten
Titel
OMProv: Provenance Mechanism for Objects in Deep Learning
verfasst von
Jian Lin
Dongming Xie
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-55814-7_8

Premium Partner