Skip to main content
Erschienen in: The Journal of Supercomputing 4/2017

12.09.2016

Constructing data supply chain based on layered PROV

verfasst von: Peng Li, Tin-Yu Wu, Xin-Ming Li, Hong Luo, Mohammad S. Obaidat

Erschienen in: The Journal of Supercomputing | Ausgabe 4/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The inability to effectively construct data supply chain in distributed environments is becoming one of the top concerns in big data area. Aiming at this problem, a novel method of constructing data supply chain based on layered PROV is proposed. First, to abstractly describe the data transfer processes from creation to distribution, a data provenance specification presented by W3C is used to standardize the information records of data activities within and across data platforms. Then, a distributed PROV data generation algorithm for multi-platform is designed. Further, we propose a tiered storage management of provenance based on summarization technology, which reduces the provenance records by compressing mid versions so as to realize multi-level management of PROV. In specific, we propose a hierarchical visual technique based on a layered query mechanism, which allows users to visualize data supply chain from general to detail. The experimental results show that the proposed approach can effectively improve the construction performance for data supply chain.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Groth P (2013) Transparency and reliability in the data supply chain. Internet Comput IEEE 17(2):69–71CrossRef Groth P (2013) Transparency and reliability in the data supply chain. Internet Comput IEEE 17(2):69–71CrossRef
2.
Zurück zum Zitat Zhou W, Fei Q, Narayan A et al (2011) Secure network provenance. The 23rd ACM Symposium on Operating Systems Principles (SOSP 2011), pp295–310, 23–26 October 2011 Zhou W, Fei Q, Narayan A et al (2011) Secure network provenance. The 23rd ACM Symposium on Operating Systems Principles (SOSP 2011), pp295–310, 23–26 October 2011
3.
Zurück zum Zitat Xie Y, Feng D, Tan Z et al (2013) Design and evaluation of a provenance-based rebuild framework. IEEE Trans Magn 49(6):2805–2811CrossRef Xie Y, Feng D, Tan Z et al (2013) Design and evaluation of a provenance-based rebuild framework. IEEE Trans Magn 49(6):2805–2811CrossRef
4.
Zurück zum Zitat Stamatogiannakis M, Groth P, Bos H (2015) Looking inside the black-box: capturing data provenance using dynamic instrumentation. Provenance and Annotation of Data and Processes, vol 8628, pp 155–167 Stamatogiannakis M, Groth P, Bos H (2015) Looking inside the black-box: capturing data provenance using dynamic instrumentation. Provenance and Annotation of Data and Processes, vol 8628, pp 155–167
5.
Zurück zum Zitat Ko RKL, Will M (2014) Progger: an efficient, Tamper-evident Kernel-space logger for cloud data provenance tracking. In: IEEE 7th International Conference on Cloud Computing (CLOUD). IEEE, New York, pp 881–889 Ko RKL, Will M (2014) Progger: an efficient, Tamper-evident Kernel-space logger for cloud data provenance tracking. In: IEEE 7th International Conference on Cloud Computing (CLOUD). IEEE, New York, pp 881–889
6.
Zurück zum Zitat Yu T, Ko RKL, Holmes G (2013) Security and data accountability in dis- tributed systems a: provenance survey. In: 2013 IEEE 10th International Conference On High Performance Computing and Communications 2013 IEEE International Conference On Embedded and Ubiquitous Computing (HPCC EUC). IEEE, New York, pp 1571–1578 Yu T, Ko RKL, Holmes G (2013) Security and data accountability in dis- tributed systems a: provenance survey. In: 2013 IEEE 10th International Conference On High Performance Computing and Communications 2013 IEEE International Conference On Embedded and Ubiquitous Computing (HPCC EUC). IEEE, New York, pp 1571–1578
7.
Zurück zum Zitat Xie Y, Muniswamy-Reddy KK, Feng D et al (2013) Evaluation of a hybrid approach for efficient provenance storage[J]. ACM Trans Storage 9(4):1752–1756CrossRef Xie Y, Muniswamy-Reddy KK, Feng D et al (2013) Evaluation of a hybrid approach for efficient provenance storage[J]. ACM Trans Storage 9(4):1752–1756CrossRef
8.
Zurück zum Zitat Moreau L, Clifford B, Freire J et al (2010) The open provenance model core specification (vl.l). Future Gen Comput Syst 27(6):743–756CrossRef Moreau L, Clifford B, Freire J et al (2010) The open provenance model core specification (vl.l). Future Gen Comput Syst 27(6):743–756CrossRef
10.
Zurück zum Zitat Jones S , Strong C, Parker-Wood A, Holloway A, LongD D E (2011) Easing the burdens of HPC file management. PDSW ’11 Proceedings of the sixth workshop on Parallel Data Storage, New York, NY, USA pp 25–30 November 2011 Jones S , Strong C, Parker-Wood A, Holloway A, LongD D E (2011) Easing the burdens of HPC file management. PDSW ’11 Proceedings of the sixth workshop on Parallel Data Storage, New York, NY, USA pp 25–30 November 2011
11.
Zurück zum Zitat Mattoso M, Dias J, OcanaKary ACS et al (2015) Dynamic steering of HPC scientific workflows: A survey. Future Gen Comput Syst 46:100–113 Mattoso M, Dias J, OcanaKary ACS et al (2015) Dynamic steering of HPC scientific workflows: A survey. Future Gen Comput Syst 46:100–113
12.
Zurück zum Zitat Korolev V, Joshi A (2014) PROB: a tool for tracking provenance and reproducibility of big data experiments. The 20th IEEE International Symposium on High Performance Computer Architecture (HPCA2014), 02 March 2014 Korolev V, Joshi A (2014) PROB: a tool for tracking provenance and reproducibility of big data experiments. The 20th IEEE International Symposium on High Performance Computer Architecture (HPCA2014), 02 March 2014
13.
Zurück zum Zitat Imran A, Agrawal R, Walker J et al (2014) A layer based architecture for provenance in big data. In: 2014 IEEE International Conference on Big Data (big data). IEEE, New York, pp 29–31 Imran A, Agrawal R, Walker J et al (2014) A layer based architecture for provenance in big data. In: 2014 IEEE International Conference on Big Data (big data). IEEE, New York, pp 29–31
14.
Zurück zum Zitat Gehani A, Tariq D (2012) SPADE: support for provenance auditing in distributed environments. ACM/IFIP/USENIX 13th International Middleware Conference, pp 101–120, 3–7 December 2012 Gehani A, Tariq D (2012) SPADE: support for provenance auditing in distributed environments. ACM/IFIP/USENIX 13th International Middleware Conference, pp 101–120, 3–7 December 2012
15.
Zurück zum Zitat Zhao D, Shou C, Malik T et al (2013) Distributed data provenance for large-scale data-intensive computing. In: 2013 IEEE International Conference on Cluster Computing (CLUSTER). IEEE, New York, pp 1–8 Zhao D, Shou C, Malik T et al (2013) Distributed data provenance for large-scale data-intensive computing. In: 2013 IEEE International Conference on Cluster Computing (CLUSTER). IEEE, New York, pp 1–8
16.
Zurück zum Zitat Suen CH, Ko RKL, Yu ST et al (2013) S2Logger: End-to-End Data Tracking Mechanism for Cloud Data Provenance. TrustCom2013:12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, pp 16–18 July 2013 Suen CH, Ko RKL, Yu ST et al (2013) S2Logger: End-to-End Data Tracking Mechanism for Cloud Data Provenance. TrustCom2013:12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, pp 16–18 July 2013
17.
Zurück zum Zitat Jacobson V, Braynard RL, Diebert T et al (2012) Custodian-based information sharing. IEEE Commun Mag 50(7):38–43CrossRef Jacobson V, Braynard RL, Diebert T et al (2012) Custodian-based information sharing. IEEE Commun Mag 50(7):38–43CrossRef
18.
Zurück zum Zitat Zhang C, Li S (2016) Secure information sharing in internet-based supply chain management systems. J Comput Inf Syst 46(4):18–24 Zhang C, Li S (2016) Secure information sharing in internet-based supply chain management systems. J Comput Inf Syst 46(4):18–24
19.
Zurück zum Zitat Freire J, Miles S, Missier P et al (2011) The open provenance model core specification (v1.1)[J]. Future Gen Comput Syst 27(6):743–756CrossRef Freire J, Miles S, Missier P et al (2011) The open provenance model core specification (v1.1)[J]. Future Gen Comput Syst 27(6):743–756CrossRef
Metadaten
Titel
Constructing data supply chain based on layered PROV
verfasst von
Peng Li
Tin-Yu Wu
Xin-Ming Li
Hong Luo
Mohammad S. Obaidat
Publikationsdatum
12.09.2016
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 4/2017
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-016-1838-0

Weitere Artikel der Ausgabe 4/2017

The Journal of Supercomputing 4/2017 Zur Ausgabe