ABSTRACT
As digital objects become increasingly important in people's lives, people may need to understand the provenance, or lineage and history, of an important digital object, to understand how it was produced. This is particularly important for objects created from large, multi-source collections of personal data. As the metadata describing provenance, Provenance Data, is commonly represented as a labelled directed acyclic graph, the challenge is to create effective interfaces onto such graphs so that people can understand the provenance of key digital objects. This unsolved problem is especially challenging for the case of novice and intermittent users and complex provenance graphs. We tackle this by creating an interface based on a clustering approach. This was designed to enable users to view provenance graphs, and to simplify complex graphs by combining several nodes. Our core contribution is the design of a prototype interface that supports clustering and its analytic evaluation in terms of desirable properties of visualisation interfaces.
- J. Abello, F. Van Ham, and N. Krishnan. ASK-GraphView: A large scale graph visualization system. In IEEE Transactions on Visualization and Computer Graphics, volume 12, 669--676, 2006. Google ScholarDigital Library
- N. Balakrishnan, T. Bytheway, R. Sohan, and A. Hopper. OPUS: A Lightweight System for Observational Provenance in User Space. In USENIX Workshop on the Theory and Practice of Provenance (TaPP), 8, 2013. Google ScholarDigital Library
- D. Bearman and R. Lytle. The Power of the Principle of Provenance. Archivaria, 21(February 1982):14--27, 1985.Google Scholar
- K. Belhajjame, H. Deus, D. Garijo, G. Klyne, P. Missier, S. Soliand-Reyes, and S. Zednik. PROV Model Primer. In W3C Working Group Note, 2013.Google Scholar
- M. A. Borkin, C. S. Yeh, M. Boyd, P. MacKo, K. Z. Gajos, M. Seltzer, and H. Pfister. Evaluation of filesystem provenance visualization tools. IEEE Transactions on Visualization and Computer Graphics, 19(12):2476--2485, 2013. Google ScholarDigital Library
- J. Cheney, P. Missier, and L. Moreau. Constraints of the Provenance Data Model. Technical report, 2012.Google Scholar
- E. R. Gansner, E. Koutsofios, S. C. North, and K. P. Vo. A Technique for Drawing Directed Graphs. IEEE Transactions on Software Engineering, 19(3):214--230, 1993. Google ScholarDigital Library
- P. Guo and M. Seltzer. BURRITO: Wrapping Your Lab Notebook in Computational Infrastructure. In USENIX Workshop on the Theory and Practice of Provenance (TaPP), 4, 2012. Google ScholarDigital Library
- I. Li, Y. Medynskiy, J. Froehlich, and J. E. Larsen. Personal informatics in practice: improving quality of life through data. CHI Extended Abstracts on Human Factors in Computing Systems, 2799--2802, 2012. Google ScholarDigital Library
- P. Macko, M. Chiarini, and M. Seltzer. Collecting Provenance via the Xen Hypervisor. In USENIX Workshop on the Theory and Practice of Provenance (TaPP), 2011.Google Scholar
- P. Missier, J. Bryans, C. Gamble, V. Curcin, and R. Danger. Provabs: Model, policy, and tooling for abstracting PROV graphs. In International Provenance & Annotation Workshop (IPAW), 2014.Google Scholar
- D. Schaffer, Z. Zuo, S. Greenberg, L. Bartram, J. Dill, S. Dubs, and M. Roseman. Navigating hierarchically clustered networks through fisheye and full-zoom methods. ACM Transactions on Computer-Human Interaction, 3(2):162--188, 1996. Google ScholarDigital Library
- M. Seltzer and P. Macko. Provenance Map Orbiter: Interactive Exploration of Large Provenance Graphs. In USENIX Workshop on the Theory and Practice of Provenance (TaPP), 2011.Google Scholar
- B. Shneiderman. The eyes have it: A task by data type taxonomy for information visualizations. In IEEE Symposium on Visual Languages, 336--343, 1996. Google ScholarDigital Library
Recommendations
Local clustering in provenance graphs
CIKM '13: Proceedings of the 22nd ACM international conference on Information & Knowledge ManagementSystems that capture and store data provenance, the record of how an object has arrived at its current state, accumulate historical metadata over time, forming a large graph. Local clustering in these graphs, in which we start with a seed vertex and ...
The perm provenance management system in action
SIGMOD '09: Proceedings of the 2009 ACM SIGMOD International Conference on Management of dataIn this demonstration we present the Perm provenance management system (PMS). Perm is capable of computing, storing and querying provenance information for the relational data model. Provenance is computed by using query rewriting techniques to annotate ...
Provenance of publications: a PROV style for latex
TaPP'15: Proceedings of the 7th USENIX Conference on Theory and Practice of ProvenanceIn general, the task of generating provenance is still tedious, and the community still lacks tools to generate provenance easily. In particular, when writing papers, researchers should be able to produce the provenance of their papers, make it ...
Comments