skip to main content
10.1145/2939502.2939508acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Clustering provenance facilitating provenance exploration through data abstraction

Published:26 June 2016Publication History

ABSTRACT

As digital objects become increasingly important in people's lives, people may need to understand the provenance, or lineage and history, of an important digital object, to understand how it was produced. This is particularly important for objects created from large, multi-source collections of personal data. As the metadata describing provenance, Provenance Data, is commonly represented as a labelled directed acyclic graph, the challenge is to create effective interfaces onto such graphs so that people can understand the provenance of key digital objects. This unsolved problem is especially challenging for the case of novice and intermittent users and complex provenance graphs. We tackle this by creating an interface based on a clustering approach. This was designed to enable users to view provenance graphs, and to simplify complex graphs by combining several nodes. Our core contribution is the design of a prototype interface that supports clustering and its analytic evaluation in terms of desirable properties of visualisation interfaces.

References

  1. J. Abello, F. Van Ham, and N. Krishnan. ASK-GraphView: A large scale graph visualization system. In IEEE Transactions on Visualization and Computer Graphics, volume 12, 669--676, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. N. Balakrishnan, T. Bytheway, R. Sohan, and A. Hopper. OPUS: A Lightweight System for Observational Provenance in User Space. In USENIX Workshop on the Theory and Practice of Provenance (TaPP), 8, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. D. Bearman and R. Lytle. The Power of the Principle of Provenance. Archivaria, 21(February 1982):14--27, 1985.Google ScholarGoogle Scholar
  4. K. Belhajjame, H. Deus, D. Garijo, G. Klyne, P. Missier, S. Soliand-Reyes, and S. Zednik. PROV Model Primer. In W3C Working Group Note, 2013.Google ScholarGoogle Scholar
  5. M. A. Borkin, C. S. Yeh, M. Boyd, P. MacKo, K. Z. Gajos, M. Seltzer, and H. Pfister. Evaluation of filesystem provenance visualization tools. IEEE Transactions on Visualization and Computer Graphics, 19(12):2476--2485, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. J. Cheney, P. Missier, and L. Moreau. Constraints of the Provenance Data Model. Technical report, 2012.Google ScholarGoogle Scholar
  7. E. R. Gansner, E. Koutsofios, S. C. North, and K. P. Vo. A Technique for Drawing Directed Graphs. IEEE Transactions on Software Engineering, 19(3):214--230, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. P. Guo and M. Seltzer. BURRITO: Wrapping Your Lab Notebook in Computational Infrastructure. In USENIX Workshop on the Theory and Practice of Provenance (TaPP), 4, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. I. Li, Y. Medynskiy, J. Froehlich, and J. E. Larsen. Personal informatics in practice: improving quality of life through data. CHI Extended Abstracts on Human Factors in Computing Systems, 2799--2802, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. P. Macko, M. Chiarini, and M. Seltzer. Collecting Provenance via the Xen Hypervisor. In USENIX Workshop on the Theory and Practice of Provenance (TaPP), 2011.Google ScholarGoogle Scholar
  11. P. Missier, J. Bryans, C. Gamble, V. Curcin, and R. Danger. Provabs: Model, policy, and tooling for abstracting PROV graphs. In International Provenance & Annotation Workshop (IPAW), 2014.Google ScholarGoogle Scholar
  12. D. Schaffer, Z. Zuo, S. Greenberg, L. Bartram, J. Dill, S. Dubs, and M. Roseman. Navigating hierarchically clustered networks through fisheye and full-zoom methods. ACM Transactions on Computer-Human Interaction, 3(2):162--188, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. M. Seltzer and P. Macko. Provenance Map Orbiter: Interactive Exploration of Large Provenance Graphs. In USENIX Workshop on the Theory and Practice of Provenance (TaPP), 2011.Google ScholarGoogle Scholar
  14. B. Shneiderman. The eyes have it: A task by data type taxonomy for information visualizations. In IEEE Symposium on Visual Languages, 336--343, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    HILDA '16: Proceedings of the Workshop on Human-In-the-Loop Data Analytics
    June 2016
    93 pages
    ISBN:9781450342070
    DOI:10.1145/2939502

    Copyright © 2016 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 26 June 2016

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article

    Acceptance Rates

    HILDA '16 Paper Acceptance Rate16of32submissions,50%Overall Acceptance Rate28of56submissions,50%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader