skip to main content
research-article

The complexity of causality and responsibility for query answers and non-answers

Published:01 October 2010Publication History
Skip Abstract Section

Abstract

An answer to a query has a well-defined lineage expression (alternatively called how-provenance) that explains how the answer was derived. Recent work has also shown how to compute the lineage of a non-answer to a query. However, the cause of an answer or non-answer is a more subtle notion and consists, in general, of only a fragment of the lineage. In this paper, we adapt Halpern, Pearl, and Chockler's recent definitions of causality and responsibility to define the causes of answers and non-answers to queries, and their degree of responsibility. Responsibility captures the notion of degree of causality and serves to rank potentially many causes by their relative contributions to the effect. Then, we study the complexity of computing causes and responsibilities for conjunctive queries. It is known that computing causes is NP-complete in general. Our first main result shows that all causes to conjunctive queries can be computed by a relational query which may involve negation. Thus, causality can be computed in PTIME, and very efficiently so. Next, we study computing responsibility. Here, we prove that the complexity depends on the conjunctive query and demonstrate a dichotomy between PTIME and NP-complete cases. For the PTIME cases, we give a non-trivial algorithm, consisting of a reduction to the max-flow computation problem. Finally, we prove that, even when it is in PTIME, responsibility is complete for LOGSPACE, implying that, unlike causality, it cannot be computed by a relational query.

References

  1. S. Abiteboul, R. Hull, and V. Vianu. Foundations of Databases. Addison-Wesley, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. P. Buneman, S. Khanna, and W. C. Tan. Why and where: A characterization of data provenance. In ICDT, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. A. Chapman and H. V. Jagadish. Why not? In SIGMOD, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. J. Cheney, L. Chiticariu, and W. C. Tan. Provenance in databases: Why, how, and where. Foundations and Trends in Databases, 1(4):379--474, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. H. Chockler and J. Y. Halpern. Responsibility and blame: A structural-model approach. J. Artif. Intell. Res. (JAIR), 22:93--115, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. H. Chockler, J. Y. Halpern, and O. Kupferman. What causes a system to satisfy a specification? ACM Trans. Comput. Log., 9(3), 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Y. Cui, J. Widom, and J. L. Wiener. Tracing the lineage of view data in a warehousing environment. ACM Trans. Database Syst., 25(2):179--227, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. N. Dalvi and D. Suciu. Management of probabilistic data: Foundations and challenges. In PODS, pages 1--12, Beijing, China, 2007. (invited talk). Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. T. Eiter and T. Lukasiewicz. Complexity results for structure-based causality. Artif. Intell., 142(1):53--89, 2002. (Conference version in IJCAI, 2002). Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. T. Eiter and T. Lukasiewicz. Causes and explanations in the structural-model approach: Tractable cases. Artif. Intell., 170(6--7):542--580, 2006. Google ScholarGoogle ScholarCross RefCross Ref
  11. G. Gottlob, N. Leone, and F. Scarcello. The complexity of acyclic conjunctive queries. J. ACM, 48(3):431--498, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. T. J. Green, G. Karvounarakis, and V. Tannen. Provenance semirings. In PODS, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Y. Halpern and J. Pearl. Causes and explanations: A structural-model approach. Part I: Causes. Brit. J. Phil. Sci., 56:843--887, 2005. (Conference version in UAI, 2001). Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. M. Herschel, M. A. Hernández, and W. C. Tan. Artemis: A system for analyzing missing answers. PVLDB, 2(2):1550--1553, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. Huang, T. Chen, A. Doan, and J. F. Naughton. On the provenance of non-answers to queries over extracted data. PVLDB, 1(1):736--747, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. D. Lewis. Causation. The Journal of Philosophy, 70(17):556--567, 1973.Google ScholarGoogle ScholarCross RefCross Ref
  17. A. Meliou, W. Gatterbauer, J. Halpern, C. Koch, K. F. Moore, and D. Suciu. Causality in databases. IEEE Data Engineering Bulletin, Sept. 2010.Google ScholarGoogle Scholar
  18. A. Meliou, W. Gatterbauer, K. F. Moore, and D. Suciu. The complexity of causality and responsibility for query answers and non-answers. CoRR, abs/1009.2021, 2010.Google ScholarGoogle Scholar
  19. A. Meliou, W. Gatterbauer, K. F. Moore, and D. Suciu. Why so? or Why no? Functional causality for explaining query answers. In MUD, 2010. Full version: CoRR abs/0912.5340 (2009).Google ScholarGoogle Scholar
  20. P. Menzies. Counterfactual theories of causation. Stanford Encylopedia of Philosophy, 2008.Google ScholarGoogle Scholar
  21. D. Olteanu and J. Huang. Secondary-storage confidence computation for conjunctive queries with inequalities. In SIGMOD, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. P. Senellart and G. Gottlob. On the complexity of deriving schema mappings from database instances. PODS, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Q. T. Tran and C.-Y. Chan. How to conquer why-not questions. In SIGMOD, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. International multidisciplinary workshop on causality. IRIT, Toulouse, June 2009.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in

Full Access

  • Published in

    cover image Proceedings of the VLDB Endowment
    Proceedings of the VLDB Endowment  Volume 4, Issue 1
    October 2010
    45 pages

    Publisher

    VLDB Endowment

    Publication History

    • Published: 1 October 2010
    Published in pvldb Volume 4, Issue 1

    Qualifiers

    • research-article

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader