skip to main content
10.1145/1353343.1353403acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article
Free Access

Data exchange in the presence of arithmetic comparisons

Published:25 March 2008Publication History

ABSTRACT

Data exchange is the problem of transforming data structured under a schema (called source) into data structured under a different schema (called target). The emphasis of data exchange is to materialize a target instance (called solution) that satisfies the relationship between the schemas. Universal solutions were shown to be the most suitable solutions, mainly because they can be used to answer conjunctive queries posed over the target schema. Trying to extend this result to more expressive query languages fails, even if we only add inequalities (≠) to conjunctive queries.

In this work we study data exchange in the presence of general arithmetic comparisons (<, ≤, >, ≥, =, ≠): (a) We consider queries posed over the target schema that belong to the class of unions of conjunctive queries with arithmetic comparisons (in short CQACs). (b) We exploit arithmetic comparisons to define more expressive data exchange settings, called DEAC settings. In particular, DEAC settings consist of constraints that involve arithmetic comparisons. For that, two new classes of dependencies (tgd-ACs and acgds) are introduced, to capture the need of arithmetic comparisons in source-to-target and target constraints.

We show that in DEAC settings the existence of solution problem is in NP. We define a novel chase procedure called AC-chase which is a tree and we prove that it produces a universal solution (appropriately defined to deal with arithmetic comparisons). We show that the new concept of universal solution is the right tool for query answering in the case of unions of CQACs. The complexity of computing certain answers for unions of CQACs is shown to be coNP-complete. Moreover, we identify polynomial cases for a) computing a universal solution and b) computing certain answers. For that, we introduce the succinct AC-chase which is a sequence instead of a tree, but its result is not necessarily a solution. We identify cases where succinct AC-chase returns indeed a universal solution and we investigate the syntactic conditions of the query under which query answering takes polynomial time. We show that the latter is feasible even in cases where the result of chase is not a universal solution.

References

  1. S. Abiteboul and O. M. Duschka. Complexity of answering queries using materialized views. In PODS, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. F. Afrati, C. Li, and P. Mitra. On containment of conjunctive queries with arithmetic comparisons. In EDBT, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  3. F. Afrati, C. Li, and P. Mitra. Rewriting queries using views in the presence of arithmetic comparisons. TCS, 368(1--2):88--123, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. Arenas, P. Barcel, R. Fagin, and L. Libkin. Locally consistent transformations and query answering in data exchange. In PODS, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. M. Arenas and L. Libkin. XML data exchange: Consistency and query answering. In PODS, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Baudinet, J. Chomicki, and P. Wolper. Constraint-generating dependencies. J. Comput. Syst. Sci., 59(1):94--115, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. C. Beeri and M. Y. Vardi. Formal systems for tuple and equality generating dependencies. SIAM J. on Computing, 13(1):76--98, 1984.Google ScholarGoogle ScholarCross RefCross Ref
  8. C. Beeri and M. Y. Vardi. A proof procedure for data dependencies. J. ACM, 31(4):718--741, 1984. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. P. A. Bernstein. Generic model management: A database infrastructure for schema manipulation. In IDM 2003 Workshop, 2003.Google ScholarGoogle Scholar
  10. P. Bohannon, W. Fan, F. Geerts, X. Jia, and A. Kementsietsidis. Conditional functional dependencies for data cleaning. In ICDE, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  11. A. K. Chandra and P. M. Merlin. Optimal implementation of conjunctive queries in relational databases. In STOC, 1977. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. Deutsch and V. Tannen. Reformulation of XML queries and constraints. In ICDT, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. R. Fagin. Horn clauses and database dependencies. J. ACM, 29(4):952--985, 1982. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. R. Fagin, P. G. Kolaitis, R. J. Miller, and L. Popa. Data exchange: Semantics and query answering. In ICDT, 2003. Full version: TCS 336(1): 89--124, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. R. Fagin, P. G. Kolaitis, and L. Popa. Data exchange: Getting to the core. In PODS, 2003. Full version: ACM TODS 30(1): 174--210, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. G. Gottlob. Computing cores for data exchange: New algorithms and practical solutions. In PODS, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. G. Gottlob and A. Nash. Data exchange: Computing cores in polynomial time. In PODS, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. G. Grahne. The problem of incomplete information in relational databases, volume 554. Spring-Verlag, Berlin, Lecture Notes in Computer Science, 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. A. Gupta. Partial Information Based Integrity Constraint Checking. PhD thesis, Stanford, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. T. Imielinski and W. Lipski. Incomplete information in relational databases. J. ACM, 31(4):761--791, 1984. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. N. Ishakbeyoglu and Z. M. Ozsoyoglu. On the maintenance of implication integrity constraints. In DEXA, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. A. Klug. On conjunctive queries containing inequalities. J. ACM, 35(1):146--160, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. P. G. Kolaitis. Schema Mappings, Data Exchange, and Metadata Management. In PODS, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. P. G. Kolaitis, J. Panttaja, and W.-C. Tan. The complexity of data exchange. In PODS, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. M. Lenzerini. Data integration: A theoretical perspective. In PODS, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. A. Levy, A. Rajaraman, and J. Ullman. Answering queries using limited external query processors. In PODS, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. L. Libkin. Data exchange and incomplete information. In PODS, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. A. Madry. Data exchange: on complexity of answering queries with inequalities. Information Processing Letters, 94(6):253--257, 2005.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. M. J. Maher. Constrained dependencies. Theoretical Computer Science, 173(1):113--149, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. M. J. Maher and D. Srivastava. Chasing constrained tuple-generating dependencies. In PODS, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. D. Maier, A. O. Mendelzon, and Y. Sagiv. Testing implications of data dependencies. ACM TODS, 4(4):455--469, 1979. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. R. v. d. Meyden. The complexity of querying indefinite data about linearly ordered domains. In PODS, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. R. v. d. Meyden. Logical approaches to incomplete information: A survey. In Logics for Databases and Information Systems, pages 307--356, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. R. J. Miller, L. M. Haas, and M. Hernández. Schema mapping as query discovery. In VLDB, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. L. Popa, Y. Velegrakis, R. J. Miller, M. A. Hernández, and R. Fagin. Translating web data. In VLDB, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. N. C. Shu, B. C. Housel, R. W. Taylor, S. P. Ghosh, and V. Y. Lum. Express: A data EXtraction, processing, amd REStructuring System. ACM TODS, 2(2):134--174, 1977. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. A. Vieilleribiere and M. D. Rougemont. Approximate data exchange. In ICDT, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. J. Wang, R. W. Topor, and M. J. Maher. Reasoning with disjunctive constrained tuple-generating dependencies. In DEXA, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Data exchange in the presence of arithmetic comparisons

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            EDBT '08: Proceedings of the 11th international conference on Extending database technology: Advances in database technology
            March 2008
            762 pages
            ISBN:9781595939265
            DOI:10.1145/1353343

            Copyright © 2008 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 25 March 2008

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

            Acceptance Rates

            Overall Acceptance Rate7of10submissions,70%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader