skip to main content
research-article

Querying and repairing inconsistent numerical databases

Published:03 May 2010Publication History
Skip Abstract Section

Abstract

The problem of extracting consistent information from relational databases violating integrity constraints on numerical data is addressed. In particular, aggregate constraints defined as linear inequalities on aggregate-sum queries on input data are considered. The notion of repair as consistent set of updates at attribute-value level is exploited, and the characterization of several data-complexity issues related to repairing data and computing consistent query answers is provided. Moreover, a method for computing “reasonable” repairs of inconsistent numerical databases is provided, for a restricted but expressive class of aggregate constraints. Several experiments are presented which assess the effectiveness of the proposed approach in real-life application scenarios.

Skip Supplemental Material Section

Supplemental Material

References

  1. Afrati, F. N. and Kolaitis, P. G. 2009. Repair checking in inconsistent databases: Algorithms and complexity. In Proceedings of the 12th International Conference on Database Theory (ICDT). 31--41. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Agarwal, S., Keller, A. M., Wiederhold, G., and Saraswat, K. 1995. Flexible relation: An approach for integrating data from multiple, possibly inconsistent databases. In Proceedings of the 11th International Conference on Data Engineering (ICDE). 495--504. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Arenas, M., Bertossi, L. E., and Chomicki, J. 1999. Consistent query answers in inconsistent databases. In Proceedings of the 18th ACM Symposium on Principles of Database Systems (PODS). 68--79. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Arenas, M., Bertossi, L. E., and Chomicki, J. 2000. Specifying and querying database repairs using logic programs with exceptions. In Proceedings of the 4th International Conference on Flexible Query Answering Systems (FQAS). 27--41.Google ScholarGoogle Scholar
  5. Arenas, M., Bertossi, L. E., and Chomicki, J. 2003a. Answer sets for consistent query answering in inconsistent databases. Theory Pract. Logic Program. 3, 4-5, 393--424. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Arenas, M., Bertossi, L. E., Chomicki, J., He, X., Raghavan, V., and Spinrad, J. 2003b. Scalar aggregation in inconsistent databases. Theor. Comput. Sci. 3, 296, 405--434. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Arenas, M., Bertossi, L. E., and Kifer, M. 2000. Applications of annotated predicate calculus to querying inconsistent databases. In Proceedings of the 1st International Conference on Computing Logic (CL). 926--941. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Barceló, P. and Bertossi, L. E. 2002. Repairing databases with annotated predicate logic. In Proceedings of the 9th International Workshop on Non-Monotonic Reasoning (NMR). 160--170.Google ScholarGoogle Scholar
  9. Barceló, P. and Bertossi, L. E. 2003. Logic programs for querying inconsistent databases. In Proceedings of the 5th International Symposium on Practical Aspects of Declarative Languages (PADL). 208--222. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Bertossi, L. E., Bravo, L., Franconi, E., and Lopatenko, A. 2008. The complexity and approximation of fixing numerical attributes in databases under integrity constraints. Inform. Syst. 33, 4-5, 407--434. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Bohannon, P., Flaster, M., Fan, W., and Rastogi, R. 2005. A cost-based model and effective heuristic for repairing constraints by value modification. In Proceedings of the ACM SIGMOD International Conference on Management of Data. 143--154. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Bry, F. 1997. Query answering in information systems with integrity constraints. In Proceedings of the 1st International Conference on Integrity and Internal Control in Information Systems (IICIS). 113--130. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Calì, A., Lembo, D., and Rosati, R. 2003. On the decidability and complexity of query answering over inconsistent and incomplete databases. In Proceedings of the 22nd ACM Symposium on Principles of Database Systems (PODS). 260--271. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Celle, A. and Bertossi, L. E. 2000. Querying inconsistent databases: Algorithms and implementation. In Proceedings of the 1st International Conference on Computing Logic (CL). 942--956. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Chomicki, J. and Marcinkowski, J. 2005. Minimal-Change integrity maintenance using tuple deletions. Inf. Comput. 197, 1-2, 90--121. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Chomicki, J., Marcinkowski, J., and Staworko, S. 2004. Computing consistent query answers using conflict hypergraphs. In Proceedings of the 13th ACM Conference on Information and Knowledge Management (CIKM). 417--426. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Eiter, T. and Gottlob, G. 1992. On the complexity of propositional knowledge base revision, updates, and counterfactuals. Artif. Intell. 57, 2-3, 227--270. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Fazzinga, B., Flesca, S., Furfaro, F., and Parisi, F. 2006. Dart: A data acquisition and repairing tool. In Proceedings of the International Workshop on Inconsistency and Incompleteness in Databases (IIDB). 297--317. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Flesca, S., Furfaro, F., and Parisi, F. 2005. Consistent query answers on numerical databases under aggregate constraints. In Proceedings of the 10th International Symposium on Database Programming Languages (DPBL). 279--294. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Franconi, E., Palma, A. L., Leone, N., Perri, S., and Scarcello, F. 2001. Census data repair: A challenging application of disjunctive logic programming. In Proceedings of the 8th International Conference on Logic for Programming, Artificial Intelligence, and Reasoning (LPAR). 561--578. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Fuxman, A., Fazli, E., and Miller, R. J. 2005. Conquer: Efficient management of inconsistent databases. In Proceedings of the ACM SIGMOD International Conference on Management of Data. 155--166. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Fuxman, A. and Miller, R. J. 2007. First-Order query rewriting for inconsistent databases. J. Comput. Syst. Sci. 73, 4, 610--635. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Gärdenfors, P. and Rott, H. 1995. Belief revision. In Handbook of Logic in Artificial Intelligence and Logic Programming (Vol. 4): Epistemic and Temporal Reasoning, D. M. Gabbay, C. J. Hogger, and J. A. Robinson, Eds. Oxford University Press, Oxford, UK, 35--132. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Gass, S. I. 1985. Linear Programming: Methods and Applications (5th ed.). McGraw-Hill, New York. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Greco, G., Greco, S., and Zumpano, E. 2003. A logical framework for querying and repairing inconsistent databases. IEEE Trans. Knowl. Data Engin. 15, 6, 1389--1408. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Imielinski, T. and Lipski, W. 1984. Incomplete information in relational databases. J. ACM 31, 4, 761--791. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Kendall, M. 1963. Ill-Conditioned matrices in linear programming. Metrika 6, 1, 60--64.Google ScholarGoogle ScholarCross RefCross Ref
  28. Kifer, M. and Lozinskii, E. L. 1992. A logic for reasoning with inconsistency. J. Autom. Reason. 9, 2, 179--215. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Lopatenko, A. and Bertossi, L. E. 2007. Complexity of consistent query answering in databases under cardinality-based and incremental repair semantics. In Proceedings of the 11th International Conference on Database Theory (ICDT). 179--193. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Papadimitriou, C. H. 1981. On the complexity of integer programming. J. ACM 28, 4, 765--768. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Papadimitriou, C. H. and Steiglitz, K. 1998. Combinatorial Optimization: Algorithms and Complexity. Dover, Mineola, New York. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Papadimitriou, C. M. 1994. Computational Complexity. Addison-Wesley, Reading, MA.Google ScholarGoogle Scholar
  33. Rahm, E. and Do, H. H. 2000. Data cleaning: Problems and current approaches. IEEE Data Engin. Bull. 23, 4, 3--13.Google ScholarGoogle Scholar
  34. Ross, K. A., Srivastava, D., Stuckey, P. J., and Sudarshan, S. 1998. Foundations of aggregation constraints. Theor. Comput. Sci. 193, 1-2, 149--179. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Wijsen, J. 2003. Condensed representation of database repairs for consistent query answering. In Proceedings of the 9th International Conference on Database Theory (ICDT). 378--393. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Wijsen, J. 2004. Making more out of an inconsistent database. In Proceedings of the 8th East European Conference on Advances in Databases and Information Systems (ADBIS). 291--305.Google ScholarGoogle ScholarCross RefCross Ref
  37. Wijsen, J. 2005. Database repairing using updates. ACM Trans. Database. Syst. 30, 3, 722--768. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Wijsen, J. 2006. A note on database repairing by value modification. In Proceedings of the International Workshop on Inconsistency and Incompleteness in Databases (IIDB). Position paper.Google ScholarGoogle Scholar
  39. Wijsen, J. 2009. Consistent query answering under primary keys: A characterization of tractable queries. In Proceedings of the 12th International Conference on Database Theory (ICDT). 42--52. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Xie, G., Dang, Z., and Ibarra, O. H. 2003. A solvable class of quadratic diophantine equations with applications to verification of infinite-state systems. In Proceedings of the 30th International Colloquium on Automata, Languages and Programming (ICALP). 668--680. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Querying and repairing inconsistent numerical databases

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM Transactions on Database Systems
          ACM Transactions on Database Systems  Volume 35, Issue 2
          April 2010
          336 pages
          ISSN:0362-5915
          EISSN:1557-4644
          DOI:10.1145/1735886
          Issue’s Table of Contents

          Copyright © 2010 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 3 May 2010
          • Accepted: 1 February 2010
          • Revised: 1 September 2009
          • Received: 1 August 2008
          Published in tods Volume 35, Issue 2

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader