Abstract
In this paper, we introduce a new type of integrity constraint, which we call a statistical constraint, and discuss its applicability to enhancing database correctness. Statistical constraints manifest embedded relationships among current attribute values in the database and are characterized by their probabilistic nature. They can be used to detect potential errors not easily detected by the conventional constraints. Methods for extracting statistical constraints from a relation and enforcement of such constraints are described. Preliminary performance evaluation of enforcing statistical constraints on a real life database is also presented.
- Agra 89 Agrawal, R., Gehani, N., "Ode (Object Database and Environment) ~ The Language and the Data Model", ACM SIG- MOD Conference, 1989, pp. 36-45. Google ScholarDigital Library
- Agra 93 Agrawal, R., Imielinski, T., Swami, A., "Mining Association Rules between Sets of Items in Large Databases", ACM SIG- MOD Conference, 1993, pp. 207-216. Google ScholarDigital Library
- Beer 91 Beeri, C., Milo, T, "A Model for Active Object Oriented Database", Proc. 17th VLDB Conference, 1991, pp 337-349. Google ScholarDigital Library
- Bran 93 Brant, D., Miranker, D.,"Index Support for Rule Activation", ACM SIGMOD Conference, 1993, pp. 42-48. Google ScholarDigital Library
- Chou 75 Chou, Y-I. "Statistical Analysis", Holt, Rinehart and Winston, 1975.Google Scholar
- Coch 77 Cochran, W. "Sampling Techniques", 3rd Ed., John Wiley & Sons, 1977.Google Scholar
- Codd 70 Codd, E. F., "A Relational Model for Large Shared Data Banks", Communication of the ACM, Vol. 13, No. 6, 1970, pp. 377-387. Google ScholarDigital Library
- Devo 84 Devore, J., "Probability & Statistics for Engineering and the Sciences", Brooks/Cole Publishing, 1984.Google Scholar
- EsCh 75 Eswaran, K., Chamberlin D. "Functional Specifications of a Subsystem for Data Base Integrity", Proc. VLDB 1975, pp. 48-68.Google ScholarDigital Library
- Hans 92 Hanson, E., "Rule Condition Testing and Action Execution in Ariel", ACM SIG- MOD Conference, 1992, pp. 49-58. Google ScholarDigital Library
- HaSa 78 Hammer, M., Sarin, S., "Efficient Monitoring of Database Assertions", Proc. of ACM SIGMOD Conference, 1978. Google ScholarDigital Library
- HoWo 73 Hollander, M., and Wolfe, D., "Nonparametric Statistical Methods", John Wiley, 1973.Google Scholar
- HoOz 93 Hou, W-C., Ozsoyoglu, G., "Processing Real-Time Aggregate Relational Queries in CASE-DB", ACM Transactions on Database Systems Vol. 18, No. 2, June, 1993. Google ScholarDigital Library
- HsIm 85 Hsu, A., Imielinski, T., "Integrity Checking for Multiple Updates", Proc. of ACM SIGMOD Conference, 1985, pp. 152-168. Google ScholarDigital Library
- HZZ 93 Hou, W-C., Zhang, Z., Zhou, N., "Statistical Inference of Unknown Attribute Values in Databases", Proc. CIKM 1993, pp. 21-30. Google ScholarDigital Library
- JoWi 92 Johnson, R. and Wichem, D., "Applied Multivariate Statistical Analysis", 3rd ed. Prentice-Hall, Englewood Cliffs, 1992. Google ScholarDigital Library
- Lohm 91 Lohman, G., etc., "Extension to Starburst : Objects, Types, Functions, and Rules", Comm. ACM, Vol. 34, No. 10, 1991, pp. 94-109. Google ScholarDigital Library
- McCa 89 McCarthy, D., Uayal, U, "The Architecture of An Active Object-Oriented Database System, ACM SIGMOD Conference, 1989, pp. 215-224. Google ScholarDigital Library
- Morg 83 Morgenstem, M. "Active Databases as a Paradigm for Enhanced Computing Environments", Proc. the 9th VLDB Conference, 1983, pp. 34-42. Google ScholarDigital Library
- Piat 91 G. Piatetsky-Shapiro etc., "Knowledge Discovery in Databases", AAA//MIT Press, 1991. Google ScholarDigital Library
- SAS 91 "SAS/STAT User's Guide", Release 6.03 Ed., S AS Institute Inc., North Carolina.Google Scholar
- Sell 88 Sellis, T., Lin, C., Raschid, L., "Implementing Large Production Systems in a DBMS Environment: Concepts and Algorithms", ACM SIGMOD Conference, 1988, pp. 404-412. Google ScholarDigital Library
- Ston 90 Stonebraker, M., etc., "On Rules, Procedures, Caching and Views in Database Systems", ACM SIGMOD Conference, 1990, pp. 281-290. Google ScholarDigital Library
- Suit 85 Suits, D. "Statistics : An Introduction to Quantitative Economic Research", Halyburton Press, 1985.Google Scholar
- Tats 88 Tatsuoka, M., "Multivariate Analysis", Macmillan Publishing, 1988.Google Scholar
Index Terms
- Enhancing database correctness: a statistical approach
Recommendations
Enhancing database correctness: a statistical approach
SIGMOD '95: Proceedings of the 1995 ACM SIGMOD international conference on Management of dataIn this paper, we introduce a new type of integrity constraint, which we call a statistical constraint, and discuss its applicability to enhancing database correctness. Statistical constraints manifest embedded relationships among current attribute ...
Comments