skip to main content
article
Free Access

Association rules over interval data

Authors Info & Claims
Published:01 June 1997Publication History
Skip Abstract Section

Abstract

We consider the problem of mining association rules over interval data (that is, ordered data for which the separation between data points has meaning). We show that the measures of what rules are most important (also called rule interest) that are used for mining nominal and ordinal data do not capture the semantics of interval data. In the presence of interval data, support and confidence are no longer intuitive measures of the interest of a rule. We propose a new definition of interest for association rules that takes into account the semantics of interval data. We developed an algorithm for mining association rules under the new definition and overview our experience using the algorithm on large real-life datasets.

References

  1. AIS93 R. Agrawal, T. Imielinksi, and A. Swami. Mining Association Rules between Sets of Items in Large Databases. In Proc. of the A CM SIGMOD Int'l Conf. on Management o} Data, Washington, DC, May 1993. Google ScholarGoogle Scholar
  2. AS94 R. Agrawal and R. Srikant. Fast Algorithms for Mining Association Rules in Large Databases. In Proc. of the Int 'l Con}. on Very Large Data Bases (VLDB), Santiago, Chile, September 1994. Google ScholarGoogle Scholar
  3. CNFF96 D. W. Cheung, V. T. Ng, A. W. Fu, and Y. Fu. Efficient Mining of Association Rules in Distributed Databases. IEEE TKDE, 1996. To appear. Google ScholarGoogle Scholar
  4. EKX95 M. Ester, H.-P. Kriegel, and X. Xu. A Database Interface for Clustering in Large Spatial Databases. In Proc. of the Int'l Conf. on Knowledge Discovery ~4 Data Mining, 1995.Google ScholarGoogle Scholar
  5. Eve93 B.S. Everitt. Cluster Analysis. Edward Arnold and Halsted Press, New York- Toronto, 1993.Google ScholarGoogle Scholar
  6. FPSM91 W. J. Frawley, G. Piatetsky-Shapiro, and C. J. Matheus. Knowledge Discovery in Databases: An Overview. In {PSF91}, 1991.Google ScholarGoogle Scholar
  7. HCC93 J. Hail, Y. Cai, and N. Cercone. Data-Driven Discovery of Quantitative Rules in Relational Databases. IEEE Transactions on Knowledge and Data Engineering, 5(1):29-40, 1993. Google ScholarGoogle Scholar
  8. HF95 J. Han and Y. Fu. Discovery of Multiple-level As- Rules from Large Databases. In Proc. the lnt'l Conf. on Very Large Data Bases (VLDB), Zurich, Switzerland, September 1995. Google ScholarGoogle Scholar
  9. HS95 M. Houtsma and A. Swami. Set-oriented Mining of Association Rules. In Proc. of the Int 7 Conf. on Data Engineering, Taipei, Taiwan, March 1995 Google ScholarGoogle Scholar
  10. JD88 h.K. Jain and R. C. Dubes. Algorithms for Clustering Data. Prentice-Hall, Englewood Cliffs, NJ, 1988. Google ScholarGoogle Scholar
  11. KR90 L. Kaufman and P. J. Rousseeuw. Finding Groups in Data: An Introduction to Cluster Analysis. John Wiley & Sons, inc., NY, 1990.Google ScholarGoogle Scholar
  12. Mil95 G.W. Milligan. Clustering Validation: Results and Implications for Applied Analyses. In P. Arable, L. J. Huber, and G. DeSoete, editors, Clustering and Classification, pages 345-375. World Scientific Publishing, River Edge, NJ, 1995.Google ScholarGoogle Scholar
  13. MTV94 H. Mannila, H. Toivonen, and A. I. Verkamo. Efficient Algorithms for Discovering Association Rules. In Proc. of the AAAI Workshop on Knowledge Discovery in Databases, pages 181- 192, Seattle, WA, july 1994.Google ScholarGoogle Scholar
  14. NH94 R.T. Ng and J. Han. Efficient and Effective ClusteringMethods for Spatila Data Mining. In Proc. of the int'l Conf. on Very Large Data Bases (VLDB), 1994. Google ScholarGoogle Scholar
  15. PCY95 J.S. Park, M.-S. Chen, and P. S. Yu. An Effective Hash Based Algorithm for Mining Association Rules. In Proc. of the A CM SIGMOD Int'l Conf. on Management of Data, San Jose, CA, May 1995. Google ScholarGoogle Scholar
  16. PS91 G. Piatetsky-Shapiro. Discovery, Analysis, and Presentation of Strong Rules. In {PSF91}, pages 229-248, 1991.Google ScholarGoogle Scholar
  17. PSF91 G. Piatetsky-Shapiro and W. J. Frawley. Knowledge Discovery in Databases. AAAI Press/MIT Press, Cambridge, MA, 1991. Google ScholarGoogle Scholar
  18. SA95 R. Srikant and R. Agrawal. Mining Generalized Association Rules. In Proc. of the Int'l Conf. on Very Large Data Bases (VLDB), Zurich, Switzerland, September 1995. Google ScholarGoogle Scholar
  19. SA96 R. Srikant and R. Agrawal. Mining Quantitative Association Rules in Large Relational Tables. In Proc. of the A CM SIGMOD Int'l Conf. on Management of Data, Montreal, Canada, 1996. Google ScholarGoogle Scholar
  20. SON95 A. Savasere, E. Omiecinski, and S. Navathe. An Efficient Algorithm for Mining Association Rules in Large Databases. In Proc. of the Int'l Con}. on Very Large Data Bases (VLDB), Zurich, Switzerland, September 1995. Google ScholarGoogle Scholar
  21. Toi96 H. Toivonen. Sampling Large Databases for Association Rules. In Proc. of the Int'l Conf. on Very Large Data Bases (VLDB), Bombay, India, 1996. Google ScholarGoogle Scholar
  22. WM92 W. H. Wolberg and O. Mangasarian. Wisconsin Breast Cancer Database. In P. M. Murphy and D. W. Aha, editors, UCI Repository of Machine Learning Databases, Irvine, CA, 1992. University of California, Department of Information and Computer Science. http://www.ics, uci. edu / .-mlear n/M L Reposi tory. ht m I.Google ScholarGoogle Scholar
  23. ZRL96 T. Zhang, R. Ramakrishnan, and M. Livny. BIRCH: An Efficient Data Clustering Method for Very Large Databases. In Proc. of the A CM SIG- MOD Int 'l Conf. on Management of Data, Montreal, Canada, 1996. Google ScholarGoogle Scholar
  24. ZRL97 T. Zhang, R. Ramakrishnan, and M. Livny. Data Clustering System BIRCH and Its Applications. Submitted for publication, 1997.Google ScholarGoogle Scholar

Index Terms

  1. Association rules over interval data

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM SIGMOD Record
          ACM SIGMOD Record  Volume 26, Issue 2
          June 1997
          583 pages
          ISSN:0163-5808
          DOI:10.1145/253262
          Issue’s Table of Contents
          • cover image ACM Conferences
            SIGMOD '97: Proceedings of the 1997 ACM SIGMOD international conference on Management of data
            June 1997
            594 pages
            ISBN:0897919114
            DOI:10.1145/253260

          Copyright © 1997 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 1 June 1997

          Check for updates

          Qualifiers

          • article

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader