skip to main content
research-article

Mining multidimensional and multilevel sequential patterns

Published:18 January 2010Publication History
Skip Abstract Section

Abstract

Multidimensional databases have been designed to provide decision makers with the necessary tools to help them understand their data. This framework is different from transactional data as the datasets contain huge volumes of historicized and aggregated data defined over a set of dimensions that can be arranged through multiple levels of granularities. Many tools have been proposed to query the data and navigate through the levels of granularity. However, automatic tools are still missing to mine this type of data in order to discover regular specific patterns. In this article, we present a method for mining sequential patterns from multidimensional databases, at the same time taking advantage of the different dimensions and levels of granularity, which is original compared to existing work. The necessary definitions and algorithms are extended from regular sequential patterns to this particular case. Experiments are reported, showing the significance of this approach.

References

  1. Agrawal, R. and Srikant, R. 1995. Mining sequential patterns. In Proceedings of the International Conference on Data Engineering (ICDE). IEEE Computer Society, 3--14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Ayres, J., Flannick, J., Gehrke, J., and Yiu, T. 2002. Sequential pattern mining using a bitmap representation. In Proceedings of the International Conference on Knowledge Discovery and Data Mining (KDD). AAAI Press, 429--435. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Beyer, K. and Ramakrishnan, R. 1999. Bottom-up computation of sparse and iceberg cube. In Proceedings of the International Conference on Management of Data (SIGMOD). ACM Press, 359--370. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Bonchi, F. and Lucchese, C. 2006. On condensed representations of constrained frequent patterns. Knowl. Inform. Syst. 9, 2, 180--201.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Boulicaut, J.-F., Bykowski, A., and Rigotti, C. 2003. Free-sets: a condensed representation of Boolean data for the approximation of frequency queries. Data Mining Knowl. Disc. 7, 1, 5--22. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Burdick, D., Calimlim, M., and Gehrke, J. 2001. MAFIA: a maximal frequent itemset algorithm for transactional databases. In Proceedings of the International Conference on Data Engineering (ICDE). IEEE Computer Society, 443--452. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Calders, T. and Goethals, B. 2002. Mining all non-derivable frequent itemsets. In Proceedings of the European Conference on the Principles and Practice of Knowledge Discovery in Databases (PKDD). Lecture Notes in Computer Science, vol. 2431. Springer Verlag, 74--85. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Calders, T., Rigotti, C., and Boulicaut, J.-F. 2006. A survey on condensed representations for frequent sets. In Proceedings of Constraint-Based Mining and Inductive Databases: European Workshop on Inductive Databases and Constraint Based Mining. Lecture Notes in Computer Science, vol. 3848. Springer Verlag, 64--80. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Chiu, D.-Y., Wu, Y.-H., and Chen, A. L. P. 2004. An efficient algorithm for mining frequent sequences by a new strategy without support counting. In Proceedings of the International Conference on Data Engineering (ICDE). IEEE Computer Society, 375--386. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. de Amo, S., Furtado, D. A., Giacometti, A., and Laurent, D. 2004. An a priori-based approach for first-order temporal pattern mining. In Proceedings of the Simpósio Brasileiro de Bancos de Dados. 48--62.Google ScholarGoogle Scholar
  11. Dietterich, T. and Michalski, R. 1985. Discovering patterns in sequences of events. Artif. Intell. 25, 2, 187--232. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Han, J. and Fu, Y. 1999. Mining multiple-level association rules in large databases. IEEE Trans. Knowl. Data Engin. 11, 5, 798--804. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Inmon, W. 2003. Building the Data Warehouse. John Wiley and Sons. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Mannila, H. and Toivonen, H. 1996. Multiple uses of frequent sets and condensed representations. In Proceedings of the International Conference on Knowledge Discovery and Data Mining (KDD). AAAI Press, 189--194.Google ScholarGoogle Scholar
  15. Mannila, H., Toivonen, H., and Verkamo, A. 1995. Discovering frequent episodes in sequences. In Proceedings of the International Conference on Knowledge Discovery and Data Mining (KDD). AAAI Press, 210--215.Google ScholarGoogle Scholar
  16. Masseglia, F., Cathala, F., and Poncelet, P. 1998. The PSP approach for mining sequential patterns. In Proceedings of the Principles and Practice of Knowledge Discovery in Databases (PKDD). Lecture Notes in Computer Science, vol. 1510. Springer Verlag, 176--184. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Pasquier, N., Bastide, Y., Taouil, R., and Lakhal, L. 1999a. Discovering frequent closed itemsets for association rules. In Proceedings of the International Conference on Database Theory (ICDT). Lecture Notes in Computer Science, vol. 1540. Springer Verlag, 398--416. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Pasquier, N., Bastide, Y., Taouil, R., and Lakhal, L. 1999b. Efficient mining of association rules using closed itemset lattices. Inform. Syst. 24, 1, 25--46. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Pei, J., Han, J., and Mao, R. 2000. Closet: An efficient algorithm for mining frequent closed itemsets. In Proceedings of the SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery. ACM Press, 21--30.Google ScholarGoogle Scholar
  20. Pei, J., Han, J., Mortazavi-Asl, B., Wang, J., Pinto, H., Chen, Q., Dayal, U., and Hsu, M.-C. 2004. Mining sequential patterns by pattern-growth: The prefixspan approach. IEEE Trans. Knowl. Data Engin. 16, 11, 1424--1440. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Pinto, H., Han, J., Pei, J., Wang, K., Chen, Q., and Dayal, U. 2001. Multi-dimensional sequential pattern mining. In Proceedings of the International Conference on Information and Knowledge Management (CIKM). ACM Press, 81--88. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Plantevit, M., Choong, Y. W., Laurent, A., Laurent, D., and Teisseire, M. 2005. M2SP: Mining sequential patterns among several dimensions. In Proceedings of the European Conference on the Principles and Practice of Knowledge Discovery in Databases (PKDD). Lecture Notes in Artificial Intelligence, vol. 3721. Springer Verlag, 205--216.Google ScholarGoogle Scholar
  23. Plantevit, M., Laurent, A., and Teisseire, M. 2006. HYPE: mining hierarchical sequential patterns. In Proceedings of the International Workshop on Data Warehousing and OLAP (DOLAP). ACM Press, 19--26. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Rashad, S., Kantardzic, M. M., and Kumar, A. 2007. MSP-CACRR: multidimensional sequential patterns-based call admission control and resource reservation for next-generation wireless cellular networks. In Proceedings of the Symposium on Computational Intelligence and Data Mining (CIDM). IEEE Computer Society, 552--559.Google ScholarGoogle Scholar
  25. Srikant, R. and Agrawal, R. 1996. Mining sequential patterns: generalizations and performance improvements. In Proceedings of the International Conference on Extending Data Base Technology (EDBT). Lecture Notes in Computer Science, vol. 1057. Springer Verlag, 3--17. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Stefanowski, J. 2007. Algorithms for context based sequential pattern mining. Fundamenta Informaticae 76, 4, 495--510. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Stefanowski, J. and Ziembinski, R. 2005. Mining context based sequential patterns. In Proceedings of the Atlantic Web Intelligence Conference (AWIC). Lecture Notes in Computer Science, vol. 3528. Springer Verlag, 401--407. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Yang, Z., Kitsuregawa, M., and Wang, Y. 2006. Paid: mining sequential patterns by passed item deduction in large databases. In Proceedings of the International Database Engineering and Applications Symposium (IDEAS). IEEE Computer Society, 113--120. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Yu, C.-C. and Chen, Y.-L. 2005. Mining sequential patterns from multidimensional sequence data. IEEE Trans. Knowl. Data Engin. 17, 1, 136--140. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Zaki, M. J. 2001. SPADE: an efficient algorithm for mining frequent sequences. Machine Learn. J. Special Issue on Unsupervised Learning 42, 1/2, 31--60.Google ScholarGoogle Scholar
  31. Zaki, M. J. 2004. Mining non-redundant association rules. Data Mining Knowl. Discov. 9, 223--248. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Zaki, M. J. and Hsiao, C.-J. 2002. CHARM: an efficient algorithm for closed itemset mining. In Proceedings of the SIAM International Conference on Data Mining (SDM). SIAM.Google ScholarGoogle Scholar
  33. Zhang, C., Hu, K., Chen, Z., Chen, L., and Dong, Y. 2007. ApproxMGMSP: a scalable method of mining approximate multidimensional sequential patterns on distributed system. In Proceedings of the International Conference on Fuzzy Systems and Knowledge Discovery (FSKD). Vol. 2. IEEE Computer Society, 730--734. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Mining multidimensional and multilevel sequential patterns

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Knowledge Discovery from Data
      ACM Transactions on Knowledge Discovery from Data  Volume 4, Issue 1
      January 2010
      135 pages
      ISSN:1556-4681
      EISSN:1556-472X
      DOI:10.1145/1644873
      Issue’s Table of Contents

      Copyright © 2010 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 18 January 2010
      • Accepted: 1 May 2009
      • Revised: 1 January 2009
      • Received: 1 August 2008
      Published in tkdd Volume 4, Issue 1

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader