Skip to main content

2015 | OriginalPaper | Buchkapitel

Mining Long Patterns of Least-Support Items in Stream

verfasst von : Qinhua Huang, Weimin Ouyang

Erschienen in: Intelligent Computing Theories and Methodologies

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The mining task of finding long sequential pattern has been well studied for years. Typical algorithms often apply vary cascading support counting methods, including the basic apriori algorithm, FP-growth, and other derived algorithms. It is commonly known that during the mining process the items with very high support may lead to poor time performance and very huge useless branch search space, especially when the items in fact are not the member of the end long pattern. On the other hand the items with least user specified support, but will be the member of long pattern, might be discarded easily. This problem could be more challenging in scenarios where the data source is stream data, for data stream being unbounded, time-varied and un-revisited. We carefully considered the role of hidden Markov chain structure and then checked the item frequency evolution in stream mining context. In this paper we presented a method of mining long patterns for data stream application scenarios. Our algorithm can well overcome the negative effects generated in stream scenarios.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Agrawal, R., Srikant, R.: Mining sequential patterns. In: Proceedings of the Eleventh International Conference on Data Engineering, pp. 3–14. IEEE Computer Society Press (1995) Agrawal, R., Srikant, R.: Mining sequential patterns. In: Proceedings of the Eleventh International Conference on Data Engineering, pp. 3–14. IEEE Computer Society Press (1995)
2.
Zurück zum Zitat Srikant, R., Agrawal, R.: Mining sequential patterns: generalizations and performance improvements. In: Apers, P.M., Bouzeghoub, M., Gardarin, G. (eds.) EDBT 1996. LNCS, vol. 1057, pp. 3–17. Springer, Heidelberg (1996) Srikant, R., Agrawal, R.: Mining sequential patterns: generalizations and performance improvements. In: Apers, P.M., Bouzeghoub, M., Gardarin, G. (eds.) EDBT 1996. LNCS, vol. 1057, pp. 3–17. Springer, Heidelberg (1996)
3.
Zurück zum Zitat Masseglia, F., Cathala, F., Poncelet, P.: The PSP approach for mining sequential patterns. In: Żytkow, J.M. (ed.) PKDD 1998. LNCS, vol. 1510, pp. 176–184. Springer, Heidelberg (1998)CrossRef Masseglia, F., Cathala, F., Poncelet, P.: The PSP approach for mining sequential patterns. In: Żytkow, J.M. (ed.) PKDD 1998. LNCS, vol. 1510, pp. 176–184. Springer, Heidelberg (1998)CrossRef
4.
Zurück zum Zitat Pei, J., Han, J., Mortazavi-Asl, B., Pinto, H.: Prefixspan: mining sequential patterns efficiently by prefix-projected pattern growth. In: Proceedings of the 17th International Conference on Data Engineering (ICDE 2001), pp. 215–226 (2001) Pei, J., Han, J., Mortazavi-Asl, B., Pinto, H.: Prefixspan: mining sequential patterns efficiently by prefix-projected pattern growth. In: Proceedings of the 17th International Conference on Data Engineering (ICDE 2001), pp. 215–226 (2001)
5.
Zurück zum Zitat Han, J., Pei, J., Yin, Y., Mao, R.: Mining frequent patterns without candidate generation. In: Proceedings of the 2000 ACM-SiGMOD International Conference Management of Data (SIGMOD 2000), pp. 1–12 (2000) Han, J., Pei, J., Yin, Y., Mao, R.: Mining frequent patterns without candidate generation. In: Proceedings of the 2000 ACM-SiGMOD International Conference Management of Data (SIGMOD 2000), pp. 1–12 (2000)
6.
Zurück zum Zitat Han, J., Pei, J., Mortazavi-Asl, B., Chen, Q.: FreeSpan: frequent pattern-projected sequential pattern mining. In: Proceedings of the 2000 International Conference Knowledge Discovery and Data Mining (KDD’00), pp. 355–359. Boston, MA (2000) Han, J., Pei, J., Mortazavi-Asl, B., Chen, Q.: FreeSpan: frequent pattern-projected sequential pattern mining. In: Proceedings of the 2000 International Conference Knowledge Discovery and Data Mining (KDD’00), pp. 355–359. Boston, MA (2000)
7.
Zurück zum Zitat Zaki, M.J.: Spade: an efficient algorithm for mining frequents sequences. Mach. Learn. 42, 31–60 (2001)CrossRefMATH Zaki, M.J.: Spade: an efficient algorithm for mining frequents sequences. Mach. Learn. 42, 31–60 (2001)CrossRefMATH
8.
Zurück zum Zitat Ayres, J., Gehrke, J., Yiu, T., Flannick, J.: Sequential pattern mining using a bitmap representation. In: SIGKDD 2001, Edmonton, Alberta, Canada (2001) Ayres, J., Gehrke, J., Yiu, T., Flannick, J.: Sequential pattern mining using a bitmap representation. In: SIGKDD 2001, Edmonton, Alberta, Canada (2001)
9.
Zurück zum Zitat Zhu, F., Yan, X., Han, J., Yu, P.S., Cheng, H.: Mining colossal frequent patterns by core pattern fusion. In Proceedings of the International Conference Data Engineering (ICDE) (2007) Zhu, F., Yan, X., Han, J., Yu, P.S., Cheng, H.: Mining colossal frequent patterns by core pattern fusion. In Proceedings of the International Conference Data Engineering (ICDE) (2007)
11.
Zurück zum Zitat Panuccio, A., Bicego, M., Murino, V.: A hidden Markov model-based approach to sequential data clustering. In: Caelli, T.M., Amin, A., Duin, R.P., Kamel, M.S., de Ridder, D. (eds.) SPR 2002 and SSPR 2002. LNCS, vol. 2396, pp. 734–743. Springer, Heidelberg (2002)CrossRef Panuccio, A., Bicego, M., Murino, V.: A hidden Markov model-based approach to sequential data clustering. In: Caelli, T.M., Amin, A., Duin, R.P., Kamel, M.S., de Ridder, D. (eds.) SPR 2002 and SSPR 2002. LNCS, vol. 2396, pp. 734–743. Springer, Heidelberg (2002)CrossRef
12.
Zurück zum Zitat Giannella, C., Han, J., Pei, J., Yan, X., Yu, P.S.: Mining frequent patterns in data streams at multiple time granularities. In: Kargupta, H., et al. (eds.) Data Mining: Next Generation Challenges and Future Directions. MIT Press, Cambridge (2003). Ch. 3 Giannella, C., Han, J., Pei, J., Yan, X., Yu, P.S.: Mining frequent patterns in data streams at multiple time granularities. In: Kargupta, H., et al. (eds.) Data Mining: Next Generation Challenges and Future Directions. MIT Press, Cambridge (2003). Ch. 3
Metadaten
Titel
Mining Long Patterns of Least-Support Items in Stream
verfasst von
Qinhua Huang
Weimin Ouyang
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-22186-1_27