Skip to main content
Erschienen in: Soft Computing 22/2020

13.05.2020 | Methodologies and Application

Deep learning-based sequential pattern mining for progressive database

verfasst von: Aatif Jamshed, Bhawna Mallick, Pramod Kumar

Erschienen in: Soft Computing | Ausgabe 22/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Sequential pattern mining (SPM) is one of the main application areas in the field of online business, e-commerce, bioinformatics, etc. The traditional approaches in SPM are unable to accurately mine the huge volume of data. Therefore, the proposed work employs a sequential mining model based on deep learning to minimize complexity in handling huge data. Application areas such as online retailing, finance, and e-commerce face a dynamic change in data, which results in non-stationary data. Therefore, our proposed work uses discrete wavelet analysis to convert non-stationary data into time series. In the proposed SPM, a reformed hybrid combination of convolutional neural network (CNN) with long short-term memory (LSTM) is designed to find out customer behavior and purchasing patterns in terms of time. CNN is used to find the concerned itemsets (frequent) at the end of the pattern and LSTM for finding the time interval among each pair of successive itemsets. The proposed work mines the sequential pattern from a progressive database that removes the obsolete data. Finally, the accuracy of the proposed work is compared with some traditional algorithms to demonstrate its robustness.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Abboud Y, Brun A, Boyer A (2019) C3Ro: an efficient mining algorithm of extended-closed contiguous robust sequential patterns in noisy data. Expert Syst Appl 1(131):172–189CrossRef Abboud Y, Brun A, Boyer A (2019) C3Ro: an efficient mining algorithm of extended-closed contiguous robust sequential patterns in noisy data. Expert Syst Appl 1(131):172–189CrossRef
Zurück zum Zitat Agarwal S (2013) Data mining: data mining concepts and techniques. In: 2013 International conference on machine intelligence and research advancement. IEEE, pp 203–207 Agarwal S (2013) Data mining: data mining concepts and techniques. In: 2013 International conference on machine intelligence and research advancement. IEEE, pp 203–207
Zurück zum Zitat Ahmed CF, Tanbeer SK, Jeong BS, Lee YK, Choi HJ (2012) Single-pass incremental and interactive mining for weighted frequent patterns. Expert Syst Appl 39(9):7976–7994CrossRef Ahmed CF, Tanbeer SK, Jeong BS, Lee YK, Choi HJ (2012) Single-pass incremental and interactive mining for weighted frequent patterns. Expert Syst Appl 39(9):7976–7994CrossRef
Zurück zum Zitat Almasoud AM, Al-Khalifa HS, Al-Salman A (2015) Recent developments in data mining applications and techniques. In: 2015 Tenth international conference on digital information management (ICDIM). IEEE, pp 36–42 Almasoud AM, Al-Khalifa HS, Al-Salman A (2015) Recent developments in data mining applications and techniques. In: 2015 Tenth international conference on digital information management (ICDIM). IEEE, pp 36–42
Zurück zum Zitat Belhadi A, Djenouri Y, Lin JC, Zhang C, Cano A (2020) Exploring pattern mining algorithms for hashtag retrieval problem. IEEE Access 8:10569–10583CrossRef Belhadi A, Djenouri Y, Lin JC, Zhang C, Cano A (2020) Exploring pattern mining algorithms for hashtag retrieval problem. IEEE Access 8:10569–10583CrossRef
Zurück zum Zitat Chen MS, Han J, Yu PS (1996) Data mining: an overview from a database perspective. IEEE Trans Knowl Data Eng 8(6):866–883CrossRef Chen MS, Han J, Yu PS (1996) Data mining: an overview from a database perspective. IEEE Trans Knowl Data Eng 8(6):866–883CrossRef
Zurück zum Zitat Dam TL, Ramampiaro H, Nørvåg K, Duong QH (2019) Towards efficiently mining closed high utility itemsets from incremental databases. Knowl-Based Syst 165:13–29CrossRef Dam TL, Ramampiaro H, Nørvåg K, Duong QH (2019) Towards efficiently mining closed high utility itemsets from incremental databases. Knowl-Based Syst 165:13–29CrossRef
Zurück zum Zitat Djenouri Y, Belhadi A, Lin JCW, Cano A (2019a) Adapted k-nearest neighbors for detecting anomalies on spatio–temporal traffic flow. IEEE Access 7:10015–10027CrossRef Djenouri Y, Belhadi A, Lin JCW, Cano A (2019a) Adapted k-nearest neighbors for detecting anomalies on spatio–temporal traffic flow. IEEE Access 7:10015–10027CrossRef
Zurück zum Zitat Djenouri Y, Djenouri D, Belhadi A, Cano A (2019b) Exploiting GPU and cluster parallelism in single scan frequent itemset mining. Inf Sci 1(496):363–377CrossRef Djenouri Y, Djenouri D, Belhadi A, Cano A (2019b) Exploiting GPU and cluster parallelism in single scan frequent itemset mining. Inf Sci 1(496):363–377CrossRef
Zurück zum Zitat He Z, Zhang S, Wu J (2019) Significance-based discriminative sequential pattern mining. Expert Syst Appl 15(122):54–64CrossRef He Z, Zhang S, Wu J (2019) Significance-based discriminative sequential pattern mining. Expert Syst Appl 15(122):54–64CrossRef
Zurück zum Zitat Huang JW, Tseng CY, Ou JC, Chen MS (2008) A general model for sequential pattern mining with a progressive database. IEEE Trans Knowl Data Eng 20(9):1153–1167CrossRef Huang JW, Tseng CY, Ou JC, Chen MS (2008) A general model for sequential pattern mining with a progressive database. IEEE Trans Knowl Data Eng 20(9):1153–1167CrossRef
Zurück zum Zitat Huang SC, Chiou CC, Chiang JT, Wu CF (2020) Online sequential pattern mining and association discovery by advanced artificial intelligence and machine learning techniques. Soft Comput 24:8021–8039CrossRef Huang SC, Chiou CC, Chiang JT, Wu CF (2020) Online sequential pattern mining and association discovery by advanced artificial intelligence and machine learning techniques. Soft Comput 24:8021–8039CrossRef
Zurück zum Zitat Huynh B, Trinh C, Huynh H, Van TT, Vo B, Snasel V (2018) An efficient approach for mining sequential patterns using multiple threads on very large databases. Eng Appl Artif Intell 1(74):242–251CrossRef Huynh B, Trinh C, Huynh H, Van TT, Vo B, Snasel V (2018) An efficient approach for mining sequential patterns using multiple threads on very large databases. Eng Appl Artif Intell 1(74):242–251CrossRef
Zurück zum Zitat Huynh HM, Nguyen LTT, Vo B, Nguyen A, Tseng VS (2020a) Efficient methods for mining weighted clickstream patterns. Expert Syst Appl 142:112993CrossRef Huynh HM, Nguyen LTT, Vo B, Nguyen A, Tseng VS (2020a) Efficient methods for mining weighted clickstream patterns. Expert Syst Appl 142:112993CrossRef
Zurück zum Zitat Huynh HM, Nguyen LT, Vo B, Yun U, Oplatková ZK, Hong TP (2020b) Efficient algorithms for mining clickstream patterns using pseudo-IDLists. Future Gener Comput Syst 1(107):18–30CrossRef Huynh HM, Nguyen LT, Vo B, Yun U, Oplatková ZK, Hong TP (2020b) Efficient algorithms for mining clickstream patterns using pseudo-IDLists. Future Gener Comput Syst 1(107):18–30CrossRef
Zurück zum Zitat Karthikeyan L, Kumar DN (2013) Predictability of nonstationary time series using wavelet and EMD based ARMA models. J Hydrol 10(502):103–119CrossRef Karthikeyan L, Kumar DN (2013) Predictability of nonstationary time series using wavelet and EMD based ARMA models. J Hydrol 10(502):103–119CrossRef
Zurück zum Zitat Kim B, Yi G (2019) Location-based parallel sequential pattern mining algorithm. IEEE Access 7:128651–128658CrossRef Kim B, Yi G (2019) Location-based parallel sequential pattern mining algorithm. IEEE Access 7:128651–128658CrossRef
Zurück zum Zitat Lan K, Wang DT, Fong S, Liu LS, Wong KK, Dey N (2018) A survey of data mining and deep learning in bioinformatics. J Med Syst 42(8):139CrossRef Lan K, Wang DT, Fong S, Liu LS, Wong KK, Dey N (2018) A survey of data mining and deep learning in bioinformatics. J Med Syst 42(8):139CrossRef
Zurück zum Zitat Lim HK, Kim Y, Kim MK (2017) Failure prediction using sequential pattern mining in the wire bonding process. IEEE Trans Semicond Manuf 30(3):285–292CrossRef Lim HK, Kim Y, Kim MK (2017) Failure prediction using sequential pattern mining in the wire bonding process. IEEE Trans Semicond Manuf 30(3):285–292CrossRef
Zurück zum Zitat Lin JC, Gan W, Hong TP (2014) Efficiently maintaining the fast updated sequential pattern trees with sequence deletion. IEEE Access 24(2):1374–1383CrossRef Lin JC, Gan W, Hong TP (2014) Efficiently maintaining the fast updated sequential pattern trees with sequence deletion. IEEE Access 24(2):1374–1383CrossRef
Zurück zum Zitat Lin JC, Li T, Pirouz M, Zhang J, Fournier-Viger P (2019) High average-utility sequential pattern mining based on uncertain databases. Knowl Inf Syst 62:1199–1228CrossRef Lin JC, Li T, Pirouz M, Zhang J, Fournier-Viger P (2019) High average-utility sequential pattern mining based on uncertain databases. Knowl Inf Syst 62:1199–1228CrossRef
Zurück zum Zitat Liu X, Huang Z, Tong B (2016) Review on the data mining technology and the applications on financial analysis area. In: 2016 International conference on communication and electronics systems (ICCES). IEEE, pp 1–7 Liu X, Huang Z, Tong B (2016) Review on the data mining technology and the applications on financial analysis area. In: 2016 International conference on communication and electronics systems (ICCES). IEEE, pp 1–7
Zurück zum Zitat Lu Q, Lyu ZJ, Xiang Q, Zhou Y, Bao J (2017) Research on data mining service and its application case in complex industrial process. In: 2017 13th IEEE conference on automation science and engineering (CASE). IEEE, pp 1124–1129 Lu Q, Lyu ZJ, Xiang Q, Zhou Y, Bao J (2017) Research on data mining service and its application case in complex industrial process. In: 2017 13th IEEE conference on automation science and engineering (CASE). IEEE, pp 1124–1129
Zurück zum Zitat Perera D, Kay J, Koprinska I, Yacef K, Zaïane OR (2008) Clustering and sequential pattern mining of online collaborative learning data. IEEE Trans Knowl Data Eng 21(6):759–772CrossRef Perera D, Kay J, Koprinska I, Yacef K, Zaïane OR (2008) Clustering and sequential pattern mining of online collaborative learning data. IEEE Trans Knowl Data Eng 21(6):759–772CrossRef
Zurück zum Zitat Rjeily CB, Badr G, Al Hassani AH, Andres E (2018) Overview on sequential mining algorithms and their extensions. In: Recent trends in computer applications. Springer, Cham, pp 3–16 Rjeily CB, Badr G, Al Hassani AH, Andres E (2018) Overview on sequential mining algorithms and their extensions. In: Recent trends in computer applications. Springer, Cham, pp 3–16
Zurück zum Zitat Saleti S, Subramanyam RB (2019) A novel mapreduce algorithm for distributed mining of sequential patterns using co-occurrence information. Appl Intell 49(1):150–171CrossRef Saleti S, Subramanyam RB (2019) A novel mapreduce algorithm for distributed mining of sequential patterns using co-occurrence information. Appl Intell 49(1):150–171CrossRef
Zurück zum Zitat Shaji SP (2019) Prediction and diagnosis of heart disease patients using data mining technique. In: 2019 International conference on communication and signal processing (ICCSP). IEEE, pp 0848–0852 Shaji SP (2019) Prediction and diagnosis of heart disease patients using data mining technique. In: 2019 International conference on communication and signal processing (ICCSP). IEEE, pp 0848–0852
Zurück zum Zitat Singh Y, Chauhan AS (2009) Neural networks in data mining. J Theor Appl Inf Technol 5(1):37–42 Singh Y, Chauhan AS (2009) Neural networks in data mining. J Theor Appl Inf Technol 5(1):37–42
Zurück zum Zitat Soltani S (2002) On the use of the wavelet decomposition for time series prediction. Neurocomputing 48(1–4):267–277CrossRef Soltani S (2002) On the use of the wavelet decomposition for time series prediction. Neurocomputing 48(1–4):267–277CrossRef
Zurück zum Zitat Stahl F, Jordanov I (2012) An overview of the use of neural networks for data mining tasks. Wiley Interdiscip Rev Data Min Knowl Discov 2(3):193–208CrossRef Stahl F, Jordanov I (2012) An overview of the use of neural networks for data mining tasks. Wiley Interdiscip Rev Data Min Knowl Discov 2(3):193–208CrossRef
Zurück zum Zitat Wu CL, Koh JL, An PY (2005) Improved sequential pattern mining using an extended bitmap representation. In: Int Conf Database Expert Syst Appl. Springer, Berlin, pp 776–785 Wu CL, Koh JL, An PY (2005) Improved sequential pattern mining using an extended bitmap representation. In: Int Conf Database Expert Syst Appl. Springer, Berlin, pp 776–785
Zurück zum Zitat Yang Z, Kitsuregawa M (2005) LAPIN-SPAM: an improved algorithm for mining sequential pattern. In: 21st International conference on data engineering workshops (ICDEW’05). IEEE, pp 1222–1222 Yang Z, Kitsuregawa M (2005) LAPIN-SPAM: an improved algorithm for mining sequential pattern. In: 21st International conference on data engineering workshops (ICDEW’05). IEEE, pp 1222–1222
Zurück zum Zitat Yun CH, Chen MS (2007) Mining mobile sequential patterns in a mobile commerce environment. IEEE Trans Syst Man Cybern Part C (Appl Rev) 37(2):278–295CrossRef Yun CH, Chen MS (2007) Mining mobile sequential patterns in a mobile commerce environment. IEEE Trans Syst Man Cybern Part C (Appl Rev) 37(2):278–295CrossRef
Zurück zum Zitat Yun U, Lee G, Ryu KH (2014) Mining maximal frequent patterns by considering weight conditions over data streams. Knowl-Based Syst 1(55):49–65CrossRef Yun U, Lee G, Ryu KH (2014) Mining maximal frequent patterns by considering weight conditions over data streams. Knowl-Based Syst 1(55):49–65CrossRef
Zurück zum Zitat Zhang H (2011) A short introduction to data mining and its applications. In: 2011 International conference on management and service science. IEEE, pp 1–4 Zhang H (2011) A short introduction to data mining and its applications. In: 2011 International conference on management and service science. IEEE, pp 1–4
Zurück zum Zitat Zhang C, Almpanidis G, Wang W, Liu C (2018) An empirical evaluation of high utility itemset mining algorithms. Expert Syst Appl 101:91–115CrossRef Zhang C, Almpanidis G, Wang W, Liu C (2018) An empirical evaluation of high utility itemset mining algorithms. Expert Syst Appl 101:91–115CrossRef
Metadaten
Titel
Deep learning-based sequential pattern mining for progressive database
verfasst von
Aatif Jamshed
Bhawna Mallick
Pramod Kumar
Publikationsdatum
13.05.2020
Verlag
Springer Berlin Heidelberg
Erschienen in
Soft Computing / Ausgabe 22/2020
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-020-05015-2

Weitere Artikel der Ausgabe 22/2020

Soft Computing 22/2020 Zur Ausgabe