Skip to main content

2017 | OriginalPaper | Buchkapitel

Reduce Scanning Time Incremental Algorithm (RSTIA) of Association Rules

verfasst von : Iyad Aqra, Muhammad Azani Hasibuan, Tutut Herawan

Erschienen in: Recent Advances on Soft Computing and Data Mining

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In the real world where large amounts of data grow steadily, some old association rules can become stale, and new databases may give rise to some implicitly valid patterns or rules. Hence, updating rules or patterns is also important. A simple method for solving the updating problem is to reapply the mining algorithm to the entire database, but this approach is time-consuming. This paper reuses information from old frequent itemsets to improve its performance and addresses the problem of high cost access to incremental databases in which data are very changing by reducing the number of scanning times for the original database. A log file has been used to keep track of database changes whenever, a transaction has been added, deleted or even modified, a new record is added to the log file. This helps identifying the newly changes or updates in incremental databases. A new vertical mining technique has been used to minimize the number of scanning times to the original database. This algorithm has been implemented and developed using C#.net and applied to real data and gave a good result comparing with pure Apriori.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Agrawal, R., Imieliński, T., Swami, A.: Mining association rules between sets of items in large databases. ACM SIGMOD Rec. 22, 207–216 (1993)CrossRef Agrawal, R., Imieliński, T., Swami, A.: Mining association rules between sets of items in large databases. ACM SIGMOD Rec. 22, 207–216 (1993)CrossRef
2.
Zurück zum Zitat Bose, I., Mahapatra, R.K.: Business data mining - a machine learning perspective. Inf. Manag. 39(3), 211–225 (2001)CrossRef Bose, I., Mahapatra, R.K.: Business data mining - a machine learning perspective. Inf. Manag. 39(3), 211–225 (2001)CrossRef
3.
Zurück zum Zitat Fayyad, U., Piatetsky-Shapiro, G., Smyth, P.: From data mining to knowledge discovery in databases. AI Mag. 17(3), 37 (1996) Fayyad, U., Piatetsky-Shapiro, G., Smyth, P.: From data mining to knowledge discovery in databases. AI Mag. 17(3), 37 (1996)
4.
Zurück zum Zitat Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of 20th International Conference Very Large Data Bases, VLDB (1994) Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of 20th International Conference Very Large Data Bases, VLDB (1994)
5.
Zurück zum Zitat Lin, C.-W., et al.: Efficient updating of discovered high-utility itemsets for transaction deletion in dynamic databases. Adv. Eng. Informat. 29(1), 16–27 (2015)CrossRef Lin, C.-W., et al.: Efficient updating of discovered high-utility itemsets for transaction deletion in dynamic databases. Adv. Eng. Informat. 29(1), 16–27 (2015)CrossRef
6.
Zurück zum Zitat Chen, F., et al.: Principal association mining: an efficient classification approach. Knowl. Based Syst. 67, 16–25 (2014)CrossRef Chen, F., et al.: Principal association mining: an efficient classification approach. Knowl. Based Syst. 67, 16–25 (2014)CrossRef
7.
Zurück zum Zitat Zaki, M.J., et al.: New algorithms for fast discovery of association rules. In: KDD (1997) Zaki, M.J., et al.: New algorithms for fast discovery of association rules. In: KDD (1997)
8.
Zurück zum Zitat Park, J.S., Yu, P.S., Chen, M.-S.: Mining association rules with adjustable accuracy. In: Proceedings of the Sixth International Conference on Information and Knowledge Management. ACM (1997) Park, J.S., Yu, P.S., Chen, M.-S.: Mining association rules with adjustable accuracy. In: Proceedings of the Sixth International Conference on Information and Knowledge Management. ACM (1997)
9.
Zurück zum Zitat Li, Z.-C., He, P.-L., Lei, M.: A high efficient AprioriTid algorithm for mining association rule. In: Proceedings of 2005 International Conference on Machine Learning and Cybernetics. IEEE (2005) Li, Z.-C., He, P.-L., Lei, M.: A high efficient AprioriTid algorithm for mining association rule. In: Proceedings of 2005 International Conference on Machine Learning and Cybernetics. IEEE (2005)
10.
Zurück zum Zitat Schlegel, B., et al.: Scalable frequent itemset mining on many-core processors. In: Proceedings of the Ninth International Workshop on Data Management on New Hardware. ACM (2013) Schlegel, B., et al.: Scalable frequent itemset mining on many-core processors. In: Proceedings of the Ninth International Workshop on Data Management on New Hardware. ACM (2013)
11.
Zurück zum Zitat Lee, Y.-C., Hong, T.-P., Lin, W.-Y.: Mining association rules with multiple minimum supports using maximum constraints. Int. J. Approx. Reason. 40(1), 44–54 (2005)CrossRef Lee, Y.-C., Hong, T.-P., Lin, W.-Y.: Mining association rules with multiple minimum supports using maximum constraints. Int. J. Approx. Reason. 40(1), 44–54 (2005)CrossRef
12.
Zurück zum Zitat Li, W., Han, J., Pei, J.: CMAR: accurate and efficient classification based on multiple class-association rules. In: Proceedings IEEE International Conference on Data Mining, ICDM 2001. IEEE (2001) Li, W., Han, J., Pei, J.: CMAR: accurate and efficient classification based on multiple class-association rules. In: Proceedings IEEE International Conference on Data Mining, ICDM 2001. IEEE (2001)
13.
Zurück zum Zitat Zaki, M.J.: Scalable algorithms for association mining. IEEE Trans. Knowl. Data Eng. 12(3), 372–390 (2000)CrossRef Zaki, M.J.: Scalable algorithms for association mining. IEEE Trans. Knowl. Data Eng. 12(3), 372–390 (2000)CrossRef
14.
Zurück zum Zitat Ibrahim, H.M., Marghny, M., Abdelaziz, N.M.: Fast vertical mining using Boolean algebra. Editor. Pref. 6(1) (2015) Ibrahim, H.M., Marghny, M., Abdelaziz, N.M.: Fast vertical mining using Boolean algebra. Editor. Pref. 6(1) (2015)
15.
Zurück zum Zitat Cheung, D.W., et al.: Maintenance of discovered association rules in large databases: an incremental updating technique. In: Proceedings of the Twelfth International Conference on Data Engineering. IEEE (1996) Cheung, D.W., et al.: Maintenance of discovered association rules in large databases: an incremental updating technique. In: Proceedings of the Twelfth International Conference on Data Engineering. IEEE (1996)
Metadaten
Titel
Reduce Scanning Time Incremental Algorithm (RSTIA) of Association Rules
verfasst von
Iyad Aqra
Muhammad Azani Hasibuan
Tutut Herawan
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-51281-5_49