Skip to main content
Top

2019 | OriginalPaper | Chapter

Discovering Association Rules Using R. A Case Study on Retail’s Database

Authors : Juan Manuel Báez Acuña, Clara Anuncia Paredes Cabañas, Gustavo Sosa-Cabrera, María E. García-Díaz

Published in: Computer Science – CACIC 2018

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Today, the high competitiveness in retail businesses requires them to seek new strategies to ensure their survival. To this end, organizations have understood that the data located in their transactional databases can be used as raw material to boost business growth, if they can be exploited properly. The research’s main objective is to apply Data Mining techniques for the discovery of association rules from purely commercial transactional data, taking as a study period 10-year in a household appliances and furniture retail entity. The selection’s phase and preparation data are described as well as its cost in man/hours. In the modeling phase, the Apriori and Eclat algorithms implemented in the arules package of the R tool were executed, where both the resulting associations and execution time were compared. The results show relevant patterns in the buying behavior of customers such as those that relate items and accessories’ prices.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
3.
go back to reference Jiawei, H., Kamber, M.: Data Mining Concepts and Techniques. Morgan Kaufmann, San Francisco (2002)MATH Jiawei, H., Kamber, M.: Data Mining Concepts and Techniques. Morgan Kaufmann, San Francisco (2002)MATH
4.
go back to reference Kim, J., Ale, J.: Descubrimiento incremental de las reglas de asociación temporales. In: X Congreso Argentino de Ciencias de la Computación (2004) Kim, J., Ale, J.: Descubrimiento incremental de las reglas de asociación temporales. In: X Congreso Argentino de Ciencias de la Computación (2004)
5.
go back to reference Witten, I., Frank, E.: Data Mining: Practical Machine Learning Tools with Java Implementations. Morgan Kaufmann, San Francisco (2000) Witten, I., Frank, E.: Data Mining: Practical Machine Learning Tools with Java Implementations. Morgan Kaufmann, San Francisco (2000)
6.
go back to reference Fayyad, U., Irani, K.: Multi-interval discretization of continuous-valued attributes for classification learning. In: 13th International Joint Conference on Artificial Intelligence (IJCAI 1993), pp. 1022–1027 (1993) Fayyad, U., Irani, K.: Multi-interval discretization of continuous-valued attributes for classification learning. In: 13th International Joint Conference on Artificial Intelligence (IJCAI 1993), pp. 1022–1027 (1993)
7.
go back to reference Witten, I., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)MATH Witten, I., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)MATH
8.
go back to reference Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. ACM SIGMOD Rec. 22, 207–216 (1993)CrossRef Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. ACM SIGMOD Rec. 22, 207–216 (1993)CrossRef
9.
go back to reference Heaton, J.: Comparing dataset characteristics that favor the Apriori, Eclat or FP-Growth frequent itemset mining algorithms. SoutheastCon, pp. 1–7 (2016) Heaton, J.: Comparing dataset characteristics that favor the Apriori, Eclat or FP-Growth frequent itemset mining algorithms. SoutheastCon, pp. 1–7 (2016)
10.
go back to reference Schmidt-Thieme, L.: Algorithmic features of Eclat. In: FIMI (2004) Schmidt-Thieme, L.: Algorithmic features of Eclat. In: FIMI (2004)
11.
go back to reference Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of the 20th International Conference Very Large Data Bases, VLDB, pp. 487–499 (1994) Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of the 20th International Conference Very Large Data Bases, VLDB, pp. 487–499 (1994)
12.
go back to reference Zaki, M., Parthasarathy, S., Ogihara, M., Li, W.: New algorithms for fast discovery of association rules. In: Third International Conference on Knowledge Discovery and Data Mining, pp. 283–286 (1997) Zaki, M., Parthasarathy, S., Ogihara, M., Li, W.: New algorithms for fast discovery of association rules. In: Third International Conference on Knowledge Discovery and Data Mining, pp. 283–286 (1997)
16.
go back to reference Sosa-Cabrera, G., García-Torres, M., Gómez, S., Schaerer, C., Divina, F.: Understanding a version of multivariate symmetric uncertainty to assist in feature selection. In: Conference of Computational Interdisciplinary Science (2016) Sosa-Cabrera, G., García-Torres, M., Gómez, S., Schaerer, C., Divina, F.: Understanding a version of multivariate symmetric uncertainty to assist in feature selection. In: Conference of Computational Interdisciplinary Science (2016)
17.
go back to reference Moine, J., Gordillo, S., Haedo, A.: Análisis comparativo de metodologías para la gestión de proyectos de minería de datos. In: Congreso Argentino de Ciencias de la Computación (2011) Moine, J., Gordillo, S., Haedo, A.: Análisis comparativo de metodologías para la gestión de proyectos de minería de datos. In: Congreso Argentino de Ciencias de la Computación (2011)
18.
go back to reference Báez, J., et al.: Descubriendo reglas de asociación en bases de datos del sector retail usando R. In: Libro de Actas XXIV Congreso Argentino de Ciencias de la Computación, CACIC 2018, pp. 432–441. Red de Universidades con Carreras en Informática, RedUNCI. Facultad de Ciencias Exactas, Universidad Nacional del Centro de la Provincia de Buenos Aires (2018) Báez, J., et al.: Descubriendo reglas de asociación en bases de datos del sector retail usando R. In: Libro de Actas XXIV Congreso Argentino de Ciencias de la Computación, CACIC 2018, pp. 432–441. Red de Universidades con Carreras en Informática, RedUNCI. Facultad de Ciencias Exactas, Universidad Nacional del Centro de la Provincia de Buenos Aires (2018)
19.
go back to reference Han, J., Pei, J., Kamber, M.: Data Mining: Concepts and Techniques, 3rd edn. Morgan Kaufmann, Burlington (2012)MATH Han, J., Pei, J., Kamber, M.: Data Mining: Concepts and Techniques, 3rd edn. Morgan Kaufmann, Burlington (2012)MATH
21.
Metadata
Title
Discovering Association Rules Using R. A Case Study on Retail’s Database
Authors
Juan Manuel Báez Acuña
Clara Anuncia Paredes Cabañas
Gustavo Sosa-Cabrera
María E. García-Díaz
Copyright Year
2019
DOI
https://doi.org/10.1007/978-3-030-20787-8_14

Premium Partner