Skip to main content
Top
Published in: Soft Computing 5/2012

01-05-2012 | Focus

Mining fuzzy association rules from low-quality data

Authors: A. M. Palacios, M. J. Gacto, J. Alcalá-Fdez

Published in: Soft Computing | Issue 5/2012

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Data mining is most commonly used in attempts to induce association rules from databases which can help decision-makers easily analyze the data and make good decisions regarding the domains concerned. Different studies have proposed methods for mining association rules from databases with crisp values. However, the data in many real-world applications have a certain degree of imprecision. In this paper we address this problem, and propose a new data-mining algorithm for extracting interesting knowledge from databases with imprecise data. The proposed algorithm integrates imprecise data concepts and the fuzzy apriori mining algorithm to find interesting fuzzy association rules in given databases. Experiments for diagnosing dyslexia in early childhood were made to verify the performance of the proposed algorithm.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
go back to reference Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. In: SIGMOD, Washington, D.C., USA, pp 207–216 Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. In: SIGMOD, Washington, D.C., USA, pp 207–216
go back to reference Agrawal R, Srikant R (1994) Fast algorithms for mining association rules. In: International conference on very large data bases, Santiago de Chile, pp 487–499 Agrawal R, Srikant R (1994) Fast algorithms for mining association rules. In: International conference on very large data bases, Santiago de Chile, pp 487–499
go back to reference Ajuriaguerra J (1976) Manual de psiquiatría infantil. Barcelona, Toray-Masson Ajuriaguerra J (1976) Manual de psiquiatría infantil. Barcelona, Toray-Masson
go back to reference Alatas B, Akin E (2006) An efficient genetic algorithm for automated mining of both positive and negative quantitative association rules. Soft Comput Fusion Found Methodol Appl 10(3):230–237 Alatas B, Akin E (2006) An efficient genetic algorithm for automated mining of both positive and negative quantitative association rules. Soft Comput Fusion Found Methodol Appl 10(3):230–237
go back to reference Alcala-Fdez J, Fernandez A, Luego J, Derrac J, Garcia S, Sanchez L, Herrera F (2011) Keel data-mining software tool: data set repository, integration of algorithms and experimental analysis framework. J Multiple-Valued Log Soft Comput 17(2–3):255–287 Alcala-Fdez J, Fernandez A, Luego J, Derrac J, Garcia S, Sanchez L, Herrera F (2011) Keel data-mining software tool: data set repository, integration of algorithms and experimental analysis framework. J Multiple-Valued Log Soft Comput 17(2–3):255–287
go back to reference Alcala-Fdez J, Flugy-Pape N, Bonarini A, Herrera F (2010) Analysis of the effectiveness of the genetic algorithms based on extraction of association rules. Fundamenta Informaticae 98(1):1–14MathSciNet Alcala-Fdez J, Flugy-Pape N, Bonarini A, Herrera F (2010) Analysis of the effectiveness of the genetic algorithms based on extraction of association rules. Fundamenta Informaticae 98(1):1–14MathSciNet
go back to reference Baudrit C, Dubois D, Perror N (2008) Representing parametric probabilistic models tainted with imprecision. Fuzzy Sets Syst 15(1):1913–1928CrossRef Baudrit C, Dubois D, Perror N (2008) Representing parametric probabilistic models tainted with imprecision. Fuzzy Sets Syst 15(1):1913–1928CrossRef
go back to reference Bertoluzza C, Gil M, Ralescu D (2003) Statistical modeling. Analysis and management of fuzzy data. Springer, Berlin Bertoluzza C, Gil M, Ralescu D (2003) Statistical modeling. Analysis and management of fuzzy data. Springer, Berlin
go back to reference Delgado M, Marín N, Sánchez D, Vila M (2003) Fuzzy association rules: general model and applications. IEEE Trans Fuzzy Syst 11(2):214–225CrossRef Delgado M, Marín N, Sánchez D, Vila M (2003) Fuzzy association rules: general model and applications. IEEE Trans Fuzzy Syst 11(2):214–225CrossRef
go back to reference Dubois D, Hullermeier E, Prade H (2006) A systematic approach to the assessment of fuzzy association rules. Data Min Knowl Disc 13(2):167–192MathSciNetCrossRef Dubois D, Hullermeier E, Prade H (2006) A systematic approach to the assessment of fuzzy association rules. Data Min Knowl Disc 13(2):167–192MathSciNetCrossRef
go back to reference Dubois D, Prade H, Sudamp T (2005) On the representation, measurement, and discovery of fuzzy associations. IEEE Trans Fuzzy Syst 13(2):250–262CrossRef Dubois D, Prade H, Sudamp T (2005) On the representation, measurement, and discovery of fuzzy associations. IEEE Trans Fuzzy Syst 13(2):250–262CrossRef
go back to reference Han J, Kamber M (2006) Data mining: concepts and techniques, 2nd edn. Morgan Kaufmann, San FransiscoMATH Han J, Kamber M (2006) Data mining: concepts and techniques, 2nd edn. Morgan Kaufmann, San FransiscoMATH
go back to reference Han J, Pei J, Yin Y (2004) Mining frequent patterns without candidate generation: a frequent-pattern tree approach. Data Min Knowl Dis 8(1):53–87MathSciNetCrossRef Han J, Pei J, Yin Y (2004) Mining frequent patterns without candidate generation: a frequent-pattern tree approach. Data Min Knowl Dis 8(1):53–87MathSciNetCrossRef
go back to reference Hong T, Kuo C, Chi S (1999) Mining association rules from quantitative data. Intell Data Anal 3(5):363–376MATHCrossRef Hong T, Kuo C, Chi S (1999) Mining association rules from quantitative data. Intell Data Anal 3(5):363–376MATHCrossRef
go back to reference Hong T, Kuo C, Chi S (2001) Trade-off between time complexity and number of rules for fuzzy mining from quantitative data. Int J Uncertain Fuzziness Knowl Based Syst 9(5):587–604MATH Hong T, Kuo C, Chi S (2001) Trade-off between time complexity and number of rules for fuzzy mining from quantitative data. Int J Uncertain Fuzziness Knowl Based Syst 9(5):587–604MATH
go back to reference Hong T, Lee Y (2008) An overview of mining fuzzy association rules. In: Bustince H, Herrera F, Montero J (eds) Studies in fuzziness and soft computing, vol 220. Springer, Berlin, pp 397–410 Hong T, Lee Y (2008) An overview of mining fuzzy association rules. In: Bustince H, Herrera F, Montero J (eds) Studies in fuzziness and soft computing, vol 220. Springer, Berlin, pp 397–410
go back to reference Hullermeier E, Yi Y (2007) In defense of fuzzy association analysis. IEEE Trans Syst Man Cybern Part B Cybern 37(4):1039–1043CrossRef Hullermeier E, Yi Y (2007) In defense of fuzzy association analysis. IEEE Trans Syst Man Cybern Part B Cybern 37(4):1039–1043CrossRef
go back to reference Kaufmann A, Gupta M (1991) Introduction to fuzzy arithmetic: theory and applications. Van Nostrand Reinhold, New YorkMATH Kaufmann A, Gupta M (1991) Introduction to fuzzy arithmetic: theory and applications. Van Nostrand Reinhold, New YorkMATH
go back to reference Kaya M (2006) Multi-objective genetic algorithm based approaches for mining optimized fuzzy association rules. Soft Comput Fusion Found Methodol Appl 10(7):578–586MathSciNetMATH Kaya M (2006) Multi-objective genetic algorithm based approaches for mining optimized fuzzy association rules. Soft Comput Fusion Found Methodol Appl 10(7):578–586MathSciNetMATH
go back to reference Limbourg P (2005) Multi-objective optimization of problems with epistemic uncertainty. In Proceedings of EMO, pp 413–427 Limbourg P (2005) Multi-objective optimization of problems with epistemic uncertainty. In Proceedings of EMO, pp 413–427
go back to reference Mladenic D, Lavrac N, Bohanec M, Moyle S (2002) Data mining and decision support: integration and collaboration. Kluwer, Norwell Mladenic D, Lavrac N, Bohanec M, Moyle S (2002) Data mining and decision support: integration and collaboration. Kluwer, Norwell
go back to reference Palacios A, Sanchez L, Couso I (2011) Future performance modelling in athletism with low quality data-based GFSs. J Multiple-Valued Log Soft Comput 17(2–3):207–228 Palacios A, Sanchez L, Couso I (2011) Future performance modelling in athletism with low quality data-based GFSs. J Multiple-Valued Log Soft Comput 17(2–3):207–228
go back to reference Sanchez L, Couso I, Casillas J (2007) Modelling vague data with genetic fuzzy systems under a combination of crisp and imprecise criteria. In: IEEE symposium on computational intelligence inmulticriteria decision making, pp 30–37 Sanchez L, Couso I, Casillas J (2007) Modelling vague data with genetic fuzzy systems under a combination of crisp and imprecise criteria. In: IEEE symposium on computational intelligence inmulticriteria decision making, pp 30–37
go back to reference Sanchez L, Suarez M, Villar J, Couso I (2008) Mutual information-based feature selection and partition design in fuzzy rule-based classifiers from vague data. Int J Approx Reason 49:607–622CrossRef Sanchez L, Suarez M, Villar J, Couso I (2008) Mutual information-based feature selection and partition design in fuzzy rule-based classifiers from vague data. Int J Approx Reason 49:607–622CrossRef
go back to reference Sun K, Fengshan B (2008) Mining weighted association rules without preassigned weights. IEEE Trans Knowl Data Eng 20(4):489–495CrossRef Sun K, Fengshan B (2008) Mining weighted association rules without preassigned weights. IEEE Trans Knowl Data Eng 20(4):489–495CrossRef
go back to reference Thomson P, Gilchrist P (1996) Dyslexia: a multidisciplinary approach. Chapman and Hall, London Thomson P, Gilchrist P (1996) Dyslexia: a multidisciplinary approach. Chapman and Hall, London
go back to reference Toro J, Cervera M (1980) TALE Test de Análisis de la lectoescritura. Pablo del Río, Madrid Toro J, Cervera M (1980) TALE Test de Análisis de la lectoescritura. Pablo del Río, Madrid
go back to reference Villar J, Otero A, Otero J, Sanchez L (2009) Taximeter verification using imprecise data from gps and multiobjective algorithms. Eng Appl Artif Intell 22:250–260CrossRef Villar J, Otero A, Otero J, Sanchez L (2009) Taximeter verification using imprecise data from gps and multiobjective algorithms. Eng Appl Artif Intell 22:250–260CrossRef
go back to reference Vinuessa M, Coll J (1984) Tratado de atletismo. Servicio Geográfico del Ejército Vinuessa M, Coll J (1984) Tratado de atletismo. Servicio Geográfico del Ejército
go back to reference Wu B, Sun C (2001) Interval-valued statistics, fuzzy logic, and their use in computational semantics. J Intell Fuzzy Syst 1–2(11):1–7 Wu B, Sun C (2001) Interval-valued statistics, fuzzy logic, and their use in computational semantics. J Intell Fuzzy Syst 1–2(11):1–7
Metadata
Title
Mining fuzzy association rules from low-quality data
Authors
A. M. Palacios
M. J. Gacto
J. Alcalá-Fdez
Publication date
01-05-2012
Publisher
Springer-Verlag
Published in
Soft Computing / Issue 5/2012
Print ISSN: 1432-7643
Electronic ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-011-0775-3

Other articles of this Issue 5/2012

Soft Computing 5/2012 Go to the issue

Premium Partner