2014 | OriginalPaper | Buchkapitel
Complexity of Rule Sets Induced from Incomplete Data Sets Using Global Probabilistic Approximations
verfasst von : Patrick G. Clark, Jerzy W. Grzymala-Busse
Erschienen in: Information Processing and Management of Uncertainty in Knowledge-Based Systems
Verlag: Springer International Publishing
Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.
Wählen Sie Textabschnitte aus um mit Künstlicher Intelligenz passenden Patente zu finden. powered by
Markieren Sie Textabschnitte, um KI-gestützt weitere passende Inhalte zu finden. powered by
We consider incomplete data sets using two interpretations of missing attribute values: lost values and “do not care” conditions. Additionally, in our data mining experiments we use global probabilistic approximations (singleton, subset and concept). The results of validation of such data, using global probabilistic approximations, were published recently. A novelty of this paper is research on the complexity of corresponding rule sets, in terms of the number of rules and number of rule conditions. Our main result is that the simplest rule sets are induced from data sets in which missing attribute values are interpreted as “do not care” conditions where rule sets are induced using subset probabilistic approximations.