Published in:

1997 | ReviewPaper | Chapter

Discovering all most specific sentences by randomized algorithms extended abstract

Authors : Dimitrios Gunopulos, Heikki Mannila, Sanjeev Saluja

Published in: Database Theory — ICDT '97

Publisher: Springer Berlin Heidelberg

Included in: Professional Book Archive

Get Access

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Data mining can in many instances be viewed as the task of computing a representation of a theory of a model or a database. In this paper we present a randomized algorithm that can be used to compute the representation of a theory in terms of the most specific sentences of that theory. In addition to randomization, the algorithm uses a generalization of the concept of hypergraph transversal. We apply the general algorithm, for discovering maximal frequent sets in 0/1 data, and for computing minimal keys in relations. We present some empirical results on the performance of these methods on real data. We also show some complexity theoretic evidence of the hardness of these problems.

Springer Professional

Discovering all most specific sentences by randomized algorithms extended abstract

Premium Partner