Elsevier

Information Sciences

Volume 185, Issue 1, 15 February 2012, Pages 114-127
Information Sciences

Fast algorithm for computing fixpoints of Galois connections induced by object-attribute relational data

https://doi.org/10.1016/j.ins.2011.09.023Get rights and content

Abstract

Fixpoints of Galois connections induced by object-attribute data tables represent important patterns that can be found in relational data. Such patterns are used in several data mining disciplines including formal concept analysis, frequent itemset and association rule mining, and Boolean factor analysis. In this paper we propose efficient algorithm for listing all fixpoints of Galois connections induced by object-attribute data. The algorithm, called FCbO, results as a modification of Kuznetsov’s CbO in which we use more efficient canonicity test. We describe the algorithm, prove its correctness, discuss efficiency issues, and present an experimental evaluation of its performance and comparison with other algorithms.

Section snippets

Introduction and Preliminaries

This paper describes a new algorithm for computing fixpoints of Galois connections. In particular, we focus on Galois connections [5], [12], [26], [33] that appear in formal concept analysis (FCA) – a method of qualitative analysis of object-attribute relational data [10], [33]. In a broader sense, the algorithm belongs to an important family of algorithms for listing combinatorial structures [11] and algorithms for biclustering [3], [29]. The algorithm we propose is a refinement of Kuznetsov’s

Canonicity test and CbO

In this section we recall CbO [19], [21] and the canonicity test. The next section will describe the new algorithm. In the sequel, we assume that X = {0, 1,  , m} and Y = {0, 1,  , n} are finite nonempty sets of objects and attributes, respectively, and I  X × Y. Since I is fixed, the concept-forming operators I and I defined by (5), (6) will be denoted just by and , respectively. The set of all formal concept in I will be denoted by B(X,Y,I).

CbO has been introduced in [19] (a paper in Russian) and

Improved canonicity test and FCbO

In this section, we propose an improvement of the canonicity test used by CbO that reduces the number of formal concepts computed multiple times. In a call tree like that in Fig. 1, such formal concepts are depicted by the black-square nodes. Our new test and the improved algorithm will reduce the number of such nodes in the call tree without altering the rest of the tree. The major problem with the original canonicity test used by CbO is that it is always used after a new formal concept is

Complexity and efficiency issues

It is a well-known fact that the limiting factor of computing all formal concepts is that the corresponding counting problem is #P-complete [18], [20]. Fortunately, if ∣I∣ is considerably small, one can get the set of all formal concepts in reasonable time even if X and Y are large. Therefore, there have been proposed various algorithms for FCA specialized on sparse incidence data. FCbO performs well in case of both sparse and dense data of reasonable size. From the point of view of the

Conclusions

We have introduced an algorithm called FCbO for computing formal concepts in object-attribute data tables. The algorithm results from CbO [19], [21] by introducing a new canonicity test. We have proved correctness of the algorithm and presented an experimental evaluation of its performance compared to the original CbO, Ganter’s NextClosure and also to Andrews’s In-Close, another contemporary derivative of CbO. The experiments have shown that FCbO significantly reduces the number of computed

Acknowledgment

Supported by Grant Nos. P103/10/1056 and P202/10/P360 of the Czech Science Foundation and by Grant No. MSM 6198959214. The algorithm described in this paper has been presented during ICCS 2009 and CLA 2010 [16] conferences. However, in ICCS 2009, the algorithm took part in a performance contest only and was not published in the proceedings, and the CLA 2010 paper contains the pseudocode and a brief summary of the algorithm without further analysis or proofs. The present paper is aimed as a

References (34)

  • J.H. Correia et al.

    Conceptual knowledge discovery – A human-centered approach

    Appl. Artif. Intell.

    (2003)
  • B. Ganter, Two basic algorithms in concept analysis. (Technical Report FB4-Preprint No. 831). TH Darmstadt,...
  • B. Ganter et al.

    Formal Concept Analysis

    (1999)
  • L.A. Goldberg

    Efficient Algorithms for Listing Combinatorial Structures

    (1993)
  • G.A. Gratzer

    General Lattice Theory

    (1998)
  • S. Hettich, S.D. Bay, The UCI KDD Archive University of California, Irvine, School of Information and Computer...
  • P. Krajca et al.

    Parallel algorithm for computing fixpoints of galois connections

    Ann. Math. Artif. Intell.

    (2010)
  • Cited by (94)

    • LCM from FCA point of view: A CbO-style algorithm with speed-up features

      2022, International Journal of Approximate Reasoning
    • LinCbO: Fast algorithm for computation of the Duquenne-Guigues basis

      2021, Information Sciences
      Citation Excerpt :

      Originally, we believed that CbO itself can make the computation faster. This motivation came from the paper by Outrata & Vychodil [24], where CbO is shown to be significantly faster than NextClosure when computing intents (see Table 5). The main reason for the speed-up is the fact that CbO uses set intersection to efficiently obtain extents during the tree descent.

    View all citing articles on Scopus
    View full text