Elsevier

Social Networks

Volume 22, Issue 2, May 2000, Pages 159-172
Social Networks

Determining groups from the clique structure in large social networks

https://doi.org/10.1016/S0378-8733(00)00021-6Get rights and content

Abstract

Ethnographers have traditionally defined human groups as disjoint collections of individuals who are linked to each other by regular interaction, shared perceptions and affective ties. They are internally differentiated: some members occupy a central position in the group, others are on the periphery or somewhere in between. This way of characterising a group is intuitive to seasoned observers but social network analysts seek a more systematic method for determining groups.

Freeman [Freeman, L.C., 1996. Cliques, Galois lattices, and the structure of human social groups, Social Networks 18, 173–187.] describes a formal model based on the concept of overlapping social network cliques. Freeman's analysis produces results that are consistent with ethnographic results but his technique is not easily implemented for very large data sets. This paper describes two algorithms based on Freeman's clique–lattice analysis. The first implements his technique exactly; the second is a modification that exploits the clique structure, thereby enabling the analysis of large complex networks. We apply both algorithms to a real data set and compare the results.

Introduction

In any human organisation in which individuals interact with each other on a daily basis, groups emerge quite naturally and often deliberately. The group structure in a network gives us new insight into the dynamics of the organisation. In particular, it helps us understand how information spreads throughout the organisation. This interest in group structure was motivated by its applications to information diffusion through a social network Rapoport, 1953a, Rapoport, 1953b, Erbe, 1962, Weenig, 1993, Yamaguchi, 1994, Frank, 1996, Buskens, 1998. Our particular interest is in modelling the rate and accuracy of the transmission of information in a stratified population (Bartholomew, 1982, chapter 9).

Ethnographers have traditionally defined human groups as collections of individuals with little or no overlap, linked to each other by regular interaction. Within such groups, members are aware of the existence of the group, they develop shared perceptions and affective ties, and each occupies a particular position in the group: at the core, on the periphery or somewhere in between. An individual is likely to be a member of several groups associated with particular contexts that might be defined for a particular time frame (Freeman, 1992). However, if the groups are well defined in terms of context there should be very little overlap between groups.

The ethnographic description of a group is based on intuitive ideas and does not help us determine how to systematically partition a network into meaningful subnetworks with as little human intervention as possible. As a result, social network researchers have made many attempts to formalise the group concept Frank, 1995, Flache and Macy, 1997, Zeggelink et al., 1997. One definition considers the concept of a clique, which is defined as a maximal subnetwork containing three or more actors all of whom are connected to each other. However, cliques are typically too small to satisfy the properties of an ethnographic group. In addition, an individual may belong to many different cliques, making the segmentation into subnetworks an almost impossible task.

Friedell (1967) introduced the concept of using semilattices to model complex organisational structure. Freeman (1996) extended this concept to Galois lattices to show that the underlying group structure in a social network is revealed in the overlapping patterns of cliques. He demonstrates his technique on two classical data sets and shows that the group composition he obtains using his formal method matches the group structure observed by the ethnographers. Freeman's technique enables a determination of the structure from a representation of the clique lattice. In the case of small networks (of the order of 20 cliques), this is often possible by visual inspection of the lattice but it is not feasible for very large data sets. In his original examples, the data sets consisted of less than 20 actors while the social network we study is at least five times this size. Two algorithms were developed to interpret the information in a formal description of the latticial representation of larger data sets rather than rely on visual interpretation. The first algorithm is an implementation of Freeman's technique. The second is a modification arising out of attempts at solving problems arising when the former was applied to a large complex network. The modified algorithm exploits a property of clique lattices in which the overlap pattern in the lower layers of the lattice mirrors the group structure. Both techniques are described in the following section. Resulting group structures are compared in the final section.

Section snippets

The group structure inherent in a clique lattice

A Galois lattice is an algebraic representation that permits the representation and visualisation of two-mode social network data (see Freeman and White, 1993) as duality extensions/intensions (see Duquenne, 1999). One mode is the set of actors and the other is a set of attributes describing the actors; in our case the attributes are membership of particular cliques in the network. By encoding these two modes in a Galois lattice, we are able to determine the patterning revealed by the clique

Application to a large social network

Let us now consider social network data collected from a military organisation as part of an information flow study. The data we refer to were responses to a questionnaire in which the respondents were given a list of names of all the members of the organisation and asked to rate the importance of information exchanged with every person on the list. The importance rating ranged from 0 to 5, with 0 rating assigned to those individuals with whom they exchanged no information. The 113 subjects

Conclusion

Freeman's bridging clique technique appears to give good results for networks whose clique structure can be represented by three-layered Galois lattices. When the majority of cliques are bridging cliques by Freeman's definition, a large proportion of clique members, including all the core actors, is eliminated from the group structure. This suggests that the definition of a bridging clique is not helpful in this case. The modified technique appears to address this problem by exploiting the

Acknowledgements

The author is grateful to David Matthews, for kindly producing some of the diagrams, and to Tim Pattison for helpful comments on this work.

References (16)

There are more references available in the full text version of this article.

Cited by (21)

  • Detecting large cohesive subgroups with high clustering coefficients in social networks

    2016, Social Networks
    Citation Excerpt :

    Interactions using online social media platforms, mobile phones and email are commonly represented as graphs, whose structural analysis may reveal interesting insights about the underlying social networks. For example, community detection techniques have been extensively used to identify groups of people characterized by a high level of interactions and to understand social communication throughout the network by finding cohesive subgroups (Scott, 2000; Falzon, 2000; Fortunato, 2010; Schaeffer, 2007). A cohesive subgroup is a “tightly knit” subset of actors in a social network, which was originally modeled using the graph-theoretic concept of a clique (Luce and Perry, 1949).

  • Knowledge discovery in social networks by using a logic-based treatment of implications

    2015, Knowledge-Based Systems
    Citation Excerpt :

    There are other methods to retrieve knowledge from the social networks usually based on finding clusters representing communities. The most used methods are based on graph exploration as clique techniques [17], biclustering [23,25], spectral graph partioning [13], etc. The different approaches use linguistic approaches, text mining or machine learning techniques combined with methods of network analysis, such as centrality measures.

  • Familial groups in social networks

    2013, Social Networks
    Citation Excerpt :

    Freeman gave a definition for social community which was more general than a clique by imposing overlap conditions on maximal cliques. Falzon (2000) modified that definition and described algorithms to find such communities. The notion of network robustness captures the property that desirable characteristics of the network still hold under some failures or disconnections.

  • Social and semantic coevolution in knowledge networks

    2010, Social Networks
    Citation Excerpt :

    The identification of this kind of dual structure in networks has been the focus of several qualitative and quantitative studies within the framework of bipartite graphs. ( Breiger, 1974; Wille, 1992; Freeman, 1996; White and Duquenne, 1996; Falzon, 2000; Roth and Bourgine, 2005; Lehmann et al., 2008). Freeman and White (1993) have notably shown how to jointly group agents and events they participate in.

  • On succinct representation of knowledge community taxonomies with formal concept analysis

    2008, International Journal of Foundations of Computer Science
View all citing articles on Scopus
View full text