Determining groups from the clique structure in large social networks

doi:10.1016/S0378-8733(00)00021-6

Social Networks

Volume 22, Issue 2, May 2000, Pages 159-172

https://doi.org/10.1016/S0378-8733(00)00021-6 Get rights and content

Abstract

Ethnographers have traditionally defined human groups as disjoint collections of individuals who are linked to each other by regular interaction, shared perceptions and affective ties. They are internally differentiated: some members occupy a central position in the group, others are on the periphery or somewhere in between. This way of characterising a group is intuitive to seasoned observers but social network analysts seek a more systematic method for determining groups.

Freeman [Freeman, L.C., 1996. Cliques, Galois lattices, and the structure of human social groups, Social Networks 18, 173–187.] describes a formal model based on the concept of overlapping social network cliques. Freeman's analysis produces results that are consistent with ethnographic results but his technique is not easily implemented for very large data sets. This paper describes two algorithms based on Freeman's clique–lattice analysis. The first implements his technique exactly; the second is a modification that exploits the clique structure, thereby enabling the analysis of large complex networks. We apply both algorithms to a real data set and compare the results.

Introduction

In any human organisation in which individuals interact with each other on a daily basis, groups emerge quite naturally and often deliberately. The group structure in a network gives us new insight into the dynamics of the organisation. In particular, it helps us understand how information spreads throughout the organisation. This interest in group structure was motivated by its applications to information diffusion through a social network Rapoport, 1953a, Rapoport, 1953b, Erbe, 1962, Weenig, 1993, Yamaguchi, 1994, Frank, 1996, Buskens, 1998. Our particular interest is in modelling the rate and accuracy of the transmission of information in a stratified population (Bartholomew, 1982, chapter 9).

Ethnographers have traditionally defined human groups as collections of individuals with little or no overlap, linked to each other by regular interaction. Within such groups, members are aware of the existence of the group, they develop shared perceptions and affective ties, and each occupies a particular position in the group: at the core, on the periphery or somewhere in between. An individual is likely to be a member of several groups associated with particular contexts that might be defined for a particular time frame (Freeman, 1992). However, if the groups are well defined in terms of context there should be very little overlap between groups.

The ethnographic description of a group is based on intuitive ideas and does not help us determine how to systematically partition a network into meaningful subnetworks with as little human intervention as possible. As a result, social network researchers have made many attempts to formalise the group concept Frank, 1995, Flache and Macy, 1997, Zeggelink et al., 1997. One definition considers the concept of a clique, which is defined as a maximal subnetwork containing three or more actors all of whom are connected to each other. However, cliques are typically too small to satisfy the properties of an ethnographic group. In addition, an individual may belong to many different cliques, making the segmentation into subnetworks an almost impossible task.

Friedell (1967) introduced the concept of using semilattices to model complex organisational structure. Freeman (1996) extended this concept to Galois lattices to show that the underlying group structure in a social network is revealed in the overlapping patterns of cliques. He demonstrates his technique on two classical data sets and shows that the group composition he obtains using his formal method matches the group structure observed by the ethnographers. Freeman's technique enables a determination of the structure from a representation of the clique lattice. In the case of small networks (of the order of 20 cliques), this is often possible by visual inspection of the lattice but it is not feasible for very large data sets. In his original examples, the data sets consisted of less than 20 actors while the social network we study is at least five times this size. Two algorithms were developed to interpret the information in a formal description of the latticial representation of larger data sets rather than rely on visual interpretation. The first algorithm is an implementation of Freeman's technique. The second is a modification arising out of attempts at solving problems arising when the former was applied to a large complex network. The modified algorithm exploits a property of clique lattices in which the overlap pattern in the lower layers of the lattice mirrors the group structure. Both techniques are described in the following section. Resulting group structures are compared in the final section.

Section snippets

The group structure inherent in a clique lattice

A Galois lattice is an algebraic representation that permits the representation and visualisation of two-mode social network data (see Freeman and White, 1993) as duality extensions/intensions (see Duquenne, 1999). One mode is the set of actors and the other is a set of attributes describing the actors; in our case the attributes are membership of particular cliques in the network. By encoding these two modes in a Galois lattice, we are able to determine the patterning revealed by the clique

Application to a large social network

Let us now consider social network data collected from a military organisation as part of an information flow study. The data we refer to were responses to a questionnaire in which the respondents were given a list of names of all the members of the organisation and asked to rate the importance of information exchanged with every person on the list. The importance rating ranged from 0 to 5, with 0 rating assigned to those individuals with whom they exchanged no information. The 113 subjects

Conclusion

Freeman's bridging clique technique appears to give good results for networks whose clique structure can be represented by three-layered Galois lattices. When the majority of cliques are bridging cliques by Freeman's definition, a large proportion of clique members, including all the core actors, is eliminated from the group structure. This suggests that the definition of a bridging clique is not helpful in this case. The modified technique appears to address this problem by exploiting the

Acknowledgements

The author is grateful to David Matthews, for kindly producing some of the diagrams, and to Tim Pattison for helpful comments on this work.

References (16)

V. Buskens
The social structure of trust
Social Networks
(1998)
V. Duquenne
Latticial structures in data analysis
Theoretical Computer Science
(1999)
K.A. Frank
Identifying cohesive subgroups
Social Networks
(1995)
K.A. Frank
Mapping interactions within and between cohesive subgroups
Social Networks
(1996)
L.C. Freeman
Cliques, Galois lattices, and the structure of human social groups
Social Networks
(1996)
K. Yamaguchi
The flow of information through social networks: diagonal-free measures of inefficiency and the structural determinants of inefficiency
Social Networks
(1994)
D.J. Bartholomew
Stochastic Models for Social Processes
(1982)
W. Erbe
Gregariousness, group membership and the flow of information
American Journal of Sociology
(1962)

There are more references available in the full text version of this article.

Cited by (21)

Detecting large cohesive subgroups with high clustering coefficients in social networks
2016, Social Networks
Citation Excerpt :
Interactions using online social media platforms, mobile phones and email are commonly represented as graphs, whose structural analysis may reveal interesting insights about the underlying social networks. For example, community detection techniques have been extensively used to identify groups of people characterized by a high level of interactions and to understand social communication throughout the network by finding cohesive subgroups (Scott, 2000; Falzon, 2000; Fortunato, 2010; Schaeffer, 2007). A cohesive subgroup is a “tightly knit” subset of actors in a social network, which was originally modeled using the graph-theoretic concept of a clique (Luce and Perry, 1949).
Clique relaxations are used in classical models of cohesive subgroups in social network analysis. Clustering coefficient was introduced more recently as a structural feature characterizing small-world networks. Noting that cohesive subgroups tend to have high clustering coefficients, this paper introduces a new clique relaxation, α-cluster, defined by enforcing a lower bound α on the clustering coefficient in the corresponding induced subgraph. Two variations of the clustering coefficient are considered, namely, the local and global clustering coefficient. Certain structural properties of α-clusters are analyzed and mathematical optimization models for determining α-clusters of the largest size in a network are developed and validated using several real-life social networks. In addition, a network clustering algorithm based on local α-clusters is proposed and successfully tested.
Knowledge discovery in social networks by using a logic-based treatment of implications
2015, Knowledge-Based Systems
Citation Excerpt :
There are other methods to retrieve knowledge from the social networks usually based on finding clusters representing communities. The most used methods are based on graph exploration as clique techniques [17], biclustering [23,25], spectral graph partioning [13], etc. The different approaches use linguistic approaches, text mining or machine learning techniques combined with methods of network analysis, such as centrality measures.
This work can be seen as a contribution to the area of social network analysis. By considering Formal Concept Analysis (FCA) as the underlying formalizing tool, we use logic-based techniques in order to offer novel solutions to identify user’s influence in a social network. We propose the use of the Simplification Logic ${SL}_{FD}$ for attribute implications as the core of an automated method to build a structure containing the complete set of influences among users.
Familial groups in social networks
2013, Social Networks
Citation Excerpt :
Freeman gave a definition for social community which was more general than a clique by imposing overlap conditions on maximal cliques. Falzon (2000) modified that definition and described algorithms to find such communities. The notion of network robustness captures the property that desirable characteristics of the network still hold under some failures or disconnections.
Many structural definitions for social community have been proposed in attempt to characterize and further understand the structure of social relationships. Algorithms using quantitative concepts such as centrality measures, spectral methods and other clustering measures have been used to compute social communities. While these methods have had much success in extracting meaningful subgroups in social and biological (and other) networks, they do not necessarily reveal the defining structure of social attraction.
We propose a new definition here for social community with a very clear and simple graph-theoretic structure which can also be realized as a new clique-relaxation. This structure evolved from Freeman's definition of social community, and this definition is further supported by long-standing sociometric principles such as Granovetter's weak-tie hypothesis or Faust's and others’ studies on how global structure can be inferred from a complete understanding of local structures (although our definition goes beyond dyadic and triadic configurations). We provide computational results that show our simply-stated structural definition reveals communities that correspond almost identically to, and sometimes are better than, the widely used centrality-based methods.
We name these new communities familial groups, inspired by the network structures resulting from inheritance or blood-line relations. These structures form naturally in hierarchical arrangements such as in corporate settings. Using results from graph theory, our structural definition for familial groups also immediately implies a ranking of the individuals within the group, easily identifying leaders and subcommunities.
Social and semantic coevolution in knowledge networks
2010, Social Networks
Citation Excerpt :
The identification of this kind of dual structure in networks has been the focus of several qualitative and quantitative studies within the framework of bipartite graphs. ( Breiger, 1974; Wille, 1992; Freeman, 1996; White and Duquenne, 1996; Falzon, 2000; Roth and Bourgine, 2005; Lehmann et al., 2008). Freeman and White (1993) have notably shown how to jointly group agents and events they participate in.
Socio-semantic networks involve agents creating and processing information: communities of scientists, software developers, wiki contributors and webloggers are, among others, examples of such knowledge networks. We aim at demonstrating that the dynamics of these communities can be adequately described as the coevolution of a social and a socio-semantic network. More precisely, we will first introduce a theoretical framework based on a social network and a socio-semantic network, i.e. an epistemic network featuring agents, concepts and links between agents and between agents and concepts. Adopting a relevant empirical protocol, we will then describe the joint dynamics of social and socio-semantic structures, at both macroscopic and microscopic scales, emphasizing the remarkable stability of these macroscopic properties in spite of a vivid local, agent-based network dynamics.
The conceptual weak hierarchy associated with a dissimilarity measure
2002, Mathematical social sciences
On succinct representation of knowledge community taxonomies with formal concept analysis
2008, International Journal of Foundations of Computer Science

View all citing articles on Scopus

View full text

Determining groups from the clique structure in large social networks

Abstract

Introduction

Section snippets

The group structure inherent in a clique lattice

Application to a large social network

Conclusion

Acknowledgements

Social Networks

Theoretical Computer Science

Social Networks

Social Networks

Social Networks

Social Networks

Stochastic Models for Social Processes

Gregariousness, group membership and the flow of information

American Journal of Sociology