Elsevier

Swarm and Evolutionary Computation

Volume 31, December 2016, Pages 90-109
Swarm and Evolutionary Computation

Regular Paper
A new multi-objective evolutionary framework for community mining in dynamic social networks

https://doi.org/10.1016/j.swevo.2016.09.001Get rights and content

Abstract

Evolutionary clustering – clustering in the presence of dynamic shifts of data's topological structure – has recently drawn remarkable attention wherein several algorithms are developed in the study of complex real networks. Despite the growing interests, all of the algorithms are designed based on seemingly the same principle. The primary principle in these evolutionary clustering frameworks is guided by decomposing the problem into two individual criteria, snapshot quality and temporal smoothness. Snapshot quality should properly cluster individuals of a network into interconnected communities. Temporal smoothness, on the other hand, should capture well the dynamic shift of the interconnected clusters from one time step to another. Thus, in the absence of any dynamic behavior, an evolutionary clustering model should be no more than a community detection one in a static network. Unfortunately, all of the developed algorithms are proposed based on discretion of the snapshot quality as a unified of both intra- and inter- connected community detection model while temporal cost as a community evolution detection model. The contribution of this paper starts by noting the limitation of the existing state-of-the-art algorithms. Despite their performance on dynamic complex networks, their formulations lack complete reflection of sufficient community detection model. Our framework, then, models the evolutionary clustering problem by hypothesizing that it should not depart too much from the community detection problem. To support this claim, an alternate decomposition perspective is proposed by projecting the problem, as a multi-objective optimization problem, in the light of snapshot and temporal of both intra- and inter-community scores. Two snapshot qualities are proposed to individually emphasize the role of intra- and inter- community scores, while temporal cost is proposed to cross-fertilize inter- community score. By applying one of the prominent multi-objective evolutionary algorithms (MOEAs) to solve the proposed multi-objective evolutionary clustering framework and testing it on several synthetic and real-world dynamic networks, we demonstrate the ability of the proposed model to address the problem more accurately than the existing state-of-the-art formulations.

Introduction

Due to their practical significance and ever-increasing applicability in many real world dynamic systems, networks and their topological attributes have very recently drawn growing attention and fueled the desire for solving their problems. Examples include online worlds like technological networks, information networks, and social-communication networks such as the internet, World Wide Web, and Facebook. Other interesting examples are biological networks and ecological niches like protein-protein interaction networks and food webs.

Many algorithms have shown up in literature to analyze the behavior of complex networks in a single and, more importantly, in multi time steps. The study of functional homogeneity of group of members (commonly noted as module, co-cluster, or simply, cluster) in the network is much more involved in social network analysis (SNA). In its context, a module or a community is a set of individuals with more appearance of intra-connection amongst its members than inter-connection with other communities in the network. Moreover, the aspect of a community can account several types of membership drifting over time resulting in continuous changes in interaction signatures. Thus, by identifying network's communities (and their evolution), several functional phenomena can be depicted and predicted from the network structure. Community mining in evolutionary networks has and continues to have growing applications. Examples include trend analysis in social spheres and dynamic link prediction [1], [2], [3], [4], [5].

Capturing the evolution of clusters in dynamic complex networks is first introduced by Chakrabarti et al. [6] and adopted in all state-of-the-art approaches (examples include [7], [8], [9], [10], [11], [12]). The fundamental issue of evolution of temporal data is addressed in these approaches based on seemingly one common ground and principle inspired primary from the detection of the two participants of the problem: the snapshot patterns and evolutionary patterns of the communities. These two sub-problems are formulated as multi-cost optimization problem content mainly with snapshot cost and temporal cost. To specify the characteristic of evolutionary clustering problem in these approaches, three design parameters are used, namely, snapshot intra-cluster quality, snapshot inter-cluster quality, and temporal cost. They proved that the interplay of these parameters plays a vital role in the ability of the adopted evolutionary clustering algorithm.

Although all of the existing state-of-the-art frameworks attempt to involve the above mentioned parameters by maximizing snapshot quality of the network at a current time step and minimizing temporal cost of the network between the current time step and the previous one, it allows (as will be demonstrated in our investigations) a certain degree of cross-competition between snapshot quality and temporal cost that may become an acute problem while eliminating some promising solutions. This cross-competition, however, can't be overlooked in any evolutionary clustering framework and thus can also act against our framework. Nevertheless, our idea is to lessen the impact of this cross-competition by designing a proper cross-fertilization model between the temporal cost and the snapshot inter-cluster connection quality. Once we do that, we can then make a cross-competition between the snapshot intra-cluster connection quality and the designed cross-fertilization model. It is not intended, here, to be an exact evolutionary clustering framework, rather, its purpose is to offer a more successful way to maintain the essential characteristic components of evolutionary clustering problem and to explore their combined impacts on the final performance of the model. The remaining sections of this paper present our alternate perspective to solve evolutionary clustering problem in complex networks. The proposed framework should contribute to each of the following two problem solving aspects:

  • 1.

    How can we characterize the evolutionary clustering problem in dynamic complex networks?

  • 2.

    How can we shift from the de-facto definition of evolutionary clustering problem and define an alternate and efficient framework to cast and state it?

Starting with Section 2, it gives related backgrounds on the network's evolutionary clustering problem while presenting relevant graph's terminology. State-of-the-art works are then reviewed in Section 3. Our framework is stated and formulated in 4 Evolutionary clustering: an alternate trajectory, 5 Methodology of the proposed. Experimental results and corresponding analysis on synthetic and real life social networks are provided in Section 6. Finally, conclusion of the main findings of this paper and further possible ramifications are highlighted in Section 7.

Section snippets

Graph clustering and evolutionary clustering

Mathematically, a network is modeled as graph of pairwise edges between its nodes. Assuming, for example, a friendship graph G modeling a social network N, the pairwise friendship connections between individual entities of N can be modeled by the pair (V,E). The set of n individuals or entities in N is noted as the set of nodes or vertices V={v1,v2,,vn} in G while the friendship connection between any pair of individuals in N is noted as edge (vi,vj) in E, i.e. E={(vi,vj)|1i,jnij}.

Literature review

By definition, capturing evolution of patterns within social networks must exploit time-evolving friendship relations gathered from the nodes and assign the nodes accordingly to different evolving clusters. Kumar et al. [30] proposed community mining model to analyze two large online social networks. They presented two properties that can be extracted from the online network. These are nodes density and membership. Their model discriminates the regions of the network, accordingly, into three

Evolutionary clustering: an alternate trajectory

With all the foregoing heuristic and meta-heuristic methodologies (which are based on one common snapshottemporal costs framework) in our arsenal, it is quite natural to ask whether it is possible to create another principle and elaborate any of the presented methodologies to tackle evolutionary clustering problem more accurately. The answer of this paper is yes. To support this claim, the remaining of this section introduces an alternate perspective of evolutionary clustering framework, noted

Methodology of the proposed MOEC

The methodology of the formulated MOEC is mainly based on the decomposition based multi-objective evolutionary algorithm (MOEA/D) of Zhang and Li [41]. A general review of MOEA/D is presented next.

Experimental results

In this section, we will test the performance of the proposed MOEC framework (based on the multi-objective minimization of an intra-snapshot cost formulated in Eq. (12) and an inter-and-intra-temporal cost formulated in Eq. (14)) against the MOEC traditional framework being projected into four state-of-the-art multi-objective optimization models presented in [12], [40]. These models noted as snapshot-temporal pair (Φ1,Φ2) are: 1) (Q,NMI), 2) (CS,NMI), 3) (CO,NMI), and 4) (NC,NMI). Henceforth,

Conclusions

One of the main important aspects of the proposed multi-objective evolutionary clustering framework is that it realizes the importance of each of the three main participants of the problem, namely snapshot intra-cluster quality, snapshot inter-cluster quality, and temporal smoothness. Unlike state-of-the-art evolutionary clustering models, the detection of communities in our model is more accurate and less susceptible to trapping locally because of the explicit manipulation of the snapshot

References (51)

  • C. Aggarwal et al.

    Evolutionary network analysis: a survey

    ACM Comput. Surv. (CSUR)

    (2014)
  • J. Leskovec et al.

    The dynamics of viral marketing

    ACM Trans. Web (TWEB)

    (2007)
  • L.Yan, J. Wang, J. Han, Y. Wang, A significance-driven framework for characterizing and finding evolving patterns of...
  • Acar, E., Dunlavy, D.M., & Kolda, T.G. (2009, December). Link prediction on evolving data using matrix and tensor...
  • R. Kumar, J. Novak, A. Tomkins, Structure and evolution of online social networks. In Link Mining: Models, Algorithms,...
  • D. Chakrabarti, R. Kumar, & A. Tomkins, Evolutionary clustering, in: Proceedings of the 12th ACM International...
  • L. Tang, H. Liu, J. Zhang, & Z. Nazeri, Community evolution in dynamic multi-mode networks, in: Proceeding of the...
  • Y.R. Lin et al.

    Analyzing communities and their evolutions in dynamic social networks

    ACM Trans. Knowl. Discov. Data (TKDD)

    (2009)
  • Y. Chi et al.

    On evolutionary spectral clustering

    ACM Trans. Knowl. Discov. Data (TKDD)

    (2009)
  • M.S. Kim et al.

    A particle-and-density based evolutionary clustering method for dynamic networks

    Proc. VLDB Endow.

    (2009)
  • F. Folino et al.

    An evolutionary multiobjective approach for community discovery in dynamic networks

    IEEE Trans. Knowl. Data Eng.

    (2014)
  • Folino, F., & Pizzuti, C. (2010, August). A multiobjective and evolutionary clustering method for dynamic networks. in:...
  • M.R. Garey et al.

    Computers and Intractability. A Guide to the Theory of NP-Completeness

    (1979)
  • S. Schaeffer

    Graph clustering

    Comput. Sci. Rev.

    (2007)
  • U. Brandes et al.

    On modularity clustering

    IEEE Trans. Knowl. Data Eng.

    (2008)
  • A. Noack

    Energy models for graph clustering

    J. Graph Algorithms Appl.

    (2007)
  • F. Radicchi et al.

    Defining and identifying communities in networks

    Proc. Natl. Acad. Sci. USA

    (2004)
  • M.E.J. Newman et al.

    Finding and evaluating community structure in networks

    Phys. Rev. E

    (2004)
  • M. Tasgin, A. Bingol, Communities detection in complex networks using genetic algorithms. in: Proceedings of the...
  • A. Clauset et al.

    Finding community structure in very large networks

    Phys. Rev. E

    (2004)
  • J. Reichardt et al.

    Statistical mechanics of community detection

    Phys. Rev. E

    (2006)
  • M.E.J. Newman

    Finding community structure in networks using the eigenvectors of matrices

    Phys. Rev. E

    (2006)
  • M.E.J. Newman

    Modularity and community structure in networks

    Proc. Natl. Acad. Sci.

    (2006)
  • Tasgin, M., Herdagdelen, A., Bingol, A. (2007). Communities detection in complex networks using genetic algorithms,...
  • S. Lehmann et al.

    Deterministic modularity optimization

    Eur. Phys. J. B

    (2007)
  • Cited by (23)

    • A review of heuristics and metaheuristics for community detection in complex networks: Current usage, emerging development and future directions

      2021, Swarm and Evolutionary Computation
      Citation Excerpt :

      They showed plausible behaviour supporting their performance against the state-of-the-art HCD and MCD algorithms. For example, the foundation of the heuristic mutation operators, the so-called migration operators, proposed in [2,20,21,23,24] is to harness the existence of strong intra-connections among the nodes at the expense of weak ones. The proposed migration operators can discover the connections necessary to form strong intra-connections and moving nodes towards such structures.

    • Revealing dynamic communities in networks using genetic algorithm with merge and split operators

      2020, Physica A: Statistical Mechanics and its Applications
      Citation Excerpt :

      Compared with other heuristics, evolutionary approaches have a stronger global search capability originating from the search mechanism based on population, which allows them to escape from local optima and hence to find a better solution. In recent years, evolutionary methods have been successfully applied in community detection for static networks [44–49], and dynamic networks [48,49]. Based on the elite genetic algorithm, we present an evolutionary approach, MSGA, which uses a different quality function combined with ad-hoc merge and split operators to accurately detect dynamic communities.

    • A new evolutionary multi-objective community mining algorithm for signed networks

      2019, Applied Soft Computing Journal
      Citation Excerpt :

      Recently, a thorough cover of the main research studies in multi-objective community detection models (including unsigned, signed, dynamic, and overlapping models) is put forward by Cai et al. [37]. Liu et al. in [34] compared the performance of their multi-objective maximization model against FEC of Yang et al. [24], extension to Blondel et al. optimization method for modularity [33], and Li et al. model [28]. Upon thorough analysis, the effectiveness and efficacy of Liu et al. model against other models are revealed out.

    View all citing articles on Scopus
    View full text