Combining a popularity-productivity stochastic block model with a discriminative-content model for general structure detection

Bian-fang Chai, Jian Yu, Cai-yan Jia, Tian-bao Yang, and Ya-wen Jiang
Phys. Rev. E 88, 012807 – Published 8 July 2013

Abstract

Latent community discovery that combines links and contents of a text-associated network has drawn more attention with the advance of social media. Most of the previous studies aim at detecting densely connected communities and are not able to identify general structures, e.g., bipartite structure. Several variants based on the stochastic block model are more flexible for exploring general structures by introducing link probabilities between communities. However, these variants cannot identify the degree distributions of real networks due to a lack of modeling of the differences among nodes, and they are not suitable for discovering communities in text-associated networks because they ignore the contents of nodes. In this paper, we propose a popularity-productivity stochastic block (PPSB) model by introducing two random variables, popularity and productivity, to model the differences among nodes in receiving links and producing links, respectively. This model has the flexibility of existing stochastic block models in discovering general community structures and inherits the richness of previous models that also exploit popularity and productivity in modeling the real scale-free networks with power law degree distributions. To incorporate the contents in text-associated networks, we propose a combined model which combines the PPSB model with a discriminative model that models the community memberships of nodes by their contents. We then develop expectation-maximization (EM) algorithms to infer the parameters in the two models. Experiments on synthetic and real networks have demonstrated that the proposed models can yield better performances than previous models, especially on networks with general structures.

  • Figure
  • Figure
  • Figure
  • Figure
  • Figure
  • Figure
  • Figure
1 More
  • Received 27 December 2012

DOI:https://doi.org/10.1103/PhysRevE.88.012807

©2013 American Physical Society

Authors & Affiliations

Bian-fang Chai1,2, Jian Yu1,*, Cai-yan Jia1,†, Tian-bao Yang3, and Ya-wen Jiang1

  • 1Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, Beijing 100044, China
  • 2Department of Information Engineering, Shijiazhuang University of Economics, Hebei 050031,China
  • 3GE Global Research, San Ramon, California 94583, USA

  • *jianyu@bjtu.edu.cn
  • cyjia@bjtu.edu.cn

Article Text (Subscription Required)

Click to Expand

References (Subscription Required)

Click to Expand
Issue

Vol. 88, Iss. 1 — July 2013

Reuse & Permissions
Access Options
Author publication services for translation and copyediting assistance advertisement

Authorization Required


×
×

Images

×

Sign up to receive regular email alerts from Physical Review E

Log In

Cancel
×

Search


Article Lookup

Paste a citation or DOI

Enter a citation
×