skip to main content
research-article

Effective community search for large attributed graphs

Published:01 August 2016Publication History
Skip Abstract Section

Abstract

Given a graph G and a vertex qG, the community search query returns a subgraph of G that contains vertices related to q. Communities, which are prevalent in attributed graphs such as social networks and knowledge bases, can be used in emerging applications such as product advertisement and setting up of social events. In this paper, we investigate the attributed community query (or ACQ), which returns an attributed community (AC) for an attributed graph. The AC is a subgraph of G, which satisfies both structure cohesiveness (i.e., its vertices are tightly connected) and keyword cohesiveness (i.e., its vertices share common keywords). The AC enables a better understanding of how and why a community is formed (e.g., members of an AC have a common interest in music, because they all have the same keyword "music"). An AC can be "personalized"; for example, an ACQ user may specify that an AC returned should be related to some specific keywords like "research" and "sports".

To enable efficient AC search, we develop the CL-tree index structure and three algorithms based on it. We evaluate our solutions on four large graphs, namely Flickr, DBLP, Tencent, and DBpedia. Our results show that ACs are more effective and efficient than existing community retrieval approaches. Moreover, an AC contains more precise and personalized information than that of existing community search and detection methods.

References

  1. https://en.wikipedia.org/wiki/Disjoint-set_data_structure.Google ScholarGoogle Scholar
  2. V. Batagelj and M. Zaversnik. An o(m) algorithm for cores decomposition of networks. arXiv, 2003.Google ScholarGoogle Scholar
  3. G. Bhalotia et al. Keyword searching and browsing in databases using banks. In ICDE, 2002.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. W. Cui, Y. Xiao, H. Wang, Y. Lu, and W. Wang. Online search of overlapping communities. In SIGMOD, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. W. Cui, Y. Xiao, H. Wang, and W. Wang. Local search of communities in large graphs. In SIGMOD, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. B. Ding, J. X. Yu, S. Wang, L. Qin, X. Zhang, and X. Lin. Finding top-k min-cost connected trees in databases. In ICDE, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  7. S. N. Dorogovtsev et al. K-core organization of complex networks. Physical review letters, 2006.Google ScholarGoogle Scholar
  8. W. Fan, J. Li, S. Ma, N. Tang, Y. Wu, and Y. Wu. Graph pattern matching: from intractable to polynomial time. PVLDB, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. W. Fan, X. Wang, Y. Wu, and J. Xu. Association rules with graph patterns. PVLDB, 8(12):1502--1513, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Y. Fang, H. Zhang, Y. Ye, and X. Li. Detecting hot topics from twitter: A multiview approach. Journal of Information Science, 40(5):578--593, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. S. Fortunato. Community detection in graphs. Physics Reports, 486(3):75--174, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  12. J. Han, M. Kamber, and J. Pei. Data mining: concepts and techniques. Elsevier, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Han, J. Pei, and Y. Yin. Mining frequent patterns without candidate generation. In SIGMOD, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. H. He, H. Wang, J. Yang, and P. S. Yu. Blinks: ranked keyword searches on graphs. In SIGMOD, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. X. Huang, H. Cheng, L. Qin, W. Tian, and J. X. Yu. Querying k-truss community in large and dynamic graphs. In SIGMOD, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. V. Kacholia et al. Bidirectional expansion for keyword search on graph databases. In VLDB, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. M. Kargar and A. An. Keyword search in graphs: Finding r-cliques. PVLDB, 4(10):681--692, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. R.-H. Li, L. Qin, J. X. Yu, and R. Mao. Influential community search in large networks. In PVLDB, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. R.-H. Li, J. X. Yu, and R. Mao. Efficient core maintenance in large dynamic graphs. TKDE, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  20. Y. Liu, A. Niculescu-Mizil, and W. Gryc. Topic-link lda: joint models of topic and author community. In ICML, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. R. M. Nallapati, A. Ahmed, E. P. Xing, and W. W. Cohen. Joint latent topic models for text and citations. In KDD, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. M. Newman et al. Finding and evaluating community structure in networks. Physical review E, 2004.Google ScholarGoogle Scholar
  23. Y. Ruan, D. Fuhry, and S. Parthasarathy. Efficient community detection in large networks using content and links. In WWW, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. M. Sachan et al. Using content and interactions for discovering communities in social networks. In WWW, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. S. B. Seidman. Network structure and minimum degree. Social networks, 5(3):269--287, 1983.Google ScholarGoogle ScholarCross RefCross Ref
  26. M. Sozio and A. Gionis. The community-search problem and how to plan a successful cocktail party. In KDD, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. B. Thomee et al. The new data and new challenges in multimedia research. arXiv:1503.01817, 2015.Google ScholarGoogle Scholar
  28. H. Tong, C. Faloutsos, B. Gallagher, and T. Eliassi-Rad. Fast best-effort pattern matching in large attributed graphs. In KDD, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Z. Xu, Y. Ke, Y. Wang, H. Cheng, and J. Cheng. A model-based approach to attributed graph clustering. In SIGMOD, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Y. Fang et al. Effective community search for large attributed graphs (technical report). http://www.cs.hku.hk/research/techreps/document/TR-2016-01.pdf.Google ScholarGoogle Scholar
  31. J. Yang, J. McAuley, and J. Leskovec. Community detection in networks with node attributes. In ICDM, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  32. T. Yang, R. Jin, Y. Chi, and S. Zhu. Combining link and content for community detection: a discriminative approach. In KDD, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. J. X. Yu, L. Qin, and L. Chang. Keyword search in databases. Synthesis Lectures on Data Management, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Y. Zhou, H. Cheng, and J. X. Yu. Graph clustering based on structural/attribute similarities. VLDB, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Effective community search for large attributed graphs
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image Proceedings of the VLDB Endowment
      Proceedings of the VLDB Endowment  Volume 9, Issue 12
      August 2016
      345 pages
      ISSN:2150-8097
      Issue’s Table of Contents

      Publisher

      VLDB Endowment

      Publication History

      • Published: 1 August 2016
      Published in pvldb Volume 9, Issue 12

      Qualifiers

      • research-article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader