Skip to main content
Top

2020 | OriginalPaper | Chapter

A Comparative Analysis of Community Detection Methods in Massive Datasets

Authors : B. S. A. S. Rajita, Deepa Kumari, Subhrakanta Panda

Published in: Modelling, Simulation and Intelligent Computing

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Nowadays there is a boom in social network data streaming from various fields of interest related to finance, engineering, medicine, and general sciences. All these data are modeled as graphs for better analysis. Community detection is one such mechanism for the analysis of such massive data. Many community detection algorithms exist in literature. The existing algorithms are compared by using either real-world or artificial networks (modeled as graphs) but not both. This paper aims to make a comparative study of two popular existing community detection algorithms both on real-world and synthetic data and verify their performance. The approach in this paper makes good use of recent advances in graphical modeling of different social networks. We generated a random graph that represents most of the observed properties of a real-world dataset. The experimental results are tabulated and the computed metrics help in inferring the suitability or scalability of an algorithm for small or massive datasets.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Girvan M, Newman MEJ (2002) Community structure in social and biological networks. Proc Nat Acad Sci 99(12):7821–7826 Girvan M, Newman MEJ (2002) Community structure in social and biological networks. Proc Nat Acad Sci 99(12):7821–7826
3.
go back to reference Sharma R, Oliveira S (2017) Community detection algorithm for big social networks using hybrid architecture. Big Data Res 10(2):44–522CrossRef Sharma R, Oliveira S (2017) Community detection algorithm for big social networks using hybrid architecture. Big Data Res 10(2):44–522CrossRef
4.
go back to reference Blondel VD, Guillaume JL, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks. J Statistical Mech Theory Experim 2008(10):10008 Blondel VD, Guillaume JL, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks. J Statistical Mech Theory Experim 2008(10):10008
5.
go back to reference Raghavan UN, Albert R, Kumara S (2007) Near linear time algorithm to detect community structures in large-scale networks. Phys Rev E 76(3):036106 Raghavan UN, Albert R, Kumara S (2007) Near linear time algorithm to detect community structures in large-scale networks. Phys Rev E 76(3):036106
6.
go back to reference Chakrabarti D, Kumar R, Tomkins A (2006) Evolutionary clustering. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining, Aug 2006, pp 554–560 Chakrabarti D, Kumar R, Tomkins A (2006) Evolutionary clustering. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining, Aug 2006, pp 554–560
7.
go back to reference Li X, Tan Y, Zhang Z, Tong Q (2016) Community detection in large social networks based on relationship density. In: IEEE 40th Annual computer software and applications conference (COMPSAC 2016), pp 524–533 Li X, Tan Y, Zhang Z, Tong Q (2016) Community detection in large social networks based on relationship density. In: IEEE 40th Annual computer software and applications conference (COMPSAC 2016), pp 524–533
8.
go back to reference Alamuri M, Surampudi BR, Negi A (2014) A survey of distance/similarity measures for categorical data. In: International joint conference on neural networks (IJCNN), vol 9(3), pp 1907–1914 Alamuri M, Surampudi BR, Negi A (2014) A survey of distance/similarity measures for categorical data. In: International joint conference on neural networks (IJCNN), vol 9(3), pp 1907–1914
9.
go back to reference Leskovec J, Kleinberg J, Faloutsos C (2005) Graphs over time: densification laws, shrinking diameters and possible explanations. In: Proceedings of the 11th ACM SIGKDD international conference on knowledge discovery in data mining, pp 177–187, 21 Aug 2005 Leskovec J, Kleinberg J, Faloutsos C (2005) Graphs over time: densification laws, shrinking diameters and possible explanations. In: Proceedings of the 11th ACM SIGKDD international conference on knowledge discovery in data mining, pp 177–187, 21 Aug 2005
10.
go back to reference Sumith N, Annappa B, Bhattacharya SA (2018) Holistic approach to influence maximization in social networks: STORIE. Appl Soft Comput 66(2):533–547 Sumith N, Annappa B, Bhattacharya SA (2018) Holistic approach to influence maximization in social networks: STORIE. Appl Soft Comput 66(2):533–547
11.
go back to reference Clauset A, Newman ME, Moore C (2004) Finding community structure in very large networks. Phys Rev E 70(6):066111 Clauset A, Newman ME, Moore C (2004) Finding community structure in very large networks. Phys Rev E 70(6):066111
12.
go back to reference Lancichinetti A, Kivelä M, Saramäki J, Fortunato S (2010) Characterizing the community structure of complex networks. PloS One 5(8):e11976 Lancichinetti A, Kivelä M, Saramäki J, Fortunato S (2010) Characterizing the community structure of complex networks. PloS One 5(8):e11976
13.
go back to reference Leskovec J, Lang KJ, Mahoney M (2010) Empirical comparison of algorithms for network community detection. In: Proceedings of the 19th international conference on world wide web, pp 631–640 Leskovec J, Lang KJ, Mahoney M (2010) Empirical comparison of algorithms for network community detection. In: Proceedings of the 19th international conference on world wide web, pp 631–640
14.
go back to reference Tyler JR, Wilkinson DM, Huberman BA (2005) E-mail as spectroscopy: automated discovery of community structure within organizations. Inform Soc 21(2):143–153CrossRef Tyler JR, Wilkinson DM, Huberman BA (2005) E-mail as spectroscopy: automated discovery of community structure within organizations. Inform Soc 21(2):143–153CrossRef
15.
go back to reference Palla G, Derényi I, Farkas I, Vicsek T (2005) Uncovering the overlapping community structure of complex networks in nature and society. Nature 435(70):793–814 Palla G, Derényi I, Farkas I, Vicsek T (2005) Uncovering the overlapping community structure of complex networks in nature and society. Nature 435(70):793–814
16.
go back to reference Pons P, Latapy M (2005) Computing communities in large networks using random walks. In: International symposium on computer and information sciences, pp 284–293, Oct 2005 Pons P, Latapy M (2005) Computing communities in large networks using random walks. In: International symposium on computer and information sciences, pp 284–293, Oct 2005
17.
go back to reference Wagenseller P, Wang F, Wu W (2018) Size matters: a comparative analysis of community detection algorithms. IEEE Trans Comput Soc Syst 5(4):951–960CrossRef Wagenseller P, Wang F, Wu W (2018) Size matters: a comparative analysis of community detection algorithms. IEEE Trans Comput Soc Syst 5(4):951–960CrossRef
18.
go back to reference Zhao Z, Zheng S, Li C, Sun J, Chang L, Chiclana F (2018) A comparative study on community detection methods in complex networks. J Intell Fuzzy Syst 35(1):1077–1086 Zhao Z, Zheng S, Li C, Sun J, Chang L, Chiclana F (2018) A comparative study on community detection methods in complex networks. J Intell Fuzzy Syst 35(1):1077–1086
Metadata
Title
A Comparative Analysis of Community Detection Methods in Massive Datasets
Authors
B. S. A. S. Rajita
Deepa Kumari
Subhrakanta Panda
Copyright Year
2020
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-15-4775-1_19

Premium Partner