Skip to main content
Erschienen in: The Journal of Supercomputing 12/2020

06.03.2020

Towards finding the best-fit distribution for OSN data

verfasst von: Subhayan Bhattacharya, Sankhamita Sinha, Sarbani Roy, Amarnath Gupta

Erschienen in: The Journal of Supercomputing | Ausgabe 12/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Currently, all online social networks (OSNs) are considered to follow a power-law distribution. In this paper, the degree distribution for multiple OSNs has been studied. It is seen that the degree distributions of OSNs differ moderately from a power law. Lognormal distributions are an alternative to power-law distributions and have been used as best fit for many complex networks. It is seen that the degree distributions of OSNs differ massively from a lognormal distribution. Thus, for a better fit, a composite distribution combining power-law and lognormal distribution is suggested. This paper proposes an approach to find the most suitable distribution for a given degree distribution out of the six possible combinations of power law and lognormal, namely power law, lognormal, power law–lognormal, lognormal–power law, double power law, and double power law lognormal. The errors in the fitted composite distribution and the original degree distribution of the OSNs are observed. It is seen that a composite distribution fitted using the approach described in this paper is always a better fit than both power-law and lognormal distributions.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Fußnoten
1
In a hashtag co-occurrence graph, the nodes are hashtags and the edges represent the fact that two hashtags appear in a tweet. In this study, we ignore the edge weight that represents the count of tweets in which these hashtags co-occur.
 
Literatur
1.
Zurück zum Zitat Kemp S (2019) Digital 2019: Global internet use accelerates. We are Social Kemp S (2019) Digital 2019: Global internet use accelerates. We are Social
2.
Zurück zum Zitat Smith K (2019) 53 incredible facebook statistics and facts. Brandwatch Report Smith K (2019) 53 incredible facebook statistics and facts. Brandwatch Report
3.
4.
Zurück zum Zitat Kumar R, Novak J, Tomkins A (2006) Structure and evolution of online social networks. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining (KDD ’06). Association for Computing Machinery, New York, NY, pp 611–617. https://doi.org/10.1145/1150402.1150476CrossRef Kumar R, Novak J, Tomkins A (2006) Structure and evolution of online social networks. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining (KDD ’06). Association for Computing Machinery, New York, NY, pp 611–617. https://​doi.​org/​10.​1145/​1150402.​1150476CrossRef
5.
Zurück zum Zitat Gephart JA, Pace ML (2015) Structure and evolution of the global seafood trade network. Environ. Res. Lett. 10(12):125,014CrossRef Gephart JA, Pace ML (2015) Structure and evolution of the global seafood trade network. Environ. Res. Lett. 10(12):125,014CrossRef
8.
Zurück zum Zitat Newman ME (2005) Power laws, pareto distributions and zipf’s law. Contemp. Phys. 46(5):323–351CrossRef Newman ME (2005) Power laws, pareto distributions and zipf’s law. Contemp. Phys. 46(5):323–351CrossRef
9.
Zurück zum Zitat Gómez V, Kaltenbrunner A, López V (2008) Statistical analysis of the social network and discussion threads in slashdot. In: Proceedings of the 17th International Conference on World Wide Web (WWW). ACM, pp 645–654 Gómez V, Kaltenbrunner A, López V (2008) Statistical analysis of the social network and discussion threads in slashdot. In: Proceedings of the 17th International Conference on World Wide Web (WWW). ACM, pp 645–654
10.
Zurück zum Zitat Sala A, Gaito S, Rossi GP, Zheng H, Zhao BY (2011) Revisiting degree distribution models for social graph analysis. arXiv:11080027 Sala A, Gaito S, Rossi GP, Zheng H, Zhao BY (2011) Revisiting degree distribution models for social graph analysis. arXiv:​11080027
11.
Zurück zum Zitat Reed WJ, Hughes BD (2003) Power-law distribution from exponential processes: an explanation for the occurrence of long-tailed distributions in biology and elsewhere. Sci Math Jpn 58(2):473–484MathSciNetMATH Reed WJ, Hughes BD (2003) Power-law distribution from exponential processes: an explanation for the occurrence of long-tailed distributions in biology and elsewhere. Sci Math Jpn 58(2):473–484MathSciNetMATH
12.
Zurück zum Zitat Reed WJ, Jorgensen M (2004) The double pareto-lognormal distribution—a new parametric model for size distributions. Commun Stat Theory Methods 33(8):1733–1753MathSciNetCrossRef Reed WJ, Jorgensen M (2004) The double pareto-lognormal distribution—a new parametric model for size distributions. Commun Stat Theory Methods 33(8):1733–1753MathSciNetCrossRef
13.
Zurück zum Zitat Seshadri M, Machiraju S, Sridharan A, Bolot J, Faloutsos C, Leskovek J (2008) Mobile call graphs: beyond power-law and lognormal distributions. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, pp 596–604 Seshadri M, Machiraju S, Sridharan A, Bolot J, Faloutsos C, Leskovek J (2008) Mobile call graphs: beyond power-law and lognormal distributions. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, pp 596–604
14.
Zurück zum Zitat Fang Z, Wang J, Liu B, Gong W (2012) Double Pareto lognormal distributions in complex networks. In: Thai M. Pardalos P (eds) Handbook of optimization in complex networks. Springer Optimization and Its Applications, vol 57. Springer, Boston, MACrossRef Fang Z, Wang J, Liu B, Gong W (2012) Double Pareto lognormal distributions in complex networks. In: Thai M. Pardalos P (eds) Handbook of optimization in complex networks. Springer Optimization and Its Applications, vol 57. Springer, Boston, MACrossRef
15.
Zurück zum Zitat Luckstead J, Devadoss S (2017) Pareto tails and lognormal body of US cities size distribution. Phys A Stat Mech Appl 465:573–578CrossRef Luckstead J, Devadoss S (2017) Pareto tails and lognormal body of US cities size distribution. Phys A Stat Mech Appl 465:573–578CrossRef
16.
Zurück zum Zitat Kwong HS, Nadarajah S (2019) A note on “pareto tails and lognormal body of us cities size distribution”. Phys A Stat Mech Appl 513:55–62CrossRef Kwong HS, Nadarajah S (2019) A note on “pareto tails and lognormal body of us cities size distribution”. Phys A Stat Mech Appl 513:55–62CrossRef
17.
Zurück zum Zitat Montebruno P, Bennett RJ, Van Lieshout C, Smith H (2019) A tale of two tails: Do power law and lognormal models fit firm-size distributions in the mid-victorian era? Phys A Stat Mech Appl 523:858–875CrossRef Montebruno P, Bennett RJ, Van Lieshout C, Smith H (2019) A tale of two tails: Do power law and lognormal models fit firm-size distributions in the mid-victorian era? Phys A Stat Mech Appl 523:858–875CrossRef
18.
Zurück zum Zitat Lu S (2018) Power laws in complex graphs: parsimonious generative models, similarity testing algorithms, and the origins. PhD thesis, University of Massachusetts Amherst Lu S (2018) Power laws in complex graphs: parsimonious generative models, similarity testing algorithms, and the origins. PhD thesis, University of Massachusetts Amherst
19.
Zurück zum Zitat Kong Y, Zang H, Ma X (2016) Quick model fitting using a classifying engine. In: 2016 IEEE International Conference on Big Data (Big Data). IEEE, pp 2728–2733 Kong Y, Zang H, Ma X (2016) Quick model fitting using a classifying engine. In: 2016 IEEE International Conference on Big Data (Big Data). IEEE, pp 2728–2733
20.
Zurück zum Zitat Bee M (2015) Estimation of the lognormal-pareto distribution using probability weighted moments and maximum likelihood. Commun Stat Simul Comput 44(8):2040–2060MathSciNetCrossRef Bee M (2015) Estimation of the lognormal-pareto distribution using probability weighted moments and maximum likelihood. Commun Stat Simul Comput 44(8):2040–2060MathSciNetCrossRef
Metadaten
Titel
Towards finding the best-fit distribution for OSN data
verfasst von
Subhayan Bhattacharya
Sankhamita Sinha
Sarbani Roy
Amarnath Gupta
Publikationsdatum
06.03.2020
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 12/2020
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-020-03232-y

Weitere Artikel der Ausgabe 12/2020

The Journal of Supercomputing 12/2020 Zur Ausgabe