Skip to main content
Erschienen in: Quality & Quantity 5/2014

01.09.2014

Inference of the Russian drug community from one of the largest social networks in the Russian Federation

verfasst von: L. J. Dijkstra, A. V. Yakushev, P. A. C. Duijn, A. V. Boukhanovsky, P. M. A. Sloot

Erschienen in: Quality & Quantity | Ausgabe 5/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The criminal nature of narcotics complicates the direct assessment of a drug community, while having a good understanding of the type of people drawn or currently using drugs is vital for finding effective intervening strategies. Especially for the Russian Federation this is of immediate concern given the dramatic increase it has seen in drug abuse since the fall of the Soviet Union in the early nineties. Using unique data from the Russian social network ‘LiveJournal’ with over 39 million registered users worldwide, we were able for the first time to identify the on-line drug community by context sensitive text mining of the users’ blogs using a dictionary of known drug-related official and ‘slang’ terminology. By comparing the interests of the users that most actively spread information on narcotics over the network with the interests of the individuals outside the on-line drug community, we found that the ‘average’ drug user in the Russian Federation is generally mostly interested in topics such as Russian rock, non-traditional medicine, UFOs, Buddhism, yoga and the occult. We identify three distinct scale-free sub-networks of users which can be uniquely classified as being either ‘infectious’, ‘susceptible’ or ‘immune’.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
LiveJournal is available at http://​www.​livejournal.​com (English) and http://​www.​livejournal.​ru (Russian).
 
2
Facebook is available at http://​www.​facebook.​com.
 
3
Twitter is available at http://​www.​twitter.​com.
 
4
LiveJournal’s own statistics page can be found at http://​www.​livejournal.​com/​stats.​bml.
 
5
The homepage of SPb IAC can be found at http://​iac.​spb.​ru (in Russian).
 
6
The full drug-dictionary is freely available and can be downloaded at http://​escience.​ifmo.​ru/​?​ws=​sub48.
 
7
The number of phrases (8,359) is rather high in comparison to the number of words (368) in this dictionary. This is due to the fact that we consider a phrase consisting, for example, of the words ‘injecting’, ‘heroin’ and the phrase with the words ‘injection’, ‘heroin’ and ‘needle’ as two separate expressions (where the latter is associated with a higher weight than the former).
 
8
A \(\chi ^2\) test originally designed for \(2 \times 2\) contingency tables by Sir R. A. Fisher (1922).
 
9
Strictly speaking, the expected false discovery rate is only upper bounded when the \(m\) test statistics are independent, which does not hold in this particular case. B. Efron makes the case in his book Large-Scale Inference (2010) that this independency constraint is not strong.
 
10
The governmental statistics agency of the Russian Federation. They can be found at http://​www.​gks.​ru (in Russian) with links to their rather extensive database.
 
11
A rank/frequency log–log plot is the plot of the occurrence frequency versus the rank on logarithmically scaled axes. For a more elaborate description on how to construct such a plot, see the paper by Newman (2005), Appendix.
 
Literatur
Zurück zum Zitat Agresti, A.: A survey of exact inference for contingency tables. Stat. Sci. 7(1), 131–177 (1992)CrossRef Agresti, A.: A survey of exact inference for contingency tables. Stat. Sci. 7(1), 131–177 (1992)CrossRef
Zurück zum Zitat Albert, R., Jeong, H., Barabasi, A.L.: Error and attack tolerance of complex networks. Nature 406, 378–382 (2000)CrossRef Albert, R., Jeong, H., Barabasi, A.L.: Error and attack tolerance of complex networks. Nature 406, 378–382 (2000)CrossRef
Zurück zum Zitat Benjamini, Y., Hochberg, Y.: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B Methodol. 57(1), 289–300 (1995) Benjamini, Y., Hochberg, Y.: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B Methodol. 57(1), 289–300 (1995)
Zurück zum Zitat Benjamini, Y., Yekutieli, D.: The control of the false discovery rate in multiple testing under dependency. Ann. Stat. 29(4), 1165–1188 (2001)CrossRef Benjamini, Y., Yekutieli, D.: The control of the false discovery rate in multiple testing under dependency. Ann. Stat. 29(4), 1165–1188 (2001)CrossRef
Zurück zum Zitat Bernades, D.F., Latapy, M., Tarissan, F.: Relevance of SIR model for real-world spreading phenomena: experiments on a large-scale p2p system. In: Proceedings of the International Conference on Advances in Social Network Analysis and Mining (ASONAM), Istanbul (2012) Bernades, D.F., Latapy, M., Tarissan, F.: Relevance of SIR model for real-world spreading phenomena: experiments on a large-scale p2p system. In: Proceedings of the International Conference on Advances in Social Network Analysis and Mining (ASONAM), Istanbul (2012)
Zurück zum Zitat Bollobas, B., Riordan, O.: Robustness and vulnerability of scale-free random graphs. Internet Math. 1(1), 1–35 (2004)CrossRef Bollobas, B., Riordan, O.: Robustness and vulnerability of scale-free random graphs. Internet Math. 1(1), 1–35 (2004)CrossRef
Zurück zum Zitat Crucitti, P., Latora, V., Marchiori, M., Rapisarda, A.: Efficiency of scale-free networks: error and attack tolerance. Phys. A Stat. Mech. Appl. 320, 622–642 (2003)CrossRef Crucitti, P., Latora, V., Marchiori, M., Rapisarda, A.: Efficiency of scale-free networks: error and attack tolerance. Phys. A Stat. Mech. Appl. 320, 622–642 (2003)CrossRef
Zurück zum Zitat Efron, B.: Large-Scale Inference: Empirical Bayes Methods for Estimation, Testing and Prediction. Cambridge University Press, Cambridge (2010)CrossRef Efron, B.: Large-Scale Inference: Empirical Bayes Methods for Estimation, Testing and Prediction. Cambridge University Press, Cambridge (2010)CrossRef
Zurück zum Zitat Everitt, B., Landau, S., Leese, M.: Cluster Analysis. Arnold, London (2001) Everitt, B., Landau, S., Leese, M.: Cluster Analysis. Arnold, London (2001)
Zurück zum Zitat Ferri, F., Grifoni, P., Guzzo, T.: New forms of social and professional digital relationships: the case of Facebook. Soc. Netw. Anal. Min. 2(2), 121–137 (2012)CrossRef Ferri, F., Grifoni, P., Guzzo, T.: New forms of social and professional digital relationships: the case of Facebook. Soc. Netw. Anal. Min. 2(2), 121–137 (2012)CrossRef
Zurück zum Zitat Fisher, R.: On the interpretation \(\chi ^2\) from contingency tables, and the calculation of \(p\). J. R. Stat. Soc. 85(1), 87–94 (1922)CrossRef Fisher, R.: On the interpretation \(\chi ^2\) from contingency tables, and the calculation of \(p\). J. R. Stat. Soc. 85(1), 87–94 (1922)CrossRef
Zurück zum Zitat Gallos, L.K., Barttfield, P., Havlin, S., Sigman, M., Makse, H.A.: Collective behavior in the spatial spreading of obesity. Sci. Rep. 2(45), 1–9 (2012) Gallos, L.K., Barttfield, P., Havlin, S., Sigman, M., Makse, H.A.: Collective behavior in the spatial spreading of obesity. Sci. Rep. 2(45), 1–9 (2012)
Zurück zum Zitat Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning. Springer, New York (2009)CrossRef Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning. Springer, New York (2009)CrossRef
Zurück zum Zitat Iribarren, J.B., Moro, E.: Impact of human activity patterns on the dynamics of information diffusion. Phys. Rev. Lett. 103(3), 8–11 (2009)CrossRef Iribarren, J.B., Moro, E.: Impact of human activity patterns on the dynamics of information diffusion. Phys. Rev. Lett. 103(3), 8–11 (2009)CrossRef
Zurück zum Zitat Kantardzic, M.: Data Mining: Concepts, Models, Methods, and Algorithms. IEEE Press/Wiley-Interscience, Hoboken (2011)CrossRef Kantardzic, M.: Data Mining: Concepts, Models, Methods, and Algorithms. IEEE Press/Wiley-Interscience, Hoboken (2011)CrossRef
Zurück zum Zitat Lämmel, R.: Google’s MapReduce programming model—revisted. Sci. Comput. Program. 70, 1–30 (2007)CrossRef Lämmel, R.: Google’s MapReduce programming model—revisted. Sci. Comput. Program. 70, 1–30 (2007)CrossRef
Zurück zum Zitat Mityagin, S.A.: Modeling the spread of drug-addiction through the population on the basis of complex networks (in Russian—Modelirovanie processov narkotizatsiya nasileniya na osnove kompleksnix cetei). Dissertation, National Research University of Information Technologies, Mechanics and Optics (2012) Mityagin, S.A.: Modeling the spread of drug-addiction through the population on the basis of complex networks (in Russian—Modelirovanie processov narkotizatsiya nasileniya na osnove kompleksnix cetei). Dissertation, National Research University of Information Technologies, Mechanics and Optics (2012)
Zurück zum Zitat Newman, M.: Power laws, Pareto distributions and Zipf’s law. Contemp. Phys. 46, 323–351 (2005)CrossRef Newman, M.: Power laws, Pareto distributions and Zipf’s law. Contemp. Phys. 46, 323–351 (2005)CrossRef
Zurück zum Zitat Ochiai, A.: A zoogeographic studies on the solenoid fishes found in Japan and its neighbouring regions. Bull. Jpn. Soc. Fish Sci. 22, 526–530 (1957)CrossRef Ochiai, A.: A zoogeographic studies on the solenoid fishes found in Japan and its neighbouring regions. Bull. Jpn. Soc. Fish Sci. 22, 526–530 (1957)CrossRef
Zurück zum Zitat Onnela, J., Saramäki, J., Hyvönen, J., Szabó, G., Lazer, D., Kaski, K., Kertész, J., Barabási, A.L.: Structure and tie strengths in mobile communication networks. Proc. Natl. Acad. Sci. USA 104(8), 7332–7336 (2007)CrossRef Onnela, J., Saramäki, J., Hyvönen, J., Szabó, G., Lazer, D., Kaski, K., Kertész, J., Barabási, A.L.: Structure and tie strengths in mobile communication networks. Proc. Natl. Acad. Sci. USA 104(8), 7332–7336 (2007)CrossRef
Zurück zum Zitat Pinto, C., Mendes Lopez, A., Machado, J.: A review of power laws in real life phenomena. Commun. Nonlinear Sci. Numer. Simul. 17(9), 3558–3578 (2012)CrossRef Pinto, C., Mendes Lopez, A., Machado, J.: A review of power laws in real life phenomena. Commun. Nonlinear Sci. Numer. Simul. 17(9), 3558–3578 (2012)CrossRef
Zurück zum Zitat Porter, M.F.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)CrossRef Porter, M.F.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)CrossRef
Zurück zum Zitat Scott, J.: Social network analysis: developments, advances, and prospects. Soc. Netw. Anal. Min. 1(1), 21–26 (2011)CrossRef Scott, J.: Social network analysis: developments, advances, and prospects. Soc. Netw. Anal. Min. 1(1), 21–26 (2011)CrossRef
Zurück zum Zitat Sunami, A.N.: Drug-conflict management in the context of information warfare (in Russian—Politika upravleniya narkokonfliktom v kontekste informatsionnoi voiny). Saint Petersburg State University, Saint Petersburg (2007) Sunami, A.N.: Drug-conflict management in the context of information warfare (in Russian—Politika upravleniya narkokonfliktom v kontekste informatsionnoi voiny). Saint Petersburg State University, Saint Petersburg (2007)
Zurück zum Zitat Wilson, R.E., Gosling, S.D., Graham, L.T.: A review of Facebook research in the social sciences. Perspect. Psychol. Sci. 7(3), 203–220 (2012)CrossRef Wilson, R.E., Gosling, S.D., Graham, L.T.: A review of Facebook research in the social sciences. Perspect. Psychol. Sci. 7(3), 203–220 (2012)CrossRef
Zurück zum Zitat White, T.: Hadoop: The Definitive Guide. O’Reilly Media, Yahoo! Press, New York (2009) White, T.: Hadoop: The Definitive Guide. O’Reilly Media, Yahoo! Press, New York (2009)
Metadaten
Titel
Inference of the Russian drug community from one of the largest social networks in the Russian Federation
verfasst von
L. J. Dijkstra
A. V. Yakushev
P. A. C. Duijn
A. V. Boukhanovsky
P. M. A. Sloot
Publikationsdatum
01.09.2014
Verlag
Springer Netherlands
Erschienen in
Quality & Quantity / Ausgabe 5/2014
Print ISSN: 0033-5177
Elektronische ISSN: 1573-7845
DOI
https://doi.org/10.1007/s11135-013-9921-6

Weitere Artikel der Ausgabe 5/2014

Quality & Quantity 5/2014 Zur Ausgabe