Skip to main content
Erschienen in: Social Network Analysis and Mining 4/2013

01.12.2013 | Original Article

Exploiting behaviors of communities of twitter users for link prediction

verfasst von: Jorge Valverde-Rebaza, Alneu de Andrade Lopes

Erschienen in: Social Network Analysis and Mining | Ausgabe 4/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Currently, online social networks and social media have become increasingly popular showing an exponential growth. This fact have attracted increasing research interest and, in turn, facilitating the emergence of new interdisciplinary research directions, such as social network analysis. In this scenario, link prediction is one of the most important tasks since it deals with the problem of the existence of a future relation among members in a social network. Previous techniques for link prediction were based on structural (or topological) information. Nevertheless, structural information is not enough to achieve a good performance in the link prediction task on large-scale social networks. Thus, the use of additional information, such as interests or behaviors that nodes have into their communities, may improve the link prediction performance. In this paper, we analyze the viability of using a set of simple and non-expensive techniques that combine structural with community information for predicting the existence of future links in a large-scale online social network, such as Twitter. Twitter, a microblogging service, has emerged as a useful source of informative data shared by millions of users whose relationships require no reciprocation. Twitter network was chosen because it is not well understood, mainly due to the occurrence of directed and asymmetric links yet. Experiments show that our proposals can be used efficiently to improve unsupervised and supervised link prediction task in a directed and asymmetric large-scale network.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
1
Ratio of total links per user is the ratio between the total number of links, |E|, and the total number of nodes, |V|. This ratio indicates the average of the size of the neighborhood for each node.
 
Literatur
Zurück zum Zitat Almeida LJ, de Andrade Lopes A (2009) An ultra-fast modularity-based graph clustering algorithm, Aveiro, Portugal 1–9 Almeida LJ, de Andrade Lopes A (2009) An ultra-fast modularity-based graph clustering algorithm, Aveiro, Portugal 1–9
Zurück zum Zitat Barber MJ, Clark JW (2009) Detecting network communities by propagating labels under constraints. Phys Rev E Stat Phys 80(2): 026129 Barber MJ, Clark JW (2009) Detecting network communities by propagating labels under constraints. Phys Rev E Stat Phys 80(2): 026129
Zurück zum Zitat Benchettara N, Kanawati R, Rouveirol C (2010) A supervised machine learning link prediction approach for academic collaboration recommendation. In: Proceedings of RecSys Vol 10, pp 253–256 Benchettara N, Kanawati R, Rouveirol C (2010) A supervised machine learning link prediction approach for academic collaboration recommendation. In: Proceedings of RecSys Vol 10, pp 253–256
Zurück zum Zitat Boutet A, Kim H, Yoneki E (2013) Whats in Twitter, i know what parties are popular and who you are supporting now!. Soc Netw Anal Min Boutet A, Kim H, Yoneki E (2013) Whats in Twitter, i know what parties are popular and who you are supporting now!. Soc Netw Anal Min
Zurück zum Zitat Calderon-Niquin MA, Valverde-Rebaza J (2012) Multiple kernel learning based on local and nonlinear combinations. In: Informatica (CLEI), XXXVIII Conferencia Latinoamericana, pp 1 –7 Calderon-Niquin MA, Valverde-Rebaza J (2012) Multiple kernel learning based on local and nonlinear combinations. In: Informatica (CLEI), XXXVIII Conferencia Latinoamericana, pp 1 –7
Zurück zum Zitat Davis D, Lichtenwalter R, Chawla R (2013) Supervised methods for multi-relational link prediction. Soc Netw Anal Min 3: 127–141CrossRef Davis D, Lichtenwalter R, Chawla R (2013) Supervised methods for multi-relational link prediction. Soc Netw Anal Min 3: 127–141CrossRef
Zurück zum Zitat Esslimani I, Brun A, Boyer A (2011) Densifying a behavioral recommender system by social networks link prediction methods. Soc Netw Anal Min 1:159–172CrossRef Esslimani I, Brun A, Boyer A (2011) Densifying a behavioral recommender system by social networks link prediction methods. Soc Netw Anal Min 1:159–172CrossRef
Zurück zum Zitat Fatourechi M, Ward R, Mason S, Huggins J, Schlogl A, Birch G (2008) Comparison of evaluation metrics in classification applications with imbalanced datasets. In: Machine learning and applications. ICMLA ’08. Seventh International Conference on, pp 777–782 Fatourechi M, Ward R, Mason S, Huggins J, Schlogl A, Birch G (2008) Comparison of evaluation metrics in classification applications with imbalanced datasets. In: Machine learning and applications. ICMLA ’08. Seventh International Conference on, pp 777–782
Zurück zum Zitat Feng X, Zhao J, Xu K (2012) Link prediction in complex networks: a clustering perspective. Eur Phys J B 85(1): 3CrossRef Feng X, Zhao J, Xu K (2012) Link prediction in complex networks: a clustering perspective. Eur Phys J B 85(1): 3CrossRef
Zurück zum Zitat Fire M, Tenenboim L, Lesser O, Puzis R, Rokach L, Elovici Y (2011) Link prediction in social networks using computationally efficient topological features. In: Privacy, security, risk and trust, 2011 IEEE Third International Conference on and 2011 IEEE Third International Conference on Social Computing (SOCIALCOM), pp 73 –80 Fire M, Tenenboim L, Lesser O, Puzis R, Rokach L, Elovici Y (2011) Link prediction in social networks using computationally efficient topological features. In: Privacy, security, risk and trust, 2011 IEEE Third International Conference on and 2011 IEEE Third International Conference on Social Computing (SOCIALCOM), pp 73 –80
Zurück zum Zitat Fortunato S (2010) Community detection in graphs. CoRR abs/0906.0612v2 Fortunato S (2010) Community detection in graphs. CoRR abs/0906.0612v2
Zurück zum Zitat Golder SA, Yardi S (2010) Structural predictors of tie formation in twitter: transitivity and mutuality. In: Proceedings of SOCIALCOM ’10, pp 88–95 Golder SA, Yardi S (2010) Structural predictors of tie formation in twitter: transitivity and mutuality. In: Proceedings of SOCIALCOM ’10, pp 88–95
Zurück zum Zitat Hanley JA, McNeil BJ (1982) The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143(1): 29–36 Hanley JA, McNeil BJ (1982) The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143(1): 29–36
Zurück zum Zitat Hasan MA, Chaoji V, Salem S, Zaki M (2006) Link prediction using supervised learning. In: Proceedings of SDM 06 workshop on link analysis, counterterrorism and security Hasan MA, Chaoji V, Salem S, Zaki M (2006) Link prediction using supervised learning. In: Proceedings of SDM 06 workshop on link analysis, counterterrorism and security
Zurück zum Zitat Haykin S (1998) Neural networks: a comprehensive foundation, 2nd ed. Prentice Hall PTR, Upper Saddle River Haykin S (1998) Neural networks: a comprehensive foundation, 2nd ed. Prentice Hall PTR, Upper Saddle River
Zurück zum Zitat Herlocker JL, Konstan JA, Terveen LG, Riedl JT (2004) Evaluating collaborative filtering recommender systems. ACM Trans Inf Syst 22(1):5–53CrossRef Herlocker JL, Konstan JA, Terveen LG, Riedl JT (2004) Evaluating collaborative filtering recommender systems. ACM Trans Inf Syst 22(1):5–53CrossRef
Zurück zum Zitat Hopcroft J, Lou T, Tang J (2011) Who will follow you back?: reciprocal relationship prediction. In: Proceedings of CIKM ’11, pp 1137–1146 Hopcroft J, Lou T, Tang J (2011) Who will follow you back?: reciprocal relationship prediction. In: Proceedings of CIKM ’11, pp 1137–1146
Zurück zum Zitat Hoseini E, SHashemi E, Hamzeh A (2012) Link prediction in social network using co-clustering based approach. In: Proceedings of the 2012 26th international conference on advanced information networking and applications workshops, ser. WAINA ’12. IEEE Computer Society, pp 795–800 Hoseini E, SHashemi E, Hamzeh A (2012) Link prediction in social network using co-clustering based approach. In: Proceedings of the 2012 26th international conference on advanced information networking and applications workshops, ser. WAINA ’12. IEEE Computer Society, pp 795–800
Zurück zum Zitat Itakura KY, Clarke CLA, Geva S, Trotman A, Huang WC (2011) Topical and structural linkage in wikipedia. In: Proceedings of ECIR’11, pp 460–465 Itakura KY, Clarke CLA, Geva S, Trotman A, Huang WC (2011) Topical and structural linkage in wikipedia. In: Proceedings of ECIR’11, pp 460–465
Zurück zum Zitat Kotera M, Yamanishi Y, Moriya Y, Kanehisa M, Goto S (2012) Genies: gene network inference engine based on supervised analysis. Nucleic Acids Res 40: 162–167CrossRef Kotera M, Yamanishi Y, Moriya Y, Kanehisa M, Goto S (2012) Genies: gene network inference engine based on supervised analysis. Nucleic Acids Res 40: 162–167CrossRef
Zurück zum Zitat Kwak H, Lee C, Park H, Moon S (2010) What is twitter, a social network or a news media? In: Proceedings of WWW ’10, pp 591–600 Kwak H, Lee C, Park H, Moon S (2010) What is twitter, a social network or a news media? In: Proceedings of WWW ’10, pp 591–600
Zurück zum Zitat Leung I, Hui P, Lio P, Crowcroft J (2009) Towards real-time community detection in large networks. Phys Rev E 79(6): 066107CrossRef Leung I, Hui P, Lio P, Crowcroft J (2009) Towards real-time community detection in large networks. Phys Rev E 79(6): 066107CrossRef
Zurück zum Zitat Liben-Nowell D, Kleinberg J (2007) The link-prediction problem for social networks. JASIST 58(7): 1019–1031CrossRef Liben-Nowell D, Kleinberg J (2007) The link-prediction problem for social networks. JASIST 58(7): 1019–1031CrossRef
Zurück zum Zitat Lichtenwalter RN, Lussier JT, Chawla NV (2010) New perspectives and methods in link prediction. In: Proceedings of the 16th ACM SIGKDD international conference on knowledge discovery and data mining, ser. KDD’10. ACM, New York, pp 243–252 Lichtenwalter RN, Lussier JT, Chawla NV (2010) New perspectives and methods in link prediction. In: Proceedings of the 16th ACM SIGKDD international conference on knowledge discovery and data mining, ser. KDD’10. ACM, New York, pp 243–252
Zurück zum Zitat Liu Z, Zhang Q-M, Lü L, Zhou T (2011) Link prediction in complex networks: a local naïve bayes model. Europhys Lett 96(48007) Liu Z, Zhang Q-M, Lü L, Zhou T (2011) Link prediction in complex networks: a local naïve bayes model. Europhys Lett 96(48007)
Zurück zum Zitat Lü L, Zhou T (2011) Link prediction in complex networks: a survey. Phys A Stat Mech Appl 390(6): 1150–1170CrossRef Lü L, Zhou T (2011) Link prediction in complex networks: a survey. Phys A Stat Mech Appl 390(6): 1150–1170CrossRef
Zurück zum Zitat Newman MEJ (2004) Fast algorithm for detecting community structure in networks. Phys Rev E 69(6): 066133CrossRef Newman MEJ (2004) Fast algorithm for detecting community structure in networks. Phys Rev E 69(6): 066133CrossRef
Zurück zum Zitat Newman MEJ, Girvan M (2004) Finding and evaluating community structure in networks. Phys Rev E 69(2): 026113CrossRef Newman MEJ, Girvan M (2004) Finding and evaluating community structure in networks. Phys Rev E 69(2): 026113CrossRef
Zurück zum Zitat Perez-Cervantes E, Mena-Chalco JP, de Oliveira MCF, Cesar-Jr RM (2013) Using link prediction to estimate the collaborative influence of researchers. In: IEEE 9th International Conference on e-Science 2013, Beijing, pp 1–8 Perez-Cervantes E, Mena-Chalco JP, de Oliveira MCF, Cesar-Jr RM (2013) Using link prediction to estimate the collaborative influence of researchers. In: IEEE 9th International Conference on e-Science 2013, Beijing, pp 1–8
Zurück zum Zitat Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc, San Francisco Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc, San Francisco
Zurück zum Zitat Raghavan UN, Albert R, Kumara S (2007) Near linear time algorithm to detect community structures in large-scale networks. Phys Rev E 76: 036106CrossRef Raghavan UN, Albert R, Kumara S (2007) Near linear time algorithm to detect community structures in large-scale networks. Phys Rev E 76: 036106CrossRef
Zurück zum Zitat Romero DM, Kleinberg JM (2010) The directed closure process in hybrid social-information networks, with an analysis of link formation on twitter. In: ICWSM Romero DM, Kleinberg JM (2010) The directed closure process in hybrid social-information networks, with an analysis of link formation on twitter. In: ICWSM
Zurück zum Zitat Soundarajan S, Hopcroft J (2012) Using community information to improve the precision of link prediction methods. In: Proceedings of the 21st international conference companion on World Wide Web, ser. Proceedings of WWW ’12 Companion, pp 607–608 Soundarajan S, Hopcroft J (2012) Using community information to improve the precision of link prediction methods. In: Proceedings of the 21st international conference companion on World Wide Web, ser. Proceedings of WWW ’12 Companion, pp 607–608
Zurück zum Zitat Tang J, Sun J, Wang C, Yang Z (2009) Social influence analysis in large-scale networks. In: Proceedings of KDD ’09, pp 807–816 Tang J, Sun J, Wang C, Yang Z (2009) Social influence analysis in large-scale networks. In: Proceedings of KDD ’09, pp 807–816
Zurück zum Zitat Valverde-Rebaza J, de Andrade Lopes A (2012) Link prediction in complex networks based on cluster information. In: Advances in artificial intelligence, SBIA 2012, 21th Brazilian symposium on artificial intelligence, ser, Vol 7589. Lecture Notes in Computer Science, Springer 92–101 Valverde-Rebaza J, de Andrade Lopes A (2012) Link prediction in complex networks based on cluster information. In: Advances in artificial intelligence, SBIA 2012, 21th Brazilian symposium on artificial intelligence, ser, Vol 7589. Lecture Notes in Computer Science, Springer 92–101
Zurück zum Zitat Valverde-Rebaza J, de Andrade Lopes A (2012) Structural link prediction using community information on twitter. In: Computational aspects of social networks (CASoN), 2012 Fourth International Conference on, Nov 2012, pp 132–137 Valverde-Rebaza J, de Andrade Lopes A (2012) Structural link prediction using community information on twitter. In: Computational aspects of social networks (CASoN), 2012 Fourth International Conference on, Nov 2012, pp 132–137
Zurück zum Zitat Vapnik VN (1995) The nature of statistical learning theory. Springer-Verlag New York, Inc., New York CrossRefMATH Vapnik VN (1995) The nature of statistical learning theory. Springer-Verlag New York, Inc., New York CrossRefMATH
Zurück zum Zitat Wei D, Deng X, Zhang X, Deng Y, Mahadevan S (2013) Identifying influential nodes in weighted networks based on evidence theory. Phys A Stat Mech Appl 392(10): 2564–2575CrossRef Wei D, Deng X, Zhang X, Deng Y, Mahadevan S (2013) Identifying influential nodes in weighted networks based on evidence theory. Phys A Stat Mech Appl 392(10): 2564–2575CrossRef
Zurück zum Zitat Yin D, Hong L, Davison BD (2011) Structural link analysis and prediction in microblogs. In: Proceedings of CIKM ’11, pp 1163–1168 Yin D, Hong L, Davison BD (2011) Structural link analysis and prediction in microblogs. In: Proceedings of CIKM ’11, pp 1163–1168
Zurück zum Zitat Zhang Q-M, Lü L, Wang W-Q, Zhu Y-X, Zhou T (2012) Potential theory for directed networks. CoRR abs/1202.2709 Zhang Q-M, Lü L, Wang W-Q, Zhu Y-X, Zhou T (2012) Potential theory for directed networks. CoRR abs/1202.2709
Zurück zum Zitat Zheleva E, Getoor L, Golbeck J, Kuter U (2008) Using friendship ties and family circles for link prediction. In: Proceedings of the 2nd international conference on advances in social network mining and analysis, ser. SNAKDD’08, pp 97–113 Zheleva E, Getoor L, Golbeck J, Kuter U (2008) Using friendship ties and family circles for link prediction. In: Proceedings of the 2nd international conference on advances in social network mining and analysis, ser. SNAKDD’08, pp 97–113
Zurück zum Zitat Zhou T, Lü L, Zhang Y-C (2009) Predicting missing links via local information. Eur Phys J B 71(4): 623–630CrossRefMATH Zhou T, Lü L, Zhang Y-C (2009) Predicting missing links via local information. Eur Phys J B 71(4): 623–630CrossRefMATH
Metadaten
Titel
Exploiting behaviors of communities of twitter users for link prediction
verfasst von
Jorge Valverde-Rebaza
Alneu de Andrade Lopes
Publikationsdatum
01.12.2013
Verlag
Springer Vienna
Erschienen in
Social Network Analysis and Mining / Ausgabe 4/2013
Print ISSN: 1869-5450
Elektronische ISSN: 1869-5469
DOI
https://doi.org/10.1007/s13278-013-0142-8

Weitere Artikel der Ausgabe 4/2013

Social Network Analysis and Mining 4/2013 Zur Ausgabe