Skip to main content
Top
Published in: Social Network Analysis and Mining 1/2015

01-12-2015 | Original Article

Topic dynamics in Weibo: a comprehensive study

Authors: Rui Fan, Jichang Zhao, Ke Xu

Published in: Social Network Analysis and Mining | Issue 1/2015

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The tremendous development of online social media has changed people’s life fundamentally in recent years. Weibo, a Twitter-like service in China, has attracted more than 500 million users in less than 5 years and produces more than 100 million Chinese tweets everyday. In these massive tweets, different user interests and daily trends are reflected by different topics. To our best knowledge, a systematic investigation of topic dynamics in Weibo is still missing. Aiming at filling this vital gap, we try to comprehensively disclose the topic dynamics from the perspective of time, geography, demographics, emotion, retweeting and correlation. An incremental learning framework is first established to probe more than 200 million streaming tweets and an interaction network constituted by around 90,000 active users. Many interesting patterns are then revealed, which could provide insights for topic-related applications in online social media, such as user profiling, event detection, trend tracking or content recommendation.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
go back to reference Ardon S, Bagchi A, Mahanti A, Ruhela A, Seth A, Tripathy RM, Triukose S (2013) Spatio-temporal and events based analysis of topic popularity in Twitter. In: Proceedings of the 22nd ACM international conference on conference on information & knowledge management (CIKM), San Francisco, CA, ACM, pp 219–228 Ardon S, Bagchi A, Mahanti A, Ruhela A, Seth A, Tripathy RM, Triukose S (2013) Spatio-temporal and events based analysis of topic popularity in Twitter. In: Proceedings of the 22nd ACM international conference on conference on information & knowledge management (CIKM), San Francisco, CA, ACM, pp 219–228
go back to reference Banerjee S, Ramanathan K, Gupta A (2007) Clustering short texts using Wikipedia.In: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, pp 787–788 Banerjee S, Ramanathan K, Gupta A (2007) Clustering short texts using Wikipedia.In: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, pp 787–788
go back to reference Barabasi AL (2005) The origin of bursts and heavy tails in human dynamics. Nature 435(7039):207–211CrossRef Barabasi AL (2005) The origin of bursts and heavy tails in human dynamics. Nature 435(7039):207–211CrossRef
go back to reference Becker H, Naaman M, Gravano L (2011) Beyond trending topics: real-world event identification on Twitter. In: ICWSM Becker H, Naaman M, Gravano L (2011) Beyond trending topics: real-world event identification on Twitter. In: ICWSM
go back to reference Blei DM, Ng AY, Jordan MI (2003) Latent Dirichlet allocation. J Mach Learn Res 3:993–1022MATH Blei DM, Ng AY, Jordan MI (2003) Latent Dirichlet allocation. J Mach Learn Res 3:993–1022MATH
go back to reference Bogdanov P, Busch M, Moehlis J, Singh AK, Szymanski BK (2013) The social media genome: modeling individual topic-specific behavior in social media. In: Proceedings of the 2013 IEEE/ACM international conference on advances in social networks analysis and mining. ACM, pp 236–242 Bogdanov P, Busch M, Moehlis J, Singh AK, Szymanski BK (2013) The social media genome: modeling individual topic-specific behavior in social media. In: Proceedings of the 2013 IEEE/ACM international conference on advances in social networks analysis and mining. ACM, pp 236–242
go back to reference Boyd D, Golder S, Lotan G (2010) Tweet, tweet, retweet: conversational aspects of retweeting on Twitter. In: System sciences (HICSS), 2010 43rd Hawaii international conference. IEEE, pp 1–10 Boyd D, Golder S, Lotan G (2010) Tweet, tweet, retweet: conversational aspects of retweeting on Twitter. In: System sciences (HICSS), 2010 43rd Hawaii international conference. IEEE, pp 1–10
go back to reference Cataldi M, Di Caro L, Schifanella C (2010) Emerging topic detection on Twitter based on temporal and social terms evaluation. In: Proceedings of the tenth international workshop on multimedia data mining. ACM, p 4 Cataldi M, Di Caro L, Schifanella C (2010) Emerging topic detection on Twitter based on temporal and social terms evaluation. In: Proceedings of the tenth international workshop on multimedia data mining. ACM, p 4
go back to reference Dumais S, Platt J, Heckerman D, Sahami M (1998) Inductive learning algorithms and representations for text categorization. In: Proceedings of the seventh international conference on Information and knowledge management. ACM, pp 148–155 Dumais S, Platt J, Heckerman D, Sahami M (1998) Inductive learning algorithms and representations for text categorization. In: Proceedings of the seventh international conference on Information and knowledge management. ACM, pp 148–155
go back to reference Fan R, Zhao J, Chen Y, Xu K (2014) Anger is more influential than joy: sentiment correlation in Weibo. PLoS One 9:e110, 184 Fan R, Zhao J, Chen Y, Xu K (2014) Anger is more influential than joy: sentiment correlation in Weibo. PLoS One 9:e110, 184
go back to reference Genc Y, Sakamoto Y, Nickerson JV (2011) Discovering context: classifying tweets through a semantic transform based on Wikipedia. In: Foundations of augmented cognition. Directing the future of adaptive systems. Springer, pp 484–492 Genc Y, Sakamoto Y, Nickerson JV (2011) Discovering context: classifying tweets through a semantic transform based on Wikipedia. In: Foundations of augmented cognition. Directing the future of adaptive systems. Springer, pp 484–492
go back to reference Hofmann T (1999) Probabilistic latent semantic indexing. In: Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval. ACM, pp 50–57 Hofmann T (1999) Probabilistic latent semantic indexing. In: Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval. ACM, pp 50–57
go back to reference Joachims T (1999) Transductive inference for text classification using support vector machines. In: ICML, vol 99, pp 200–209 Joachims T (1999) Transductive inference for text classification using support vector machines. In: ICML, vol 99, pp 200–209
go back to reference Kinsella S, Passant A, Breslin JG (2011) Topic classification in social media using metadata from hyperlinked objects. In: Advances in information retrieval. Springer, pp 201–206 Kinsella S, Passant A, Breslin JG (2011) Topic classification in social media using metadata from hyperlinked objects. In: Advances in information retrieval. Springer, pp 201–206
go back to reference Kwak H, Lee C, Park H, Moon S (2010) What is Twitter, a social network or a news media? In: Proceedings of the 19th international conference on World Wide Web, WWW ’10. ACM, pp 591–600 Kwak H, Lee C, Park H, Moon S (2010) What is Twitter, a social network or a news media? In: Proceedings of the 19th international conference on World Wide Web, WWW ’10. ACM, pp 591–600
go back to reference Michelson M, Macskassy SA (2010) Discovering users’ topics of interest on Twitter: a first look. In: Proceedings of the fourth workshop on analytics for noisy unstructured text data. ACM, pp 73–80 Michelson M, Macskassy SA (2010) Discovering users’ topics of interest on Twitter: a first look. In: Proceedings of the fourth workshop on analytics for noisy unstructured text data. ACM, pp 73–80
go back to reference Mislove A, Marcon M, Gummadi KP, Druschel P, Bhattacharjee B (2007) Measurement and analysis of online social networks. In: Proceedings of the 7th ACM SIGCOMM conference on Internet measurement. ACM, pp 29–42 Mislove A, Marcon M, Gummadi KP, Druschel P, Bhattacharjee B (2007) Measurement and analysis of online social networks. In: Proceedings of the 7th ACM SIGCOMM conference on Internet measurement. ACM, pp 29–42
go back to reference Novakovic J (2010) The impact of feature selection on the accuracy of naïve Bayes classifier. In: 18th telecommunications forum TELFOR, pp 1113–1116 Novakovic J (2010) The impact of feature selection on the accuracy of naïve Bayes classifier. In: 18th telecommunications forum TELFOR, pp 1113–1116
go back to reference Quercia D, Capra L, Crowcroft J (2012) The social world of Twitter: topics, geography, and emotions. In: ICWSM Quercia D, Capra L, Crowcroft J (2012) The social world of Twitter: topics, geography, and emotions. In: ICWSM
go back to reference Ritter A, Etzioni O, Clark S et al (2012) Open domain event extraction from Twitter. In: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 1104–1112 Ritter A, Etzioni O, Clark S et al (2012) Open domain event extraction from Twitter. In: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 1104–1112
go back to reference Romero DM, Meeder B, Kleinberg J (2011) Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on Twitter. In: Proceedings of the 20th international conference on World Wide Web. ACM, pp 695–704 Romero DM, Meeder B, Kleinberg J (2011) Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on Twitter. In: Proceedings of the 20th international conference on World Wide Web. ACM, pp 695–704
go back to reference Rosvall M, Bergstrom CT (2008) Maps of random walks on complex networks reveal community structure. Proc Natl Acad Sci 105(4):1118–1123CrossRef Rosvall M, Bergstrom CT (2008) Maps of random walks on complex networks reveal community structure. Proc Natl Acad Sci 105(4):1118–1123CrossRef
go back to reference Sankaranarayanan J, Samet H, Teitler BE, Lieberman MD, Sperling J (2009) Twitterstand: news in tweets. In: Proceedings of the 17th ACM SIGSPATIAL international conference on advances in geographic information systems. ACM, pp 42–51 Sankaranarayanan J, Samet H, Teitler BE, Lieberman MD, Sperling J (2009) Twitterstand: news in tweets. In: Proceedings of the 17th ACM SIGSPATIAL international conference on advances in geographic information systems. ACM, pp 42–51
go back to reference Schönhofen P (2009) Identifying document topics using the Wikipedia category network. Web Intell Agent Syst 7(2):195–207 Schönhofen P (2009) Identifying document topics using the Wikipedia category network. Web Intell Agent Syst 7(2):195–207
go back to reference Song S, Li Q, Bao H (2012) Detecting dynamic association among Twitter topics. In: Proceedings of the 21st international conference companion on World Wide Web. ACM, pp 605–606 Song S, Li Q, Bao H (2012) Detecting dynamic association among Twitter topics. In: Proceedings of the 21st international conference companion on World Wide Web. ACM, pp 605–606
go back to reference Sriram B, Fuhry D, Demir E, Ferhatosmanoglu H, Demirbas M (2010) Short text classification in Twitter to improve information filtering. In: Proceedings of the 33rd international ACM SIGIR conference on research and development in information retrieval. ACM, pp 841–842 Sriram B, Fuhry D, Demir E, Ferhatosmanoglu H, Demirbas M (2010) Short text classification in Twitter to improve information filtering. In: Proceedings of the 33rd international ACM SIGIR conference on research and development in information retrieval. ACM, pp 841–842
go back to reference Suh B, Hong L, Pirolli P, Chi EH (2010) Want to be retweeted? large scale analytics on factors impacting retweet in Twitter network. In: Social computing (socialcom), 2010 IEEE second international conference. IEEE, pp. 177–184 Suh B, Hong L, Pirolli P, Chi EH (2010) Want to be retweeted? large scale analytics on factors impacting retweet in Twitter network. In: Social computing (socialcom), 2010 IEEE second international conference. IEEE, pp. 177–184
go back to reference Yamaguchi Y, Amagasa T, Kitagawa H (2011) Tag-based user topic discovery using Twitter lists. In: Advances in social networks analysis and mining (ASONAM), 2011 international conference. IEEE, pp 13–20 Yamaguchi Y, Amagasa T, Kitagawa H (2011) Tag-based user topic discovery using Twitter lists. In: Advances in social networks analysis and mining (ASONAM), 2011 international conference. IEEE, pp 13–20
go back to reference Yang J, Leskovec J (2011) Patterns of temporal variation in online media. In: Proceedings of the fourth ACM international conference on web search and data mining. ACM, pp 177–186 Yang J, Leskovec J (2011) Patterns of temporal variation in online media. In: Proceedings of the fourth ACM international conference on web search and data mining. ACM, pp 177–186
go back to reference Yang T, Lee D, Yan S (2013) Steeler nation, 12th man, and boo birds: classifying Twitter user interests using time series. In: Proceedings of the 2013 IEEE/ACM international conference on advances in social networks analysis and mining. ACM, pp 684–691 Yang T, Lee D, Yan S (2013) Steeler nation, 12th man, and boo birds: classifying Twitter user interests using time series. In: Proceedings of the 2013 IEEE/ACM international conference on advances in social networks analysis and mining. ACM, pp 684–691
go back to reference Yang Y, Liu X (1999) A re-examination of text categorization methods. In: Proceedings of the 22nd annual international ACM SIGIR conference on research and development in information retrieval. ACM, pp 42–49 Yang Y, Liu X (1999) A re-examination of text categorization methods. In: Proceedings of the 22nd annual international ACM SIGIR conference on research and development in information retrieval. ACM, pp 42–49
go back to reference Yang Y, Pedersen JO (1997) A comparative study on feature selection in text categorization. In: ICML, vol 97, pp 412–420 Yang Y, Pedersen JO (1997) A comparative study on feature selection in text categorization. In: ICML, vol 97, pp 412–420
go back to reference Yang Z, Guo J, Cai K, Tang J, Li J, Zhang L, Su Z (2010) Understanding retweeting behaviors in social networks. In: Proceedings of the 19th ACM international conference on information and knowledge management. ACM, pp 1633–1636 Yang Z, Guo J, Cai K, Tang J, Li J, Zhang L, Su Z (2010) Understanding retweeting behaviors in social networks. In: Proceedings of the 19th ACM international conference on information and knowledge management. ACM, pp 1633–1636
go back to reference Yu L, Asur S, Huberman BA (2011) What trends in Chinese social media. In: The 5th SNA-KDD workshop’11 (SNA-KDD’11), 21 August 2011, San Diego, CA Yu L, Asur S, Huberman BA (2011) What trends in Chinese social media. In: The 5th SNA-KDD workshop’11 (SNA-KDD’11), 21 August 2011, San Diego, CA
go back to reference Zhang T, Oles FJ (2001) Text categorization based on regularized linear classification methods. Inf Retr 4(1):5–31MATHCrossRef Zhang T, Oles FJ (2001) Text categorization based on regularized linear classification methods. Inf Retr 4(1):5–31MATHCrossRef
go back to reference Zhao J, Dong L, Wu J, Xu K (2012) Moodlens: an emoticon-based sentiment analysis system for Chinese tweets. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 1528–1531 Zhao J, Dong L, Wu J, Xu K (2012) Moodlens: an emoticon-based sentiment analysis system for Chinese tweets. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 1528–1531
go back to reference Zhou T, Han XP, Wang BH (2008) Towards the understanding of human dynamics. In: Science matters: humanities as complex systems, pp 207–233 Zhou T, Han XP, Wang BH (2008) Towards the understanding of human dynamics. In: Science matters: humanities as complex systems, pp 207–233
Metadata
Title
Topic dynamics in Weibo: a comprehensive study
Authors
Rui Fan
Jichang Zhao
Ke Xu
Publication date
01-12-2015
Publisher
Springer Vienna
Published in
Social Network Analysis and Mining / Issue 1/2015
Print ISSN: 1869-5450
Electronic ISSN: 1869-5469
DOI
https://doi.org/10.1007/s13278-015-0282-0

Other articles of this Issue 1/2015

Social Network Analysis and Mining 1/2015 Go to the issue

Premium Partner