Skip to main content

2020 | OriginalPaper | Buchkapitel

4. Dominating Factors Affecting Individual Retweeting Behavior

verfasst von : Juan Shi, Kin Keung Lai, Gang Chen

Erschienen in: Individual Retweeting Behavior on Social Networking Sites

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In Chap. 3, we identify some influential factors that have an positive impact on individual retweeting behavior, such as topical relevance, information richness, soical tie strength, etc. One may wonder whether these factors only play an important role in theroy or, are these factors still important when predicting individual retweeting behavior? Furthermoer, to the best of our knowledge, virtually no scholarly effort has been undertaken to figure out the relative importance of those factors when predicting individual retweeting decision. Instead, a large number of features are indiscriminately introduced into the prediction model without examining the relevance of these features. The existence of redundant features not only increases data collection cost, but also tends to generate an overfitted model which predicts poorly on future observations not used in model training, known as the curse of dimensionality. Thus, it is necessary to rank the priority of these factors and find out the dominating ones. To tackle the above problems, we first pick out a specific user to illustrate the feature (also called factor in the monograph) selection process. The results confirm that only a small subset of predictors have an influential impact on individual retweeting behavior. And then, based on a large sample, we commit ourselves to find out factors that are not only important in theory in terms of explaining individual retweeting behavior, but also important in practice in terms of predicting individual retweeting behavior. Finally, we obtain a subset of dominating factors which not only save the cost of collecting trivial features but also improve the prediction performance to some extent, under certain classification algorithms such as support vector classification (SVC) or logistic.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
2
0.00–0.19: very weak; 0.20–0.39: weak; 0.40–0.59: moderate; 0.60–0.79: strong; 0.80–1.00: very strong.
 
Literatur
1.
Zurück zum Zitat Zhang, J., Tang, J., Li, J., Liu, Y., Xing, C.: Who influenced you? Predicting retweet via social influence locality. ACM Trans. Knowl. Disc. Data (TKDD) 9(3), 25 (2014) Zhang, J., Tang, J., Li, J., Liu, Y., Xing, C.: Who influenced you? Predicting retweet via social influence locality. ACM Trans. Knowl. Disc. Data (TKDD) 9(3), 25 (2014)
2.
Zurück zum Zitat Xu, Z., Yang, Q.: Analyzing user retweet behavior on twitter. In: Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012), pp. 46–50. IEEE Computer Society (2012) Xu, Z., Yang, Q.: Analyzing user retweet behavior on twitter. In: Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012), pp. 46–50. IEEE Computer Society (2012)
3.
Zurück zum Zitat Tang, X., Miao, Q., Quan, Y., Tang, J., Deng, K.: Predicting individual retweet behavior by user similarity: a multi-task learning approach. Knowl.-Based Syst. 89, 681–688 (2015)CrossRef Tang, X., Miao, Q., Quan, Y., Tang, J., Deng, K.: Predicting individual retweet behavior by user similarity: a multi-task learning approach. Knowl.-Based Syst. 89, 681–688 (2015)CrossRef
4.
Zurück zum Zitat Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. J. Am. Soc. Inform. Sci. 41(6), 391 (1990)CrossRef Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. J. Am. Soc. Inform. Sci. 41(6), 391 (1990)CrossRef
5.
Zurück zum Zitat Hong, L., Davison, B.D.: Empirical study of topic modeling in twitter. In: Proceedings of the First Workshop on Social Media Analytics, pp. 80–88. ACM (2010) Hong, L., Davison, B.D.: Empirical study of topic modeling in twitter. In: Proceedings of the First Workshop on Social Media Analytics, pp. 80–88. ACM (2010)
6.
Zurück zum Zitat Xu, Z., Lu, R., Xiang, L., Yang, Q.: Discovering user interest on twitter with a modified author-topic model. In: 2011 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), vol. 1, pp. 422–429. IEEE (2011) Xu, Z., Lu, R., Xiang, L., Yang, Q.: Discovering user interest on twitter with a modified author-topic model. In: 2011 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), vol. 1, pp. 422–429. IEEE (2011)
7.
Zurück zum Zitat Feng, W., Wang, J.: Retweet or not?: personalized tweet re-ranking. In: Proceedings of the sixth ACM international conference on Web search and data mining, pp. 577–586. ACM (2013) Feng, W., Wang, J.: Retweet or not?: personalized tweet re-ranking. In: Proceedings of the sixth ACM international conference on Web search and data mining, pp. 577–586. ACM (2013)
8.
Zurück zum Zitat Macskassy, S.A., Michelson, M.: Why do people retweet? Anti-homophily wins the day! In: Proceedings of the Fifth International AAAI Conference on Weblogs and Social Media, pp. 209–216 (2011) Macskassy, S.A., Michelson, M.: Why do people retweet? Anti-homophily wins the day! In: Proceedings of the Fifth International AAAI Conference on Weblogs and Social Media, pp. 209–216 (2011)
9.
Zurück zum Zitat Stieglitz, S., Dang-Xuan, L.: Emotions and information diffusion in social media–sentiment of microblogs and sharing behavior. J. Manag. Inform. Syst. 29(4), 217–248 (2013)CrossRef Stieglitz, S., Dang-Xuan, L.: Emotions and information diffusion in social media–sentiment of microblogs and sharing behavior. J. Manag. Inform. Syst. 29(4), 217–248 (2013)CrossRef
10.
Zurück zum Zitat James, G., Witten, D., Hastie, T., Tibshirani, R.: An Introduction to Statistical Learning. Springer (2013) James, G., Witten, D., Hastie, T., Tibshirani, R.: An Introduction to Statistical Learning. Springer (2013)
11.
Zurück zum Zitat Weng, J., Lim, E.P., Jiang, J., He, Q.: Twitterrank: finding topic-sensitive influential twitterers. In: Proceedings of the third ACM International Conference on Web Search and Data Mining, pp. 261–270. ACM (2010) Weng, J., Lim, E.P., Jiang, J., He, Q.: Twitterrank: finding topic-sensitive influential twitterers. In: Proceedings of the third ACM International Conference on Web Search and Data Mining, pp. 261–270. ACM (2010)
12.
Zurück zum Zitat Leavitt, A., Burchard, E., Fisher, D., Gilbert, S.: The influentials: new approaches for analyzing influence on twitter. Web Ecol. Project 4(2), 1–18 (2009) Leavitt, A., Burchard, E., Fisher, D., Gilbert, S.: The influentials: new approaches for analyzing influence on twitter. Web Ecol. Project 4(2), 1–18 (2009)
13.
Zurück zum Zitat Dash, M., Liu, H.: Feature selection for classification. Intell. Data Anal. 1(1–4), 131–156 (1997)CrossRef Dash, M., Liu, H.: Feature selection for classification. Intell. Data Anal. 1(1–4), 131–156 (1997)CrossRef
14.
Zurück zum Zitat Song, L., Smola, A., Gretton, A., Borgwardt, K.M., Bedo, J.: Supervised feature selection via dependence estimation. In: Proceedings of the 24th International Conference on Machine Learning, pp. 823–830. ACM (2007) Song, L., Smola, A., Gretton, A., Borgwardt, K.M., Bedo, J.: Supervised feature selection via dependence estimation. In: Proceedings of the 24th International Conference on Machine Learning, pp. 823–830. ACM (2007)
15.
Zurück zum Zitat Weston, J., Elisseeff, A., Schölkopf, B., Tipping, M.: Use of the zero-norm with linear models and kernel methods. J. Mach. Learn. Res. 3, 1439–1461 (2003) Weston, J., Elisseeff, A., Schölkopf, B., Tipping, M.: Use of the zero-norm with linear models and kernel methods. J. Mach. Learn. Res. 3, 1439–1461 (2003)
16.
Zurück zum Zitat Mitra, P., Murthy, C., Pal, S.K.: Unsupervised feature selection using feature similarity. IEEE Trans. Pattern Anal. Mach. Intell. 24(3), 301–312 (2002)CrossRef Mitra, P., Murthy, C., Pal, S.K.: Unsupervised feature selection using feature similarity. IEEE Trans. Pattern Anal. Mach. Intell. 24(3), 301–312 (2002)CrossRef
17.
Zurück zum Zitat Dy, J.G., Brodley, C.E.: Feature selection for unsupervised learning. J. Mach. Learn. Res. 5, 845–889 (2004) Dy, J.G., Brodley, C.E.: Feature selection for unsupervised learning. J. Mach. Learn. Res. 5, 845–889 (2004)
18.
Zurück zum Zitat Xu, Z., King, I., Lyu, M.R.T., Jin, R.: Discriminative semi-supervised feature selection via manifold regularization. IEEE Trans. Neural Netw. 21(7), 1033–1047 (2010)CrossRef Xu, Z., King, I., Lyu, M.R.T., Jin, R.: Discriminative semi-supervised feature selection via manifold regularization. IEEE Trans. Neural Netw. 21(7), 1033–1047 (2010)CrossRef
19.
Zurück zum Zitat Zhao, Z., Liu, H.: Semi-supervised feature selection via spectral analysis. In: Proceedings of the 2007 SIAM International Conference on Data Mining, pp. 641–646. SIAM (2007) Zhao, Z., Liu, H.: Semi-supervised feature selection via spectral analysis. In: Proceedings of the 2007 SIAM International Conference on Data Mining, pp. 641–646. SIAM (2007)
20.
Zurück zum Zitat Cohen, J.: Statistical Power Analysis for the Behavioral Sciences. Academic Press (2013) Cohen, J.: Statistical Power Analysis for the Behavioral Sciences. Academic Press (2013)
21.
Zurück zum Zitat Evans, J.D.: Straightforward Statistics for the Behavioral Sciences. Brooks/Cole (1996) Evans, J.D.: Straightforward Statistics for the Behavioral Sciences. Brooks/Cole (1996)
22.
Zurück zum Zitat Deutsch, M., Gerard, H.B.: A study of normative and informational social influences upon individual judgment. J. Abnormal Soc. Psychol. 51(3), 629 (1955)CrossRef Deutsch, M., Gerard, H.B.: A study of normative and informational social influences upon individual judgment. J. Abnormal Soc. Psychol. 51(3), 629 (1955)CrossRef
23.
Zurück zum Zitat Peng, H., Long, F., Ding, C.: Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Anal. Mach. Intell. 27(8), 1226–1238 (2005)CrossRef Peng, H., Long, F., Ding, C.: Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Anal. Mach. Intell. 27(8), 1226–1238 (2005)CrossRef
24.
Zurück zum Zitat Ivakhnenko, A., Ivakhnenko, G.: The review of problems solvable by algorithms of the group method of data handling (GMDH). Pattern Recogn. Image Anal. C/C Of Raspoznavaniye Obrazov I Analiz Izobrazhenii 5, 527–535 (1995) Ivakhnenko, A., Ivakhnenko, G.: The review of problems solvable by algorithms of the group method of data handling (GMDH). Pattern Recogn. Image Anal. C/C Of Raspoznavaniye Obrazov I Analiz Izobrazhenii 5, 527–535 (1995)
Metadaten
Titel
Dominating Factors Affecting Individual Retweeting Behavior
verfasst von
Juan Shi
Kin Keung Lai
Gang Chen
Copyright-Jahr
2020
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-15-7376-7_4