Abstract
In this chapter, we present a feature-driven approach to detect PSM accounts in social media. Inspired by the literature, we set out to assess PSMs from three broad perspectives: (1) user-related information (e.g., user activity, profile characteristics), (2) source-related information (i.e., information linked via URLs shared by users) and (3) content-related information (e.g., tweets characteristics). For the user-related information, we investigate malicious signals using causality analysis (i.e., if user is frequently a cause of viral cascades) and profile characteristics (e.g., number of followers, etc.). For the source-related information, we explore various malicious properties linked to URLs (e.g., URL address, content of the associated website, etc.). Finally, for the content-related information, we examine attributes (e.g., number of hashtags, suspicious hashtags, etc.) from tweets posted by users. Experiments on real-world Twitter data from different countries demonstrate the effectiveness of the proposed approach in identifying PSM users.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
In this work we used the following: https://voiceofeurope.com/, https://newsvoice.se/, https://nyadagbladet.se/, https://www.friatider.se/, or the pro-Russian website https://ok.ru/.
- 2.
A codebook is survey research approach to provide a guide for framing categories and coding responses to the categories definitions.
References
H. Alvari, P. Shakarian, Causal inference for early detection of pathogenic social media accounts. Preprint (2018). arXiv:1806.09787
H. Alvari, P. Shakarian, Hawkes process for understanding the influence of pathogenic social media accounts, in 2019 2nd International Conference on Data Intelligence and Security (ICDIS), pp. 36–42 (June 2019)
H. Alvari, E. Shaabani, P. Shakarian, Early identification of pathogenic social media accounts. IEEE Intelligent and Security Informatics (2018). arXiv:1809.09331
H. Alvari, E. Shaabani, S. Sarkar, G. Beigi, P. Shakarian, Less is more: Semi-supervised causal inference for detecting pathogenic users in social media, in Companion Proceedings of The 2019 World Wide Web Conference, WWW ’19 (Association for Computing Machinery, New York, NY, USA, 2019), pp. 154–161
H. Alvari, G. Beigi, S. Sarkar, S. W. Ruston, S. R. Corman, H. Davulcu, P. Shakarian, A feature-driven approach for identifying pathogenic social media accounts. Preprint (2020). arXiv:2001.04624
R. Baly, G. Karadzhov, D. Alexandrov, J. Glass, P. Nakov, Predicting factuality of reporting and bias of news media sources. Preprint (2018). arXiv:1810.01765
D.P. Baron, Persistent media bias. J. Public Econ. 90(1-2), 1–36 (2006)
D.M. Blei, A.Y. Ng, M.I. Jordan, Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
S. DellaVigna, E. Kaplan, The Fox News effect: Media bias and voting. Q. J. Econ. 122(3), 1187–1234 (2007)
R.M. Entman, Framing: Toward clarification of a fractured paradigm. J. Commun. 43(4), 51–58 (1993)
E. Ferrara, O. Varol, C. Davis, F. Menczer, A. Flammini, The rise of social bots. Commun. ACM 59(7), 96–104 (2016)
A. Goyal, F. Bonchi, L.V. Lakshmanan, Learning influence probabilities in social networks, in WSDM (2010)
S. Hochreiter, J. Schmidhuber, Long short-term memory. Neural Computation 9(8), 1735–1780 (1997)
B.D. Horne, S. Adali, This just in: Fake news packs a lot in title, uses simpler, repetitive content in text body, more similar to satire than real news, in Eleventh International AAAI Conference on Web and Social Media (2017)
J.P. Kincaid, R.P. Fishburne Jr., R.L. Rogers, B.S. Chissom, Derivation of new readability formulas (automated readability index, fog count and Flesch reading ease formula) for navy enlisted personnel (1975)
S. Kudugunta, E. Ferrara, Deep neural networks for bot detection. Preprint (2018). arXiv:1802.04289
C. Manning, R. Prabhakar, S. Hinrich, Introduction to Information Retrieval, vol. 1 (Cambridge University Press, Cambridge, 2008)
T. Mikolov, I. Sutskever, K. Chen, G.S. Corrado, J. Dean, Distributed representations of words and phrases and their compositionality, in Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
F. Morstatter, L. Wu, U. Yavanoglu, S.R. Corman, H. Liu, Identifying framing bias in online news. ACM Trans. Soc. Comput. 1(2), 5 (2018)
T.M. Phuong et al., Gender prediction using browsing history, in Knowledge and Systems Engineering (Springer, 2014), pp. 271–283
D.A. Scheufele, D. Tewksbury, Framing, agenda setting, and priming: The evolution of three media effects models. J. Commun. 57(1), 9–20 (2006)
C. Shao, G.L. Ciampaglia, O. Varol, A. Flammini, F. Menczer, The spread of fake news by social bots. Preprint (2017). arXiv:1707.07592
A.B. Soliman, K. Eissa, S.R. El-Beltagy, Aravec: A set of Arabic word embedding models for use in Arabic NLP. Procedia Comput. Sci. 117, 256–265 (2017)
P. Suppes, A probabilistic theory of causality (1970)
O. Varol, E. Ferrara, C.A. Davis, F. Menczer, A. Flammini, Online human-bot interactions: Detection, estimation, and characterization, in ICWSM (2017)
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Alvari, H., Shaabani, E., Shakarian, P. (2021). Feature-Driven Method for Identifying Pathogenic Social Media Accounts. In: Identification of Pathogenic Social Media Accounts. SpringerBriefs in Computer Science. Springer, Cham. https://doi.org/10.1007/978-3-030-61431-7_7
Download citation
DOI: https://doi.org/10.1007/978-3-030-61431-7_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-61430-0
Online ISBN: 978-3-030-61431-7
eBook Packages: Computer ScienceComputer Science (R0)