Top

Social Network Analysis and Mining

Published in:

01-12-2021 | Original Article

A transformer-based architecture for fake news classification

Authors: Divyam Mehta, Aniket Dwivedi, Arunabha Patra, M. Anand Kumar

Published in: Social Network Analysis and Mining | Issue 1/2021

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

In today’s post-truth world, the proliferation of propaganda and falsified news poses a deadly risk of misinforming the public on a variety of issues, either through traditional media or on social media. Information people acquire through these articles and posts tends to shape their world view and provides reasoning for choices they take in their day to day lives. Thus, fake news can definitely be a malicious force, having massive real-world consequences. In this paper, we focus on classifying fake news using models based on a natural language processing framework, Bidirectional Encoder Representations from Transformers, also known as BERT. We fine-tune BERT for specific domain datasets and also make use of human justification and metadata for added performance in our models. We determine that the deep-contextualizing nature of BERT is effective for this task and obtain significant improvement over binary classification, and minimal yet important improvement in six-label classification in comparison with previously explored models.

previous article Creative social media use for Covid-19 prevention in Bangladesh: a structural equation modeling approach

next article Using targeted betweenness centrality to identify bridges to neglected users in the Twitter conversation on veteran suicide

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Ahmed H, Traore I, Saad S (2017) Detecting opinion spams and fake news using text classification. Secur Priv. https://doi.org/10.1002/spy2.9CrossRef

Alhindi T (2018a) Where is your evidence: improving fact-checking by justification modeling. In: Proceedings of the first workshop on fact extraction and verification (FEVER), Brussels, Belgium, pp 85–90

Alhindi T, Petridis S, Muresan S (2018b) Where is your evidence: improving fact-checking by justification modeling. In: Proceedings of the first workshop on fact extraction

Aphiwongsophon S, Chongstitvatana P (2018). Detecting fake news with machine learning method, pp 528–531. https://doi.org/10.1109/ECTICon.2018.8620051

Balwant MK (2019) Bidirectional LSTM Based on POS tags and CNN architecture for fake news detection. In: 2019 10th international conference on computing, communication and networking technologies (ICCCNT). https://doi.org/10.1109/ICCCNT45670.2019.8944460

Bojanowski P, Grave E, Joulin A, Mikolov T (2017) Enriching word vectors with subword information. Trans Assoc Comput Linguist 5:135–146CrossRef

Bourgonje P, Schneider JM, Rehm G (2017) From clickbait to fake news detection: an approach based on detecting the stance of headlines to articles. In: Proceedings of the 2017 EMNLP workshop: natural language processing meets journalism, pp 84–89

Chen ZF, Cheng Y (2020) Consumer response to fake news about brands on social media: the effects of self-efficacy, media trust, and persuasion knowledge on brand trust. J Prod Brand Manag 29(2):188–198CrossRef

Crawford M, Khoshgoftaar TM, Prusa JD, Richter AN, Al Najada H (2015) Survey of review spam detection using machine learning techniques. J Big Data 2(1):1–24CrossRef

Devlin J, Chang MW, Lee K, Toutanova K (2018) Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805

Eugenio T et al (2017) Some like it hoax: automated fake news detection in social networks. arXiv preprint arXiv:1704.07506

Feng S, Banarjee R, Choi Y (2012) Syntactic stylometry for deception detection. In: ACL’12

Handley L (2018) Nearly 70 percent of people are worried about fake news as a ’weapon,’ survey says. Retrieved from https://www.cnbc.com/2018/01/22/nearly-70-percent-of-peopleare-worried-about-fake-news-as-a-weapon-survey-says.html

Hu X, Tang J, Gao H, Liu H (2014a) Social spammer detection with sentiment information. In: ICDM’14

Hu X, Tang J, Liu H (2014b) Online social spammer detection. In: AAAI’14, pp 59–65

Kaliyar RK, Goswami A, Narang P, Sinha S (2020) FNDNet–a deep convolutional neural network for fake news detection. Cogn Syst Res 61:32–44CrossRef

Kumar S, Asthana R, Upadhyay S, Upreti N, Akbar M (2020) Fake news detection using deep learning models: a novel approach. Trans Emerg Telecommun Technol 31(2):e3767

Lichterman J (2016) Nearly half of US adults get news on Facebook, Pew Says. URL: http://www.niemanlab.org/2016/05/pew-report-44-percent-of-us-adults-get-news-onfacebook

Long Y et al (2017) Fake news detection through multi-perspective speaker profiles. In: Proceedings of the eighth international joint conference on natural language processing, vol 2: short papers, pp 252–256

Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Proceedings of the 26th international conference on neural information processing systems, Lake Tahoe, NV, USA, 5–10, pp 3111–3119

Nandimath JN, Katkar BS, Ghadge VU, Garad AN (2017) Efficiently detecting and analyzing spam reviews using live data feed. Int Res J EngTechnol (IRJET) 4(2):1421–1424

Rapoza K (2017) Can ’fake news’ impact the stock market? RealClearMarkets, Forbes

Resnick B (2018) False news stories travel faster and farther on Twitter than the truth, Vox.(Erişim: 09.09. 2019). https://www.vox.com/science-and-health/2018/3/8/17085928/fake-news-study-mit-science

Silverman C (2016) This analysis shows how viral fake election news stories outperformed real news On Facebook. BuzzFeed News, BuzzFeed News. www.buzzfeednews.com/article/craigsilverman/viral-fake-election-news-outperformed-real-news-on-facebook. Accessed 16 Nov 2016

Shu K, Sliva A, Wang S, Tang J, Liu H (2017) Fake news detection on social media: a data mining perspective. SIGKDD Explor Newslett 19(1):22–36CrossRef

Shu K, Bernard H, Liu H (2018) Studying fake news via network analysis: detection and mitigation

Tang J, Yi C, Huan L (2014) Mining social media with social theories: a survey. ACM SIGKDD Explor Newslett 15(2):20–29CrossRef

Vaswani A et al (2017) Attention is all you need. In: Advances in neural information processing systems, pp 6000–6010

Vis F (2014) 10. The rapid spread of misinformation online. World Economic Forum. Retrieved from http://reports.weforum.org/outlook-14/top-ten-trends-categorypage/10-the-rapid-spread-of-misinformation-online/

Vo N, Lee K (2018) The rise of guardians: fact-checking URL recommendation to combat fake news. In: The 41st international ACM SIGIR conference on research & development in information retrieval, pp 275–284

Volkova S, Shaffer K, Jang JY, Hodas N (2017) Separating facts from fiction: linguistic models to classify suspicious and trusted news posts on Twitter. In: ACL

Wakefield J (2016) Young using social media to access news. BBC News. www.bbc.com/news/uk-36528256. Accessed 15 Jun 2016

Wang WY (2017) “Liar, Liar Pants on Fire”: a new benchmark dataset for fake news detection. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, vol 2, Short Papers, pp 422–426

Wang C, Mahadevan S (2011) Heterogeneous domain adaptation using manifold alignment. In: Proceedings of the 22nd international joint conference on artificial intelligence, vol 2, pp 541–546

Yang, S et al (2019) Unsupervised fake news detection on social media: a generative approach. In: Proceedings of the AAAI conference on artificial intelligence, vol 33

Zhang S, Wang Y, Tan C (2018) Research on text classification for identifying fake news. In: IEEE 2018 international conference on security, pattern analysis, and cybernetics (SPAC). https://doi.org/10.1109/SPAC46244.2018.8965536

Zhang J et al (2019) FAKEDETECTOR: effective fake news detection with deep diffusive neural network. In: 2019 36th IEEE international conference. https://doi.org/10.1109/ICDE48307.2020.00180

Zhou JT, Tsang IW, Pan SJ, Tan M (2014) Heterogeneous domain adaptation for multiple classes. In: International conference on artificial intelligence and statistics, pp 103–1095

Title: A transformer-based architecture for fake news classification
Authors: Divyam Mehta
Aniket Dwivedi
Arunabha Patra
M. Anand Kumar
Publication date: 01-12-2021
Publisher: Springer Vienna
Published in: Social Network Analysis and Mining / Issue 1/2021
Print ISSN: 1869-5450
Electronic ISSN: 1869-5469
DOI: https://doi.org/10.1007/s13278-021-00738-y

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Other articles of this Issue 1/2021

A systematic evaluation of assumptions in centrality measures by empirical flow data

Studying leaders & their concerns using online social media during the times of crisis - A COVID case study

Misleading information in Spanish: a survey

Making sense of tweets using sentiment analysis on closely related topics

A novel similarity measure for the link prediction in unipartite and bipartite networks

DeepFriend: finding abnormal nodes in online social networks using dynamic deep learning

Premium Partner