Skip to main content
Top
Published in: Social Network Analysis and Mining 1/2021

01-12-2021 | Original Article

A transformer-based architecture for fake news classification

Authors: Divyam Mehta, Aniket Dwivedi, Arunabha Patra, M. Anand Kumar

Published in: Social Network Analysis and Mining | Issue 1/2021

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In today’s post-truth world, the proliferation of propaganda and falsified news poses a deadly risk of misinforming the public on a variety of issues, either through traditional media or on social media. Information people acquire through these articles and posts tends to shape their world view and provides reasoning for choices they take in their day to day lives. Thus, fake news can definitely be a malicious force, having massive real-world consequences. In this paper, we focus on classifying fake news using models based on a natural language processing framework, Bidirectional Encoder Representations from Transformers, also known as BERT. We fine-tune BERT for specific domain datasets and also make use of human justification and metadata for added performance in our models. We determine that the deep-contextualizing nature of BERT is effective for this task and obtain significant improvement over binary classification, and minimal yet important improvement in six-label classification in comparison with previously explored models.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
go back to reference Alhindi T (2018a) Where is your evidence: improving fact-checking by justification modeling. In: Proceedings of the first workshop on fact extraction and verification (FEVER), Brussels, Belgium, pp 85–90 Alhindi T (2018a) Where is your evidence: improving fact-checking by justification modeling. In: Proceedings of the first workshop on fact extraction and verification (FEVER), Brussels, Belgium, pp 85–90
go back to reference Alhindi T, Petridis S, Muresan S (2018b) Where is your evidence: improving fact-checking by justification modeling. In: Proceedings of the first workshop on fact extraction Alhindi T, Petridis S, Muresan S (2018b) Where is your evidence: improving fact-checking by justification modeling. In: Proceedings of the first workshop on fact extraction
go back to reference Bojanowski P, Grave E, Joulin A, Mikolov T (2017) Enriching word vectors with subword information. Trans Assoc Comput Linguist 5:135–146CrossRef Bojanowski P, Grave E, Joulin A, Mikolov T (2017) Enriching word vectors with subword information. Trans Assoc Comput Linguist 5:135–146CrossRef
go back to reference Bourgonje P, Schneider JM, Rehm G (2017) From clickbait to fake news detection: an approach based on detecting the stance of headlines to articles. In: Proceedings of the 2017 EMNLP workshop: natural language processing meets journalism, pp 84–89 Bourgonje P, Schneider JM, Rehm G (2017) From clickbait to fake news detection: an approach based on detecting the stance of headlines to articles. In: Proceedings of the 2017 EMNLP workshop: natural language processing meets journalism, pp 84–89
go back to reference Chen ZF, Cheng Y (2020) Consumer response to fake news about brands on social media: the effects of self-efficacy, media trust, and persuasion knowledge on brand trust. J Prod Brand Manag 29(2):188–198CrossRef Chen ZF, Cheng Y (2020) Consumer response to fake news about brands on social media: the effects of self-efficacy, media trust, and persuasion knowledge on brand trust. J Prod Brand Manag 29(2):188–198CrossRef
go back to reference Crawford M, Khoshgoftaar TM, Prusa JD, Richter AN, Al Najada H (2015) Survey of review spam detection using machine learning techniques. J Big Data 2(1):1–24CrossRef Crawford M, Khoshgoftaar TM, Prusa JD, Richter AN, Al Najada H (2015) Survey of review spam detection using machine learning techniques. J Big Data 2(1):1–24CrossRef
go back to reference Devlin J, Chang MW, Lee K, Toutanova K (2018) Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 Devlin J, Chang MW, Lee K, Toutanova K (2018) Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:​1810.​04805
go back to reference Feng S, Banarjee R, Choi Y (2012) Syntactic stylometry for deception detection. In: ACL’12 Feng S, Banarjee R, Choi Y (2012) Syntactic stylometry for deception detection. In: ACL’12
go back to reference Hu X, Tang J, Gao H, Liu H (2014a) Social spammer detection with sentiment information. In: ICDM’14 Hu X, Tang J, Gao H, Liu H (2014a) Social spammer detection with sentiment information. In: ICDM’14
go back to reference Hu X, Tang J, Liu H (2014b) Online social spammer detection. In: AAAI’14, pp 59–65 Hu X, Tang J, Liu H (2014b) Online social spammer detection. In: AAAI’14, pp 59–65
go back to reference Kaliyar RK, Goswami A, Narang P, Sinha S (2020) FNDNet–a deep convolutional neural network for fake news detection. Cogn Syst Res 61:32–44CrossRef Kaliyar RK, Goswami A, Narang P, Sinha S (2020) FNDNet–a deep convolutional neural network for fake news detection. Cogn Syst Res 61:32–44CrossRef
go back to reference Kumar S, Asthana R, Upadhyay S, Upreti N, Akbar M (2020) Fake news detection using deep learning models: a novel approach. Trans Emerg Telecommun Technol 31(2):e3767 Kumar S, Asthana R, Upadhyay S, Upreti N, Akbar M (2020) Fake news detection using deep learning models: a novel approach. Trans Emerg Telecommun Technol 31(2):e3767
go back to reference Long Y et al (2017) Fake news detection through multi-perspective speaker profiles. In: Proceedings of the eighth international joint conference on natural language processing, vol 2: short papers, pp 252–256 Long Y et al (2017) Fake news detection through multi-perspective speaker profiles. In: Proceedings of the eighth international joint conference on natural language processing, vol 2: short papers, pp 252–256
go back to reference Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Proceedings of the 26th international conference on neural information processing systems, Lake Tahoe, NV, USA, 5–10, pp 3111–3119 Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Proceedings of the 26th international conference on neural information processing systems, Lake Tahoe, NV, USA, 5–10, pp 3111–3119
go back to reference Nandimath JN, Katkar BS, Ghadge VU, Garad AN (2017) Efficiently detecting and analyzing spam reviews using live data feed. Int Res J EngTechnol (IRJET) 4(2):1421–1424 Nandimath JN, Katkar BS, Ghadge VU, Garad AN (2017) Efficiently detecting and analyzing spam reviews using live data feed. Int Res J EngTechnol (IRJET) 4(2):1421–1424
go back to reference Rapoza K (2017) Can ’fake news’ impact the stock market? RealClearMarkets, Forbes Rapoza K (2017) Can ’fake news’ impact the stock market? RealClearMarkets, Forbes
go back to reference Shu K, Sliva A, Wang S, Tang J, Liu H (2017) Fake news detection on social media: a data mining perspective. SIGKDD Explor Newslett 19(1):22–36CrossRef Shu K, Sliva A, Wang S, Tang J, Liu H (2017) Fake news detection on social media: a data mining perspective. SIGKDD Explor Newslett 19(1):22–36CrossRef
go back to reference Shu K, Bernard H, Liu H (2018) Studying fake news via network analysis: detection and mitigation Shu K, Bernard H, Liu H (2018) Studying fake news via network analysis: detection and mitigation
go back to reference Tang J, Yi C, Huan L (2014) Mining social media with social theories: a survey. ACM SIGKDD Explor Newslett 15(2):20–29CrossRef Tang J, Yi C, Huan L (2014) Mining social media with social theories: a survey. ACM SIGKDD Explor Newslett 15(2):20–29CrossRef
go back to reference Vaswani A et al (2017) Attention is all you need. In: Advances in neural information processing systems, pp 6000–6010 Vaswani A et al (2017) Attention is all you need. In: Advances in neural information processing systems, pp 6000–6010
go back to reference Vo N, Lee K (2018) The rise of guardians: fact-checking URL recommendation to combat fake news. In: The 41st international ACM SIGIR conference on research & development in information retrieval, pp 275–284 Vo N, Lee K (2018) The rise of guardians: fact-checking URL recommendation to combat fake news. In: The 41st international ACM SIGIR conference on research & development in information retrieval, pp 275–284
go back to reference Volkova S, Shaffer K, Jang JY, Hodas N (2017) Separating facts from fiction: linguistic models to classify suspicious and trusted news posts on Twitter. In: ACL Volkova S, Shaffer K, Jang JY, Hodas N (2017) Separating facts from fiction: linguistic models to classify suspicious and trusted news posts on Twitter. In: ACL
go back to reference Wang WY (2017) “Liar, Liar Pants on Fire”: a new benchmark dataset for fake news detection. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, vol 2, Short Papers, pp 422–426 Wang WY (2017) “Liar, Liar Pants on Fire”: a new benchmark dataset for fake news detection. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, vol 2, Short Papers, pp 422–426
go back to reference Wang C, Mahadevan S (2011) Heterogeneous domain adaptation using manifold alignment. In: Proceedings of the 22nd international joint conference on artificial intelligence, vol 2, pp 541–546 Wang C, Mahadevan S (2011) Heterogeneous domain adaptation using manifold alignment. In: Proceedings of the 22nd international joint conference on artificial intelligence, vol 2, pp 541–546
go back to reference Yang, S et al (2019) Unsupervised fake news detection on social media: a generative approach. In: Proceedings of the AAAI conference on artificial intelligence, vol 33 Yang, S et al (2019) Unsupervised fake news detection on social media: a generative approach. In: Proceedings of the AAAI conference on artificial intelligence, vol 33
go back to reference Zhou JT, Tsang IW, Pan SJ, Tan M (2014) Heterogeneous domain adaptation for multiple classes. In: International conference on artificial intelligence and statistics, pp 103–1095 Zhou JT, Tsang IW, Pan SJ, Tan M (2014) Heterogeneous domain adaptation for multiple classes. In: International conference on artificial intelligence and statistics, pp 103–1095
Metadata
Title
A transformer-based architecture for fake news classification
Authors
Divyam Mehta
Aniket Dwivedi
Arunabha Patra
M. Anand Kumar
Publication date
01-12-2021
Publisher
Springer Vienna
Published in
Social Network Analysis and Mining / Issue 1/2021
Print ISSN: 1869-5450
Electronic ISSN: 1869-5469
DOI
https://doi.org/10.1007/s13278-021-00738-y

Other articles of this Issue 1/2021

Social Network Analysis and Mining 1/2021 Go to the issue

Premium Partner