Skip to main content
Erschienen in: Social Network Analysis and Mining 1/2024

01.12.2024 | Original Article

Automatic detection of fake tweets about the COVID-19 Vaccine in Portuguese

verfasst von: Rafael Geurgas, Leandro R. Tessler

Erschienen in: Social Network Analysis and Mining | Ausgabe 1/2024

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The COVID-19 pandemic induced an unprecedented wave of disinformation in social media in Brazil. In particular, Twitter (currently X) was used to spread fake news about COVID-19 vaccines that helped to induce vaccine hesitation. This article presents a BERT-based neural network for the automatic detection of fake tweets. The optimized architecture relies upon BERTimbau, a BERT implementation pre-trained in Brazilian Portuguese, fine-tuned using three fully connected layers. All 2,857,908 tweets in Portuguese containing the word vacina (vaccine in Portuguese) were collected over 7 months. A random subset of 16,731 tweets was manually classified as real or fake. Of these, 2309 were discarded for not being about non-COVID-19 vaccines and 422 were discarded for containing irony. Of the remaining 14,000 tweets, 1144 were labeled fake and 12,856 were real. To balance the training dataset, the network was fine-tuned using the 1144 curated fake tweets and a random sample of 2000 real tweets. Optimal results were achieved by melting the last four layers of the BERTimbau. The best results obtained were 77.1% F1-score and 76.9% accuracy. These results are already acceptable for practical applications. They can be improved by increasing the size of the training dataset. A weighted 96.3% F1-score was obtained by training the same neural network architecture and hyperparameters with a larger curated balanced English language training dataset.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
2
During the collection period there were news about a cancer vaccine being developed at the University of Oxford (McAuliffe et al. 2021) and consequently a wave of tweets concerning this subject has circulated.
 
3
Reuters Fact Check: COVID-19 vaccines do not cause HIV or AIDS. https://​www.​reuters.​com/​article/​factcheck-vaccines-hiv-idUSL1N2UW10H.
 
7
After the change of Twitter ownership and name, users can pay to have this limitation lifted.
 
Literatur
Zurück zum Zitat Devlin J, Chang M, Lee K, et al (2018) BERT: pre-training of deep bidirectional transformers for language understanding arXiv:1810.04805 Devlin J, Chang M, Lee K, et al (2018) BERT: pre-training of deep bidirectional transformers for language understanding arXiv:​1810.​04805
Zurück zum Zitat Fischer M, Haque R, Stynes P, et al (2022) Identifying fake news in brazilian portuguese. In: Rosso P, Basile V, Martínez R, et al (eds) NLDB 2022: 27th international conference on applications of natural language to information systems. Springer International Publishing, pp 111–118, https://doi.org/10.1007/978-3-031-08473-7 Fischer M, Haque R, Stynes P, et al (2022) Identifying fake news in brazilian portuguese. In: Rosso P, Basile V, Martínez R, et al (eds) NLDB 2022: 27th international conference on applications of natural language to information systems. Springer International Publishing, pp 111–118, https://​doi.​org/​10.​1007/​978-3-031-08473-7
Zurück zum Zitat Geron A (2018) Hands-on machine learning with scikit-learn and tensor flow. O’Reily Media Inc, Sebastopol, CA Geron A (2018) Hands-on machine learning with scikit-learn and tensor flow. O’Reily Media Inc, Sebastopol, CA
Zurück zum Zitat Glaskowa A, Glazkov M, Trifonov T (2021) g2tmn at constraint@aaai2021: Exploiting ct-bert and ensembling learning for covid-19 fake news detections. Combating online hostile posts in regional languages during emergency situation. Springer International Publishing, Berlin, pp 116–127. https://doi.org/10.1007/978-3-030-73696-5_12CrossRef Glaskowa A, Glazkov M, Trifonov T (2021) g2tmn at constraint@aaai2021: Exploiting ct-bert and ensembling learning for covid-19 fake news detections. Combating online hostile posts in regional languages during emergency situation. Springer International Publishing, Berlin, pp 116–127. https://​doi.​org/​10.​1007/​978-3-030-73696-5_​12CrossRef
Zurück zum Zitat Martins ADF, Cabral L, Mourão PJC et al (2021) Detection of misinformation about covid-19 in brazilian portuguese whatsapp messages. In: Métais E, Meziane F, Horacek H et al (eds) NLDB 2021: 26th international conference on applications of natural language to information systems. Springer International Publishing, Berlin, pp 199–206. https://doi.org/10.1007/978-3-030-80599-9CrossRef Martins ADF, Cabral L, Mourão PJC et al (2021) Detection of misinformation about covid-19 in brazilian portuguese whatsapp messages. In: Métais E, Meziane F, Horacek H et al (eds) NLDB 2021: 26th international conference on applications of natural language to information systems. Springer International Publishing, Berlin, pp 199–206. https://​doi.​org/​10.​1007/​978-3-030-80599-9CrossRef
Zurück zum Zitat Nair V, Hinton GE (2010) Rectified linear units improve restricted boltzmann machines. In: International conference on machine learning Nair V, Hinton GE (2010) Rectified linear units improve restricted boltzmann machines. In: International conference on machine learning
Zurück zum Zitat Robbins H, Monro S (1951) A stochastic approximation method. Ann Math Stat pp 400–407 Robbins H, Monro S (1951) A stochastic approximation method. Ann Math Stat pp 400–407
Zurück zum Zitat Sun C, Qiu X, Xu Y, et al (2019) How to fine-tune bert for text classification? In: Sun M, Huang X, Ji H, et al (eds) Chinese computational linguistics: 18th China national conference, CCL 2019, Kunming, China, October 18–20, 2019, Proceedings 18. Springer International Publishing, pp 194–206, https://doi.org/10.1007/978-3-030-32381-3_16 Sun C, Qiu X, Xu Y, et al (2019) How to fine-tune bert for text classification? In: Sun M, Huang X, Ji H, et al (eds) Chinese computational linguistics: 18th China national conference, CCL 2019, Kunming, China, October 18–20, 2019, Proceedings 18. Springer International Publishing, pp 194–206, https://​doi.​org/​10.​1007/​978-3-030-32381-3_​16
Metadaten
Titel
Automatic detection of fake tweets about the COVID-19 Vaccine in Portuguese
verfasst von
Rafael Geurgas
Leandro R. Tessler
Publikationsdatum
01.12.2024
Verlag
Springer Vienna
Erschienen in
Social Network Analysis and Mining / Ausgabe 1/2024
Print ISSN: 1869-5450
Elektronische ISSN: 1869-5469
DOI
https://doi.org/10.1007/s13278-024-01216-x

Weitere Artikel der Ausgabe 1/2024

Social Network Analysis and Mining 1/2024 Zur Ausgabe

Premium Partner