Skip to main content
Erschienen in: Social Network Analysis and Mining 1/2022

01.12.2022 | Review Paper

Applications of machine learning for COVID-19 misinformation: a systematic review

verfasst von: A. R. Sanaullah, Anupam Das, Anik Das, Muhammad Ashad Kabir, Kai Shu

Erschienen in: Social Network Analysis and Mining | Ausgabe 1/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The inflammable growth of misinformation on social media and other platforms during pandemic situations like COVID-19 can cause significant damage to the physical and mental stability of the people. To detect such misinformation, researchers have been applying various machine learning (ML) and deep learning (DL) techniques. The objective of this study is to systematically review, assess, and synthesize state-of-the-art research articles that have used different ML and DL techniques to detect COVID-19 misinformation. A structured literature search was conducted in the relevant bibliographic databases to ensure that the survey was solely centered on reproducible and high-quality research. We reviewed 43 papers that fulfilled our inclusion criteria out of 260 articles found from our keyword search. We have surveyed a complete pipeline of COVID-19 misinformation detection. In particular, we have identified various COVID-19 misinformation datasets and reviewed different data processing, feature extraction, and classification techniques to detect COVID-19 misinformation. In the end, the challenges and limitations in detecting COVID-19 misinformation using ML techniques and the future research directions are discussed.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
Zurück zum Zitat Akon S, Bhuiyan A (2020) COVID-19: Rumors and Youth Vulnerabilities in Bangladesh Akon S, Bhuiyan A (2020) COVID-19: Rumors and Youth Vulnerabilities in Bangladesh
Zurück zum Zitat Alam F, Dalvi F, Shaar Set al (2021) Fighting the covid-19 infodemic in social media: a holistic perspective and a call to arms. In: ICWSM Alam F, Dalvi F, Shaar Set al (2021) Fighting the covid-19 infodemic in social media: a holistic perspective and a call to arms. In: ICWSM
Zurück zum Zitat Alkhalifa R, Yoong T, Kochkina Eet al (2020) QMUL-SDS at CheckThat! 2020: determining COVID-19 tweet check-worthiness using an enhanced CT-BERT with numeric expressions. arXiv:2008.13160 Alkhalifa R, Yoong T, Kochkina Eet al (2020) QMUL-SDS at CheckThat! 2020: determining COVID-19 tweet check-worthiness using an enhanced CT-BERT with numeric expressions. arXiv:​2008.​13160
Zurück zum Zitat Alsudias L, Rayson P (2020) COVID-19 and Arabic Twitter: how can Arab World Governments and Public Health Organizations Learn from Social Media? In: Proceedings of the 1st Workshop on NLP for COVID-19 at ACL 2020. Association for Computational Linguistics Alsudias L, Rayson P (2020) COVID-19 and Arabic Twitter: how can Arab World Governments and Public Health Organizations Learn from Social Media? In: Proceedings of the 1st Workshop on NLP for COVID-19 at ACL 2020. Association for Computational Linguistics
Zurück zum Zitat Antoun W, Baly F, Hajj H (2003) AraBERT, Transformer-based Model for Arabic Language Understanding, p 00104 Antoun W, Baly F, Hajj H (2003) AraBERT, Transformer-based Model for Arabic Language Understanding, p 00104
Zurück zum Zitat Boukouvalas Z, Mallinson C, Crothers Eet al (2020) Independent Component Analysis for Trustworthy Cyberspace during High Impact Events: An Application to Covid-19. arXiv:2006.01284 Boukouvalas Z, Mallinson C, Crothers Eet al (2020) Independent Component Analysis for Trustworthy Cyberspace during High Impact Events: An Application to Covid-19. arXiv:​2006.​01284
Zurück zum Zitat Cheng M, Nazarian S, Bogdan P (2020) VRoC: Variational Autoencoder-Aided Multi-Task Rumor Classifier Based on Text. In: proceedings of The web conference 2020. Association for Computing Machinery, New York, NY, USA, WWW ’20, pp 2892–2898, https://doi.org/10.1145/3366423.3380054 Cheng M, Nazarian S, Bogdan P (2020) VRoC: Variational Autoencoder-Aided Multi-Task Rumor Classifier Based on Text. In: proceedings of The web conference 2020. Association for Computing Machinery, New York, NY, USA, WWW ’20, pp 2892–2898, https://​doi.​org/​10.​1145/​3366423.​3380054
Zurück zum Zitat Chung J, Gulcehre C, Cho Ket al (2014) Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling. In: NIPS 2014 Workshop on Deep Learning, December 2014 Chung J, Gulcehre C, Cho Ket al (2014) Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling. In: NIPS 2014 Workshop on Deep Learning, December 2014
Zurück zum Zitat Cui L, Wang S, Lee D (2019) Same: sentiment-aware multi-modal embedding for detecting fake news. In: Proceedings of the 2019 IEEE/ACM international conference on advances in social networks analysis and mining, ASONAM 2019, pp 41–48, https://doi.org/10.1145/3341161.3342894 Cui L, Wang S, Lee D (2019) Same: sentiment-aware multi-modal embedding for detecting fake news. In: Proceedings of the 2019 IEEE/ACM international conference on advances in social networks analysis and mining, ASONAM 2019, pp 41–48, https://​doi.​org/​10.​1145/​3341161.​3342894
Zurück zum Zitat Devlin J, Chang MW, Lee Ket al (2019) BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, pp 4171–4186, https://doi.org/10.18653/v1/N19-1423 Devlin J, Chang MW, Lee Ket al (2019) BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, pp 4171–4186, https://​doi.​org/​10.​18653/​v1/​N19-1423
Zurück zum Zitat Dharawat A, Lourentzou I, Morales Aet al (2020) Drink bleach or do what now? Covid-HeRA: a dataset for risk-informed health decision making in the presence of COVID19 misinformation. arXiv:2010.08743 Dharawat A, Lourentzou I, Morales Aet al (2020) Drink bleach or do what now? Covid-HeRA: a dataset for risk-informed health decision making in the presence of COVID19 misinformation. arXiv:​2010.​08743
Zurück zum Zitat Dimitrov D, Baran E, Fafalios Pet al (2020) TweetsCOV19—A Knowledge Base of Semantically Annotated Tweets about the COVID-19 Pandemic. International conference on information and knowledge management, proceedings (June):2991–2998. https://doi.org/10.1145/3340531.3412765 Dimitrov D, Baran E, Fafalios Pet al (2020) TweetsCOV19—A Knowledge Base of Semantically Annotated Tweets about the COVID-19 Pandemic. International conference on information and knowledge management, proceedings (June):2991–2998. https://​doi.​org/​10.​1145/​3340531.​3412765
Zurück zum Zitat Fernandez M, Alani H (2018) Online misinformation: challenges and future directions. In: Companion proceedings of the web conference 2018. International world wide web conferences steering committee, Republic and Canton of Geneva, CHE, WWW ’18, p 595-602, https://doi.org/10.1145/3184558.3188730 Fernandez M, Alani H (2018) Online misinformation: challenges and future directions. In: Companion proceedings of the web conference 2018. International world wide web conferences steering committee, Republic and Canton of Geneva, CHE, WWW ’18, p 595-602, https://​doi.​org/​10.​1145/​3184558.​3188730
Zurück zum Zitat Haouari F, Hasanain M, Suwaileh Ret al (2020) ArCOV19-Rumors: Arabic COVID-19 Twitter Dataset for Misinformation Detection. arXiv:2010.08768 Haouari F, Hasanain M, Suwaileh Ret al (2020) ArCOV19-Rumors: Arabic COVID-19 Twitter Dataset for Misinformation Detection. arXiv:​2010.​08768
Zurück zum Zitat Ji Y, Eisenstein J (2014) Representation learning for text-level discourse parsing. In: Proceedings of the 52nd annual meeting of the association for computational linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Baltimore, Maryland, pp 13–24, https://doi.org/10.3115/v1/P14-1002 Ji Y, Eisenstein J (2014) Representation learning for text-level discourse parsing. In: Proceedings of the 52nd annual meeting of the association for computational linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Baltimore, Maryland, pp 13–24, https://​doi.​org/​10.​3115/​v1/​P14-1002
Zurück zum Zitat Joulin A, Grave E, Bojanowski Pet al (2017) Bag of tricks for efficient text classification. In: proceedings of the 15th conference of the European chapter of the association for computational linguistics: Volume 2, Short Papers. Association for Computational Linguistics, Valencia, Spain, pp 427–431 Joulin A, Grave E, Bojanowski Pet al (2017) Bag of tricks for efficient text classification. In: proceedings of the 15th conference of the European chapter of the association for computational linguistics: Volume 2, Short Papers. Association for Computational Linguistics, Valencia, Spain, pp 427–431
Zurück zum Zitat Kaliyar R, Goswami A, Narang P (2021) A hybrid model for effective fake news detection with a novel COVID-19 dataset. In: ICAART 2021—Proceedings of the 13th international conference on agents and artificial intelligence, pp 1066–1072, https://doi.org/10.5220/0010316010661072 Kaliyar R, Goswami A, Narang P (2021) A hybrid model for effective fake news detection with a novel COVID-19 dataset. In: ICAART 2021—Proceedings of the 13th international conference on agents and artificial intelligence, pp 1066–1072, https://​doi.​org/​10.​5220/​0010316010661072​
Zurück zum Zitat Kar D, Bhardwaj M, Samanta Set al (2020) No Rumours Please! A Multi-Indic-Lingual Approach for COVID Fake-Tweet Detection. arXiv:2010.06906 Kar D, Bhardwaj M, Samanta Set al (2020) No Rumours Please! A Multi-Indic-Lingual Approach for COVID Fake-Tweet Detection. arXiv:​2010.​06906
Zurück zum Zitat Karlova N, Fisher K (2013) A social diffusion model of misinformation and disinformation for understanding human information behaviour. Inf Res 18 Karlova N, Fisher K (2013) A social diffusion model of misinformation and disinformation for understanding human information behaviour. Inf Res 18
Zurück zum Zitat Kim J, Tabibian B, Oh Aet al (2018) Leveraging the Crowd to Detect and Reduce the Spread of Fake News and Misinformation. In: Proceedings of the eleventh ACM international conference on web search and data mining. Association for Computing Machinery, New York, NY, USA, WSDM ’18, pp 324–332, https://doi.org/10.1145/3159652.3159734 Kim J, Tabibian B, Oh Aet al (2018) Leveraging the Crowd to Detect and Reduce the Spread of Fake News and Misinformation. In: Proceedings of the eleventh ACM international conference on web search and data mining. Association for Computing Machinery, New York, NY, USA, WSDM ’18, pp 324–332, https://​doi.​org/​10.​1145/​3159652.​3159734
Zurück zum Zitat Koirala A (2020) Covid-19 fake news classification using deep learning. Master’s thesis, Asian Institute of Technology, Thailand Koirala A (2020) Covid-19 fake news classification using deep learning. Master’s thesis, Asian Institute of Technology, Thailand
Zurück zum Zitat Li Q, Zhang Q, Si Let al (2019) Rumor detection on social media: datasets, methods and opportunities. In: proceedings of the second workshop on natural language processing for internet freedom: censorship, disinformation, and propaganda. Association for Computational Linguistics, Hong Kong, China, pp 66–75, https://doi.org/10.18653/v1/D19-5008 Li Q, Zhang Q, Si Let al (2019) Rumor detection on social media: datasets, methods and opportunities. In: proceedings of the second workshop on natural language processing for internet freedom: censorship, disinformation, and propaganda. Association for Computational Linguistics, Hong Kong, China, pp 66–75, https://​doi.​org/​10.​18653/​v1/​D19-5008
Zurück zum Zitat Lin X, Liao X, Xu Tet al (2019) Rumor Detection with Hierarchical Recurrent Convolutional Neural Network. In: Tang J, Kan MY, Zhao Det al (eds) Natural Language Processing and Chinese Computing. Springer International Publishing, Cham, pp 338–348, https://doi.org/10.1007/978-3-030-32236-6_30 Lin X, Liao X, Xu Tet al (2019) Rumor Detection with Hierarchical Recurrent Convolutional Neural Network. In: Tang J, Kan MY, Zhao Det al (eds) Natural Language Processing and Chinese Computing. Springer International Publishing, Cham, pp 338–348, https://​doi.​org/​10.​1007/​978-3-030-32236-6_​30
Zurück zum Zitat Liu P, Qiu X, Huang X (2016) Recurrent neural network for text classification with multi-task learning. In: proceedings of the twenty-fifth international joint conference on artificial intelligence. AAAI Press, IJCAI’16, pp 2873–2879 Liu P, Qiu X, Huang X (2016) Recurrent neural network for text classification with multi-task learning. In: proceedings of the twenty-fifth international joint conference on artificial intelligence. AAAI Press, IJCAI’16, pp 2873–2879
Zurück zum Zitat Medina Serrano JC, Papakyriakopoulos O, Hegelich S (2020) NLP-based Feature Extraction for the Detection of COVID-19 Misinformation Videos on YouTube. In: proceedings of the 1st workshop on NLP for COVID-19 at ACL 2020. Association for Computational Linguistics Medina Serrano JC, Papakyriakopoulos O, Hegelich S (2020) NLP-based Feature Extraction for the Detection of COVID-19 Misinformation Videos on YouTube. In: proceedings of the 1st workshop on NLP for COVID-19 at ACL 2020. Association for Computational Linguistics
Zurück zum Zitat Micallef N, He B, Kumar Set al (2020) The Role of the Crowd in Countering Misinformation: A Case Study of the COVID-19 Infodemic. arXiv:2011.05773 Micallef N, He B, Kumar Set al (2020) The Role of the Crowd in Countering Misinformation: A Case Study of the COVID-19 Infodemic. arXiv:​2011.​05773
Zurück zum Zitat Mikolov T, Sutskever I, Chen Ket al (2013) Distributed representations of words and phrases and their compositionality. In: Proceedings of the 26th international conference on neural information processing systems—Volume 2. Curran Associates Inc., Red Hook, NY, USA, NIPS’13, pp 3111–3119 Mikolov T, Sutskever I, Chen Ket al (2013) Distributed representations of words and phrases and their compositionality. In: Proceedings of the 26th international conference on neural information processing systems—Volume 2. Curran Associates Inc., Red Hook, NY, USA, NIPS’13, pp 3111–3119
Zurück zum Zitat Müller M, Salathé M, Kummervold PE (2020) COVID-Twitter-BERT: a natural language processing model to analyse COVID-19 Content on Twitter. arXiv:2005.07503 Müller M, Salathé M, Kummervold PE (2020) COVID-Twitter-BERT: a natural language processing model to analyse COVID-19 Content on Twitter. arXiv:​2005.​07503
Zurück zum Zitat Pennebaker JW, Francis ME, Booth RJ (2001) Linguistic inquiry and word count: Liwc 2001. Mahway: Lawrence Erlbaum Associates 71(2001):2001 Pennebaker JW, Francis ME, Booth RJ (2001) Linguistic inquiry and word count: Liwc 2001. Mahway: Lawrence Erlbaum Associates 71(2001):2001
Zurück zum Zitat Pennington J, Socher R, Manning C (2014) GloVe: Global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, Doha, Qatar, pp 1532–1543, https://doi.org/10.3115/v1/D14-1162 Pennington J, Socher R, Manning C (2014) GloVe: Global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, Doha, Qatar, pp 1532–1543, https://​doi.​org/​10.​3115/​v1/​D14-1162
Zurück zum Zitat Peters M, Neumann M, Iyyer Met al (2018) Deep contextualized word representations. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: human language technologies, Volume 1 (Long Papers). Association for Computational Linguistics, New Orleans, Louisiana, pp 2227–2237, https://doi.org/10.18653/v1/N18-1202 Peters M, Neumann M, Iyyer Met al (2018) Deep contextualized word representations. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: human language technologies, Volume 1 (Long Papers). Association for Computational Linguistics, New Orleans, Louisiana, pp 2227–2237, https://​doi.​org/​10.​18653/​v1/​N18-1202
Zurück zum Zitat Reimers N, Gurevych I (2019) Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. In: proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP). Association for Computational Linguistics, pp 3980–3990, https://doi.org/10.18653/v1/D19-1410 Reimers N, Gurevych I (2019) Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. In: proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP). Association for Computational Linguistics, pp 3980–3990, https://​doi.​org/​10.​18653/​v1/​D19-1410
Zurück zum Zitat Shu K, Wang S, Liu H (2019b) Beyond news contents: the role of social context for fake news detection. In: proceedings of the twelfth ACM international conference on web search and data mining. Association for Computing Machinery, New York, NY, USA, WSDM ’19, pp 312–320, https://doi.org/10.1145/3289600.3290994 Shu K, Wang S, Liu H (2019b) Beyond news contents: the role of social context for fake news detection. In: proceedings of the twelfth ACM international conference on web search and data mining. Association for Computing Machinery, New York, NY, USA, WSDM ’19, pp 312–320, https://​doi.​org/​10.​1145/​3289600.​3290994
Zurück zum Zitat Song X, Petrak J, Jiang Yet al (2020) Classification Aware Neural Topic Model and its Application on a New COVID-19 Disinformation Corpus. arXiv:2006.03354 Song X, Petrak J, Jiang Yet al (2020) Classification Aware Neural Topic Model and its Application on a New COVID-19 Disinformation Corpus. arXiv:​2006.​03354
Zurück zum Zitat Vaswani A, Shazeer N, Parmar Net al (2017) Attention is all you need. In: Proceedings of the 31st international conference on neural information processing systems. Curran Associates Inc., Red Hook, NY, USA, NIPS’17, pp 6000–6010 Vaswani A, Shazeer N, Parmar Net al (2017) Attention is all you need. In: Proceedings of the 31st international conference on neural information processing systems. Curran Associates Inc., Red Hook, NY, USA, NIPS’17, pp 6000–6010
Zurück zum Zitat Wang H, Gan J, Chen Jet al (2021) Automatic detecting for covid-19-related rumors data on internet. In: 2021 9th international conference on communications and broadband networking. Association for Computing Machinery, New York, NY, USA, ICCBN 2021, pp 22–26, https://doi.org/10.1145/3456415.3456420 Wang H, Gan J, Chen Jet al (2021) Automatic detecting for covid-19-related rumors data on internet. In: 2021 9th international conference on communications and broadband networking. Association for Computing Machinery, New York, NY, USA, ICCBN 2021, pp 22–26, https://​doi.​org/​10.​1145/​3456415.​3456420
Zurück zum Zitat WHO (2020) Novel Coronavirus. Situation Report–205 205(6):1–19 WHO (2020) Novel Coronavirus. Situation Report–205 205(6):1–19
Zurück zum Zitat Yang Z, Yang D, Dyer Cet al (2016) Hierarchical attention networks for document classification. In: proceedings of the 2016 conference of the North American chapter of the association for computational linguistics: human language technologies. Association for Computational Linguistics, San Diego, California, pp 1480–1489, https://doi.org/10.18653/v1/N16-1174 Yang Z, Yang D, Dyer Cet al (2016) Hierarchical attention networks for document classification. In: proceedings of the 2016 conference of the North American chapter of the association for computational linguistics: human language technologies. Association for Computational Linguistics, San Diego, California, pp 1480–1489, https://​doi.​org/​10.​18653/​v1/​N16-1174
Zurück zum Zitat Zhang T, Kishore V, Wu Fet al (2020) Bertscore: evaluating text generation with bert. In: international conference on learning representations Zhang T, Kishore V, Wu Fet al (2020) Bertscore: evaluating text generation with bert. In: international conference on learning representations
Zurück zum Zitat Zhou P, Shi W, Tian Jet al (2016) Attention-based bidirectional long short-term memory networks for relation classification. In: proceedings of the 54th annual meeting of the association for computational linguistics (Volume 2: Short Papers). Association for Computational Linguistics, Berlin, Germany, pp 207–212, https://doi.org/10.18653/v1/P16-2034 Zhou P, Shi W, Tian Jet al (2016) Attention-based bidirectional long short-term memory networks for relation classification. In: proceedings of the 54th annual meeting of the association for computational linguistics (Volume 2: Short Papers). Association for Computational Linguistics, Berlin, Germany, pp 207–212, https://​doi.​org/​10.​18653/​v1/​P16-2034
Metadaten
Titel
Applications of machine learning for COVID-19 misinformation: a systematic review
verfasst von
A. R. Sanaullah
Anupam Das
Anik Das
Muhammad Ashad Kabir
Kai Shu
Publikationsdatum
01.12.2022
Verlag
Springer Vienna
Erschienen in
Social Network Analysis and Mining / Ausgabe 1/2022
Print ISSN: 1869-5450
Elektronische ISSN: 1869-5469
DOI
https://doi.org/10.1007/s13278-022-00921-9

Weitere Artikel der Ausgabe 1/2022

Social Network Analysis and Mining 1/2022 Zur Ausgabe

Premium Partner