Skip to main content
Erschienen in: Neural Computing and Applications 2/2023

01.10.2022 | Original Article

Multimodal tweet classification in disaster response systems using transformer-based bidirectional attention model

verfasst von: Rani Koshy, Sivasankar Elango

Erschienen in: Neural Computing and Applications | Ausgabe 2/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The goal of this research is to use social media to gain situational awareness in the wake of a crisis. With the developments in information and communication technologies, social media became the de facto norm for gathering and disseminating information. We present a method for classifying informative tweets from the massive volume of user tweets on social media. Once the informative tweets have been found, emergency responders can use them to gain situational awareness so that recovery actions can be carried out efficiently. The majority of previous research has focused on either text data or images in tweets. A thorough review of the literature illustrates that text and image carry complementary information. The proposed method is a deep learning framework which utilizes multiple input modalities, specifically text and image from a user-generated tweet. We mainly focused to devise an improved multimodal fusion strategy. The proposed system has a transformer-based image and text models. The main building blocks include fine-tuned RoBERTa model for text, Vision Transformer model for image, biLSTM and attention mechanism. We put forward a multiplicative fusion strategy for image and text inputs. Extensive experiments have been done on various network architectures with seven datasets spanning different types of disasters, including wildfire, hurricane, earth-quake and flood. Several state-of-the-art approaches were surpassed by our system. It showed good accuracy in the range of 94–98%. The results showed that identifying the interaction between multiple related modalities will enhance the quality of a deep learning classifier.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Alam F, Imran M, Ofli F (2017) Image4act: online social media image processing for disaster response. In: Proceedings of the 2017 IEEE/ACM international conference on advances in social networks analysis and mining 2017, pp 601–604 Alam F, Imran M, Ofli F (2017) Image4act: online social media image processing for disaster response. In: Proceedings of the 2017 IEEE/ACM international conference on advances in social networks analysis and mining 2017, pp 601–604
2.
Zurück zum Zitat Alam F, Ofli F, Imran M (2018) Crisismmd: multimodal twitter datasets from natural disasters. In: Twelfth international AAAI conference on web and social media Alam F, Ofli F, Imran M (2018) Crisismmd: multimodal twitter datasets from natural disasters. In: Twelfth international AAAI conference on web and social media
3.
Zurück zum Zitat Alam F, Ofli F, Imran M (2018) Processing social media images by combining human and machine computing during crises. Int J Hum Comput Interact 34(4):311–327CrossRef Alam F, Ofli F, Imran M (2018) Processing social media images by combining human and machine computing during crises. Int J Hum Comput Interact 34(4):311–327CrossRef
4.
Zurück zum Zitat Bahdanau D, Cho K, Bengio Y (2014) Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 Bahdanau D, Cho K, Bengio Y (2014) Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:​1409.​0473
5.
Zurück zum Zitat Balakrishnan V, Shi Z, Law CL et al (2021) A deep learning approach in predicting products’ sentiment ratings: a comparative analysis. J Supercomput 2021:1–21 Balakrishnan V, Shi Z, Law CL et al (2021) A deep learning approach in predicting products’ sentiment ratings: a comparative analysis. J Supercomput 2021:1–21
6.
Zurück zum Zitat Bengio Y, Courville A, Vincent P (2013) Representation learning: a review and new perspectives. IEEE Trans Pattern Anal Mach Intell 35(8):1798–1828CrossRef Bengio Y, Courville A, Vincent P (2013) Representation learning: a review and new perspectives. IEEE Trans Pattern Anal Mach Intell 35(8):1798–1828CrossRef
7.
Zurück zum Zitat Chaudhuri N, Bose I (2019) Application of image analytics for disaster response in smart cities. In: Proceedings of the 52nd Hawaii international conference on system sciences Chaudhuri N, Bose I (2019) Application of image analytics for disaster response in smart cities. In: Proceedings of the 52nd Hawaii international conference on system sciences
8.
Zurück zum Zitat Devlin J, Chang MW, Lee K et al (2018) Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 Devlin J, Chang MW, Lee K et al (2018) Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:​1810.​04805
9.
Zurück zum Zitat Dosovitskiy A, Beyer L, Kolesnikov A et al (2020) An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 Dosovitskiy A, Beyer L, Kolesnikov A et al (2020) An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:​2010.​11929
10.
Zurück zum Zitat Endsley MR (1995) Toward a theory of situation awareness in dynamic systems. Hum Factors 37(1):32–64CrossRef Endsley MR (1995) Toward a theory of situation awareness in dynamic systems. Hum Factors 37(1):32–64CrossRef
11.
Zurück zum Zitat Flin R, O’connor P, Crichton M (2017) Safety at the sharp end: a guide to non-technical skills. CRC Press, Boca RatonCrossRef Flin R, O’connor P, Crichton M (2017) Safety at the sharp end: a guide to non-technical skills. CRC Press, Boca RatonCrossRef
12.
Zurück zum Zitat Gautam AK, Misra L, Kumar A et al (2019) Multimodal analysis of disaster tweets. In: 2019 IEEE fifth international conference on multimedia Big Data (BigMM). IEEE, pp 94–103 Gautam AK, Misra L, Kumar A et al (2019) Multimodal analysis of disaster tweets. In: 2019 IEEE fifth international conference on multimedia Big Data (BigMM). IEEE, pp 94–103
13.
Zurück zum Zitat Ghafarian SH, Yazdi HS (2020) Identifying crisis-related informative tweets using learning on distributions. Inf Process Manag 57(2):102145CrossRef Ghafarian SH, Yazdi HS (2020) Identifying crisis-related informative tweets using learning on distributions. Inf Process Manag 57(2):102145CrossRef
14.
Zurück zum Zitat Gunes H, Piccardi M (2008) Automatic temporal segment detection and affect recognition from face and body display. IEEE Trans Syst Man Cybern Part B (Cybernetics) 39(1):64–84CrossRef Gunes H, Piccardi M (2008) Automatic temporal segment detection and affect recognition from face and body display. IEEE Trans Syst Man Cybern Part B (Cybernetics) 39(1):64–84CrossRef
15.
Zurück zum Zitat Han J, Pei J, Kamber M (2011) Data mining: concepts and techniques. Elsevier, AmsterdamMATH Han J, Pei J, Kamber M (2011) Data mining: concepts and techniques. Elsevier, AmsterdamMATH
16.
Zurück zum Zitat He K, Zhang X, Ren S et al (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778 He K, Zhang X, Ren S et al (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
17.
Zurück zum Zitat Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780CrossRef Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780CrossRef
18.
Zurück zum Zitat Huang G, Liu Z, Van Der Maaten L et al (2017) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4700–4708 Huang G, Liu Z, Van Der Maaten L et al (2017) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4700–4708
19.
Zurück zum Zitat Kejriwal M, Zhou P (2020) On detecting urgency in short crisis messages using minimal supervision and transfer learning. Soc Netw Anal Min 10(1):1–12CrossRef Kejriwal M, Zhou P (2020) On detecting urgency in short crisis messages using minimal supervision and transfer learning. Soc Netw Anal Min 10(1):1–12CrossRef
20.
Zurück zum Zitat Kumar A, Singh JP, Dwivedi YK et al (2020) A deep multi-modal neural network for informative twitter content classification during emergencies. Ann Oper Res 2020:1–32 Kumar A, Singh JP, Dwivedi YK et al (2020) A deep multi-modal neural network for informative twitter content classification during emergencies. Ann Oper Res 2020:1–32
21.
Zurück zum Zitat Kyrkou C, Theocharides T (2019) Deep-learning-based aerial image classification for emergency response applications using unmanned aerial vehicles. In: CVPR workshops, pp 517–525 Kyrkou C, Theocharides T (2019) Deep-learning-based aerial image classification for emergency response applications using unmanned aerial vehicles. In: CVPR workshops, pp 517–525
22.
23.
Zurück zum Zitat Madichetty S, Muthukumarasamy S (2020) Detection of situational information from twitter during disaster using deep learning models. Sādhanā 45(1):1–13CrossRef Madichetty S, Muthukumarasamy S (2020) Detection of situational information from twitter during disaster using deep learning models. Sādhanā 45(1):1–13CrossRef
24.
Zurück zum Zitat Madichetty S, Sridevi M (2019) Disaster damage assessment from the tweets using the combination of statistical features and informative words. Soc Netw Anal Min 9(1):1–11CrossRef Madichetty S, Sridevi M (2019) Disaster damage assessment from the tweets using the combination of statistical features and informative words. Soc Netw Anal Min 9(1):1–11CrossRef
25.
Zurück zum Zitat Madichetty S, Sridevi M (2021) A novel method for identifying the damage assessment tweets during disaster. Future Gener Comput Syst 116:440–454CrossRef Madichetty S, Sridevi M (2021) A novel method for identifying the damage assessment tweets during disaster. Future Gener Comput Syst 116:440–454CrossRef
26.
Zurück zum Zitat Madichetty S, Muthukumarasamy S, Jayadev P (2021a) Multi-modal classification of twitter data during disasters for humanitarian response. J Ambient Intell Hum Comput 12(11):10223–10237CrossRef Madichetty S, Muthukumarasamy S, Jayadev P (2021a) Multi-modal classification of twitter data during disasters for humanitarian response. J Ambient Intell Hum Comput 12(11):10223–10237CrossRef
27.
Zurück zum Zitat Madichetty S et al (2020a) Classifying informative and non-informative tweets from the twitter by adapting image features during disaster. Multimed Tools Appl 79(39):28,901-28,923CrossRef Madichetty S et al (2020a) Classifying informative and non-informative tweets from the twitter by adapting image features during disaster. Multimed Tools Appl 79(39):28,901-28,923CrossRef
28.
Zurück zum Zitat Madichetty S et al (2020) Identification of medical resource tweets using majority voting-based ensemble during disaster. Soc Netw Anal Min 10(1):1–18CrossRef Madichetty S et al (2020) Identification of medical resource tweets using majority voting-based ensemble during disaster. Soc Netw Anal Min 10(1):1–18CrossRef
29.
Zurück zum Zitat Madichetty S et al (2021) A stacked convolutional neural network for detecting the resource tweets during a disaster. Multimed Tools Appl 80(3):3927–3949CrossRef Madichetty S et al (2021) A stacked convolutional neural network for detecting the resource tweets during a disaster. Multimed Tools Appl 80(3):3927–3949CrossRef
30.
Zurück zum Zitat Martinez-Rojas M, del Carmen Pardo-Ferreira M, Rubio-Romero JC (2018) Twitter as a tool for the management and analysis of emergency situations: a systematic literature review. Int J Inf Manag 43:196–208CrossRef Martinez-Rojas M, del Carmen Pardo-Ferreira M, Rubio-Romero JC (2018) Twitter as a tool for the management and analysis of emergency situations: a systematic literature review. Int J Inf Manag 43:196–208CrossRef
31.
Zurück zum Zitat Mohanty SD, Biggers B, Sayedahmed S et al (2021) A multi-modal approach towards mining social media data during natural disasters—a case study of hurricane irma. Int J Disaster Risk Reduct 54(102):032 Mohanty SD, Biggers B, Sayedahmed S et al (2021) A multi-modal approach towards mining social media data during natural disasters—a case study of hurricane irma. Int J Disaster Risk Reduct 54(102):032
32.
Zurück zum Zitat Mouzannar H, Rizk Y, Awad M (2018) Damage identification in social media posts using multimodal deep learning. In: ISCRAM Mouzannar H, Rizk Y, Awad M (2018) Damage identification in social media posts using multimodal deep learning. In: ISCRAM
33.
Zurück zum Zitat Nugroho KS, Sukmadewa AY, Wuswilahaken DWH et al (2021) Bert fine-tuning for sentiment analysis on Indonesian mobile apps reviews. In: 6th international conference on sustainable information engineering and technology 2021, pp 258–264 Nugroho KS, Sukmadewa AY, Wuswilahaken DWH et al (2021) Bert fine-tuning for sentiment analysis on Indonesian mobile apps reviews. In: 6th international conference on sustainable information engineering and technology 2021, pp 258–264
34.
Zurück zum Zitat Ofli F, Alam F, Imran M (2020) Analysis of social media data using multimodal deep learning for disaster response. arXiv preprint arXiv:2004.11838 Ofli F, Alam F, Imran M (2020) Analysis of social media data using multimodal deep learning for disaster response. arXiv preprint arXiv:​2004.​11838
35.
Zurück zum Zitat Pajak K, Pajak D (2022) Multilingual fine-tuning for grammatical error correction. Expert Syst Appl 200(116):948 Pajak K, Pajak D (2022) Multilingual fine-tuning for grammatical error correction. Expert Syst Appl 200(116):948
36.
Zurück zum Zitat Palen L, Anderson KM, Mark G et al (2010) A vision for technology-mediated support for public participation & assistance in mass emergencies & disasters. ACM-BCS Vis Comput Sci 2010:1–12 Palen L, Anderson KM, Mark G et al (2010) A vision for technology-mediated support for public participation & assistance in mass emergencies & disasters. ACM-BCS Vis Comput Sci 2010:1–12
37.
Zurück zum Zitat Phengsuwan J, Shah T, Thekkummal NB et al (2021) Use of social media data in disaster management: a survey. Future Internet 13(2):46CrossRef Phengsuwan J, Shah T, Thekkummal NB et al (2021) Use of social media data in disaster management: a survey. Future Internet 13(2):46CrossRef
38.
Zurück zum Zitat Pourebrahim N, Sultana S, Edwards J et al (2019) Understanding communication dynamics on twitter during natural disasters: a case study of hurricane sandy. Int J Disaster Risk Reduct 37(101):176 Pourebrahim N, Sultana S, Edwards J et al (2019) Understanding communication dynamics on twitter during natural disasters: a case study of hurricane sandy. Int J Disaster Risk Reduct 37(101):176
39.
Zurück zum Zitat Rizk Y, Jomaa HS, Awad M et al (2019) A computationally efficient multi-modal classification approach of disaster-related twitter images. In: Proceedings of the 34th ACM/SIGAPP symposium on applied computing, pp 2050–2059 Rizk Y, Jomaa HS, Awad M et al (2019) A computationally efficient multi-modal classification approach of disaster-related twitter images. In: Proceedings of the 34th ACM/SIGAPP symposium on applied computing, pp 2050–2059
40.
Zurück zum Zitat Shah R, Zimmermann R (2017) Multimodal analysis of user-generated multimedia content. Springer, BerlinCrossRef Shah R, Zimmermann R (2017) Multimodal analysis of user-generated multimedia content. Springer, BerlinCrossRef
41.
Zurück zum Zitat Shah RR, Yu Y, Zimmermann R (2014) Advisor: personalized video soundtrack recommendation by late fusion with heuristic rankings. In: Proceedings of the 22nd ACM international conference on multimedia, pp 607–616 Shah RR, Yu Y, Zimmermann R (2014) Advisor: personalized video soundtrack recommendation by late fusion with heuristic rankings. In: Proceedings of the 22nd ACM international conference on multimedia, pp 607–616
42.
Zurück zum Zitat Shah RR, Mahata D, Choudhary V et al (2018) Multimodal semantics and affective computing from multimedia content. In: Intelligent multidimensional data and image processing. IGI Global, pp 359–382 Shah RR, Mahata D, Choudhary V et al (2018) Multimodal semantics and affective computing from multimedia content. In: Intelligent multidimensional data and image processing. IGI Global, pp 359–382
43.
Zurück zum Zitat Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:​1409.​1556
44.
Zurück zum Zitat Singh T, Kumari M (2016) Role of text pre-processing in twitter sentiment analysis. Procedia Comput Sci 89:549–554CrossRef Singh T, Kumari M (2016) Role of text pre-processing in twitter sentiment analysis. Procedia Comput Sci 89:549–554CrossRef
45.
Zurück zum Zitat Snyder LS, Lin YS, Karimzadeh M et al (2019) Interactive learning for identifying relevant tweets to support real-time situational awareness. IEEE Trans Vis Comput Graph 26(1):558–568 Snyder LS, Lin YS, Karimzadeh M et al (2019) Interactive learning for identifying relevant tweets to support real-time situational awareness. IEEE Trans Vis Comput Graph 26(1):558–568
46.
Zurück zum Zitat Szegedy C, Liu W, Jia Y et al (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–9 Szegedy C, Liu W, Jia Y et al (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–9
47.
Zurück zum Zitat Tan M, Le Q (2019) Efficientnet: rethinking model scaling for convolutional neural networks. In: International conference on machine learning, PMLR, pp 6105–6114 Tan M, Le Q (2019) Efficientnet: rethinking model scaling for convolutional neural networks. In: International conference on machine learning, PMLR, pp 6105–6114
48.
Zurück zum Zitat Tripathy JK, Chakkaravarthy SS, Satapathy SC et al (2020) Albert-based fine-tuning model for cyberbullying analysis. Multimed Syst 2020:1–9 Tripathy JK, Chakkaravarthy SS, Satapathy SC et al (2020) Albert-based fine-tuning model for cyberbullying analysis. Multimed Syst 2020:1–9
49.
Zurück zum Zitat Valdez DB, Godmalin RAG (2021) A deep learning approach of recognizing natural disasters on images using convolutional neural network and transfer learning. In: Proceedings of the international conference on artificial intelligence and its applications, pp 1–7 Valdez DB, Godmalin RAG (2021) A deep learning approach of recognizing natural disasters on images using convolutional neural network and transfer learning. In: Proceedings of the international conference on artificial intelligence and its applications, pp 1–7
50.
Zurück zum Zitat Yu Y, Tang S, Aizawa K et al (2018) Category-based deep cca for fine-grained venue discovery from multimodal data. IEEE Trans Neural Netw Learn Syst 30(4):1250–1258MathSciNetCrossRef Yu Y, Tang S, Aizawa K et al (2018) Category-based deep cca for fine-grained venue discovery from multimodal data. IEEE Trans Neural Netw Learn Syst 30(4):1250–1258MathSciNetCrossRef
51.
Zurück zum Zitat Yu Y, Tang S, Raposo F et al (2019) Deep cross-modal correlation learning for audio and lyrics in music retrieval. ACM Trans Multimed Comput Commun Appl 15(1):1–16CrossRef Yu Y, Tang S, Raposo F et al (2019) Deep cross-modal correlation learning for audio and lyrics in music retrieval. ACM Trans Multimed Comput Commun Appl 15(1):1–16CrossRef
52.
Zurück zum Zitat Zahra K, Imran M, Ostermann FO (2020) Automatic identification of eyewitness messages on twitter during disasters. Inf Process Manag 57(1):102,107CrossRef Zahra K, Imran M, Ostermann FO (2020) Automatic identification of eyewitness messages on twitter during disasters. Inf Process Manag 57(1):102,107CrossRef
Metadaten
Titel
Multimodal tweet classification in disaster response systems using transformer-based bidirectional attention model
verfasst von
Rani Koshy
Sivasankar Elango
Publikationsdatum
01.10.2022
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 2/2023
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-022-07790-5

Weitere Artikel der Ausgabe 2/2023

Neural Computing and Applications 2/2023 Zur Ausgabe

Premium Partner