Skip to main content
Top
Published in: Multimedia Systems 6/2022

07-06-2022 | Regular Paper

BERT-based semi-supervised domain adaptation for disastrous classification

Authors: Jing Wang, Kexin Wang

Published in: Multimedia Systems | Issue 6/2022

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Currently, with the rapid development of social media platforms represented by twitter, multimodal content on social media provides critical information during major disastrous events. Different from traditional text news, major disastrous events posted on Twitter often attach videos or images. Although many studies have shown that both image and text contents are useful for humanitarian aid and disaster response, this research focuses on analyzing multimodal information in social media to effectively analyze sudden disastrous events. We propose an end-to-end model, the BERT-based semi-supervised domain adaptation for multimodal disastrous events classification (BSSDA). BSSDA is composed of two main modules: a classifier with minimax entropy domain adaptation and a multimodal feature extractor. First, we perform multimodal feature extraction on our three modalities (image, text and image description). Here, we generate image description with the NeuralTalk and extract both image description features and text features with BERT, and extract image features with pretrained VGG. Then, the multimodal features are concatenated and fed to the classifier with minimax entropy domain adaptation module. The purpose of adding the domain adaptation module is to map the multimodal features of different disastrous events to the same feature space by using a good amount of unlabeled data of the unforeseen real-time disastrous event. Our experimental results of three different types of disastrous events show that our BSSDA model has significant improvements for disastrous events classification.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Alam, F., Joty, S., Imran, M.: Graph based semi-supervised learning with convolution neural networks to classify crisis related tweets. In: Twelfth International AAAI conference on web and social media (2018) Alam, F., Joty, S., Imran, M.: Graph based semi-supervised learning with convolution neural networks to classify crisis related tweets. In: Twelfth International AAAI conference on web and social media (2018)
2.
go back to reference Alam, F., Ofli, F., Imran, M.: Crisismmd: multimodal twitter datasets from natural disasters. In: Proceedings of the International AAAI conference on web and social media, pp. 465–473 (2018) Alam, F., Ofli, F., Imran, M.: Crisismmd: multimodal twitter datasets from natural disasters. In: Proceedings of the International AAAI conference on web and social media, pp. 465–473 (2018)
3.
go back to reference Bica, M., Palen, L., Bopp, C.: Visual representations of disaster. In: Proceedings of the 2017 ACM Conference on computer supported cooperative work and social computing, pp. 1262–1276 (2017) Bica, M., Palen, L., Bopp, C.: Visual representations of disaster. In: Proceedings of the 2017 ACM Conference on computer supported cooperative work and social computing, pp. 1262–1276 (2017)
4.
go back to reference Castillo, C., Imran, M., Meier, P., et al.: Together we stand-supporting decision in crisis response: artificial intelligence for digital response and micromappers. By OCHA and partners Istanbul: Tudor Rose, World Humanitarian Summit pp. 93–95 (2016) Castillo, C., Imran, M., Meier, P., et al.: Together we stand-supporting decision in crisis response: artificial intelligence for digital response and micromappers. By OCHA and partners Istanbul: Tudor Rose, World Humanitarian Summit pp. 93–95 (2016)
5.
go back to reference Chen, T., Lu, D., Kan, M.Y., et al.: Understanding and classifying image tweets. In: Proceedings of the 21st ACM International Conference on Multimedia, pp. 781–784 (2013) Chen, T., Lu, D., Kan, M.Y., et al.: Understanding and classifying image tweets. In: Proceedings of the 21st ACM International Conference on Multimedia, pp. 781–784 (2013)
7.
go back to reference Daly, S., Thom, J.A.: Mining and classifying image posts on social media to analyse fires. In: ISCRAM, Citeseer, pp. 1–14 (2016) Daly, S., Thom, J.A.: Mining and classifying image posts on social media to analyse fires. In: ISCRAM, Citeseer, pp. 1–14 (2016)
8.
go back to reference Gautam, A.K., Misra, L., Kumar, A., et al.: Multimodal analysis of disaster tweets. In: 2019 IEEE Fifth International Conference on Multimedia Big Data (BigMM), IEEE, pp. 94–103 (2019) Gautam, A.K., Misra, L., Kumar, A., et al.: Multimodal analysis of disaster tweets. In: 2019 IEEE Fifth International Conference on Multimedia Big Data (BigMM), IEEE, pp. 94–103 (2019)
9.
go back to reference Gidaris, S., Komodakis, N.: Dynamic few-shot visual learning without forgetting. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4367–4375 (2018) Gidaris, S., Komodakis, N.: Dynamic few-shot visual learning without forgetting. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4367–4375 (2018)
10.
go back to reference Imran, M., Castillo, C., Diaz, F., et al.: Processing social media messages in mass emergency: a survey. ACM Comput. Surv. (CSUR) 47(4), 1–38 (2015)CrossRef Imran, M., Castillo, C., Diaz, F., et al.: Processing social media messages in mass emergency: a survey. ACM Comput. Surv. (CSUR) 47(4), 1–38 (2015)CrossRef
11.
go back to reference Jin, Z., Cao, J., Guo, H., et al.: Multimodal fusion with recurrent neural networks for rumor detection on microblogs. In: Proceedings of the 25th ACM International conference on multimedia, pp. 795–816 (2017) Jin, Z., Cao, J., Guo, H., et al.: Multimodal fusion with recurrent neural networks for rumor detection on microblogs. In: Proceedings of the 25th ACM International conference on multimedia, pp. 795–816 (2017)
12.
go back to reference Jin, Z., Cao, J., Zhang, Y., et al.: Novel visual and statistical image features for microblogs news verification. IEEE Trans. Multimedia 19(3), 598–608 (2016)CrossRef Jin, Z., Cao, J., Zhang, Y., et al.: Novel visual and statistical image features for microblogs news verification. IEEE Trans. Multimedia 19(3), 598–608 (2016)CrossRef
14.
go back to reference Mikolov, T., Chen, K., Corrado, G., et al.: Efficient estimation of word representations in vector space. arXiv preprint. (2013). arXiv:1301.3781 Mikolov, T., Chen, K., Corrado, G., et al.: Efficient estimation of word representations in vector space. arXiv preprint. (2013). arXiv:​1301.​3781
15.
go back to reference Mouzannar, H., Rizk, Y., Awad, M.: Damage identification in social media posts using multimodal deep learning. In: ISCRAM, pp. 529–543 (2018) Mouzannar, H., Rizk, Y., Awad, M.: Damage identification in social media posts using multimodal deep learning. In: ISCRAM, pp. 529–543 (2018)
16.
go back to reference Nguyen, D., Al Mannai, K.A., Joty, S., et al.: Robust classification of crisis-related data on social networks using convolutional neural networks. In: Proceedings of the International AAAI conference on web and social media, pp. 632–635 (2017) Nguyen, D., Al Mannai, K.A., Joty, S., et al.: Robust classification of crisis-related data on social networks using convolutional neural networks. In: Proceedings of the International AAAI conference on web and social media, pp. 632–635 (2017)
17.
go back to reference Peters, R., de Albuquerque, J.P.: Investigating images as indicators for relevant social media messages in disaster management. In: ISCRAM (2015) Peters, R., de Albuquerque, J.P.: Investigating images as indicators for relevant social media messages in disaster management. In: ISCRAM (2015)
18.
go back to reference Ranjan, R., Castillo, C.D., Chellappa, R.: L2-constrained softmax loss for discriminative face verification. arXiv preprint. (2017). arXiv:1703.09507 Ranjan, R., Castillo, C.D., Chellappa, R.: L2-constrained softmax loss for discriminative face verification. arXiv preprint. (2017). arXiv:​1703.​09507
19.
go back to reference Rosenthal, S., Farra, N., Nakov, P.: Semeval-2017 task 4: sentiment analysis in twitter. In: Proceedings of the 11th International workshop on semantic evaluation (SemEval-2017), pp. 502–518 (2017) Rosenthal, S., Farra, N., Nakov, P.: Semeval-2017 task 4: sentiment analysis in twitter. In: Proceedings of the 11th International workshop on semantic evaluation (SemEval-2017), pp. 502–518 (2017)
20.
go back to reference Saito, K., Kim, D., Sclaroff, S., et al.: Semi-supervised domain adaptation via minimax entropy. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 8050–8058 (2019) Saito, K., Kim, D., Sclaroff, S., et al.: Semi-supervised domain adaptation via minimax entropy. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 8050–8058 (2019)
21.
go back to reference Vieweg, S., Castillo, C., Imran, M.: Integrating social media communications into the rapid assessment of sudden onset disasters. In: International Conference on Social Informatics, Springer, pp. 444–461 (2014) Vieweg, S., Castillo, C., Imran, M.: Integrating social media communications into the rapid assessment of sudden onset disasters. In: International Conference on Social Informatics, Springer, pp. 444–461 (2014)
22.
go back to reference Vinyals, O., Toshev, A., Bengio, S., et al.: Show and tell: a neural image caption generator. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 3156–3164 (2015) Vinyals, O., Toshev, A., Bengio, S., et al.: Show and tell: a neural image caption generator. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 3156–3164 (2015)
23.
go back to reference Wang, Y., Ma, F., Jin, Z., et al.: Eann: event adversarial neural networks for multi-modal fake news detection. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 849–857 (2018) Wang, Y., Ma, F., Jin, Z., et al.: Eann: event adversarial neural networks for multi-modal fake news detection. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 849–857 (2018)
Metadata
Title
BERT-based semi-supervised domain adaptation for disastrous classification
Authors
Jing Wang
Kexin Wang
Publication date
07-06-2022
Publisher
Springer Berlin Heidelberg
Published in
Multimedia Systems / Issue 6/2022
Print ISSN: 0942-4962
Electronic ISSN: 1432-1882
DOI
https://doi.org/10.1007/s00530-022-00956-0

Other articles of this Issue 6/2022

Multimedia Systems 6/2022 Go to the issue