Top

Published in:

2020 | OriginalPaper | Chapter

Detecting Rumours in Disasters: An Imbalanced Learning Approach

Authors : Amir Ebrahimi Fard, Majid Mohammadi, Bartel van de Walle

Published in: Computational Science – ICCS 2020

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

The online spread of rumours in disasters can create panic and anxiety and disrupt crisis operations. Hence, it is crucial to take measure against such a distressing phenomenon since it can turn into a crisis by itself. In this work, the automatic rumour detection in natural disasters is addressed from an imbalanced learning perspective due to the rumour dearth versus non-rumour abundance in social networks.

We first provide two datasets by collecting and annotating tweets regarding the Hurricane Florence and Kerala flood. We then capture the properties of rumours and non-rumours in those disasters using 83 theory-based and early-available features, 47 of which are proposed for the first time. The proposed features show a high discrimination power that help us distinguish rumours from non-rumours more reliably. Next, We build the rumour identification models using imbalanced learning to address the scarcity of rumours compared to non-rumour. Additionally, to replicate the rumour detection in the real-world situation, we practice cross-incident learning by training the classifier with the samples of one incident and test it with the other one. In the end we measure the impact of imbalanced learning using Bayesian Wilcoxon Signed-rank test and observe a significant improvement in the classifiers performance.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

previous chapter Syntactic and Semantic Bias Detection and Countermeasures

next chapter Sentiment Analysis for Fake News Detection by Means of Neural Networks

The datasets are publicly available: https://bit.ly/2WxVhY0.

The newly introduced features are explained and their relevance are discussed in the first section of the supplementary materials (available at: https://bit.ly/2PJ3FmR).

The further explanations regarding the issues of the conventional feature selection methods can be found in the second section of the supplementary materials (available at: https://bit.ly/2PJ3FmR).

Benavoli, A., Corani, C., Demšar, J., Zaffalon, M.: Time for a change: a tutorial for comparing multiple classifiers through Bayesian analysis. J. Mach. Learn. Res. 18(1), 2653–2688 (2017)MathSciNetMATH

Castillo, C., Mendoza, M., Poblete, B.: Information credibility on Twitter. In: Proceedings of the 20th International Conference on World Wide Web - WWW 2011, p. 675. ACM Press, New York (2011)

Chen, Y.C., Liu, Z.Y., Kao, H.Y.: IKM at SemEval-2017 Task 8: convolutional neural networks for stance detection and rumor verification. In: The 11th International Workshop on Semantic Evaluation (SemEval-2017), pp. 465–469 (2017)

DiFonzo, N., Bordia, P.: Rumor psychology: social and organizational approaches. American Psychological Association (2007)

Dunbar, R.I.: Gossip in evolutionary perspective. Rev. Gener. Psychol. 8(2), 100–110 (2004)CrossRef

Fard, A.E., et al.: Rumour as an anomaly: rumour detection with one-class classification. In: 2019 IEEE International Conference on Engineering, Technology and Innovation (ICE/ITMC), pp. 1–9. IEEE (2019)

Forman, G.: An extensive empirical study of feature selection metrics for text classification. J. Mach. Learn. Res. 3(Mar), 1289–1305 (2003)MATH

García, L.M., Lilja, H., Tjörnhammar, E., Karasalo, M.: Mama Edha at SemEval-2017 task 8: stance classification with CNN and rules. In: The 11th International Workshop on Semantic Evaluation (SemEval-2017), pp. 481–485 (2017)

He, H., Garcia, E.: Learning from imbalanced data. IEEE Trans. Knowl. Data Eng. 21(9), 1263–1284 (2009)CrossRef

10.

Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)CrossRef

11.

Janecek, A.G.K., Gansterer, W.N., Demel, M.A., Ecker, G.F.: On the relationship between feature selection and classification accuracy. In: Proceedings of the 2008 International Conference on New Challenges for Feature Selection in Data Mining and Knowledge Discovery, pp. 90–105. JMLR.org (2008)

12.

Kwon, S., Cha, M., Jung, K., On, W.C.: Prominent features of rumor propagation in online social media. In: International Conference on Data Mining. IEEE (2013)

13.

Kwon, S., Cha, M., Jung, K.: Rumor detection over varying time windows. PLOS One 12(1), e0168344 (2017)CrossRef

14.

Lafferty, J., McCallum, A., Pereira, F.C.: Conditional random fields: probabilistic models for segmenting and labeling sequence data (2001)

15.

Lazer, D.M.J., et al.: The science of fake news. Science (New York, N.Y.) 359(6380), 1094–1096 (2018)CrossRef

16.

Lemaître, G., Nogueira, F., Aridas, C.K.: Imbalanced-learn: a python toolbox to tackle the curse of imbalanced datasets in machine learning. J. Mach. Learn. Res. 18(1), 559–563 (2017)

17.

Liang, G., He, W., Xu, C., Chen, L., Zeng, J.: Rumor identification in microblogging systems based on users’ behavior. IEEE Trans. Comput. Soc. Syst. 2(3), 99–108 (2015)CrossRef

18.

Ma, J., Gao, W., Mitra, P., Kwon, S., Jansen, B., Wong, K.: Detecting rumors from microblogs with recurrent neural networks. In: IJCAI, pp. 3818–3824 (2016)

19.

Qazvinian, V., Rosengren, E., Radev, D.R., Mei, Q.: Rumor has it: identifying misinformation in microblogs. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 1589–1599 (2011)

20.

Sicilia, R., Lo Giudice, S., Pei, Y., Pechenizkiy, M.: Twitter rumour detection in the health domain. Expert Syst. Appl. 110, 33–40 (2018)CrossRef

21.

Starbird, K., Maddock, J., Orand, M., Achterman, P., Mason, R.: Rumors, false flags, and digital vigilantes: misinformation on Twitter after the 2013 Boston marathon bombing. In: IConference (2014)

22.

Turenne, N.: The rumour spectrum. PLOS One 13(1), e0189080 (2018)CrossRef

23.

Varol, O., et al.: Feature engineering for social bot detection. In: Feature Engineering for Social Bot Detection, pp. 311–334. CRC Press, March 2018

24.

Vosoughi, S., Mohsenvand, M.N., Roy, D.: Rumor Gauge. ACM Trans. Knowl. Discov. Data 11(4), 1–36 (2017)CrossRef

25.

Vosoughi, S., Roy, D., Aral, S.: The spread of true and false news online. Science (New York, N.Y.) 359(6380), 1146–1151 (2018)CrossRef

26.

Wang, S., Moise, I., Helbing, D.: Early signals of trending rumor event in streaming social media. In: Computer Software and Applications Conference (COMPSAC), pp. 654–659. IEEE (2017)

27.

Wijeratne, S., et al.: Feature engineering for Twitter-based applications. In: Feature Engineering for Machine Learning and Data Analytics, pp. 359–384 (2017)

28.

Wu, K., Yang, S., Zhu, K.Q.: False rumors detection on Sina Weibo by propagation structures. In: 2015 IEEE 31st International Conference on Data Engineering, pp. 651–662. IEEE, April 2015

29.

Yang, F., Liu, Y., Yu, X., Yang, M.: Automatic detection of rumor on Sina Weibo. In: Proceedings of the ACM SIGKDD Workshop on Mining Data Semantics (2012)

30.

Zhang, Q., Zhang, S., Dong, J., Xiong, J., Cheng, X.: Automatic detection of rumor on social network. In: Li, J., Ji, H., Zhao, D., Feng, Y. (eds.) NLPCC -2015. LNCS (LNAI), vol. 9362, pp. 113–122. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-25207-0_10CrossRef

31.

Zhao, Z., Resnick, P., Mei, Q.: Enquiring minds. In: Proceedings of the 24th International Conference on World Wide Web - WWW 2015, pp. 1395–1405. ACM Press, New York (2015)

32.

Zubiaga, A., Liakata, M., Procter, R.: Exploiting context for rumour detection in social media. In: Ciampaglia, G.L., Mashhadi, A., Yasseri, T. (eds.) SocInfo 2017. LNCS, vol. 10539, pp. 109–123. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-67217-5_8CrossRef

33.

Zubiaga, A., Aker, A., Bontcheva, K., Liakata, M., Procter, R.: Detection and resolution of rumours in social media. ACM Comput. Surv. 51(2), 1–36 (2018)CrossRef

34.

Zubiaga, A., Liakata, M., Procter, R.: Learning reporting dynamics during breaking news for rumour detection in social media (2016)

Title: Detecting Rumours in Disasters: An Imbalanced Learning Approach
Authors: Amir Ebrahimi Fard
Majid Mohammadi
Bartel van de Walle
Publisher: Springer International Publishing
Book: Computational Science – ICCS 2020
Print ISBN: 978-3-030-50422-9

Electronic ISBN: 978-3-030-50423-6

Copyright Year: 2020
DOI: https://doi.org/10.1007/978-3-030-50423-6_48

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner