Skip to main content
Top

2020 | OriginalPaper | Chapter

Detecting Rumours in Disasters: An Imbalanced Learning Approach

Authors : Amir Ebrahimi Fard, Majid Mohammadi, Bartel van de Walle

Published in: Computational Science – ICCS 2020

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The online spread of rumours in disasters can create panic and anxiety and disrupt crisis operations. Hence, it is crucial to take measure against such a distressing phenomenon since it can turn into a crisis by itself. In this work, the automatic rumour detection in natural disasters is addressed from an imbalanced learning perspective due to the rumour dearth versus non-rumour abundance in social networks.
We first provide two datasets by collecting and annotating tweets regarding the Hurricane Florence and Kerala flood. We then capture the properties of rumours and non-rumours in those disasters using 83 theory-based and early-available features, 47 of which are proposed for the first time. The proposed features show a high discrimination power that help us distinguish rumours from non-rumours more reliably. Next, We build the rumour identification models using imbalanced learning to address the scarcity of rumours compared to non-rumour. Additionally, to replicate the rumour detection in the real-world situation, we practice cross-incident learning by training the classifier with the samples of one incident and test it with the other one. In the end we measure the impact of imbalanced learning using Bayesian Wilcoxon Signed-rank test and observe a significant improvement in the classifiers performance.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
The datasets are publicly available: https://​bit.​ly/​2WxVhY0.
 
2
The newly introduced features are explained and their relevance are discussed in the first section of the supplementary materials (available at: https://​bit.​ly/​2PJ3FmR).
 
3
The further explanations regarding the issues of the conventional feature selection methods can be found in the second section of the supplementary materials (available at: https://​bit.​ly/​2PJ3FmR).
 
Literature
1.
go back to reference Benavoli, A., Corani, C., Demšar, J., Zaffalon, M.: Time for a change: a tutorial for comparing multiple classifiers through Bayesian analysis. J. Mach. Learn. Res. 18(1), 2653–2688 (2017)MathSciNetMATH Benavoli, A., Corani, C., Demšar, J., Zaffalon, M.: Time for a change: a tutorial for comparing multiple classifiers through Bayesian analysis. J. Mach. Learn. Res. 18(1), 2653–2688 (2017)MathSciNetMATH
2.
go back to reference Castillo, C., Mendoza, M., Poblete, B.: Information credibility on Twitter. In: Proceedings of the 20th International Conference on World Wide Web - WWW 2011, p. 675. ACM Press, New York (2011) Castillo, C., Mendoza, M., Poblete, B.: Information credibility on Twitter. In: Proceedings of the 20th International Conference on World Wide Web - WWW 2011, p. 675. ACM Press, New York (2011)
3.
go back to reference Chen, Y.C., Liu, Z.Y., Kao, H.Y.: IKM at SemEval-2017 Task 8: convolutional neural networks for stance detection and rumor verification. In: The 11th International Workshop on Semantic Evaluation (SemEval-2017), pp. 465–469 (2017) Chen, Y.C., Liu, Z.Y., Kao, H.Y.: IKM at SemEval-2017 Task 8: convolutional neural networks for stance detection and rumor verification. In: The 11th International Workshop on Semantic Evaluation (SemEval-2017), pp. 465–469 (2017)
4.
go back to reference DiFonzo, N., Bordia, P.: Rumor psychology: social and organizational approaches. American Psychological Association (2007) DiFonzo, N., Bordia, P.: Rumor psychology: social and organizational approaches. American Psychological Association (2007)
5.
go back to reference Dunbar, R.I.: Gossip in evolutionary perspective. Rev. Gener. Psychol. 8(2), 100–110 (2004)CrossRef Dunbar, R.I.: Gossip in evolutionary perspective. Rev. Gener. Psychol. 8(2), 100–110 (2004)CrossRef
6.
go back to reference Fard, A.E., et al.: Rumour as an anomaly: rumour detection with one-class classification. In: 2019 IEEE International Conference on Engineering, Technology and Innovation (ICE/ITMC), pp. 1–9. IEEE (2019) Fard, A.E., et al.: Rumour as an anomaly: rumour detection with one-class classification. In: 2019 IEEE International Conference on Engineering, Technology and Innovation (ICE/ITMC), pp. 1–9. IEEE (2019)
7.
go back to reference Forman, G.: An extensive empirical study of feature selection metrics for text classification. J. Mach. Learn. Res. 3(Mar), 1289–1305 (2003)MATH Forman, G.: An extensive empirical study of feature selection metrics for text classification. J. Mach. Learn. Res. 3(Mar), 1289–1305 (2003)MATH
8.
go back to reference García, L.M., Lilja, H., Tjörnhammar, E., Karasalo, M.: Mama Edha at SemEval-2017 task 8: stance classification with CNN and rules. In: The 11th International Workshop on Semantic Evaluation (SemEval-2017), pp. 481–485 (2017) García, L.M., Lilja, H., Tjörnhammar, E., Karasalo, M.: Mama Edha at SemEval-2017 task 8: stance classification with CNN and rules. In: The 11th International Workshop on Semantic Evaluation (SemEval-2017), pp. 481–485 (2017)
9.
go back to reference He, H., Garcia, E.: Learning from imbalanced data. IEEE Trans. Knowl. Data Eng. 21(9), 1263–1284 (2009)CrossRef He, H., Garcia, E.: Learning from imbalanced data. IEEE Trans. Knowl. Data Eng. 21(9), 1263–1284 (2009)CrossRef
10.
go back to reference Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)CrossRef Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)CrossRef
11.
go back to reference Janecek, A.G.K., Gansterer, W.N., Demel, M.A., Ecker, G.F.: On the relationship between feature selection and classification accuracy. In: Proceedings of the 2008 International Conference on New Challenges for Feature Selection in Data Mining and Knowledge Discovery, pp. 90–105. JMLR.org (2008) Janecek, A.G.K., Gansterer, W.N., Demel, M.A., Ecker, G.F.: On the relationship between feature selection and classification accuracy. In: Proceedings of the 2008 International Conference on New Challenges for Feature Selection in Data Mining and Knowledge Discovery, pp. 90–105. JMLR.org (2008)
12.
go back to reference Kwon, S., Cha, M., Jung, K., On, W.C.: Prominent features of rumor propagation in online social media. In: International Conference on Data Mining. IEEE (2013) Kwon, S., Cha, M., Jung, K., On, W.C.: Prominent features of rumor propagation in online social media. In: International Conference on Data Mining. IEEE (2013)
13.
go back to reference Kwon, S., Cha, M., Jung, K.: Rumor detection over varying time windows. PLOS One 12(1), e0168344 (2017)CrossRef Kwon, S., Cha, M., Jung, K.: Rumor detection over varying time windows. PLOS One 12(1), e0168344 (2017)CrossRef
14.
go back to reference Lafferty, J., McCallum, A., Pereira, F.C.: Conditional random fields: probabilistic models for segmenting and labeling sequence data (2001) Lafferty, J., McCallum, A., Pereira, F.C.: Conditional random fields: probabilistic models for segmenting and labeling sequence data (2001)
15.
go back to reference Lazer, D.M.J., et al.: The science of fake news. Science (New York, N.Y.) 359(6380), 1094–1096 (2018)CrossRef Lazer, D.M.J., et al.: The science of fake news. Science (New York, N.Y.) 359(6380), 1094–1096 (2018)CrossRef
16.
go back to reference Lemaître, G., Nogueira, F., Aridas, C.K.: Imbalanced-learn: a python toolbox to tackle the curse of imbalanced datasets in machine learning. J. Mach. Learn. Res. 18(1), 559–563 (2017) Lemaître, G., Nogueira, F., Aridas, C.K.: Imbalanced-learn: a python toolbox to tackle the curse of imbalanced datasets in machine learning. J. Mach. Learn. Res. 18(1), 559–563 (2017)
17.
go back to reference Liang, G., He, W., Xu, C., Chen, L., Zeng, J.: Rumor identification in microblogging systems based on users’ behavior. IEEE Trans. Comput. Soc. Syst. 2(3), 99–108 (2015)CrossRef Liang, G., He, W., Xu, C., Chen, L., Zeng, J.: Rumor identification in microblogging systems based on users’ behavior. IEEE Trans. Comput. Soc. Syst. 2(3), 99–108 (2015)CrossRef
18.
go back to reference Ma, J., Gao, W., Mitra, P., Kwon, S., Jansen, B., Wong, K.: Detecting rumors from microblogs with recurrent neural networks. In: IJCAI, pp. 3818–3824 (2016) Ma, J., Gao, W., Mitra, P., Kwon, S., Jansen, B., Wong, K.: Detecting rumors from microblogs with recurrent neural networks. In: IJCAI, pp. 3818–3824 (2016)
19.
go back to reference Qazvinian, V., Rosengren, E., Radev, D.R., Mei, Q.: Rumor has it: identifying misinformation in microblogs. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 1589–1599 (2011) Qazvinian, V., Rosengren, E., Radev, D.R., Mei, Q.: Rumor has it: identifying misinformation in microblogs. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 1589–1599 (2011)
20.
go back to reference Sicilia, R., Lo Giudice, S., Pei, Y., Pechenizkiy, M.: Twitter rumour detection in the health domain. Expert Syst. Appl. 110, 33–40 (2018)CrossRef Sicilia, R., Lo Giudice, S., Pei, Y., Pechenizkiy, M.: Twitter rumour detection in the health domain. Expert Syst. Appl. 110, 33–40 (2018)CrossRef
21.
go back to reference Starbird, K., Maddock, J., Orand, M., Achterman, P., Mason, R.: Rumors, false flags, and digital vigilantes: misinformation on Twitter after the 2013 Boston marathon bombing. In: IConference (2014) Starbird, K., Maddock, J., Orand, M., Achterman, P., Mason, R.: Rumors, false flags, and digital vigilantes: misinformation on Twitter after the 2013 Boston marathon bombing. In: IConference (2014)
22.
23.
go back to reference Varol, O., et al.: Feature engineering for social bot detection. In: Feature Engineering for Social Bot Detection, pp. 311–334. CRC Press, March 2018 Varol, O., et al.: Feature engineering for social bot detection. In: Feature Engineering for Social Bot Detection, pp. 311–334. CRC Press, March 2018
24.
go back to reference Vosoughi, S., Mohsenvand, M.N., Roy, D.: Rumor Gauge. ACM Trans. Knowl. Discov. Data 11(4), 1–36 (2017)CrossRef Vosoughi, S., Mohsenvand, M.N., Roy, D.: Rumor Gauge. ACM Trans. Knowl. Discov. Data 11(4), 1–36 (2017)CrossRef
25.
go back to reference Vosoughi, S., Roy, D., Aral, S.: The spread of true and false news online. Science (New York, N.Y.) 359(6380), 1146–1151 (2018)CrossRef Vosoughi, S., Roy, D., Aral, S.: The spread of true and false news online. Science (New York, N.Y.) 359(6380), 1146–1151 (2018)CrossRef
26.
go back to reference Wang, S., Moise, I., Helbing, D.: Early signals of trending rumor event in streaming social media. In: Computer Software and Applications Conference (COMPSAC), pp. 654–659. IEEE (2017) Wang, S., Moise, I., Helbing, D.: Early signals of trending rumor event in streaming social media. In: Computer Software and Applications Conference (COMPSAC), pp. 654–659. IEEE (2017)
27.
go back to reference Wijeratne, S., et al.: Feature engineering for Twitter-based applications. In: Feature Engineering for Machine Learning and Data Analytics, pp. 359–384 (2017) Wijeratne, S., et al.: Feature engineering for Twitter-based applications. In: Feature Engineering for Machine Learning and Data Analytics, pp. 359–384 (2017)
28.
go back to reference Wu, K., Yang, S., Zhu, K.Q.: False rumors detection on Sina Weibo by propagation structures. In: 2015 IEEE 31st International Conference on Data Engineering, pp. 651–662. IEEE, April 2015 Wu, K., Yang, S., Zhu, K.Q.: False rumors detection on Sina Weibo by propagation structures. In: 2015 IEEE 31st International Conference on Data Engineering, pp. 651–662. IEEE, April 2015
29.
go back to reference Yang, F., Liu, Y., Yu, X., Yang, M.: Automatic detection of rumor on Sina Weibo. In: Proceedings of the ACM SIGKDD Workshop on Mining Data Semantics (2012) Yang, F., Liu, Y., Yu, X., Yang, M.: Automatic detection of rumor on Sina Weibo. In: Proceedings of the ACM SIGKDD Workshop on Mining Data Semantics (2012)
31.
go back to reference Zhao, Z., Resnick, P., Mei, Q.: Enquiring minds. In: Proceedings of the 24th International Conference on World Wide Web - WWW 2015, pp. 1395–1405. ACM Press, New York (2015) Zhao, Z., Resnick, P., Mei, Q.: Enquiring minds. In: Proceedings of the 24th International Conference on World Wide Web - WWW 2015, pp. 1395–1405. ACM Press, New York (2015)
33.
go back to reference Zubiaga, A., Aker, A., Bontcheva, K., Liakata, M., Procter, R.: Detection and resolution of rumours in social media. ACM Comput. Surv. 51(2), 1–36 (2018)CrossRef Zubiaga, A., Aker, A., Bontcheva, K., Liakata, M., Procter, R.: Detection and resolution of rumours in social media. ACM Comput. Surv. 51(2), 1–36 (2018)CrossRef
34.
go back to reference Zubiaga, A., Liakata, M., Procter, R.: Learning reporting dynamics during breaking news for rumour detection in social media (2016) Zubiaga, A., Liakata, M., Procter, R.: Learning reporting dynamics during breaking news for rumour detection in social media (2016)
Metadata
Title
Detecting Rumours in Disasters: An Imbalanced Learning Approach
Authors
Amir Ebrahimi Fard
Majid Mohammadi
Bartel van de Walle
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-50423-6_48

Premium Partner