Skip to main content
Top
Published in: Health and Technology 4/2019

21-03-2019 | Original Paper

Predicting the spread of influenza epidemics by analyzing twitter messages

Authors: Soheila Molaei, Mohammad Khansari, Hadi Veisi, Mostafa Salehi

Published in: Health and Technology | Issue 4/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Seasonal influenza epidemics affect millions of people with respiratory illnesses and cause 250,000 to 500,000 deaths worldwide each year. Rapidly predicting the outbreak of epidemics leads to an earlier detection and control. In this study, we predicted an influenza-like illness (ILI) based on social media data derived from Twitter. Tweets and patients do not always have a linear correlation; therefore, we employed nonlinear methods including autoregressive with exogenous inputs (ARX), autoregressive-moving-average with exogenous inputs (ARMAX), nonlinear autoregressive exogenous (NARX), deep multilayer perceptron (DeepMLP), and a convolutional neural network (CNN). Two new features employed to significantly reduce the prediction errors are products of the tweets and Centers for Disease Control and Prevention (CDC) data and of the tweets and Google data. Furthermore, we introduced a new method based on entropy that decreased the errors as well as time complexity. Among the available methods and features, the best results were obtained with the newly developed features in the deep neural network methods and the entropy-based method that decreased the mean average error by up to 25%. The entropy method also reduced the time complexity. Applying the above-mentioned methods to the Twitter datasets from 2009 to 2010 and 2011–2014 revealed that the ILI outbreak can be predicted 2–4 weeks earlier than by the CDC.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Literature
2.
3.
go back to reference Morens DM, Fauci AS. The 1918 influenza pandemic: insights for the 21st century. J Infect Dis. 2007;195:1018–28.CrossRef Morens DM, Fauci AS. The 1918 influenza pandemic: insights for the 21st century. J Infect Dis. 2007;195:1018–28.CrossRef
5.
go back to reference Chen L, Hossain KSMT, Butler P, Ramakrishnan N, Prakash BA. Flu Gone Viral: Syndromic Surveillance of Flu on Twitter Using Temporal Topic Models. Proc - IEEE Int Conf Data Mining, ICDM. 2015. p. 755–60. Chen L, Hossain KSMT, Butler P, Ramakrishnan N, Prakash BA. Flu Gone Viral: Syndromic Surveillance of Flu on Twitter Using Temporal Topic Models. Proc - IEEE Int Conf Data Mining, ICDM. 2015. p. 755–60.
7.
go back to reference Yih WK, Teates KS, Abrams A, Kleinman K, Kulldorff M, Pinner R, et al. Telephone triage service data for detection of influenza-like illness. PLoS One. 2009;4. Yih WK, Teates KS, Abrams A, Kleinman K, Kulldorff M, Pinner R, et al. Telephone triage service data for detection of influenza-like illness. PLoS One. 2009;4.
8.
go back to reference Liu TY, Sanders JL, Tsui FC, Espino JU, Dato VM, Suyama J. Association of Over-The-Counter Pharmaceutical Sales with Influenza-Like-Illnesses to Patient Volume in an Urgent Care Setting. PLoS One. 2013;8. Liu TY, Sanders JL, Tsui FC, Espino JU, Dato VM, Suyama J. Association of Over-The-Counter Pharmaceutical Sales with Influenza-Like-Illnesses to Patient Volume in an Urgent Care Setting. PLoS One. 2013;8.
10.
15.
go back to reference Achrekar H, Lazarus R, Park WC. Predicting Flu Trends using Twitter Data. IEEE Infocom. 2011;702–7. Achrekar H, Lazarus R, Park WC. Predicting Flu Trends using Twitter Data. IEEE Infocom. 2011;702–7.
16.
go back to reference Lee K. Real-time disease surveillance using twitter data: demonstration on flu and cancer. KDD’13. 2013;1474–7. Lee K. Real-time disease surveillance using twitter data: demonstration on flu and cancer. KDD’13. 2013;1474–7.
17.
go back to reference Lamb A, Paul MJ, Dredze M. Separating fact from fear: tracking flu infections on Twitter. Conf North Am Chapter Assoc Comput Linguist Hum Lang Technol. 2013; Lamb A, Paul MJ, Dredze M. Separating fact from fear: tracking flu infections on Twitter. Conf North Am Chapter Assoc Comput Linguist Hum Lang Technol. 2013;
18.
go back to reference Sadilek A, Kautz H, Silenzio V. Modeling spread of disease from social interactions. Int AAAI Conf Weblogs Soc Media. 2012. Sadilek A, Kautz H, Silenzio V. Modeling spread of disease from social interactions. Int AAAI Conf Weblogs Soc Media. 2012.
20.
go back to reference Bodnar T, Salathé M. Validating Models for Disease Detection Using Twitter Regression on Tweet Count. Proc 22nd Int Conf World Wide Web companion. 2013;699–702. Bodnar T, Salathé M. Validating Models for Disease Detection Using Twitter Regression on Tweet Count. Proc 22nd Int Conf World Wide Web companion. 2013;699–702.
23.
go back to reference Caverlee J, Webb S, Tech G. A Large-Scale Study of MySpace : Observations and Implications for Online Social Networks. Proc from 2nd Int Conf Weblogs Soc Media AAAI. 2008; Caverlee J, Webb S, Tech G. A Large-Scale Study of MySpace : Observations and Implications for Online Social Networks. Proc from 2nd Int Conf Weblogs Soc Media AAAI. 2008;
24.
go back to reference Gauvin W, Ribeiro B, Towsley D, Liu B, Wang J. Measurement and gender-specific analysis of user publishing characteristics on MySpace. IEEE Netw. 2010;24:38–43.CrossRef Gauvin W, Ribeiro B, Towsley D, Liu B, Wang J. Measurement and gender-specific analysis of user publishing characteristics on MySpace. IEEE Netw. 2010;24:38–43.CrossRef
25.
go back to reference Asur S, Huberman BA. Predicting the Future With Social Media. WI-IAT ‘10 Proc 2010 IEEE/WIC/ACM Int Conf Web Intell Intell Agent Technol. 2010;429–99. Asur S, Huberman BA. Predicting the Future With Social Media. WI-IAT ‘10 Proc 2010 IEEE/WIC/ACM Int Conf Web Intell Intell Agent Technol. 2010;429–99.
26.
go back to reference Motoyama M, Voelker GM, Savage S. Measuring Online Service Availability Using Twitter. WOSN’10 Proc 3rd Conf Online Soc networks. 2010;13. Motoyama M, Voelker GM, Savage S. Measuring Online Service Availability Using Twitter. WOSN’10 Proc 3rd Conf Online Soc networks. 2010;13.
27.
go back to reference Mislove A, Lehmann S, Ahn Y-Y, Onnela J-P, Rosenquist JN. pulse of the nation us mood throughout the day inferred from twitter. 2013; Mislove A, Lehmann S, Ahn Y-Y, Onnela J-P, Rosenquist JN. pulse of the nation us mood throughout the day inferred from twitter. 2013;
28.
go back to reference Heaivilin N, Gerbert B, Page J, Gibbs J. Public health surveillance of dental pain via Twitter. J Dent Res. 2011;90:1047–51.CrossRef Heaivilin N, Gerbert B, Page J, Gibbs J. Public health surveillance of dental pain via Twitter. J Dent Res. 2011;90:1047–51.CrossRef
29.
go back to reference Bosley JC, Zhao NW, Hill S, Shofer FS, Asch DA, Becker LB, et al. Decoding twitter: Surveillance and trends for cardiac arrest and resuscitation communication. Resuscitation. 2013;84:206–12.CrossRef Bosley JC, Zhao NW, Hill S, Shofer FS, Asch DA, Becker LB, et al. Decoding twitter: Surveillance and trends for cardiac arrest and resuscitation communication. Resuscitation. 2013;84:206–12.CrossRef
30.
go back to reference Paul MJ, Dredze M. You Are What You Tweet: Analyzing Twitter for Public Health. Fifth Int AAAI Conf Weblogs Soc Media. 2011;265–72. Paul MJ, Dredze M. You Are What You Tweet: Analyzing Twitter for Public Health. Fifth Int AAAI Conf Weblogs Soc Media. 2011;265–72.
31.
go back to reference Gomide J, Veloso A, Meira W, Almeida V, Benevenuto F, Ferraz F, et al. Dengue surveillance based on a computational model of spatio-temporal locality of Twitter. Proc 3rd Int Web Sci Conf - WebSci ‘11. New York, New York, USA, New York, USA: ACM Press; 2011. p. 1–8. Available from: http://dl.acm.org/citation.cfm?doid=2527031.2527049. Accessed 3 March 2019. Gomide J, Veloso A, Meira W, Almeida V, Benevenuto F, Ferraz F, et al. Dengue surveillance based on a computational model of spatio-temporal locality of Twitter. Proc 3rd Int Web Sci Conf - WebSci ‘11. New York, New York, USA, New York, USA: ACM Press; 2011. p. 1–8. Available from: http://​dl.​acm.​org/​citation.​cfm?​doid=​2527031.​2527049. Accessed 3 March 2019.
32.
go back to reference Signorini A, Segre AM, Polgreen PM. The Use of Twitter to Track Levels of Disease Activity and Public Concern in the U.S. during the Influenza A H1N1 Pandemic. PLoS One. 2011. Signorini A, Segre AM, Polgreen PM. The Use of Twitter to Track Levels of Disease Activity and Public Concern in the U.S. during the Influenza A H1N1 Pandemic. PLoS One. 2011.
33.
go back to reference Chew C, Eysenbach G. Pandemics in the age of Twitter: Content analysis of tweets during the 2009 H1N1 outbreak. PLoS One. 2010;5:361–7.CrossRef Chew C, Eysenbach G. Pandemics in the age of Twitter: Content analysis of tweets during the 2009 H1N1 outbreak. PLoS One. 2010;5:361–7.CrossRef
34.
go back to reference Lampos V, Cristianini N. Tracking the flu pandemic by monitoring the social web. 2nd Int Work Cogn Inf Process. Ieee; 2010;411–6. Lampos V, Cristianini N. Tracking the flu pandemic by monitoring the social web. 2nd Int Work Cogn Inf Process. Ieee; 2010;411–6.
35.
go back to reference Aramaki E. Twitter Catches The Flu: Detecting Influenza Epidemics using Twitter. Proc 2011 Conf Empir Methods Nat Lang Process. 2011:1568–76. Aramaki E. Twitter Catches The Flu: Detecting Influenza Epidemics using Twitter. Proc 2011 Conf Empir Methods Nat Lang Process. 2011:1568–76.
36.
go back to reference Achrekar H. Social Network Enabled Flu Trends. Achrekar H. Social Network Enabled Flu Trends.
37.
go back to reference Ginsberg J, Mohebbi MH, Patel RS, Brammer L, Smolinski MS, Brilliant L. Detecting influenza epidemics using search engine query data. Nature. 2009;457:1012–4.CrossRef Ginsberg J, Mohebbi MH, Patel RS, Brammer L, Smolinski MS, Brilliant L. Detecting influenza epidemics using search engine query data. Nature. 2009;457:1012–4.CrossRef
40.
go back to reference Polgreen PM, Chen Y, Pennock DM, Nelson FD. Using internet searches for influenza surveillance. Clin Infect Dis. 2008;47:1443–8.CrossRef Polgreen PM, Chen Y, Pennock DM, Nelson FD. Using internet searches for influenza surveillance. Clin Infect Dis. 2008;47:1443–8.CrossRef
41.
go back to reference Hulth A, Rydevik G, Linde A. Web queries as a source for syndromic surveillance. PLoS One. 2009;4:e4378.CrossRef Hulth A, Rydevik G, Linde A. Web queries as a source for syndromic surveillance. PLoS One. 2009;4:e4378.CrossRef
44.
go back to reference Balakrishnan V. System identification: theory for the user (second edition). Automatica. 2002;38:375–8.CrossRef Balakrishnan V. System identification: theory for the user (second edition). Automatica. 2002;38:375–8.CrossRef
45.
go back to reference Ramesh K, Aziz N, Shukor A. R. S. Development of NARX Model for Distillation Column and Studies on Effect of Regressors. J Appl Sci. 2008;8:1214–20.CrossRef Ramesh K, Aziz N, Shukor A. R. S. Development of NARX Model for Distillation Column and Studies on Effect of Regressors. J Appl Sci. 2008;8:1214–20.CrossRef
46.
go back to reference Cajueiro E, Kalid R, Schnitman L. Using NARX model with wavelet network to inferring the polished rod position. Int J Math Comput Simul. 2012;6. Cajueiro E, Kalid R, Schnitman L. Using NARX model with wavelet network to inferring the polished rod position. Int J Math Comput Simul. 2012;6.
49.
go back to reference Kendall M. Rank correlation methods. London Griffin. 1970. Kendall M. Rank correlation methods. London Griffin. 1970.
Metadata
Title
Predicting the spread of influenza epidemics by analyzing twitter messages
Authors
Soheila Molaei
Mohammad Khansari
Hadi Veisi
Mostafa Salehi
Publication date
21-03-2019
Publisher
Springer Berlin Heidelberg
Published in
Health and Technology / Issue 4/2019
Print ISSN: 2190-7188
Electronic ISSN: 2190-7196
DOI
https://doi.org/10.1007/s12553-019-00309-4

Other articles of this Issue 4/2019

Health and Technology 4/2019 Go to the issue

Premium Partner