ABSTRACT
Allergies are one of the most common chronic diseases worldwide. One in five Americans suffer from either allergy or asthma symptoms. With the prevalence of social media, people sharing experiences and opinions on personal health symptoms and concerns on social media are increasing. Mining those publicly available health related data potentially provides valuable healthcare insights. In this paper, we propose a real-time allergy surveillance system that first classifies tweets to identify those that mention actual allergy incidents using bag-of-words model and NaiveBayesMultinomial classifier and applies in-depth text and spatiotemporal analysis. Our experimental results show that the proposed system can detect predominant allergy types with high precision and that allergy-related tweet volume is highly correlated to the weather data (daily maximum temperature). We believe that this is the first study that examines a large-scale social media stream for in-depth analysis of allergy activities.
- H. Achrekar, A. Gandhe, R. Lazarus, S.-H. Yu, and B. Liu. Predicting flu trends using twitter data. In Computer Communications Workshops (INFOCOM WKSHPS), 2011 IEEE Conference on, 2011.Google ScholarCross Ref
- E. Aramaki, S. Maskawa, and M. Morita. Twitter catches the flu: Detecting influenza epidemics using twitter. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP '11, pages 1568--1576, Stroudsburg, PA, USA, 2011. Association for Computational Linguistics. Google ScholarDigital Library
- S. Asur and B. A. Huberman. Predicting the future with social media. In Proceedings of the 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01, WI-IAT '10, pages 492--499, Washington, DC, USA, 2010. IEEE Computer Society. Google ScholarDigital Library
- D. L. Blackwell, J. W. Lucas, and T. C. Clarke. Summary health statistics for u.s. adults: National health interview survey, 2012. http://www.cdc.gov/nchs/data/series/sr_10/sr10_260.pdf, 2013.Google Scholar
- B. Bloom, L. I. Jones, and G. Freeman. Summary health statistics for u.s. children: National health interview survey, 2012. http://www.cdc.gov/nchs/data/series/sr_10/sr10_258.pdf, 2012.Google Scholar
- J. Bollen, H. Mao, and X. Zeng. Twitter mood predicts the stock market. Journal of Computational Science, 2(1):1 -- 8, 2011.Google ScholarCross Ref
- P. Chakraborty, P. Khadivi, B. Lewis, A. Mahendiran, J. Chen, P. Chakraborty, P. Khadivi, B. Lewis, A. Mahendiran, and J. C. and. Forecasting a moving target: Ensemble models for ili case count predictions. In SDM, 2014.Google ScholarCross Ref
- L. Chen, H. Achrekar, B. Liu, and R. Lazarus. Vision: Towards real time epidemic vigilance through online social networks: Introducing sneft -- social network enabled flu trends. In Proceedings of the 1st ACM Workshop on Mobile Cloud Computing & Services: Social Networks and Beyond, MCS '10, pages 4:1--4:5, New York, NY, USA, 2010. ACM. Google ScholarDigital Library
- C. Chew and G. Eysenbach. Pandemics in the age of twitter: Content analysis of tweets during the 2009 h1n1 outbreak. PLoS ONE, 5(11):e14118, 11 2010.Google ScholarCross Ref
- A. Choudhary, W. Hendrix, K. Lee, D. Palsetia, and W.-K. Liao. Social media evolution of the egyptian revolution. Commun. ACM, 55(5):74--80, May 2012. Google ScholarDigital Library
- L. de Weger, T. Beerthuizen, P. Hiemstra, and J. Sont. Development and validation of a 5-day-ahead hay fever forecast for patients with grass-pollen-induced allergic rhinitis. International Journal of Biometeorology, 58(6):1047--1055, 2014.Google Scholar
- J. Emberlin, J. Mullins, J. Corden, W. Millington, M. Brooke, M. Savage, and S. Jones. The trend to earlier birch pollen seasons in the uk: a biotic response to changes in weather conditions? Grana, 36(1):29-- 33, 1997.Google ScholarCross Ref
- J. U. Espino, W. R. Hogan, and M. M. Wagner. Telephone triage: a timely data source for surveillance of influenza-like diseases. In AMIA Annual Symposium Proceedings, volume 2003, page 215. American Medical Informatics Association, 2003.Google Scholar
- J. Ginsberg, M. Mohebbi, R. Patel, L. Brammer, M. Smolinski, and L. Brilliant. Detecting influenza epidemics using search engine query data. Nature, 457:1012--1014, 2009. doi:10.1038/nature07634.Google ScholarCross Ref
- M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten. The weka data mining software: An update. SIGKDD Explor. Newsl., 11(1):10--18, Nov. 2009. Google ScholarDigital Library
- V. Lampos, T. De Bie, and N. Cristianini. Flu detector-tracking epidemics on twitter. In Machine Learning and Knowledge Discovery in Databases, pages 599--602. Springer, 2010. Google ScholarDigital Library
- K. Lee, A. Agrawal, and A. Choudhary. Real-time disease surveillance using twitter data: Demonstration on flu and cancer. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '13, pages 1474--1477, New York, NY, USA, 2013. ACM. Google ScholarDigital Library
- K. Lee, D. Palsetia, R. Narayanan, M. M. A. Patwary, A. Agrawal, and A. Choudhary. Twitter trending topic classification. In Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on, pages 251--258. IEEE, 2011. Google ScholarDigital Library
- J. Li and C. Cardie. Early stage influenza detection from twitter. arXiv preprint arXiv:1309.7340, 2013.Google Scholar
- S. Magruder. Evaluation of over-the-counter pharmaceutical sales as a possible early warning indicator of human disease. Johns Hopkins APL technical digest, 24(4):349--53, 2003.Google Scholar
- C. D. Manning, P. Raghavan, and H. Schtze. Introduction to Information Retrieval. Cambridge University Press, New York, NY, USA, 2008. Google ScholarCross Ref
- A. McCallum and K. Nigam. A comparison of event models for naive bayes text classification. In IN AAAI-98 WORKSHOP ON LEARNING FOR TEXT CATEGORIZATION, pages 41--48. AAAI Press, 1998.Google Scholar
- M. J. Paul and M. Dredze. You are what you tweet: Analyzing twitter for public health. In ICWSM. The AAAI Press, 2011.Google Scholar
- R. Pawankar, G. W. Canonica, S. T. Holgate, and R. F. Lockey. Wao white book on allergy. http://www.worldallergy.org/UserFiles/file/WAO-White-Book-on-Allergy_web.pdf, 2011.Google Scholar
- T. Sakaki, M. Okazaki, and Y. Matsuo. Earthquake shakes twitter users: Real-time event detection by social sensors. In In Proceedings of the Nineteenth International WWW Conference (WWW2010). ACM, 2010. Google ScholarDigital Library
- J. Shaman, A. Karspeck, W. Yang, J. Tamerius, and M. Lipsitch. Real-time influenza forecasts during the 2012--2013 season. Nature Communications, 4, Dec. 2013.Google ScholarCross Ref
- A. Signorini, A. M. Segre, and P. M. Polgreen. The use of twitter to track levels of disease activity and public concern in the u.s. during the influenza a h1n1 pandemic. PLoS ONE, 6(5):e19467, 05 2011.Google ScholarCross Ref
- M. Sofean and M. Smith. A real-time architecture for detection of diseases using social networks: Design, implementation and evaluation. In Proceedings of the 23rd ACM Conference on Hypertext and Social Media, HT '12, pages 309--310, New York, NY, USA, 2012. ACM. Google ScholarDigital Library
- S. Tuarob, C. S. Tucker, M. Salathe, and N. Ram. Discovering health-related knowledge in social media using ensembles of heterogeneous features. In Proceedings of the 22Nd ACM International Conference on Conference on Information & Knowledge Management, CIKM '13, pages 1685--1690, New York, NY, USA, 2013. ACM. Google ScholarDigital Library
- M. Wilson, S. Villalba, H. Avila, J. Hahn, and A. Cepeda. Correlation between atmospheric tree pollen levels with three weather variables during 2002-2004 in a tropical urban area. Journal of Allergy and Clinical Immunology, 127(2):AB170, 2011.Google Scholar
- Mining Social Media Streams to Improve Public Health Allergy Surveillance
Recommendations
Text Mining from Social Media for Public Health Applications
DH '17: Proceedings of the 2017 International Conference on Digital HealthPublic Health is crucial to manage and monitor threats to the health of the population. In recent years, Twitter has been successfully applied to monitor diseases through its ability to provide near real-time data and proved to be an asset to the ...
Social media as a measurement tool of depression in populations
WebSci '13: Proceedings of the 5th Annual ACM Web Science ConferenceDepression is a serious and widespread public health challenge. We examine the potential for leveraging social media postings as a new type of lens in understanding depression in populations. Information gleaned from social media bears potential to ...
Role of social media in tackling challenges in mental health
SAM '13: Proceedings of the 2nd international workshop on Socially-aware multimediaMental illness is a serious and widespread health challenge in our society today. Tens of millions of people each year suffer from depression and only a fraction receives adequate treatment. This position paper highlights some recent attempts examining ...
Comments