skip to main content
10.1145/2808797.2808896acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Mining Social Media Streams to Improve Public Health Allergy Surveillance

Authors Info & Claims
Published:25 August 2015Publication History

ABSTRACT

Allergies are one of the most common chronic diseases worldwide. One in five Americans suffer from either allergy or asthma symptoms. With the prevalence of social media, people sharing experiences and opinions on personal health symptoms and concerns on social media are increasing. Mining those publicly available health related data potentially provides valuable healthcare insights. In this paper, we propose a real-time allergy surveillance system that first classifies tweets to identify those that mention actual allergy incidents using bag-of-words model and NaiveBayesMultinomial classifier and applies in-depth text and spatiotemporal analysis. Our experimental results show that the proposed system can detect predominant allergy types with high precision and that allergy-related tweet volume is highly correlated to the weather data (daily maximum temperature). We believe that this is the first study that examines a large-scale social media stream for in-depth analysis of allergy activities.

References

  1. H. Achrekar, A. Gandhe, R. Lazarus, S.-H. Yu, and B. Liu. Predicting flu trends using twitter data. In Computer Communications Workshops (INFOCOM WKSHPS), 2011 IEEE Conference on, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  2. E. Aramaki, S. Maskawa, and M. Morita. Twitter catches the flu: Detecting influenza epidemics using twitter. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP '11, pages 1568--1576, Stroudsburg, PA, USA, 2011. Association for Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. Asur and B. A. Huberman. Predicting the future with social media. In Proceedings of the 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01, WI-IAT '10, pages 492--499, Washington, DC, USA, 2010. IEEE Computer Society. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. D. L. Blackwell, J. W. Lucas, and T. C. Clarke. Summary health statistics for u.s. adults: National health interview survey, 2012. http://www.cdc.gov/nchs/data/series/sr_10/sr10_260.pdf, 2013.Google ScholarGoogle Scholar
  5. B. Bloom, L. I. Jones, and G. Freeman. Summary health statistics for u.s. children: National health interview survey, 2012. http://www.cdc.gov/nchs/data/series/sr_10/sr10_258.pdf, 2012.Google ScholarGoogle Scholar
  6. J. Bollen, H. Mao, and X. Zeng. Twitter mood predicts the stock market. Journal of Computational Science, 2(1):1 -- 8, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  7. P. Chakraborty, P. Khadivi, B. Lewis, A. Mahendiran, J. Chen, P. Chakraborty, P. Khadivi, B. Lewis, A. Mahendiran, and J. C. and. Forecasting a moving target: Ensemble models for ili case count predictions. In SDM, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  8. L. Chen, H. Achrekar, B. Liu, and R. Lazarus. Vision: Towards real time epidemic vigilance through online social networks: Introducing sneft -- social network enabled flu trends. In Proceedings of the 1st ACM Workshop on Mobile Cloud Computing & Services: Social Networks and Beyond, MCS '10, pages 4:1--4:5, New York, NY, USA, 2010. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. C. Chew and G. Eysenbach. Pandemics in the age of twitter: Content analysis of tweets during the 2009 h1n1 outbreak. PLoS ONE, 5(11):e14118, 11 2010.Google ScholarGoogle ScholarCross RefCross Ref
  10. A. Choudhary, W. Hendrix, K. Lee, D. Palsetia, and W.-K. Liao. Social media evolution of the egyptian revolution. Commun. ACM, 55(5):74--80, May 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. L. de Weger, T. Beerthuizen, P. Hiemstra, and J. Sont. Development and validation of a 5-day-ahead hay fever forecast for patients with grass-pollen-induced allergic rhinitis. International Journal of Biometeorology, 58(6):1047--1055, 2014.Google ScholarGoogle Scholar
  12. J. Emberlin, J. Mullins, J. Corden, W. Millington, M. Brooke, M. Savage, and S. Jones. The trend to earlier birch pollen seasons in the uk: a biotic response to changes in weather conditions? Grana, 36(1):29-- 33, 1997.Google ScholarGoogle ScholarCross RefCross Ref
  13. J. U. Espino, W. R. Hogan, and M. M. Wagner. Telephone triage: a timely data source for surveillance of influenza-like diseases. In AMIA Annual Symposium Proceedings, volume 2003, page 215. American Medical Informatics Association, 2003.Google ScholarGoogle Scholar
  14. J. Ginsberg, M. Mohebbi, R. Patel, L. Brammer, M. Smolinski, and L. Brilliant. Detecting influenza epidemics using search engine query data. Nature, 457:1012--1014, 2009. doi:10.1038/nature07634.Google ScholarGoogle ScholarCross RefCross Ref
  15. M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten. The weka data mining software: An update. SIGKDD Explor. Newsl., 11(1):10--18, Nov. 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. V. Lampos, T. De Bie, and N. Cristianini. Flu detector-tracking epidemics on twitter. In Machine Learning and Knowledge Discovery in Databases, pages 599--602. Springer, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. K. Lee, A. Agrawal, and A. Choudhary. Real-time disease surveillance using twitter data: Demonstration on flu and cancer. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '13, pages 1474--1477, New York, NY, USA, 2013. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. K. Lee, D. Palsetia, R. Narayanan, M. M. A. Patwary, A. Agrawal, and A. Choudhary. Twitter trending topic classification. In Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on, pages 251--258. IEEE, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. Li and C. Cardie. Early stage influenza detection from twitter. arXiv preprint arXiv:1309.7340, 2013.Google ScholarGoogle Scholar
  20. S. Magruder. Evaluation of over-the-counter pharmaceutical sales as a possible early warning indicator of human disease. Johns Hopkins APL technical digest, 24(4):349--53, 2003.Google ScholarGoogle Scholar
  21. C. D. Manning, P. Raghavan, and H. Schtze. Introduction to Information Retrieval. Cambridge University Press, New York, NY, USA, 2008. Google ScholarGoogle ScholarCross RefCross Ref
  22. A. McCallum and K. Nigam. A comparison of event models for naive bayes text classification. In IN AAAI-98 WORKSHOP ON LEARNING FOR TEXT CATEGORIZATION, pages 41--48. AAAI Press, 1998.Google ScholarGoogle Scholar
  23. M. J. Paul and M. Dredze. You are what you tweet: Analyzing twitter for public health. In ICWSM. The AAAI Press, 2011.Google ScholarGoogle Scholar
  24. R. Pawankar, G. W. Canonica, S. T. Holgate, and R. F. Lockey. Wao white book on allergy. http://www.worldallergy.org/UserFiles/file/WAO-White-Book-on-Allergy_web.pdf, 2011.Google ScholarGoogle Scholar
  25. T. Sakaki, M. Okazaki, and Y. Matsuo. Earthquake shakes twitter users: Real-time event detection by social sensors. In In Proceedings of the Nineteenth International WWW Conference (WWW2010). ACM, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. J. Shaman, A. Karspeck, W. Yang, J. Tamerius, and M. Lipsitch. Real-time influenza forecasts during the 2012--2013 season. Nature Communications, 4, Dec. 2013.Google ScholarGoogle ScholarCross RefCross Ref
  27. A. Signorini, A. M. Segre, and P. M. Polgreen. The use of twitter to track levels of disease activity and public concern in the u.s. during the influenza a h1n1 pandemic. PLoS ONE, 6(5):e19467, 05 2011.Google ScholarGoogle ScholarCross RefCross Ref
  28. M. Sofean and M. Smith. A real-time architecture for detection of diseases using social networks: Design, implementation and evaluation. In Proceedings of the 23rd ACM Conference on Hypertext and Social Media, HT '12, pages 309--310, New York, NY, USA, 2012. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. S. Tuarob, C. S. Tucker, M. Salathe, and N. Ram. Discovering health-related knowledge in social media using ensembles of heterogeneous features. In Proceedings of the 22Nd ACM International Conference on Conference on Information & Knowledge Management, CIKM '13, pages 1685--1690, New York, NY, USA, 2013. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. M. Wilson, S. Villalba, H. Avila, J. Hahn, and A. Cepeda. Correlation between atmospheric tree pollen levels with three weather variables during 2002-2004 in a tropical urban area. Journal of Allergy and Clinical Immunology, 127(2):AB170, 2011.Google ScholarGoogle Scholar
  1. Mining Social Media Streams to Improve Public Health Allergy Surveillance

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            ASONAM '15: Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015
            August 2015
            835 pages
            ISBN:9781450338547
            DOI:10.1145/2808797

            Copyright © 2015 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 25 August 2015

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
            • Research
            • Refereed limited

            Acceptance Rates

            Overall Acceptance Rate116of549submissions,21%

            Upcoming Conference

            KDD '24

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader