ABSTRACT
Social media contributes to enhancing transparency and openness for the purpose of innovating public services and policy-making. In disaster management, social media data can be mined to discover public perceptions and concerns on large disaster events. However, converting large data streams into useful information remains a challenge due to the unstructured nature of textual data. This study proposes an interactive topic modeling method to analyze microblog data for understanding the dynamics of public expressions immediately after a major explosion event. First, we extract topics from microblog message data. In order to test the influence of the number of topics, the topics are detected at multiple levels of granularity by varying the number of topics. Second, these topics are used to detect topical compositions of contents at different time slices and assess the topic evolution over time. The topic evolution patterns are visualized by the streamgraph method to discover informative topics to help to take further actions. Third, since the first-level topics are not informative, we conduct a second-level topic (subtopic) analysis to detect key decision elements by choosing "investigation" from the first-level topics, a hot focus in any man-made disaster. The results improve our understanding of the topic composition evolution around major man-made disasters and have implications on officials deciding what and when to release formal investigation information to the public.
- Farzindar Atefeh and Wael Khreich. 2015. A survey of techniques for event detection in twitter. Computational Intelligence 31, 1 (2015), 132--164. Google ScholarDigital Library
- Hila Becker, Mor Naaman, and Luis Gravano. 2011. Beyond Trending Topics: Real-World Event Identification on Twitter. ICWSM 11, 2011 (2011), 438--441.Google Scholar
- Nicolas Garcia Belmonte. 2014. Extracting and visualizing insights from realtime conversations around public presentations. In Visual Analytics Science and Technology (VAST), 2014 IEEE Conference on. IEEE, 225--226.Google ScholarCross Ref
- David M Blei. 2012. Probabilistic topic models. Commun. ACM 55, 4 (2012), 77--84. Google ScholarDigital Library
- David M Blei, Andrew Y Ng, and Michael I Jordan. 2003. Latent dirichlet allocation. Journal of machine Learning research 3, Jan (2003), 993--1022. Google ScholarDigital Library
- Louis Capozzi. 2013. Crisis management in the Age of Social Media. Business Expert Press.Google Scholar
- Cornelia Caragea, Nathan McNeese, Anuj Jaiswal, Greg Traylor, Hyun-Woo Kim, Prasenjit Mitra, Dinghao Wu, Andrea H Tapia, Lee Giles, Bernard J Jansen, and others. 2011. Classifying text messages for the Haiti earthquake. In Proceedings of the 8th international conference on information systems for crisis response and management (ISCRAM2011). Citeseer.Google Scholar
- Junghoon Chae, Dennis Thom, Harald Bosch, Yun Jang, Ross Maciejewski, David S Ebert, and Thomas Ertl. 2012. Spatiotemporal social media analytics for abnormal event detection and examination using seasonal-trend decomposition. In Visual Analytics Science and Technology (VAST), 2012 IEEE Conference on. IEEE, 143--152. Google ScholarDigital Library
- Yuzhong Chen, Wanhua Li, Wenzhong Guo, and Kun Guo. 2015. Popular Topic Detection in Chinese Micro-Blog Based on the Modified LDA Model. In Web Information System and Application Conference (WISA), 2015 12th. IEEE, 37--42. Google ScholarDigital Library
- Dmitry Chernov and Didier Sornette. 2016. Causes of Risk Information Concealment. In Man-made Catastrophes and Risk Information Concealment: Case Studies of Major Disasters and Human Fallibility, Dmitry Chernov and Didier Sornette (Eds.). Springer, Zürich, Switzerland.Google Scholar
- Louise K Comfort, Jennifer Bert, and Jee Eun Song. 2016. Wicked problems in real time: uncertainty, information, and the escalation of Ebola. Information polity 21, 3 (2016), 273--289.Google Scholar
- Kate Crawford and Megan Finn. 2015. The limits of crisis data: analytical and ethical challenges of using social and mobile data to understand disasters. GeoJournal 80, 4 (2015), 491--502.Google ScholarCross Ref
- Weiwei Cui, Shixia Liu, Li Tan, Conglei Shi, Yangqiu Song, Zekai Gao, Huamin Qu, and Xin Tong. 2011. Textflow: Towards better understanding of evolving topics in text. IEEE transactions on visualization and computer graphics 17, 12 (2011), 2412--2421. Google ScholarDigital Library
- Carmen De Maio, Giuseppe Fenza, Vincenzo Loia, and Francesco Orciuoli. 2017. Unfolding social content evolution along time and semantics. Future Generation Computer Systems 66 (2017), 146--159.Google ScholarCross Ref
- Qing Deng, Yi Liu, Hui Zhang, Xiaolong Deng, and Yefeng Ma. 2016. A new crowdsourcing model to assess disaster using microblog data in typhoon Haiyan. Natural Hazards 84, 2 (2016), 1241--1256.Google ScholarCross Ref
- Xian Gao and Jooho Lee. 2017. E-government services and social media adoption: Experience of small local governments in Nebraska state. Government Information Quarterly 34, 4 (2017), 627--634.Google ScholarCross Ref
- Jennifer Golbeck, Justin M Grimes, and Anthony Rogers. 2010. Twitter use by the US Congress. Journal of the Association for Information Science and Technology 61, 8 (2010), 1612--1621. Google ScholarDigital Library
- Alaa Abi Haidar, Bin Yang, and Jean-Gabriel Ganascia. 2016. Visualizing the First World War Using StreamGraphs and Information Extraction. In Information Visualisation (IV), 2016 20th International Conference. IEEE, 290--293.Google ScholarCross Ref
- Haijing Hao and Kunpeng Zhang. 2016. The voice of chinese health consumers: a text mining approach to web-Based physician reviews. Journal of medical Internet research 18, 5 (2016).Google ScholarCross Ref
- Enamul Hoque and Giuseppe Carenini. 2016. Interactive topic modeling for exploring asynchronous online conversations: Design and evaluation of ConVisIT. ACM Transactions on Interactive Intelligent Systems (TiiS) 6, 1 (2016), 7. Google ScholarDigital Library
- Ping Huang and Jingyuan Zhang. 2015. Facts related to August 12, 2015 explosion accident in Tianjin, China. Process Safety Progress 34, 4 (2015), 313--314.Google ScholarCross Ref
- Wei Huang and Songnian Li. 2016. Understanding human activity patterns based on space-time-semantics. ISPRS Journal of Photogrammetry and Remote Sensing 121 (2016), 1--10.Google ScholarCross Ref
- Alan Jackoway, Hanan Samet, and Jagan Sankaranarayanan. 2011. Identification of live news events using Twitter. In Proceedings of the 3rd ACM SIGSPATIAL International Workshop on Location-Based Social Networks. ACM, 25--32. Google ScholarDigital Library
- Tomasz Janowski. 2015. Digital government evolution: From transformation to contextualization. (2015).Google Scholar
- Krishna Y Kamath and James Caverlee. 2011. Discovering trending phrases on information streams. In Proceedings of the 20th ACM international conference on Information and knowledge management. ACM, 2245--2248. Google ScholarDigital Library
- Andrea L Kavanaugh, Edward A Fox, Steven D Sheetz, Seungwon Yang, Lin Tzy Li, Donald J Shoemaker, Apostol Natsev, and Lexing Xie. 2012. Social media use by government: From the routine to the critical. Government Information Quarterly 29, 4 (2012), 480--491.Google ScholarCross Ref
- Dongwoo Kim and Alice Oh. 2011. Topic chains for understanding a news corpus. Computational Linguistics and Intelligent Text Processing (2011), 163--176. Google ScholarDigital Library
- Jae Won Kim, Dongwoo Kim, Brian Keegan, Joon Hee Kim, Suin Kim, and Alice Oh. 2015. Social media dynamics of global co-presence during the 2014 FIFA World Cup. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems. ACM, 2623--2632. Google ScholarDigital Library
- Dennis Linders. 2012. From e-government to we-government: Defining a typology for citizen coproduction in the age of social media. Government Information Quarterly 29, 4 (2012), 446--454.Google ScholarCross Ref
- Shuhua Monica Liu. 2016. Chinese Netizens' Collective Deliberation on Bystander Controversies and the Role of Micro-blogging. In Proceedings of the 17th International Digital Government Research Conference on Digital Government Research. ACM, 425--434. Google ScholarDigital Library
- B De Longueville, Gianluca Luraschi, Paul Smits, S Peedell, and T De Groeve. 2010. Citizens as sensors for natural hazards: A VGI integration worklow. Geomatica 64, 1 (2010), 41--59.Google Scholar
- Lianjie Ma, Jongpil Chung, and Stuart Thorson. 2005. E-government in China: Bringing economic development through administrative reform. Government Information Quarterly 22, 1 (2005), 20--37.Google ScholarCross Ref
- Yefeng Ma, Qing Deng, Xinzhi Wang, Jiaqi Liu, and Hui Zhang. 2014. Keyword-Based Semantic Analysis of Microblog for Public Opinion Study in Online Collective Behaviors. In International Conference on Web-Age Information Management. Springer, 44--55.Google Scholar
- Mahshid Marbouti, Irene Mayor, Dianna Yim, and Frank Maurer. 2013. Social Media Analyst Responding Tool: A Visual Analytics Prototype to Identify Relevant Tweets in Emergency Events. (2013).Google Scholar
- David F Merrick and Tom Duffy. 2013. Utilizing community volunteered information to enhance disaster situational awareness.. In ISCRAM.Google Scholar
- Stuart E Middleton, Lee Middleton, and Stefano Modafferi. 2014. Real-time crisis mapping of natural disasters using social media. IEEE Intelligent Systems 29, 2 (2014), 9--17.Google ScholarCross Ref
- Robertus Nugroho, Diego Molla-Aliod, Jian Yang, Youliang Zhong, Cecile Paris, and Surya Nepal. 2015. Incorporating tweet relationships into topic derivation. In International Conference of the Pacific Association for Computational Linguistics. Springer, 177--190.Google Scholar
- Robertus Nugroho, Weiliang Zhao, Jian Yang, Cecile Paris, and Surya Nepal. 2017. Using time-sensitive interactions to improve topic derivation in twitter. World Wide Web 20, 1 (2017), 61--87. Google ScholarDigital Library
- Alexander Pak and Patrick Paroubek. 2010. Twitter as a corpus for sentiment analysis and opinion mining.. In LREc, Vol. 10.Google Scholar
- Panos Panagiotopoulos, Frances Bowen, and Phillip Brooker. 2017. The value of social media data: Integrating crowd capabilities in evidence-based policy. Government Information Quarterly 34, 4 (2017), 601--612.Google ScholarCross Ref
- Kuei-Hsiang Peng, Li-Heng Liou, Cheng-Shang Chang, and Duan-Shin Lee. 2015. Predicting personality traits of Chinese users based on Facebook wall posts. In Wireless and Optical Communication Conference (WOCC), 2015 24th. IEEE, 9--14.Google ScholarCross Ref
- Yan Qu, Chen Huang, Pengyi Zhang, and Jun Zhang. 2011. Microblogging after a major disaster in China: a case study of the 2010 Yushu earthquake. In Proceedings of the ACM 2011 conference on Computer supported cooperative work. ACM, 25--34. Google ScholarDigital Library
- Bernd Resch, Florian Usländer, and Clemens Havas. 2017. Combining machine-learning topic models and spatiotemporal analysis of social media data for disaster footprint and damage assessment. Cartography and Geographic Information Science (2017), 1--15.Google Scholar
- Michael Röder, Andreas Both, and Alexander Hinneburg. 2015. Exploring the space of topic coherence measures. In Proceedings of the eighth ACM international conference on Web search and data mining. ACM, 399--408. Google ScholarDigital Library
- Tatsuhiro Sakai, Keiichi Tamura, and Hajime Kitakami. 2015. Extracting Topic-Related Photos in Density-Based Spatiotemporal Analysis System for Enhancing Situation Awareness. In Advanced Applied Informatics (IIAI-AAI), 2015 IIAI 4th International Congress on. IEEE, 201--206. Google ScholarDigital Library
- Yongzhong Sha, Jinsong Yan, and Guoray Cai. 2014. Detecting public sentiment over PM2. 5 pollution hazards through analysis of Chinese microblog.. In ISCRAM.Google Scholar
- Nabila Shahid, Muhammad Ilyas, Jalal Alowibdi, and Naif Aljohani. 2017. Word-cloud Segmentation for Simplified Exploration of Trending Topics on Twitter. IET Software (2017).Google Scholar
- Enrico Steiger, JoÃo Porto Albuquerque, and Alexander Zipf. 2015. An advanced systematic literature review on spatiotemporal analyses of Twitter data. Transactions in GIS 19, 6 (2015), 809--834.Google ScholarCross Ref
- Enrico Steiger, René Westerholt, Bernd Resch, and Alexander Zipf. 2015. Twitter as an indicator for whereabouts of people? Correlating Twitter with UK census data. Computers, Environment and Urban Systems 54 (2015), 255--265.Google ScholarCross Ref
- J Sun. 2012. 'Jieba' Chinese word segmentation tool. (2012).Google Scholar
- Dennis Thom, Robert Krüger, and Thomas Ertl. 2016. Can Twitter save lives? A broad-scale study on visual social media analytics for public safety. IEEE transactions on visualization and computer graphics 22, 7 (2016), 1816--1829.Google Scholar
- Stephen Wan and Cécile Paris. 2014. Improving government services with social media feedback. In Proceedings of the 19th international conference on Intelligent User Interfaces. ACM, 27--36. Google ScholarDigital Library
- Yandong Wang, Teng Wang, Xinyue Ye, Jianqi Zhu, and Jay Lee. 2015. Using social media for emergency response and urban sustainability: A case study of the 2012 Beijing rainstorm. Sustainability 8, 1 (2015), 25.Google ScholarCross Ref
- Zheye Wang and Xinyue Ye. 2018. Social media analytics for natural disaster management. International Journal of Geographical Information Science 32, 1 (2018), 49--72. Google ScholarDigital Library
- Yingcai Wu, Nan Cao, David Gotz, Yap-Peng Tan, and Daniel A Keim. 2016. A survey on visual analytics of social media data. IEEE Transactions on Multimedia 18, 11 (2016), 2135--2148. Google ScholarDigital Library
- Qianli Yuan and Mila Gasco. 2017. Citizens' Use of Microblogging During Emergency: A Case Study on Water Contamination in Shanghai. In Proceedings of the 18th Annual International Conference on Digital Government Research. ACM, 110--119. Google ScholarDigital Library
- Lizhou Zheng, Peiquan Jin, Jie Zhao, and Lihua Yue. 2014. Multi-dimensional sentiment analysis for large-scale E-commerce reviews. In International Conference on Database and Expert Systems Applications. Springer, 449--463.Google ScholarCross Ref
Index Terms
- Enhancing situation awareness of public safety events by visualizing topic evolution using social media
Recommendations
Hashtag-based topic evolution in social media
The rise of online social media has led to an explosion of metadata-containing user generated content. The tracking of metadata distribution is essential to understand social media. This paper presents two statistical models that detect interpretable ...
Topic evolution based on the probabilistic topic model: a review
Accurately representing the quantity and characteristics of users' interest in certain topics is an important problem facing topic evolution researchers, particularly as it applies to modern online environments. Search engines can provide information ...
Subtopic Based Topic Evolution Analysis
WISM '09: Proceedings of the 2009 International Conference on Web Information Systems and MiningTraditional topic tracking approaches can obtain the relevant stories. However, the relationship between stories occurred during the topic developing process can not be exhibited clearly. By analyzing the evolution of the topic, the concept subtopic is ...
Comments