ABSTRACT
Twitter streams are on overload: active users receive hundreds of items per day, and existing interfaces force us to march through a chronologically-ordered morass to find tweets of interest. We present an approach to organizing a user's own feed into coherently clustered trending topics for more directed exploration. Our Twitter client, called Eddi, groups tweets in a user's feed into topics mentioned explicitly or implicitly, which users can then browse for items of interest. To implement this topic clustering, we have developed a novel algorithm for discovering topics in short status updates powered by linguistic syntactic transformation and callouts to a search engine. An algorithm evaluation reveals that search engine callouts outperform other approaches when they employ simple syntactic transformation and backoff strategies. Active Twitter users evaluated Eddi and found it to be a more efficient and enjoyable way to browse an overwhelming status update feed than the standard chronological interface.
Supplemental Material
- }}Baumer, E. and Fisher, D. Smarter Blogroll: An Exploration of Social Topic Extraction for Manageable Blogrolls. HICSS '08, IEEE Press (2008). Google ScholarDigital Library
- }}Bendersky, M. and Croft, W. B. Discovering key concepts in verbose queries. SIGIR '08, ACM Press (2008). Google ScholarDigital Library
- }}Blei, D. M., Ng, A. Y., and Jordan, M. I. Latent Dirichlet Allocation. Journal of Machine Learning Research 3, 4-5 (2003), 993--1022. Google ScholarDigital Library
- }}Chen, J., Nairn, R., Nelson, L., et al. Short and Tweet: Experiments on Recommending Content from Information Streams. Proc. CHI '10, ACM Press (2010). Google ScholarDigital Library
- }}Dakka, W. and Ipeirotis, P. G. Automatic Extraction of Useful Facet Hierarchies from Text Databases. IEEE Data Engineering '08, IEEE (2008). Google ScholarDigital Library
- }}Dumais, S., Cutrell, E., and Chen, H. Optimizing search by showing results in context. CHI '01, ACM Press (2001). Google ScholarDigital Library
- }}Ehrlich, K. and Shami, N. Microblogging Inside and Outside the Workplace. ICWSM '10, AAAI Press (2010).Google Scholar
- }}Gabrilovich, E. and Markovitch, S. Computing semantic relatedness using wikipedia-based explicit semantic analysis. IJCAI '07, (2007), 6--12. Google ScholarDigital Library
- }}Havre, S., Hetzler, E., Whitney, P., and Nowell, L. ThemeRiver: visualizing thematic changes in large document collections. IEEE Trans. Vis. and Comp. Graphics 8, 1 (2002), 9--20. Google ScholarDigital Library
- }}Hearst, M. A. and Rosner, D. Tag Clouds: Data Analysis Tool or Social Signaller? HICSS '08, (2008), 160--160. Google ScholarDigital Library
- }}Hearst, M. A. Clustering versus faceted categories for information exploration. CACM 49, 4 (2006), 59. Google ScholarDigital Library
- }}Hearst, M. A. Search User Interfaces. Cambridge University Press, 2009. Google ScholarDigital Library
- }}Honeycutt, C. and Herring, S. C. Beyond Microblogging: Conversation and Collaboration via Twitter. HICSS '09, IEEE (2009).Google Scholar
- }}Hulth, A. Improved automatic keyword extraction given more linguistic knowledge. EMNLP '03, ACL (2003), 216--223. Google ScholarDigital Library
- }}Java, A., Song, X., Finin, T., and Tseng, B. Why We Twitter: Understanding Microblogging Usage and Communities. WebKDD '07, ACM Press (2007), 56--65. Google ScholarDigital Library
- }}Kammerer, Y., Nairn, R., Pirolli, P., and Chi, E. H. Signpost from the masses: learning effects in an exploratory social tag search browser. CHI '09, ACM Press (2009), 625--634. Google ScholarDigital Library
- }}Käki, M. Findex: search result categories help users when document ranking fails. CHI '05, ACM Press (2005), 131--140. Google ScholarDigital Library
- }}Leskovec, J., Backstrom, L., and Kleinberg, J. Meme-tracking and the dynamics of the news cycle. KDD '09, ACM Press (2009), 497--506. Google ScholarDigital Library
- }}Naaman, M., Boase, J., and Lai, C. Is it Really About Me? Message Content in Social Awareness Streams. CSCW '10, ACM Press (2010). Google ScholarDigital Library
- }}Paley, W. B. TextArc: Showing Word Frequency and Distribution in Text. Ext. Proc. IEEE InfoViz, IEEE (2002).Google Scholar
- }}Ramage, D., Dumais, S., and Liebling, D. Characterizing Microblogs with Topic Models. ICWSM '10, AAAI Press (2010).Google Scholar
- }}Sahami, M. and Heilman, T. D. A web-based kernel function for measuring the similarity of short text snippets. WWW '06, ACM Press (2006), 377--386. Google ScholarDigital Library
- }}Salton, G. and Buckley, C. Term-weighting approaches in automatic text retrieval. Information Processing and Management 24, 5 (1988), 513--523. Google ScholarDigital Library
- }}Shamma, D. A., Kennedy, L., and Churchill, E. F. Tweet the debates: understanding community annotation of uncollected sources. Multimedia '09, (2009).Google ScholarDigital Library
- }}Wattenberg, M. and Kriss, J. Designing for social data analysis. IEEE Trans. Viz. and Comp. Graphics 12, 4 (2006), 549--57. Google ScholarDigital Library
- }}Yee, K., Swearingen, K., Li, K., and Hearst, M. Faceted metadata for image search and browsing. CHI '03, ACM Press (2003), 401--408. Google ScholarDigital Library
- }}Zhang, J., Qu, Y., Cody, J., and Wu, Y. A case study of micro-blogging in the enterprise: use, value, and related issues. Proc., ACM Press (2010), 243--252.Google ScholarDigital Library
Index Terms
- Eddi: interactive topic-based browsing of social status streams
Recommendations
Disinformation Warfare: Understanding State-Sponsored Trolls on Twitter and Their Influence on the Web
WWW '19: Companion Proceedings of The 2019 World Wide Web ConferenceOver the past couple of years, anecdotal evidence has emerged linking coordinated campaigns by state-sponsored actors with efforts to manipulate public opinion on the Web, often around major political events, through dedicated accounts, or “trolls.” ...
A sentiment analysis of audiences on twitter: who is the positive or negative audience of popular twitterers?
ICHIT'11: Proceedings of the 5th international conference on Convergence and hybrid information technologyMicroblogging is a new informal communication medium of blogging that differs from a traditional blog in which content is much shorter. Microbloggers post about topics that describe their current status. Twitter is a popular microblogging service and ...
Rumor Gauge: Predicting the Veracity of Rumors on Twitter
Special Issue on KDD 2016 and Regular PapersThe spread of malicious or accidental misinformation in social media, especially in time-sensitive situations, such as real-world emergencies, can have harmful effects on individuals and society. In this work, we developed models for automated ...
Comments