ABSTRACT
An increasing fraction of the global discourse is migrating online in the form of blogs, bulletin boards, web pages, wikis, editorials, and a dizzying array of new collaborative technologies. The migration has now proceeded to the point that topics reflecting certain individual products are sufficiently popular to allow targeted online tracking of the ebb and flow of chatter around these topics. Based on an analysis of around half a million sales rank values for 2,340 books over a period of four months, and correlating postings in blogs, media, and web pages, we are able to draw several interesting conclusions.First, carefully hand-crafted queries produce matching postings whose volume predicts sales ranks. Second, these queries can be automatically generated in many cases. And third, even though sales rank motion might be difficult to predict in general, algorithmic predictors can use online postings to successfully predict spikes in sales rank.
- E. Adar, L. Zhang, L. A. Adamic, and R. M. Lukose. Implicit structure and the dynamics of blogspace. Workshop on the Weblogging Ecosystem, 13th International World Wide Web Conference, 2004.Google Scholar
- A. Admati and P eiderer. Disclosing information on the internet: Is it noise or is it news? Technical report, Graduate School of Business, Stanford University, 2001.Google Scholar
- J. Allan, J. Carbonell, G. Doddington, J. Yamron, and Y. Yang. Topic detection and tracking pilot study: Final report. Proc. of the DARPA Broadcast News Transcription and Understanding Workshop, 1998.Google Scholar
- W. Antweiler and M. Z. Frank. Is all that talk just noise? The information content of Internet stock message boards. Journal of Finance, 59(3):1259--1295, 2004.Google ScholarCross Ref
- S. Arbesman. The memespread project: An initial analysis of the contagious nature of information in online networks. http://www.arbesman.net/memespread.pdf, 2004.Google Scholar
- Biz360. Market360 product datasheet. Technical report, Biz360, 2004.Google Scholar
- P. Blackshaw and M. Nazzaro. Consumer-generated media (cgm) 101. Technical report, Intelliseek, 2004.Google Scholar
- G. E. P. Box, G. M. Jenkins, and G. C. Reinsel. Time Series Analysis, Forecasting and Control. Prentice Hall, 1994. Google ScholarDigital Library
- Carma. How doe we gain an understanding of the media environment on our company as our industry comes under scrutiny? Technical report, Carma, 2004.Google Scholar
- C. Chatfield. The Analysis of Time Series. Chapman and Hall, 1984.Google Scholar
- S. Dill, N. Eiron, D. Gibson, D. Gruhl, R. Guha, A. Jhingran, T. Kanungo, S. Rajagopalan, A. Tomkins, J. Tomlin, and J. Y. Zien. Semtag and seeker: Bootstrapping the semantic web via automated semantic annotation. In Proc. of the 12th International World Wide Web Conference, pages 178--186, 2003. Google ScholarDigital Library
- D. Sornette, F. Deschâtres, T. Gilbert, and Y. Ageon. Endogenous versus exogenous shocks in complex networks: An empirical test using book sale rankings. Physical Review Letters, 93(228701), 2004.Google Scholar
- D. Gruhl, L. Chavet, D. Gibson, J. Meyer, P. Pattanayak, A. Tomkins, and J. Zien. How to build a webfountain: An architecture for very large-scale text analytics. IBM Systems Journal, 43(1):64--77, 2004. Google ScholarDigital Library
- D. Gruhl, R. Guha, D. Liben-Nowell, and A. Tomkins. Information diffusion through blogspace. In Proc. of the 13th International World Wide Web Conference, pages 491--501, 2004. Google ScholarDigital Library
- J. Kleinberg. Bursty and hierarchical structure in streams. In Proc. 8th ACM SIGKDD Intl. Conf. on Knowledge Discovery and Data Mining, pages 91--101, 2002. Google ScholarDigital Library
- R. Kumar, J. Novak, P. Raghavan, and A. Tomkins. On the bursty evolution of blogspace. In Proc. of the 12th International World Wide Web Conference, pages 568--576, 2003. Google ScholarDigital Library
- R. Kumar, J. Novak, P. Raghavan, and A. Tomkins. Structure and evolution of blogspace. Communications of the ACM, 47(12):35--39, 2004. Google ScholarDigital Library
- J. Lin and A. Halavais. Mapping the blogosphere in america. Workshop on the Weblogging Ecosystem, 13th International World Wide Web Conference, 2004.Google Scholar
- R. Papka. On-line new event detection, clustering, and tracking. Technical Report UM-CS-1999-045, University of Massachusetts, 1999. Google ScholarDigital Library
- D. Smith. Detecting and browsing events in unstructured text. In Proc. of the 25th ACM International Conference on Research and Development in Information Retrieval, pages 73--80, 2002. Google ScholarDigital Library
- R. Tong. Detecting and tracking opinions in on-line discussions. UCB/SIMS Web Mining Workshop, 2001.Google Scholar
- R. Tumarkin and R. F. Whitelaw. News or noise? internet postings and stock prices. Financial Analysts Journal, pages 41--51, 2001.Google ScholarCross Ref
- B. Whitman and S. Lawrence. Inferring descriptions and similarity for music from community metadata. In Proc. of the 2002 International Computer Music Conference, pages 591--598, 2002.Google Scholar
- Y. Yang, T. Pierce, and J. Carbonell. A study on retrospective and on-line event detection. In Proc. of the 21st ACM International Conference on Research and Development in Information Retrieval, pages 28--36, 1998. Google ScholarDigital Library
Index Terms
- The predictive power of online chatter
Recommendations
Analyzing online opinions and influence campaigns on blogs using BlogTracker
ASONAM '21: Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and MiningBlogging has become an essential part of the new print media of the 21st century despite the emergence of social media platforms like Twitter and Facebook, with many news agencies, media outlets, journalists and users using this medium to write without ...
Finding prophets in the blogosphere: bloggers who predicted buzzwords before they become popular
iiWAS '15: Proceedings of the 17th International Conference on Information Integration and Web-based Applications & ServicesIdentifying important users from social media has recently attracted much attention in information and knowledge management community. Although researchers have focused on users' knowledge levels on certain topics or influence degrees on other users in ...
Bloggers and Readers Blogging Together: Collaborative Co-creation of Political Blogs
A significant amount of research has focused on blogs, bloggers, and blogging. However, relatively little work has examined blog readers, their interactions with bloggers, or their impact on blogging. This paper presents a qualitative study focusing ...
Comments