ABSTRACT
We propose a dynamic topic model for monitoring temporal evolution of market competition by jointly leveraging tweets and their associated images. For a market of interest (e.g. luxury goods), we aim at automatically detecting the latent topics (e.g. bags, clothes, luxurious) that are competitively shared by multiple brands (e.g. Burberry, Prada, and Chanel), and tracking temporal evolution of the brands' stakes over the shared topics. One of key applications of our work is social media monitoring that can provide companies with temporal summaries of highly overlapped or discriminative topics with their major competitors. We design our model to correctly address three major challenges: multiview representation of text and images, modeling of competitiveness of multiple brands over shared topics, and tracking their temporal evolution. As far as we know, no previous model can satisfy all the three challenges. For evaluation, we analyze about 10 millions of tweets and 8 millions of associated images of the 23 brands in the two categories of luxury and beer. Through experiments, we show that the proposed approach is more successful than other candidate methods for the topic modeling of competition. We also quantitatively demonstrate the generalization power of the proposed method for three prediction tasks.
Supplemental Material
- A. Ahmed and E. P. Xing. Timeline: A Dynamic Hierarchical Dirichlet Process Model for Recovering Birth/Death and Evolution of Topics in Text Stream. In UAI, 2010.Google Scholar
- N. Archak, A. Ghose, and P. G. Ipeirotis. Show me the Money! Deriving the Pricing Power of Product Features by Mining Consumer Reviews. In KDD, 2007. Google ScholarDigital Library
- D. Blei, A. Ng, and M. Jordan. Latent Dirichlet Allocation. JMLR, 3:993--1022, 2003. Google ScholarDigital Library
- D. M. Blei and M. I. Jordan. Modeling Annotated Data. In SIGIR, 2003. Google ScholarDigital Library
- D. M. Blei and J. D. Lafferty. Dynamic Topic Models. In ICML, 2006. Google ScholarDigital Library
- J. Chang, J. L. Boyd-graber, S. Gerrish, C. Wang, and D. M. Blei. Reading Tea Leaves: How Humans Interpret Topic Models. In NIPS, 2009.Google ScholarDigital Library
- N. Chen, J. Zhu, F. Sun, and X. Eric P. Large-Margin Predictive Latent Subspace Learning for Multiview Data Analysis. IEEE PAMI, 34:2365--2378, 2012. Google ScholarDigital Library
- G. Doyle and C. Elkan. Financial Topic Models. In NIPS Workshop for Applications for Topic Models: Text and Beyond, 2009.Google Scholar
- Y. Feng and M. Lapata. Topic Models for Image Annotation and Text Illustration. In NAACL HLT, 2010. Google ScholarDigital Library
- N. Glance, M. Hurst, K. Nigam, M. Siegler, R. Stockton, and T. Tomokiyo. Deriving Marketing Intelligence from Online Discussion. In KDD, 2005. Google ScholarDigital Library
- T. Iwata, S. Watanabe, T. Yamada, and N. Ueda. Topic Tracking Model for Analyzing Consumer Purchase Behavior. In IJCAI, 2009. Google ScholarDigital Library
- G. Kim, C. Faloutsos, and M. Hebert. Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links. In ACM MIR, 2008. Google ScholarDigital Library
- T. Kurashima, T. Iwata, T. Hoshide, N. Takaya, and K. Fujimura. Geo Topic Model: Joint Modeling of User's Activity Area and Interests for Location Recommendation. In WSDM, 2013. Google ScholarDigital Library
- Q. Mei, X. Ling, M. Wondra, H. Su, and C. Zhai. Topic Sentiment Mixture: Modeling Facets and Opinions in Weblogs. In WWW, 2007. Google ScholarDigital Library
- O. Netzer, R. Feldman, J. Goldenberg, and M. Fresko. Mine Your Own Business: Market-Structure Surveillance Through Text Mining. Marketing Science, 31(3):521--543, 2012. Google ScholarDigital Library
- B. O'Connor, M. Krieger, and D. Ahn. TweetMotif: Exploratory Search and Topic Summarization for Twitter. In ICWSM, 2010.Google ScholarCross Ref
- B. Pang and L. Lee. Opinion Mining and Sentiment Analysis. Foundations and Trends in Information Retrieval, 2:1--135, 2008. Google ScholarDigital Library
- J. F. Prescott and S. H. Miller. Proven Strategies in Competitive Intelligence: Lessons from the Trenches. Wiley, 2001.Google Scholar
- I. Titov and R. McDonald. Modeling Online Reviews with Multi-grain Topic Models. In WWW, 2008. Google ScholarDigital Library
- A. Vedaldi and K. Lenc. MatConvNet -- Convolutional Neural Networks for MATLAB. In CoRR, 2014.Google Scholar
- Z. Wang, P. Cui, L. Xie, W. Zhu, Y. Rui, and S. Yang. Bilateral Correspondence Model for Words-and-Pictures Association in Multimedia-rich Microblogs. ACM TOMM, 10:2365--2378, 2014. Google ScholarDigital Library
- S. Wu, W. M. Rand, and L. Raschid. Recommendations in Social Media for Brand Monitoring. In RecSys, 2011. Google ScholarDigital Library
- P. Xie and E. P. Xing. Integrating Document Clustering and Topic Modeling. In UAI, 2013.Google Scholar
- K. Xu, S. S. Liao, J. Li, and Y. Song. Mining Comparative Opinions from Customer Reviews for Competitive Intelligence. Decision Support Systems, 50:743--754, 2011. Google ScholarDigital Library
- J. Zhu, A. Ahmed, and E. P. Xing. MedLDA: Maximum Margin Supervised Topic Models. JMLR, 13:2237--2278, 2012. Google ScholarDigital Library
- J. Zhu and E. P. Xing. Sparse Topical Coding. In UAI, 2011.Google Scholar
Index Terms
- Dynamic Topic Modeling for Monitoring Market Competition from Online Text and Image Data
Recommendations
Dynamic topic models
ICML '06: Proceedings of the 23rd international conference on Machine learningA family of probabilistic time series models is developed to analyze the time evolution of topics in large document collections. The approach is to use state space models on the natural parameters of the multinomial distributions that represent the ...
Scaling up Dynamic Topic Models
WWW '16: Proceedings of the 25th International Conference on World Wide WebDynamic topic models (DTMs) are very effective in discovering topics and capturing their evolution trends in time series data. To do posterior inference of DTMs, existing methods are all batch algorithms that scan the full dataset before each update of ...
Online multiscale dynamic topic models
KDD '10: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data miningWe propose an online topic model for sequentially analyzing the time evolution of topics in document collections. Topics naturally evolve with multiple timescales. For example, some words may be used consistently over one hundred years, while other ...
Comments