Skip to main content

2015 | OriginalPaper | Buchkapitel

Visual Analysis of Topics in Twitter Based on Co-evolution of Terms

verfasst von : Lambert Pépin, Julien Blanchard, Fabrice Guillet, Pascale Kuntz, Philippe Suignard

Erschienen in: Data Science, Learning by Latent Structures, and Knowledge Discovery

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The analysis of Twitter short messages has become a key issue for companies seeking to understand consumer behaviour and expectations. However, automatic algorithms for topic tracking often extract general tendencies at a high granularity level and do not provide added value to experts who are looking for more subtle information. In this paper, we focus on the visualization of the co-evolution of terms in tweets in order to facilitate the analysis of the evolution of topics by a decision-maker. We take advantage of the perceptual quality of heatmaps to display our 3D data (term × time × score) in a 2D space. Furthermore, by computing an appropriate order to display the main terms on the heatmap, our methodology ensures an intuitive visualization of their co-evolution. An experiment was conducted on real-life datasets in collaboration with an expert in customer relationship management working at the French energy company EDF. The first results show three different kinds of co-evolution of terms: bursty features, reoccurring terms and long periods of activity.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Aigner, W., Miksch, S., Schumann, H., & Tominski, C. (2011). Visualization of time-oriented data. London: Springer.CrossRef Aigner, W., Miksch, S., Schumann, H., & Tominski, C. (2011). Visualization of time-oriented data. London: Springer.CrossRef
Zurück zum Zitat Allan, J., Carbonell, J. G., Doddington, G., Yamron, J., & Yang, Y. (1998). Topic detection and tracking. Pilot Study Final Report. Allan, J., Carbonell, J. G., Doddington, G., Yamron, J., & Yang, Y. (1998). Topic detection and tracking. Pilot Study Final Report.
Zurück zum Zitat Blei, D. M., & Lafferty, J. D. (2006). Dynamic topic todels. In Proceedings of the 23rd International Conference on Machine Learning (pp. 113–120). Blei, D. M., & Lafferty, J. D. (2006). Dynamic topic todels. In Proceedings of the 23rd International Conference on Machine Learning (pp. 113–120).
Zurück zum Zitat Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. The Journal of Machine Learning Research, 3, 993–1022.MATH Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. The Journal of Machine Learning Research, 3, 993–1022.MATH
Zurück zum Zitat Caballero, K. L., Barajas, J., & Akella, R. (2012). The generalized Dirichlet distribution in enhanced topic detection. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM) (pp. 773–782) Caballero, K. L., Barajas, J., & Akella, R. (2012). The generalized Dirichlet distribution in enhanced topic detection. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM) (pp. 773–782)
Zurück zum Zitat Deerwester, S. C., Dumais, S. T., Landauer, T. K., Furnas, G. W., & Harshman, R. A. (1990). Indexing by latent semantic analysis. The Journal of the American Society for Information Science, 41(6), 391–407.CrossRef Deerwester, S. C., Dumais, S. T., Landauer, T. K., Furnas, G. W., & Harshman, R. A. (1990). Indexing by latent semantic analysis. The Journal of the American Society for Information Science, 41(6), 391–407.CrossRef
Zurück zum Zitat Gansner, E., Hu, Y., & North, S. (2012). Visualizing streaming text data with dynamic maps. In Proceedings of the 20th International Conference on Graph Drawing (pp. 439–450). Gansner, E., Hu, Y., & North, S. (2012). Visualizing streaming text data with dynamic maps. In Proceedings of the 20th International Conference on Graph Drawing (pp. 439–450).
Zurück zum Zitat Hoffman, M., Bach, F. R., & Blei, D. M. (2010). Online learning for latent Dirichlet allocation. In Advances in Neural Information Processing Systems (pp. 856–864). Hoffman, M., Bach, F. R., & Blei, D. M. (2010). Online learning for latent Dirichlet allocation. In Advances in Neural Information Processing Systems (pp. 856–864).
Zurück zum Zitat Hoffman, T. (1999). Probabilistic latent semantic analysis. In Proceedings of the 15th Conference on Uncertainty in Artificial Intelligence (UAI) (pp. 289–296). San Francisco, CA: Morgan Kaufmann. Hoffman, T. (1999). Probabilistic latent semantic analysis. In Proceedings of the 15th Conference on Uncertainty in Artificial Intelligence (UAI) (pp. 289–296). San Francisco, CA: Morgan Kaufmann.
Zurück zum Zitat Jankun-Kelly, T. J., Ma, K.-L., & Gertz, M. (2007). A model and framework for visualization exploration. IEEE Transactions on Visualization and Computer Graphics, 13(2), 357–369.CrossRef Jankun-Kelly, T. J., Ma, K.-L., & Gertz, M. (2007). A model and framework for visualization exploration. IEEE Transactions on Visualization and Computer Graphics, 13(2), 357–369.CrossRef
Zurück zum Zitat Jo, Y., Hopcroft, J. E., & Lagoze, C. (2011). The web of topics: Discovering the topology of topic evolution in a corpus. In Proceedings of the 20th International Conference on World Wide Web (W3C) (pp. 257–266). Jo, Y., Hopcroft, J. E., & Lagoze, C. (2011). The web of topics: Discovering the topology of topic evolution in a corpus. In Proceedings of the 20th International Conference on World Wide Web (W3C) (pp. 257–266).
Zurück zum Zitat Jones, K. S. (1972). A statistical interpretation of term specificity and its application in retrieval. The Journal of Documentation, 28(1), 11–21.CrossRef Jones, K. S. (1972). A statistical interpretation of term specificity and its application in retrieval. The Journal of Documentation, 28(1), 11–21.CrossRef
Zurück zum Zitat Kasiviswanathan, S. P., Melville, P., Banerjee, A., & Sindhwani, V. (2011). Emerging topic detection using dictionary learning. In Proceedings of the 20th ACM International Conference on Information and Knowledge Management (ICKM) (pp. 745–754). Kasiviswanathan, S. P., Melville, P., Banerjee, A., & Sindhwani, V. (2011). Emerging topic detection using dictionary learning. In Proceedings of the 20th ACM International Conference on Information and Knowledge Management (ICKM) (pp. 745–754).
Zurück zum Zitat Keim, D. A., Mansmann, F., & Thomas, J. (2010). Visual analytics: How much visualization and how much analytics? ACM SIGKDD Explorations Newsletter, 11(2), 5–8.CrossRef Keim, D. A., Mansmann, F., & Thomas, J. (2010). Visual analytics: How much visualization and how much analytics? ACM SIGKDD Explorations Newsletter, 11(2), 5–8.CrossRef
Zurück zum Zitat Kleinberg, J. (2002). Bursty and hierarchical structure in streams. In Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) (pp. 91–101). Kleinberg, J. (2002). Bursty and hierarchical structure in streams. In Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) (pp. 91–101).
Zurück zum Zitat Kwak, H., Lee, C., Park, H., & Moon, S. (2010). What is twitter, a social network or a news media? In Proceedings of the 19th International Conference on World Wide Web (W3C) (pp. 591–600). Kwak, H., Lee, C., Park, H., & Moon, S. (2010). What is twitter, a social network or a news media? In Proceedings of the 19th International Conference on World Wide Web (W3C) (pp. 591–600).
Zurück zum Zitat Leskovec, J., Backstorm, L., & Kleinberg, J. (2009). Meme-tracking and the dynamics of the news cycle. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) (pp. 497–506). Leskovec, J., Backstorm, L., & Kleinberg, J. (2009). Meme-tracking and the dynamics of the news cycle. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) (pp. 497–506).
Zurück zum Zitat Marcus, A., Bernstein, M. S., Badar, O., Karger, D. R., Madden, S., & Miller, R. C. (2011). Twitinfo: Aggregating and visualizing microblogs for event exploration. In Proceedings of the 2011 Annual Conference on Human Factors in Computing Systems (CHI) (pp. 227–236). Marcus, A., Bernstein, M. S., Badar, O., Karger, D. R., Madden, S., & Miller, R. C. (2011). Twitinfo: Aggregating and visualizing microblogs for event exploration. In Proceedings of the 2011 Annual Conference on Human Factors in Computing Systems (CHI) (pp. 227–236).
Zurück zum Zitat Mei, Q., & Zhai, C. (2005). Discovering evolutionary theme patterns from text: An exploration of temporal text mining. In Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery in Data Mining (KDD) (pp. 198–207). Mei, Q., & Zhai, C. (2005). Discovering evolutionary theme patterns from text: An exploration of temporal text mining. In Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery in Data Mining (KDD) (pp. 198–207).
Zurück zum Zitat North, C., Rhyne, TM., & Duca, K. (2005). Bioinformatics visualization: Introduction to the special issue. Information Visualization, 4(3), 147–148.CrossRef North, C., Rhyne, TM., & Duca, K. (2005). Bioinformatics visualization: Introduction to the special issue. Information Visualization, 4(3), 147–148.CrossRef
Zurück zum Zitat Robertson, S. E., Walker, S., Beaulieu, M., & Willet, P. (1999). Okapi at Trec-7: Automatic ad hoc, filtering, VLC and interactive track (pp. 253–264). Gaithersburg: Nist Special Publication SP. Robertson, S. E., Walker, S., Beaulieu, M., & Willet, P. (1999). Okapi at Trec-7: Automatic ad hoc, filtering, VLC and interactive track (pp. 253–264). Gaithersburg: Nist Special Publication SP.
Metadaten
Titel
Visual Analysis of Topics in Twitter Based on Co-evolution of Terms
verfasst von
Lambert Pépin
Julien Blanchard
Fabrice Guillet
Pascale Kuntz
Philippe Suignard
Copyright-Jahr
2015
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-662-44983-7_15

Premium Partner