Skip to main content
Top

2016 | OriginalPaper | Chapter

Topic Tracking in News Streams Using Latent Factor Models

Authors : Jens Meiners, Andreas Lommatzsch

Published in: Innovations for Community Services

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The increasing number of published news articles and messages in social media make it hard for users to find the relevant information and to track interesting topics. Relevant news is hidden in a haystack of irrelevant data. Text-mining techniques have been developed to extract implicit, hidden information. These techniques analyze big datasets and compute “latent” features based on implicit correlations between documents and events. In this paper we develop a system that applies latent factor models on data streams. Our method allows us detecting the dominant topics and tracking the changes in the relevant topics. In addition, we explain how the extracted knowledge is used for computing recommendations based on trending topics and terms. We evaluate our system on a stream of news messages published on the micro-blogging service Twitter. The evaluation shows that our system efficiently extracts topics and provides valuable insights into the continuously changing news stream helping users quickly identifying the most relevant information as well as current trends.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Associated Press: A New Model for News: Studying the Deep Structure of Young-Adult News Consumption, July 2008 Associated Press: A New Model for News: Studying the Deep Structure of Young-Adult News Consumption, July 2008
2.
go back to reference Nordenson, B.: Overload! Columbia J. Rev. 30 (2008) Nordenson, B.: Overload! Columbia J. Rev. 30 (2008)
3.
go back to reference Sprenger, T.O., et al.: Essays on the information content of microblogs and their use as an indicator of real-world events. Dissertation, Technische Universität München, München (2011) Sprenger, T.O., et al.: Essays on the information content of microblogs and their use as an indicator of real-world events. Dissertation, Technische Universität München, München (2011)
4.
go back to reference Kwak, H., Lee, C., Park, H., Moon, S.: What is Twitter, a social network or a news media? In: Proceedings of the 19th International Conference on WWW 2010, pp. 591–600. ACM, New York (2010) Kwak, H., Lee, C., Park, H., Moon, S.: What is Twitter, a social network or a news media? In: Proceedings of the 19th International Conference on WWW 2010, pp. 591–600. ACM, New York (2010)
5.
go back to reference De Francisci Morales, G., Gionis, A., Lucchese, C.: From chatter to headlines: harnessing the real-time web for personalized news recommendation. In: Proceedings of the 5th ACM International Conference on Web Search and Data Mining, pp. 153–162. ACM (2012) De Francisci Morales, G., Gionis, A., Lucchese, C.: From chatter to headlines: harnessing the real-time web for personalized news recommendation. In: Proceedings of the 5th ACM International Conference on Web Search and Data Mining, pp. 153–162. ACM (2012)
6.
go back to reference Manning, C.D., Raghavan, P., Schütze, H., et al.: Introduction to Information Retrieval, vol. 1. Cambridge University Press, Cambridge (2008)CrossRefMATH Manning, C.D., Raghavan, P., Schütze, H., et al.: Introduction to Information Retrieval, vol. 1. Cambridge University Press, Cambridge (2008)CrossRefMATH
7.
go back to reference Perkio, J., Buntine, W., Perttu, S.: Exploring independent trends in a topic-based search engine. In: Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, pp. 664–668. IEEE Computer Society (2004) Perkio, J., Buntine, W., Perttu, S.: Exploring independent trends in a topic-based search engine. In: Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, pp. 664–668. IEEE Computer Society (2004)
8.
go back to reference Srivastava, A.N., Sahami, M.: Text Mining: Classification, Clustering, and Applications, pp. 121–180. CRC Press (2009) Srivastava, A.N., Sahami, M.: Text Mining: Classification, Clustering, and Applications, pp. 121–180. CRC Press (2009)
9.
go back to reference Mei, Q., Zhai, C.: Discovering evolutionary theme patterns from text: an exploration of temporal text mining. In: Proceedings of the 11th International Conference on Knowledge Discovery in Data Mining, pp. 198–207. ACM (2005) Mei, Q., Zhai, C.: Discovering evolutionary theme patterns from text: an exploration of temporal text mining. In: Proceedings of the 11th International Conference on Knowledge Discovery in Data Mining, pp. 198–207. ACM (2005)
10.
go back to reference Xu, W., Liu, X., Gong, Y.: Document clustering based on non-negative matrix factorization. In: Proceedings of the 26th International ACM SIGIR Conference on Research and Development in IR, pp. 267–273. ACM (2003) Xu, W., Liu, X., Gong, Y.: Document clustering based on non-negative matrix factorization. In: Proceedings of the 26th International ACM SIGIR Conference on Research and Development in IR, pp. 267–273. ACM (2003)
11.
go back to reference Deerwester, S.C., Dumais, S.T., Landauer, T.K., Furnas, G.W., Harshman, R.A.: Indexing by latent semantic analysis. JAsIs 41(6), 391–407 (1990)CrossRef Deerwester, S.C., Dumais, S.T., Landauer, T.K., Furnas, G.W., Harshman, R.A.: Indexing by latent semantic analysis. JAsIs 41(6), 391–407 (1990)CrossRef
12.
go back to reference Cao, B., Shen, D., Sun, J.-T., Wang, X., Yang, Q., Chen, Z.: Detect and track latent factors with online nonnegative matrix factorization. In: IJCAI, vol. 7, pp. 2689–2694 (2007) Cao, B., Shen, D., Sun, J.-T., Wang, X., Yang, Q., Chen, Z.: Detect and track latent factors with online nonnegative matrix factorization. In: IJCAI, vol. 7, pp. 2689–2694 (2007)
13.
go back to reference Vaca, C.K., Mantrach, A., Jaimes, A., Saerens, M.: A time-based collective factorization for topic discovery, monitoring in news. In Proceedings of the 23rd International Conference on World Wide Web, pp. 527–538. ACM, New York (2014) Vaca, C.K., Mantrach, A., Jaimes, A., Saerens, M.: A time-based collective factorization for topic discovery, monitoring in news. In Proceedings of the 23rd International Conference on World Wide Web, pp. 527–538. ACM, New York (2014)
14.
go back to reference Brand, M.: Incremental singular value decomposition of uncertain data with missing values. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2350, pp. 707–720. Springer, Heidelberg (2002). doi:10.1007/3-540-47969-4_47 CrossRef Brand, M.: Incremental singular value decomposition of uncertain data with missing values. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2350, pp. 707–720. Springer, Heidelberg (2002). doi:10.​1007/​3-540-47969-4_​47 CrossRef
15.
go back to reference Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet Allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)MATH Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet Allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)MATH
16.
go back to reference AlSumait, L., Barbará, D., Domeniconi, C.: On-line LDA: adaptive topic models for mining text streams with applications to topic detection and tracking. In: Proceedings of the 8th IEEE International Conference on Data Mining, ICDM 2008, pp. 3–12. IEEE (2008) AlSumait, L., Barbará, D., Domeniconi, C.: On-line LDA: adaptive topic models for mining text streams with applications to topic detection and tracking. In: Proceedings of the 8th IEEE International Conference on Data Mining, ICDM 2008, pp. 3–12. IEEE (2008)
17.
go back to reference Wang, Y., Agichtein, E., Benzi, M.: TM-LDA: efficient online modeling of latent topic transitions in social media. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 123–131. ACM (2012) Wang, Y., Agichtein, E., Benzi, M.: TM-LDA: efficient online modeling of latent topic transitions in social media. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 123–131. ACM (2012)
18.
go back to reference Balabanović, M., Shoham, Y.: Fab: content-based, collaborative recommendation. Commun. ACM 40(3), 66–72 (1997)CrossRef Balabanović, M., Shoham, Y.: Fab: content-based, collaborative recommendation. Commun. ACM 40(3), 66–72 (1997)CrossRef
19.
go back to reference Liu, J., Dolan, P., Pedersen, E.R.: Personalized news recommendation based on click behavior. In: Proceedings of the 15th International Conference on Intelligent User Interfaces, pp. 31–40. ACM, New York (2010) Liu, J., Dolan, P., Pedersen, E.R.: Personalized news recommendation based on click behavior. In: Proceedings of the 15th International Conference on Intelligent User Interfaces, pp. 31–40. ACM, New York (2010)
20.
go back to reference Li, L., Li, T.: News recommendation via hypergraph learning: encapsulation of user behavior and news content. In: Proceedings of the 6th ACM International Conference on Web Search and Data Mining, pp. 305–314 (2013) Li, L., Li, T.: News recommendation via hypergraph learning: encapsulation of user behavior and news content. In: Proceedings of the 6th ACM International Conference on Web Search and Data Mining, pp. 305–314 (2013)
21.
go back to reference Saha, A., Sindhwani, V.: Learning evolving, emerging topics in social media: a dynamic NMF approach with temporal regularization. In: Proceedings of the 5th ACM International Conference on Web Search and Data Mining, pp. 693–702. ACM (2012) Saha, A., Sindhwani, V.: Learning evolving, emerging topics in social media: a dynamic NMF approach with temporal regularization. In: Proceedings of the 5th ACM International Conference on Web Search and Data Mining, pp. 693–702. ACM (2012)
22.
go back to reference Hennig, L., Ploch, D., Prawdzik, D., Armbruster, B., Düwiger, H., De Luca, E.W., Albayrak, S.: SPIGA - multilingual news aggregator. In: Proceedings of GSCL 2011 (2011) Hennig, L., Ploch, D., Prawdzik, D., Armbruster, B., Düwiger, H., De Luca, E.W., Albayrak, S.: SPIGA - multilingual news aggregator. In: Proceedings of GSCL 2011 (2011)
Metadata
Title
Topic Tracking in News Streams Using Latent Factor Models
Authors
Jens Meiners
Andreas Lommatzsch
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-49466-1_12

Premium Partner