Skip to main content
Top

2017 | OriginalPaper | Chapter

Keeping Priors in Streaming Bayesian Learning

Authors : Anh Nguyen Duc, Ngo Van Linh, Anh Nguyen Kim, Khoat Than

Published in: Advances in Knowledge Discovery and Data Mining

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Exploiting prior knowledge in the Bayesian learning process is one way to improve the quality of Bayesian model. To the best of our knowledge, however, there is no formal research about the influence of prior in streaming environment. In this paper, we address the problem of using prior knowledge in streaming Bayesian learning, and develop a framework for keeping priors in streaming learning (KPS) that maintains knowledge from the prior through each minibatch of streaming data. We demonstrate the performance of our framework in two scenarios: streaming learning for latent Dirichlet allocation and streaming text classification in comparison with methods that do not keep prior.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Ahmed, A., Xing, E.P.: Staying informed: supervised and semi-supervised multi-view topical analysis of ideological perspective. In: Empirical Methods in Natural Language Processing, pp. 1140–1150 (2010) Ahmed, A., Xing, E.P.: Staying informed: supervised and semi-supervised multi-view topical analysis of ideological perspective. In: Empirical Methods in Natural Language Processing, pp. 1140–1150 (2010)
2.
go back to reference Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)MATH Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)MATH
3.
go back to reference Broderick, T., Boyd, N., Wibisono, A., Wilson, A.C., Jordan, M.I.: Streaming variational Bayes. In: Advances in Neural Information Processing Systems, pp. 1727–1735 (2013) Broderick, T., Boyd, N., Wibisono, A., Wilson, A.C., Jordan, M.I.: Streaming variational Bayes. In: Advances in Neural Information Processing Systems, pp. 1727–1735 (2013)
4.
go back to reference Jo, Y., Oh, A.H.: Aspect and sentiment unification model for online review analysis. In: ACM international Conference on Web Search and Data Mining, pp. 815–824 (2011) Jo, Y., Oh, A.H.: Aspect and sentiment unification model for online review analysis. In: ACM international Conference on Web Search and Data Mining, pp. 815–824 (2011)
5.
go back to reference Kluckhohn, C.: Human behavior and the principle of least effort. George Kingsley Zipf. Am. Anthropol. 52(2), 268–270 (1950)CrossRef Kluckhohn, C.: Human behavior and the principle of least effort. George Kingsley Zipf. Am. Anthropol. 52(2), 268–270 (1950)CrossRef
6.
go back to reference Le, V., Phung, C., Vu, C., Linh, N.V., Than, K.: Streaming sentiment-aspect analysis. In: RIVF, pp. 181–186 (2016) Le, V., Phung, C., Vu, C., Linh, N.V., Than, K.: Streaming sentiment-aspect analysis. In: RIVF, pp. 181–186 (2016)
7.
go back to reference Lin, C., He, Y., Pedrinaci, C., Domingue, J.: Feature LDA: a supervised topic model for automatic detection of web API documentations from the web. In: Cudré-Mauroux, P., et al. (eds.) ISWC 2012. LNCS, vol. 7649, pp. 328–343. Springer, Heidelberg (2012). doi:10.1007/978-3-642-35176-1_21 CrossRef Lin, C., He, Y., Pedrinaci, C., Domingue, J.: Feature LDA: a supervised topic model for automatic detection of web API documentations from the web. In: Cudré-Mauroux, P., et al. (eds.) ISWC 2012. LNCS, vol. 7649, pp. 328–343. Springer, Heidelberg (2012). doi:10.​1007/​978-3-642-35176-1_​21 CrossRef
8.
go back to reference McInerney, J., Ranganath, R., Blei, D.M.: The population posterior and Bayesian inference on streams. In: International Conference on Neural Information Processing Systems (2015) McInerney, J., Ranganath, R., Blei, D.M.: The population posterior and Bayesian inference on streams. In: International Conference on Neural Information Processing Systems (2015)
9.
go back to reference Mimno, D., Wallach, H.M., Talley, E., Leenders, M., McCallum, A.: Optimizing semantic coherence in topic models. In: Empirical Methods in Natural Language Processing, pp. 262–272 (2011) Mimno, D., Wallach, H.M., Talley, E., Leenders, M., McCallum, A.: Optimizing semantic coherence in topic models. In: Empirical Methods in Natural Language Processing, pp. 262–272 (2011)
10.
go back to reference Newman, M.E.J.: Power laws, Pareto distributions and Zipf’s law. Contemp. Phys. 46(5), 323–351 (2005)CrossRef Newman, M.E.J.: Power laws, Pareto distributions and Zipf’s law. Contemp. Phys. 46(5), 323–351 (2005)CrossRef
11.
go back to reference Piantadosi, S.T.: Zipfs word frequency law in natural language: a critical review and future directions. Psychon. Bull. Rev. 21(5), 1112–1130 (2014)CrossRef Piantadosi, S.T.: Zipfs word frequency law in natural language: a critical review and future directions. Psychon. Bull. Rev. 21(5), 1112–1130 (2014)CrossRef
12.
go back to reference Sato, I., Nakagawa, H.: Topic models with power-law using Pitman-Yor process. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 673–682 (2010) Sato, I., Nakagawa, H.: Topic models with power-law using Pitman-Yor process. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 673–682 (2010)
13.
go back to reference Van Linh, N., Anh, N.K., Than, K., Dang, C.N.: An effective and interpretable method for document classification. Knowl. Inf. Syst. 50(3), 763–793 (2016) Van Linh, N., Anh, N.K., Than, K., Dang, C.N.: An effective and interpretable method for document classification. Knowl. Inf. Syst. 50(3), 763–793 (2016)
14.
go back to reference Viet, H., Phung, D., Venkatesh, S.: Streaming variational inference for dirichlet process mixtures. In: Asian Conference on Machine Learning (2015) Viet, H., Phung, D., Venkatesh, S.: Streaming variational inference for dirichlet process mixtures. In: Asian Conference on Machine Learning (2015)
Metadata
Title
Keeping Priors in Streaming Bayesian Learning
Authors
Anh Nguyen Duc
Ngo Van Linh
Anh Nguyen Kim
Khoat Than
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-57529-2_20

Premium Partner