Skip to main content

2019 | OriginalPaper | Buchkapitel

Integrating Topic Model and Heterogeneous Information Network for Aspect Mining with Rating Bias

verfasst von : Yugang Ji, Chuan Shi, Fuzhen Zhuang, Philip S. Yu

Erschienen in: Advances in Knowledge Discovery and Data Mining

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Recently, there is a surge of research on aspect mining, where the goal is to predict aspect ratings of shops with reviews and overall ratings. Traditional methods assumed that aspect ratings in a specific review text are of the same level, which equal to the corresponding overall rating. However, recent research reveals a different phenomenon: there is an obvious rating bias between aspect ratings and overall ratings. Moreover, these methods usually analyze aspect ratings of reviews with topic models at textual level, while totally ignore potentially structural information among multiple entities (users, shops, reviews), which can be captured by a Heterogeneous Information Network (HIN). In this paper, we present a novel model integrating Topic model and HIN for Aspect Mining with rating bias (called THAM). Firstly, a phrase-level LDA model is designed to extract topic distributions of reviews by using textual information. Secondly, making full use of structural information, we constructs a topic propagation network, and propagate topic distributions in this heterogeneous network. Finally, by setting review as the sharing factor, the two parts are integrated into a uniform optimization framework. Experimental results on two real datasets demonstrate that THAM achieves significant performance improvement, compared to the state of the arts.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Bauman, K., Liu, B., Tuzhilin, A.: Aspect based recommendations: recommending items with the most valuable aspects based on user reviews. In: The ACM SIGKDD International Conference, pp. 717–725 (2017) Bauman, K., Liu, B., Tuzhilin, A.: Aspect based recommendations: recommending items with the most valuable aspects based on user reviews. In: The ACM SIGKDD International Conference, pp. 717–725 (2017)
2.
Zurück zum Zitat Laddha, A., Mukherjee, A.: Aspect opinion expression and rating prediction via LDA-CRF hybrid. Nat. Lang. Eng. 24, 1–29 (2018)CrossRef Laddha, A., Mukherjee, A.: Aspect opinion expression and rating prediction via LDA-CRF hybrid. Nat. Lang. Eng. 24, 1–29 (2018)CrossRef
3.
Zurück zum Zitat Li, H., Lin, R., Hong, R., Ge, Y.: Generative models for mining latent aspects and their ratings from short reviews. In: 2015 IEEE International Conference on Data Mining, ICDM 2015, Atlantic City, NJ, USA, 14–17 November 2015, pp. 241–250 (2015) Li, H., Lin, R., Hong, R., Ge, Y.: Generative models for mining latent aspects and their ratings from short reviews. In: 2015 IEEE International Conference on Data Mining, ICDM 2015, Atlantic City, NJ, USA, 14–17 November 2015, pp. 241–250 (2015)
5.
Zurück zum Zitat Lu, Y., Zhai, C., Sundaresan, N.: Rated aspect summarization of short comments. In: Proceedings of the 18th International Conference on World Wide Web, WWW 2009, Madrid, Spain, 20–24 April 2009, pp. 131–140 (2009) Lu, Y., Zhai, C., Sundaresan, N.: Rated aspect summarization of short comments. In: Proceedings of the 18th International Conference on World Wide Web, WWW 2009, Madrid, Spain, 20–24 April 2009, pp. 131–140 (2009)
6.
Zurück zum Zitat Luo, W., Zhuang, F., Cheng, X., He, Q., Shi, Z.: Ratable aspects over sentiments: predicting ratings for unrated reviews. In: 2014 IEEE International Conference on Data Mining, ICDM 2014, Shenzhen, China, 14–17 December 2014, pp. 380–389 (2014) Luo, W., Zhuang, F., Cheng, X., He, Q., Shi, Z.: Ratable aspects over sentiments: predicting ratings for unrated reviews. In: 2014 IEEE International Conference on Data Mining, ICDM 2014, Shenzhen, China, 14–17 December 2014, pp. 380–389 (2014)
7.
Zurück zum Zitat Luo, W., Zhuang, F., Zhao, W., He, Q., Shi, Z.: QPLSA: utilizing quad-tuples for aspect identification and rating. Inf. Process. Manag. 51(1), 25–41 (2015)CrossRef Luo, W., Zhuang, F., Zhao, W., He, Q., Shi, Z.: QPLSA: utilizing quad-tuples for aspect identification and rating. Inf. Process. Manag. 51(1), 25–41 (2015)CrossRef
8.
Zurück zum Zitat Moghaddam, S., Ester, M.: The FLDA model for aspect-based opinion mining: addressing the cold start problem. In: International Conference on World Wide Web, pp. 909–918 (2013) Moghaddam, S., Ester, M.: The FLDA model for aspect-based opinion mining: addressing the cold start problem. In: International Conference on World Wide Web, pp. 909–918 (2013)
9.
Zurück zum Zitat Pecar, S.: Towards opinion summarization of customer reviews. In: Proceedings of ACL 2018, Student Research Workshop, pp. 1–8 (2018) Pecar, S.: Towards opinion summarization of customer reviews. In: Proceedings of ACL 2018, Student Research Workshop, pp. 1–8 (2018)
10.
Zurück zum Zitat Schouten, K., van der Weijde, O., Frasincar, F., Dekker, R.: Supervised and unsupervised aspect category detection for sentiment analysis with co-occurrence data. IEEE Trans. Cybern. 48(4), 1263–1275 (2018)CrossRef Schouten, K., van der Weijde, O., Frasincar, F., Dekker, R.: Supervised and unsupervised aspect category detection for sentiment analysis with co-occurrence data. IEEE Trans. Cybern. 48(4), 1263–1275 (2018)CrossRef
11.
Zurück zum Zitat Shi, C., Li, Y., Zhang, J., Sun, Y., Yu, P.S.: A survey of heterogeneous information network analysis. IEEE Trans. Knowl. Data Eng. 29(1), 17–37 (2017)CrossRef Shi, C., Li, Y., Zhang, J., Sun, Y., Yu, P.S.: A survey of heterogeneous information network analysis. IEEE Trans. Knowl. Data Eng. 29(1), 17–37 (2017)CrossRef
12.
Zurück zum Zitat Sun, Y., Han, J., Zhao, P., Yin, Z., Cheng, H., Wu, T.: RankClus: integrating clustering with ranking for heterogeneous information network analysis. In: ACM SIGKDD 2009, pp. 565–576 (2009) Sun, Y., Han, J., Zhao, P., Yin, Z., Cheng, H., Wu, T.: RankClus: integrating clustering with ranking for heterogeneous information network analysis. In: ACM SIGKDD 2009, pp. 565–576 (2009)
13.
Zurück zum Zitat Wang, H., Ester, M.: A sentiment-aligned topic model for product aspect rating prediction. In: Conference on Empirical Methods in Natural Language Processing, pp. 1192–1202 (2014) Wang, H., Ester, M.: A sentiment-aligned topic model for product aspect rating prediction. In: Conference on Empirical Methods in Natural Language Processing, pp. 1192–1202 (2014)
14.
Zurück zum Zitat Xiao, D., Ji, Y., Li, Y., Zhuang, F., Shi, C.: Coupled matrix factorization and topic modeling for aspect mining. Inf. Process. Manag. 54(6), 861–873 (2018)CrossRef Xiao, D., Ji, Y., Li, Y., Zhuang, F., Shi, C.: Coupled matrix factorization and topic modeling for aspect mining. Inf. Process. Manag. 54(6), 861–873 (2018)CrossRef
15.
Zurück zum Zitat Yu, D., Mu, Y., Jin, Y.: Rating prediction using review texts with underlying sentiments. Inf. Process. Lett. 117, 10–18 (2017)MathSciNetCrossRef Yu, D., Mu, Y., Jin, Y.: Rating prediction using review texts with underlying sentiments. Inf. Process. Lett. 117, 10–18 (2017)MathSciNetCrossRef
Metadaten
Titel
Integrating Topic Model and Heterogeneous Information Network for Aspect Mining with Rating Bias
verfasst von
Yugang Ji
Chuan Shi
Fuzhen Zhuang
Philip S. Yu
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-16148-4_13