Skip to main content
Top

2017 | OriginalPaper | Chapter

DFDS: A Domain-Independent Framework for Document-Level Sentiment Analysis Based on RST

Authors : Zhenyu Zhao, Guozheng Rao, Zhiyong Feng

Published in: Web and Big Data

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Document-level sentiment analysis is among the most popular research fields of nature language processing in recent years, in which one of major challenges is that discourse structural information can be hardly captured by existing approaches. In this paper, a domain-independent framework for document-level sentiment classification with weighting rules based on Rhetorical Structure Theory is proposed. First, original textual documents are parsed into rhetorical structure trees through a preprocessing pipeline. Next, the sentiment score of elementary discourse units is computed via sentence-level sentiment classification method. Finally, according to the rhetorical relation between neighbor discourse units, we define weighting schema and composing rules based on which scores of elementary discourse units are summed recursively to the whole document. Experiment results show that our approach has better performance on datasets in different domains, compared with state-of-art document-level sentiment analysis systems based on RST, and the best result is 15% higher than baseline.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference O’Connor, B., Balasubramanyan, R., Routledge, B.R., et al.: From tweets to polls: linking text sentiment to public opinion time series. ICWSM 11, 122–129 (2010) O’Connor, B., Balasubramanyan, R., Routledge, B.R., et al.: From tweets to polls: linking text sentiment to public opinion time series. ICWSM 11, 122–129 (2010)
2.
go back to reference Musto, C., Semeraro, G., Lops, P., et al.: CrowdPulse: a framework for real-time semantic analysis of social streams. Inf. Syst. 54, 127–146 (2015)CrossRef Musto, C., Semeraro, G., Lops, P., et al.: CrowdPulse: a framework for real-time semantic analysis of social streams. Inf. Syst. 54, 127–146 (2015)CrossRef
3.
go back to reference Bollen, J., Mao, H., Zeng, X.: Twitter mood predicts the stock market. J. Comput. Sci. 2(1), 1–8 (2011)CrossRef Bollen, J., Mao, H., Zeng, X.: Twitter mood predicts the stock market. J. Comput. Sci. 2(1), 1–8 (2011)CrossRef
4.
go back to reference Smailović, J., Grčar, M., Lavrač, N., et al.: Stream-based active learning for sentiment analysis in the financial domain. Inf. Sci. 285(1), 181–203 (2014)CrossRef Smailović, J., Grčar, M., Lavrač, N., et al.: Stream-based active learning for sentiment analysis in the financial domain. Inf. Sci. 285(1), 181–203 (2014)CrossRef
5.
go back to reference Liu, B., Zhang, L.: A survey of opinion mining and sentiment analysis. In: Aggarwal C., Zhai, C. (eds.) Mining Text Data, pp. 415–463. Springer, US (2012) Liu, B., Zhang, L.: A survey of opinion mining and sentiment analysis. In: Aggarwal C., Zhai, C. (eds.) Mining Text Data, pp. 415–463. Springer, US (2012)
6.
go back to reference Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retrieval 2(1-2), 1–135 (2008) Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retrieval 2(1-2), 1–135 (2008)
7.
go back to reference Bhatia, P., Ji, Y., Eisenstein, J.: Better document-level sentiment analysis from RST discourse parsing. arXiv preprint arXiv:1509.01599 (2015) Bhatia, P., Ji, Y., Eisenstein, J.: Better document-level sentiment analysis from RST discourse parsing. arXiv preprint arXiv:​1509.​01599 (2015)
8.
go back to reference Mann, W.C., Thompson, S.A.: Rhetorical structure theory: description and construction of text structures. In: Kempen, G. (ed.) Natural Language Generation, pp. 85–95. Springer, Netherlands (1987) Mann, W.C., Thompson, S.A.: Rhetorical structure theory: description and construction of text structures. In: Kempen, G. (ed.) Natural Language Generation, pp. 85–95. Springer, Netherlands (1987)
9.
go back to reference Ji, Y., Eisenstein, J.: Representation learning for text-level discourse parsing. In: Meeting of the Association for Computational Linguistics, pp. 13–24, USA (2014) Ji, Y., Eisenstein, J.: Representation learning for text-level discourse parsing. In: Meeting of the Association for Computational Linguistics, pp. 13–24, USA (2014)
10.
go back to reference Corston-Oliver, S.H.: Beyond string matching and cue phrases: improving efficiency and coverage in discourse analysis. In: The AAAI Spring Symposium on Intelligent Text Summarization, pp. 9–15 (1970) Corston-Oliver, S.H.: Beyond string matching and cue phrases: improving efficiency and coverage in discourse analysis. In: The AAAI Spring Symposium on Intelligent Text Summarization, pp. 9–15 (1970)
11.
go back to reference Soricut, R., Marcu, D.: Sentence level discourse parsing using syntactic and lexical information. In: Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, vol. 1, pp. 149–156. Association for Computational Linguistics (2004) Soricut, R., Marcu, D.: Sentence level discourse parsing using syntactic and lexical information. In: Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, vol. 1, pp. 149–156. Association for Computational Linguistics (2004)
12.
go back to reference Feng, V.W., Hirst, G.: Text-level discourse parsing with rich linguistic features. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers, vol. 1, pp. 60–68. Association for Computational Linguistics (2012) Feng, V.W., Hirst, G.: Text-level discourse parsing with rich linguistic features. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers, vol. 1, pp. 60–68. Association for Computational Linguistics (2012)
13.
go back to reference Li, S., Wang, L., Cao, Z., et al.: Text-level discourse dependency parsing. Meet. Assoc. Comput. Linguist. 1, 25–35 (2014) Li, S., Wang, L., Cao, Z., et al.: Text-level discourse dependency parsing. Meet. Assoc. Comput. Linguist. 1, 25–35 (2014)
14.
go back to reference Pang, B., Lee, L.: A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In: Meeting on Association for Computational Linguistics, p. 271. Association for Computational Linguistics (2004) Pang, B., Lee, L.: A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In: Meeting on Association for Computational Linguistics, p. 271. Association for Computational Linguistics (2004)
15.
go back to reference Sharma, A., Dey, S.: A document-level sentiment analysis approach using artificial neural network and sentiment lexicons. ACM SIGAPP Appl. Comput. Rev. 12(4), 67–75 (2012)CrossRef Sharma, A., Dey, S.: A document-level sentiment analysis approach using artificial neural network and sentiment lexicons. ACM SIGAPP Appl. Comput. Rev. 12(4), 67–75 (2012)CrossRef
16.
go back to reference Tang, D., Qin, B., Liu, T.: Document modeling with gated recurrent neural network for sentiment classification. In: Conference on Empirical Methods in Natural Language Processing, pp. 1422–1432, Portugal (2015) Tang, D., Qin, B., Liu, T.: Document modeling with gated recurrent neural network for sentiment classification. In: Conference on Empirical Methods in Natural Language Processing, pp. 1422–1432, Portugal (2015)
17.
go back to reference Xu, J., Chen, D., Qiu, X., et al.: Cached Long Short-Term Memory Neural Networks for Document-Level Sentiment Classification. arXiv preprint arXiv:1610.04989 (2016) Xu, J., Chen, D., Qiu, X., et al.: Cached Long Short-Term Memory Neural Networks for Document-Level Sentiment Classification. arXiv preprint arXiv:​1610.​04989 (2016)
18.
go back to reference Voll, K., Taboada, M.: Not all words are created equal: extracting semantic orientation as a function of adjective relevance. In: Orgun, M.A., Thornton, J. (eds.) AI 2007. LNCS, vol. 4830, pp. 337–346. Springer, Heidelberg (2007). doi:10.1007/978-3-540-76928-6_35 CrossRef Voll, K., Taboada, M.: Not all words are created equal: extracting semantic orientation as a function of adjective relevance. In: Orgun, M.A., Thornton, J. (eds.) AI 2007. LNCS, vol. 4830, pp. 337–346. Springer, Heidelberg (2007). doi:10.​1007/​978-3-540-76928-6_​35 CrossRef
19.
go back to reference Heerschop, B., Goossen, F., Hogenboom, A., et al.: Polarity analysis of texts using discourse structure. In: ACM Conference on Information and Knowledge Management. DBLP, pp. 1061–1070, Glasgow, United Kingdom (2011) Heerschop, B., Goossen, F., Hogenboom, A., et al.: Polarity analysis of texts using discourse structure. In: ACM Conference on Information and Knowledge Management. DBLP, pp. 1061–1070, Glasgow, United Kingdom (2011)
20.
go back to reference Wang, F., Wu, Y., Qiu, L.: Exploiting discourse relations for sentiment analysis. In: COLING: Posters, pp. 1311–1320 (2012) Wang, F., Wu, Y., Qiu, L.: Exploiting discourse relations for sentiment analysis. In: COLING: Posters, pp. 1311–1320 (2012)
21.
go back to reference Li, J., Zhou, Y., Liu, C., et al.: Sentiment classification of Chinese contrast sentences. In: Zong, C., Nie, JY., Zhao, D., Feng, Y. (eds.) Natural Language Processing and Chinese Computing, vol. 496, pp. 205–216. Springer, Heidelberg (2014) Li, J., Zhou, Y., Liu, C., et al.: Sentiment classification of Chinese contrast sentences. In: Zong, C., Nie, JY., Zhao, D., Feng, Y. (eds.) Natural Language Processing and Chinese Computing, vol. 496, pp. 205–216. Springer, Heidelberg (2014)
22.
go back to reference Hogenboom, A., Frasincar, F., De Jong, F., et al.: Using rhetorical structure in sentiment analysis. Commun. ACM 58(7), 69–77 (2015)CrossRef Hogenboom, A., Frasincar, F., De Jong, F., et al.: Using rhetorical structure in sentiment analysis. Commun. ACM 58(7), 69–77 (2015)CrossRef
Metadata
Title
DFDS: A Domain-Independent Framework for Document-Level Sentiment Analysis Based on RST
Authors
Zhenyu Zhao
Guozheng Rao
Zhiyong Feng
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-63579-8_23

Premium Partner