Skip to main content
Top

2017 | OriginalPaper | Chapter

Mining Worse and Better Opinions

Unsupervised and Agnostic Aggregation of Online Reviews

Authors : Michela Fazzolari, Marinella Petrocchi, Alessandro Tommasi, Cesare Zavattari

Published in: Web Engineering

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this paper, we propose a novel approach for aggregating online reviews, according to the opinions they express. Our methodology is unsupervised, due to the fact that it does not rely on pre-labeled reviews, and it is agnostic, since it does not make any assumption about the domain or the language of the review content. We measure the adherence of a review content to the domain terminology extracted from a review set. First, we demonstrate the informativeness of the adherence metric with respect to the score associated with a review. Then, we exploit the metric values to group reviews, according to the opinions they express. Our experimental campaign has been carried out on two large datasets collected from Booking and Amazon, respectively.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
Contiguous sequence of n words: “president of the USA” is a 4-gram.
 
Literature
1.
go back to reference Basili, R., et al.: A contrastive approach to term extraction. In: Terminologie et intelligence artificielle, pp. 119–128. Rencontres (2001) Basili, R., et al.: A contrastive approach to term extraction. In: Terminologie et intelligence artificielle, pp. 119–128. Rencontres (2001)
2.
go back to reference Bonin, F., et al.: A contrastive approach to multi-word extraction from domain-specific corpora. In: Language Resources and Evaluation, ELRA (2010) Bonin, F., et al.: A contrastive approach to multi-word extraction from domain-specific corpora. In: Language Resources and Evaluation, ELRA (2010)
3.
go back to reference Bravo-Marquez, F., et al.: Building a twitter opinion lexicon from automatically-annotated tweets. Knowl.-Based Syst. 108, 65–78 (2016)CrossRef Bravo-Marquez, F., et al.: Building a twitter opinion lexicon from automatically-annotated tweets. Knowl.-Based Syst. 108, 65–78 (2016)CrossRef
4.
go back to reference Cambria, E., Hussain, A.: Sentic Computing: A Common-Sense-Based Framework for Concept-Level Sentiment Analysis. Springer, Heidelberg (2015)CrossRef Cambria, E., Hussain, A.: Sentic Computing: A Common-Sense-Based Framework for Concept-Level Sentiment Analysis. Springer, Heidelberg (2015)CrossRef
5.
go back to reference Cambria, E., et al.: SenticNet 3: a common and common-sense knowledge base for cognition-driven sentiment analysis. In: 28th AAAI, pp. 1515–1521 (2014) Cambria, E., et al.: SenticNet 3: a common and common-sense knowledge base for cognition-driven sentiment analysis. In: 28th AAAI, pp. 1515–1521 (2014)
6.
go back to reference Chung, T.M., Nation, P.: Identifying technical vocabulary. System 32(2), 251–263 (2004)CrossRef Chung, T.M., Nation, P.: Identifying technical vocabulary. System 32(2), 251–263 (2004)CrossRef
7.
go back to reference Del Vigna, F., Petrocchi, M., Tommasi, A., Zavattari, C., Tesconi, M.: Semi-supervised knowledge extraction for detection of drugs and their effects. In: Social Informatics I (2016) Del Vigna, F., Petrocchi, M., Tommasi, A., Zavattari, C., Tesconi, M.: Semi-supervised knowledge extraction for detection of drugs and their effects. In: Social Informatics I (2016)
8.
go back to reference Esuli, A., Sebastiani, F.: SENTIWORDNET: a publicly available lexical resource for opinion mining. In: Language Resources and Evaluation, pp. 417–422 (2006) Esuli, A., Sebastiani, F.: SENTIWORDNET: a publicly available lexical resource for opinion mining. In: Language Resources and Evaluation, pp. 417–422 (2006)
9.
go back to reference Li, G., Liu, F.: Application of a clustering method on sentiment analysis. J. Inf. Sci. 38(2), 127–139 (2012)CrossRef Li, G., Liu, F.: Application of a clustering method on sentiment analysis. J. Inf. Sci. 38(2), 127–139 (2012)CrossRef
10.
go back to reference Ling Lo, S., et al.: A multilingual semi-supervised approach in deriving Singlish sentic patterns for polarity detection. Knowl.-Based Syst. 105, 236–247 (2016)CrossRef Ling Lo, S., et al.: A multilingual semi-supervised approach in deriving Singlish sentic patterns for polarity detection. Knowl.-Based Syst. 105, 236–247 (2016)CrossRef
11.
go back to reference Liu, B.: Sentiment Analysis and Opinion Mining. Morgan & Claypool, San Rafael (2012) Liu, B.: Sentiment Analysis and Opinion Mining. Morgan & Claypool, San Rafael (2012)
12.
go back to reference Ma, B., Yuan, H., Wu, Y.: Exploring performance of clustering methods on document sentiment analysis. Inf. Sci. 43, 54–74 (2015)CrossRef Ma, B., Yuan, H., Wu, Y.: Exploring performance of clustering methods on document sentiment analysis. Inf. Sci. 43, 54–74 (2015)CrossRef
13.
go back to reference McAuley, J., Pandey, R., Leskovec, J.: Inferring networks of substitutable and complementary products. In: 21th KDD, pp. 785–794. ACM (2015) McAuley, J., Pandey, R., Leskovec, J.: Inferring networks of substitutable and complementary products. In: 21th KDD, pp. 785–794. ACM (2015)
14.
go back to reference McAuley, J., et al.: Image-based recommendations on styles and substitutes. In: 38th Research and Development in Information Retrieval, pp. 43–52. ACM (2015) McAuley, J., et al.: Image-based recommendations on styles and substitutes. In: 38th Research and Development in Information Retrieval, pp. 43–52. ACM (2015)
15.
go back to reference Mellinas, J.P., María-Dolores, S.M.M., García, J.J.B.: Booking.com: the unexpected scoring system. Tourism Manage. 49, 72–74 (2015)CrossRef Mellinas, J.P., María-Dolores, S.M.M., García, J.J.B.: Booking.com: the unexpected scoring system. Tourism Manage. 49, 72–74 (2015)CrossRef
16.
go back to reference Muhammad, A., Wiratunga, N., Lothian, R.: Contextual sentiment analysis for social media genres. Knowl.-Based Syst. 108, 92–101 (2016)CrossRef Muhammad, A., Wiratunga, N., Lothian, R.: Contextual sentiment analysis for social media genres. Knowl.-Based Syst. 108, 92–101 (2016)CrossRef
17.
go back to reference Nagamma, P., et al.: An improved sentiment analysis of online movie reviews based on clustering for box-office prediction. In: Computing, Communication and Automation, pp. 933–937 (2015) Nagamma, P., et al.: An improved sentiment analysis of online movie reviews based on clustering for box-office prediction. In: Computing, Communication and Automation, pp. 933–937 (2015)
18.
go back to reference Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2(1–2), 1–135 (2008)CrossRef Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2(1–2), 1–135 (2008)CrossRef
19.
go back to reference Pazienza, M.T., Zanzotto, F.M.: Terminology extraction: an analysis of linguistic and statistical approaches. In: Sirmakessis, S. (ed.) Knowledge Mining. Studies in Fuzziness and Soft Computing, vol. 185, pp. 255–279. Springer, Heidelberg (2005). doi:10.1007/3-540-32394-5_20 CrossRef Pazienza, M.T., Zanzotto, F.M.: Terminology extraction: an analysis of linguistic and statistical approaches. In: Sirmakessis, S. (ed.) Knowledge Mining. Studies in Fuzziness and Soft Computing, vol. 185, pp. 255–279. Springer, Heidelberg (2005). doi:10.​1007/​3-540-32394-5_​20 CrossRef
20.
go back to reference Peñas, A., Verdejo, F., Gonzalo, J.: Corpus-based terminology extraction applied to information access. Corpus Linguist. 13, 458–465 (2001) Peñas, A., Verdejo, F., Gonzalo, J.: Corpus-based terminology extraction applied to information access. Corpus Linguist. 13, 458–465 (2001)
21.
go back to reference Ren, Y., Zhang, Y., Zhang, M., Ji, D.: Context-sensitive Twitter sentiment classification using neural network. In: Artificial Intelligence, pp. 215–221. AAAI (2016) Ren, Y., Zhang, Y., Zhang, M., Ji, D.: Context-sensitive Twitter sentiment classification using neural network. In: Artificial Intelligence, pp. 215–221. AAAI (2016)
23.
go back to reference Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manage. 24(5), 513–523 (1988)CrossRef Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manage. 24(5), 513–523 (1988)CrossRef
24.
go back to reference Turney, P.D.: Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In: Computational Linguistics Meeting, pp. 417–424. ACL (2002) Turney, P.D.: Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In: Computational Linguistics Meeting, pp. 417–424. ACL (2002)
25.
go back to reference Wilson, T., et al.: OpinionFinder: a system for subjectivity analysis. In: HLT/EMNLP on Interactive Demonstrations, pp. 34–35. ACL (2005) Wilson, T., et al.: OpinionFinder: a system for subjectivity analysis. In: HLT/EMNLP on Interactive Demonstrations, pp. 34–35. ACL (2005)
26.
go back to reference Wilson, T., et al.: Recognizing contextual polarity in phrase-level sentiment analysis. In: HLT/EMNLP, pp. 347–354. ACL (2005) Wilson, T., et al.: Recognizing contextual polarity in phrase-level sentiment analysis. In: HLT/EMNLP, pp. 347–354. ACL (2005)
27.
go back to reference Wilson, T., et al.: Recognizing contextual polarity: an exploration of features for phrase-level sentiment analysis. Comput. Linguist. 35(3), 399–433 (2009)CrossRef Wilson, T., et al.: Recognizing contextual polarity: an exploration of features for phrase-level sentiment analysis. Comput. Linguist. 35(3), 399–433 (2009)CrossRef
Metadata
Title
Mining Worse and Better Opinions
Authors
Michela Fazzolari
Marinella Petrocchi
Alessandro Tommasi
Cesare Zavattari
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-60131-1_35

Premium Partner