Skip to main content

2017 | OriginalPaper | Buchkapitel

Mining Worse and Better Opinions

Unsupervised and Agnostic Aggregation of Online Reviews

verfasst von : Michela Fazzolari, Marinella Petrocchi, Alessandro Tommasi, Cesare Zavattari

Erschienen in: Web Engineering

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, we propose a novel approach for aggregating online reviews, according to the opinions they express. Our methodology is unsupervised, due to the fact that it does not rely on pre-labeled reviews, and it is agnostic, since it does not make any assumption about the domain or the language of the review content. We measure the adherence of a review content to the domain terminology extracted from a review set. First, we demonstrate the informativeness of the adherence metric with respect to the score associated with a review. Then, we exploit the metric values to group reviews, according to the opinions they express. Our experimental campaign has been carried out on two large datasets collected from Booking and Amazon, respectively.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Contiguous sequence of n words: “president of the USA” is a 4-gram.
 
Literatur
1.
Zurück zum Zitat Basili, R., et al.: A contrastive approach to term extraction. In: Terminologie et intelligence artificielle, pp. 119–128. Rencontres (2001) Basili, R., et al.: A contrastive approach to term extraction. In: Terminologie et intelligence artificielle, pp. 119–128. Rencontres (2001)
2.
Zurück zum Zitat Bonin, F., et al.: A contrastive approach to multi-word extraction from domain-specific corpora. In: Language Resources and Evaluation, ELRA (2010) Bonin, F., et al.: A contrastive approach to multi-word extraction from domain-specific corpora. In: Language Resources and Evaluation, ELRA (2010)
3.
Zurück zum Zitat Bravo-Marquez, F., et al.: Building a twitter opinion lexicon from automatically-annotated tweets. Knowl.-Based Syst. 108, 65–78 (2016)CrossRef Bravo-Marquez, F., et al.: Building a twitter opinion lexicon from automatically-annotated tweets. Knowl.-Based Syst. 108, 65–78 (2016)CrossRef
4.
Zurück zum Zitat Cambria, E., Hussain, A.: Sentic Computing: A Common-Sense-Based Framework for Concept-Level Sentiment Analysis. Springer, Heidelberg (2015)CrossRef Cambria, E., Hussain, A.: Sentic Computing: A Common-Sense-Based Framework for Concept-Level Sentiment Analysis. Springer, Heidelberg (2015)CrossRef
5.
Zurück zum Zitat Cambria, E., et al.: SenticNet 3: a common and common-sense knowledge base for cognition-driven sentiment analysis. In: 28th AAAI, pp. 1515–1521 (2014) Cambria, E., et al.: SenticNet 3: a common and common-sense knowledge base for cognition-driven sentiment analysis. In: 28th AAAI, pp. 1515–1521 (2014)
6.
Zurück zum Zitat Chung, T.M., Nation, P.: Identifying technical vocabulary. System 32(2), 251–263 (2004)CrossRef Chung, T.M., Nation, P.: Identifying technical vocabulary. System 32(2), 251–263 (2004)CrossRef
7.
Zurück zum Zitat Del Vigna, F., Petrocchi, M., Tommasi, A., Zavattari, C., Tesconi, M.: Semi-supervised knowledge extraction for detection of drugs and their effects. In: Social Informatics I (2016) Del Vigna, F., Petrocchi, M., Tommasi, A., Zavattari, C., Tesconi, M.: Semi-supervised knowledge extraction for detection of drugs and their effects. In: Social Informatics I (2016)
8.
Zurück zum Zitat Esuli, A., Sebastiani, F.: SENTIWORDNET: a publicly available lexical resource for opinion mining. In: Language Resources and Evaluation, pp. 417–422 (2006) Esuli, A., Sebastiani, F.: SENTIWORDNET: a publicly available lexical resource for opinion mining. In: Language Resources and Evaluation, pp. 417–422 (2006)
9.
Zurück zum Zitat Li, G., Liu, F.: Application of a clustering method on sentiment analysis. J. Inf. Sci. 38(2), 127–139 (2012)CrossRef Li, G., Liu, F.: Application of a clustering method on sentiment analysis. J. Inf. Sci. 38(2), 127–139 (2012)CrossRef
10.
Zurück zum Zitat Ling Lo, S., et al.: A multilingual semi-supervised approach in deriving Singlish sentic patterns for polarity detection. Knowl.-Based Syst. 105, 236–247 (2016)CrossRef Ling Lo, S., et al.: A multilingual semi-supervised approach in deriving Singlish sentic patterns for polarity detection. Knowl.-Based Syst. 105, 236–247 (2016)CrossRef
11.
Zurück zum Zitat Liu, B.: Sentiment Analysis and Opinion Mining. Morgan & Claypool, San Rafael (2012) Liu, B.: Sentiment Analysis and Opinion Mining. Morgan & Claypool, San Rafael (2012)
12.
Zurück zum Zitat Ma, B., Yuan, H., Wu, Y.: Exploring performance of clustering methods on document sentiment analysis. Inf. Sci. 43, 54–74 (2015)CrossRef Ma, B., Yuan, H., Wu, Y.: Exploring performance of clustering methods on document sentiment analysis. Inf. Sci. 43, 54–74 (2015)CrossRef
13.
Zurück zum Zitat McAuley, J., Pandey, R., Leskovec, J.: Inferring networks of substitutable and complementary products. In: 21th KDD, pp. 785–794. ACM (2015) McAuley, J., Pandey, R., Leskovec, J.: Inferring networks of substitutable and complementary products. In: 21th KDD, pp. 785–794. ACM (2015)
14.
Zurück zum Zitat McAuley, J., et al.: Image-based recommendations on styles and substitutes. In: 38th Research and Development in Information Retrieval, pp. 43–52. ACM (2015) McAuley, J., et al.: Image-based recommendations on styles and substitutes. In: 38th Research and Development in Information Retrieval, pp. 43–52. ACM (2015)
15.
Zurück zum Zitat Mellinas, J.P., María-Dolores, S.M.M., García, J.J.B.: Booking.com: the unexpected scoring system. Tourism Manage. 49, 72–74 (2015)CrossRef Mellinas, J.P., María-Dolores, S.M.M., García, J.J.B.: Booking.com: the unexpected scoring system. Tourism Manage. 49, 72–74 (2015)CrossRef
16.
Zurück zum Zitat Muhammad, A., Wiratunga, N., Lothian, R.: Contextual sentiment analysis for social media genres. Knowl.-Based Syst. 108, 92–101 (2016)CrossRef Muhammad, A., Wiratunga, N., Lothian, R.: Contextual sentiment analysis for social media genres. Knowl.-Based Syst. 108, 92–101 (2016)CrossRef
17.
Zurück zum Zitat Nagamma, P., et al.: An improved sentiment analysis of online movie reviews based on clustering for box-office prediction. In: Computing, Communication and Automation, pp. 933–937 (2015) Nagamma, P., et al.: An improved sentiment analysis of online movie reviews based on clustering for box-office prediction. In: Computing, Communication and Automation, pp. 933–937 (2015)
18.
Zurück zum Zitat Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2(1–2), 1–135 (2008)CrossRef Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2(1–2), 1–135 (2008)CrossRef
19.
Zurück zum Zitat Pazienza, M.T., Zanzotto, F.M.: Terminology extraction: an analysis of linguistic and statistical approaches. In: Sirmakessis, S. (ed.) Knowledge Mining. Studies in Fuzziness and Soft Computing, vol. 185, pp. 255–279. Springer, Heidelberg (2005). doi:10.1007/3-540-32394-5_20 CrossRef Pazienza, M.T., Zanzotto, F.M.: Terminology extraction: an analysis of linguistic and statistical approaches. In: Sirmakessis, S. (ed.) Knowledge Mining. Studies in Fuzziness and Soft Computing, vol. 185, pp. 255–279. Springer, Heidelberg (2005). doi:10.​1007/​3-540-32394-5_​20 CrossRef
20.
Zurück zum Zitat Peñas, A., Verdejo, F., Gonzalo, J.: Corpus-based terminology extraction applied to information access. Corpus Linguist. 13, 458–465 (2001) Peñas, A., Verdejo, F., Gonzalo, J.: Corpus-based terminology extraction applied to information access. Corpus Linguist. 13, 458–465 (2001)
21.
Zurück zum Zitat Ren, Y., Zhang, Y., Zhang, M., Ji, D.: Context-sensitive Twitter sentiment classification using neural network. In: Artificial Intelligence, pp. 215–221. AAAI (2016) Ren, Y., Zhang, Y., Zhang, M., Ji, D.: Context-sensitive Twitter sentiment classification using neural network. In: Artificial Intelligence, pp. 215–221. AAAI (2016)
23.
Zurück zum Zitat Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manage. 24(5), 513–523 (1988)CrossRef Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manage. 24(5), 513–523 (1988)CrossRef
24.
Zurück zum Zitat Turney, P.D.: Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In: Computational Linguistics Meeting, pp. 417–424. ACL (2002) Turney, P.D.: Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In: Computational Linguistics Meeting, pp. 417–424. ACL (2002)
25.
Zurück zum Zitat Wilson, T., et al.: OpinionFinder: a system for subjectivity analysis. In: HLT/EMNLP on Interactive Demonstrations, pp. 34–35. ACL (2005) Wilson, T., et al.: OpinionFinder: a system for subjectivity analysis. In: HLT/EMNLP on Interactive Demonstrations, pp. 34–35. ACL (2005)
26.
Zurück zum Zitat Wilson, T., et al.: Recognizing contextual polarity in phrase-level sentiment analysis. In: HLT/EMNLP, pp. 347–354. ACL (2005) Wilson, T., et al.: Recognizing contextual polarity in phrase-level sentiment analysis. In: HLT/EMNLP, pp. 347–354. ACL (2005)
27.
Zurück zum Zitat Wilson, T., et al.: Recognizing contextual polarity: an exploration of features for phrase-level sentiment analysis. Comput. Linguist. 35(3), 399–433 (2009)CrossRef Wilson, T., et al.: Recognizing contextual polarity: an exploration of features for phrase-level sentiment analysis. Comput. Linguist. 35(3), 399–433 (2009)CrossRef
Metadaten
Titel
Mining Worse and Better Opinions
verfasst von
Michela Fazzolari
Marinella Petrocchi
Alessandro Tommasi
Cesare Zavattari
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-60131-1_35