Skip to main content
Top
Published in: The Journal of Supercomputing 7/2017

29-07-2016

Development of a Chinese opinion-mining system for application to Internet online forums

Authors: Shih-Jung Wu, Rui-Dong Chiang, Zheng-Hong Ji

Published in: The Journal of Supercomputing | Issue 7/2017

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Articles posted on a forum often contain new Internet words related to opinion elements (feature words and opinion words). Consequently, existing Chinese opinion-mining systems may exhibit low recall and precision because they cannot recognize these new Internet words. Therefore, we propose a simple algorithm to elaborate on the opinion elements of such articles by extracting the opinion elements. Moreover, when an opinion word is combined with a specific word or concatenated with another opinion word, it may cause a change in the polarity or meaning of the opinion. This fact is prone to cause difficulties by changing the polarity or meaning of certain opinion elements, leading to errors in the analysis results of the Chinese system. We designed three algorithms with context dependency to address this problem. In this paper, we develop a semi-automatic Chinese opinion-mining system with these algorithms to extract these new opinion elements. Then, we determine whether the new word identified through manual judgment is a useful opinion element for a specific domain and add it to the thesaurus. In comparison with semi-automatic annotation methods, our approach can save considerable labor. After a 20-month follow-up analysis, the experimental data indicated that the precision, recall, and F1 of the system reached 84.0, 89.4 %, and 0.865, respectively.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Zhen H, Kuiyu C, Jung-Jae K, Yang CC (2014) Identifying features in opinion mining via intrinsic and extrinsic domain relevance. Knowl Data Eng IEEE Trans 26:623–634CrossRef Zhen H, Kuiyu C, Jung-Jae K, Yang CC (2014) Identifying features in opinion mining via intrinsic and extrinsic domain relevance. Knowl Data Eng IEEE Trans 26:623–634CrossRef
2.
go back to reference Li Z, Zhang M, Ma S, Zhou B, Sun Y (2009) Automatic extraction for product feature words from comments on the web. In: Presented at the Proceedings of the 5th Asia Information Retrieval Symposium on Information Retrieval Technology, Sapporo, Japan Li Z, Zhang M, Ma S, Zhou B, Sun Y (2009) Automatic extraction for product feature words from comments on the web. In: Presented at the Proceedings of the 5th Asia Information Retrieval Symposium on Information Retrieval Technology, Sapporo, Japan
3.
go back to reference Liu B, Zhang L (2012) A survey of opinion mining and sentiment analysis mining text data. In: Aggarwal CC, Zhai C (eds), Springer, US, pp 415–463 Liu B, Zhang L (2012) A survey of opinion mining and sentiment analysis mining text data. In: Aggarwal CC, Zhai C (eds), Springer, US, pp 415–463
4.
go back to reference Kim S-M, Hovy E (2004) Determining the sentiment of opinions. In: Presented at the Proceedings of the 20th International Conference on Computational Linguistics, Geneva, Switzerland, p 1367 Kim S-M, Hovy E (2004) Determining the sentiment of opinions. In: Presented at the Proceedings of the 20th International Conference on Computational Linguistics, Geneva, Switzerland, p 1367
5.
go back to reference Liu B (2012) Sentiment analysis and opinion mining. In: Synthesis Lectures on Human Language Technologies, vol 5, pp 1–167. 23 May 2012 Liu B (2012) Sentiment analysis and opinion mining. In: Synthesis Lectures on Human Language Technologies, vol 5, pp 1–167. 23 May 2012
6.
go back to reference Liu B, Hu M, Cheng J (2005) Opinion observer: analyzing and comparing opinions on the Web. In: Presented at the Proceedings of the 14th International Conference on World Wide Web, Chiba, Japan Liu B, Hu M, Cheng J (2005) Opinion observer: analyzing and comparing opinions on the Web. In: Presented at the Proceedings of the 14th International Conference on World Wide Web, Chiba, Japan
7.
go back to reference Qiu G, Liu B, Bu J, Chen C (2009) Expanding domain sentiment lexicon through double propagation. In: Presented at the Proceedings of the 21st International Jont Conference on Artifical Intelligence. Pasadena, California, USA, pp 1199–1204 Qiu G, Liu B, Bu J, Chen C (2009) Expanding domain sentiment lexicon through double propagation. In: Presented at the Proceedings of the 21st International Jont Conference on Artifical Intelligence. Pasadena, California, USA, pp 1199–1204
8.
go back to reference Chen M, Yao T (2010) Combining dependency parsing with shallow semantic analysis for Chinese opinion-element relation identification. In: Universal Communication Symposium (IUCS), 2010 4th International, pp 299-305 Chen M, Yao T (2010) Combining dependency parsing with shallow semantic analysis for Chinese opinion-element relation identification. In: Universal Communication Symposium (IUCS), 2010 4th International, pp 299-305
9.
go back to reference Jianping C, Ke Z, Hui W, Jiajun C, Fengcai Q, Ding W, Yanqing G (2014) Web-based traffic sentiment analysis: methods and applications. Intell Transp Syst IEEE Trans 15:844–853CrossRef Jianping C, Ke Z, Hui W, Jiajun C, Fengcai Q, Ding W, Yanqing G (2014) Web-based traffic sentiment analysis: methods and applications. Intell Transp Syst IEEE Trans 15:844–853CrossRef
10.
go back to reference Ku L-W, Ho H-W, Chen H-H (2009) Opinion mining and relationship discovery using CopeOpi opinion analysis system. J Am Soc Inf Sci Technol 60:1486–1503CrossRef Ku L-W, Ho H-W, Chen H-H (2009) Opinion mining and relationship discovery using CopeOpi opinion analysis system. J Am Soc Inf Sci Technol 60:1486–1503CrossRef
11.
go back to reference Liu C-L, Hsaio W-H, Lee C-H, Lu G-C, Jou E (2012) Movie rating and review summarization in mobile environment. Syste Man Cybern Part C Appl Rev IEEE Trans 42:397–407CrossRef Liu C-L, Hsaio W-H, Lee C-H, Lu G-C, Jou E (2012) Movie rating and review summarization in mobile environment. Syste Man Cybern Part C Appl Rev IEEE Trans 42:397–407CrossRef
12.
go back to reference Kim S-M, Hovy E (2006) Extracting opinions, opinion holders, and topics expressed in online news media text. In: Presented at the Proceedings of the Workshop on Sentiment and Subjectivity in Text, Sydney, Australia Kim S-M, Hovy E (2006) Extracting opinions, opinion holders, and topics expressed in online news media text. In: Presented at the Proceedings of the Workshop on Sentiment and Subjectivity in Text, Sydney, Australia
13.
go back to reference Qiu G, Wang C, Bu J, Liu K, Chen C (2008) Incorporate the Syntactic knowledge in opinion mining in user-generated content. WWW 2008 Qiu G, Wang C, Bu J, Liu K, Chen C (2008) Incorporate the Syntactic knowledge in opinion mining in user-generated content. WWW 2008
14.
go back to reference Qiu G, Liu B, Bu J, Chen C (2011) Opinion word expansion and target extraction through double propagation. Comput Linguist 37:9–27CrossRef Qiu G, Liu B, Bu J, Chen C (2011) Opinion word expansion and target extraction through double propagation. Comput Linguist 37:9–27CrossRef
15.
go back to reference Haiping Z, Zhengang Y, Ming X, Yueling S (2011) Feature-level sentiment analysis for Chinese product reviews. In: Computer Research and Development (ICCRD), 2011 3rd International Conference on, pp 135–140 Haiping Z, Zhengang Y, Ming X, Yueling S (2011) Feature-level sentiment analysis for Chinese product reviews. In: Computer Research and Development (ICCRD), 2011 3rd International Conference on, pp 135–140
16.
go back to reference Zhu S, Liu Y, Liu M, Tian P (2009) Research on feature extraction from Chinese text for opinion mining. In: Asian Language Processing, IALP ’09. International Conference on, 7–10 Zhu S, Liu Y, Liu M, Tian P (2009) Research on feature extraction from Chinese text for opinion mining. In: Asian Language Processing, IALP ’09. International Conference on, 7–10
17.
go back to reference Wei W, Hongyan L, Jun H, Hui Y, Xiaoyong D (2008) Extracting feature and opinion words effectively from chinese product reviews. In: Fuzzy Systems and Knowledge Discovery, FSKD ’08, 5th International Conference on, 170–174 Wei W, Hongyan L, Jun H, Hui Y, Xiaoyong D (2008) Extracting feature and opinion words effectively from chinese product reviews. In: Fuzzy Systems and Knowledge Discovery, FSKD ’08, 5th International Conference on, 170–174
18.
go back to reference Nozomi K, Kentaro I, Yuji M (2007) Opinion mining from web documents: extraction and structurization. Inf Media Technol 2:326–337 Nozomi K, Kentaro I, Yuji M (2007) Opinion mining from web documents: extraction and structurization. Inf Media Technol 2:326–337
20.
go back to reference Bin S, Kuiyu C (2006) Mining Chinese reviews. ICDM Workshops, 6th IEEE International Conference on, pp 585–589 Bin S, Kuiyu C (2006) Mining Chinese reviews. ICDM Workshops, 6th IEEE International Conference on, pp 585–589
21.
go back to reference Huang J, Etzioni O, Zettlemoyer L, Clark K, Lee C (2012) RevMiner: an extractive interface for navigating reviews on a smartphone. In: The Proceedings of the 25th Annual ACM Symposium on User Interface Software and Technology, Cambridge, Massachusetts, USA Huang J, Etzioni O, Zettlemoyer L, Clark K, Lee C (2012) RevMiner: an extractive interface for navigating reviews on a smartphone. In: The Proceedings of the 25th Annual ACM Symposium on User Interface Software and Technology, Cambridge, Massachusetts, USA
22.
go back to reference Jin W, Ho HH, Srihari RK (2009) OpinionMiner: a novel machine learning system for web opinion mining and extraction. In: The Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France Jin W, Ho HH, Srihari RK (2009) OpinionMiner: a novel machine learning system for web opinion mining and extraction. In: The Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France
Metadata
Title
Development of a Chinese opinion-mining system for application to Internet online forums
Authors
Shih-Jung Wu
Rui-Dong Chiang
Zheng-Hong Ji
Publication date
29-07-2016
Publisher
Springer US
Published in
The Journal of Supercomputing / Issue 7/2017
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-016-1816-6

Other articles of this Issue 7/2017

The Journal of Supercomputing 7/2017 Go to the issue

Premium Partner