research-article

Open Access

Chinese EmoBank: Building Valence-Arousal Resources for Dimensional Sentiment Analysis

Authors:
Lung-Hao Lee

National Central University, Taoyuan City, Taiwan

National Central University, Taoyuan City, Taiwan
View Profile

,
Jian-Hong Li

National Central University, Taoyuan City, Taiwan

National Central University, Taoyuan City, Taiwan
View Profile

,
Liang-Chih Yu

National Central University Yuan Ze University, Taoyuan City, Taiwan

National Central University Yuan Ze University, Taoyuan City, Taiwan

0000-0003-1443-4347
View Profile

ACM Transactions on Asian and Low-Resource Language Information Processing Volume 21 Issue 4Article No.: 65pp 1–18https://doi.org/10.1145/3489141

Published:19 January 2022Publication History

ACM Transactions on Asian and Low-Resource Language Information Processing

Abstract

An increasing amount of research has recently focused on dimensional sentiment analysis that represents affective states as continuous numerical values on multiple dimensions, such as valence-arousal (VA) space. Compared to the categorical approach that represents affective states as distinct classes (e.g., positive and negative), the dimensional approach can provide more fine-grained (real-valued) sentiment analysis. However, dimensional sentiment resources with valence-arousal ratings are very rare, especially for the Chinese language. Therefore, this study aims to: (1) Build a Chinese valence-arousal resource called Chinese EmoBank, the first Chinese dimensional sentiment resource featuring various levels of text granularity including 5,512 single words, 2,998 multi-word phrases, 2,582 single sentences, and 2,969 multi-sentence texts. The valence-arousal ratings are annotated by crowdsourcing based on the Self-Assessment Manikin (SAM) rating scale. A corpus cleanup procedure is then performed to improve annotation quality by removing outlier ratings and improper texts. (2) Evaluate the proposed resource using different categories of classifiers such as lexicon-based, regression-based, and neural-network-based methods, and comparing their performance to a similar evaluation of an English dimensional sentiment resource.

1 INTRODUCTION

Sentiment analysis has emerged as a leading technique for the automatic identification of affective information texts [Pang and Lee 2008; Calvo and D'Mello 2010; Liu 2012; Feldman 2013]. In sentiment analysis, representation of affective states is an essential issue and can be generally divided into categorical and dimensional approaches [Calvo and Kim 2013].

The categorical approach represents affective states as several discrete classes such as positive, neutral, and negative, Ekman's six basic emotions (i.e., anger, happiness, fear, sadness, disgust and surprise) [Ekman 1992], and Plutchik's [1991] eight emotions (Ekman's six plus trust and anticipation). The dimensional approach represents affective states as continuous numerical values in multiple dimensions, such as valence-arousal (VA) space [Russell 1980], as shown in Figure 1. The valence represents the degree of pleasant and unpleasant (i.e., positive and negative) feelings, while the arousal represents the degree of excitement and calm. Based on this representation, any sentiment expressions can be represented as a point in the VA coordinate plane by recognizing their valence-arousal ratings. Any affective state can be represented as a point in the VA coordinate plane. Applications can benefit from such representation to provide more fine-grained (real-valued) sentiment analysis. For instance, mood analysis systems can identify high risk Twitter users with different mental illnesses because analysis of Twitter posts suggests that depressive users express lower valence and arousal than those with post-traumatic stress disorder (PTSD), and both are lower than control (normal) subjects [Preoţiuc-Pietro et al. 2015]. Product review systems can prioritize high-arousal positive (or high-arousal negative) reviews because recent marketing research suggests that these reviews are usually of interest and could drive purchasing behavior [Ren and Nickerson 2014].

Fig. 1. Two-dimensional valence-arousal space.

Affective lexicons and corpora with VA ratings are useful resources for the development of sentiment applications. For English, researchers have developed several dimensional lexicons such as the Affective Norms for English Words (ANEW) [Bradley and Lang 1999], Extended ANEW [Warriner et al. 2013], and NRC-VAD [Mohammad 2018b], and corpora such as Affective Norms for English Text (ANET) [Bradley and Lang 2007], Facebook posts [Preoţiuc-Pietro et al. 2016], and EmoBank [Buechel and Hahn 2017]. For Chinese, dimensional sentiment resources are very rare, including only one small lexicon of 162 words [Wei et al. 2011] and no corpora.

Therefore, this study focuses on building a Chinese valence-arousal resource named Chinese EmoBank, the first Chinese dimensional sentiment resource featuring various levels of text granularity including words, phrases, sentences and multi-sentence texts. Chinese EmoBank consists of two lexicons called Chinese valence-arousal words (CVAW) and Chinese valence-arousal phrases (CVAP) and two corpora called Chinese valence-arousal sentences (CVAS) and Chinese valence-arousal texts (CVAT). The CVAW contains 5,512 single words collected from two polarity-based sentiment lexicons, the Chinese LIWC (C-LIWC) [Huang 2012] and NTUSD [Ku and Chen 2007]. The CVAP contains 2,998 multi-word phrases where each phrase is composed of an affective word in the CVAW and a set of modifiers (e.g., negator, degree adverb, and modal) that modify the affective word. The CVAS contains 2,582 single sentences selected from the Twitter microblogging and social networking service. The CVAT contains 2,969 multi-sentence texts extracted from web forums, reviews, and news articles. The annotation of VA ratings is accomplished by crowdsourcing based on the Self-Assessment Manikin (SAM) rating scale [Bradley and Lang 1994]. A corpus cleanup procedure is also used to improve annotation quality by removing outlier ratings and improper texts. To further demonstrate the feasibility of the constructed resource, we evaluate it using different categories of classifiers such as lexicon-based, regression-based, and neural-network-based methods, and compare their performance to a similar evaluation of an English dimensional sentiment resource.

The rest of this paper is organized as follows. Section 2 introduces existing lexicons, corpora, and prediction methods for dimensional sentiment analysis. Section 3 describes the process of building Chinese EmoBank. Section 4 presents the analysis results and feasibility evaluation. Conclusions are finally drawn in Section 5.

2 RELATED WORK

This section presents existing single-dimension and multi-dimensions sentiment lexicons and corpora, followed by a description of automatic methods for dimensional score prediction at the word-, phrase- and sentence-levels.

2.1 Dimensional Sentiment Resources

Table 1 presents the language resources for dimensional sentiment analysis. A number of one-dimensional sentiment lexicons provide sentiment intensity or strength of words, including SentiWordNet [Baccianella et al. 2010], SentiFul [Neviarouskaya et al. 2011], SO-CAL [Taboada et al. 2011], AFINN [Nielsen 2011], SentiStrength [Thelwall et al. 2012], and VADER [Hutto and Gilbert 2014]. Specifically, NRC-EIL provides sentiment intensity for eight emotions [Mohammad 2018a]. The SemEval and WASSA shared tasks also released several datasets for single words, multi-word phrases [Rosenthal et al. 2015; Kiritchenko et al. 2016], and sentences [Cortis et al. 2017; Mohammad and Bravo-Marquez 2017; Mohammad et al. 2018]. Stanford Sentiment Treebank [Socher et al. 2013] provided fully labeled parse trees containing sentiment scores at both the phrase- and sentence-levels.

Table 1.

Lexicon	Granularity	Size	Scale	Dimension
SentiWordNet [Baccianella et al. 2010]	Word	147,306	Continuous [0, 1]	Valence
SentiFul [Neviarouskaya et al. 2011]	Word	12,900	Continuous [0, 1]	Valence
SO-CAL [Taboada et al. 2011]	Word	5,042	Multi-point [−5, 5]	Valence
AFINN [Nielsen 2011]	Word	2,477	Multi-point [−5, 5]	Valence
SentiStrength [Thelwall et al. 2012]	Word	2,609	Multi-point [−4, 4]	Valence
VADER [Hutto and Gilbert 2014]	Word	7,520	Continuous [−4, 4]	Valence
NRC-EIL [Mohammad 2018a]	Word	9,921	Continuous [0, 1]	Valence for Eight emotions
SemEval 2015 Task 10 [Rosenthal et al. 2015]	Word/Phrase	1,515 (subtask E)	Continuous [0, 1]	Valence
SemEval 2016 Task 7 [Kiritchenko et al. 2016]	Word/Phrase	3,207 (subtask 1)	Continuous [−1, 1]	Valence
SST [Socher et al. 2013b]	Sentence	11,855	Continuous [0, 1]	Valence
SemEval-2017 Task 5 [Cortis et al. 2017]	Tweets (subtask 1) Headlines (subtask 2)	2,510 (subtask 1) 1,647 (subtask 2)	Continuous [−1, 1]	Valence
WASSA-2017 [Mohammad and Bravo-Marquez 2017]	Tweets	7,097	Continuous [0, 1]	Valence for four emotions
SemEval-2018 Task 1 [Mohammad et al. 2018]	Tweets	12,634 (EI-reg) 2,567 (V-reg)	Continuous [0, 1]	Valence for four emotions
ANEW [Bradley and Lang 1999]	Word	1,034	Continuous [1,9]	Valence, Arousal, Dominance
Extended ANEW [Warriner et al. 2013]	Word	13,915	Continuous [1,9]	Valence, Arousal, Dominance
NRC-VAD [Mohammad 2018b]	Word	20,007	Continuous [0, 1]	Valence, Arousal, Dominance
ANET [Bradley and Lang 2007]	Text	120	Continuous [1,9]	Valence, Arousal, Dominance
Facebook posts [Preoţiuc-Pietro et al. 2016]	Sentence	2,895	Continuous [1,9]	Valence, Arousal
EmoBank [Buechel and Hahn 2017]	Sentence	10,062	Continuous [1,9]	Valence, Arousal, Dominance

	Number of Instances	Valence		Arousal
Mean	SD	Mean	SD
CVAW	5,512	4.540	0.717	5.023	1.276
CVAP	2,998	4.594	0.451	5.617	0.561
CVAS	2,582	4.637	0.410	4.967	1.035
CVAT	2,969	4.803	0.664	4.845	1.084

Phrase Type	Pattern	Number (ratio%)
2-word phrases	Negator + Word	181 (6.04%)
2-word phrases	Degree Adverb + Word	1,160 (38.69%)
Modal + Word	143 (4.77%)
3-word phrases	Negator + Degree Adverb + Word	373 (12.44%)
	Degree Adverb + Negator + Word	646 (21.55%)
	Modal + Negator + Word	151 (5.05%)
	Modal + Degree Adverb + Word	323 (10.77%)
Degree Adverb + Modal + Word	21 (0.70%)
Total	All	2,998 (100%)

	Category	Num. of Texts (ratio%)	Num. of Sentences	Num. of Words	Avg. Words
CVAS	Twitter	2,582 (100%)	2,582	18,383	7.12
CVAT	Book Review	287 (9.67%)	1,007	6,958	6.91
	Car Forum	253 (8.52%)	859	11,124	12.95
	Hotel Review	299 (10.07%)	1,001	6,101	6.10
	Laptop Review	182 (6.13%)	738	4,538	6.15
	Politics Forum	439 (14.78%)	1,717	13,420	7.82
	News Article	1,509 (50.83%)	6,771	50,096	7.40
Total	2,969 (100%)	12,093	92,237	7.63

Methods	CNN	RNN, LSTM	Attention	XLNet	BERT
Filter Number	60	–	60	–	–
Filters Length	3	–	3	–	–
Pool Length	2	–	2	–	–
Hidden State Dim.	–	120	120	120	120
Layer Number	–	–	–	24 (Cht.)12 (Eng.)	12
Hidden Number	–	–	–	768	768
Head Number	–	–	–	12	12
Optimizer	Adam
Batch Size	32
(Recurrent) Dropout	0.25
Epoch	20

Chinese EmoBank: Building Valence-Arousal Resources for Dimensional Sentiment Analysis

ACM Transactions on Asian and Low-Resource Language Information Processing

Abstract

1 INTRODUCTION

2 RELATED WORK

2.1 Dimensional Sentiment Resources

2.2 Dimension Score Prediction

3 THE CHINESE EMOBANK CONSTRUCTION

3.1 Data Collection

3.2 Annotation Details

3.3 Corpus Cleanup

4 RESULTS AND EVALUATION

4.1 Results of the Chinese EmoBank

4.2 Valence-Arousal Rating Prediction

5 CONCLUSIONS AND FUTURE WORK

Footnotes

REFERENCES

Cited By

Index Terms

Recommendations

Biometric valence and arousal recognition

Locally weighted linear regression for cross-lingual valence-arousal prediction of affective words

Acting emotions: physiological correlates of emotional valence and arousal dynamics in theatre

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media