nach oben

Artificial Intelligence Review

Erschienen in:

Open Access 25.06.2021

Over a decade of social opinion mining: a systematic review

verfasst von: Keith Cortis, Brian Davis

Erschienen in: Artificial Intelligence Review | Ausgabe 7/2021

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Patentsuche

Aus

Abstract

Social media popularity and importance is on the increase due to people using it for various types of social interaction across multiple channels. This systematic review focuses on the evolving research area of Social Opinion Mining, tasked with the identification of multiple opinion dimensions, such as subjectivity, sentiment polarity, emotion, affect, sarcasm and irony, from user-generated content represented across multiple social media platforms and in various media formats, like text, image, video and audio. Through Social Opinion Mining, natural language can be understood in terms of the different opinion dimensions, as expressed by humans. This contributes towards the evolution of Artificial Intelligence which in turn helps the advancement of several real-world use cases, such as customer service and decision making. A thorough systematic review was carried out on Social Opinion Mining research which totals 485 published studies and spans a period of twelve years between 2007 and 2018. The in-depth analysis focuses on the social media platforms, techniques, social datasets, language, modality, tools and technologies, and other aspects derived. Social Opinion Mining can be utilised in many application areas, ranging from marketing, advertising and sales for product/service management, and in multiple domains and industries, such as politics, technology, finance, healthcare, sports and government. The latest developments in Social Opinion Mining beyond 2018 are also presented together with future research directions, with the aim of leaving a wider academic and societal impact in several real-world applications.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

1 Introduction

Social media is increasing in popularity and also in its importance. This is principally due to the large number of people who make use of different social media platforms for various types of social interaction. Kaplan and Haenlein define social media as “a group of Internet-based applications that build on the ideological and technological foundations of Web 2.0, which allows the creation and exchange of user generated content” (Kaplan and Haenlein 2010). This definition fully reflects that social media platforms are essential for online users to submit their views and also read the ones posted by other people about various aspects and/or entities, such as opinions about a political party they are supporting in an upcoming election, recommendations of products to buy, restaurants to eat in and holiday destinations to visit. In particular, people’s social opinions as expressed through various social media platforms can be beneficial in several domains, used in several applications and applied in real-life scenarios. Therefore, mining of people’s opinions, which are usually expressed in various media formats, such as textual (e.g., online posts, newswires), visual (e.g., images, videos) and audio, is a valuable business asset that can be utilised in many ways ranging from marketing strategies to product or service improvement. However as indicated in Ravi and Ravi (2015), dealing with unstructured data, such as video, speech, audio and text, creates crucial research challenges.

This research area is evolving due to the rise of social media platforms, where several work already exists on the analysis of sentiment polarity. Moreover, researchers can gauge widespread opinions from user-generated content and better model and understand human beliefs and their behaviour. Opinion Mining is regarded as a challenging Natural Language Processing (NLP) problem, in particular for social data obtained from social media platforms, such as Twitter¹, and also for transcribed text. Standard linguistic processing tools were built and developed on newswires and review-related data due to such data following more strict grammar rules. These differences should be taken in consideration when performing any kind of analysis (Balazs and Velásquez 2016). Therefore, social data is difficult to analyse due to the short length in text, the non-standard abbreviations used, the high sparse representation of terms and difficulties in finding out the synonyms and any other relations between terms, emoticons and hashtags used, lack of punctuations, use of informal text, slang, non-standard shortcuts and word concatenations. Hence, typical NLP solutions are not likely to work well for Opinion Mining.

Opinion Mining—presently a very popular field of study—is defined by Liu and Zhang as “the computational study of people’s opinions, appraisals, attitudes, and emotions toward entities, individuals, issues, events, topics and their attributes” (Liu and Zhang 2012). Social is defined by the Merriam-Webster Online dictionary² as “of or relating to human society, the interaction of the individual and the group, or the welfare of human beings as members of society”.

In light of this, we define Social Opinion Mining (SOM) as “the study of user-generated content by a selective portion of society be it an individual or group, specifically those who express their opinion about a particular entity, individual, issue, event and/or topic via social media interaction”.

Therefore, the research area of SOM is tasked with the identification of several dimensions of opinion, such as sentiment polarity, emotion, sarcasm, irony and mood, from social data which is represented in structured, semi-structured and/or unstructured data formats. Information fusion is the field tasked with researching about efficient methods for automatically or semi-automatically transforming information from different sources into a single coherent representation, which can be used to guide the fusion process. This is important due to the diversity in data in terms of content, format and volume (Balazs and Velásquez 2016). Sections 1.1 and 1.2 provide information about SOM and its challenges.

In addition, SOM is generally very personal to the individual responsible for expressing an opinion about an object or set of objects, thus making it user-oriented from an opinion point-of-view, e.g., a social post about an event on Twitter, a professional post about a job opening on LinkedIn³ or a review about a hotel on TripAdvisor⁴.

Our SOM research focuses on microposts—i.e. information published on the Web that is small in size and requires minimal effort to publish (Cano et al. 2016)—that are expressed by individuals on a microblogging service, such as Sina Weibo⁵ or Twitter and/or a social network service that has its own microblogging feature, such as Facebook⁶ and LinkedIn.

1.1 Opinion mining versus social opinion mining

In 2008, Pang and Lee had already identified the relevance between the field of “social media monitoring and analysis” and the body of work reviewed in Pang and Lee (2008), which deals with the computational treatment of opinion, sentiment and subjectivity in text. This work is nowadays known as opinion mining, sentiment analysis, and/or subjectivity analysis (Pang and Lee 2008). Other phrases, such as review mining and appraisal extraction have also been used in the same context, whereas some connections have been found to affective computing (where one of its goals is to enable computers in recognising and expressing emotions) (Pang and Lee 2008). Merriam-Webster’s Online Dictionary defines that the terms⁷ “opinion”, “view”, “belief”, “conviction”, “persuasion” and “sentiment” mean a judgement one holds as true. This shows that the distinctions in common usage between these terms can be quite subtle. In light of this, three main three research areas—opinion mining, sentiment analysis and subjectivity analysis—are all related and use multiple techniques taken from NLP, information retrieval, structured and unstructured data mining (Ravi and Ravi 2015). However, even though these three concepts are broadly used as synonyms, thus used interchangeably, it is worth noting that their origins differ. Some authors also consider that each concept presents a different understanding (Serrano-Guerrero et al. 2015), and also have different notions (Tsytsarau and Palpanas 2012). We are in agreement with this, hence we felt that a new terminology is required to properly specify what SOM means, as defined in Sect. 1.

According to Cambria et al., sentiment analysis can be considered as a very restricted NLP problem, where the polarity (negative/positive) of each sentence and/or target entities or topics needs to be understood (Cambria et al. 2013). On the other hand, Liu discusses that “opinions are usually subjective expressions that describe people’s sentiments, appraisals or feelings toward entities, events and their properties” (Liu 2010). He further identifies two sub-topics of sentiment and subjectivity analysis, namely sentiment classification (or document-level sentiment classification) and subjectivity classification. SOM requires such classification methods to determine an opinion dimension, such as objectivity/subjectivity and sentiment polarity. For example, subjectivity classification is required to classify whether user-generated content, such as a product review, is objective or subjective, whereas sentiment classification is performed on subjective content to find the sentiment polarity (positive/negative) as expressed by the author of the opinionated text. In cases where the user-generated content is made up of multiple sentences, sentence-level classification needs to be performed to determine the respective opinion dimension. In addition, sentence-level classification is not suitable for compound sentences, i.e., a sentence that expresses more than one opinion. For such cases, aspect-based opinion mining needs to be performed.

1.2 Issues and challenges

Pang and Lee (2008) had already identified that the writings of Web users can be very challenging in their own way due to numerous factors, such as the quality of written text, discourse structure and the order in which different opinions are presented. The effects of the latter factor can result in a completely opposite overall sentiment polarity, where the order effects can completely overwhelm the frequency effects. This is not the case in traditional text classification, where if a document refers to the term “car” in a frequent manner, the document will probably somewhat be related to cars. Therefore, order dependence manifests itself in a more fine-grained level of analysis.

Liu (2010) mentions that complete sentences (for reviews) are more complex than short phrases and contain a large amount of noise, thus making it more difficult to extract features for feature-based sentiment analysis. Even though we agree that with more text, comes a higher probability of spelling mistakes, etc., we tend to disagree that shorter text, such as microposts, contain less noise.

The process of mining user-generated content posted on the Web is very intricate and challenging due to the nature of short textual content limit (e.g., tweets allowed up to 140 characters till October 2017), which at times forces a user to resort in using short words, such as acronyms and slang, to make a statement. These often lead to further issues in the text, such as misspellings, incomplete content, jargon, incorrect acronyms and/or abbreviations, emoticons and content misinterpretation (Cortis 2013). Other noteworthy challenges include swear words, irony, sarcasm, negatives, conditional statements, grammatical mistakes, use of multiple languages, incorrect language syntax, syntactically inconsistent words, and different discourse structures. In fact, when informal language is used in the user-generated content, the grammar and lexicon varies from the standard language normally used (Dashtipour et al. 2016). Moreover, user-generated text exhibits more language variation due to it being less grammatical than longer posts, where the aforementioned use of emoticons, abbreviations together with hashtags and inconsistent capitalisation, can form an important part of the meaning (Maynard et al. 2012). Maynard et al. (2012) also points out that microposts are in some sense the most challenging type of text for text mining tools especially for opinion mining, since they do not contain a lot of contextual information and assume much implicit knowledge. Another issue is ambiguity, since microposts such as tweets do not follow a conversation thread. Therefore, this isolation from other tweets makes it more difficult to make use of coreference information unlike in blog posts and comments. Due to the short textual content, features can also be sparse to find and use, in terms of text representation (Wang et al. 2014). In addition, the majority of microposts usually contain information about a single topic due to the length limitation, which is not the case in traditional blogs, where they contain information on more than one topic given that they do not face the same length limitations (Giachanou and Crestani 2016).

Big data challenges, such as handling and processing large volumes of streaming data, are also encountered when analysing social data (Bravo-Marquez et al. 2014). Limited availability of labelled data and dealing with the evolving nature of social streams usually results in the target concept changing which would require the learning models to be constantly updated (Guerra et al. 2014).

In light of the above, social networking services bring several issues and challenges with them and the way in how content is generated by their users. Therefore, several Information Extraction tasks, such as Named Entity Recognition (NER) and Coreference Resolution, might be required to carry out multi-dimensional SOM. In fact, several shared evaluation tasks are being organised to try and reach a standard mechanism towards performing IE tasks on noisy text which is very common in user-generated social media content. As already discussed in detail above, such tasks are much harder to solve when they are applied on micro-text like microposts (Ravi and Ravi 2015). This problem presents serious challenges on several levels, such as performance. Examples of such tasks are “Named Entity Recognition in Twitter”⁸.

In terms of content, social media-based studies present only analysis and results from a selective portion of society, since not everyone uses social media. Moreover, several cross-cultural differences and factors determine the social media usage in each country and hence the results of such studies. For example for the Political domain, these services are used predominantly by young and politically active individuals or by ones with strong political views. This could be easily reflected in the Brexit results, where the majority of younger generation (age 18–44) voted to remain in the European Union as opposed to people over age 45. Such a result falls in line with the latest United Kingdom social media statistics, such as for Twitter, where 72% of the users are between the age of 15–44, whilst for Facebook the most popular age group is 25–34 (26% of users) (Hürlimann et al. 2016). However, results of similar studies in other cultures and languages might differ due to different use of social words to reflect a general opinion, sentiment polarity and/or emotion (Lin et al. 2018).

1.3 Systematic review

In light of the above, it is noteworthy that no systematic review within this newly defined domain exists even though there are several good survey papers (Liu and Zhang 2012; Tsytsarau and Palpanas 2012; Medhat et al. 2014; Ravi and Ravi 2015). The research paper by Bukhari et al. (2016) is closest to a systematic review in this domain, whereby the authors performed a search over the ScienceDirect and SpringerLink electronic libraries for the “sentiment analysis”, “sentiment analysis models”, “sentiment analysis of microblogs” terms. As a result, we felt that the SOM domain well and truly deserves a thorough systematic review that captures all of the relevant research conducted over the last decade. This review also identifies the current literature gaps within this popular and constantly evolving research domain.

The structure of this comprehensive systematic review is as follows: Sect. 2 presents the research method adopted to carry out this review, followed by Sect. 3 which provides a thorough review analysis of the main aspects derived from the analysed studies. This is followed by Sect. 4 which focuses on the different dimensions of social opinions as derived from the analysed studies, and Sect. 5 which presents the application areas where SOM is being used. Lastly, Sect. 6 discusses the the latest developments for SOM (beyond the period covered by the systematic review) and future research directions as identified by the authors.

2 Research method

This survey paper about SOM adopts a systematic literature review process. This empirical research process was based on the guidelines and procedures proposed by Kitchenham (2004), Brereton et al. (2007), Dyba et al. (2007) and Attard et al. (2015) which were focused on the software engineering domain. The systematic review process although more time consuming is reproducible, minimising bias and maximising internal and external validity. The procedure undertaken was structured as follows and is explained in detail within the sub-sections below:

Specification of research questions;

Generation of search strategy which includes the identification of electronic sources (libraries) and selection of relevant search terms;

Application of the relevant search;

Choice of primary studies via the utilisation of inclusion and exclusion criteria on the obtained results;

Extraction of required data from primary studies;

Synthesis of data.

2.1 Research questions

A systematic literature review is usually characterised by an appropriate generic “research question, topic area, or phenomenon of interest” (Kitchenham 2004). This question can be expanded into a set of sub-questions that are more clearly defined, whereby all available research relevant to these sub-questions are identified, evaluated and interpreted.

The goal of this systematic review is to identify, analyse and evaluate current opinion mining solutions that make use of social data (data extracted from social media platforms). In light of this, the following generic research question is defined:

What are the existing opinion mining approaches which make use of user-generated content obtained from social media platforms?

The following are specific sub-questions that the generic question above can be sub-divided into:

What are the existing approaches that make use of social data for opinion mining and how can they be classified⁹?

What are the different dimensions/types of social opinion mining?

What are the challenges faced when performing opinion mining on social data?

What techniques, datasets, tools/technologies and resources are used in the current solutions?

What are the application areas of social opinion mining?

2.2 Search strategy

The search strategy for this systematic review is primarily directed via the use of published papers which consist of journals, conference/workshop proceedings, or technical reports. The following electronic libraries were identified for use, due to their wide coverage of relevant publications within our domain: ACM Digital Library¹⁰, IEEE Xplore Digital Library¹¹, ScienceDirect¹², and SpringerLink¹³.

The first three electronic libraries listed were used by three out of the four systematic reviews that our research process was based on (and which made use of a digital source), whereas SpringerLink is one of the most popular sources for publishing work in this domain (as will be seen in Sect. 2.4 below). Moreover, three other electronic libraries were considered for use, two –Web of Science¹⁴ and Ei Compendex¹⁵– which the host university did not have access to and Google Scholar¹⁶ which was not included, since content is obtained from the electronic libraries listed above (and more), thus making the process redundant.

The relevant search terms were identified for answering the research questions defined in Sect. 2.1. In addition, these questions were also used to perform some trial searches before the following list of relevant search terms was determined:

“Social opinion mining”;

“Social sentiment analysis”;

“Opinion mining social media”;

“Sentiment analysis social media”;

“Microblog opinion mining”;

“Microblog sentiment analysis”;

“Social network sentiment”;

“Social network opinion”;

“Social data sentiment analysis”;

10.

“Social data opinion mining”;

11.

“Twitter sentiment analysis”;

12.

“Twitter opinion mining”;

13.

“Social data analysis”.

The following are important justifications behind the search terms selected above:

“opinion mining” and “sentiment analysis”: are both included due to the fact that these key terms are used interchangeably to denote the same field of study (Pang and Lee 2008; Cambria et al. 2013), even though their origins differ and hence do not refer to the same concept (Serrano-Guerrero et al. 2015);
“microblog”, “social network” and “Twitter”: the majority of the opinion mining and/or sentiment analysis research and development efforts target these two kinds of social media platforms, in particular the Twitter microblogging service.

2.3 Search application

The “OR” Boolean operator was chosen to formulate the search string. The search terms were all linked using this operator, making the search query simple and easy to use across multiple electronic libraries. Therefore, a publication only had to include any one of the search terms to be retrieved (Attard et al. 2015). In addition, this operator is more suitable for the defined search terms given that this study is not a general one e.g., about opinion mining in general, but is focused about opinion mining in a social context. Construction of the correct search string (and terms) is very important, since this eliminates noise (i.e. false positives) as much as possible and at the same time still retrieves potential relevant publication which increases recall.

Several other factors had to be taken in consideration during the application of search terms on the electronic libraries. The following is a list of factors relevant to our study, identified in Brereton et al. (2007) and verified during our search application process:

Electronic library search engines have different underlying models, thus not always provide required support for systematic searching;
Same set of search terms cannot be used for multiple engines e.g., complex logical combination not supported by the ACM Digital Library but is by the IEEE Xplore Digital Library;
Boolean search string is dependent on the order of terms, independent of brackets;
Inconsistencies in the order or relevance in search results (e.g., IEEE Xplore Digital Library results are sorted in order of relevance);
Certain electronic libraries treat multiple words as a Boolean term and look for instances of all the words together (e.g., “social opinion mining”). In this case, the use of the “AND” Boolean operator (e.g., “social AND opinion AND mining”) looks for all of the words but not necessary together.

On the above, in our case it was very important to select a search strategy that is more appropriate to the review’s research question which could be applied to the selected electronic libraries.

When applying the relevant search on top of the search strategy defined in Sect. 2.2, another important element was to identify appropriate metadata fields upon which the search string can be executed. Table 1 presents the ones applied in our study.

Table 1

Metadata fields used in search application

Metadata field	ACM	IEEE Xplore	ScienceDirect	SpringerLink
Title	\(\checkmark\)	\(\checkmark\)	\(\checkmark\)	\(\checkmark\)
Abstract	\(\checkmark\)	\(\checkmark\)	\(\checkmark\)	\(\checkmark\)
Keywords	\(\checkmark\)	\(\checkmark\)	\(\checkmark\)

Applying the search on the title metadata field alone would result in several missed and/or incorrect results. Therefore, using the abstract and/or keywords in the search is very important to reduce the number of irrelevant results. In addition, this ensures that significant publications that lack any of the relevant search terms within their title are returned.

A separate search method was applied for each electronic library, since they all offer different functionalities and have different underlying models. Each method is detailed below:

ACM: Separate searches for each metadata field were conducted and results were merged (duplicates removed). Reason being that the metadata field search functionality “ANDs” all metadata fields, whereas manual edition of the search query does not work well when amended.
IEEE: Separate searches for each metadata field were conducted and results were merged (duplicates removed).
ScienceDirect: One search that takes in consideration all the chosen metadata fields.
SpringerLink: By entering a search term or phrase, a search is conducted over the title, abstract and full-text (including authors, affiliations and references) of every article and book chapter. This was noted in the large amount of returned papers (as will be discussed in the next sub-section), which results in a high amount of false positives (and possibly a higher recall).

2.4 Study selection

A manual study selection was performed on the primary studies obtained from the search application defined in Sect. 2.3. This is required to eliminate any studies that might be irrelevant even though the search terms appear in either of the metadata fields defined in Table 1 above. Therefore, inclusion and exclusion criteria (listed below) were defined.

Published papers that meet any of the following inclusion criteria are chosen as primary studies:

I1. A study that targeted at least one social networking service and/or utilised a social dataset besides other social media services, such as blogs, chats and wikis. Please note that only work performed on social data from social networking services is taken in consideration for the purposes of this review;
I2. A study published from the year 2007 onwards. This year was chosen, since the mid-2000s saw the evolution of several social networking services, in particular Facebook’s growth (2007), which currently contains the highest monthly active users;
I3. A study published in the English language.

Published papers that satisfy any of the exclusion criteria from the following list, are removed from the systematic review:

E1. A study published before 2007;
E2. A study that does not focus on performing any sort of opinion mining on social media services, even though it mentions some of the search terms;
E3. A study that focuses on opinion mining or sentiment analysis in general i.e. no reference in a social context;
E4. A study that is only focused on social data sources obtained from online forums, communities, blogs, chats, social news websites (e.g., Slashdot¹⁷), review websites (e.g., IMDb¹⁸);
E5. A study that consists of either a paper’s front cover and/or title page.

Selection of the primary studies for this systematic review was carried out in 2019. Therefore, studies indexed or published from 2019 onwards, are not included in this review.

Table 2

Primary studies selection procedure from the electronic libraries

Primary studies	ACM	IEEE Xplore	ScienceDirect	SpringerLink
Search application	106	242	57	456
False positives	39	83	17	262
Study selection	67	159	40	194
No full paper access	0	0	5	4
Full paper access	67	159	35	190
Total	451

Table 2 shows the results for each electronic library at each step of the procedure used for selecting the final set of primary studies. The results included one proceedings, which was resolved by including all the published papers within the track relevant to this study¹⁹, since the other papers were not relevant thus not included in the initial results. The search application phase resulted in a total of 861 published papers. False positives, which consist of duplicate papers and papers that meet any of the exclusion criteria were removed. This was done through a manual study selection which was performed on all the metadata fields considered i.e. the title, abstract and keywords. In cases where we were still unclear of whether a published paper is valid or not, we went through the full text. This study selection operation left us with 460 published papers, where the number of false positives totalled 401. Out of the final study selection published papers, we did not have full access to 9 published papers, thus reducing the total primary studies to 451.

In addition to the primary studies selected from the electronic libraries, we added a set of relevant studies –34 published papers (excluding survey papers)– for completeness sake which were either published in reputable venues within the Opinion Mining community or were highly cited. Therefore, the final set of primary studies totals 485 published papers.

2.5 Extraction of data

2.5.1 Overall

The main objective of this study is to conduct a systematic analysis of the current literature in the field of SOM. Each published paper in this review was analysed in terms of the following information/parameters: social media platforms, techniques and approaches, social datasets, language, modality, tools and technologies, (other) NLP tasks, application areas and opinion mining dimensions. It is important to note that this information was manually extracted from each published paper. In the sub-sections below we discuss the overall statistics about the relevant primary studies that resulted from the study selection phase of this systematic review.

2.5.2 Study selection: electronic libraries

Figure 1 shows that the first three years of this evaluation period, i.e., 2007–2009, did not return any relevant literature. It is important to note that 2006 and 2007 was the period when opinion mining emerged in Web applications and weblogs within multiple domains, such as politics and marketing (Pang and Lee 2008). However, 2010—which year coincides with the introduction of various social media platforms and the major increase in Facebook and Twitter usage²⁰—resulted in the first relevant literature, which figures kept increasing in the following years. Please note that the final year in evaluation, that is 2018, contains literature that was published or indexed till the 31st December 2018. From the twelve full years evaluated, 2018 produced the highest number of relevant literature. This shows the importance of opinion mining on social data, and therefore the continuous increase in social media usage and popularity, in particular social networking services. Moreover, SOM solutions are on the increase for various real world applications.

2.5.3 Study selection: additional set

The additional set of studies included in this systematic review, were published in the period between the year 2009 and 2014. These ranged from various publishers, namely the four selected for this study (ACM, IEEE Xplore, ScieneDirect and SpringerLink) and other popular ones, such as Association for the Advancement of Artificial Intelligence (AAAI)²¹, Association for Computational Linguistics (ACL)²² and Wiley Online Library²³.

2.6 Synthesis of data

The data synthesis of this detailed analysis is based on the extracted data mentioned in Sect. 2.5.1 above, which is discussed in the subsequent sections.

3 Review analysis

Table 3 provides different high level categories of the primary studies selected for this systematic review, discussed in Sect. 2.4.

Table 3

Categories of primary studies

Categories	ACM	IEEE Xplore	ScienceDirect	SpringerLink	Additional Set
Study selection	67	159	40	194	34
No full paper access	0	0	5	4	0
Surveys	2	5	3	8	0
Work can be applied/used on social data	1	0	0	1	0
Organised tasks	0	0	0	2	0

It must be noted that not all the published papers were considered in the analysis conducted. Therefore, this table is referenced in all of the different aspects of the data synthesised, as presented below. It presents the primary studies returned from each electronic library and the additional ones, together with the ones that do not have full access, survey papers, papers which present work that can be applied/used on social data, and papers originating from organised tasks within the domain.

The in-depth analysis, which focused on the social media platforms, techniques, social datasets, language, modality, tools and technologies, NLP tasks and other aspects used across the published papers, is presented in Sects. 3.1–3.7.

Social data refers to online data generated from any type of social media platform be it from microblogging, social networking, blogging, photo/video sharing and crowdsourcing. Given that this systematic survey focuses on opinion mining approaches that make use of social networking and microblogging services, we identify the social media platforms used in the studies within this review.

In total, 469 studies were evaluated with 66 from ACM, 155 from IEEE Xplore, 32 from ScienceDirect, 182 from SpringerLink and 34 additional ones. Papers which did not provide full access were excluded. Note that 4 survey papers—2 from ACM (Giachanou and Crestani 2016; Zimbra et al. 2018), 1 from IEEE Xplore (Wagh and Punde 2018), 1 from SpringerLink (Abdullah and Hadzikadic 2017)—and 2 SpringerLink organised/shared task papers (Loukachevitch and Rubtsova 2015; Patra et al. 2015) were included, since the former papers focus on Twitter Sentiment Analysis methods whereas the latter papers focus on Sentiment Analysis of tweets (therefore the target social media platform of all evaluated papers is clear in both cases). None of the other 14 survey papers (Rajalakshmi et al. 2017; Yenkar and Sawarkar 2018; Abdelhameed and Muñoz-Hern’andez 2017; Rathan et al. 2017; Liu and Young 2018; Zhang et al. 2018; Ravi and Ravi 2015; Nassirtoussi et al. 2014; Beigi et al. 2016; Lo et al. 2017; Ji et al. 2016; Batrinca and Treleaven 2015; Li et al. 2014; Lin and He 2014) have been included, since various social media platforms were used in the respective studies evaluated. In addition, 2 papers that presented a general approach which can be applied/used on social data (i.e., not on any source) (Min et al. 2013; El Haddaoui et al. 2018) have also not been included.

Out of these studies, 429 made use of 1 social media platform, whereas 32 made use of 2–4 social media platforms, as can be seen in Fig. 2.

With respect to social media platforms, in total 504 were used across all of the studies. These span over the following 18 different ones, which are also listed in Table 4:

Twitter : a microblogging platform that allows publishing of short text updates (“microposts”);

Sina Weibo : a Chinese microblogging platform that is like a hybrid of Twitter and Facebook;

Facebook : a social networking platform that allows users to connect and share content with family and friends online;

YouTube²⁴: a video sharing platform;

Tencent Weibo²⁵: a Chinese microblogging platform;

TripAdvisor : a travel platform that allows people to post their reviews about hotels, restaurants and other travel-related content, besides offering accommodation bookings;

Instagram²⁶: a platform for sharing photos and videos from a smartphone;

Flickr²⁷: an image- and video-hosting platform that is popular for sharing personal photos;

Myspace²⁸: a social networking platform for musicians and bands to show and share their talent and connect with fans;

10.

Digg²⁹: a social bookmarking and news aggregation platform that selects stories to the specific audience;

11.

Foursquare³⁰: formerly a location-based service and nowadays a local search and discovery service mobile application known as Foursquare City Guide;

12.

Stocktwits³¹: a social networking platform for investors and traders to connect with each other;

13.

LinkedIn³²: a professional networking platform that allows users to communicate and share updates with colleagues and potential clients, job searching and recruitment;

14.

Plurk³³: a social networking and microblogging platform;

15.

Weixin³⁴: a Chinese multi-purpose messaging and social media app developed by Tencent;

16.

PatientsLikeMe³⁵: a health information sharing platform for patients;

17.

Apontador³⁶: a Brazilian platform that allows users to share their opinions and photos on social networks and also book hotels and restaurants;

18.

Google+³⁷: formerly a social networking platform (shut down in April 2019) that included features such as posting photos and status updates, group different relationship types into Circles, organise events and location tagging.

Table 4

Social media platforms used in the studies

Social Media Platform	ACM	IEEE Xplore	ScienceDirect	SpringerLink	Additional Set	Total
Twitter	53	130	25	136	27	371
Sina Weibo	4	13	1	26	2	46
Facebook	4	10	3	10	3	30
YouTube	7	1	2	1	1	12
Tencent Weibo	0	1	1	5	1	8
TripAdvisor	0	1	2	4	0	7
Instagram	2	3	0	1	0	6
Flickr	0	2	0	3	0	5
Myspace	2	0	0	0	3	5
Digg	2	0	0	0	1	3
Foursquare	2	0	0	1	0	3
Stocktwits	0	1	1	0	0	2
LinkedIn	1	0	0	0	0	1
Plurk	0	0	0	1	0	1
Weixin	0	0	1	0	0	1
PatientsLikeMe	0	1	0	0	0	1
Apontador	0	0	0	0	1	1
Google+	0	1	0	0	0	1

Overall, Twitter was the most popular with 371 opinion mining studies making use of it, followed by Sina Weibo with 46 and Facebook with 30. Other popular platforms such as YouTube (12), Tencent Weibo (8), TripAdvisor (7), Instagram (6) and Flickr (5) were also used in a few studies. These results show the importance and popularity of microblogging platforms, such as Twitter and Sina Weibo, which are also very frequently used for research and development purposes in this domain. Such microblogging platforms provide researchers the possibility of using an Application Programming Interface (API) to access social data, which plays a crucial role in selecting them for their studies. On the other hand, data retrieval from other social media platforms such as Facebook, is becoming more challenging due to ethical concerns. For example, Facebook access to the Public Feed API³⁸ is restricted and users cannot apply for it.

3.2 Techniques

For this analysis, 465 studies were evaluated: 65 from ACM, 154 from IEEE Xplore, 32 from ScienceDirect, 180 from SpringerLink and 34 additional ones. Studies excluded are the ones with no full access, surveys, and organised task papers. The main aim was to identify the technique/s used for the opinion mining process on social data. Therefore, they were categorised under the following approaches: Lexicon (Lx), Machine Learning (ML), Deep Learning (DL), Statistical (St), Probabilistic (Pr), Fuzziness (Fz), Rule (Rl), Graph (Gr), Ontology (On), Hybrid (Hy) –a combination of more than one technique, Manual (Mn) and Other (Ot). Table 5 provides the yearly statistics for all the respective approaches adopted.

Table 5

Approaches used in the studies analysed

Year	Lx	ML	DL	St	Pr	Fz	Rl	Gr	On	Hy	Mn	Ot
2007	0	0	0	0	0	0	0	0	0	0	0	0
2008	0	0	0	0	0	0	0	0	0	0	0	0
2009	0	1	0	0	0	0	0	0	0	1	0	0
2010	2	2	0	0	0	0	0	0	0	3	0	0
2011	2	3	1	0	0	0	0	0	0	7	1	0
2012	6	5	0	0	0	0	0	1	0	10	0	1
2013	6	14	2	1	1	0	2	0	1	21	0	0
2014	14	20	2	1	3	1	1	0	1	41	0	3
2015	16	15	4	1	1	0	0	1	0	42	0	0
2016	13	21	4	3	0	0	0	0	0	38	2	4
2017	20	22	9	2	1	1	0	0	0	50	2	5
2018	17	18	13	1	0	0	1	2	0	69	1	4
Total	96	121	35	9	6	2	4	4	2	282	6	17

From the studies analysed, 88 developed and used more than 1 technique within their respective studies. These techniques include the ones originally used in their approach and/or ones used for comparison/baseline/experimentation purposes. In particular, from these 88 studies, 65 used 2 techniques each, 17 studies used 3 techniques, 4 studies used 4 techniques, and 2 studies made use of 5 techniques, which totals to 584 techniques used across all studies (including the studies that used 1 technique). The results show that a hybrid approach is the most popular one, with over half of the studies adopting such an approach. This is followed by Machine Learning and Lexicon techniques, which are usually chosen to perform any form of opinion mining. These results are explained in more detail in the sub-sections below.

3.2.1 Lexicon

In total 94 unique studies adopted a lexicon-based approach to perform a form of SOM, which produced a total of 96 different techniques³⁹. The majority of the lexicons used were specifically related to opinions and are well known in this domain, whereas the others that were not can still be used for conducting opinion mining.

Table 6

Lexicon-based studies

Number of lexicons	1	2	3	4	6	7	8	Other/NA
Number of studies	39	19	10	4	1	1	1	21
References of studies	Rathan et al. (2018), Li and Fleyeh (2018), Geetha et al. (2018), Chen (2018), Aoudi and Malik (2018), Poortvliet and Wang (2018), Salari et al. (2018), Hubert et al. (2018), Ray and Chakrabarti (2017), Gupta and Joshi (2017), Zhang et al. (2017), Arslan et al. (2017), Hagge et al. (2017), Ozer et al. (2017), Ishikawa and Sakurai (2017), Gallegos et al. (2016), Polymerou et al. (2014), Paltoglou and Thelwall (2012), Santarcangelo et al. (2015), Jurek et al. (2014), Philander and YunYing (2016), Walha et al. (2016), Pääkkönen (2016), Gao et al. (2016), Munezero et al. (2015), Hridoy et al. (2015), Jiang et al. (2015), Wang and Wu (2015), Hagen et al. (2015), Lu (2015), Lu et al. (2015), Varshney and Gupta (2014), Ou et al. (2014), Mostafa (2013a), Mostafa (2013b), Lek and Poo (2013), Tumasjan et al. (2010), Raja and Swamynathan (2016) and Lau et al. (2014)	Singh et al. (2018), Pollacci et al. (2017), Abdullah and Hadzikadic (2017), Joyce and Deng (2017), Husnain et al. (2017), Shi et al. (2017), Khuc et al. (2012), Porshnev et al. (2013), Li et al. (2017), Mukherjee and Mukherjee (2017), Chou et al. (2017), Frankenstein et al. (2016), Giachanou et al. (2016), Feng et al. (2015), Hagen et al. (2015), Del Bosque and Garza (2014), Zhou et al. (2014), Souza and Vieira (2012) and Asiaee et al. (2012)	Bansal and Srivastava (2018), Ghiassi and Lee (2018), Tasoulis et al. (2018), Wu et al. (2016), Li et al. (2016), Pandarachalil et al. (2015), Molina-González et al. (2014), Andriotis et al. (2014), Javed et al. (2014) and Chen et al. (2012)	Boididou et al. (2018), Su and Wang (2017), Bandhakavi et al. (2016) and Saif et al. (2014)	Gonçalves et al. (2013)	Årup Nielsen (2011)	Erdmann et al. (2014)	Rathan et al. (2018), Ranjan et al. (2018), Nausheen and Begum (2018), Wang et al. (2018), Vo et al. (2017), Yan et al. (2017), Radhika and Sankar (2017), Hu et al. (2017), Kaushik and Dey (2016), Akcora et al. (2010), Gupta and Kohli (2016), Lai et al. (2015), Dasgupta et al. (2015), Sarlan et al. (2014), Parthasarathi et al. (2012), Tian et al. (2015), Song et al. (2015), Choi and Kim (2013), Park et al. (2011), Costa et al. (2014) and Yanmei and Yuda (2015)

Table 6 presents the number of lexicons (first row and columns titled 1–8) used by the lexicon-based studies (second row). The column titled “Other/NA” refers to any other general lexicon, which does not list general lexicons mentioned in the studies such as acronym dictionaries, intensifier words⁴⁰, downtoner words,⁴¹ negation words and internet slang, and/or to studies which do not provide any information on the exact lexicons used.

The majority of the lexicon-based studies used one or two lexicons, where a total of 144 state-of-the-art lexicons (55 unique ones) were used across. The following are the top six lexicons based on use:

SentiWordNet⁴² (Baccianella et al. 2010)—used in 22 studies;

Hu and Liu⁴³ (Hu and Liu 2004)—used in 12 studies;

AFINN⁴⁴ (Årup Nielsen 2011) and SentiStrength⁴⁵ (Thelwall et al. 2012)—used in 9 studies;

MPQA—Subjectivity⁴⁶ (Wilson et al. 2005)—used in 8 studies;

HowNet Sentiment Analysis Word Library (HowNetSenti)⁴⁷—used in 6 studies;

NRC Word-Emotion Association Lexicon (also known as NRC Emotion Lexicon or EmoLex)⁴⁸ (Mohammad and Turney 2010, 2013), WordNet⁴⁹ (Miller 1995) and Wikipedia—list of emoticons⁵⁰—used in 5 studies.

In addition to the lexicons mentioned above, 19 studies used lexicons that they created as part of their work or specifically focused on creating SOM lexicons, such as (Årup Nielsen 2011) who created the AFINN word list for sentiment analysis in microblogs, (Javed et al. 2014) who built a bilingual sentiment lexicon for English and Roman Urdu, (Santarcangelo et al. 2015) the creators of the first Italian sentiment thesaurus, (Wu et al. 2016) for Chinese sentiment analysis and (Bandhakavi et al. 2016) for sentiment analysis on Twitter. These lexicons varied from social media focused lexicons (Tian et al. 2015; Ghiassi and Lee 2018; Pollacci et al. 2017), to sentiment and/or emoticon lexicons (Jurek et al. 2014; Molina-González et al. 2014; Khuc et al. 2012; Ranjan et al. 2018; Vo et al. 2017; Feng et al. 2015; Wang and Wu 2015; Zhou et al. 2014) and extensions of existing state-of-the-art lexicons (Li et al. 2016; Pandarachalil et al. 2015; Andriotis et al. 2014), such as (Li et al. 2016) who extended HowNetSenti with words manually collected from the internet, and (Pandarachalil et al. 2015) who built a sentiment lexicon from SenticNet⁵¹ (Cambria et al. 2020) and SentiWordNet for slang words and acronyms.

3.2.2 Machine learning

A total of 121 studies adopted a machine learning-based approach to perform a form of SOM, where several supervised and unsupervised algorithms were used. Table 7 below presents the number of machine learning algorithms (first row and columns titled 1–7) used by the machine learning-based studies (second row). The column titled “NA” refers to studies who do not provide any information on the exact algorithms used.

Table 7

Machine learning-based studies

Number of machine learning algorithms	1	2	3	4	5	6	7	NA
Number of studies	59	23	18	9	5	2	1	4
References of studies	Fatyanosa et al. (2018), dos Santos et al. (2018), Rout et al. (2018), Liu et al. (2018), Katz et al. (2018), Huang et al. (2018), Halibas et al. (2018), Ignatov and Ignatov (2017), Vo et al. (2017), Omar et al. (2017), Ducange and Fazzolari (2017), Joyce and Deng (2017), Soni et al. (2017), Radhika and Sankar (2017), Wehrmann et al. (2017), Song and Gruzd (2017), Li et al. (2017), Sygkounas et al. (2016), Kumar and Bala (2016), Balaji et al. (2016), Severyn et al. (2016), Singh and Kumari (2016), Abdelrazeq et al. (2016), Wang et al. (2016), Nagiwale and Umale (2015), Smailović et al. (2015), Attigeri et al. (2015), D’Avanzo and Pilato (2015), Kokkinogenis et al. (2015), Seron et al. (2015), Lu (2015), Wagner et al. (2015), Liu et al. (2015), Sluban et al. (2015), Lu et al. (2015), Guerra et al. (2014), Du et al. (2014), Yan et al. (2014), Lau et al. (2014), Batista and Ratté (2014), Rao and Srivastava (2014), Tapia and Velásquez (2014), Molina-González et al. (2014), Abdul-Mageed et al. (2014), Li et al. (2014), Ghiassi et al. (2013), Porshnev et al. (2013), Hoang et al. (2013), Kranjc et al. (2013), Gonçalves et al. (2013), Wunnasri et al. (2013), Yu et al. (2013), Weiss et al. (2013), Xiong et al. (2013), Wang et al. (2012), Xie et al. (2012), Saif et al. (2012), Bifet et al. (2011), Bollen et al. (2011)	Zhang et al. 2018, Moh et al. (2017), Nugroho et al. (2017), Gupta and Singal (2017), Shi et al. (2017), Hao et al. (2017), Wang et al. (2016), Suresh (2016), Peng et al. (2016), Ramteke et al. (2016), Shyamasundar and Rani (2016), de Souza Carvalho et al. (2016), Lu et al. (2016), Zhang et al. (2015), Esiyok and Albayrak (2015), Wang et al. (2014), Filice et al. (2014), Garg and Chatterjee (2014), Ou et al. (2014), Politopoulou and Maragoudakis (2013), Mejova et al. (2013), Wang and Ye (2013) and Li and Li (2013)	Ismail et al. (2018), Baltas et al. (2017), Yan et al. (2017), Vora and Chacko (2017), Sun et al. (2017), Balikas et al. (2017), Anastasia and Budi (2016), Li et al. (2016), Khalil et al. (2015), Anjaria and Guddeti (2014), Zimmermann et al. (2014), Le et al. (2014), Neethu and Rajasree (2013), Paltoglou and Thelwall (2012), Zhang et al. (2011), Bifet and Frank (2010), Pak and Paroubek (2010), Go et al. (2009)	Pant et al. 2018, Wazery et al. (2018), Adibi et al. (2018), Michailidis et al. (2018), Dritsas et al. (2018), Troussas et al. (2016), Krouska et al. (2016), Rexha et al. (2016), Zhang et al. (2014)	Xiaomei et al. 2018, Sánchez-Holgado and Arcila-Calderón (2018), Pavel e al. (2017), Baecchi et al. (2016) and Asiaee et al. (2012)	Çeliktuğ (2018) and Raja and Swamynathan (2016)	Juneja and Ojha (2017)	Rezk et al. (2018), Weiss et al. (2015), Brooks et al. (2014) and Sheth et al. (2014)

In total, 239 machine learning algorithms were used (not distinct) across 117 studies (since 4 studies did not provide any information), with 235 being supervised and 4 unsupervised. It is important to note that this figure does not include any supervised/semi-supervised/unsupervised proposed algorithms by the respective authors, which algorithms shall be discussed below.

Table 8

Supervised machine learning algorithms

Algorithm	Number of studies	References
Naïve Bayes (NB)	75	Lewis (1998)
Support Vector Machine (SVM)	71	Cortes and Vapnik (1995)
Logistic Regression (LoR)	16	McCullagh (1984)
Decision Tree (DT)	15	Quinlan (1986)
Maximum Entropy (MaxEnt)	12	Jaynes (1957)
Random Forest (RF)	9	Breiman (2001)
K-Nearest Neighbors (KNN)	7	Altman (1992)
SentiStrength	5	Thelwall et al. (2012)
Conditional Random Field	4	Lafferty et al. (2001)
Linear Regression (LiR)	4	Cook (1977)
SANT optimization algorithm (SANT)	3	Hu et al. (2013)
Stochastic Gradient Descent (SGD)	3	Bottou (2010)
Passive Aggressive (PA)	2	Crammer et al. (2006)
Bootstrap Aggregating (Bagging)	1	Breiman (1996)
Bayesian Network (BN)	1	Heckerman et al. (1995)
Conjunctive Rule Based (CRB)	1	Clark and Niblett (1989)
Adaptive Boosting (AB)	1	Freund et al. (1999)
Hidden Markov Model (HMM)	1	Baum and Petrie (1966)
Dictionary Learning	1	Ramirez et al. (2010)
SVM with NB features (NBSVM)	1	Wang and Manning (2012)
Multiclass Classifier (MCC)	1	Witten et al. (2016)
Iterative Classifier Optimizer (ICO)	1	Witten et al. (2016)

Table 8 provides breakdown of the 235 supervised machine learning algorithms (not distinct) that were used within these studies. The NB and SVM algorithms are clearly the most popular in this domain, especially for text classification. With respect to the former, it is important to note that 20 out of the 75 studies used the Multinomial NB (MNB), which model is usually utilised for discrete counts i.e., the number of times a given term (word or token) appears in a document. The other 55 studies made use of the Multi-variate Bernoulli NB (MBNB) model, which is based on binary data, where every token in a feature vector of a document is classified with the value of 0 or 1. As for SVM, this method looks at the given data and sorts it in two categories (binary classification). If multi-class classification is required, the Support Vector Classification (SVC)⁵², NuSVC⁵³ or LinearSVC⁵⁴ algorithms are usually applied, where the “one-against-one” approach is implemented for SVC and NuSVC, whereas the “one-vs-the-rest” multi-class strategy is implemented for LinearSVC.

The LoR statistical technique is also widely used in machine learning for binary classification problems. In total, 16 studies from the ones analysed, made use of this algorithm. DT learning has also been very much in use, which model uses a DT for both classification and regression problems. There are various algorithms in how a DT is built, with 2 studies using the C4.5 (Quinlan 1993)—an extension of Quinlan’s Iterative Dichotomiser 3 (ID3) algorithm, used for classification purposes, 3 studies using J48, a simple C4.5 DT for classification (Weka’s implementation⁵⁵), 2 using the Hoeffding Tree (Hulten et al. 2001) and the other 8 using the basic ID3 algorithm.

MaxEnt, used by 12 studies, is a probabilistic classifier that is also used for text classification problems, such as sentiment analysis. More specifically, it is generalisation of LoR for multi-class scenarios (Yu et al. 2011). RF was used in 9 studies, where this supervised learning algorithm –which can be used for both classification and regression tasks– creates a forest (which is an ensemble of DTs) and makes it somehow random. Moreover, 7 studies used the KNN algorithm, one of the simplest classification algorithms where no learning is required, since the model structure is determined from the entire dataset.

The SentiStrength algorithm, utilised by 5 studies (Gonçalves et al. 2013; Lu et al. 2015; Baecchi et al. 2016; Yan et al. 2017; Zhang et al. 2018), can be used in both supervised and unsupervised cases, since the authors developed a version for each learning case. Conditional Random Fields, used by 4 studies (Pak and Paroubek 2010; Zhang et al. 2014; Wang et al. 2016; Hao et al. 2017), are a type of discriminative classifier that model the decision boundary amongst different classes, whereas LiR was also used by 4 studies (Bollen et al. 2011; Pavel e al. 2017; Adibi et al. 2018; Xiaomei et al. 2018). Moreover, 3 studies each used the SANT (Ou et al. 2014; Lu 2015; Xiaomei et al. 2018) and SGD (Bifet and Frank 2010; Juneja and Ojha 2017; Sánchez-Holgado and Arcila-Calderón 2018) algorithms, with the former being mostly used for comparison purposes to the proposed approaches by the respective authors.

In addition, the PA algorithm was used in 2 studies (Li et al. 2014; Filice et al. 2014). In the case of the former (Li et al. 2014), this algorithm was used in a collaborative online learning framework to automatically classify whether a post is emotional or not, thereby overcoming challenges faced by the diversity of microblogging styles which increase the difficulty of classification. The authors in the latter study (Filice et al. 2014) extend the budgeted PA algorithm to enable robust and efficient natural language learning processes based on semantic kernels. The proposed online learning learner was applied to two real world linguistic tasks, one of which was sentiment analysis.

Nine other algorithms were used by 7 different studies, namely: Bagging (Sygkounas et al. 2016), BN (Lu et al. 2016), CRB (Raja and Swamynathan 2016), AB (Raja and Swamynathan 2016), HMM (Zhang et al. 2014), Dictionary Learning (Asiaee et al. 2012), NBSVM (Sun et al. 2017), MCC (Çeliktuğ 2018) and ICO (Çeliktuğ 2018).

In terms of unsupervised machine learning algorithms, 4 were used in 2 of the 80 studies that used a machine learning-based technique. Suresh and Raj S. used the K-Means (KM) (Lloyd 1982) and Expectation Maximization (Dempster et al. 1977) clustering algorithms in Suresh (2016). Both were used for comparison purposes to an unsupervised modified fuzzy clustering algorithm proposed by authors. The proposed algorithm produced accurate results without manual processing, linguistic knowledge or training time, which concepts are required for supervised approaches. Baecchi et al. (Baecchi et al. 2016) used two unsupervised algorithms, namely Continuous Bag-Of-Word (CBOW) (Mikolov et al. 2013) and Denoising Autoencoder (DA) (Vincent et al. 2008) (the SGD and backpropagation algorithms were used for the DA learning process), and supervised ones, namely LoR, SVM and SentiStrength, for constructing their method and comparison purposes. They considered both textual and visual information in their work on sentiment analysis of social network multimedia. Their proposed unified model (CBOW-DA-LoR) works in both an unsupervised and semi-supervised manner, whereby learning text and image representation and also the sentiment polarity classifier for tweets containing images.

Other studies proposed their own algorithms, with some of the already established algorithms discussed above playing an important role in their implementation and/or comparison. Zimmermann et al. proposed a semi-supervised algorithm, the S*3Learner (Zimmermann et al. 2014) which suits changing opinion stream classification environments, where the vector of words evolves over time, with new words appearing and old words disappearing. Severyn et al. (2016) defined a novel and efficient tree kernel function, the Shallow syntactic Tree Kernel, for multi-class supervised sentiment classification of online comments. This study focused on YouTube which is multilingual, multimodal, multidomain and multicultural, with the aim to find whether the polarity of a comment is directed towards the source video, product described in the video or another product. Furthermore, Ignatov and Ignatov (2017) presented a novel DT-based algorithm, a Decision Stream, where Twitter sentiment analysis was one of several common machine learning problems that it was evaluated on. Lastly, Fatyanosa et al. (2018) enhanced the ability of the NB classifier with an optimisation algorithm, the Variable Length Chromosome Genetic Algorithm (VLCGA), thereby proposing VLCGA-NB for Twitter sentiment analysis.

Moreover, the following 13 studies proposed an ensemble method or evaluated ensemble-based classifiers:

Çeliktuğ (2018) used two ensemble learning methods in RF and MCC (amongst other machine learning algorithms), for sentiment classification of Twitter datasets;
Yan et al. (2017) presented two ensemble learners built on four off-the-shelf classifiers, for Twitter sentiment classification;
Zhang et al. (2018), Adibi et al. (2018), Çeliktuğ (2018), Vora and Chacko (2017), Lu et al. (2016), Rexha et al. (2016), Xie et al. (2012) and Zhang et al. (2011) used the RF ensemble learning method in their work;
Troussas et al. (2016) evaluated the most common ensemble methods that can be used for sentiment analysis on Twitter datasets;
Sygkounas et al. (2016) proposed an ensemble system composed on five state-of-the-art sentiment classifiers;
Le et al. (2014) used multiple oblique decision stumps classifiers to form an ensemble of classifiers, which is more accurate than a single one for classifying tweets;
Neethu and Rajasree (2013) used an ensemble classifier (and single algorithm classifiers) for sentiment classification.

Ensembles created usually result in providing more accurate classification answers when compared to individual classifiers, i.e., classic learning approaches. In addition, ensembles reduce the overall risk of choosing a wrong classifier especially when applying it on a new dataset (Da Silva et al. 2014).

3.2.3 Deep learning

Deep learning is a subset of machine learning based on Artificial Neural Networks (ANNs) –algorithms inspired by the human brain– where there are connections, layers and neurons for data to propagate. A total of 35 studies adopted a deep learning-based approach to perform a form of SOM, where supervised and unsupervised algorithms were used. Twenty six (26) of the studies made use of 1 deep learning algorithm, with 5 utilising 2 algorithms and 2 studies each using 3 and 4 algorithms, respectively. Table 9 provides breakdown of the 50 deep learning algorithms (not distinct) used within these studies.

Table 9

Deep learning algorithms

Algorithm	Number of studies	References
Long Short-Term Memory (LSTM)	13	Hinton et al. (2012)
Convolutional Neural Network (CNN)	12	LeCun et al. (1990)
Recurrent Neural Network (RNN)	8	Graves and Schmidhuber (2005)
ANN	5	McCulloch and Pitts (1943)
Recursive Neural Tensor Network (RNTN)	3	Socher et al. (2013)
Bidirectional Long Short-Term Memory (BLSTM)	3	Graves and Schmidhuber (2005)
Multilayer Perceptron (MLP)	2	Hornik et al. (1989)
Autoencoder (AE)	2	Rumelhart et al. (1985)
Gated Recurrent Units (GRU)	1	Greff et al. (2017)
Dynamic Architecture for ANN (DAN2)	1	Ghiassi and Saidane (2005)

LSTM, a prominent variation of the RNN which makes it easier to remember past data in memory, was used in 13 studies (Yan et al. 2018; Sun et al. 2018; Sanyal et al. 2018; Ameur et al. 2018; Wazery et al. 2018; Li et al. 2018; Chen and Wang 2018; Chen et al. 2018; Sun et al. 2017; Hu et al. 2017; Shi et al. 2017; Wang et al. 2016; Yan and Tao 2016), thus making it the most popular deep learning algorithm amongst the evaluated studies. Three further studies (Ameur et al. 2018; Balikas et al. 2017; Wang et al. 2016) used the BLSTM, an extension of the traditional LSTM which can improve model performance on sequence classification problems. In particular, a BLSTM was used in Balikas et al. (2017) to improve the performance of fine-grained sentiment classification, which approach can benefit sentiment expressed in different textual types (e.g., tweets and paragraphs), in different languages and different granularity levels (e.g., binary and ternary). Similarly, Wang et al. (2016) proposed a language-independent method based on BLSTM models for incorporating preceding microblogs for context-aware Chinese sentiment classification.

The CNN algorithm –a variant of ANN– is made up of neurons that have learnable weights and biases, where each neuron receives an input, performs a dot product and optionally follows it with non-linearity. In total, 12 studies (Sun et al. 2018; Ochoa-Luna and Ari 2018; Ameur et al. 2018; Adibi et al. 2018; Chen and Wang 2018; Napitu et al. 2017; Shi et al. 2017; Wehrmann et al. 2017; Zhang et al. 2017; Stojanovski et al. 2015; Wang et al. 2016; Severyn and Moschitti 2015) made use of this algorithm. Notably, Wehrmann et al. (2017) propose a language-agnostic translation-free method for Twitter sentiment analysis.

RNNs, a powerful set of ANNs useful for processing and recognising patterns in sequential data such as natural language, were used in 8 studies (Yan et al. 2018; Ochoa-Luna and Ari 2018; Piñeiro-Chousa et al. 2018; Wazery et al. 2018; Pavel e al. 2017; Shi et al. 2017; Yan and Tao 2016; Wang et al. 2016). One study in particular (Averchenkov et al. 2015), considered a novel approach to aspect-based sentiment analysis of Russian social networks based on RNNs, where the best results were obtained by using a special network modification, the RNTN. Two further studies (Lu et al. 2015; Sygkounas et al. 2016) also used this algorithm (RNTN) in their work.

Five other studies (Arslan et al. 2018; Anjaria and Guddeti 2014; Du et al. 2014; Politopoulou and Maragoudakis 2013; Zhang et al. 2011) used a simple type of ANN, such as the feedforward neural network. Moreover, the MLP, a class of feedforward ANN, was used in 2 studies (Chen and Zheng 2018; Ramadhani and Goo 2017). Similarly, 2 studies (Yan et al. 2018; Ameur et al. 2018) proposed methods based on the AE unsupervised learning algorithm which is used for representation learning. Lastly, one study each made use of the GRU (Wang et al. 2016) and DAN2 (Ghiassi et al. 2013) algorithms.

Some studies used several types of ANNs in their work. Ameur et al. (2018) used multiple methods based on AE, CNN, LSTM and BLSTM for sentiment polarity classification and Wang et al. (2016) used RNN, LSTM, BLSTM and GRUs models. Yan et al. (2018) used learning methods based on RNN, LSTM and AE for comparison with the proposed learning framework for short text classification, and Shi et al. (2017) proposed an improved LSTM which considers user-based and content-based features and used CNN, LSTM and RNN models for comparison purposes. Furthermore, Ochoa-Luna and Ari (2018) made use of CNN and RNN deep learning algorithms for tweet sentiment analysis, Wazery et al. (2018) and Yan and Tao (2016) used the RNN and LSTM, whereas Sun et al. (2018) and Chen and Wang (2018) proposed new models based on CNN and LSTM.

3.2.4 Statistical

A total of 9 studies (Wang et al. 2018; Kitaoka and Hasuike 2017; Arslan et al. 2017; Raja and Swamynathan 2016; Yang et al. 2014; Bukhari et al. 2016; Zhang et al. 2015; Karpowicz et al. 2013; Supriya et al. 2016) adopted a statistical approach to perform a form of SOM. In particular, one of the approaches proposed in Arslan et al. (2017) uses the term frequency-inverse document frequency (tf-idf) (Salton and McGill 1986) numerical statistic to find out the important words within a tweet, to dynamically enrich Twitter specific dictionaries created by the authors. The tf-idf is also one of several statistical-based techniques used in Wang et al. (2018) for comparing the proposed novel feature weighting approach for Twitter sentiment analysis. Moreover, Raja and Swamynathan (2016) focuses on a statistical sentiment score calculation technique based on adjectives, whereas Yang et al. (2014) use a variation of the point-wise mutual information to measure the opinion polarity of an entity and its competitors, which method is different from the traditional opinion mining way.

3.2.5 Probabilistic

A total of 6 studies (Bhattacharya and Banerjee 2017; Baecchi et al. 2016; Ou et al. 2014; Ragavi and Usharani 2014; Yan et al. 2014; Lek and Poo 2013) adopted a probabilistic approach to perform a form of SOM. In particular, Ou et al. (2014) propose a novel probabilistic model in the Content and Link Unsupervised Sentiment Model, where the focus is on microblog sentiment classification incorporating link information, namely behaviour, same user and friend.

3.2.6 Fuzziness

Two studies (D’Asaro et al. 2017; Del Bosque and Garza 2014) adopted a fuzzy-based approach to perform a form of SOM. D’Asaro et al. (2017) present a sentiment evaluation and analysis system based on fuzzy linguistic textual analysis. Del Bosque and Garza (2014) assume that aggressive text detection is a sub-task of sentiment analysis, which is closely related to document polarity detection given that aggressive text can be seen as intrinsically negative. This approach considers the document’s length and the number of swear words as inputs, with the output being an aggressiveness value between 0 and 1.

3.2.7 Rule-based

In total, 4 studies (El Haddaoui et al. 2018; Zhang et al. 2014; Min et al. 2013; Bosco et al. 2013) adopted a rule-based approach to perform a form of SOM. Notably, Bosco et al. (2013) applied an approach for automatic emotion annotation of ironic tweets. This relies on sentiment lexicons (words and expressions) and sentiment grammar expressed by compositional rules.

3.2.8 Graph

Four studies (Dritsas et al. 2018; Vilarinho and Ruiz 2018; Chen et al. 2015; Rabelo et al. 2012) adopted a graph-based approach to perform a form of SOM. The study in Vilarinho and Ruiz (2018) presents a word graph-based method for Twitter sentiment analysis using global centrality metrics over graphs to evaluate sentiment polarity. In Dritsas et al. (2018), a graph-based method is proposed for sentiment classification at a hashtag level. Moreover, the authors in Chen et al. (2015) compare their proposed multimodal hypergraph-based microblog sentiment prediction approach with a combined hypergraph-based method (Huang et al. 2010). Lastly, Rabelo et al. (2012) used link mining techniques to infer the opinions of users.

3.2.9 Ontology

Two studies (Lau et al. 2014; Kontopoulos et al. 2013) adopted an ontology-based approach to perform a form of SOM. In particular, the technique developed in Kontopoulos et al. (2013) performs more fine-grained sentiment analysis of tweets where each subject within the tweets is broken down into a set of aspects, with each one being assigned a sentiment score.

3.2.10 Hybrid

Hybrid approaches are very much in demand for performing different opinion mining tasks, where 244 unique studies (out of 465) adopted this approach and produced a total of 282 different techniques⁵⁶.

Tables 10 and 11 lists these studies, together with the type of techniques used for each. In total, there were 38 different hybrid approaches across the analysed studies.

Table 10

Studies adopting a hybrid approach consisting of two techniques

Lx	ML	DL	St	Pr	Fz	Rl	Gr	On	Total	Studies
✓	✓								114	Zhang et al. (2018), Yan et al. (2018), Pollacci et al. (2017), Jin et al. (2017), Hong and Sinnott (2018), Calvo and Juárez Gambino (2018), Rathan et al. (2018), Saleena (2018), Yan et al. (2017), Katiyar et al. (2018), Gandhe et al. (2018), Al Shammari (2018), Pai and Liu (2018), Goel et al. (2018), Sahni et al. (2017), Ahuja and Dubey (2017), Fatyanosa and Bachtiar (2017), Singh et al. (2018), Abdullah and Zolkepli (2017), Lee and Nerghes (2017), Bouchlaghem et al. (2016), Sharma et al. (2016), Sankaranarayanan et al. (2016), Kanavos et al. (2016), Gallegos et al. (2016), Koto and Adriani (2015), Buscaldi and Hernandez-Farias (2015), Tsytsarau et al. (2014), Yuan et al. (2014), Bravo-Marquez et al. (2013), Zhang et al. (2013), Xu et al. (2012), Jianqiang and Xiaolin (2017), Qaisi and Aljarah (2016), Zimbra et al. (2016), Jianqiang (2016), You and Tunçer (2016), Bravo-Marquez et al. (2016), Zhao et al. (2016), Li et al. (2016), Deshwal and Sharma (2016), Jianqiang and Xueliang (2015), Chen et al. (2015), Li et al. (2016), Fersini et al. (2015), Abdelwahab et al. (2015), Yang et al. (2015), Yang and Zhou (2015), Chen et al. (2015), Kanakaraj and Guddeti (2015), Jianqiang (2015), Koto and Adriani (2015), Wu et al. (2015), Shukri et al. (2015), Sahu et al. (2015), Lewenberg et al. (2015), Cho et al. (2014), Sui et al. (2012), Karyotis et al. (2017), Lim et al. (2017), Pandey et al. (2017), Burnap et al. (2016), Lima et al. (2015), Poria et al. (2014), Bravo-Marquez et al. (2014), Da Silva et al. (2014), Gambino and Calvo (2016), Yan and Tao (2016), Jiang et al. (2015), Nguyen (2016), Aboluwarin et al. (2016), Zainuddin et al. (2016), Flaes et al. (2016), Koto and Adriani (2015), Hagen et al. (2015), Castellucci et al. (2015), Sanborn et al. (2015), Castellucci et al. (2015), Chen et al. (2015), Mansour et al. (2015), Del Bosque and Garza (2014), Han and Kavuluru (2015), Yuan et al. (2015), Buddhitha and Inkpen (2015), Ji et al. (2015), Zhou et al. (2014), Wang et al. (2014), Tsakalidis et al. (2014), Porshnev and Redkin (2014), Su et al. (2014), Yan et al. (2014), Gonçalves et al. (2013), Porshnev et al. (2014), Sun et al. (2014), Pla and Hurtado (2014), Wang and Li (2014), Bao et al. (2014), Zhu et al. (2013), Jiang et al. (2013), Cui et al. (2013), Khuc et al. (2012), Bermingham and Smeaton (2010), Wang et al. (2012), Montejo-Raez et al. (2014), Ortigosa et al. (2014), Rui et al. (2013), Reyes et al. (2013), Kouloumpis et al. (2011), Bakliwal et al. (2013), Vu et al. (2012), Agarwal et al. (2011), Hernandez-Farias et al. (2014), Thelwall et al. (2011) and Thelwall et al. (2010)
✓		✓							12	Jin et al. (2017), Baccouche et al. (2018), Karyotis et al. (2017), Zimbra et al. (2016), Er et al. (2016), Koto and Adriani (2015), Bravo-Marquez et al. (2014), Del Bosque and Garza (2014), Porshnev and Redkin (2014), Porshnev et al. (2014), Tang et al. (2013), Thelwall et al. (2010)
✓			✓						22	Zhang et al. (2018), Wan et al. (2018), Sangameswar et al. (2017), Rout et al. (2018), Rout et al. (2017), Bansal and Srivastava (2018), Satapathy et al. (2017), Tago and Jin (2018), Fatyanosa and Bachtiar (2017), Sachdeva et al. (2018), Zhou et al. (2017), Le et al. (2017), Azzouza et al. (2017), Giachanou and Crestani (2016), Gao et al. (2016), Lu et al. (2016), Orellana-Rodriguez et al. (2015), Tan et al. (2014), Khan et al. (2014), Orellana-Rodriguez et al. (2013), Blenn et al. (2012) and Zhang et al. (2012)
✓				✓					3	Huang et al. (2015), Yang et al. (2013) and Lek and Poo (2013)
✓					✓				4	Ismail et al. (2018), Cotfas et al. (2017), Kao and Huang (2018) and Dragoni (2018)
✓						✓			9	Dambhare and Karale (2017), Chen et al. (2018), Kamyab et al. (2018), Mishra and Diesner (2018), Gambino and Calvo (2016), Wang et al. (2014), Saif et al. (2014), Wang and Ye (2013) and Maynard and Funk (2011)
✓							✓		4	Chen et al. (2015), Rill et al. (2014), Bliss et al. (2012) and Cui et al. (2011)
✓								✓	2	Cotfas et al. (2015) and Delcea et al. (2014)
	✓	✓							7	Yan et al. (2018), Stojanovski et al. (2018), Ameur et al. (2018), Sun et al. (2017), Prusa et al. (2015), Yanmei and Yuda (2015) and Cai and Xia (2015)
	✓		✓						21	Wang et al. (2018), Saidani et al. (2017), Sabuj et al. (2017), Ismail et al. (2018), Hanafy et al. (2018), Elouardighi et al. (2017), Effrosynidis et al. (2017), Ameur et al. (2018), Symeonidis et al. (2018), Rezaei and Jalali (2017), Elzayady et al. (2018), Rinaldi and Musdholifah (2017), Coyne et al. (2017), Setiawan et al. (2018), Dedhia and Ramteke (2017), Alzahrani (2018), Elbagir and Yang (2018), Mishra and Diesner (2018), Ramadhani et al. (2016), Trung et al. (2013) and Taddy (2013)
	✓			✓					3	Vo et al. (2017), Sihwi et al. (2018) and Caschera et al. (2016)
	✓					✓			2	Mumu and Ezeife (2014) and Zhang et al. (2013)
	✓						✓		3	Lu (2015), Ou et al. (2014) and Lek and Poo (2013)
		✓	✓						3	Hanafy et al. (2018), Symeonidis et al. (2018) and Coyne et al. (2017)
		✓		✓					1	Haldenwang et al. (2018)
				✓	✓				1	Mukkamala et al. (2014)
					✓	✓			1	Karyotis et al. (2017)
					✓		✓		1	Mukkamala et al. (2014)

Table 11

Studies adopting a hybrid approach consisting of three and four techniques

Lx	ML	DL	St	Pr	Fz	Rl	Gr	On	Total	Studies
✓	✓	✓							3	Cao et al. (2018), Hassan et al. (2013) and Kalayeh et al. (2015)
✓	✓		✓						21	Vo et al. (2017), Villegas et al. (2018), Konate and Du (2018), Giachanou et al. (2017), Alharbi and DeDoncker (2017), Saleena (2018), Ghiassi and Lee (2018), Tellez et al. (2017), Simões et al. (2017), Lavanya and Deisy (2017), Li and Fleyeh (2018), Permatasari et al. (2018), Fitri et al. (2018), Bouazizi and Ohtsuki (2018), Rai et al. (2018), Wijayanti and Arisal (2017), Xia et al. (2017), Jianqiang et al. (2018), Bouazizi and Ohtsuki (2017), Chen et al. (2015) and Pei et al. (2014)
✓	✓			✓					2	Ortis et al. (2018) and Lek and Poo (2013)
✓	✓					✓			12	Moh et al. (2017), Siddiqua et al. (2016), Liu and Young (2016), Zainuddin et al. (2016), Poria et al. (2016), Souza et al. (2016), Chikersal et al. (2015), Shi et al. (2013), Maeda et al. (2012), Li and Xu (2014), Prabowo and Thelwall (2009) and Thelwall et al. (2012)
✓		✓	✓						7	Villegas et al. (2018), Liu et al. (2018), Konate and Du (2018), Ghosal et al. (2018), Ghiassi and Lee (2018), Li and Fleyeh (2018) and Jianqiang et al. (2018)
✓		✓				✓			2	Saini et al. (2018) and Poria et al. (2016)
✓	✓						✓		2	Weichselbraun et al. (2017) and Jiang et al. (2011)
✓	✓							✓	1	Ji et al. (2016)
✓			✓	✓					1	Zhang et al. (2018)
✓			✓		✓				1	Dragoni (2018)
✓			✓			✓			3	Asghar et al. (2018), Gao et al. (2015) and Unankard et al. (2014)
	✓	✓	✓						2	Hanafy et al. (2018) and Ameur et al. (2018)
	✓		✓	✓					1	Tong et al. (2017)
	✓		✓			✓			1	Samoylov (2014)
	✓		✓				✓		2	Xiaomei et al. (2018) and Tan et al. (2011)
	✓				✓	✓			1	Nivetha et al. (2016)
✓	✓	✓	✓						2	Chen et al. (2017) and Alharbi and DeDoncker (2017)
✓	✓		✓	✓					1	Vo et al. (2017)
✓	✓		✓			✓			3	Zainuddin et al. (2018), Saif et al. (2014) and Korenek and Šimko (2014)
✓	✓					✓	✓		1	Kuo et al. (2016)

The majority of these studies used two different techniques (213 out of 282)—see Table 10—within their hybrid approach, whereas 62 used three and 7 studies used four different techniques –see Table 11.

The Lexicon and Machine Learning-based techniques were mostly used, where they accounted for 40% of the hybrid approaches, followed by Lexicon and Statistical-based (7.8%), Machine Learning and Statistical-based (7.4%), and Lexicon, Machine Learning and Statistical-based (7.4%) techniques.

Moreover, out of the 282 hybrid approaches, 232 used lexicons, 205 used Machine Learning and 39 used Deep Learning. These numbers reflect the importance of these three techniques within the SOM research and development domain. In light of these, a list of lexicons, machine learning and deep learning algorithms used in these studies have been compiled, similar to Sects. 3.2.1, 3.2.2 and 3.2.3 above. The lexicons, machine learning and deep learning algorithms quoted below were either used in the proposed method/s and/or for comparison purposes in the respective studies.

In terms of state-of-the-art lexicons, these total 403 within the studies adopting a hybrid approach. The top ones align with the results obtained from the lexicon-based approaches in Sect. 3.2.1 above. The following are the lexicons used for more than ten times across the hybrid approaches:

SentiWordNet—used in 51 studies;

MPQA—Subjectivity—used in 28 studies;

Hu and Liu—used in 25 studies;

WordNet—used in 24 studies;

AFINN—used in 22 studies;

SentiStrength—used in 21 studies;

HowNetSenti—used in 15 studies;

NRC Word-Emotion Association Lexicon—used in 13 studies;

NRC Hashtag Sentiment Lexicon⁵⁷ (Mohammad et al. 2013)—used in 12 studies;

10.

SenticNet, Sentiment140 Lexicon (also known as NRC Emoticon Lexicon)⁵⁸ (Mohammad et al. 2013), National Taiwan University Sentiment Dictionary (NTUSD) (Ku et al. 2006) and Wikipedia list of emoticons - used 11 studies.

Further to the quoted lexicons, 49 studies used lexicons that they created as part of their work. Some studies composed their lexicons from emoticons/emojis that were extracted from a dataset (Cao et al. 2018; Li and Fleyeh 2018; Azzouza et al. 2017; Zimbra et al. 2016; You and Tunçer 2016; Chen et al. 2015; Porshnev et al. 2014; Cui et al. 2011; Zhang et al. 2012; Vu et al. 2012), combined publicly available emoticon lexicons/lists (Siddiqua et al. 2016) or mapped emoticons to their corresponding polarity (Tellez et al. 2017), and others (Gao et al. 2016; Souza et al. 2016; Su et al. 2014; Yan et al. 2014; Tang et al. 2013; Cui et al. 2011; Zhang et al. 2012; Li and Xu 2014) used seed/feeling/emotional words to establish a microblog typical emotional dictionary. Additionally, some authors constructed or used sentiment lexicons (Zhang et al. 2018; Vo et al. 2017; Rout et al. 2017; Jin et al. 2017; Ismail et al. 2018; Yan et al. 2017; Katiyar et al. 2018; Al Shammari 2018; Abdullah and Zolkepli 2017; Liu and Young 2016; Sahu et al. 2015; Cho et al. 2014; Wang et al. 2014; Chen et al. 2015; Jiang et al. 2013; Cui et al. 2013; Khuc et al. 2012; Montejo-Raez et al. 2014; Rui et al. 2013) some of which are domain or language specific (Konate and Du 2018; Hong and Sinnott 2018; Chen et al. 2017; Zhao et al. 2016; Lu et al. 2016; Zhou et al. 2014; Porshnev and Redkin 2014), others that extend state-of-the-art lexicons (Li et al. 2016, 2016; Koto and Adriani 2015), and some who made them available to the research community (Cotfas et al. 2017; Castellucci et al. 2015) such as the Distributional Polarity Lexicon⁵⁹.

Table 12

Machine learning algorithms used in the studies adopting a hybrid approach

Algorithm	Learning type	Studies #
SVM	Sup	130
NB	Sup	96
LoR	Sup	34
DT	Sup	27
RF	Sup	21
MaxEnt	Sup	15
SentiStrength	Sup/Unsup	13
LiR	Sup	8
KNN	Sup	5
AB	Sup	5
BN	Sup	3
Support Vector Regression (SVR)	Sup	3
SANT	Sup	3
KM	Unsup	2
Repeated Incremental Pruning to Produce Error Reduction (RIPPER)	Sup	2
HMM	Sup	1
Extremely Randomised Trees	Sup	1
Least Median of Squares Regression	Sup	1
Maximum Likelihood Estimation	Sup	1
Hyperpipes	Sup	1
Extreme Learning Machine	Sup	1
Domain Adaptation Machine	Sup	1
Affinity Propagation	Unsup	1
Multinomial inverse regression	Unsup	1
Apriori	Sup/Unsup	1
Distant Supervision	Semi-sup	1
Label Propagation	Semi-sup	1
SGD	Sup	1
NBSVM	Sup	1

Table 12 below presents a list of machine learning algorithms –in total 381 in 197 studies– that were used within the hybrid approaches. The first column indicates the algorithm, the second lists the type of learning algorithm, in terms of Supervised (Sup), Unsupervised (Unsup) and Semi-supervised (Semi-sup), and the last column lists the total number of studies using each respective algorithm. The SVM and NB algorithms were mostly used in supervised learning, which result corresponds to the machine learning-based approaches in Sect. 3.2.2 above. With respect to the latter, 76 studies used the MBNB algorithm, 19 studies the MNB and 1 study the Discriminative MNB. Moreover, the LoR, DT –namely the basic ID3 (10 studies), J48 (5 studies), C4.5 (5 studies), Classification And Regression Tree (3 studies), Reduced Error Pruning (1 study), DT with AB (1 study), McDiarmid Tree (McDiarmid 1989) (1 study) and Hoeffding Tree (1 study) algorithms, RF, MaxEnt and SentiStrength (used in both supervised and unsupervised settings) algorithms were also in various studies. Notably, some additional algorithms from the ones used in the machine learning-based approaches in Sect. 3.2.2 above, were used in a hybrid approach, in particular, SVR (Drucker et al. 1997), Extremely Randomised Trees (Geurts et al. 2006), Least Median of Squares Regression (Rousseeuw 1984), Maximum Likelihood Estimation (Fisher 1925), Hyperpipes (Witten et al. 2016), Extreme Learning Machine (Huang et al. 2006), Domain Adaptation Machine (Duan et al. 2009), RIPPER (Cohen 1995), Affinity Propagation (Frey and Dueck 2007), Multinomial inverse regression (Taddy 2013), Apriori (Agrawal et al. 1994), Distant Supervision (Go et al. 2009) and Label Propagation (Zhu and Ghahramani 2002).

Given that deep learning is a subset of machine learning, the algorithms used within the hybrid approaches are presented below. In total, 36 studies used the following deep learning algorithms:

CNN—used in 16 studies (Yan et al. 2018; Stojanovski et al. 2018; Konate and Du 2018; Hanafy et al. 2018; Haldenwang et al. 2018; Ghosal et al. 2018; Chen et al. 2017; Ameur et al. 2018; Alharbi and DeDoncker 2017; Symeonidis et al. 2018; Saini et al. 2018; Jianqiang et al. 2018; Baccouche et al. 2018; Cai and Xia 2015; Kalayeh et al. 2015; Yanmei and Yuda 2015);
ANN—used in 8 studies (Li and Fleyeh 2018; Karyotis et al. 2017; Poria et al. 2016; Er et al. 2016; Koto and Adriani 2015; Porshnev and Redkin 2014; Porshnev et al. 2014; Hassan et al. 2013);
LSTM—used in 7 studies (Yan et al. 2018; Konate and Du 2018; Hanafy et al. 2018; Ghosal et al. 2018; Ameur et al. 2018; Sun et al. 2017; Baccouche et al. 2018);
MLP—used in 7 studies (Villegas et al. 2018; Ghosal et al. 2018; Coyne et al. 2017; Karyotis et al. 2017; Bravo-Marquez et al. 2014; Del Bosque and Garza 2014; Thelwall et al. 2010);
RNN—used in 4 studies (Yan et al. 2018; Liu et al. 2018; Baccouche et al. 2018; Yanmei and Yuda 2015);
AE—used in 2 studies (Yan et al. 2018; Ameur et al. 2018);
BLSTM—used in 2 studies (Konate and Du 2018; Ameur et al. 2018);
DAN2—used in 2 studies (Ghiassi and Lee 2018; Zimbra et al. 2016);
Deep Belief Network (Hinton and Salakhutdinov 2006), a probabilistic generative model that is composed of multiple layers of stochastic, latent variables—used in 2 studies (Jin et al. 2017; Tang et al. 2013);
GRU—used in 1 study (Cao et al. 2018);
Generative Adversarial Networks (GAN) (Goodfellow et al. 2014), are deep neural net architectures composed of a two networks, a generator and a discriminator, pitting one against the other—used in 1 study (Cao et al. 2018);
Conditional GAN (Mirza and Osindero 2014), a conditional version of GAN that can be constructed by feeding the data that needs to be conditioned on both the generator and discriminator—used in 1 study (Cao et al. 2018);
Hierarchical Attention Network, a neural architecture for document classification (Yang et al. 2016), used in 1 study (Liu et al. 2018).

Further to the quoted algorithms, 22 studies (Hong and Sinnott 2018; Hanafy et al. 2018; Ghosal et al. 2018; Saleena 2018; Yan et al. 2017; Tong et al. 2017; Dedhia and Ramteke 2017; Wijayanti and Arisal 2017; Xia et al. 2017; Jianqiang 2016; Prusa et al. 2015; Fersini et al. 2015; Abdelwahab et al. 2015; Kanakaraj and Guddeti 2015; Hagen et al. 2015; Cai and Xia 2015; Mansour et al. 2015; Wang et al. 2014; Tsakalidis et al. 2014; Da Silva et al. 2014; Hassan et al. 2013; Gonçalves et al. 2013) used ensemble learning methods in their work, where they combined the output of several base machine learning and/or deep learning methods. In particular, Gonçalves et al. (2013) compared eight popular lexicon and machine learning-based sentiment analysis algorithms, and then developed an ensemble that combines them, which in turn provided the best coverage results and competitive agreement. Moreover, Ghosal et al. (2018) proposes an MLP-based ensemble network that combines LSTM, CNN and feature-based MLP models, with each model incorporating character, word and lexicon level information, to predict the degree of intensity for sentiment and emotion. Lastly, as presented in Table 12, the RF ensemble learning method was used in the 21 studies (Da Silva et al. 2014; Porshnev et al. 2014; Samoylov 2014; Yuan et al. 2014; Buddhitha and Inkpen 2015; Kanakaraj and Guddeti 2015; Jianqiang 2015; Bouchlaghem et al. 2016; Deshwal and Sharma 2016; Jianqiang 2016; Yan and Tao 2016; Tong et al. 2017; Jianqiang and Xiaolin 2017; Bouazizi and Ohtsuki 2017; Elouardighi et al. 2017; Bouazizi and Ohtsuki 2018; Li and Fleyeh 2018; Saleena 2018; Villegas et al. 2018; Yan et al. 2018; Zhang et al. 2018).

3.2.11 Other

In total, 23 studies did not adopt any of the previous approaches (discussed in Sects. 3.2.1–3.2.10). This is mainly due to three reasons: no information provided by the authors (13 studies), use of an automated approach (4 studies), or use of a manual approach (6 studies) (Sandoval-Almazan and Valle-Cruz 2018; Fang and Ben-Miled 2017; Song and Gruzd 2017; Zafar et al. 2016; Furini and Montangero 2016; Cvijikj and Michahelles 2011) to perform a form of SOM. Regarding the former, the majority of them (Ayoub and Elgammal 2018; Tiwari et al. 2017; Ouyang et al. 2017; Anggoro et al. 2016; Williamson and Ruming 2016; Agrawal et al. 2014; Pupi et al. 2014; Das et al. 2014) were not specifically focused on SOM (this was secondary), in contrast to the others (Vivanco et al. 2017; Gonzalez-Marron et al. 2017; Chen et al. 2016; Barapatre et al. 2016; Mejova and Srinivasan 2012). As for the automated approaches (Sharma et al. 2018; Pai and Alathur 2018; Ali et al. 2018; Teixeira and Laureano 2017), some of them used cloud services, such as Microsoft Azure Text Analytics⁶⁰ or out-of-the-box functionality provided by existing tools/software libraries, such as the TextBlob⁶¹ Python library.

Numerous datasets were used across the 465 studies evaluated for this systematic review. These consisted of SOM datasets released online for public use –which have been widely used across the studies– and newly collected datasets, some of which were made available for public use or else for private use within the respective studies. In terms of data collection, the majority of them used the respective platform’s API, such as the Twitter Search API⁶², either directly or through a third-party library, e.g., Twitter4J⁶³. Due to the large number of datasets, only the ones mostly used shall be discussed within this section. In addition, only social datasets are mentioned irrespective of whether other non-social datasets (e.g., news, movies, etc.,) were used, given that the main focus of this review is on social data.

The first sub-section (Sect. 3.3.1) presents an overview of the top social datasets used, whereas the second sub-section (Sect. 3.3.2) presents a comparative analysis of the studies that produced the best performance for each respective social dataset.

3.3.1 Overview

The following are the top ten social datasets used across all studies:

Stanford Twitter Sentiment (STS) Go et al. (2009) used in 61 studies: 1,600,000 training tweets collected via the Twitter API, that is made up of 800,000 tweets containing positive emoticons and 800,000 tweets containing negative emoticons. These are based on various topics, such as Nike, Google, China, Obama, Kindle, San Francisco, North Korea and Iran.

Sanders⁶⁴—used in 32 studies: 5513 hand-classified tweets about four topics: Apple, Google, Microsoft, Twitter. These tweets are labelled as follows: 570 positive, 654 negative, 2503 neutral, and 1786 irrelevant.

SemEval 2013—Task 2⁶⁵ Nakov et al. (2013)—used in 28 studies: Training, development and test sets for Twitter and SMS messages were annotated with positive, negative, and objective/neutral labels via the Amazon Mechanical Turk crowdsourcing platform. This was done for 2 subtasks focusing on an expression-level and message-level.

SemEval 2014—Task 9⁶⁶ Rosenthal et al. (2014)—used in 18 studies: Continuation of SemEval 2013—Task 2, where three new test sets from regular and sarcastic tweets, and LiveJournal sentences were introduced.

STS Gold (STS-Gold) Saif et al. (2013)—used in 17 studies: A subset of STS, which was annotated manually at a tweet and entity-level. The tweet labels were either positive, negative, neutral, mixed, or other.

Health care reform (HCR) Speriosu et al. (2011)—used in 17 studies: Dataset contains tweets about the 2010 health care reform in the USA. A subset of these are annotated for polarity with the following labels: positive, negative, neutral, irrelevant. The polarity targets, such as health care reform, conservatives, democrats, liberals, republicans, Obama, Stupak and Tea Party, were also annotated. All were distributed into training, development and test sets.

Obama-McCain Debate (OMD) Shamma et al. (2009)—used in 17 studies: 3,238 tweets about the first presidential debate held in the USA for the 2008 campaign. The sentiment labels of the tweets are acquired by Diakopoulos and Shamma (2010) using Amazon Mechanical Turk, and are rated as either positive, negative, mixed, or other.

SemEval 2015—Task 10⁶⁷ Rosenthal et al. (2015)—used in 15 studies: This continues on datasets number 3 and 4, with three new subtasks. The first two target sentiment about a particular topic in one tweet or collection of tweets, whereas the third targets the degree of prior polarity of a phrase.

SentiStrength Twitter (SS-Twitter) Thelwall et al. (2012)—used in 12 studies: 6 human-coded databases from BBC, Digg, MySpace, Runners World, Twitter and YouTube annotated for sentiment polarity strength i.e., negative between -1 (not negative) and -5 (extremely negative), and positive between 1 (not positive) and 5 (extremely positive).

10.

SemEval 2016—Task 4⁶⁸ Nakov et al. (2016)—used in 9 studies: This is a re-run of dataset 7, with three new subtasks. The first one replaces the standard two-point scale (positive/negative) or three-point scale (positive/negative/neutral) with a five-point scale (very positive/positive/OK/ negative/very negative). The other two subtasks replaced tweet classification with quantification (i.e., estimating the distribution of the classes in a set of unlabelled items) according to a two-point and five-point scale, respectively.

11.

NLPCC 2012⁶⁹—used in 6 studies: Chinese microblog sentiment dataset (sentence level) from Tencent Weibo provided by the First Conference on Natural Language Processing and Chinese Computing (NLP&CC 2012) It consists of a training set of microblogs about two topics, and a test set about 20 topics, where the subjectivity (subjective/objective) and the polarity (positive/negative/neutral) was assigned for each.

12.

NLPCC 2013⁷⁰—used in 6 studies: Dataset from Sina Weibo used for the Chinese Microblog Sentiment Analysis Evaluation (CMSAE) task in the second conference on NLP&CC 2013. The Chinese microblogs were classified into 7 emotion types: anger, disgust, fear, happiness, like, sadness, surprise. Test set contains 10,000 microblogs, where each text is labelled with a primary emotion type ans a secondary one (if possible).

13.

Sentiment Evaluation (SE-Twitter) Narr et al. (2012)—used in 5 studies: Human annotated multilingual dataset of 12,597 tweets from 4 languages, namely English, German, French, Portuguese. Polarity annotations with labels: positive, negative, neutral, and irrelevant, were conducted manually using Amazon Mechanical Turk.

14.

SemEval 2017—Task 4 Rosenthal et al. (2017)—used in 5 studies: This dataset continues with a re-run of dataset 10, where two new changes were introduced; inclusion of the Arabic language for all subtasks and provision of profile information of the Twitter users that posted the target tweets.

All the datasets above are textual, with the majority of them composed of social data from Twitter. From the datasets above, in terms of language, only the SE-Twitter (number 13) social dataset can be considered as multilingual, with the rest targeting English (majority) or Chinese microblogs, whereas SemEval 2017—Task 4 (number 14) introduced a new language in Arabic. An additional dataset is the one produced by Mozetič et al., which contains 15 Twitter sentiment corpora for 15 European languages (Mozetič et al. 2016). Some studies such as Munezero et al. (2015) used one of the English-based datasets above (STS-Gold) for multiple languages, given that they adopted a lexicon-based approach. Moreover, these datasets had different usage within the respective studies, with the most common being used as a training/test set, the final evaluation of the proposed solution/lexicon, or for comparison purposes. Evaluation challenges like SemEval are important to generate social datasets such as the above and Cortis et al. (2017), since these can be used by the Opinion Mining community for further research and development.

3.3.2 Comparative analysis

A comparative analysis of all the studies that used the social datasets presented in the previous sub-section was carried out. The Precision, Recall, F-measure (F1-score), and Accuracy metrics were selected to evaluate the said studies (when available) and identify the best performance for each respective social dataset. It is important to note that for certain datasets, this could not be done, since the experiments conducted were not consistent across all the studies. The top three studies (where possible) obtaining the best results for each of the four evaluation metrics are presented in the tables below.

Tables 13 and 14 provide the best results for the STS and Sanders datasets.

Table 13

Studies obtaining the best performance for the STS (1) social dataset

Ranking	Precision	Recall	F-measure	Accuracy
1	87.60% (Jianqiang et al. 2018)	91.76% (Siddiqua et al. 2016)	87.50% (Jianqiang et al. 2018)	89.61% (Shyamasundar and Rani 2016)
2	85.00% (Ismail et al. 2018)	87.60% (Bravo-Marquez et al. 2013)	86.08% (Siddiqua et al. 2016)	88.80% (Arslan et al. 2018)
3	84.56% (Zainuddin et al. 2016)	87.45% (Jianqiang et al. 2018)	83.90% (Saif et al. 2012)	88.30% (Lek and Poo 2013)

Table 14

Studies obtaining the best performance for the Sanders (2) social dataset

Ranking	Precision	Recall	F-measure	Accuracy
1	97.72% (Ameur et al. 2018)	97.41% (Ameur et al. 2018)	98.20% (Bravo-Marquez et al. 2014)	98.10% (Bravo-Marquez et al. 2014)
2	79.00% (Bravo-Marquez et al. 2013)	89.10% (Bravo-Marquez et al. 2013)	97.57% (Ameur et al. 2018)	88.93% (Korenek and Šimko 2014)
3	77.60% (Deshwal and Sharma 2016)	78.70% (Deshwal and Sharma 2016)	84.85% (Da Silva et al. 2014)	88.30% (Çeliktuğ 2018)

Tables 15 and 16 provide the best results for the SemEval 2013—Task 2 and SemEval 2014—Task 9 datasets, specifically for sub-task B, which focused on message polarity classification. Moreover, the results obtained by the participants of this shared task should be reviewed for a more representative comparative evaluation.

Table 15

Studies obtaining the best performance for the SemEval 2013—Task 2 (3) social dataset

Ranking	Precision	Recall	F-measure	Accuracy
1	80.69% (Chikersal et al. 2015)	83.68% (Chikersal et al. 2015)	93.70% (Bravo-Marquez et al. 2014)	93.70% (Bravo-Marquez et al. 2014)
2	NA	NA	81.90% (Chikersal et al. 2015)	91.16% (Lima et al. 2015)
3	NA	NA	80.30% (Xia et al. 2017)	89.00% (Yan et al. 2018)

Table 16

Studies obtaining the best performance for the SemEval 2014—Task 9 (4) social dataset

Ranking	Precision	Recall	F-measure	Accuracy
1	83.56% (Jianqiang et al. 2018)	81.48% (Jianqiang et al. 2018)	82.36% (Xia et al. 2017)	85.82% (Jianqiang et al. 2018)
2	80.47% (Jianqiang 2016)	80.98% (Chikersal et al. 2015)	81.99% (Jianqiang et al. 2018)	83.82% (Jianqiang 2016)
3	78.93% (Chikersal et al. 2015)	76.89% (Jianqiang 2016)	79.81% (Chikersal et al. 2015)	83.06% (Jianqiang and Xueliang 2015)

Tables 17, 18 and 19 provide the best results for the STS-Gold, HCR and OMD datasets.

Table 17

Studies obtaining the best performance for the STS-Gold (5) social dataset

Ranking	Precision	Recall	F-measure	Accuracy
1	82.75% (Jianqiang et al. 2018)	82.61% (Jianqiang et al. 2018)	83.10% (Saif et al. 2014)	92.67% (Krouska et al. 2016)
2	82.20% (Ismail et al. 2018)	82.30% (Ismail et al. 2018)	82.65% (Jianqiang et al. 2018)	89.02% (Troussas et al. 2016)
3	79.26% (Saif et al. 2014)	80.04% (Saif et al. 2014)	79.62% (Saif et al. 2014)	86.37% (Yan and Tao 2016)

Table 18

Studies obtaining the best performance for the HCR (6) social dataset

Ranking	Precision	Recall	F-measure	Accuracy
1	71.30% (Mishra and Diesner 2018)	67.40% (Saif et al. 2012)	70.28% (Saleena 2018)	91.94% (Krouska et al. 2016)
2	69.15% (Saif et al. 2012)	59.47% (Saif et al. 2014)	69.10% (Saif et al. 2014)	85.10% (Troussas et al. 2016)
3	59.76% (Saif et al. 2014)	58.30% (Mishra and Diesner 2018)	68.00% (Da Silva et al. 2014)	84.50% (Yan et al. 2018)

Table 19

Studies obtaining the best performance for the OMD (7) social dataset

Ranking	Precision	Recall	F-measure	Accuracy
1	81.36% (Zhang et al. 2018)	79.00% (Saif et al. 2012)	81.34% (Saif et al. 2014)	92.59% (Krouska et al. 2016)
2	78.95% (Saif et al. 2012)	65.76% (Saif et al. 2014)	78.20% (Saif et al. 2012)	87.74% (Troussas et al. 2016)
3	66.51% (Saif et al. 2014)	61.60% (Mishra and Diesner 2018)	74.65% (Da Silva et al. 2014)	82.90% (Saif et al. 2014)

Table 20 provides the best results for the SemEval 2015—Task 10 dataset, specifically for sub-task B, which focused on message polarity classification. Moreover, the results obtained by the participants of this shared task should be reviewed for a more representative comparative evaluation.

Table 20

Studies obtaining the best performance for the SemEval 2015—Task 10 (8) social dataset

Ranking	Precision	Recall	F-measure	Accuracy
1	NA	NA	76.02% (Xia et al. 2017)	68.77% (Stojanovski et al. 2015)
2	NA	NA	67.39% (Sygkounas et al. 2016)	68.00% (Li et al. 2017)
3	NA	NA	64.88% (Stojanovski et al. 2018)	51.95% (Stojanovski et al. 2018)

Table 21 provides the best results for the SS-Twitter dataset.

Table 21

Studies obtaining the best performance for the SS-Twitter (9) social dataset

Ranking	Precision	Recall	F-measure	Accuracy
1	80.61% (Jianqiang et al. 2018)	80.86% (Jianqiang et al. 2018)	80.72% (Jianqiang et al. 2018)	89.10% (Yan et al. 2018)
2	67.77% (Zhang et al. 2018)	54.77% (Zhang et al. 2018)	72.27% (Saif et al. 2014)	84.59% (Lima et al. 2015)
3	NA	NA	59.27% (Zhang et al. 2018)	81.56% (Su and Wang 2017)

Table 22 provides the best results for the SemEval 2016—Task 4 dataset, specifically for sub-task A, which focused on message polarity classification. Moreover, the results obtained by the participants of this shared task should be reviewed for a more representative comparative evaluation.

Table 22

Studies obtaining the best performance for the SemEval 2016—Task 4 (10) social dataset

Ranking	Precision	Recall	F-measure	Accuracy
1	64.10% (Mishra and Diesner 2018)	60.50% (Mishra and Diesner 2018)	77.25% (Xia et al. 2017)	65.60% (Mishra and Diesner 2018)
2	NA	NA	61.40% (Mishra and Diesner 2018)	NA
3	NA	NA	57.10% (Villegas et al. 2018)	NA

Tables 23 and 24 provide the best results for the NLPCC 2012 dataset. Results quoted below are for task 1 which focused on subjectivity classification (see Table 23) and task 2 which focused on sentiment polarity classification (see Table 24). Moreover, the results obtained by the participants of this shared task should be reviewed for a more representative comparative evaluation.

Table 23

Studies obtaining the best performance for the NLPCC 2012 - Task 1 (11) social dataset

Ranking	Precision	Recall	F-measure	Accuracy
1	72.20% (Cui et al. 2013)	96.70% (Feng et al. 2015)	78.80% (Feng et al. 2015)	NA
2	69.15% (Hao et al. 2017)	96.00% (Shi et al. 2013)	77.00% (Shi et al. 2013)	NA
3	66.90% (Feng et al. 2015)	73.80% (Cui et al. 2013)	72.10% (Cui et al. 2013)	NA

Table 24

Studies obtaining the best performance for the NLPCC 2012 - Task 2 (11) social dataset

Ranking	Precision	Recall	F-measure	Accuracy
1	80.90% (Shi et al. 2013)	77.80% (Shi et al. 2013)	79.30% (Shi et al. 2013)	NA
2	78.60% (Cui et al. 2013)	74.60% (Feng et al. 2015)	69.14% (Hao et al. 2017)	NA
3	70.83% (Hao et al. 2017)	67.52% (Hao et al. 2017)	67.10% (Cui et al. 2013)	NA

Tables 25 and 26 provide the best results for the NLPCC 2013 and SE-Twitter datasets.

Table 25

Studies obtaining the best performance for the NLPCC 2013 (12) social dataset

Ranking	Precision	Recall	F-measure	Accuracy
1	NA	NA	83.02% (Xia et al. 2017)	78.80% (Jiang et al. 2015)
2	NA	NA	NA	63.90% (Jiang et al. 2013)
3	NA	NA	NA	NA

Table 26

Studies obtaining the best performance for the SE-Twitter (13) social dataset

Ranking	Precision	Recall	F-measure	Accuracy
1	88.00% (Jianqiang et al. 2018)	87.32% (Jianqiang et al. 2018)	87.66% (Jianqiang et al. 2018)	87.39% (Jianqiang et al. 2018)
2	86.16% (Jianqiang 2016)	86.15% (Jianqiang 2016)	86.08% (Jianqiang 2016)	86.72% (Jianqiang 2016)
3	NA	NA	NA	85.87% (Jianqiang and Xueliang 2015)

Table 27 provides the best results for the SemEval 2017—Task 4 dataset, specifically for sub-task A, which focused on message polarity classification. Moreover, the results obtained by the participants of this shared task should be reviewed for a more representative comparative evaluation.

Table 27

Studies obtaining the best performance for the SemEval 2017—Task 4 (14) social dataset

Ranking	Precision	Recall	F-measure	Accuracy
1	NA	NA	62.30% (Villegas et al. 2018)	67.30% (Symeonidis et al. 2018)
2	NA	NA	NA	60.70% (Effrosynidis et al. 2017)
3	NA	NA	NA	NA

The following are some comments regarding the social dataset results quoted in the tables above:

In cases where several techniques and/or methods were applied, the highest result obtained in the study for each of the four evaluation metrics, was recorded, even if the technique did not produce the best result for all metrics.
The average Precision, Recall, and F-measure results are quoted (if provided by authors), i.e., average score of the results for each classified level (e.g., the average score of the results obtained for each sentiment polarity classification level - positive, negative and, neutral).
Results for social datasets that were released as a shared evaluation task, such as SemEval, were either only provided in the metrics used by the task organisers or other metrics were chosen by the authors, therefore not quoted.
Certain studies evaluated their techniques based on a subset of the actual dataset. Results quoted are the ones where the entire dataset was used (according to the authors and/our our understanding).
Quoted results are for classification tasks and not aspect-based SOM, which can vary depending on the focus of the study.
Results presented in a graph visualisation were not considered due to the exact values not being clear.

3.4 Language

Multilingual/bilingual SOM is very challenging, since it deals with multi-cultural social data. For example, analysing Chinese and English online posts can bring a mixed sentiment on such posts. Therefore, it is hard for researchers to make a fair judgement in cases where online posts’ results from different languages contradict each other (Yan et al. 2014).

The majority of the studies (354 out of 465) considered for this review analysis support one language in their SOM solutions. A total of 80 studies did not specify whether their proposed solution is language-agnostic or otherwise, or else their modality was not textual-based. Lastly, only 31 studies cater for more than one language, with 18 being bilingual, 1 being trilingual and 12 proposed solutions claiming to be multilingual. Regarding the latter, the majority were tested on a few languages at most, with Castellucci et al. (2015, 2015) on English and Italian, Montejo-Raez et al. (2014) on English and Spanish, Erdmann et al. (2014) on English and Japanese, Radhika and Sankar (2017) on English and Malayalam⁷¹. Baccouche et al. (2018) on English, French and Arabic, Munezero et al. (2015) on keyword sets for different languages (e.g., Spanish, French), Wehrmann et al. (2017) on English, Spanish, Portuguese and German, Cui et al. (2011) on Basic Latin (English) and Extended Latin (Portuguese, Spanish, German), Teixeira and Laureano (2017) on Spanish, Italian, Portuguese, French, English, and Arabic, Zhang et al. (2017) on 8 languages, namely English, German, Portuguese, Spanish, Polish, Slovak, Slovenian, Swedish, and Gao et al. (2016) on 11 languages, namely English, Dutch, French, German, Italian, Polish, Portuguese, Russian, Spanish, Swedish and Turkish.

The list below specifies the languages supported by the 19 bilingual and trilingual studies:

English and Italian (Severyn et al. 2016; D’Avanzo and Pilato 2015; Pupi et al. 2014);
English and German (Abdelrazeq et al. 2016; Tumasjan et al. 2010);
English and Spanish (Giachanou et al. 2017; Cotfas et al. 2015; Delcea et al. 2014);
English and Brazilian Portuguese (Guerra et al. 2014);
English and Chinese (Xia et al. 2017; Yan et al. 2014);
English and Dutch (Flaes et al. 2016);
English and Greek (Politopoulou and Maragoudakis 2013);
English and Hindi (Anjaria and Guddeti 2014);
English and Japanese (Ragavi and Usharani 2014);
English and Roman-Urdu (Javed et al. 2014);
English and Swedish (Li and Fleyeh 2018);
English and Korean (Ramadhani and Goo 2017);
English, German and Spanish (Boididou et al. 2018).

Some studies above (D’Avanzo and Pilato 2015; Anjaria and Guddeti 2014; Tumasjan et al. 2010) translated their input data into an intermediate language, mostly English, to perform SOM.

Moreover, Table 28 provides a list of the non-English languages identified from the 354 studies that support one language. Chou et al. (2017) claim that their method can be easily applied to any ConceptNet⁷² supported language, with Wang et al. (2016) similarly claiming that their method is language independent, whereas the solution by Wang and Wu (2015) is multilingual given that emoticons are used in the majority of languages.

Table 28

Non-English languages supported by studies in this review analysis

Language	Total	Studies
Chinese	53	Cao et al. (2018), Li et al. (2018), Liu et al. (2018), Sun et al. (2018), Wang et al. (2018), Wan et al. (2018), Chou et al. (2017), Hao et al. (2017), Ouyang et al. (2017), Shi et al. (2017), Sun et al. (2017), Zhang et al. (2017), Zhang et al. (2013), Gao et al. (2016), Liu and Young (2016), Zhao et al. (2016), Wang et al. (2016), Wu et al. (2016), Li et al. (2016), Yang and Zhou (2015), Chen et al. (2015), Wang et al. (2014), Sui et al. (2012), Yanmei and Yuda (2015), Liu et al. (2015), Zhang et al. (2015), Wang et al. (2014), Tian et al. (2015), Feng et al. (2015), Song et al. (2015), Jiang et al. (2015), Kuo et al. (2016), Wang et al. (2016), Wang and Wu (2015), Du et al. (2014), Gao et al. (2015), Chen et al. (2015), Wang et al. (2014), Su et al. (2014), Ou et al. (2014), Yan et al. (2014), Pei et al. (2014), Sun et al. (2014), Wang and Li (2014), Xiong et al. (2013), Zhu et al. (2013), Jiang et al. (2013), Zhang et al. (2013), Tang et al. (2013), Cui et al. (2013), Shi et al. (2013), Zhang et al. (2012) and Li and Xu (2014)
Spanish	11	Calvo and Juárez Gambino (2018), Hubert et al. (2018), Ochoa-Luna and Ari (2018), Sánchez-Holgado and Arcila-Calderón (2018), Gonzalez-Marron et al. (2017), Tellez et al. (2017), Gambino and Calvo (2016), Tapia and Velásquez (2014), Molina-González et al. (2014), Pla and Hurtado (2014) and Ortigosa et al. (2014)
Indonesian	8	Fitri et al. (2018), Permatasari et al. (2018), Setiawan et al. (2018), Napitu et al. (2017), Nugroho et al. (2017), Rinaldi and Musdholifah (2017), Wijayanti and Arisal (2017) and Ramadhani et al. (2016)
Italian	5	Buscaldi and Hernandez-Farias (2015), Furini and Montangero (2016), Santarcangelo et al. (2015), Bosco et al. (2013) and Hernandez-Farias et al. (2014)
Arabic	5	Alzahrani (2018), Elouardighi et al. (2017), Bouchlaghem et al. (2016), Khalil et al. (2015) and Abdul-Mageed et al. (2014)
Portuguese	3	Kokkinogenis et al. (2015), Seron et al. (2015) and Souza and Vieira (2012)
Brazilian Portuguese	3	dos Santos et al. (2018), Souza et al. (2016) and Costa et al. (2014)
Japanese	3	Tago and Jin (2018), Ishikawa and Sakurai (2017) and Maeda et al. (2012)
Korean	2	Cho et al. (2014) and Park et al. (2011)
French	2	Ameur et al. (2018) and Lai et al. (2015)
French—Bambara	1	Konate and Du (2018)
Bulgarian	1	Smailović et al. (2015)
German	1	Rill et al. (2014)
Roman Urdu	1	Zafar et al. (2016)
Russian	1	Averchenkov et al. (2015)
Swiss German	1	Cvijikj and Michahelles (2011)
Thai	1	Wunnasri et al. (2013)
Persian	1	Salari et al. (2018)
Bengala	1	Sabuj et al. (2017)
Vietnamese	1	Vo et al. (2017)

3.5 Modality

The majority of the studies in this systematic review and in the state-of-the-art focus on SOM on the textual modality, with only 15 out of 465 studies applying their work on more than one modality. However, other modalities, such as visual (image, video), and audio information is often ignored, even though it contributes greatly towards expressing user emotions (Chen et al. 2015). Moreover, when two or more modalities are considered together for any form of social opinion, such as emotion recognition, they are often complementary, thus increase the system’s performance (Caschera et al. 2016). Table 29 lists the multimodal studies within the review analysis, with the ones catering for two modalities –text and image– being the most popular.

Table 29

Studies adopting a multimodal approach

Text	Image	Video	Audio	Studies
✓	✓			Ortis et al. (2018), Rai et al. (2018), Saini et al. (2018), Chen et al. (2017), Chen et al. (2015), Baecchi et al. (2016), Liu et al. (2015), Zhang et al. (2015), Wang et al. (2014), Flaes et al. (2016), Cai and Xia (2015) and Yuan et al. (2015)
		✓	✓	Song and Gruzd (2017)
✓		✓	✓	Caschera et al. (2016)
✓	✓	✓	✓	Poria et al. (2016)

3.5.1 Datasets

Current available datasets and resources for SOM are restricted to the textual modality only. The following are the non-textual social datasets (not listed in Sect. 3.3) used across the mentioned studies:

YouTube Dataset (Morency et al. 2011) used in Poria et al. (2016): 47 videos targeting various topics, such as politics, electronics and product reviews.
SentiBank Twitter Dataset⁷³ (Borth et al. 2013) used in Baecchi et al. (2016) and Cai and Xia (2015): Image dataset from Twitter annotated for polarity using Amazon Mechanical Turk. Tweets with images related to 21 hashtags (topics) resulted in 470 being positive and 133 being negative.
SentiBank Flickr Dataset (Borth et al. 2013) used in Cai and Xia (2015): 500,000 image posts from Flickr labeled by 1553 adjective noun pairs based on Plutchik’s Wheel of Emotions (psychological theory) (Plutchik 1980).
You Image Dataset (You et al. 2015) used in Cai and Xia (2015): Image dataset from Twitter consisting of 769 positive and 500 negative tweets with images, annotated using Amazon Mechanical Turk.
Katsurai and Sotoh Image Dataset⁷⁴ (Katsurai and Satoh 2016) used in Ortis et al. (2018): Dataset of images from Flickr (90,139) and Instagram (65,439) with their sentiment labels.

3.5.2 Observations

The novel methodology by Poria et al. (2016), is the only mutlimodal sentiment analysis approach which caters for four different modalities, namely text, vision (image and video) and audio. Sentiments are extracted from social Web videos. In Caschera et al. (2016), the authors propose a method whereby machine learning techniques need to be trained on different and heterogeneous features when used on different modalities, such as polarity and intensity of lexicons from text, prosodic features from audio, and postures, gestures and expressions from video. The sentiment of video and audio data in Song and Gruzd (2017) was manually coded, which task is labour intensive and time consuming. The addition of images to the microblogs’ textual data reinforces and clarifies certain feelings (Wang et al. 2014; Baecchi et al. 2016), thus improving the sentiment classifier with the image features (Liu et al. 2015; Zhang et al. 2015; Wang et al. 2014; Cai and Xia 2015). Similarly, Chen et al. (2015) also demonstrates superiority with their multimodal hypergraph method when compared to single modality (in this case textual) methods. Moreover, these results are further supported by the method in Poria et al. (2016)—which caters for more than two modalities, in audio, visual and textual—where it shows that accuracy improves drastically when such modalities are used together.

Flaes et al. (2016) apply their multimodal (text, images) method in a real world application area, which research shows that several relationships exist between city liveability indicators collected by the local government and sentiment that is extracted automatically. For example, a negative linear association of detected sentiment from Flickr data is related with people living on welfare checks. Results in Rai et al. (2018) show that there is a high correlation between sentiment extracted from text-based social data and image-based landscape preferences by humans. In addition, results in Yuan et al. (2015) show some correlation between image and textual tweets. However, the authors mention that more features and robust data is required to determine the exact influence of multimedia content in the social domain. The work in Chen et al. (2017) adopts a bimodal approach to solve the problem of cross-domain image sentiment classification by using textual features and visual features from the target domain and measuring the text/image similarity simultaneously.

Therefore, multimodality in the SOM domain is one of numerous research gaps identified in this systematic review. This provides researchers with an opportunity towards further research, development and innovation in this area.

3.6 Tools and technologies

In this systematic review, we also analysed the tool and technologies that were used across all studies for various opinion mining operations conducted on social data, such as NLP, machine learning, and big data handling. The subsections below provide respective lists for the ones mostly used across the studies for the various operations required.

3.6.1 NLP

The following are the top 5 NLP tools used across all studies for various NLP tasks:

Natural Language Toolkit (NLTK)⁷⁵: a platform that provides lexical resources, text processing libraries for classification, tokenisation, stemming, tagging, parsing, and semantic reasoning, and wrappers for industrial NLP libraries;
TweetNLP⁷⁶: consists of a tokeniser, Part-of-Speech (POS) tagger, hierarchical word clusters, and a dependency parser for tweets, besides annotated corpora and web-based annotation tools;
Stanford NLP⁷⁷: software that provides statistical NLP, deep learning NLP and rule-based NLP tools, such as Stanford CoreNLP, Stanford Parser, Stanford POS Tagger;
NLPIR-ICTCLAS⁷⁸: a Chinese word segmentation system that includes keyword extraction, POS tagging, NER, and microblog analysis, amongst other features;
word2vec⁷⁹: an efficient implementation of the continuous bag-of-words and skip-gram architectures for computing vector representations of words.

3.6.2 Machine learning

The top 5 machine learning tools used across all studies are listed below:

Weka⁸⁰: a collection of machine learning algorithms for data mining tasks, including tools for data preparation, classification, regression, clustering, association rules mining and visualisation;
scikit-learn⁸¹: consists of a set of tools for data mining and analysis, such as classification, regression, clustering, dimensionality reduction, model selection and pre-processing;
LIBSVM⁸²: an integrated software for support vector classification, regression, distribution estimation and multi-class classification;
LIBLINEAR⁸³: a linear classifier for data with millions of instances and features;
SVM-Light⁸⁴: is an implementation of SVMs for pattern recognition, classification, regression and ranking problems.

3.6.3 Opinion mining

Certain studies used opinion mining tools in their research to either conduct their main experiments or for comparison purposes to their proposed solution/s. The following are the top 3 opinion mining tools used:

SentiStrength⁸⁵: a sentiment analysis tool that is able to conduct binary (positive/negative), trinary (positive/neutral/negative), single-scale (-4 very negative to very positive +4), keyword-oriented and domain-oriented classifications;
Sentiment140⁸⁶: a tool that allows you to discover the sentiment of a brand, product, or topic on Twitter;
VADER (Valence Aware Dictionary and sEntiment Reasoner)⁸⁷: a lexicon and rule-based sentiment analysis tool that is specifically focused on sentiments expressed in social media.

3.6.4 Big data

Several big data technologies were used by the analysed studies. The most popular ones are categorised in the list below:

Relational storage

(a)

MySQL⁸⁸

(b)

PostgreSQL⁸⁹

(c)

Amazon Relational Database Service (Amazon RDS)⁹⁰

(d)

Microsoft SQL Server⁹¹

Non-relational storage

(a)

Document-based

MongoDB⁹²

ii.

Apache CouchDB⁹³

(b)

Column-based

Apache HBase⁹⁴

Resource Description Framework Triplestore

Distributed Processing

(a)

Apache Hadoop⁹⁵

(b)

Apache Spark⁹⁶

(c)

IBM InfoSphere Streams⁹⁷

(d)

Apache AsterixDB⁹⁸

(e)

Apache Storm⁹⁹

Data Warehouse

(a)

Apache Hive¹⁰⁰

Data Analytics

(a)

Databricks¹⁰¹

The MySQL relational database management system was the most technology used for storing structured social data, whereas MongoDB was mostly used for processing unstructured social data. On the other hand, the distributed processing technologies were used for processing large scale social real-time and/or historical data. In particular, Hadoop MapReduce was used for parallel processing of large volumes of structured, semi-structured and unstructured social datasets, that are stored in the Hadoop Distributed File System. Spark’s ability to process both batch and streaming data was utilised in cases where velocity is more important than volume.

3.7 Natural language processing tasks

This section presents information about other NLP tasks that were conducted to perform SOM.

3.7.1 Overview

An element of NLP is performed in 283 studies, out of the 465 analysed, either for pre-processing (248 studies), feature extraction (Machine Learning) or one of the processing parts within their SOM solution. The most common and important NLP tasks range from Tokenisation, Segmentation and POS, to NER and Language Detection.

It is important to mention that the NLP tasks mentioned above together with Anaphora Resolution, Parsing, Sarcasm, and Sparsity, are some other challenges faced in the SOM domain (Khan et al. 2014). Moreover, online posts with complicated linguistic patterns are challenging to deal with Li and Xu (2014).

However, Koto and Adriani (2015) showcase the importance and potential of NLP within this domain, where they investigated the pattern or word combination of tweets in subjectivity and polarity by considering their POS sequence. Results reveal that subjective tweets tend to have word combinations consisting of adverb and adjective, whereas objective tweets tend to have a word combination of nouns. Moreover, negative tweets tend to have a word combination of affirmation words which often appear as a negation word.

3.7.2 Pre-processing and negations

The majority (355 out of 465) of the studies performed some sort of pre-processing in their studies. Different methods and resources were used for such a process, such as NLP tasks (e.g., tokenisation, stemming, lemmatisation, NER), and dictionaries for stop words, acronyms for slang words, and others (e.g., noslang.com, noswearing.com, Urban Dictionary, Internet lingo).

Negation handling is one of the most challenging issues faced by SOM solutions. However, 117 studies cater for negations within their approach, Several different methods are used, such as negation replacement, negation transformation, negation dictionaries, textual features based on negation words and negation models.

3.7.3 Emoticons/Emojis

Social media can be seen as a sub-language that uses emoticons/emojis mixed with text to show emotions (Min et al. 2013). Emoticons/emojis are commonly used in tweets irrespective of the language, therefore are sometimes considered as being domain and language independent (Khan et al. 2014), thus useful for multilingual SOM (Cui et al. 2011).

Even though some researchers remove emoticons/emojis as part of their pre-processing stage (depending on what the authors want to achieve), many others have utilised the respective emotional meaning within their SOM process. This has led to emoticons/emojis in playing a very important role within 205 solutions of the analysed studies especially when the focus is on emotion recognition.

Results obtained from the emoticon networks model in Zhang et al. (2013) show that emoticons can help in performing sentiment analysis. This is further supported by Jiang et al. (2015) who found that emoticons are a pure carrier of sentiment. This is further supported by the results obtained by the emoticon polarity-aware method in Li et al. (2018) which show that emoticons can significantly improve the precision for identifying the sentiment polarity. In the case of hybrid (lexicon and machine learning) approaches, emoticon-aided lexicon expansion improve the performance of lexicon-based classifiers (Zhou et al. 2014). From an emotion classification perspective, Porshnev et al. (2014) analysed users’ emoticons on Twitter to improve the accuracy of predictions for the Dow Jones Industrial Average and S&P 500 stock market indices. Other researchers (Cvijikj and Michahelles 2011) were interested in analysing how people express emotions, displayed via adjectives or usage of internet slang i.e., emoticons, interjections and intentional misspelling.

Several emoticon lists were used in these studies, with the Wikipedia and DataGenetics¹⁰² ones commonly used. Moreover, emoticon dictionaries, such as (Agarwal et al. 2011; Aisopos et al. 2012; Becker et al. 2013), consisting of emoticons and their corresponding polarity class were also used in certain studies.

3.7.4 Word embeddings

Word embeddings, a type of word representation which allows words with a similar meaning to have a similar representation, were used by several studies (Severyn and Moschitti 2015; Jiang et al. 2015; Castellucci et al. 2015, 2015; Cai and Xia 2015; Gao et al. 2015; Chen et al. 2015; Stojanovski et al. 2015; Gao et al. 2016; Zhao et al. 2016; Rexha et al. 2016; Hao et al. 2017; Kitaoka and Hasuike 2017; Arslan et al. 2018; Baccouche et al. 2018; Chen et al. 2018; Ghosal et al. 2018; Hanafy et al. 2018; Jianqiang et al. 2018; Stojanovski et al. 2018; Sun et al. 2018; Wan et al. 2018; Yan et al. 2018) adopting a learning-based (Machine Learning, Deep Learning and Statistical) or hybrid approach. These studies used word embedding algorithms, such as word2vec, fastText¹⁰³, and/or GloVe¹⁰⁴. Such a form of learned representation for text is capable of capturing the context of words within a piece of text, syntactic patterns, semantic similarity and relation with other words, amongst other word representations. Therefore, word embeddings are used for different NLP problems, with SOM being one of them.

Sentence-level SOM approaches tend to fail in discovering an opinion dimension, such as sentiment polarity about a particular entity and/or its aspects (Cambria et al. 2013). Therefore, an aspect-level (also referred to as feature/topic-based) (Hu and Liu 2004) approach –where an opinion is made up of targets and their associated opinion dimension (e.g., sentiment polarity)– has been used in some studies to overcome such issues. Certain NLP tasks, such as a parsing, POS tagger, and NER, are usually required to extract the entities or aspects from the respective social data.

From all the studies analysed, 39 performed aspect-based SOM, with 37 (Bansal and Srivastava 2018; Dragoni 2018; Gandhe et al. 2018; Ghiassi and Lee 2018; Kao and Huang 2018; Katz et al. 2018; Liu et al. 2018; Rathan et al. 2018; Wang et al. 2018; Zainuddin et al. 2018; Abdullah and Zolkepli 2017; Dambhare and Karale 2017; Hagge et al. 2017; Ray and Chakrabarti 2017; Rout et al. 2017; Tong et al. 2017; Vo et al. 2017; Zhou et al. 2017; Zimbra et al. 2016; Zainuddin et al. 2016, 2016; Kokkinogenis et al. 2015; Lima et al. 2015; Hridoy et al. 2015; Castellucci et al. 2015; Averchenkov et al. 2015; Tan et al. 2014; Lau et al. 2014; Del Bosque and Garza 2014; Varshney and Gupta 2014; Unankard et al. 2014; Lek and Poo 2013; Wang and Ye 2013; Min et al. 2013; Kontopoulos et al. 2013; Jiang et al. 2011; Prabowo and Thelwall 2009) focusing on aspect-based sentiment analysis, 1 (Aoudi and Malik 2018) on aspect-based sentiment and emotion analysis and 1 (Weichselbraun et al. 2017) on aspect-based affect analysis.

In particular, the Twitter aspect-based sentiment classification process in Lek and Poo (2013) consists of the following main steps: aspect-sentiment extraction, aspect ranking and selection, and aspect classification, whereas Lau et al. (2014) use NER to parse product names to determine their polarity. The aspect-based sentiment analysis approach in Hagge et al. (2017) leveraged POS tagging and dependency parsing. Moreover, Zainuddin et al. (2016) proposed a hybrid approach to analyse aspect-based sentiment of tweets. As the authors claim, it is more important to identify the opinions of tweets rather than finding the overall polarity which might not be useful to organisations. In Zainuddin et al. (2018), the same authors used association rule mining augmented with a heuristic combination of POS patterns to find single and multi-word explicit and implicit aspects. Results in Jiang et al. (2011) show that classifiers incorporating target-dependent features significantly outperform target-independent ones. In contrast to the studies discussed, Weichselbraun et al. (2017) introduced an aspect-based analysis approach that integrates affective (includes sentiment polarity and emotions) and factual knowledge extraction to capture opinions related to certain aspects of brands and companies. The social data analysed is classified in terms of sentiment polarity and emotions, aligned with the “Hourglass of Emotions” (Susanto et al. 2020).

In terms of techniques, the majority of the aspect-based studies used a hybrid approach, where only 5 studies used deep learning for such a task. In particular, the study by Averchenkov et al. (2015) used a deep learning approach based on RNNs for aspect-based sentiment analysis. A comparative review of deep learning for aspect-based sentiment analysis published by Do et al. (2019) discusses current research in this domain. It focuses on deep learning approaches, such as CNN, LSTM and GRU, for extracting both syntactic and semantic features of text without the need for in-depth requirements for feature engineering as required by classical NLP. For future research directions on aspect-based SOM, refer to Sect. 6.2.

4.1 Context

An opinion describes a viewpoint or statement about a subjective matter. In many research problems, authors assume that an opinion is more specific and of a simpler definition. For example, sentiment analysis is considered to be a type of opinion mining even though it is only focused on extracting the sentimental score from a given text. Social data contains a wealth of signals to mine where opinions can be extracted over time. Different types of opinions require different modes of analysis (Agrawal et al. 2014). This leads to opinions being multi-dimensional semantic artefacts. In fact, Troussas et al. specify that “emotions and polarities are mutually influenced by each other, conditioning opinion intensities and emotional strengths”. Moreover, multiple studies applied different approaches, where Bravo-Marquez et al. (2013) showed that a composition of polarity, emotion and strength features, achieve significant improvements over single approaches, whereas (Koto and Adriani 2015) focused on finding the correlation between emotion—which can be differentiated by facial expression, voice intonation and also words—and sentiment in social media. Similar in nature, Buscaldi and Hernandez-Farias (2015) found out that finer-grained negative tweets potentially help in differing between negative feelings, e.g., fear (emotion).

Furthermore, mood, emotions and decision making are closely connected (Porshnev and Redkin 2014). Research on multi-dimensional sentiment analysis shows that human mood is very rich in social media, where a piece of text may contain multiple moods, such as calm and agreement (Huang et al. 2015). On the other hand, there are studies showing that one mood alone is already highly influential in encouraging people to rummage through Twitter feeds for predictive information. For example in Weiss et al. (2015), “calmness” was highly correlated with stock market movement. Different dimenions of opinions are also able to effect different entities, such as events. Results in Zhang et al. (2012) show a strong correlation between emergent events and public moods. In such cases, new events can be identified by monitoring emotional vectors in microblogs. Moreover, work in Thelwall et al. (2011) assessed if popular events are correlated with sentiment strength as it increases, which is likely the case.

All of the above motivates us to pursue further research and development on the identification of different opinion dimensions that are present within social data, such as microblogs, published across heterogeneous social media platforms. A more fine-grained opinion representation and classification of this social data shall lead to a better understanding of the messages conveyed, thus potentially influencing multiple application areas. Section 5 lists the application areas of the analysed studies.

4.2 Different dimensions of social opinions identified in the review analysis

The analysed studies focused on different opinion dimensions, namely: objectivity/subjectivity, sentiment polarity, emotion, affect, irony, sarcasm and mood. These were conducted on different levels, such as, document-level, sentence-level, and/or feature/aspect-based, depending on the study. Same as for the techniques presented in Sect. 3.2, 465 studies were evaluated. The majority focused on one social opinion dimension, with 60 studies focusing on more than one; 58 on two dimensions, 1 on three dimensions, and 1 on four dimensions. In this regard, Table 30 lists the different dimensions and respective studies. Most of the studies focused on sentiment analysis, specifically polarity classification.

Table 30

Studies focusing on two or more social opinion dimensions

Dimensions	Studies
subjectivity and sentiment polarity	Jiang et al. (2011), Blenn et al. (2012), Bravo-Marquez et al. (2013), Zhu et al. (2013), Wang and Ye (2013), Cui et al. (2013), Li and Li (2013), Rui et al. (2013), Bravo-Marquez et al. (2014), Tan et al. (2014), Garg and Chatterjee (2014), Abdul-Mageed et al. (2014), Samoylov (2014), Koto and Adriani (2015), Koto and Adriani (2015), Koto and Adriani (2015), Feng et al. (2015), Mansour et al. (2015), Wu et al. (2016), Zainuddin et al. (2016), Er et al. (2016), Abdullah and Zolkepli (2017), Hao et al. (2017), Ahuja and Dubey (2017), Sahni et al. (2017), Moh et al. (2017), Dritsas et al. (2018), Gandhe et al. (2018) and Nausheen and Begum (2018)
sentiment polarity and emotion	Cvijikj and Michahelles (2011), Orellana-Rodriguez et al. (2013), Sheth et al. (2014), Yuan et al. (2015), Orellana-Rodriguez et al. (2015), Gallegos et al. (2016), Qaisi and Aljarah (2016), Shukri et al. (2015), Munezero et al. (2015), Barapatre et al. (2016), Karyotis et al. (2017), Bouazizi and Ohtsuki (2017), Radhika and Sankar (2017), Abdullah and Hadzikadic (2017), Zhang et al. (2017), Singh et al. (2018), Aoudi and Malik (2018), Pai and Alathur (2018), Ghosal et al. (2018), Rout et al. (2018), dos Santos et al. (2018) and Stojanovski et al. (2018)
sentiment polarity and mood	Bollen et al. (2011)
sentiment polarity and irony	Reyes et al. (2013)
sentiment polarity and sarcasm	Unankard et al. (2014)
sentiment polarity and affect	Weichselbraun et al. (2017)
emotion and anger	Delcea et al. (2014) and Cotfas et al. (2015)
irony and sarcasm	Fersini et al. (2015)
subjectivity, sentiment polarity and emotion	Jiang et al. (2015)
subjectivity, sentiment polarity, emotion and irony	Bosco et al. (2013)

The following sections present the different tasks conducted for each form of opinion mentioned above¹⁰⁵.

4.2.1 Subjectivity

Subjectivity determines whether a sentence expresses an opinion –in terms of personal feelings or beliefs– or not, in which case a sentence expresses objectivity. Objectivity refers to sentences that express some factual information about the world (Liu 2010).

subjectivity classification: 2-level

(a)

objective/subjective

(b)

neutral/subjective

(c)

opinionated/not opinionated

subjectivity classification: 3-level

(a)

objective/positive/negative

(b)

objective/subjective/subjective&objective

subjectivity score

(a)

objective/subjective ranging from 0 (low) to 1 (high)

In this domain, objective statements are usually classified as being neutral (in terms of polarity), whereas subjective statements are non-neutral. In the latter cases, sentiment analysis is performed to determine the polarity classification (more information on this below). However, it is important to clarify that neutrality and objectivity are not the same. Neutrality refers to situations whereby a balanced view is taken, whereas objectivity refers to factual based i.e., true statements/facts that are quantifiable and measurable.

4.2.2 Sentiment

Sentiment determines the polarity (positive/negative/neutral) and strength/intensity (through a numeric rating score e.g., 1–5 stars, or level of depth e.g., low/high/medium) of an expressed opinion (Liu 2010).

polarity classification: 2-level

(a)

positive/negative

(b)

for/against

(c)

pro/against

(d)

positive/not positive (neutral or negative)

polarity classification: 3-level

(a)

positive/negative/neutral

(b)

positive/negative/null

(c)

positive/negative/contradiction

(d)

positive/negative/objective

(e)

positive/negative/other (neutral, irrelevant)

(f)

positive/negative/humorous

(g)

positive/negative/irrelevant

(h)

positive/negative/uncertain

(i)

beneficial (positive)/harmful (negative)/neutral

(j)

personal negative/personal non-negative/non-personal i.e. news

(k)

bullish/bearish/neutral

polarity classification: 4-level

(a)

positive/not so positive/not so negative/negative

(b)

positive/negative/neutral/irrelevant

(c)

positive/negative/neutral/undefined

(d)

positive/negative/neutral/none

(e)

positive/negative/neutral/meaningless

(f)

positive/negative/neutral/not related to target topic

(g)

positive/negative/neutral/unsure

(h)

positive/negative/neutral/uninformative

(i)

subjective/positive/negative/ironic (subjectivity and irony classification is also considered)

(j)

positive/negative/mixed/none

(k)

for/against/mixed/neutral

(l)

positive/negative/neutral/ideology/sarcastic

polarity classification: 5-level

(a)

highly positive/positive/neutral/negative/highly negative

(b)

strong positive/positive/neutral/negative/strong negative

(c)

strongly positive/mildly positive/neutral/mildly negative/strongly negative

(d)

strongly positive/slightly positive/neutral/slightly negative/strongly negative

(e)

very positive/positive/neutral/negative/very negative

(f)

positive/somewhat positive/neutral/somewhat negative/negative

(g)

most positive/positive/neutral/negative/most negative

(h)

extremely positive/positive/neutral/negative/extremely negative

(i)

positive/negative/ironic/positive and negative/objective (subjectivity and irony classification is also considered)

(j)

excellent/good/neutral/bad/worst

(k)

worst/bad/neutral/decent/wonderful

polarity classification: 6-level

(a)

strong positive/steady positive/week positive/week negative/steady negative/strong negative

polarity classification: 8-level

(a)

partially positive/mildly positive/positive/extremely positive/partially negative/mildly negative/negative/extremely negative

(b)

ProCon/AntiCon/ProLab/AntiLab/ProLib/AntiLib/Unknown/Irrelevant (levels are oriented towards the political domain)

polarity classification: 12-level

(a)

future orientation/past orientation/positive emotions/negative emotions/sadness/anxiety/anger/ tentativeness/certainty/work/achievement/money

polarity score

(a)

negative ranging from 0–0.5 and positive ranging from 0.5–1

(b)

negative/neutral/positive ranging from 0 (low) to 0.45 (high)

(c)

negative/positive ranging from -1 (low) to 1 (high)

(d)

negative/positive ranging from -1.5 (low) to 1.5 (high)

(e)

negative/positive ranging from -2 (low) to 2 (high)

(f)

negative/positive ranging from 1 (low) to 5 (high)

(g)

negative ranging from -1 (low) to -5 (high) and positive ranging from 1 (low) to 5 (high)

(h)

strongly negative to strongly positive ranging from -2 (low) to 2 (high)

(i)

normalised values from -100 to 100

(j)

weighted average of polarity scores of the sentiment aspects/topic segments

(k)

score for every aspect/feature of the subject

(l)

score per aspect by calculating the distance between the aspect and sentiment word

(m)

total classification probability close to 1

polarity strength

(a)

-5 (very negative) to 5 (very positive)

(b)

1 (no sentiment) to 5 (very strong positive/negative sentiment)

(c)

low (0) to high (5)

(d)

-4 (most negative) to 4 (most positive)

(e)

weak/strong (relative strength)

(f)

Euclidean distance of positive and negative dimensions

10.

polarity intensity

(a)

normal/excited/passionate

(b)

no emotion/a bit/normal/very/extremely

(c)

-3 (negative) to 3 (positive)

11.

sentiment assignment

(a)

total sentiment is the sum of sentiment of all words divided by total number of words (high to low)

(b)

average mean sentiment score

(c)

sentiment index based on the distribution of positive and negative online posts (Oh and Kumar 2017)

(d)

sum of inverse distance weighted sentiment values (+1, -1) of words in textual interactions

(e)

sentiment for a term is computed as [min, max] of all the positive and negative polarities

(f)

average score of associated messages in a time range and overall sentiment trend encoded by colours

12.

other

(a)

cluster heads from sentimental content

(b)

sentiment change detection

In some studies (Sandoval-Almazan and Valle-Cruz 2018; Bouazizi and Ohtsuki 2017; Chou et al. 2017; Karyotis et al. 2017; Furini and Montangero 2016; Gambino and Calvo 2016; Jiang et al. 2015; Yuan et al. 2015), the sentiment polarity was derived from the emotion classification, such as, joy/love/surprise translated to positive, and anger/sadness/fear translated to negative.

4.2.3 Emotion

Emotion refers to a person’s subjective feelings and thoughts, such as love, joy, surprise, anger, sadness and fear (Liu 2010).

emotion classification: 2-level

(a)

happy/sad

(b)

emotional/non-emotional

emotion classification: 3-level

(a)

happy/sad/neutral

emotion classification: 4-level

(a)

happy/sad/angry/surprise

(b)

happy/sad/angry/calm

(c)

joy/sadness/anger/fear

emotion classification: 5-level

(a)

nervous/sympathetic/ashamed/worried/angry

(b)

happy/sad/angry/laughter/scared

(c)

happy/surprised/sad/angry/neutral

emotion classification: 6-level

(a)

joy/sadness/surprise/anger/fear/disgust

(b)

joy/sadness/fear/anger/disgust/unknown

(c)

joy/sadness/fear/anger/surprise/unknown

(d)

fear/anger/disgust/happiness/sadness/surprise

emotion classification: 7-level

(a)

anger/disgust/fear/happy/neutral/sarcastic/surprise

(b)

pleasure/wondering/confirmation/excitement/laughter/tasty/surprise (emotions based on interjections (Cvijikj and Michahelles 2011))

(c)

love-heart/quality/happiness-smile/sadness/amused/anger/thumbs up (emotions based on sentiment carrying words and/or emoticons (Walha et al. 2016))

(d)

joy/surprise/sadness/fear/anger/disgust/unknown

(e)

like/happiness/sadness/disgust/anger/surprise/fear

(f)

joy/love/anger/sadness/fear/dislike/surprise

(g)

anger/joy/love/fear/surprise/sadness/disgust

(h)

joy/sadness/anger/love/fear/thankfulness/surprise

(i)

happiness/sadness/anger/disgust/fear/surprise/neutral

(j)

joy/surprise/fear/sadness/disgust/anger/neutral

(k)

love/happiness/fun/neutral/hate/sadness/anger

(l)

happiness/goodness/anger/sorrow/fear/evil/amazement

emotion classification: 8-level

(a)

anger/embarrassment/empathy/fear/pride/relief/sadness/other

(b)

flow/excitement/calm/boredom/stress/confusion/frustration/neutral

(c)

joy/sadness/fear/anger/anticipation/surprise/disgust/trust

(d)

anger/anxiety/expect/hate/joy/love/sorrow/surprise

(e)

happy/loving/calm/energetic/fearful/angry/tired/sad

(f)

anger/sadness/love/fear/disgust/shame/joy/surprise

emotion classification: 9-level

(a)

surprise/affection/anger/bravery/disgust/fear/happiness/neutral/sadness

emotion classification: 11-level

(a)

neutral/relax/docile/surprise/joy/contempt/hate/fear/sad/anxiety/anger

(b)

joy/excitement/wink/happiness/love/playfulness/surprise/scepticism/support/sadness/annoyance (emotions based on emoticons (Cvijikj and Michahelles 2011))

10.

emotion classification: 22-level

(a)

hope/fear/joy/distress/pride/shame/admiration/reproach/linking/disliking/gratification/remorse/gratitude/anger/satisfaction/fears-confirmed/relief/disappointment/happy-for/resentment/gloating/pity (emotions based on an Emotion-Cause-OCC model that describe the eliciting conditions of emotions (Gao et al. 2015))

11.

emotion–anger classification: 7-level

(a)

frustration/sulking/fury/hostility/indignation/envy/annoyance

12.

emotion score

(a)

valence/arousal/dominance ranging from 1 (low) to 9 (high)

(b)

prediction/valence/arousal/outcome from 0 (low) to 100 (high)

13.

emotion intensity

(a)

0 (minimum) to 1 (maximum)

(b)

0 (minimum) to 9 (maximum)

(c)

high/medium/low

14.

emotion–happiness measurement

(a)

average happiness score

A study (Munezero et al. 2015) mapped the observed emotions into two broad categories of enduring sentiments: ‘like’ and ‘dislike’. The former includes emotions that have a positive evaluation of the object, i.e., joy, trust and anticipation, and the latter includes emotions that have a negative evaluation of the object, i.e., anger, fear, disgust, and sadness.

It is important to note that some of the emotion categories listed above are based on published theories of emotion, with the most popular ones being Paul Ekman’s six basic emotions (anger, disgust, fear, happiness, sadness and surprise) (Ekman 1992), and Plutchik’s eight primary emotions (anger, fear, sadness, disgust, surprise, anticipation, trust, and joy) (Plutchik 1980). Moreover, other studies have used emotion categories that are influenced from emotional state/psychological models, such as the Pleasure Arousal Dominance (Mehrabian 1996) and the Ortony, Clore and Collins (commonly referred to as OCC) (Ortony et al. 1988).

Several studies (Xu et al. 2012; Furini and Montangero 2016; Walha et al. 2016; Hubert et al. 2018) that targeted emotion classification incorrectly referred to such a task as sentiment analysis. Even though emotions and sentiment are highly related, the former are seen as enablers to the latter, i.e., an emotion/set of emotions affect the sentiment.

4.2.4 Affect

Affect refers to a set of observable manifestations of a subjectively experienced emotion. The basic tasks of affective computing are emotion recognition and polarity detection (Cambria 2016).

affect classification: 4-level

(a)

aptitude/attention/pleasantness/sensitivity (based on the “Hourglass of Emotions”, which was inspired by Plutchik’s studies on human emotions)

When using the affective model mentioned above, sentiment is based on the four independent dimensions mentioned, namely Pleasantness, Attention, Sensitivity, and Aptitude. The different levels of activation of these dimensions constitute the total emotional state of the mind (Hussain and Cambria 2018). The semi-supervised learning model proposed by Hussain and Cambria (2018) based on the merged use of multi-dimensional scaling by means of random projections and biased SVM, has been exploited for the inference of semantics and sentics (conceptual and affective information) that are linked with concepts in a multi-dimensional vector space, in accordance with this affective model. This is used to carry out sentiment polarity detection and emotion recognition in cases when there is a lack of labelled common-sense data.

4.2.5 Irony

Irony is usually used to convey, the opposite meaning of the actual things you say, but its purpose is not intended to hurt the other person.

irony classification: 2-level

(a)

ironic/non-ironic

4.2.6 Sarcasm

Sarcasm holds the “characteristic” of meaning the opposite of what you say, but unlike irony, it is used to hurt the other person.

sarcasm classification: 2-level

(a)

sarcastic/non-sarcastic

(b)

yes/no

4.2.7 Mood

Mood refers to a conscious state of mind or predominant emotional state of person or atmosphere of groups, people or places, at any point in time.

mood classification: 6-level

(a)

composed-anxious/agreeable-hostile/elated-depressed/confident-unsure /energetic-tired/clearheaded-confused (based on the profile of mood states (POMS) Bipolar questionnaire (McNair et al. 1971) which is designed by psychologists to assess human mood states)

(b)

calm/alert/sure/vital/kind/happy (based on GPOMS (Bollen et al. 2011) which expands the POMS Bipolar questionnaire to capture a wider variety of naturally occurring mood terms in tweets)

mood classification: 8-level

(a)

happy/loving/calm/energetic/fearful/angry/tired/sad

4.2.8 Aggressiveness

Del Bosque and Garza (2014) assume that aggressive text detection is a sub-task of sentiment analysis, which is closely related to document polarity detection. Their reasoning is that aggressive text can be seen as intrinsically negative.

Aggressiveness detection

(a)

aggressiveness score ranging from 0 (no aggression) to 10 (strong aggression)

4.2.9 Other

Opinion retrieval

(a)

opinion score from 0 (minimum) to 5 (maximum)

4.3 Impact of sarcasm and irony on social opinions

Sarcasm and irony are often confused and/or misused. This leads to their classification in being very difficult even for humans (Unankard et al. 2014; Buscaldi and Hernandez-Farias 2015), with most users holding negative views on such messages (Unankard et al. 2014). The study by Buscaldi and Hernandez-Farias (Buscaldi and Hernandez-Farias 2015) is a relevant example, whereby a large number of false positives were identified in the tweets classified as ironic. Moreover, such tasks are also very time consuming and labour intensive particularly with the rapid growth in volume of online social data. Therefore, not many studies focused and/or catered for sarcasm and/or irony detection.

4.3.1 Challenges

The majority of the reviewed proposed approaches are not equipped to cater for traditional limitations, such as negation effects or ironic phenomena in text (Castellucci et al. 2015). Such opinion mining tasks face several challenges, with the main ones being:

Different languages and cultures result in various ways of how an opinion is expressed on certain social media platforms. For example, Sina Weibo users prefer to use irony when expressing negative polarity (Wang et al. 2014). Future research is required for the development of cross-lingual/multilingual NLP tools that are able to identify irony and sarcasm (Yan et al. 2014).
Presence of sarcasm and irony in social data, such as tweets, may affect the feature values of certain machine learning algorithms. Therefore, further advancement is required in the techniques used for handling sarcastic and ironic tweets (Pandey et al. 2017). The work in Sarsam et al. (2020) addresses the main challenges faced for sarcasm detection in Twitter and the machine learning algorithms that can be used in this regard.
Classifying/rating a given sentence’s sentiment is very difficult and ambiguous, since people often use negative words to be humorous or sarcastic.
Sarcasm and/or irony annotation is very hard for humans and thus it should be presented to multiple persons for accuracy purposes. This makes it very challenging to collect large datasets that can be used for supervised learning, with the only possible way being to hire people to carry out such annotations (D’Asaro et al. 2017). Moreover, the differentiation between sarcasm and irony by human annotators result in a lack of available datasets and datasets with enough examples of ironic and/or sarcastic annotations. Such datasets are usually needed for “data hungry” computational learning methods (Sykora et al. 2020).

4.3.2 Observations

Table 31 lists the studies within the review analysis that focused on sarcasm and/or irony. These account for only 18 out of 465 reviewed papers. One can clearly note the research gap that exists within these research areas.

Table 31

Studies adopting sarcasm and/or irony

Sarcasm	Irony	Studies
✓		Baccouche et al. (2018), Bouazizi and Ohtsuki (2018), Ghiassi and Lee (2018), Abdullah and Zolkepli (2017), Bouazizi and Ohtsuki (2017), Caschera et al. (2016), Tan et al. (2014), Unankard et al. (2014), Mejova et al. (2013), Bakliwal et al. (2013), Mejova and Srinivasan (2012) and Wang et al. (2012)
	✓	Buscaldi and Hernandez-Farias (2015), Hernandez-Farias et al. (2014), Bosco et al. (2013) and Reyes et al. (2013)
✓	✓	Fersini et al. (2015) and Pandey et al. (2017)

The following is an overview of the studies’ main results and observations:

Bosco et al. (2013): The authors found that irony is normally used together with a positive statement to express a negative statement, but seldomly the other way. Analysis shows that the Senti-TUT¹⁰⁶ corpus can be representative for a wide range of irony in phenomena from bitter sarcasm to genteel irony.
Reyes et al. (2013): The study describes a number of textual features used to identify irony at a linguistic level. These are mostly applicable for short texts, such as tweets. The developed irony detection model is evaluated in terms of representativeness and relevance. Authors also mention that there are overlaps in occurrences of irony, satire, parody and sarcasm, with their main differentiators being tied to usage, tone and obviousness.
Mejova et al. (2013): A multi-stage data-driven political sentiment classifier is proposed in this study. The authors found out “that a humorous tweet is 76.7% likely to also be sarcastic”, whereas “sarcastic tweets are only 26.2% likely to be humorous”. Future work is required on the connection between sarcasm and humour.
Fersini et al. (2015): Addresses the automatic detection of sarcasm and irony by introducing an ensemble approach based on Bayesian Model Averaging, that takes into account several classifiers according to their reliabilities and their marginal probability predictions. Results show that not all the features are equally able to characterise sarcasm and irony, whereby sarcasm is better characterised by POS tags, and ironic statements by pragmatic particles (such as emoticons and emphatic/onomatopoeic expressions, which represent those linguistic elements typically used in social media to convey a particular message).
Jiang et al. (2015): The authors’ model classifies subjectivity, polarity and emotion in microblogs. Results show that emoticons are a pure carrier of sentiment, whereas sentiment words have more complex senses and contexts, such as negations and irony.
Wang et al. (2012): Post-facto analysis of user-generated content, such as tweets, show that political tweets tend to be quite sarcastic.
Ghiassi and Lee (2018): Certain keywords or hash-tagged words (e.g., “thanks”, “#smh”, “ #not”) that follow certain negative or positive sentiment markers in textual social data, usually indicate the presence of sarcasm.

Around half of the studies analysed focused their work on a particular real-world application area (or multiple), where Fig. 3 shows the ones applicable for this systematic review. Note that each circle represents an application area, where the size reflects the number of studies within the particular application area. The smallest circles represent a minimum of two studies that pertain to the respective application area, whereas the biggest circle reflects the most popular application area. Intersecting circles represent application areas that were identified as being related to each other based on the analysis conducted.

The Politics domain is the dominant application area with 45 studies applying SOM on different events, namely elections (Elouardighi et al. 2017; Bansal and Srivastava 2018; Nugroho et al. 2017; Chen 2018; Nausheen and Begum 2018; Abdullah and Hadzikadic 2017; Joyce and Deng 2017; Soni et al. 2017; Salari et al. 2018; Fatyanosa and Bachtiar 2017; Juneja and Ojha 2017; Sandoval-Almazan and Valle-Cruz 2018; Zhou et al. 2017; Le et al. 2017; Yuan et al. 2014; Ramteke et al. 2016; Smailović et al. 2015; Burnap et al. 2016; Rill et al. 2014; Anjaria and Guddeti 2014; Kuo et al. 2016; Batista and Ratté 2014; Mejova et al. 2013; Hoang et al. 2013; Gonçalves et al. 2013; Unankard et al. 2014; Wang et al. 2012; Maynard and Funk 2011; Bosco et al. 2013; Bakliwal et al. 2013; Tumasjan et al. 2010), reforms, such as equality marriage (Lai et al. 2015), debates (Tapia and Velásquez 2014), referendums (Pavel e al. 2017; Fang and Ben-Miled 2017), political parties or politicians (Ozer et al. 2017; Javed et al. 2014; Taddy 2013), and political events, such as terrorism, protests, uprisings and riots (Sachdeva et al. 2018; Kamyab et al. 2018; Bouchlaghem et al. 2016; Mejova and Srinivasan 2012; de Souza Carvalho et al. 2016; Sheth et al. 2014; Weiss et al. 2013).

In terms of Marketing & Advertising & Sales, 29 studies focused on brand/product management and/or awareness (Giachanou et al. 2017; Ayoub and Elgammal 2018; Ghiassi and Lee 2018; Li and Fleyeh 2018; Ducange and Fazzolari 2017; Husnain et al. 2017; Teixeira and Laureano 2017; Halibas et al. 2018; Hu et al. 2017; Abdullah and Zolkepli 2017; Zimbra et al. 2016; Cho et al. 2014; Esiyok and Albayrak 2015; Dasgupta et al. 2015; Ghiassi et al. 2013; Mostafa 2013b; Min et al. 2013; Cvijikj and Michahelles 2011; Li and Li 2013; Gonçalves et al. 2013), products/services in general (Asghar et al. 2018; Kao and Huang 2018; Walha et al. 2016; Polymerou et al. 2014; Li and Li 2013), local marketing (Costa et al. 2014) and online advertising (Adibi et al. 2018; Dragoni 2018; Lewenberg et al. 2015).

The Technology industry-oriented studies (23) focused on either: company perception (Wan et al. 2018; Rout et al. 2018; Lek and Poo 2013; Karpowicz et al. 2013; Jiang et al. 2011), products, such as mobile/smart phones (Rathan et al. 2018; Ray and Chakrabarti 2017; Geetha et al. 2018; Gupta and Joshi 2017; Gandhe et al. 2018; Hridoy et al. 2015; Agrawal et al. 2014; Suresh 2016; Mumu and Ezeife 2014; Erdmann et al. 2014), laptops (Raja and Swamynathan 2016), electronics (Neethu and Rajasree 2013) tablets (Severyn et al. 2016; Jiang et al. 2011), operating systems (Huang et al. 2018), cloud service providers (Qaisi and Aljarah 2016), social media providers (Arslan et al. 2017) and multiple technologies (Vo et al. 2017).

All the 21 studies targeting the Finance domain applied SOM on demonitisation (Gupta and Singal 2017), currencies (Pavel e al. 2017) and the stock market, for risk management (Ishikawa and Sakurai 2017) and predictive analytics (Ghosal et al. 2018; Chen and Zheng 2018; Piñeiro-Chousa et al. 2018; Simões et al. 2017; Tiwari et al. 2017; Sun et al. 2017; Coyne et al. 2017; Zhao et al. 2016; Attigeri et al. 2015; Weiss et al. 2015; Rao and Srivastava 2014; Huang et al. 2015; Porshnev et al. 2013; Porshnev and Redkin 2014; Porshnev et al. 2014; Yu et al. 2013; Bollen et al. 2011; Vu et al. 2012).

Thirteen studies applied SOM on the Film industry for recommendations (Orellana-Rodriguez et al. 2015; Song et al. 2015), box office predictions (Du et al. 2014; Rui et al. 2013) or from a general perspective (Pavel e al. 2017; Sihwi et al. 2018; Permatasari et al. 2018; Orellana-Rodriguez et al. 2013; Yan et al. 2014; Gonçalves et al. 2013; Wang and Ye 2013; Blenn et al. 2012; Chen et al. 2012). Similarly, 13 studies focused on Healthcare, namely on epidemics/infectious diseases (Hong and Sinnott 2018; Lim et al. 2017; Lu et al. 2015; Gonçalves et al. 2013), drugs (Moh et al. 2017; Peng et al. 2016; Wu et al. 2015), hospitals (Gupta and Kohli 2016), vaccines (Song and Gruzd 2017), public health, such as epidemics, clinical science and mental health (Ji et al. 2015, 2016), and in general, such as health-related tweets (Baccouche et al. 2018) and health applications (Pai and Alathur 2018).

In terms of other industries, SOM was applied within the following:

Telecommunications (e.g., telephony, television) on particular service providers (Ghiassi and Lee 2018; Ranjan et al. 2018; Napitu et al. 2017; Fitri et al. 2018; Kumar and Bala 2016; Varshney and Gupta 2014; Wunnasri et al. 2013; Tan et al. 2011; Trung et al. 2013) or complaints (Souza et al. 2016);
Automotive (Vo et al. 2017; Pai and Liu 2018; Fatyanosa et al. 2018; Weichselbraun et al. 2017; Shukri et al. 2015; Bifet et al. 2011; Reyes et al. 2013; Severyn et al. 2016; Erdmann et al. 2014);
Hospitality for restaurant recommendations (Vo et al. 2017; Yang et al. 2013) and hotel/resort perceptions (Rout et al. 2017; Philander and YunYing 2016; Lu et al. 2016; Chen et al. 2015; Molina-González et al. 2014);
Aviation on specific airline services, e.g., customer relationship management (Ghiassi and Lee 2018; Mostafa 2013a; Chen et al. 2016), and air crashes (Gonçalves et al. 2013);
Food either in general (dos Santos et al. 2018; Liu et al. 2015) or on safety (Sun et al. 2014);
Fashion (Mukkamala et al. 2014, 2014).

In terms of domains, the studies focused on:

Sports on football/soccer (Stojanovski et al. 2018; Seron et al. 2015; Guerra et al. 2014), American football (Guerra et al. 2014; Brooks et al. 2014), basketball (Tan et al. 2011; Jiang et al. 2011), cricket (Ahuja and Dubey 2017) and Olympics (Gonçalves et al. 2013);
Government for smart cities (D’Asaro et al. 2017; Anggoro et al. 2016; Li et al. 2016) and e-Government (Hubert et al. 2018; Rezk et al. 2018; Williamson and Ruming 2016);
Environment for policy makers (Sluban et al. 2015), urban mobility (Gallegos et al. 2016), wind energy (Politopoulou and Maragoudakis 2013), green initiatives (Rai et al. 2018) and peatland fires (Gandhe et al. 2018);
E-commerce for product recommendations (Xie et al. 2012; Lau et al. 2014), crisis management (Park et al. 2011), decision making (D’Avanzo and Pilato 2015) and policy making (Omar et al. 2017);
Education for e-learning (Ortigosa et al. 2014; Karyotis et al. 2017) and on universities (Abdelrazeq et al. 2016);
Transportation for ride hailing services and logistics (Anastasia and Budi 2016) and traffic conditions (Cao et al. 2018).

Moreover, other studies applied SOM in the following areas:

Personalities (Ali et al. 2018; Ghiassi and Lee 2018; Arslan et al. 2017; Tasoulis et al. 2018; Goel et al. 2018; Poortvliet and Wang 2018; Wang et al. 2018; Jiang et al. 2011; Tan et al. 2011; Khan et al. 2014; Kranjc et al. 2013);
Natural Disasters on earthquakes (Aoudi and Malik 2018; Ragavi and Usharani 2014; Zhang et al. 2012; Thelwall et al. 2011), flooding (Buscaldi and Hernandez-Farias 2015), explosions (Ouyang et al. 2017) and in general (Sangameswar et al. 2017);
Aggressive Behaviour in relation to crime (Kitaoka and Hasuike 2017; Chen et al. 2015; Zainuddin et al. 2016), cyberbullying (Del Bosque and Garza 2014), bullying (Xu et al. 2012) and violence and disorder (Jurek et al. 2014);
Main/Breaking Events such as Black Friday (Choi and Kim 2013), Oscars, TV shows, product launch, earthquake (Thelwall et al. 2011), accidents e.g., shootings (Singh et al. 2018; Akcora et al. 2010) and in general (Stojanovski et al. 2018);
Liveability in terms of place design to supports local authorities, urban designers and city planners (You and Tunçer 2016), and government services, such as welfare (Flaes et al. 2016);
Digital Forensics (Andriotis et al. 2014; Aboluwarin et al. 2016).

Lastly, 19 further studies –not represented in Fig. 3– focused on the following application areas: Human Development (Zafar et al. 2016), Human Mobility (Kokkinogenis et al. 2015), Public Facilities (Ramadhani et al. 2016), Smart Cities (Li et al. 2017), Web Publishing (Tian et al. 2015), Sponsorships (Kaushik and Dey 2016), Countries (Khan et al. 2014), Industry (Trung et al. 2013), Entertainment (Trung et al. 2013), Refugee/Migrant crisis (Lee and Nerghes 2017), Tourism (Michailidis et al. 2018), Music (Radhika and Sankar 2017), Cryptocurrency (Pant et al. 2018), Economy (Gupta and Singal 2017), Social Issues (Vora and Chacko 2017), Law (Gandhe et al. 2018), Insurance/Social Security (Zhang et al. 2017), Geographic Information (Stojanovski et al. 2018) and Social Interactions (Vivanco et al. 2017).

6 Concluding remarks

This section presents the latest research developments and advancements within the SOM research area (Sect. 6.1) and presents the overall conclusions of this systematic review in terms of target audience and future research and development in (Sect. 6.2).

Given that this systematic review covers studies till 2018, some recent developments and advancements from 2019 till 2021 shall be discussed within this sub-section. This shows the fast research turnaround in SOM which has kept evolving at an incredibly fast rate, thus reiterating its validity and popularity as a research area.

The number of studies using Deep Learning approaches continued to increase (as reflected in Table 5), especially ones using certain deep learning techniques, such as CNNs, RNNs, LSTM, GRU and Deep Belief Networks (Yadav and Vishwakarma 2020; Wadawadagi and Pagi 2020), and with the introduction of new techniques, such as Transfer Learning. This is supported by numerous studies (Carvalho and Plastino 2021; Eke et al. 2020) who have noted that researchers are shifting from using traditional machine learning techniques to deep learning ones. Carvalho and Plastino (2021) focus on sentiment analysis on tweets, Xu et al. (2020) focus on emotion classification on tweets, Akhtar et al. (2020) focus on sentiment and emotion intensity, Cignarella et al. (2020) focus on irony detection of English, Spanish, French and Italian tweets, whereas Eke et al. (2020) focus on sarcasm detection with Twitter also being the social media platform mostly used in this research area.

Transfer learning is a deep learning technique where a model is trained for one or more tasks (source tasks), which learnt knowledge is applied to a related second task (target task) (Pan and Yang 2009). In particular, the Transformer model architecture introduced by Vaswani et al. (2017) in 2017, is based on attention mechanisms and is designed to handle sequential data like natural language for NLP tasks, such as sentiment analysis and text summarisation. This has coincided with the advancement of SOM for different opinion dimensions, such as sentiment polarity (Nguyen et al. 2020; Naseem et al. 2020), emotion (Acheampong et al. 2021), and irony (Nguyen et al. 2020), especially studies focused on adaptation to new domains and/or knowledge transfer from one language to another. The latter application is extremely reliable for cross-lingual adaptation where a labelled dataset is available in one language e.g., English, which is then applied to another language, such as low-resourced languages (Ruder 2017).

With respect to language, more SOM studies supporting languages other than the popular ones (such as English and Chinese) are on the rise. In Rani and Kumar (2019), the authors discuss the growth of research work in the fields of sentiment and emotion analysis for Indian languages. Moreover, Buechel et al. (2020) created emotion lexicons for 91 languages for sentiment and emotion analysis. Other recent studies have focused on languages, such as Urdu for sentiment analysis (Mukhtar and Khan 2019), Maltese for sentiment and emotion analysis and sarcasm/irony detection (Cortis and Davis 2019), Indonesian for sentiment analysis (Koto et al. 2020), Portuguese for sentiment and emotion analysis (Pereira 2021), and Arabic for sentiment and emotion analysis (Alhumoud and Al Wazrah 2021). Studies on code-switched languages is also on the increase, with Bansal et al. (2020) demonstrating how Hindi-English code-switching patterns from tweets can be used to improve sarcasm detection, and Appidi et al. (2020) analysing code-switched Kannada-English from tweets for emotion classification.

In terms of modality, the visual modality is gaining more interest in the SOM research community. In Akhtar et al. (2019), the authors propose a deep multi-task learning framework that carries out sentiment and emotion analysis from the textual, acoustic and visual frames of video data obtained from YouTube. On the other hand, Kumar and Garg (2019) propose a multi-modal sentiment analysis model for Twitter, where the sentiment polarity and strength is extracted from tweets based on their text and images (typographic and/or infographic).

More research has been published on aspect-based SOM, with Jiang et al. (2020) focused on sentiment polarity in both single-aspect and multi-aspect scenarios, whereas Hyun et al. (2020) focused on sentiment polarity in the automotive domain for the English and Korean languages.

In terms of application areas, the ones identified in Sect. 5 are still very popular, with research in new sub-domains emerging. In particular, several studies (Kapočiūtė-Dzikienė et al. 2019; Cresci et al. 2019; Guo and Li 2019; Xing et al. 2020; Chen et al. 2020; Mishev et al. 2020) focus on the Finance domain. Xing et al. (2020) identify common error patterns that result in financial sentiment analysis to fail, namely, irrealis mood, rhetoric, dependent opinion, unspecified aspects, unrecognised words, and external reference. On the other hand, in Mishev et al. (2020) evaluate sentiment analysis studies in the Finance domain by starting from lexicon-based approaches and finishes with the ones that use Transformers, such as the Bidirectional Enconder Representations from Transformers (BERT) (Devlin et al. 2018) and the Robustly optimised BERT approach (RoBERTa) (Liu et al. 2019).

The ongoing coronavirus disease (COVID-19) global pandemic has led to a rise in SOM studies analysing social opinions in terms of different dimensions, such as sentiment polarity. The work in Müller et al. (2020) released a COVID-19 Transformer-based model that was pre-trained on multiple datasets of tweets from Twitter. These datasets contained tweets on various topics, such as vaccine sentiment and maternal vaccine stance, and used other well known datasets, such as SemEval 2016—Task 4 which was previously discussed in Sect. 3.3. This model was pre-trained to carry out sentiment analysis on tweets written in other languages, such as Arabizi—a written form of spoken Arabic that relies on Latin characters and digits (Baert et al. 2020). On the other hand, Kruspe et al. (2020) presented sentiment analysis results of 4.6 million European tweets for the initial period of COVID-19 (December 2019 till April 2020), which results were aggregated by country (Italy, Spain, France, Germany, United Kingdom) and averaged over time. An ANN was trained to carry out sentiment analysis, which model was compared with several pre-trained models, such as BERT which is trained on BookCorpus and English Wikipedia data (Devlin et al. 2018), a multilingual version of BERT trained on COVID-19 tweets (Müller et al. 2020), and the Embeddings from Language Models (ELMO) trained on the 1 Billion Word Benchmark dataset.

In terms of NLP tools, Hugging Face¹⁰⁷ provides a state-of-the-art Transformer library for Pytorch and TensorFlow 2.0¹⁰⁸. Therefore, it provides general-purpose architectures, such as BERT, GPT-2 (Radford et al. 2019), RoBERTa, cross-lingual language model (XLM) (Lample and Conneau 2019), DistilBert (Sanh et al. 2019), and XLNET (Yang et al. 2019) for NLP tasks (like sentiment analysis), where over 32+ pre-trained models are available in 100+ languages. Similarly, TensorFlow Hub¹⁰⁹ provides a repository of trained machine learning models, with a variety of them using the Transformer architecture¹¹⁰, such as BERT.

The carbon footprint for training new deep learning models should always be taken in consideration especially if a large number of Central Processing Units (CPUs), Graphical Processing Units (GPUs), or Tensor Processing Units (TPUs) are needed. This in turn increases the related costs for model training, which is becoming very expensive and is expected to keep increasing in the future. In Strubell et al. (2019), Strubell et al. mention that such costs amount to both the financial aspect in terms of hardware and electricity or cloud compute time, and the environmental aspect in terms of carbon footprint needed to fuel modern tensor processing hardware. Therefore, researchers should report the training time and computational resources needed in their published work, and they should prioritise computationally efficient algorithms and hardware that need less energy.

6.2 Conclusion

The main aim of this systematic review is to provide in-depth analysis and insights on the most prominent technical aspects, dimensions and application areas of SOM. The target audience of this comprehensive review is three fold:

Early-Stage Researchers who are interested in working within this evolving research field of study and/or are looking for an overview of this field;
Experienced Researchers already working in SOM who would like to progress further on the technical side of their work and/or looking for weaknesses in the the field of SOM;
Early-Stage and/or Experienced Researchers who are looking into applying SOM/their SOM work in a real-world application area.

The identification of the current literature gaps within the SOM field of study is one of the main contributions of this systematic review. An overview below provides a pathway to future research and development work:

Social Media Platforms: Most studies focus on data gathered from one social media platform, with Twitter being the most popular followed by Sina Weibo for Chinese targeted studies. It is encouraged to possibly explore multi-source information by using other platforms, thus use data from multiple data sources, subject to any existing API limitations¹¹¹. This shall increase the variety and volume of data (two of the V’s of Big Data) used for evaluation purposes, thus ensuring that results provide more reflective picture of society in terms of opinions. The use of multiple data sources for studies focusing on the same real-world application areas are also beneficial for comparison purposes and identification of any potential common traits, patterns and/or results. Mining opinions from multiple sources of information also presents several advantages, such as higher authenticity, reduced ambiguity and greater availability (Balazs and Velásquez 2016).
Techniques: The use of Deep Learning, Statistical, Probabilistic, Ontology and Graph-based approaches should be further explored both as standalone and/or part of hybrid techniques, due to their potential and accessibility. In particular, Deep Learning capabilities has made several applications feasible, whereas Ontologies and Graph Mining enable fine-grained opinion mining and the identification of relationships between opinions and their enablers (person, organisation, etc.). Moreover, ensemble Machine Learning and Deep Learning methods and fine-tuned Transformed-based models are still under-explored. In such a case, researchers should be attentive to the carbon footprint needed to train neural network models for NLP.
Social Datasets: The majority of available datasets are either English or Chinese specific. This domain needs further social datasets published under a common open license for use by the public domain. These should target any of the following criteria: bi-lingual/multilingual data, and/or annotations of multiple opinion dimensions within the data, e.g., sentiment polarity, emotion, sarcasm, irony, mood, etc. Both requirements are costly in terms of resources (time, funding and personnel), domain knowledge and expertise.
Language: The majority of the studies support one language, with English and Chinese being the most popular. Studies that support two or more languages is one of the major challenges in this domain due to numerous factors, such as cultural differences and lack of language-specific resources, e.g., lexicons, datasets, tools and technologies. This domain also needs more studies that focus on code-switched languages and less-resourced languages, which shall enable the development of certain language resources needed for the respective code-switched and less-resourced languages.
Modality: Bi-/Multi-modal SOM is another sub-domain that requires several research. Several studies cater for the text modality only, with the visual—image modality gaining more popularity. However, the visual—video and audio modalities are still in their early research phases with several aspects still unexplored. This also stems from a lack of available visual, audio and multimodal datasets.
Aspect-based SOM: Research in this sub-domain is increasing and developing, however, it is far from the finished article, especially when applied in certain domains. Further aspect-based research is encouraged on other opinion dimensions other than sentiment polarity, such as emotions and moods, which is still unexplored. Moreover, more research is required on the use of Deep Learning approaches for such a task, which is still at an early stage.
Application areas: Most studies target Politics, Marketing & Advertising & Sales, Technology, Finance, Film and Healthcare. Research into other areas/sub-domains is encouraged to study and show the potential of SOM.
Dimensions of SOM: Most studies focus on subjectivity detection and sentiment analysis. The area of emotion analysis is increasing in popularity, however, sarcasm detection, irony detection and mood analysis are still in their early research phases. Moreover, from the analysis of this systematic review it is evident that there is a lack of research on any possible correlations between the different opinion dimensions, e.g., emotions and sentiment. Lastly, no studies cater for all the different SOM dimensions within their work.

Shared evaluation tasks, such as International Workshop on Semantic Evaluation (SemEval), focused on any one of the current research gaps identified above, are very important and shall contribute to the advancement of the SOM research area. Therefore, researchers are encouraged to engage in these tasks through their participation and/or organisation of new tasks, since these shall advance the SOM research area.

In conclusion, as identified through this systematic review, a fusion of social opinions represented in multiple sources and in various media formats can potentially influence multiple application areas.

Acknowledgements

This work is funded by the ADAPT Centre for Digital Content Technology which is funded under the SFI Research Centres Programme (Grant No. 13/RC/2106).

Declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Not applicable.

Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Vorheriger Artikel Deep learning for Alzheimer prediction using brain biomarkers

Nächster Artikel Bee-inspired metaheuristics for global optimization: a performance comparison

http://www.twitter.com/.

http://www.merriam-webster.com/.

https://www.linkedin.com/.

https://www.tripadvisor.com/.

http://www.weibo.com.

https://www.facebook.com/.

http://www.merriam-webster.com/dictionary/opinion.

http://noisy-text.github.io/2016/ner-shared-task.html.

Classification in this context refers to the dimension of opinion mining being conducted, e.g., subjectivity detection, sentiment analysis, sarcasm detection, emotion analysis, etc.

https://dl.acm.org/.

http://ieeexplore.ieee.org/Xplore/home.jsp.

https://www.sciencedirect.com/.

https://link.springer.com/.

https://webofknowledge.com/.

https://www.elsevier.com/solutions/engineering-village/content/compendex.

http://scholar.google.com/.

https://slashdot.org/.

https://www.imdb.com/.

Proceedings returned from IEEE Xplore: http://ieeexplore.ieee.org/stamp/stamp.jsp?reload=true&arnumber=7344780 where the “Emotion and Sentiment in Intelligent Systems and Big Social Data Analysis (SentISData)” track was relevant for this study.

https://www.techinasia.com/social-media-timeline-2010.

https://aaai.org/Library/library.php.

http://aclweb.org/anthology/.

https://onlinelibrary.wiley.com/.

https://www.youtube.com/.

http://t.qq.com/.

https://www.instagram.com.

https://www.flickr.com/.

https://myspace.com/.

http://digg.com/.

https://foursquare.com/.

https://stocktwits.com/.

https://www.linkedin.com/.

https://www.plurk.com.

https://weixin.qq.com/.

https://www.patientslikeme.com/.

https://www.apontador.com.br/.

https://plus.google.com/.

https://developers.facebook.com/docs/public_feed/.

Rathan et al. (2018) and Hagen et al. (2015) propose two lexicon based methods, which are included in total techniques.

Adverbs/adverbial phrases that strengthen the meaning of other expressions and show emphasis e.g., very, extremely.

Words/phrases that reduce the force of another word/phrase e.g., hardly, slightly.

https://sentiwordnet.isti.cnr.it/.

https://www.cs.uic.edu/~liub/FBS/sentiment-analysis.html.

https://github.com/fnielsen/afinn.

http://sentistrength.wlv.ac.uk/.

https://mpqa.cs.pitt.edu/lexicons/subj_lexicon/.

http://www.keenage.com/.

https://www.saifmohammad.com/WebPages/NRC-Emotion-Lexicon.htm.

https://wordnet.princeton.edu/.

https://en.wikipedia.org/wiki/List_of_emoticons.

https://www.sentic.net/.

https://scikit-learn.org/stable/modules/generated/sklearn.svm.SVC.html#sklearn.svm.SVC.

https://scikit-learn.org/stable/modules/generated/sklearn.svm.NuSVC.html#sklearn.svm.NuSVC.

https://scikit-learn.org/stable/modules/generated/sklearn.svm.LinearSVC.html#sklearn.svm.LinearSVC.

http://weka.sourceforge.net/doc.dev/weka/classifiers/trees/J48.html.

Several studies contain multiple hybrid methods, which are included in the respective hybrid approach total.

http://saifmohammad.com/WebPages/lexicons.html.

http://sag.art.uniroma2.it/demo-software/distributional-polarity-lexicon/.

https://azure.microsoft.com/en-us/services/cognitive-services/text-analytics/.

https://textblob.readthedocs.io/en/dev/.

https://developer.twitter.com/en/docs/tweets/search/overview.

http://twitter4j.org/en/—a Java library for the Twitter API.

Original dataset points to http://www.sananalytics.com/lab/twitter-sentiment/ which is not online anymore.

https://www.cs.york.ac.uk/semeval-2013/task2/.

http://alt.qcri.org/semeval2014/task9/.

http://alt.qcri.org/semeval2015/task10/.

http://alt.qcri.org/semeval2016/task4/.

http://tcci.ccf.org.cn/conference/2012/.

http://tcci.ccf.org.cn/conference/2013/.

Malayalam is a Dravidian language spoken in the Indian state of Kerala and the union territories of Lakshadweep and Puducherry by the Malayali people.

http://conceptnet.io/—an open, multilingual knowledge graph.

http://www.ee.columbia.edu/ln/dvmm/vso/download/sentibank.html.

http://mm.doshisha.ac.jp/senti/CrossSentiment.html.

https://www.nltk.org/.

http://www.cs.cmu.edu/~ark/TweetNLP/.

https://nlp.stanford.edu/software/.

http://ictclas.nlpir.org/.

https://code.google.com/archive/p/word2vec/.

https://www.cs.waikato.ac.nz/ml/weka/.

https://scikit-learn.org/.

https://www.csie.ntu.edu.tw/~cjlin/libsvm/.

https://www.csie.ntu.edu.tw/~cjlin/liblinear/.

http://svmlight.joachims.org/.

http://sentistrength.wlv.ac.uk/.

http://www.sentiment140.com/.

https://github.com/cjhutto/vaderSentiment.

https://www.mysql.com/.

https://www.postgresql.org/.

https://aws.amazon.com/rds/.

https://www.microsoft.com/en-us/sql-server/sql-server-downloads.

https://www.mongodb.com/.

http://couchdb.apache.org/.

https://hbase.apache.org/.

https://hadoop.apache.org/.

https://spark.apache.org/.

https://www.ibm.com/developerworks/library/bd-streamsintro/index.html.

https://asterixdb.apache.org/.

https://storm.apache.org/.

100

https://hive.apache.org/.

101

https://databricks.com/.

102

http://www.datagenetics.com/blog/october52012/index.html.

103

https://fasttext.cc/.

104

https://nlp.stanford.edu/projects/glove/.

105

Note that some level categories are dependant on the domain.

106

http://www.di.unito.it/~tutreeb/sentiTUT.html.

107

https://huggingface.co/.

108

https://huggingface.co/transformers/.

109

https://www.tensorflow.org/hub.

110

https://tfhub.dev/google/collections/transformer_encoders_text.

111

Due to GDPR, API coverage in terms of which data can be accessed is being tightened in terms of control, which can be a major issue faced by researchers.

Abdelhameed HJ, Muñoz-Hern’andez S (2017) Emotion and opinion retrieval from social media in Arabic language: survey. In: 2017 joint international conference on information and communication technologies for education and training and international conference on computing in Arabic (ICCA-TICET). IEEE, pp 1–8

Abdelrazeq A, Janßen D, Tummel C, Jeschke S, Richert A (2016) Sentiment analysis of social media for evaluating universities. In: Automation, communication and cybernetics in science and engineering 2015/2016. Springer, pp 233–251

Abdelwahab O, Bahgat M, Lowrance CJ, Elmaghraby A (2015) Effect of training set size on SVM and Naive Bayes for twitter sentiment analysis. In: 2015 IEEE international symposium on signal processing and information technology (ISSPIT). IEEE, pp 46–51

Abdullah M, Hadzikadic M (2017a) Sentiment analysis of twitter data: emotions revealed regarding Donald Trump during the 2015–16 primary debates. In: 2017 IEEE 29th international conference on tools with artificial intelligence (ICTAI). IEEE, pp 760–764

Abdullah M, Hadzikadic M (2017b) Sentiment analysis on Arabic tweets: challenges to dissecting the language. In: International conference on social computing and social media. Springer, pp 191–202

Abdullah NSD, Zolkepli IA (2017) Sentiment analysis of online crowd input towards brand provocation in facebook, twitter, and instagram. In: Proceedings of the international conference on big data and internet of thing. ACM, pp 67–74

Abdul-Mageed M, Diab M, Kübler S (2014) Samar: subjectivity and sentiment analysis for Arabic social media. Comput Speech Lang 28(1):20–37CrossRef

Aboluwarin O, Andriotis P, Takasu A, Tryfonas T (2016) Optimizing short message text sentiment analysis for mobile device forensics. In: IFIP international conference on digital forensics. Springer, pp 69–87

Acheampong FA, Nunoo-Mensah H, Chen W (2021) Transformer models for text-based emotion detection: a review of Bert-based approaches. Artif Intell Rev 2021:1–41

Adibi F, Majidi B, Eshghi M (2018) Personalized advertisement in the video games using deep social network sentiment analysis. In: 2018 2nd national and 1st international digital games research conference: trends, technologies, and applications (DGRC). IEEE, pp 104–108

Agrawal R, Srikant R et al (1994) Fast algorithms for mining association rules. In: Proceedings of 20th international conference on very large data bases, VLDB, vol 1215, pp 487–499

Agarwal A, Xie B, Vovsha I, Rambow O, Passonneau R (2011) Sentiment analysis of twitter data. In: Proceedings of the workshop on languages in social media. Association for Computational Linguistics, pp 30–38

Agrawal D, Budak C, El Abbadi A, Georgiou T, Yan X (2014) Big data in online social networks: User interaction analysis to model user behavior in social networks. DNIS. Springer, pp 1–16

Ahuja S, Dubey G (2017) Clustering and sentiment analysis on twitter data. In: 2017 2nd international conference on telecommunication and networks (TEL-NET). IEEE, pp 1–5

Aisopos F, Papadakis G, Tserpes K, Varvarigou T (2012) Content vs. context for sentiment analysis: a comparative analysis over microblogs. In: Proceedings of the 23rd ACM conference on hypertext and social media. ACM, pp 187–196

Akcora CG, Bayir MA, Demirbas M, Ferhatosmanoglu H (2010) Identifying breakpoints in public opinion. In: Proceedings of the first workshop on social media analytics. ACM, pp 62–66

Akhtar MS, Chauhan DS, Ghosal D, Poria S, Ekbal A, Bhattacharyya P (2019) Multi-task learning for multi-modal emotion recognition and sentiment analysis. arXiv preprint arXiv:1905.05812

Akhtar MS, Ekbal A, Cambria E (2020) How intense are you? Predicting intensities of emotions and sentiments using stacked ensemble [application notes]. IEEE Comput Intell Mag 15(1):64–75CrossRef

Al Shammari AS (2018) Real-time twitter sentiment analysis using 3-way classifier. In: 2018 21st Saudi computer society national computer conference (NCC). IEEE, pp 1–3

Alharbi ASM, DeDoncker E (2017) Enhance a deep neural network model for twitter sentiment analysis by incorporating user behavioral information. In: International conference on intelligent computing. Springer, pp 81–88

Alhumoud SO, Al Wazrah AA (2021) Arabic sentiment analysis using recurrent neural networks: a review. Artif Intell Rev 2021:1–42

Ali K, Hamilton M, Thevathayan C, Zhang X (2018) Big social data as a service: a service composition framework for social information service analysis. In: International conference on web services. Springer, pp 487–503

Altman NS (1992) An introduction to kernel and nearest-neighbor nonparametric regression. Am Stat 46(3):175–185MathSciNet

Alzahrani SM (2018) Development of iot mining machine for twitter sentiment analysis: mining in the cloud and results on the mirror. In: 2018 15th learning and technology conference (L&T). IEEE, pp 86–95

Ameur H, Jamoussi S, Hamadou AB (2018) A new method for sentiment analysis using contextual auto-encoders. J Comput Sci Technol 33(6):1307–1319CrossRef

Anastasia S, Budi I (2016) Twitter sentiment analysis of online transportation service providers. In: 2016 international conference on advanced computer science and information systems (ICACSIS). IEEE, pp 359–365

Andriotis P, Takasu A, Tryfonas T (2014) Smartphone message sentiment analysis. In: IFIP international conference on digital forensics. Springer, pp 253–265

Anggoro TP, Nainggolan B, Purwandesi E et al (2016) Social media analysis supporting smart city implementation (practical study in bandung district). In: 2016 international conference on ICT for smart society (ICISS). IEEE, pp 80–86

Anjaria M, Guddeti RMR (2014) A novel sentiment analysis of social networks using supervised learning. Soc Netw Anal Min 4(1):181CrossRef

Aoudi S, Malik A (2018) Lexicon based sentiment comparison of iphone and android tweets during the Iran–Iraq earthquake. In: 2018 fifth HCT information technology trends (ITT). IEEE, pp 232–238

Appidi AR, Srirangam VK, Suhas D, Shrivastava M (2020) Creation of corpus and analysis in code-mixed kannada-english twitter data for emotion prediction. In: Proceedings of the 28th international conference on computational linguistics, pp 6703–6709

Arslan Y, Birturk A, Djumabaev B, Küçük D (2017) Real-time lexicon-based sentiment analysis experiments on twitter with a mild (more information, less data) approach. In: 2017 IEEE international conference on big data (Big Data). IEEE, pp 1892–1897

Arslan Y, Küçük D, Birturk A (2018) Twitter sentiment analysis experiments using word embeddings on datasets of various scales. In: International conference on applications of natural language to information systems. Springer, pp 40–47

Årup Nielsen F (2011) A new ANEW: Evaluation of a word list for sentiment analysis in microblogs. In: Making sense of microposts (#MSM2011), pp 93–98. http://ceur-ws.org/Vol-718/paper_16.pdf

Asghar MZ, Khan A, Khan F, Kundi FM (2018) Rift: a rule induction framework for twitter sentiment analysis. Arab J Sci Eng 43(2):857–877CrossRef

Asiaee TA, Tepper M, Banerjee A, Sapiro G (2012) If you are happy and you know it...tweet. In: Proceedings of the 21st ACM international conference on information and knowledge management. ACM, pp 1602–1606

Attard J, Orlandi F, Scerri S, Auer S (2015) A systematic review of open government data initiatives. Gov Inf Q 32(4):399–418CrossRef

Attigeri GV, M. P. MM, Pai RM, Nayak A (2015) Stock market prediction: a big data approach. In: TENCON 2015-2015 IEEE region 10 conference. IEEE, pp 1–5

Averchenkov V, Budylskii D, Podvesovskii A, Averchenkov A, Rytov M, Yakimov A (2015) Hierarchical deep learning: A promising technique for opinion monitoring and sentiment analysis in Russian-language social networks. In: Creativity in intelligent, technologies and data science: first conference, CIT&DS 2015, Volgograd, Russia, September 15–17, 2015, Proceedings. Springer, pp 583–592

Ayoub A, Elgammal A (2018) Utilizing twitter data for identifying and resolving runtime business process disruptions. In: OTM confederated international conferences “On the Move to Meaningful Internet Systems”. Springer, pp 189–206

Azzouza N, Akli-Astouati K, Oussalah A, Bachir SA (2017) A real-time twitter sentiment analysis using an unsupervised method. In: Proceedings of the 7th international conference on web intelligence, mining and semantics. ACM, p 15

Baccianella S, Esuli A, Sebastiani F (2010) Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. Lrec 10:2200–2204

Baccouche A, Garcia-Zapirain B, Elmaghraby A (2018) Annotation technique for health-related tweets sentiment analysis. In: 2018 IEEE international symposium on signal processing and information technology (ISSPIT). IEEE, pp 382–387

Baecchi C, Uricchio T, Bertini M, Del Bimbo A (2016) A multimodal feature learning approach for sentiment analysis of social network multimedia. Multimed Tools Appl 75(5):2507–2525CrossRef

Baert G, Gahbiche S, Gadek G, Pauchet A (2020) Arabizi language models for sentiment analysis. In: Proceedings of the 28th international conference on computational linguistics, pp 592–603

Bakliwal A, Foster J, van der Puil J, O’Brien R, Tounsi L, Hughes M (2013) Sentiment analysis of political tweets: towards an accurate classifier. Association for Computational Linguistics 2013

Balaji H, Govindasamy V, Akila V (2016) Social opinion mining and concise rendition. In: 2016 international conference on advanced communication control and computing technologies (ICACCCT). IEEE, pp 641–645

Balazs JA, Velásquez JD (2016) Opinion mining and information fusion: a survey. Inf Fusion 27(2016):95–110CrossRef

Balikas G, Moura S, Amini M-R (2017) Multitask learning for fine-grained twitter sentiment analysis. In: Proceedings of the 40th international ACM SIGIR conference on research and development in information retrieval. ACM, pp 1005–1008

Baltas A, Kanavos A, Tsakalidis AK (2017) An apache spark implementation for sentiment analysis on twitter data. In: International workshop of algorithmic aspects of cloud computing. Springer, pp 15–25

Bandhakavi A, Wiratunga N, Massie S, Deepak P (2016) Emotion-corpus guided lexicons for sentiment analysis on twitter. In: Research and development in intelligent systems XXXIII: incorporating applications and innovations in intelligent systems XXIV 33. Springer, pp 71–85

Bansal B, Srivastava S (2018) On predicting elections with hybrid topic based sentiment analysis of tweets. Procedia Comput Sci 135(2018):346–353CrossRef

Bansal S, Garimella V, Suhane A, Patro J, Mukherjee A (2020) Code-switching patterns can be an effective route to improve performance of downstream NLP applications: a case study of humour, sarcasm and hate speech detection. arXiv preprint arXiv:2005.02295

Bao Y, Quan C, Wang L, Ren F (2014) The role of pre-processing in twitter sentiment analysis. In: International conference on intelligent computing. Springer, pp 615–624

Barapatre D, Meena MJ, Ibrahim SS (2016) Twitter data classification using side information. In: Proceedings of the 3rd international symposium on big data and cloud computing challenges (ISBCC–16’). Springer, pp 363–368

Batista L, Ratté S (2014) Multi-classifier system for sentiment analysis and opinion mining. In: Encyclopedia of social network analysis and mining, pp 989–998

Batrinca B, Treleaven PC (2015) Social media analytics: a survey of techniques, tools and platforms. AI Soc 30(1):89–116CrossRef

Baum LE, Petrie T (1966) Statistical inference for probabilistic functions of finite state Markov chains. Ann Math Stat 37(6):1554–1563MathSciNetMATHCrossRef

Becker L, Erhart G, Skiba D, Matula V (2013) Avaya: sentiment analysis on twitter with self-training and polarity lexicon expansion. In: Second Joint Conference on Lexical and Computational Semantics (* SEM), Volume 2: proceedings of the seventh international workshop on semantic evaluation (SemEval 2013), vol 2, pp 333–340

Beigi G, Hu X, Maciejewski R, Liu H (2016) An overview of sentiment analysis in social media and its applications in disaster relief. In: Sentiment analysis and ontology engineering, pp 313–340

Bermingham A, Smeaton AF (2010) Classifying sentiment in microblogs: is brevity an advantage?. In: Proceedings of the 19th ACM international conference on Information and knowledge management. ACM, pp 1833–1836

Bhattacharya S, Banerjee P (2017) Towards the exploitation of statistical language models for sentiment analysis of twitter posts. In: IFIP international conference on computer information systems and industrial management. Springer, pp 253–263

Bifet A, Frank E (2010) Sentiment knowledge discovery in twitter streaming data. In: International conference on discovery science. Springer, pp 1–15

Bifet A, Holmes G, Pfahringer B (2011) Moa-tweetreader: real-time analysis in twitter streaming data. In: International conference on discovery science. Springer, pp 46–60

Blenn N, Charalampidou K, Doerr C (2012) Context-sensitive sentiment classification of short colloquial text. Networking 2012:97–108

Bliss CA, Kloumann IM, Harris KD, Danforth CM, Dodds PS (2012) Twitter reciprocal reply networks exhibit assortativity with respect to happiness. J Comput Sci 3(5):388–397CrossRef

Boididou C, Papadopoulos S, Zampoglou M, Apostolidis L, Papadopoulou O, Kompatsiaris Y (2018) Detection and visualization of misleading content on twitter. Int J Multimed Inf Retriev 7(1):71–86CrossRef

Bollen J, Mao H, Zeng X (2011) Twitter mood predicts the stock market. J Comput Sci 2(1):1–8CrossRef

Borth D, Ji R, Chen T, Breuel T, Chang S-F (2013) Large-scale visual sentiment ontology and detectors using adjective noun pairs. In: Proceedings of the 21st ACM international conference on multimedia. ACM, pp 223–232

Bosco C, Patti V, Bolioli A (2013) Developing corpora for sentiment analysis: the case of irony and senti-tut. IEEE Intell Syst 28(2):55–63CrossRef

Bottou L (2010) Large-scale machine learning with stochastic gradient descent. In: Proceedings of COMPSTAT’2010. Springer, pp 177–186

Bouazizi M, Ohtsuki T (2017) A pattern-based approach for multi-class sentiment analysis in twitter. IEEE Access 5(2017):20617–20639CrossRef

Bouazizi M, Ohtsuki T (2018) Multi-class sentiment analysis in twitter: What if classification is not the answer. IEEE Access 6(2018):64486–64502CrossRef

Bouchlaghem R, Elkhelifi A, Faiz R (2016) A machine learning approach for classifying sentiments in Arabic tweets. In: WIMS, p 24

Bravo-Marquez F, Mendoza M, Poblete B (2013) Combining strengths, emotions and polarities for boosting twitter sentiment analysis. In: Proceedings of the second international workshop on issues of sentiment discovery and opinion mining. ACM, p 2

Bravo-Marquez F, Mendoza M, Poblete B (2014) Meta-level sentiment models for big social data analysis. Knowl-Based Syst 69(2014):86–99CrossRef

Bravo-Marquez F, Frank E, Pfahringer B (2016) From opinion lexicons to sentiment classification of tweets and vice versa: a transfer learning approach. In: 2016 IEEE/WIC/ACM international conference on web intelligence (WI). IEEE, pp 145–152

Breiman L (1996) Bagging predictors. Mach Learn 24(2):123–140MathSciNetMATHCrossRef

Breiman L (2001) Random forests. Mach Learn 45(1):5–32MATHCrossRef

Brereton P, Kitchenham BA, Budgen D, Turner M, Khalil M (2007) Lessons from applying the systematic literature review process within the software engineering domain. J Syst Softw 80(4):571–583CrossRef

Brooks M, Robinson JJ, Torkildson MK, Aragon CR et al (2014) Collaborative visual analysis of sentiment in twitter events. In: International conference on cooperative design, visualization and engineering. Springer, pp 1–8

Buddhitha P, Inkpen D (2015) Topic-based sentiment analysis. In: Annual international symposium on information management and big data. Springer, pp 95–107

Buechel S, Rücker S, Hahn U (2020) Learning and evaluating emotion lexicons for 91 languages. arXiv preprint arXiv:2005.05672

Bukhari A, Qamar U, Ghazia U (2016) Urwf: user reputation based weightage framework for twitter micropost classification. IseB 15(3):623–659CrossRef

Burnap P, Gibson R, Sloan L, Southern R, Williams M (2016) 140 characters to victory?: Using twitter to predict the UK 2015 general election. Electoral Stud 41(2016):230–233CrossRef

Buscaldi D, Hernandez-Farias I (2015) Sentiment analysis on microblogs for natural disasters management: a study on the 2014 genoa floodings. In: Proceedings of the 24th international conference on World Wide Web. ACM, pp 1185–1188

Cai G, Xia B (2015) Convolutional neural networks for multimedia sentiment analysis. In: Natural Language Processing and Chinese Computing. Springer, pp 159–167

Calvo H, Juárez Gambino O (2018) Cascading classifiers for twitter sentiment analysis with emotion lexicons. In: Gelbukh A (ed) Computational linguistics and intelligent text processing. Springer, Cham, pp 270–280CrossRef

Cambria E (2016) Affective computing and sentiment analysis. IEEE Intell Syst 31(2):102–107CrossRef

Cambria E, Schuller B, Xia Y, Havasi C (2013) New avenues in opinion mining and sentiment analysis. IEEE Intell Syst 2:15–21CrossRef

Cambria E, Li Y, Xing FZ, Poria S, Kwok K (2020) Senticnet 6: Ensemble application of symbolic and subsymbolic ai for sentiment analysis. In: Proceedings of the 29th ACM international conference on information & knowledge management, pp 105–114

Cano AE, Preotiuc-Pietro D, Radovanović D, Weller K, Dadzie A-S (2016) # microposts2016: 6th workshop on making sense of microposts: Big things come in small packages. In: Proceedings of the 25th international conference companion on World Wide Web, international World Wide Web conferences Steering Committee, pp 1041–1042

Cao D, Wang S, Lin D (2018) Chinese microblog users’ sentiment-based traffic condition analysis. Soft Comput 22(21):7005–7014CrossRef

Carvalho J, Plastino A (2021) On the evaluation and combination of state-of-the-art features in twitter sentiment analysis. Artif Intell Rev 54(3):1887–1936CrossRef

Caschera MC, Ferri F, Grifoni P (2016) Sentiment analysis from textual to multimodal features in digital environments. In: Proceedings of the 8th international conference on management of digital ecosystems. ACM, pp 137–144

Castellucci G, Croce D, Basili R (2015) Acquiring a large scale polarity lexicon through unsupervised distributional methods. In: International conference on applications of natural language to information systems. Springer, pp 73–86

Castellucci G, Croce D, Basili R (2015) Bootstrapping large scale polarity lexicons through advanced distributional methods. In: Congress of the Italian Association for Artificial Intelligence. Springer, pp 329–342

Çeliktuğ MF (2018) Twitter sentiment analysis, 3-way classification: positive, negative or neutral?. In: 2018 IEEE international conference on big data (Big Data). IEEE, pp 2098–2103

Chen Y (2018) Tagnet: Toward tag-based sentiment analysis of large social media data. In: 2018 IEEE Pacific visualization symposium (PacificVis). IEEE, pp 190–194

Chen C-IP, Zheng J (2018) Improved big data analytics solution using deep learning model and real-time sentiment data analysis approach. In: International Conference on brain inspired cognitive systems. Springer, pp 579–588

Chen N, Wang P (2018) Advanced combined LSTM-CNN model for twitter sentiment analysis. In: 2018 5th IEEE international conference on cloud computing and intelligence systems (CCIS). IEEE, pp 684–687

Chen L, Wang W, Nagarajan M, Wang S, Sheth AP (2012) Extracting diverse sentiment expressions with target-dependent polarity from twitter. ICWSM 2(3):50–57

Chen C, Zhao H, Yang Y (2015) Deceptive opinion spam detection using deep level linguistic features. In: Natural Language Processing and Chinese Computing. Springer, pp 465–474

Chen X, Cho Y, Jang SY (2015) Crime prediction using twitter sentiment and weather. In: Systems and information engineering design symposium (SIEDS). IEEE, pp 63–68

Chen P, Fu X, Teng S, Lin S, Lu J (2015) Research on micro-blog sentiment polarity classification based on SVM. Springer, Cham, pp 392–404. https://doi.org/10.1007/978-3-319-15554-8_32CrossRef

Chen F, Gao Y, Cao D, Ji R (2015) Multimodal hypergraph learning for microblog sentiment prediction. In: 2015 IEEE international conference on multimedia and expo (ICME). IEEE, pp 1–6

Chen S, Huang Y, Huang W (2016) Big data analytics on aviation social media: the case of china southern airlines on Sina Weibo. In: 2016 IEEE second international conference on big data computing service and applications (BigDataService). IEEE, pp 152–155

Chen M, Zhang L-L, Yu X, Liu Y (2017) Weighted co-training for cross-domain image sentiment classification. J Comput Sci Technol 32(4):714–725CrossRef

Chen Y, Yuan J, You Q, Luo J (2018) Twitter sentiment analysis via bi-sense emoji embedding and attention-based LSTM. In: 2018 ACM multimedia conference on multimedia conference. ACM, pp 117–125

Chen C-C, Huang H-H, Chen H-H (2020) Issues and perspectives from 10,000 annotated financial social media data. In: Proceedings of The 12th language resources and evaluation conference, pp 6106–6110

Chikersal P, Poria S, Cambria E, Gelbukh A, Siong CE (2015) Modelling public sentiment in twitter: using linguistic patterns to enhance supervised learning. In: International conference on intelligent text processing and computational linguistics. Springer, pp 49–65

Cho SW, Cha MS, Kim SY, Song JC, Sohn K-A (2014) Investigating temporal and spatial trends of brand images using twitter opinion mining. In: 2014 International conference on information science and applications (ICISA). IEEE, pp 1–4

Choi D, Kim P (2013) Sentiment analysis for tracking breaking events: a case study on twitter. In: Asian conference on intelligent information and database systems. Springer, pp 285–294

Chou P-H, Tsai RT-H, Hsu JY-J (2017) Context-aware sentiment propagation using lDA topic modeling on Chinese conceptnet. Soft Comput 21(11):2911–2921CrossRef

Cignarella AT, Basile V, Sanguinetti M, Bosco C, Rosso P, Benamara F (2020) Multilingual irony detection with dependency syntax and neural models. arXiv preprint arXiv:2011.05706

Clark P, Niblett T (1989) The cn2 induction algorithm. Mach Learn 3(4):261–283CrossRef

Cohen WW (1995) Fast effective rule induction. In: Machine learning proceedings 1995, Elsevier, pp 115–123

Cook RD (1977) Detection of influential observation in linear regression. Technometrics 19(1):15–18MathSciNetMATH

Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297MATHCrossRef

Cortis K (2013) ACE: a concept extraction approach using linked open data. In: Making Sense of Microposts (#MSM2013) Concept Extraction Challenge, pp 31–35. http://ceur-ws.org/Vol-1019/paper_20.pdf

Cortis K, Davis B (2019) A social opinion gold standard for the Malta government budget 2018. In: Proceedings of the 5th workshop on noisy user-generated text (W-NUT 2019), pp 364–369

Cortis K, Freitas A, Daudert T, Huerlimann M, Zarrouk M, Handschuh S, Davis B (2017) Semeval-2017 task 5: fine-grained sentiment analysis on financial microblogs and news. In: Proceedings of the 11th international workshop on semantic evaluation (SemEval-2017), pp 519–535

Costa H, Merschmann LH, Barth F, Benevenuto F (2014) Pollution, bad-mouthing, and local marketing: the underground of location-based social networks. Inf Sci 279(2014):123–137CrossRef

Cotfas LA, Delcea C, Raicu I, Bradea IA, Scarlat E (2017) Grey sentiment analysis using sentiwordnet. In: 2017 international conference on grey systems and intelligent services (GSIS). IEEE, pp 284–288

Cotfas L-A, Delcea C, Roxin I, Paun R (2015) Twitter ontology-driven sentiment analysis. Springer, Cham, pp 131–139. https://doi.org/10.1007/978-3-319-16211-9_14CrossRef

Coyne S, Madiraju P, Coelho J (2017) Forecasting stock prices using social media analysis. In: 2017 IEEE 15th international conference on dependable, autonomic and secure computing, 15th international conference on pervasive intelligence and computing, 3rd international conference on big data intelligence and computing and cyber science and technology congress (DASC/PiCom/DataCom/CyberSciTech). IEEE, pp 1031–1038

Crammer K, Dekel O, Keshet J, Shalev-Shwartz S, Singer Y (2006) Online passive-aggressive algorithms. J Mach Learn Res 7(Mar):551–585MathSciNetMATH

Cresci S, Lillo F, Regoli D, Tardelli S, Tesconi M (2019) Cashtag piggybacking: uncovering spam and bot activity in stock microblogs on twitter. ACM Trans Web (TWEB) 13(2):1–27CrossRef

Cui A, Zhang M, Liu Y, Ma S (2011) Emotion tokens: bridging the gap among multilingual twitter sentiment analysis. Inf Retriev Technol 2011:238–249CrossRef

Cui A, Zhang H, Liu Y, Zhang M, Ma S (2013) Lexicon-based sentiment analysis on topical Chinese microblog messages. In: Semantic web and web science. Springer, pp 333–344

Cvijikj IP, Michahelles F (2011) Understanding social media marketing: a case study on topics, categories and sentiment on a facebook brand page. In: Proceedings of the 15th international academic mindtrek conference: envisioning future media environments. ACM, pp 175–182

Da Silva NF, Hruschka ER, Hruschka ER (2014) Tweet sentiment analysis with classifier ensembles. Decis Support Syst 66(2014):170–179CrossRef

Dambhare SS, Karale S (2017) Smart map for smart city. In: 2017 international conference on innovative mechanisms for industry applications (ICIMIA). IEEE, pp 622–626

D’Asaro FA, Di Gangi MA, Perticone V, Tabacchi ME (2017) Computational intelligence and citizen communication in the smart city. Informatik-Spektrum 40(1):25–34CrossRef

Das A, Gollapudi S, Khan A, Paes Leme R (2014) Role of conformity in opinion dynamics in social networks. In: Proceedings of the second ACM conference on Online social networks. ACM, pp 25–36

Dasgupta SS, Natarajan S, Kaipa KK, Bhattacherjee SK, Viswanathan A (2015) Sentiment analysis of facebook data using hadoop based open source technologies. In: IEEE international conference on data science and advanced analytics (DSAA), 2015. 36678 2015. IEEE, pp 1–3

Dashtipour K, Poria S, Hussain A, Cambria E, Hawalah AY, Gelbukh A, Zhou Q (2016) Multilingual sentiment analysis: state of the art and independent comparison of techniques. Cogn Comput 8(4):757–771CrossRef

D’Avanzo E, Pilato G (2015) Mining social network users opinions’ to aid buyers’ shopping decisions. Comput Hum Behav 51(2015):1284–1294CrossRef

de Souza CC, de França FO, Goya DH, de Camargo Penteado CL (2016) Brazilians divided: political protests as told by twitter. In: Transactions on large-scale data-and knowledge-centered systems XXVII. Springer, pp 1–18

Dedhia C, Ramteke J (2017) Ensemble model for twitter sentiment analysis. In: 2017 international conference on inventive systems and control (ICISC). IEEE, pp 1–5

Del Bosque LP, Garza SE (2014) Aggressive text detection for cyberbullying. In: Mexican international conference on artificial intelligence. Springer, pp 221–232

Delcea C, Cotfas L-A, Paun R (2014) Understanding online social networks’ users—a twitter approach. In: International conference on computational collective intelligence. Springer, pp 145–153

Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B (Methodol) 1977:1–38MathSciNetMATH

Deshwal A, Sharma SK (2016) Twitter sentiment analysis using various classification algorithms. In: 2016 5th international conference on reliability, infocom technologies and optimization (trends and future directions) (ICRITO). IEEE, pp 251–257

Devlin J, Chang M-W, Lee K, Toutanova K (2018) Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805

Diakopoulos NA, Shamma DA (2010) Characterizing debate performance via aggregated twitter sentiment. In: Proceedings of the SIGCHI conference on human factors in computing systems. ACM, pp 1195–1198

Do HH, Prasad P, Maag A, Alsadoon A (2019) Deep learning for aspect-based sentiment analysis: a comparative review. Expert Syst Appl 118(2019):272–299CrossRef

dos Santos A, Júnior JDB, de Arruda CH (2018) Annotation of a corpus of tweets for sentiment analysis. In: International conference on computational processing of the Portuguese language. Springer, pp 294–302

Dragoni M (2018) Computational advertising in social networks: an opinion mining-based approach. In: Proceedings of the 33rd annual ACM symposium on applied computing. ACM, pp 1798–1804

Dritsas E, Livieris IE, Giotopoulos K, Theodorakopoulos L (2018) An apache spark implementation for graph-based hashtag sentiment classification on twitter. In: Proceedings of the 22nd Pan-Hellenic conference on informatics. ACM, pp 255–260

Drucker H, Burges CJ, Kaufman L, Smola AJ, Vapnik V (1997) Support vector regression machines. In: Advances in neural information processing systems, pp 155–161

Du J, Xu H, Huang X (2014) Box office prediction based on microblog. Expert Syst Appl 41(4):1680–1689CrossRef

Duan L, Tsang IW, Xu D, Chua T-S (2009) Domain adaptation from multiple sources via auxiliary classifiers. In: Proceedings of the 26th annual international conference on machine learning. ACM, pp 289–296

Ducange P, Fazzolari M (2017) Social sensing and sentiment analysis: Using social media as useful information source. In: 2017 international conference on smart systems and technologies (SST). IEEE, pp 301–306

Dyba T, Dingsoyr T, Hanssen GK (2007) Applying systematic reviews to diverse study types: an experience report. In: First international symposium on empirical software engineering and measurement, 2007. ESEM 2007. IEEE, pp 225–234

Effrosynidis D, Symeonidis S, Arampatzis A (2017) A comparison of pre-processing techniques for twitter sentiment analysis. In: International conference on theory and practice of digital libraries. Springer, pp 394–406

Eke CI, Norman AA, Shuib L, Nweke HF (2020) Sarcasm identification in textual data: systematic review, research challenges and open directions. Artif Intell Rev 53(6):4215–4258CrossRef

Ekman P (1992) An argument for basic emotions. Cogn Emotion 6(3–4):169–200CrossRef

El Haddaoui B, Chiheb R, Faizi R, El Afia A (2018) Toward a sentiment analysis framework for social media. LOPAL 2018:29–1

Elbagir S, Yang J (2018) Sentiment analysis of twitter data using machine learning techniques and scikit-learn. In: Proceedings of the (2018) International Conference on Algorithms. ACM, Computing and Artificial Intelligence, p 57

Elouardighi A, Maghfour M, Hammia H (2017) Collecting and processing arabic facebook comments for sentiment analysis. In: International conference on model and data engineering. Springer, pp 262–274

Elzayady H, Badran KM, Salama GI (2018) Sentiment analysis on twitter data using apache spark framework. In: 2018 13th international conference on computer engineering and systems (ICCES). IEEE, pp 171–176

Er MJ, Liu F, Wang N, Zhang Y, Pratama M (2016) User-level twitter sentiment analysis with a hybrid approach. In: International symposium on neural networks. Springer, pp 426–433

Erdmann M, Ikeda K, Ishizaki H, Hattori G, Takishima Y (2014) Feature based sentiment analysis of tweets in multiple languages. In: International conference on web information systems engineering. Springer, pp 109–124

Esiyok C, Albayrak S (2015) Twitter sentiment tracking for predicting marketing trends. In: Smart information systems. Springer, pp 47–74

Fang A, Ben-Miled Z (2017) Does bad news spread faster?. In: 2017 international conference on computing, networking and communications (ICNC). IEEE, pp 793–797

Fatyanosa TN, Bachtiar FA (2017) Classification method comparison on Indonesian social media sentiment analysis. In: 2017 international conference on sustainable information engineering and technology (SIET). IEEE, pp 310–315

Fatyanosa TN, Bachtiar FA, Data M (2018) Feature selection using variable length chromosome genetic algorithm for sentiment analysis. In: 2018 international conference on sustainable information engineering and technology (SIET). IEEE, pp 27–32

Feng S, Song K, Wang D, Yu G (2015) A word-emoticon mutual reinforcement ranking model for building sentiment lexicon from massive collection of microblogs. World Wide Web 18(4):949CrossRef

Fersini E, Pozzi FA, Messina E (2015) Detecting irony and sarcasm in microblogs: The role of expressive signals and ensemble classifiers. In: IEEE international conference on data science and advanced analytics (DSAA), 2015. 36678 2015. IEEE, pp 1–8

Filice S, Castellucci G, Croce D, Basili R (2014) Effective kernelized online learning in language processing tasks. ECIR. Springer, pp 347–358

Fisher RA (1925) Theory of statistical estimation. In: Mathematical proceedings of the Cambridge Philosophical Society, vol 22. Cambridge University Press, pp 700–725

Fitri FS, Si MNS, Setianingsih C (2018) Sentiment analysis on the level of customer satisfaction to data cellular services using the Naive Bayes classifier algorithm. In: 2018 IEEE international conference on internet of things and intelligence system (IOTAIS). IEEE, pp 201–206

Flaes JB, Rudinac S, Worring M (2016) What multimedia sentiment analysis says about city liveability. In: European conference on information retrieval. Springer, pp 824–829

Frankenstein W, Joseph K, Carley KM (2016) Contextual sentiment analysis. In: Social, cultural, and behavioral modeling: 9th international conference, SBP-BRiMS 2016, Washington, DC, USA, June 28–July 1, 2016, Proceedings 9. Springer, pp 291–300

Freund Y, Schapire R, Abe N (1999) A short introduction to boosting. J Jpn Soc Artif Intell 14(771–780):1612

Frey BJ, Dueck D (2007) Clustering by passing messages between data points. Science 315(5814):972–976MathSciNetMATHCrossRef

Furini M, Montangero M (2016) Tsentiment: on gamifying twitter sentiment analysis. In: 2016 IEEE symposium on computers and communication (ISCC). IEEE, pp 91–96

Gallegos L, Lerman K, Huang A, Garcia D (2016) Geography of emotion: Where in a city are people happier?. In: Proceedings of the 25th international conference companion on World Wide Web, International World Wide Web Conferences Steering Committee, pp 569–574

Gambino OJ, Calvo H (2016) A comparison between two spanish sentiment lexicons in the twitter sentiment analysis task. In: Ibero-American conference on artificial intelligence. Springer, pp 127–138

Gandhe K, Varde AS, Du X (2018) Sentiment analysis of twitter data with hybrid learning for recommender applications. In: 2018 9th IEEE annual ubiquitous computing, electronics & mobile communication conference (UEMCON). IEEE, pp 57–63

Gao B, Berendt B, Vanschoren J (2016) Toward understanding online sentiment expression: an interdisciplinary approach with subgroup comparison and visualization. Soc Netw Anal Min 6(1):68CrossRef

Gao F, Sun X, Wang K, Ren F (2016) Chinese micro-blog sentiment analysis based on semantic features and pad model. In: 2016 IEEE/ACIS 15th international conference on computer and information science (ICIS). IEEE, pp 1–5

Gao K, Xu H, Wang J (2015) Emotion cause detection for Chinese micro-blogs based on ECOCC model. In: Pacific-Asia conference on knowledge discovery and data mining. Springer, pp 3–14

Garg Y, Chatterjee N (2014) Sentiment analysis of twitter feeds. In: International conference on big data analytics. Springer, pp 33–52

Geetha R, Rekha P, Karthika S (2018) Twitter opinion mining and boosting using sentiment analysis. In: 2018 international conference on computer, communication, and signal processing (ICCCSP). IEEE, pp 1–4

Geurts P, Ernst D, Wehenkel L (2006) Extremely randomized trees. Mach Learn 63(1):3–42MATHCrossRef

Ghiassi M, Lee S (2018) A domain transferable lexicon set for twitter sentiment analysis using a supervised machine learning approach. Expert Syst Appl 106(2018):197–216CrossRef

Ghiassi M, Saidane H (2005) A dynamic architecture for artificial neural networks. Neurocomputing 63(2005):397–413CrossRef

Ghiassi M, Skinner J, Zimbra D (2013) Twitter brand sentiment analysis: a hybrid system using n-gram analysis and dynamic artificial neural network. Expert Syst Appl 40(16):6266–6282CrossRef

Ghosal D, Akhtar MS, Ekbal A, Bhattacharyya P (2018) Deep ensemble model with the fusion of character, word and lexicon level information for emotion and sentiment prediction. In: International conference on neural information processing. Springer, pp 162–174

Giachanou A, Crestani F (2016) Like it or not: a survey of twitter sentiment analysis methods. ACM Comput Surv (CSUR) 49(2):28

Giachanou A, Crestani F (2016) Opinion retrieval in twitter: is proximity effective?. In: Proceedings of the 31st annual ACM symposium on applied computing. ACM, pp 1146–1151

Giachanou A, Gonzalo J, Mele I, Crestani F (2017) Sentiment propagation for predicting reputation polarity. In: European conference on information retrieval. Springer, pp 226–238

Giachanou A, Harvey M, Crestani F (2016) Topic-specific stylistic variations for opinion retrieval on twitter. In: European conference on information retrieval. Springer, pp 466–478

Go A, Bhayani R, Huang L (2009) Twitter sentiment classification using distant supervision, CS224N Project Report. Stanford 1(12)

Goel S, Banthia M, Sinha A (2018) Modeling recommendation system for real time analysis of social media dynamics. In: 2018 eleventh international conference on contemporary computing (IC3). IEEE, pp 1–5

Gonçalves P, Araújo M, Benevenuto F, Cha M (2013) Comparing and combining sentiment analysis methods. In: Proceedings of the first ACM conference on Online social networks. ACM, pp 27–38

Gonzalez-Marron D, Mejia-Guzman D, Enciso-Gonzalez A (2017) Exploiting data of the twitter social network using sentiment analysis. In: Applications for future internet. Springer, pp 35–38

Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets. In: Advances in neural information processing systems, pp 2672–2680

Graves A, Schmidhuber J (2005) Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Netw 18(5–6):602–610CrossRef

Greff K, Srivastava RK, Koutník J, Steunebrink BR, Schmidhuber J (2017) LSTM: a search space odyssey. IEEE Trans Neural Netw Learning Syst 28(10):2222–2232MathSciNetCrossRef

Guerra PC, Meira Jr W, Cardie C (2014) Sentiment analysis on evolving social streams: How self-report imbalances can help. In: Proceedings of the 7th ACM international conference on web search and data mining. ACM, pp 443–452

Guo X, Li J (2019) A novel twitter sentiment analysis model with baseline correlation for financial market prediction with improved efficiency. In: 2019 sixth international conference on social networks analysis, management and security (SNAMS). IEEE, pp 472–477

Gupta VS, Kohli S (2016) Twitter sentiment analysis in healthcare using hadoop and r. In: 2016 3rd international conference on computing for sustainable global development (INDIACom). IEEE, pp 3766–3772

Gupta I, Joshi N (2017) Tweet normalization: a knowledge based approach. In: 2017 international conference on Infocom technologies and unmanned systems (trends and future directions) (ICTUS). IEEE, pp 157–162

Gupta F, Singal S (2017) Sentiment analysis of the demonitization of economy 2016 India, regionwise. In: 2017 7th international conference on cloud computing, data science & engineering-confluence. IEEE, pp 693–696

Hagen M, Potthast M, Büchner M, Stein B (2015) Twitter sentiment detection via ensemble classification using averaged confidence scores. In: European conference on information retrieval. Springer, pp 741–754

Hagge M, von Hoffen M, Betzing JH, Becker J (2017) Design and implementation of a toolkit for the aspect-based sentiment analysis of tweets. In: 2017 IEEE 19th conference on business informatics (CBI), vol 1. IEEE, pp 379–387

Haldenwang N, Ihler K, Kniephoff J, Vornberger O (2018) A comparative study of uncertainty based active learning strategies for general purpose twitter sentiment analysis with deep neural networks. In: International conference of the German Society for Computational Linguistics and Language Technology. Springer, pp 208–215

Halibas AS, Shaffi AS, Mohamed MAKV (2018) Application of text classification and clustering of twitter data for business analytics. In: 2018 Majan international conference (MIC). IEEE, pp 1–7

Hanafy M, Khalil MI, Abbas HM (2018) Combining classical and deep learning methods for twitter sentiment analysis. In: IAPR workshop on artificial neural networks in pattern recognition. Springer, pp 281–292

Han S, Kavuluru R (2015) On assessing the sentiment of general tweets. In: Canadian conference on artificial intelligence. Springer, pp 181–195

Hao Z, Cai R, Yang Y, Wen W, Liang L (2017) A dynamic conditional random field based framework for sentence-level sentiment analysis of Chinese microblog. In: 2017 IEEE international conference on computational science and engineering (CSE) and IEEE international conference on embedded and ubiquitous computing (EUC), vol 1. IEEE, pp 135–142

Hassan A, Abbasi A, Zeng D (2013) Twitter sentiment analysis: a bootstrap ensemble framework. In: 2013 international conference on social computing (SocialCom). IEEE, pp 357–364

Heckerman D, Geiger D, Chickering DM (1995) Learning Bayesian networks: the combination of knowledge and statistical data. Mach Learn 20(3):197–243MATHCrossRef

Hernandez-Farias I, Buscaldi D, Priego-Sánchez B (2014) Iradabe: Adapting english lexicons to the italian sentiment polarity classification task. In: First Italian conference on computational linguistics (CLiC-it 2014) and the fourth international workshop EVALITA2014, pp 75–81

Hinton GE, Srivastava N, Krizhevsky A, Sutskever I, Salakhutdinov RR (2012) Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580

Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science 313(5786):504–507MathSciNetMATHCrossRef

Hoang T-A, Cohen WW, Lim E-P, Pierce D, Redlawsk DP (2013) Politics, sharing and emotion in microblogs. In: Proceedings of the 2013 IEEE/ACM international conference on advances in social networks analysis and mining. ACM, pp 282–289

Hong Y, Sinnott RO (2018) A social media platform for infectious disease analytics. In: International conference on computational science and its applications. Springer, pp 526–540

Hornik K, Stinchcombe M, White H (1989) Multilayer feedforward networks are universal approximators. Neural Netw 2(5):359–366MATHCrossRef

Hridoy SAA, Ekram MT, Islam MS, Ahmed F, Rahman RM (2015) Localized twitter opinion mining using sentiment analysis. Decis Anal 2(1):8CrossRef

Huang Y-P, Hlongwane N, Kao L-J (2018) Using sentiment analysis to determine users’ likes on twitter. In: 2018 IEEE 16th international conference on dependable, autonomic and secure computing, 16th international conference on pervasive intelligence and computing, 4th international conference on big data intelligence and computing and cyber science and technology congress (DASC/PiCom/DataCom/CyberSciTech). IEEE, pp 1068–1073

Huang G-B, Zhu Q-Y, Siew C-K (2006) Extreme learning machine: theory and applications. Neurocomputing 70(1–3):489–501CrossRef

Huang Y, Liu Q, Zhang S, Metaxas DN (2010) Image retrieval via probabilistic hypergraph ranking. In: 2010 IEEE conference on computer vision and pattern recognition (CVPR). IEEE, pp 3376–3383

Huang Y, Zhou S, Huang K, Guan J (2015) Boosting financial trend prediction with twitter mood based on selective hidden Markov models. In: International conference on database systems for advanced applications. Springer, pp 435–451

Hubert RB, Estevez E, Maguitman A, Janowski T (2018) Examining government-citizen interactions on twitter using visual and sentiment analysis. In: Proceedings of the 19th annual international conference on digital government research: governance in the data age. ACM, p 55

Hu G, Bhargava P, Fuhrmann S, Ellinger S, Spasojevic N (2017) Analyzing users’ sentiment towards popular consumer industries and brands on twitter. In: 2017 IEEE international conference on data mining workshops (ICDMW). IEEE, pp 381–388

Hu M, Liu B (2004) Mining and summarizing customer reviews. In: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 168–177

Hulten G, Spencer L, Domingos P (2001) Mining time-changing data streams. In: Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 97–106

Hürlimann M, Davis B, Cortis K, Freitas A, Handschuh S, Fernández S (2016) A twitter sentiment gold standard for the brexit referendum. SEMANTICS 2016:193–196

Husnain A, Din SMU, Hussain G, Ghayor Y (2017) Estimating market trends by clustering social media reviews. In: 2017 13th international conference on emerging technologies (ICET). IEEE, pp 1–6

Hussain A, Cambria E (2018) Semi-supervised learning for big social data analysis. Neurocomputing 275(2018):1662–1673CrossRef

Hu X, Tang L, Tang J, Liu H (2013) Exploiting social relations for sentiment analysis in microblogging. In: Proceedings of the sixth ACM international conference on web search and data mining. ACM, pp 537–546

Hyun D, Cho J, Yu H (2020) Building large-scale English and Korean datasets for aspect-level sentiment analysis in automotive domain. In: Proceedings of the 28th international conference on computational linguistics, pp 961–966

Ignatov D, Ignatov A (2017) Decision stream: Cultivating deep decision trees. In: 2017 IEEE 29th international conference on tools with artificial intelligence (ICTAI). IEEE, pp 905–912

Ishikawa T, Sakurai K (2017) A proposal of event study methodology with twitter sentimental analysis for risk management. In: Proceedings of the 11th international conference on ubiquitous information management and communication. ACM, p 14

Ismail HM, Belkhouche B, Zaki N (2018) Semantic twitter sentiment analysis based on a fuzzy thesaurus. Soft Comput 22(18):6011–6024CrossRef

Javed I, Afzal H, Majeed A, Khan B (2014) Towards creation of linguistic resources for bilingual sentiment analysis of twitter data. In: International conference on applications of natural language to data bases/information systems. Springer, pp 232–236

Jaynes ET (1957) Information theory and statistical mechanics. Phys Rev 106(4):620MathSciNetMATHCrossRef

Ji X, Chun SA, Wei Z, Geller J (2015) Twitter sentiment classification for measuring public health concerns. Soc Netw Anal Min 5(1):13CrossRef

Jiang F, Liu Y-Q, Luan H-B, Sun J-S, Zhu X, Zhang M, Ma S-P (2015) Microblog sentiment analysis with emoticon space model. J Comput Sci Technol 30(5):1120–1129CrossRef

Jiang F, Cui A, Liu Y, Zhang M, Ma S (2013) Every term has sentiment: learning from emoticon evidences for Chinese microblog sentiment analysis. In: Natural Language Processing and Chinese Computing. Springer, pp 224–235

Jiang B, Hou J, Zhou W, Yang C, Wang S, Pang L (2020) Metnet: a mutual enhanced transformation network for aspect-based sentiment analysis. In: Proceedings of the 28th international conference on computational linguistics, pp 162–172

Jiang L, Yu M, Zhou M, Liu X, Zhao T (2011) Target-dependent twitter sentiment classification. In: Proceedings of the 49th annual meeting of the Association for Computational Linguistics: human language technologies, vol 1. Association for Computational Linguistics, pp 151–160

Jianqiang Z (2015) Pre-processing boosting twitter sentiment analysis?. In: 2015 IEEE international conference on Smart City/SocialCom/SustainCom (SmartCity). IEEE, pp 748–753

Jianqiang Z (2016) Combing semantic and prior polarity features for boosting twitter sentiment analysis using ensemble learning. In: IEEE International conference on data science in cyberspace (DSC). IEEE, pp 709–714

Jianqiang Z, Xiaolin G (2017) Comparison research on text pre-processing methods on twitter sentiment analysis. IEEE Access 5(2017):2870–2879CrossRef

Jianqiang Z, Xiaolin G, Xuejun Z (2018) Deep convolution neural networks for twitter sentiment analysis. IEEE Access 6(2018):23253–23260CrossRef

Jianqiang Z, Xueliang C (2015) Combining semantic and prior polarity for boosting twitter sentiment analysis. In: 2015 IEEE international conference on Smart City/SocialCom/SustainCom (SmartCity). IEEE, pp 832–837

Ji X, Chun SA, Geller J (2016) Knowledge-based tweet classification for disease sentiment monitoring. In: Sentiment analysis and ontology engineering. Springer, pp 425–454

Jin Y, Zhang H, Du D (2017) Incorporating positional information into deep belief networks for sentiment classification. In: Industrial conference on data mining. Springer, pp 1–15

Joyce B, Deng J (2017) Sentiment analysis of tweets for the 2016 us presidential election. In: 2017 IEEE MIT undergraduate research technology conference (URTC). IEEE, pp 1–4

Juneja P, Ojha U (2017) Casting online votes: to predict offline results using sentiment analysis by machine learning classifiers. In: 2017 8th international conference on computing, communication and networking technologies (ICCCNT). IEEE, pp 1–6

Jurek A, Bi Y, Mulvenna M (2014) Twitter sentiment analysis for security-related information gathering. In: Intelligence and security informatics conference (JISIC), 2014 IEEE Joint. IEEE, pp 48–55

Kalayeh MM, Seifu M, LaLanne W, Shah M (2015) How to take a good selfie?. In: Proceedings of the 23rd ACM international conference on Multimedia. ACM, pp 923–926

Kamyab M, Tao R, Mohammadi MH, Rasool A (2018) Sentiment analysis on twitter: a text mining approach to the Afghanistan status reviews. In: Proceedings of the 2018 international conference on artificial intelligence and virtual reality. ACM, pp 14–19

Kanakaraj M, Guddeti RMR (2015) Performance analysis of ensemble methods on twitter sentiment analysis using NLP techniques. In: 2015 IEEE international conference on semantic computing (ICSC). IEEE, pp 169–170

Kanavos A, Metaxas S, Tsakalidis A (2016) Weighting public mood via microblogging analysis. In: Proceedings of the 20th Pan-Hellenic conference on informatics. ACM, 2016, p 45

Kao L-J, Huang Y-P (2018) An effective social network sentiment mining model for healthcare product sales analysis. In: 2018 IEEE international conference on systems, man, and cybernetics (SMC). IEEE, pp 2152–2157

Kaplan AM, Haenlein M (2010) Users of the world, unite! the challenges and opportunities of social media. Bus Horiz 53(1):59–68CrossRef

Kapočiūtė-Dzikienė J, Damaševičius R, Woźniak M (2019) Sentiment analysis of lithuanian texts using traditional and deep learning approaches. Computers 8(1):4CrossRef

Karyotis C, Doctor F, Iqbal R, James A, Chang V (2017) A fuzzy computational model of emotion for cloud based sentiment analysis. Inf Sci

Katiyar H, Kumar P, Sharma A et al (2018) Twitter sentiment analysis using dynamic vocabulary. In: 2018 conference on information and communication technology (CICT). IEEE, pp 1–4

Katsurai M, Satoh S (2016) Image sentiment analysis using latent correlations among visual, textual, and sentiment views. In: 2016 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 2837–2841

Katz G, Heap B, Wobcke W, Bain M, Kannangara S (2018) Analysing tv audience engagement via twitter: Incremental segment-level opinion mining of second screen tweets. In: Pacific Rim international conference on artificial intelligence. Springer, pp 300–308

Kaushik K, Dey S (2016) Impact of event reputation on the sponsor’s sentiment. In: Proceedings of the international conference on research in adaptive and convergent systems. ACM, pp 18–21

Khalil T, Halaby A, Hammad M, El-Beltagy SR (2015) Which configuration works best? an experimental study on supervised arabic twitter sentiment analysis. In: 2015 first international conference on Arabic Computational Linguistics (ACLing). IEEE, pp 86–93

Khan FH, Bashir S, Qamar U (2014) Tom: Twitter opinion mining framework using hybrid classification scheme. Decis Support Syst 57(2014):245–257CrossRef

Khuc VN, Shivade C, Ramnath R, Ramanathan J (2012) Towards building large-scale distributed systems for twitter sentiment analysis. In: Proceedings of the 27th annual ACM symposium on applied computing. ACM, pp 459–464

Kitaoka S, Hasuike T (2017) Where is safe: Analyzing the relationship between the area and emotion using twitter data. In: 2017 IEEE symposium series on computational intelligence (SSCI). IEEE, pp 1–8

Kitchenham B (2004) Procedures for performing systematic reviews, vol 33. Keele University, Keele, pp 1–26

Kokkinogenis Z, Filguieras J, Carvalho S, Sarmento L, Rossetti RJ (2015) Mobility network evaluation in the user perspective: Real-time sensing of traffic information in twitter messages. In: Advances in artificial transportation systems and simulation, pp 219–234

Konate A, Du R (2018) Sentiment analysis of code-mixed Bambara–French social media text using deep learning techniques. Wuhan Univ J Nat Sci 23(3):237–243CrossRef

Kontopoulos E, Berberidis C, Dergiades T, Bassiliades N (2013) Ontology-based sentiment analysis of twitter posts. Expert Syst Appl 40(10):4065–4074CrossRef

Korenek P, Šimko M (2014) Sentiment analysis on microblog utilizing appraisal theory. World Wide Web 17(4):847–867CrossRef

Koto F, Adriani M (2015) A comparative study on twitter sentiment analysis: Which features are good? In: International conference on applications of natural language to information systems. Springer, pp 453–457

Koto F, Adriani M (2015) HBE: Hashtag-based emotion lexicons for twitter sentiment analysis. In: Proceedings of the 7th forum for information retrieval evaluation. ACM, pp 31–34

Koto F, Adriani M (2015) The use of pos sequence for analyzing sentence pattern in twitter sentiment analysis. In: 2015 IEEE 29th international conference on advanced information networking and applications workshops (WAINA). IEEE, pp 547–551

Koto F, Rahimi A, Lau JH, Baldwin T (2020) Indolem and indobert: a benchmark dataset and pre-trained language model for Indonesian nlp. arXiv preprint arXiv:2011.00677

Kouloumpis E, Wilson T, Moore JD (2011) Twitter sentiment analysis: the good the bad and the omg! ICWSM 11(538–541):164

Kranjc J, Podpecan V, Lavrac N (2013) Real-time data analysis in clowdflows. In: 2013 IEEE international conference on big data. IEEE, pp 15–22

Krouska A, Troussas C, Virvou M (2016) The effect of preprocessing techniques on twitter sentiment analysis. In: 2016 7th international conference on information, intelligence, systems & applications (IISA). IEEE, pp 1–5

Kruspe A, Häberle M, Kuhn I, Zhu XX (2020) Cross-language sentiment analysis of European twitter messages during the covid-19 pandemic. arXiv preprint arXiv:2008.12172

Ku L-W, Liang Y-T, Chen H-H (2006) Opinion extraction, summarization and tracking in news and blog corpora. Proc AAAI 2006:100–107

Kumar A, Garg G (2019) Sentiment analysis of multimodal twitter data. Multimedia Tools Appl 78(17):24103–24119CrossRef

Kumar M, Bala A (2016) Analyzing twitter sentiments through big data. In: 2016 3rd international conference on computing for sustainable global development (INDIACom). IEEE, pp 2628–2631

Kuo Y-H, Fu M-H, Tsai W-H, Lee K-R, Chen L-Y (2016) Integrated microblog sentiment analysis from users’ social interaction patterns and textual opinions. Appl Intell 44(2):399–413CrossRef

Lafferty JD, McCallum A, Pereira FCN (2001) Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of the eighteenth international conference on machine learning, ICML ’01, Morgan Kaufmann Publishers Inc., San Francisco, pp 282–289. http://dl.acm.org/citation.cfm?id=645530.655813

Lai M, Bosco C, Patti V, Virone D (2015) Debate on political reforms in twitter: A hashtag-driven analysis of political polarization. In: IEEE international conference on data science and advanced analytics (DSAA), 2015. 36678 2015. IEEE, pp 1–9

Lample G, Conneau A (2019) Cross-lingual language model pretraining. arXiv preprint arXiv:1901.07291

Lau RY, Li C, Liao SS (2014) Social analytics: learning fuzzy product ontologies for aspect-oriented sentiment analysis. Decis Support Syst 65(2014):80–94CrossRef

Lavanya K, Deisy C (2017) Twitter sentiment analysis using multi-class SVM. In: 2017 international conference on intelligent computing and control (I2C2). IEEE, pp 1–6

Le VT, Tran-Nguyen TM, Pham KN, Do NT (2014) Forests of oblique decision stumps for classifying very large number of tweets. In: Future data and security engineering. Springer, pp 16–28

Le H, Boynton G, Mejova Y, Shafiq Z, Srinivasan P (2017) Bumps and bruises: mining presidential campaign announcements on twitter. In: Proceedings of the 28th ACM conference on hypertext and social media. ACM, pp 215–224

LeCun Y, Boser BE, Denker JS, Henderson D, Howard RE, Hubbard WE, Jackel LD (1990) Handwritten digit recognition with a back-propagation network. In: Advances in neural information processing systems, pp 396–404

Lee J-S, Nerghes A (2017) Labels and sentiment in social media: On the role of perceived agency in online discussions of the refugee crisis. In: Proceedings of the 8th international conference on social media & society. ACM, p 14

Lek HH, Poo DC (2013) Aspect-based twitter sentiment classification. In: 2013 IEEE 25th international conference on tools with artificial intelligence (ICTAI). IEEE, pp 366–373

Lewenberg Y, Bachrach Y, Volkova S (2015) Using emotions to predict user interest areas in online social networks. In: IEEE international conference on data science and advanced analytics (DSAA), 2015. 36678 2015. IEEE, pp 1–10

Lewis DD (1998) Naive (Bayes) at forty: the independence assumption in information retrieval. In: European conference on machine learning. Springer, pp 4–15

Li C-T, Hsieh H-P, Kuo T-T, Lin S-D (2014) Opinion diffusion and analysis on social networks. In: Encyclopedia of social network analysis and mining. Springer, pp 1212–1218

Li Y-M, Li T-Y (2013) Deriving market intelligence from microblogs. Decis Support Syst 55(1):206–217CrossRef

Li W, Xu H (2014) Text-based emotion classification using emotion cause extraction. Expert Syst Appl 41(4):1742–1749CrossRef

Li Z, Zhu S, Hong H, Li Y, El Saddik A (2017) City digital pulse: a cloud based heterogeneous data analysis platform. Multimed Tools Appl 76(8):10893–10916CrossRef

Li G, Chang K, Hoi SC (2014) Twitter microblog sentiment analysis. In: Encyclopedia of social network analysis and mining. Springer, pp 2253–2259

Li M, Ch’ng E, Chong A, See S (2016) The new eye of smart city: novel citizen sentiment analysis in twitter. In: 2016 international conference on audio, language and image processing (ICALIP). IEEE, pp 557–562

Li Y, Fleyeh H (2018) Twitter sentiment analysis of new ikea stores using machine learning. In: 2018 international conference on computer and applications (ICCA). IEEE, pp 4–11

Li W, Li Y, Wang Y (2016) Chinese microblog sentiment analysis based on sentiment features. In: Asia-Pacific web conference. Springer, pp 385–388

Lim S, Tucker CS, Kumara S (2017) An unsupervised machine learning model for discovering latent infectious diseases using social media data. J Biomed Inform 66(2017):82–94CrossRef

Lima ACE, de Castro LN, Corchado JM (2015) A polarity analysis framework for twitter messages. Appl Math Comput 270(2015):756–767MATH

Lin BY, Xu FF, Zhu K, Hwang S-W (2018) Mining cross-cultural differences and similarities in social media. In: Proceedings of the 56th annual meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp 709–719

Lin C, He Y (2014) Sentiment analysis in social media. Springer, New York, pp 1688–1699. https://doi.org/10.1007/978-1-4614-6170-8_120CrossRef

Li D, Rzepka R, Ptaszynski M, Araki K (2018) Emoticon-aware recurrent neural network model for Chinese sentiment analysis. In: 2018 9th international conference on awareness science and technology (iCAST). IEEE, pp 161–166

Liu B (2010) Sentiment analysis and subjectivity. In: Handbook of natural language processing, 2nd ed. Taylor and Francis Group, Boca Raton

Liu S, Young SD (2018) A survey of social media data analysis for physical activity surveillance. J Forensic Leg Med 57(2018):33–36CrossRef

Liu Q, Hu Y, Lei Y, Wei X, Liu G, Bi W (2018) Topic-based microblog polarity classification based on cascaded model. In: International conference on computational science. Springer, pp 206–220

Liu T, Jiang F, Liu Y, Zhang M, Ma S (2015) Do photos help express our feelings: incorporating multimodal features into microblog sentiment analysis. In: Chinese national conference on social media processing. Springer, pp 63–73

Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D, Levy O, Lewis M, Zettlemoyer L, Stoyanov V (2019) Roberta: a robustly optimized Bert pretraining approach. arXiv preprint arXiv:1907.11692

Liu S, Young SD (2016) A survey of social media data analysis for physical activity surveillance. J Forensic Leg Med

Liu B, Zhang L (2012) Mining text data. In: Ch. A survey of opinion mining and sentiment analysis. Springer, Boston, pp 415–463. https://doi.org/10.1007/978-1-4614-3223-4_13

Lloyd S (1982) Least squares quantization in PCM. IEEE Trans Inf Theory 28(2):129–137MathSciNetMATHCrossRef

Lo SL, Cambria E, Chiong R, Cornforth D (2017) Multilingual sentiment analysis: from formal to informal and scarce resource languages. Artif Intell Rev 48(4):499–527CrossRef

Loukachevitch N, Rubtsova Y (2015) Entity-oriented sentiment analysis of tweets: results and problems. In: International conference on text, speech, and dialogue. Springer, pp 551–559

Lu T-J (2015) Semi-supervised microblog sentiment analysis using social relation and text similarity. In: 2015 international conference on big data and smart computing (BigComp). IEEE, pp 194–201

Lu Y, Dong R, Smyth B (2016) Context-aware sentiment detection from ratings. In: Research and development in intelligent systems XXXIII: incorporating applications and innovations in intelligent systems XXIV 33. Springer, pp 87–101

Lu Y, Hu X, Wang F, Kumar S, Liu H, Maciejewski R (2015) Visualizing social media sentiment in disaster scenarios. In: Proceedings of the 24th international conference on World Wide Web. ACM, pp 1211–1215

Maeda H, Shimada K, Endo T (2012) Twitter sentiment analysis based on writing style. In: Advances in natural language processing. Springer, pp 278–288

Mansour R, Hady MFA, Hosam E, Amr H, Ashour A (2015) Feature selection for twitter sentiment analysis: An experimental study. In: International conference on intelligent text processing and computational linguistics. Springer, pp 92–103

Maynard D, Bontcheva K, Rout D (2012) Challenges in developing opinion mining tools for social media. In: Proceedings of the@ NLP can u tag# usergeneratedcontent, pp 15–22

Maynard D, Funk A (2011) Automatic detection of political opinions in tweets. In: Extended semantic web conference. Springer, pp 88–99

McCullagh P (1984) Generalized linear models. Eur J Oper Res 16(3):285–292MathSciNetMATHCrossRef

McCulloch WS, Pitts W (1943) A logical calculus of the ideas immanent in nervous activity. Bull Math Biophys 5(4):115–133MathSciNetMATHCrossRef

McDiarmid C (1989) On the method of bounded differences. Surv Combin 141(1):148–188MathSciNetMATH

McNair LM, D.M., Droppleman L (1971) Profile of mood states. Educational & Industrial testing service

Medhat W, Hassan A, Korashy H (2014) Sentiment analysis algorithms and applications: a survey. Ain Shams Eng J 5(4):1093–1113CrossRef

Mehrabian A (1996) Pleasure–arousal–dominance: a general framework for describing and measuring individual differences in temperament. Curr Psychol 14(4):261–292MathSciNetCrossRef

Mejova Y, Srinivasan P (2012) Political speech in social media streams: Youtube comments and twitter posts. In: Proceedings of the 4th annual ACM web science conference. ACM, pp 205–208

Mejova Y, Srinivasan P, Boynton B (2013) Gop primary season on twitter: popular political sentiment in social media. In: Proceedings of the sixth ACM international conference on Web search and data mining. ACM, pp 517–526

Michailidis D, Stylianou N, Vlahavas I (2018) Real time location based sentiment analysis on twitter: the airsent system. In: Proceedings of the 10th Hellenic conference on artificial intelligence. ACM, p 21

Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781

Miller GA (1995) Wordnet: a lexical database for English. Commun ACM 38(11):39–41CrossRef

Min M, Lee T, Hsu R (2013) Role of emoticons in sentence-level sentiment classification. In: Chinese computational linguistics and natural language processing based on naturally annotated big data. Springer, pp 203–213

Mirza M, Osindero S (2014) Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784

Mishev K, Gjorgjevikj A, Vodenska I, Chitkushev LT, Trajanov D (2020) Evaluation of sentiment analysis in finance: from lexicons to transformers. IEEE Access 8(2020):131662–131682CrossRef

Mishra S, Diesner J (2018) Detecting the correlation between sentiment and user-level as well as text-level meta-data from benchmark corpora. In: Proceedings of the 29th on hypertext and social media. ACM, pp 2–10

Moh M, Moh T-S, Peng Y, Wu L (2017) On adverse drug event extractions using twitter sentiment analysis. Netw Model Anal Health Inf Bioinform 6(1):18CrossRef

Mohammad SM, Kiritchenko S, Zhu X (2013) Nrc-canada: building the state-of-the-art in sentiment analysis of tweets. In: Proceedings of the seventh international workshop on Semantic Evaluation Exercises (SemEval-2013), Atlanta, Georgia, USA

Mohammad SM, Turney PD (2010) Emotions evoked by common words and phrases: Using mechanical turk to create an emotion lexicon. In: Proceedings of the NAACL HLT 2010 workshop on computational approaches to analysis and generation of emotion in text, Association for Computational Linguistics, pp 26–34

Mohammad SM, Turney PD (2013) Crowdsourcing a word-emotion association lexicon. Comput Intell 29(3):436–465MathSciNetCrossRef

Molina-González MD, Martínez-Cámara E, Martín-Valdivia MT, Urena-López LA (2014) Cross-domain sentiment analysis using Spanish opinionated words. In: International conference on applications of natural language to data bases/information systems. Springer, pp 214–219

Montejo-Raez A, Díaz-Galiano MC, Martinez-Santiago F, Ureña-López L (2014) Crowd explicit sentiment analysis. Knowl-Based Syst 69(2014):134–139CrossRef

Morency L-P, Mihalcea R, Doshi P (2011) Towards multimodal sentiment analysis: harvesting opinions from the web. In: Proceedings of the 13th international conference on multimodal interfaces. ACM, pp 169–176

Mostafa MM (2013) An emotional polarity analysis of consumers’ airline service tweets. Soc Netw Anal Min 3(3):635–649CrossRef

Mostafa MM (2013) More than words: social networks’ text mining for consumer brand sentiments. Expert Syst Appl 40(10):4241–4251CrossRef

Mozetič I, Grčar M, Smailović J (2016) Twitter sentiment for 15 european languages, slovenian language resource repository CLARIN.SI. http://hdl.handle.net/11356/1054

Mukherjee A, Mukherjee S (2017) Collision theory based sentiment detection of twitter using discourse relations. In: Proceedings of the international conference on signal, networks, computing, and systems. Springer, pp 235–242

Mukhtar N, Khan MA (2019) Effective lexicon-based approach for Urdu sentiment analysis. Artif Intell Rev 2019:1–28

Mukkamala RR, Hussain A, Vatrapu R (2014) Fuzzy-set based sentiment analysis of big social data. In: 2014 IEEE 18th international enterprise distributed object computing conference (EDOC). IEEE, pp 71–80

Mukkamala RR, Hussain A, Vatrapu R (2014) Towards a set theoretical approach to big data analytics. In: 2014 IEEE international congress on big data (BigData Congress). IEEE, pp 629–636

Müller M, Salathé M, Kummervold PE (2020) Covid-twitter-bert: a natural language processing model to analyse covid-19 content on twitter. arXiv preprint arXiv:2005.07503

Mumu TS, Ezeife CI (2014) Discovering community preference influence network by social network opinion posts mining. In: International conference on data warehousing and knowledge discovery. Springer, pp 136–145

Munezero M, Montero CS, Mozgovoy M, Sutinen E (2015) Emotwitter—a fine-grained visualization system for identifying enduring sentiments in tweets. CICLing 2(9042):78–91

Nagiwale AN, Umale MR (2015) Design of self-adjusting algorithm for data-intensive mapreduce applications. In: 2015 international conference on energy systems and applications. IEEE, pp 506–510

Nakov P, Kozareva Z, Ritter A, Rosenthal S, Stoyanov V, Wilson T (2013) Semeval-2013 task 2: sentiment analysis in twitter. In: Proceedings of the 7th international workshop on semantic evaluation (SemEval-2013), 2013

Nakov P, Ritter A, Rosenthal S, Stoyanov V, Sebastiani F (June 2016) SemEval-2016 task 4: sentiment analysis in Twitter. In: Proceedings of the 10th international workshop on semantic evaluation, SemEval ’16. Association for Computational Linguistics, San Diego, California

Napitu F, Bijaksana MA, Trisetyarso A, Heryadi Y (2017) Twitter opinion mining predicts broadband internet’s customer churn rate. In: 2017 IEEE international conference on cybernetics and computational intelligence (CyberneticsCom). IEEE, pp 141–146

Narr S, Hulfenhaus M, Albayrak S (2012) Language-independent twitter sentiment analysis, Knowledge discovery and machine learning (KDML). LWA 2012:12–14

Naseem U, Razzak I, Musial K, Imran M (2020) Transformer based deep intelligent contextual embedding for twitter sentiment analysis. Future Gener Comput Syst 113(2020):58–69CrossRef

Nassirtoussi AK, Aghabozorgi S, Wah TY, Ngo DCL (2014) Text mining for market prediction: a systematic review. Expert Syst Appl 41(16):7653–7670CrossRef

Nausheen F, Begum SH (2018) Sentiment analysis to predict election results using python. In: 2018 2nd international conference on inventive systems and control (ICISC). IEEE, pp 1259–1262

Neethu M, Rajasree R (2013) Sentiment analysis in twitter using machine learning techniques. In: 2013 fourth international conference on computing, communications and networking technologies (ICCCNT). IEEE, pp 1–5

Nguyen DQ, Vu T, Nguyen AT (2020) Bertweet: a pre-trained language model for English tweets. arXiv preprint arXiv:2005.10200

Nguyen ML (2016) Leveraging emotional consistency for semi-supervised sentiment classification. In: Pacific-Asia conference on knowledge discovery and data mining. Springer, pp 369–381

Nivetha K, Ram GR, Ajitha P (2016) Opinion mining from social media using fuzzy inference system (FIS). In: 2016 international conference on communication and signal processing (ICCSP). IEEE, pp 2171–2175

Nugroho AS, Doewes A et al (2017) Twitter sentiment analysis of DKI Jakarta’s gubernatorial election 2017 with predictive and descriptive approaches. In: 2017 international conference on computer, control, informatics and its applications (IC3INA). IEEE, pp 89–94

Ochoa-Luna J, Ari D (2018) Deep neural network approaches for Spanish sentiment analysis of short texts. In: Ibero-American conference on artificial intelligence. Springer, pp 430–441

Oh C, Kumar S (2017) How Trump won: the role of social media sentiment in political elections. In: PACIS, p 48

Omar MS, Njeru A, Yi S (2017) The influence of twitter on education policy making. In: 2017 18th IEEE/ACIS international conference on software engineering, artificial intelligence, networking and parallel/distributed computing (SNPD). IEEE, pp 133–136

Orellana-Rodriguez C, Diaz-Aviles E, Nejdl W (2013) Mining emotions in short films: user comments or crowdsourcing? In: Proceedings of the 22nd international conference on World Wide Web. ACM, pp 69–70

Orellana-Rodriguez C, Diaz-Aviles E, Nejdl W (2015) Mining affective context in short films for emotion-aware recommendation. In: Proceedings of the 26th ACM conference on hypertext & social media. ACM, pp 185–194

Ortigosa A, Martín JM, Carro RM (2014) Sentiment analysis in facebook and its application to e-learning. Comput Hum Behav 31(2014):527–541CrossRef

Ortis A, Farinella GM, Torrisi G, Battiato S (2018) Visual sentiment analysis based on on objective text description of images. In: 2018 international conference on content-based multimedia indexing (CBMI). IEEE, pp 1–6

Ortony A, Clore G, Collins A (1988) The cognitive structure of emotions. Cambridge University Press, CambridgeCrossRef

Ou G, Chen W, Li B, Wang T, Yang D, Wong K-F (2014) Clusm: an unsupervised model for microblog sentiment analysis incorporating link information. In: International conference on database systems for advanced applications. Springer, pp 481–494

Ouyang Y, Guo B, Zhang J, Yu Z, Zhou X (2017) Sentistory: multi-grained sentiment analysis and event summarization with crowdsourced social media data. Pers Ubiquit Comput 21(1):97–111CrossRef

Ozer M, Yildirim MY, Davulcu H (2017) Negative link prediction and its applications in online political networks. In: Proceedings of the 28th ACM conference on hypertext and social media. ACM, pp 125–134

Pääkkönen P (2016) Feasibility analysis of Asterixdb and spark streaming with Cassandra for stream-based processing. J Big Data 3(1):6CrossRef

Pai RR, Alathur S (2018) Assessing mobile health applications with twitter analytics. Int J Med Inform 113(2018):72–84CrossRef

Pai P-F, Liu C-H (2018) Predicting vehicle sales by sentiment analysis of twitter data and stock market values. IEEE Access 6(2018):57655–57662CrossRef

Pak A, Paroubek P (2010) Twitter as a corpus for sentiment analysis and opinion mining. In: Chair NCC, Choukri K, Maegaard B, Mariani J, Odijk J, Piperidis S, Rosner M, Tapias D (eds) Proceedings of the seventh international conference on language resources and evaluation (LREC’10). European Language Resources Association (ELRA), Valletta, Malta

Paltoglou G, Thelwall M (2012) Twitter, myspace, digg: unsupervised sentiment analysis in social media. ACM Trans Intell Syst Technol (TIST) 3(4):66

Pan SJ, Yang Q (2009) A survey on transfer learning. IEEE Trans Knowl Data Eng 22(10):1345–1359CrossRef

Pandarachalil R, Sendhilkumar S, Mahalakshmi G (2015) Twitter sentiment analysis for large-scale data: an unsupervised approach. Cogn Comput 7(2):254–262CrossRef

Pandey AC, Rajpoot DS, Saraswat M (2017) Twitter sentiment analysis using hybrid cuckoo search method. Inf Process Manag 53(4):764–779CrossRef

Pang B, Lee L (2008) Opinion mining and sentiment analysis. Found Trends Inf Retrieval 2(1–2):1–135. https://doi.org/10.1561/1500000011CrossRef

Pant DR, Neupane P, Poudel A, Pokhrel AK, Lama BK (2018) Recurrent neural network based bitcoin price prediction by twitter sentiment analysis. In: 2018 IEEE 3rd international conference on computing, communication and security (ICCCS). IEEE, pp 128–132

Park J, Kim H, Cha M, Jeong J (2011) Ceo’s apology in twitter: a case study of the fake beef labeling incident by e-mart. Soc Inform 2011:300–303CrossRef

Parthasarathi J, Sundararaman K, Rao GSV (2012) Perisikan: An intelligent framework for social network data analysis. In: 2012 international conference on communications and information technology (ICCIT). IEEE, pp 13–16

Patra BG, Das D, Das A, Prasath R (2015) Shared task on sentiment analysis in Indian languages (sail) tweets-an overview. In: International conference on mining intelligence and knowledge exploration. Springer, pp 650–655

Pavel A, Palade V, Iqbal R, Hintea D (2017) Using short urls in tweets to improve twitter opinion mining. In: 2017 16th IEEE international conference on machine learning and applications (ICMLA). IEEE, pp 965–970

Pei S, Zhang L, Li A (2014) Microblog sentiment analysis model based on emoticons. In: Asia-Pacific web conference. Springer, pp 127–135

Peng Y, Moh M, Moh T-S (2016) Efficient adverse drug event extraction using twitter sentiment analysis. In: 2016 IEEE/ACM international conference on advances in social networks analysis and mining (ASONAM). IEEE, pp 1011–1018

Pereira DA (2021) A survey of sentiment analysis in the Portuguese language. Artif Intell Rev 54(2):1087–1115CrossRef

Permatasari RI, Fauzi MA, Adikara PP, Sari EDL (2018) Twitter sentiment analysis of movie reviews using ensemble features based Naïve Bayes. In: 2018 international conference on sustainable information engineering and technology (SIET). IEEE, pp 92–95

Petz G, Karpowicz M, Fürschuß H, Auinger A, Stříteskỳ V, Holzinger A (2013) Opinion mining on the web 2.0-characteristics of user generated content and their impacts. In: Human-computer interaction and knowledge discovery in complex, unstructured, big data. Springer, pp 35–46

Philander K, YunYing Z et al (2016) Twitter sentiment analysis: capturing sentiment from integrated resort tweets. Int J Hosp Manag 55:16–24CrossRef

Piñeiro-Chousa J, López-Cabarcos MÁ, Pérez-Pico AM, Ribeiro-Navarrete B (2018) Does social network sentiment influence the relationship between the s&p 500 and gold returns? Int Rev Financ Anal 57(2018):57–64CrossRef

Pla F, Hurtado L-F (2014) Sentiment analysis in twitter for Spanish. In: International conference on applications of natural language to data bases/information systems. Springer, pp 208–213

Plutchik R (1980) Chapter 1—a general psychoevolutionary theory of emotion. In: Plutchik R, Kellerman H (eds) Theories of emotion. Academic press, pp 3—33. https://doi.org/10.1016/B978-0-12-558701-3.50007-7. http://www.sciencedirect.com/science/article/pii/B9780125587013500077

Politopoulou V, Maragoudakis M (2013) On mining opinions from social media. In: International conference on engineering applications of neural networks. Springer, pp 474–484

Pollacci L, Sîrbu A, Giannotti F, Pedreschi D, Lucchese C, Muntean CI (2017) Sentiment spreading: an epidemic model for lexicon-based sentiment analysis on twitter. In: Conference of the Italian Association for Artificial Intelligence. Springer, pp 114–127

Polymerou E, Chatzakou D, Vakali A (2014) Emotube: A sentiment analysis integrated environment for social web content. In: Proceedings of the 4th international conference on web intelligence, mining and semantics (WIMS14). ACM, p 20

Poortvliet D, Wang X (2018) Intelligence retrieval from a centralized IoT network. In: 2018 IEEE international conference on big data (Big Data). IEEE, pp 5176–5180

Poria S, Gelbukh A, Cambria E, Hussain A, Huang G-B (2014) Emosenticspace: a novel framework for affective common-sense reasoning. Knowl-Based Syst 69(2014):108–123CrossRef

Poria S, Cambria E, Howard N, Huang G-B, Hussain A (2016) Fusing audio, visual and textual clues for sentiment analysis from multimodal content. Neurocomputing 174(2016):50–59CrossRef

Porshnev A, Redkin I (2014) Analysis of twitter users’ mood for prediction of gold and silver prices in the stock market. In: International conference on analysis of images, social networks and texts. Springer, pp 190–197

Porshnev A, Redkin I, Karpov N (2014) Modelling movement of stock market indexes with data from emoticons of twitter users. In: Russian Summer school in information retrieval. Springer, pp 297–306

Porshnev A, Redkin I, Shevchenko A (2013) Machine learning in prediction of stock market indicators based on historical data and data from twitter sentiment analysis. In: 2013 IEEE 13th international conference on data mining workshops (ICDMW). IEEE, pp 440–444

Prabowo R, Thelwall M (2009) Sentiment analysis: a combined approach. J Informetr 3(2):143–157CrossRef

Prusa J, Khoshgoftaar TM, Napolitano A (2015) Utilizing ensemble, data sampling and feature selection techniques for improving classification performance on tweet sentiment data. In: 2015 IEEE 14th international conference on machine learning and applications (ICMLA). IEEE, pp 535–542

Pupi S, Di Pietro G, Aliprandi C (2014) Ent-it-up. In: International conference on human–computer interaction. Springer, pp 3–8

Qaisi LM, Aljarah I (2016) A twitter sentiment analysis for cloud providers: a case study of AZURE vs. AWS. In: 2016 7th international conference on computer science and information technology (CSIT). IEEE, pp 1–6

Quinlan JR (1986) Induction of decision trees. Mach Learn 1(1):81–106CrossRef

Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann Publishers, San Mateo

Rabelo JC, Prudêncio RB, Barros FA (2012) Using link structure to infer opinions in social networks. In: 2012 IEEE international conference on systems, man, and cybernetics (SMC). IEEE, pp 681–685

Radford A, Wu J, Child R, Luan D, Amodei D, Sutskever I (2019) Language models are unsupervised multitask learners. OpenAI blog 1(8):9

Radhika N, Sankar S (2017) Personalized language-independent music recommendation system. In: 2017 international conference on intelligent computing and control (I2C2). IEEE, pp 1–6

Ragavi MD, Usharani S (2014) Social data analysis for predicting next event. In: 2014 international conference on information communication and embedded systems (ICICES). IEEE, pp 1–5

Rai A, Minsker B, Diesner J, Karahalios K, Sun Y (2018) Identification of landscape preferences by using social media analysis. In: 2018 international workshop on social sensing (SocialSens). IEEE, pp 44–49

Rajalakshmi S, Asha S, Pazhaniraja N (2017) A comprehensive survey on sentiment analysis. In: 2017 fourth international conference on signal processing, communication and networking (ICSCN). IEEE, pp 1–5

Raja M, Swamynathan S (2016) Tweet sentiment analyzer: sentiment score estimation method for assessing the value of opinions in tweets. In: Proceedings of the international conference on advances in information communication technology & computing. ACM, p 83

Ramadhani AM, Goo HS (2017) Twitter sentiment analysis using deep learning methods. In: 2017 7th international annual engineering seminar (InAES). IEEE, pp 1–4

Ramadhani RA, Indriani F, Nugrahadi DT (2016) Comparison of Naive Bayes smoothing methods for twitter sentiment analysis, pp 287–292

Ramirez I, Sprechmann P, Sapiro G (2010) Classification and clustering via dictionary learning with structured incoherence and shared features. In: 2010 IEEE conference on computer vision and pattern recognition (CVPR). IEEE, pp 3501–3508

Ramteke J, Shah S, Godhia D, Shaikh A (2016) Election result prediction using twitter sentiment analysis. In: International conference on inventive computation technologies (ICICT), vol 1. IEEE, pp 1–5

Rani S, Kumar P (2019) A journey of Indian languages over sentiment analysis: a systematic review. Artif Intell Rev 52(2):1415–1462CrossRef

Ranjan S, Sood S, Verma V (2018) Twitter sentiment analysis of real-time customer experience feedback for predicting growth of Indian telecom companies. In: 2018 4th international conference on computing sciences (ICCS). IEEE, pp 166–174

Rao T, Srivastava S (2014) Twitter sentiment analysis: how to hedge your bets in the stock markets. In: State of the art applications of social network analysis. Springer, pp 227–247

Rathan M, Hulipalled VR, Venugopal K, Patnaik L (2018) Consumer insight mining: aspect based twitter opinion mining of mobile phone reviews. Appl Soft Comput 68(2018):765–773

Rathan M, Hulipalled VR, Murugeshwari P, Sushmitha H (2017) Every post matters: a survey on applications of sentiment analysis in social media. In: 2017 international conference on smart technologies for smart nation (SmartTechCon). IEEE, pp 709–714

Ravi K, Ravi V (2015) A survey on opinion mining and sentiment analysis: tasks, approaches and applications. Knowl-Based Syst 89(2015):14–46CrossRef

Ray P, Chakrabarti A (2017) Twitter sentiment analysis for product review using lexicon method. In: 2017 international conference on data management, analytics and innovation (ICDMAI). IEEE, pp 211–216

Rexha A, Kröll M, Dragoni M, Kern R (2016) Polarity classification for target phrases in tweets: a word2vec approach. International semantic web conference. Springer, pp 217–223

Reyes A, Rosso P, Veale T (2013) A multidimensional approach for detecting irony in twitter. Lang Resour Eval 47(1):239–268CrossRef

Rezaei Z, Jalali M (2017) Sentiment analysis on twitter using mcdiarmid tree algorithm. In: 2017 7th international conference on computer and knowledge engineering (ICCKE). IEEE, pp 33–36

Rezk MA, Ojo A, El Khayat GA, Hussein S (2018) A predictive government decision based on citizen opinions: tools & results. In: Proceedings of the 11th international conference on theory and practice of electronic governance. ACM, pp 712–714

Rill S, Reinel D, Scheidt J, Zicari RV (2014) Politwi: early detection of emerging political topics on twitter and the impact on concept-level sentiment analysis. Knowl-Based Syst 69(2014):24–33CrossRef

Rinaldi E, Musdholifah A (2017) FVEC-SVM for opinion mining on Indonesian comments of youtube video. In: 2017 international conference on data and software engineering (ICoDSE). IEEE, pp 1–5

Rosenthal S, Farra N, Nakov P (Aug. 2017) SemEval-2017 task 4: Sentiment analysis in twitter. In: Proceedings of the 11th international workshop on semantic evaluation (SemEval-2017). Association for Computational Linguistics, Vancouver, pp 502–518. https://doi.org/10.18653/v1/S17-2088. https://www.aclweb.org/anthology/S17-2088

Rosenthal S, Nakov P, Kiritchenko S, Mohammad S, Ritter A, Stoyanov V (2015) Semeval-2015 task 10: sentiment analysis in twitter. In: Proceedings of the 9th international workshop on semantic evaluation (SemEval 2015), pp 451–463

Rosenthal S, Ritter A, Nakov P, Stoyanov V (2014) Semeval-2014 task 9: sentiment analysis in twitter. In: Proceedings of the 8th international workshop on semantic evaluation (SemEval-2014)

Rousseeuw PJ (1984) Least median of squares regression. J Am Stat Assoc 79(388):871–880MathSciNetMATHCrossRef

Rout JK, Singh S, Jena SK, Bakshi S (2017) Deceptive review detection using labeled and unlabeled data. Multimed Tools Appl 76(3):3187–3211CrossRef

Rout JK, Choo K-KR, Dash AK, Bakshi S, Jena SK, Williams KL (2018) A model for sentiment and emotion analysis of unstructured social media text. Electron Comm Res 18(1):181–199CrossRef

Ruder S (2017) Transfer learning—machine learning’s next frontier. http://ruder.io/transfer-learning/

Rui H, Liu Y, Whinston A (2013) Whose and what chatter matters? The effect of tweets on movie sales. Decis Support Syst 55(4):863–870CrossRef

Rumelhart DE, Hinton GE, Williams RJ (1985) Learning internal representations by error propagation. California Univ San Diego La Jolla Inst for Cognitive Science, Tech. rep

Sabuj MS, Afrin Z, Hasan KA (2017) Opinion mining using support vector machine with web based diverse data. In: International conference on pattern recognition and machine intelligence. Springer, pp 673–678

Sachdeva A, Kapoor R, Sharma A, Mishra A (2018) An approach towards identification and prevention of riots by analysis of social media posts in real-time. In: 2018 8th international conference on cloud computing, data science & engineering (Confluence). IEEE, pp 14–15

Sahni T, Chandak C, Chedeti NR, Singh M (2017) Efficient twitter sentiment classification using subjective distant supervision. In: 2017 9th international conference on communication systems and networks (COMSNETS). IEEE, pp 548–553

Sahu S, Rout SK, Mohanty D (2015) Twitter sentiment analysis-a more enhanced way of classification and scoring. In: 2015 IEEE international symposium on nanoelectronic and information systems (iNIS). IEEE, pp 67–72

Saidani FR, Hadjali A, Rassoul I, Belkasmi D (2017) Skyline-based feature selection for polarity classification in social networks. In: International conference on database and expert systems applications. Springer, pp 381–394

Saif H, Fernandez M, He Y, Alani H (2013) Evaluation datasets for twitter sentiment analysis: a survey and a new dataset, the sts-gold

Saif H, Fernandez M, He Y, Alani H (2014) Senticircles for contextual and conceptual semantic sentiment analysis of twitter. In: European semantic web conference. Springer, pp 83–98

Saif H, He Y, Alani H (2012) Semantic sentiment analysis of twitter. In: The Semantic Web-ISWC 2012, pp 508–524

Saif H, He Y, Fernandez M, Alani H (2014) Adapting sentiment lexicons using contextual semantics for sentiment analysis of twitter. European semantic web conference. Springer, pp 54–63

Saif H, He Y, Fernandez M, Alani H (2014) Semantic patterns for sentiment analysis of twitter. In: International semantic web conference. Springer, pp 324–340

Saini S, Rao R, Vaichole V, Rane A, Abin D (2018) Emotion recognition using multimodal approach. In: 2018 fourth international conference on computing communication control and automation (ICCUBEA). IEEE, pp 1–4

Salari S, Sedighpour N, Vaezinia V, Momtazi S (2018) Estimation of 2017 Iran’s presidential election using sentiment analysis on social media. In: 2018 4th Iranian conference on signal processing and intelligent systems (ICSPIS). IEEE, pp 77–82

Saleena N et al (2018) An ensemble classification system for twitter sentiment analysis. Procedia Comput Sci 132(2018):937–946

Salton G, McGill MJ (1986) Introduction to modern information retrieval

Samoylov AB (2014) Evaluation of the delta TF-IDF features for sentiment analysis. In: International conference on analysis of images, social networks and texts. Springer, pp 207–212

Sanborn R, Farmer M, Banerjee S (2015) Assigning geo-relevance of sentiments mined from location-based social media posts. In: International symposium on intelligent data analysis. Springer, pp 253–263

Sánchez-Holgado P, Arcila-Calderón C (2018) Towards the study of sentiment in the public opinion of science in Spanish. In: Proceedings of the sixth international conference on technological ecosystems for enhancing multiculturality. ACM, pp 963–970

Sandoval-Almazan R, Valle-Cruz D (2018) Facebook impact and sentiment analysis on political campaigns. In: Proceedings of the 19th annual international conference on digital government research: governance in the data age. ACM, p 56

Sangameswar M, Rao MN, Satyanarayana S (2017) An algorithm for identification of natural disaster affected area. J Big Data 4(1):39CrossRef

Sanh V, Debut L, Chaumond J, Wolf T (2019) Distilbert, a distilled version of Bert: smaller, faster, cheaper and lighter. arXiv preprint arXiv:1910.01108

Sankaranarayanan S, Ingale D, Bhambhu R, Chandrasekaran K (2016) Towards sentiment orientation data set enrichment. In: Proceedings of the second international conference on information and communication technology for competitive strategies. ACM, p 69

Santarcangelo V, Oddo G, Pilato M, Valenti F, Fornaro C (2015) Social opinion mining: an approach for Italian language. In: 2015 3rd international conference on future Internet of Things and Cloud (FiCloud). IEEE, pp 693–697

Sanyal A, Kumar P, Kar P, Chawla S, Sebastiani F (2018) Optimizing non-decomposable measures with deep networks. Mach Learn 107(8–10):1597–1620MathSciNetMATHCrossRef

Sarlan A, Nadam C, Basri S (2014) Twitter sentiment analysis. In: 2014 international conference on information technology and multimedia (ICIMU). IEEE, pp 212–216

Sarsam SM, Al-Samarraie H, Alzahrani AI, Wright B (2020) Sarcasm detection using machine learning algorithms in twitter: a systematic review. Int J Market Res 62(5):578–598CrossRef

Satapathy R, Guerreiro C, Chaturvedi I, Cambria E (2017) Phonetic-based microtext normalization for twitter sentiment analysis. In: 2017 IEEE international conference on data mining workshops (ICDMW). IEEE, pp 407–413

Seron W, Zorzal E, Quiles MG, Basgalupp MP, Breve FA (2015) # worldcup2014 on twitter. In: International conference on computational science and its applications. Springer, pp 447–458

Serrano-Guerrero J, Olivas JA, Romero FP, Herrera-Viedma E (2015) Sentiment analysis: a review and comparative analysis of web services. Inf Sci 311(2015):18–38CrossRef

Setiawan EB, Widyantoro DH, Surendro K (2018) Feature expansion for sentiment analysis in twitter. Proc Electr Eng Comput Sci Inform 5(5):509–513

Severyn A, Moschitti A, Uryupina O, Plank B, Filippova K (2016) Multi-lingual opinion mining on youtube. Inf Process Manag 52(1):46–60CrossRef

Severyn A, Moschitti A (2015) Twitter sentiment analysis with deep convolutional neural networks. In: Proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval. ACM, pp 959–962

Shamma DA, Kennedy L, Churchill EF (2009) Tweet the debates: understanding community annotation of uncollected sources. In: Proceedings of the first SIGMM workshop on Social media. ACM, pp 3–10

Sharma N, Pabreja R, Yaqub U, Atluri V, Chun S, Vaidya J (2018) Web-based application for sentiment analysis of live tweets. In: Proceedings of the 19th annual international conference on digital government research: governance in the data age. ACM, p 120

Sharma P, Singh UK, Sharma TV, Das D (2016) Algorithm for prediction of links using sentiment analysis in social networks. In: Proceedings of the 7th international conference on computing communication and networking technologies. ACM, p 29

Sheth A, Jadhav A, Kapanipathi P, Lu C, Purohit H, Smith GA, Wang W (2014) Twitris: a system for collective social intelligence. In: Encyclopedia of social network analysis and mining. Springer, pp 2240–2253

Shi H, Chen W, Li X (2013) Opinion sentence extraction and sentiment analysis for Chinese microblogs. In: Natural Language Processing and Chinese Computing. Springer, pp 417–423

Shi S, Zhao M, Guan J, Li Y, Huang H (2017) A hierarchical LSTM model with multiple features for sentiment analysis of Sina Weibo texts. In: 2017 international conference on Asian Language Processing (IALP). IEEE, pp 379–382

Shukri SE, Yaghi RI, Aljarah I, Alsawalqah H (2015) Twitter sentiment analysis: a case study in the automotive industry. In: 2015 IEEE Jordan conference on applied electrical engineering and computing technologies (AEECT). IEEE, pp 1–5

Shyamasundar L, Rani PJ (2016) Twitter sentiment analysis with different feature extractors and dimensionality reduction using supervised learning algorithms. In: India conference (INDICON), 2016 IEEE annual. IEEE, pp 1–6

Siddiqua UA, Ahsan T, Chy AN (2016) Combining a rule-based classifier with weakly supervised learning for twitter sentiment analysis. In: International conference on innovations in science, engineering and technology (ICISET). IEEE, pp 1–4

Sihwi SW, Jati IP, Anggrainingsih R (2018) Twitter sentiment analysis of movie reviews using information gain and naïve bayes classifier. In: 2018 international seminar on application for technology of information and communication. IEEE, pp 190–195

Simões C, Neves R, Horta N (2017) Using sentiment from twitter optimized by genetic algorithms to predict the stock market. In: 2017 IEEE congress on evolutionary computation (CEC). IEEE, pp 1303–1310

Singh T, Kumari M (2016) Role of text pre-processing in twitter sentiment analysis. Procedia Comput Sci 89(2016):549–554CrossRef

Singh N, Roy N, Gangopadhyay A (2018) Analyzing the sentiment of crowd for improving the emergency response services. In: 2018 IEEE international conference on smart computing (SMARTCOMP). IEEE, pp 1–8

Sluban B, Smailović J, Battiston S, Mozetič I (2015) Sentiment leaning of influential communities in social networks. Comput Soc Netw 2(1):1–21CrossRef

Smailović J, Kranjc J, Grčar M, Žnidaršič M, Mozetič I (2015) Monitoring the twitter sentiment during the Bulgarian elections. In: IEEE international conference on data science and advanced analytics (DSAA), 2015. 36678 2015. IEEE, pp 1–10

Socher R, Perelygin A, Wu J, Chuang J, Manning CD, Ng A, Potts C (2013) Recursive deep models for semantic compositionality over a sentiment treebank. In: Proceedings of the 2013 conference on empirical methods in natural language processing, pp 1631–1642

Song MY-J, Gruzd A (2017) Examining sentiments and popularity of pro-and anti-vaccination videos on youtube. In: Proceedings of the 8th international conference on social media & society. ACM, p 17

Song Y, Yang P, Zhang C, Ji Y (2015) Implicit feedback mining for recommendation. In: International conference on big data computing and communications. Springer, pp 373–385

Soni D, Sharma M, Khatri SK (2017) Political opinion mining using e-social network data. In: 2017 international conference on infocom technologies and unmanned systems (trends and future directions)(ICTUS). IEEE, pp 163–165

Souza E, Alves T, Teles I, Oliveira AL, Gusmão C (2016) Topie: An open-source opinion mining pipeline to analyze consumers’ sentiment in brazilian portuguese. In: International conference on computational processing of the Portuguese language. Springer, pp 95–105

Souza M, Vieira R (2012) Sentiment analysis on twitter data for Portuguese language. In: Computational processing of the Portuguese language, pp 241–247

Speriosu M, Sudan N, Upadhyay S, Baldridge J (2011) Twitter polarity classification with label propagation over lexical links and the follower graph. In: Proceedings of the first workshop on unsupervised learning in NLP. Association for Computational Linguistics, pp 53–63

Stojanovski D, Strezoski G, Madjarov G, Dimitrovski I, Chorbev I (2018) Deep neural network architecture for sentiment analysis and emotion identification of twitter messages. Multimed Tools Appl 77(24):32213–32242CrossRef

Stojanovski D, Strezoski G, Madjarov G, Dimitrovski I (2015) Twitter sentiment analysis using deep convolutional neural network. In: International conference on hybrid artificial intelligence systems. Springer, pp 726–737

Strubell E, Ganesh A, McCallum A (2019) Energy and policy considerations for deep learning in NLP. arXiv preprint arXiv:1906.02243

Sui H, Jianping Y, Hongxian Z, Wei Z (2012) Sentiment analysis of Chinese micro-blog using semantic sentiment space model. In: 2012 2nd international conference on computer science and network technology (ICCSNT). IEEE, pp 1443–1447

Sun X, Zhang C, Ding S, Quan C (2018) Detecting anomalous emotion through big data from social networks based on a deep learning method. Multimed Tools Appl 2018:1–22

Sun T, Wang J, Zhang P, Cao Y, Liu B, Wang D (2017) Predicting stock price returns using microblog sentiment for Chinese stock market. In: 2017 3rd international conference on big data computing and communications (BIGCOM). IEEE, pp 87–96

Sun X, Ye J-Q, Ren F-J (2014) Multi-strategy based sina microblog data acquisition for opinion mining. In: International conference on intelligent computing. Springer, pp 551–560

Supriya B, Kallimani V, Prakash S, Akki C (2016) Twitter sentiment analysis using binary classification technique. In: International conference on nature of computation and communication. Springer, pp 391–396

Suresh H (2016) An unsupervised fuzzy clustering method for twitter sentiment analysis. In: 2016 international conference on computation system and information technology for sustainable solutions (CSITSS). IEEE, pp 80–85. https://doi.org/10.1109/CSITSS.2016.7779444

Susanto Y, Livingstone AG, Ng BC, Cambria E (2020) The hourglass model revisited. IEEE Intell Syst 35(5):96–102CrossRef

Su Y, Wang H (2017) Lingosent-a platform for linguistic aware sentiment analysis for social media messages. In: International conference on multimedia modeling. Springer, pp 452–464

Su Z, Zhou B, Li A, Han Y (2014) Analysis on Chinese microblog sentiment based on syntax parsing and support vector machine. In: Asia-Pacific web conference. Springer, pp 104–114

Sygkounas E, Rizzo G, Troncy R (2016) A replication study of the top performing systems in semeval twitter sentiment analysis. International Semantic Web Conference. Springer, pp 204–219

Sykora M, Elayan S, Jackson TW (2020) A qualitative analysis of sarcasm, irony and related# hashtags on twitter. Big Data Society 7(2):2053951720972735CrossRef

Symeonidis S, Effrosynidis D, Arampatzis A (2018) A comparative evaluation of pre-processing techniques and their interactions for twitter sentiment analysis. Expert Syst Appl 110(2018):298–310CrossRef

Taddy M (2013) Measuring political sentiment on twitter: factor optimal design for multinomial inverse regression. Technometrics 55(4):415–425MathSciNetCrossRef

Tago K, Jin Q (2018) Influence analysis of emotional behaviors and user relationships based on twitter data. Tsinghua Sci Technol 23(1):104–113CrossRef

Tan SS, Soon L-K, Lim TY, Tang EK, Loo CK (2014) Learning the mapping rules for sentiment analysis. In: Proceedings of the 5th international workshop on web-scale knowledge representation retrieval & reasoning. ACM, pp 19–22

Tang D, Qin B, Liu T, Li Z (2013) Learning sentence representation for emotion classification on microblogs. In: Natural Language Processing and Chinese Computing. Springer, pp 212–223

Tan C, Lee L, Tang J, Jiang L, Zhou M, Li P (2011) User-level sentiment analysis incorporating social networks. In: Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 1397–1405

Tapia PA, Velásquez JD (2014) Twitter sentiment polarity analysis: a novel approach for improving the automated labeling in a text corpora. In: International conference on active media technology. Springer, pp 274–285

Tasoulis SK, Vrahatis AG, Georgakopoulos SV, Plagianakos VP (July 2018) Real time sentiment change detection of twitter data streams. In: 2018 innovations in intelligent systems and applications (INISTA), pp 1–6. https://doi.org/10.1109/INISTA.2018.8466326

Teixeira A, Laureano RM (2017) Data extraction and preparation to perform a sentiment analysis using open source tools: The example of a facebook fashion brand page. In: 2017 12th Iberian conference on information systems and technologies (CISTI). IEEE, pp 1–6

Tellez ES, Miranda-Jiménez S, Graff M, Moctezuma D, Siordia OS, Villaseñor EA (2017) A case study of Spanish text transformations for twitter sentiment analysis. Expert Syst Appl 81(2017):457–471CrossRef

Thelwall M, Buckley K, Paltoglou G, Cai D, Kappas A (2010) Sentiment strength detection in short informal text. J Am Soc Inform Sci Technol 61(12):2544–2558CrossRef

Thelwall M, Buckley K, Paltoglou G (2011) Sentiment in twitter events. J Am Soc Inform Sci Technol 62(2):406–418CrossRef

Thelwall M, Buckley K, Paltoglou G (2012) Sentiment strength detection for the social web. J Am Soc Inform Sci Technol 63(1):163–173CrossRef

Tian P, Zhu Z, Xiong L, Xu F (2015) A recommendation mechanism for web publishing based on sentiment analysis of microblog. Wuhan Univ J Nat Sci 20(2):146–152CrossRef

Tiwari S, Bharadwaj A, Gupta S (2017) Stock price prediction using data analytics. In: 2017 international conference on advances in computing, communication and control (ICAC3). IEEE, pp 1–5

Tong Y, Zhou B, Huang J (2017) Topic-adaptive sentiment analysis on tweets via learning from multi-sources data. In: 2017 10th international symposium on computational intelligence and design (ISCID), vol 1. IEEE, pp 241–246

Troussas C, Krouska A, Virvou M (2016) Evaluation of ensemble-based sentiment classifiers for twitter data. In: 2016 7th international conference on information, intelligence, systems & applications (IISA). IEEE, pp 1–6

Trung DN, Nguyen TT, Jung JJ, Choi D (2013) Understanding effect of sentiment content toward information diffusion pattern in online social networks: a case study on tweetscope. In: International conference on context-aware systems and applications. Springer, pp 349–358

Tsakalidis A, Papadopoulos S, Kompatsiaris I (2014) An ensemble model for cross-domain polarity classification on twitter. In: International conference on web information systems engineering. Springer, pp 168–177

Tsytsarau M, Palpanas T (2012) Survey on mining subjective data on the web. Data Min Knowl Disc 24(3):478–514MATHCrossRef

Tsytsarau M, Palpanas T, Castellanos M (2014) Dynamics of news events and social media reaction. In: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 901–910

Tumasjan A, Sprenger TO, Sandner PG, Welpe IM (2010) Predicting elections with twitter: What 140 characters reveal about political sentiment. ICWSM 10(1):178–185CrossRef

Unankard S, Li X, Sharaf M, Zhong J, Li X (2014) Predicting elections from social networks based on sub-event detection and sentiment analysis. In: International conference on web information systems engineering. Springer, pp 1–16

Varshney N, Gupta S (2014) Mining churning factors in Indian telecommunication sector using social media analytics. In: International conference on data warehousing and knowledge discovery. Springer, pp 405–413

Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. arXiv preprint arXiv:1706.03762

Vilarinho G, Ruiz E (2018) Global centrality measures in word graphs for twitter sentiment analysis. In: 2018 7th Brazilian conference on intelligent systems (BRACIS). IEEE, pp 55–60

Villegas J, Cobos C, Mendoza M, Herrera-Viedma E (2018) Feature selection using sampling with replacement, covering arrays and rule-induction techniques to aid polarity detection in twitter sentiment analysis. In: Ibero-American conference on artificial intelligence. Springer, pp 467–480

Vincent P, Larochelle H, Bengio Y, Manzagol P-A (2008) Extracting and composing robust features with denoising autoencoders. In: Proceedings of the 25th international conference on machine learning. ACM, pp 1096–1103

Vivanco E, Palanca J, del Val E, Rebollo M, Botti V (2017) Using geo-tagged sentiment to better understand social interactions. In: International conference on practical applications of agents and multi-agent systems. Springer, pp 369–372

Vo TH, Nguyen TT, Pham HA, Van Le T (2017) An efficient hybrid model for vietnamese sentiment analysis. In: Asian conference on intelligent information and database systems. Springer, pp 227–237

Vora J, Chacko AM (2017) Sentiment analysis of tweets to identify the correlated factors that influence an issue of interest. In: 2017 2nd international conference on telecommunication and networks (TEL-NET). IEEE, pp 1–6

Vu T-T, Chang S, Ha QT, Collier N (2012) An experiment in integrating sentiment features for tech stock prediction in twitter

Wadawadagi R, Pagi V (2020) Sentiment analysis with deep neural networks: comparative study and performance assessment. Artif Intell Rev 53(2020):6155–6195CrossRef

Wagh R, Punde P (2018) Survey on sentiment analysis using twitter dataset. In: 2018 second international conference on electronics, communication and aerospace technology (ICECA). IEEE, pp 208–211

Wagner S, Zimmermann M, Ntoutsi E, Spiliopoulou M (2015) Ageing-based multinomial Naive Bayes classifiers over opinionated data streams. In: Joint European conference on machine learning and knowledge discovery in databases. Springer, pp 401–416

Walha A, Ghozzi F, Gargouri F (2016) ETL design toward social network opinion analysis. In: Computer and information science 2015. Springer, pp 235–249

Wang J-H, Ye T-W (2013) Unsupervised opinion targets expansion and modification relation identification for microblog sentiment analysis. In: International conference on social informatics. Springer, pp 255–267

Wang Y, Kim K, Lee B, Youn HY (2018) Word clustering based on POS feature for efficient twitter sentiment analysis. Hum Centric Comput Inf Sci 8(1):17CrossRef

Wang H, Can D, Kazemzadeh A, Bar F, Narayanan S (2012) A system for real-time twitter sentiment analysis of 2012 us presidential election cycle. In: Proceedings of the ACL 2012 system demonstrations. Association for Computational Linguistics, pp 115–120

Wang M, Cao D, Li L, Li S, Ji R (2014) Microblog sentiment analysis based on cross-media bag-of-words model. In: Proceedings of international conference on internet multimedia computing and service. ACM, p 76

Wang W, Chen L, Thirunarayan K, Sheth AP (2012) Harnessing twitter“ big data” for automatic emotion identification. In: Privacy, security, risk and trust (PASSAT), 2012 international conference on and 2012 international conference on social computing (SocialCom). IEEE, pp 587–592

Wang Y, Feng S, Wang D, Yu G, Zhang Y (2016) Multi-label Chinese microblog emotion classification via convolutional neural network. In: Asia-Pacific web conference. Springer, pp 567–580

Wang Y, Feng S, Wang D, Zhang Y, Yu G (2016) Context-aware Chinese microblog sentiment classification with bidirectional LSTM. In: Asia-Pacific web conference. Springer, pp 594–606

Wang D, Li F (2014) Sentiment analysis of Chinese microblogs based on layered features. In: International conference on neural information processing. Springer, pp 361–368

Wang S, Manning CD (2012) Baselines and bigrams: Simple, good sentiment and topic classification. In: Proceedings of the 50th annual meeting of the association for computational linguistics: Short papers-volume 2, Association for Computational Linguistics, pp 90–94

Wang F, Wu Y (2015) Sentiment-bearing new words mining: Exploiting emoticons and latent polarities. In: International conference on intelligent text processing and computational linguistics. Springer, pp 166–179

Wang M, Liu M, Feng S, Wang D, Zhang Y (2014) A novel calibrated label ranking based method for multiple emotions detection in Chinese microblogs. In: Natural Language Processing and Chinese Computing. Springer, pp 238–250

Wang Z, Yu Z, Chen L, Guo B (2014) Sentiment detection and visualization of Chinese micro-blog. In: 2014 international conference on data science and advanced analytics (DSAA). IEEE, pp 251–257

Wang H, Zhang F, Hou M, Xie X, Guo M, Liu Q (2018) Shine: signed heterogeneous information network embedding for sentiment link prediction. In: Proceedings of the eleventh ACM international conference on web search and data mining. ACM, pp 592–600

Wan S, Li B, Zhang A, Wang K, Li X (2018) Vertical and sequential sentiment analysis of micro-blog topic. In: International conference on advanced data mining and applications. Springer, pp 353–363

Wazery YM, Mohammed HS, Houssein EH (2018) Twitter sentiment analysis using deep neural network. In: 2018 14th international computer engineering conference (ICENCO). IEEE, pp 177–182

Wehrmann J, Becker W, Cagnini HE, Barros RC (2017) A character-based convolutional neural network for language-agnostic twitter sentiment analysis. In: 2017 international joint conference on neural networks (IJCNN). IEEE, pp 2384–2391

Weichselbraun A, Gindl S, Fischer F, Vakulenko S, Scharl A (2017) Aspect-based extraction and analysis of affective knowledge from social media streams. IEEE Intell Syst 32(3):80–88CrossRef

Weiss L, Briscoe E, Hayes H, Kemenova O, Harbert S, Li F, Lebanon G, Stewart C, Steiger DM, Foy D (2013) A comparative study of social media and traditional polling in the Egyptian uprising of 2011. SBP. Springer, pp 303–310

Weiss SM, Indurkhya N, Zhang T (2015) Data sources for prediction: databases, hybrid data and the web. In: Fundamentals of predictive text mining. Springer, pp 147–164

Wijayanti R, Arisal A (2017) Ensemble approach for sentiment polarity analysis in user-generated Indonesian text. In: 2017 International conference on computer, control, informatics and its applications (IC3INA). IEEE, pp 158–163

Williamson W, Ruming K (2016) Social media adoption and use by Australian capital city local governments. In: Social media and local governments. Springer, pp 113–132

Wilson T, Wiebe J, Hoffmann P (2005) Recognizing contextual polarity in phrase-level sentiment analysis. In: Proceedings of the conference on human language technology and empirical methods in natural language processing. Association for Computational Linguistics, pp 347–354

Witten IH, Frank E, Hall MA, Pal CJ (2016) Data Mining: practical machine learning tools and techniques. Morgan Kaufmann, Burlington

Wu L, Moh T-S, Khuri N (2015) Twitter opinion mining for adverse drug reactions. In: 2015 IEEE international conference on big data (Big Data). IEEE, pp 1570–1574

Wu F, Huang Y, Song Y, Liu S (2016) Towards building a high-quality microblog-specific Chinese sentiment lexicon. Decis Support Syst 87(2016):39–49CrossRef

Wunnasri W, Theeramunkong T, Haruechaiyasak C (2013) Solving unbalanced data for thai sentiment analysis. In: 2013 10th international joint conference on computer science and software engineering (JCSSE). IEEE, pp 200–205

Xia R, Jiang J, He H (2017) Distantly supervised lifelong learning for large-scale social media sentiment analysis. IEEE Trans Affect Comput 8(4):480–491CrossRef

Xiaomei Z, Jing Y, Jianpei Z, Hongyu H (2018) Microblog sentiment analysis with weak dependency connections. Knowl-Based Syst 142(2018):170–180CrossRef

Xie Y, Cheng Y, Honbo D, Zhang K, Agrawal A, Choudhary A (2012) Crowdsourcing recommendations from social sentiment. In: Proceedings of the first international workshop on issues of sentiment discovery and opinion mining. ACM, p 9

Xing F, Malandri L, Zhang Y, Cambria E (2020) Financial sentiment analysis: an investigation into common mistakes and silver bullets. In: Proceedings of the 28th international conference on computational linguistics, pp 978–987

Xiong X, Zhou G, Huang Y, Chen H, Xu K (2013) Dynamic evolution of collective emotions in social networks: a case study of Sina Weibo. Sci China Inf Sci 56(7):1–18MathSciNetCrossRef

Xu J-M, Zhu X, Bellmore A (2012) Fast learning for sentiment analysis on bullying. In: Proceedings of the first international workshop on issues of sentiment discovery and opinion mining. ACM, p 10

Xu J, Masuda K, Nishizaki H, Fukumoto F, Suzuki Y (2020) Semi-automatic construction and refinement of an annotated corpus for a deep learning framework for emotion classification. In: Proceedings of the 12th language resources and evaluation conference, pp 1611–1617

Yadav A, Vishwakarma DK (2020) Sentiment analysis using deep learning architectures: a review. Artif Intell Rev 53(6):4335–4385CrossRef

Yan G, He W, Shen J, Tang C (2014) A bilingual approach for conducting Chinese and English social media sentiment analysis. Comput Netw 75(2014):491–503CrossRef

Yan L, Zheng Y, Cao J (2018) Few-shot learning for short text classification. Multimed Tools Appl 77(22):29799–29810CrossRef

Yang Y, Zhou F (2015) Microblog sentiment analysis algorithm research and implementation based on classification. In: 2015 14th international symposium on distributed computing and applications for business engineering and science (DCABES). IEEE, pp 288–291

Yang D, Zhang D, Yu Z, Wang Z (2013) A sentiment-enhanced personalized location recommendation system. In: Proceedings of the 24th ACM conference on hypertext and social media. ACM, pp 119–128

Yang C-H, Chen J-D, Kao H-Y (2014) Competition component identification on twitter. In: Pacific-Asia conference on knowledge discovery and data mining. Springer, pp 584–595

Yang A, Zhang J, Pan L, Xiang Y (2015) Enhanced twitter sentiment analysis by using feature selection and combination. In: 2015 international symposium on security and privacy in social networks and big data (SocialSec). IEEE, pp 52–57

Yang Z, Yang D, Dyer C, He X, Smola A, Hovy E (2016) Hierarchical attention networks for document classification. In: Proceedings of the 2016 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 1480–1489

Yang Z, Dai Z, Yang Y, Carbonell J, Salakhutdinov R, Le QV (2019) Xlnet: generalized autoregressive pretraining for language understanding. arXiv preprint arXiv:1906.08237

Yanmei L, Yuda C (2015) Research on Chinese micro-blog sentiment analysis based on deep learning. In: 2015 8th international symposium on computational intelligence and design (ISCID), vol 1. IEEE, pp 358–361

Yan L, Tao H (2016) An empirical study and comparison for tweet sentiment analysis. In: International conference on cloud computing and security. Springer, pp 623–632

Yan B, Zhang B, Su H, Zheng H (2014) Comments-attached Chinese microblog sentiment classification based on machine learning technology. In: International conference on intelligent computing. Springer, pp 173–184

Yan Y, Yang H, Wang H-M (2017) Two simple and effective ensemble classifiers for twitter sentiment analysis. In: 2017 computing conference. IEEE, pp 1386–1393

Yenkar P, Sawarkar S (2018) A survey on social media analytics for smart city. In: 2018 2nd international conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud)(I-SMAC) I-SMAC (IoT in Social, Mobile, Analytics and Cloud)(I-SMAC). IEEE, pp 87–93

You L, Tunçer B (2016) Exploring public sentiments for livable places based on a crowd-calibrated sentiment analysis mechanism. In: 2016 IEEE/ACM international conference on advances in social networks analysis and mining (ASONAM). IEEE, pp 693–700

You Q, Luo J, Jin H, Yang J (2015) Robust image sentiment analysis using progressively trained and domain transferred deep networks. AAAI 2015:381–388

Yu H-F, Huang F-L, Lin C-J (2011) Dual coordinate descent methods for logistic regression and maximum entropy models. Mach Learn 85(1–2):41–75MathSciNetMATHCrossRef

Yu Y, Duan W, Cao Q (2013) The impact of social and conventional media on firm equity value: a sentiment analysis approach. Decis Support Syst 55(4):919–926CrossRef

Yuan G, Murukannaiah PK, Zhang Z, Singh MP (2014) Exploiting sentiment homophily for link prediction. In: Proceedings of the 8th ACM conference on recommender systems. ACM, pp 17–24

Yuan J, You Q, Luo J (2015) Sentiment analysis using social multimedia. In: Multimedia data mining and analytics. Springer, pp 31–59

Zafar S, Sarwar U, Gilani Z, Qadir J (2016) Sentiment analysis of controversial topics on Pakistan’s twitter user-base. In: Proceedings of the 7th annual symposium on computing for development. ACM, p 35

Zainuddin N, Selamat A, Ibrahim R (2016a) Improving twitter aspect-based sentiment analysis using hybrid approach. In: Asian conference on intelligent information and database systems. Springer, pp 151–160

Zainuddin N, Selamat A, Ibrahim R (2016b) Twitter feature selection and classification using support vector machine for aspect-based sentiment analysis. In: International conference on industrial, engineering and other applications of applied intelligent systems. Springer, pp 269–279

Zainuddin N, Selamat A, Ibrahim R (2018) Hybrid sentiment classification on twitter aspect-based sentiment analysis. Appl Intell 48(5):1218–1232

Zhang K, Cheng Y, Xie Y, Honbo D, Agrawal A, Palsetia D, Lee K, Liao W-K, Choudhary A (2011) SES: sentiment elicitation system for social media data. In: 2011 IEEE 11th international conference on data mining workshops (ICDMW). IEEE, pp 129–136

Zhang L, Jia Y, Zhou B, Han Y (2012) Microblogging sentiment analysis using emotional vector. In: 2012 second international conference on cloud and green computing (CGC). IEEE, pp 430–433

Zhang H, Liu Y, Zhang M, Ma S (2013a) Grammatical phrase-level opinion target extraction on Chinese microblog messages. In: Natural Language Processing and Chinese Computing. Springer, pp 432–439

Zhang L, Pei S, Deng L, Han Y, Zhao J, Hong F (2013b) Microblog sentiment analysis based on emoticon networks model. In: Proceedings of the fifth international conference on internet multimedia computing and service. ACM, pp 134–138

Zhang K, Xie Y, Yang Y, Sun A, Liu H, Choudhary A (2014) Incorporating conditional random fields and active learning to improve sentiment identification. Neural Netw 58(2014):60–67CrossRef

Zhang Y, Shang L, Jia X (2015) Sentiment analysis on microblogging by integrating text and image features. In: Pacific-Asia conference on knowledge discovery and data mining. Springer, pp 52–63

Zhang S, Zhang X, Chan J (2017) A word-character convolutional neural network for language-agnostic twitter sentiment analysis. In: Proceedings of the 22nd Australasian document computing symposium. ACM, p 12

Zhang Y, Song D, Li X, Zhang P (2018) Unsupervised sentiment analysis of twitter posts using density matrix representation. In: European conference on information retrieval. Springer, pp 316–329

Zhang W, Xu M, Jiang Q (2018) Opinion mining and sentiment analysis in social media: Challenges and applications. In: International conference on HCI in business, government, and organizations. Springer, pp 536–548

Zhao B, He Y, Yuan C, Huang Y (2016) Stock market prediction exploiting microblog sentiment analysis. In: 2016 international joint conference on neural networks (IJCNN). IEEE, pp 4482–4488

Zhou Z, Zhang X, Sanderson M (2014) Sentiment analysis on twitter through topic-based lexicon expansion. In: Australasian database conference. Springer, pp 98–109

Zhou X, Tao X, Rahman MM, Zhang J (2017) Coupling topic modelling in opinion mining for social media analysis. In: Proceedings of the international conference on web intelligence. ACM, pp 533–540

Zhu X, Ghahramani Z (2002) Learning from labeled and unlabeled data with label propagation

Zhu S, Xu B, Zheng D, Zhao T (2013) Chinese microblog sentiment analysis based on semi-supervised learning. In: Semantic web and web science. springer, pp 325–331

Zimbra D, Ghiassi M, Lee S (2016) Brand-related twitter sentiment analysis using feature engineering and the dynamic architecture for artificial neural networks. In: 2016 49th Hawaii international conference on system sciences (HICSS). IEEE, pp 1930–1938

Zimbra D, Abbasi A, Zeng D, Chen H (2018) The state-of-the-art in twitter sentiment analysis: a review and benchmark evaluation. ACM Trans Manag Inf Syst (TMIS) 9(2):5

Zimmermann M, Ntoutsi E, Spiliopoulou M (2014) A semi-supervised self-adaptive classifier over opinionated streams. In: 2014 IEEE international conference on data mining workshop (ICDMW). IEEE, pp 425–432

Titel: Over a decade of social opinion mining: a systematic review
verfasst von: Keith Cortis
Brian Davis
Publikationsdatum: 25.06.2021
Verlag: Springer Netherlands
Erschienen in: Artificial Intelligence Review / Ausgabe 7/2021
Print ISSN: 0269-2821
Elektronische ISSN: 1573-7462
DOI: https://doi.org/10.1007/s10462-021-10030-2

Springer Professional

Abstract

Publisher's Note

1 Introduction

1.1 Opinion mining versus social opinion mining

1.2 Issues and challenges

1.3 Systematic review

2 Research method

2.1 Research questions

2.2 Search strategy

2.3 Search application

2.4 Study selection

2.5 Extraction of data

2.5.1 Overall

2.5.2 Study selection: electronic libraries

2.5.3 Study selection: additional set

2.6 Synthesis of data

3 Review analysis

3.1 Social media platforms

3.2 Techniques

3.2.1 Lexicon

3.2.2 Machine learning

3.2.3 Deep learning

3.2.4 Statistical

3.2.5 Probabilistic

3.2.6 Fuzziness

3.2.7 Rule-based

3.2.8 Graph

3.2.9 Ontology

3.2.10 Hybrid

3.2.11 Other

3.3 Social datasets

3.3.1 Overview

3.3.2 Comparative analysis

3.4 Language

3.5 Modality

3.5.1 Datasets

3.5.2 Observations

3.6 Tools and technologies

3.6.1 NLP

3.6.2 Machine learning

3.6.3 Opinion mining

3.6.4 Big data

3.7 Natural language processing tasks

3.7.1 Overview

3.7.2 Pre-processing and negations

3.7.3 Emoticons/Emojis

3.7.4 Word embeddings

3.7.5 Aspect-based social opinion mining

4 Dimensions of social opinion mining

4.1 Context

4.2 Different dimensions of social opinions identified in the review analysis

4.2.1 Subjectivity

4.2.2 Sentiment

4.2.3 Emotion

4.2.4 Affect

4.2.5 Irony

4.2.6 Sarcasm

4.2.7 Mood

4.2.8 Aggressiveness

4.2.9 Other

4.3 Impact of sarcasm and irony on social opinions

4.3.1 Challenges

4.3.2 Observations

5 Application areas of social opinion mining

6 Concluding remarks

6.1 Latest research of social opinion mining

6.2 Conclusion

Acknowledgements

Declarations

Conflict of interest

Ethical approval

Informed consent

Publisher's Note

Weitere Artikel der Ausgabe 7/2021

Deep learning for Alzheimer prediction using brain biomarkers

Parameter reduction analysis under interval-valued m-polar fuzzy soft information

Improved q-rung orthopair fuzzy line integral aggregation operators and their applications for multiple attribute decision making

Adaptive constraint propagation in constraint satisfaction: review and evaluation

A review and experimental analysis of active learning over crowdsourced data