Skip to main content
Erschienen in: Neural Computing and Applications 12/2019

21.05.2019 | Original Article

Hybrid N-gram model using Naïve Bayes for classification of political sentiments on Twitter

verfasst von: Jamilu Awwalu, Azuraliza Abu Bakar, Mohd Ridzwan Yaakub

Erschienen in: Neural Computing and Applications | Ausgabe 12/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Twitter, an online micro-blogging and social networking service, provides registered users the ability to write in 140 characters anything they wish and hence providing them the opportunity to express their opinions and sentiments on events taking place. Politically sentimental tweets are top-trending tweets; whenever election is near, users tweet about their favorite candidates or political parties and at times give their reasons for that. In this study, we hybridize two n-gram [two n-gram models used in this study are unigram and n-gram. Therefore, in this study, where unigram is mentioned that refers to a least-order n-gram (unigram) and where n-gram is mentioned that refers to the highest-order (full sentence or tweet level) n-gram] models and applied Laplace smoothing to Naïve Bayesian classifier and Katz back-off on the model. This was done in order to smoothen and address the limitation of accuracy in terms of precision and recall of n-gram models caused by the ‘zero count problem.’ Result from our baseline model shows an increase of 6.05% in average F-Harmonic accuracy in comparison with the n-gram model and 1.75% increase in comparison with the semantic-topic model proposed from a previous study on the same dataset, i.e., Obama–McCain dataset.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Kharde VA, Sonawane SS (2016) Sentiment analysis of Twitter data: a survey of techniques. Int J Comput Appl 139(11):5–15 Kharde VA, Sonawane SS (2016) Sentiment analysis of Twitter data: a survey of techniques. Int J Comput Appl 139(11):5–15
3.
Zurück zum Zitat Cambria E (2016) Affective computing and sentiment analysis. IEEE Intell Syst 16(March/April):102–107CrossRef Cambria E (2016) Affective computing and sentiment analysis. IEEE Intell Syst 16(March/April):102–107CrossRef
4.
Zurück zum Zitat Perikos I, Hatzilygeroudis I (2016) A classifier ensemble approach to detect emotions polarity in social media. In: Proceedings of the 12th international conference on web information systems and technologies (WEBIST 2016), vol 1, pp 363–370 Perikos I, Hatzilygeroudis I (2016) A classifier ensemble approach to detect emotions polarity in social media. In: Proceedings of the 12th international conference on web information systems and technologies (WEBIST 2016), vol 1, pp 363–370
7.
Zurück zum Zitat Bravo-marquez F, Frank E, Pfahringer B (2016) From opinion lexicons to sentiment classification of Tweets and vice versa: a transfer learning approach. In: The 2016 IEEE/WIC/ACM international conference on web intelligence at Omaha, Nebraska Bravo-marquez F, Frank E, Pfahringer B (2016) From opinion lexicons to sentiment classification of Tweets and vice versa: a transfer learning approach. In: The 2016 IEEE/WIC/ACM international conference on web intelligence at Omaha, Nebraska
8.
Zurück zum Zitat Boroş T, Ştefănescu D, Ion R (2013) Handling two difficult challenges for text-to-speech synthesis systems: out-of-vocabulary words and prosody: a case study in Romanian. In: Neustein A, Markowitz J (eds) Where humans meet machines. Springer, New York Boroş T, Ştefănescu D, Ion R (2013) Handling two difficult challenges for text-to-speech synthesis systems: out-of-vocabulary words and prosody: a case study in Romanian. In: Neustein A, Markowitz J (eds) Where humans meet machines. Springer, New York
9.
Zurück zum Zitat Zhu L, Smith ML, Lerman K, Kozareva Z (2013) The role of social media in the discussion of controversial topics. Accessed on 10 Jan 2014 Zhu L, Smith ML, Lerman K, Kozareva Z (2013) The role of social media in the discussion of controversial topics. Accessed on 10 Jan 2014
10.
Zurück zum Zitat Mukund D, Avik S (2010) BI and sentiment analysis. Bus Intell J 15(2):41–43 Mukund D, Avik S (2010) BI and sentiment analysis. Bus Intell J 15(2):41–43
11.
Zurück zum Zitat Cambria E, Schuller B, Xia Y, Havasi C (2013) New avenues in opinion mining and sentiment analysis. IEEE Intell Syst 28(2):15–21CrossRef Cambria E, Schuller B, Xia Y, Havasi C (2013) New avenues in opinion mining and sentiment analysis. IEEE Intell Syst 28(2):15–21CrossRef
12.
Zurück zum Zitat Jagtap VS, Pawar K (2013) Analysis of different approaches to sentence-level sentiment classification. Int J Sci Eng Technol 2(3):164–170 Jagtap VS, Pawar K (2013) Analysis of different approaches to sentence-level sentiment classification. Int J Sci Eng Technol 2(3):164–170
13.
Zurück zum Zitat Saif H, He Y, Alani H (2012) Semantic sentiment analysis of Twitter. In: 11th international semantic Web conference, Boston, MA, USA, November 11–15, 2012, proceedings, part I, pp 508–524 Saif H, He Y, Alani H (2012) Semantic sentiment analysis of Twitter. In: 11th international semantic Web conference, Boston, MA, USA, November 11–15, 2012, proceedings, part I, pp 508–524
14.
Zurück zum Zitat Tobias G, Johansson R (2013) Sentiment analysis of microblogs. Dissertation, University of Gothenberg Tobias G, Johansson R (2013) Sentiment analysis of microblogs. Dissertation, University of Gothenberg
15.
Zurück zum Zitat Wang S, Manning CD (1998) Baselines and bigrams: simple, good sentiment and topic classification, vol 94305(1) Wang S, Manning CD (1998) Baselines and bigrams: simple, good sentiment and topic classification, vol 94305(1)
16.
Zurück zum Zitat Tan S et al (2009) Adapting Naive Bayes to domain adaptation for sentiment analysis. In: Proceedings of the 31th European conference on IR research on advances in information retrieval, pp 337–349 Tan S et al (2009) Adapting Naive Bayes to domain adaptation for sentiment analysis. In: Proceedings of the 31th European conference on IR research on advances in information retrieval, pp 337–349
17.
Zurück zum Zitat Rennie JDM, Shih L, Teevan J, Karger DR (2003) Tackling the poor assumptions of naive bayes text classifiers. In: Proceedings of the twentieth international conference on machine learning (ICML-2003). AAAI Press, Washington, DC, pp 616–623 Rennie JDM, Shih L, Teevan J, Karger DR (2003) Tackling the poor assumptions of naive bayes text classifiers. In: Proceedings of the twentieth international conference on machine learning (ICML-2003). AAAI Press, Washington, DC, pp 616–623
18.
Zurück zum Zitat Rish I (2001) An empirical study of the Naive Bayes classifier. IBM Research Report. IBM Research Division, pp 41–46 Rish I (2001) An empirical study of the Naive Bayes classifier. IBM Research Report. IBM Research Division, pp 41–46
20.
Zurück zum Zitat Mensikova A, Mattmann CA (2018) Ensemble sentiment analysis to identify human trafficking in web data. In: Proceedings of ACM workshop on graph techniques for adversarial activity analytics (GTA32018), pp 0–5 Mensikova A, Mattmann CA (2018) Ensemble sentiment analysis to identify human trafficking in web data. In: Proceedings of ACM workshop on graph techniques for adversarial activity analytics (GTA32018), pp 0–5
22.
Zurück zum Zitat Ekman P (1999) Basic emotions. In: Handbook of cognition and emotion, pp 45–60CrossRef Ekman P (1999) Basic emotions. In: Handbook of cognition and emotion, pp 45–60CrossRef
24.
Zurück zum Zitat Narayanan V, Arora I, Bhatia A (2013) Fast and accurate sentiment classification using an enhanced Naive Bayes model. In: Proceedings of 14th international conference, IDEAL 2013, Hefei, China, October 20–23, 2013, pp 1–8CrossRef Narayanan V, Arora I, Bhatia A (2013) Fast and accurate sentiment classification using an enhanced Naive Bayes model. In: Proceedings of 14th international conference, IDEAL 2013, Hefei, China, October 20–23, 2013, pp 1–8CrossRef
25.
Zurück zum Zitat Jurafsky D, Martin JH (2007) Speech and language processing. In: An introduction to speech recognition, computational linguistics and natural language processing, p 42 Jurafsky D, Martin JH (2007) Speech and language processing. In: An introduction to speech recognition, computational linguistics and natural language processing, p 42
26.
Zurück zum Zitat Xiang B (2014) Improving Twitter sentiment analysis with topic-based mixture modeling and semi-supervised training. In: Proceedings of the 52nd annual meeting of the association for computational linguistics, pp 434–439 Xiang B (2014) Improving Twitter sentiment analysis with topic-based mixture modeling and semi-supervised training. In: Proceedings of the 52nd annual meeting of the association for computational linguistics, pp 434–439
27.
Zurück zum Zitat Jurafsky D, Martin JH (2008) Speech and language processing. N-grams, 2nd edn. Prentice Hall, Englewood Cliffs, pp 83–121 Jurafsky D, Martin JH (2008) Speech and language processing. N-grams, 2nd edn. Prentice Hall, Englewood Cliffs, pp 83–121
29.
Zurück zum Zitat Bo P, Lillian L (2005) Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales. In: Proceedings of the ACL Bo P, Lillian L (2005) Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales. In: Proceedings of the ACL
30.
Zurück zum Zitat Riaz M (2018) Fuzzy parameterized fuzzy soft compact spaces with decision-making. J Math 50(2):131–145MathSciNet Riaz M (2018) Fuzzy parameterized fuzzy soft compact spaces with decision-making. J Math 50(2):131–145MathSciNet
32.
Zurück zum Zitat Sa M, Riaz M, Hashmi MR (2016) Fuzzy parameterized fuzzy soft topology with applications. Ann Fuzzy Math Inf 13(5):593–613MathSciNetMATH Sa M, Riaz M, Hashmi MR (2016) Fuzzy parameterized fuzzy soft topology with applications. Ann Fuzzy Math Inf 13(5):593–613MathSciNetMATH
33.
Zurück zum Zitat Saif H, Fernandez M, He Y, Alani H (2013) Evaluation datasets for Twitter sentiment analysis: a survey and a new dataset, the STS-Gold. In: 1st international workshop on emotion and sentiment in social and expressive media: approaches and perspectives from AI (ESSEM 2013) Saif H, Fernandez M, He Y, Alani H (2013) Evaluation datasets for Twitter sentiment analysis: a survey and a new dataset, the STS-Gold. In: 1st international workshop on emotion and sentiment in social and expressive media: approaches and perspectives from AI (ESSEM 2013)
Metadaten
Titel
Hybrid N-gram model using Naïve Bayes for classification of political sentiments on Twitter
verfasst von
Jamilu Awwalu
Azuraliza Abu Bakar
Mohd Ridzwan Yaakub
Publikationsdatum
21.05.2019
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 12/2019
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-019-04248-z

Weitere Artikel der Ausgabe 12/2019

Neural Computing and Applications 12/2019 Zur Ausgabe

Machine Learning - Applications & Techniques in Cyber Intelligence

A non-intrusive load decomposition algorithm for residents

Premium Partner