Skip to main content
Erschienen in: The Journal of Supercomputing 6/2020

07.05.2018

A big data approach to sentiment analysis using greedy feature selection with cat swarm optimization-based long short-term memory neural networks

verfasst von: Abdulaziz Alarifi, Amr Tolba, Zafer Al-Makhadmeh, Wael Said

Erschienen in: The Journal of Supercomputing | Ausgabe 6/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Sentiment analysis is crucial in various systems such as opinion mining and predicting. Considerable research has been done to analyze sentiment using various machine learning techniques. However, the high error rates in these studies can reduce the entire system’s efficiency. We introduce a novel big data and machine learning technique for evaluating sentiment analysis processes to overcome this problem. The data are collected from a huge volume of datasets, helpful in the effective analysis of systems. The noise in the data is eliminated using a preprocessing data mining concept. From the cleaned sentiment data, effective features are selected using a greedy approach that selects optimal features processed by an optimal classifier called cat swarm optimization-based long short-term memory neural network (CSO-LSTMNN). The classifiers analyze sentiment-related features according to cat behavior, minimizing error rate while examining features. This technique helps improve system efficiency, analyzed using experimental results of error rate, precision, recall, and accuracy. The results obtained by implementing the greedy feature and CSO-LSTMNN algorithm and the particle swarm optimization (PSO) algorithm are compared; CSO-LSTMNN outperforms PSO in terms of increasing accuracy and decreasing error rate.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
3.
Zurück zum Zitat Pang B, Lee L (2008) Opinion mining and sentiment analysis. Found Trends Inf Retr 2:1–135 Pang B, Lee L (2008) Opinion mining and sentiment analysis. Found Trends Inf Retr 2:1–135
4.
Zurück zum Zitat Bhatia S, Sharma M, Bhatia KK (2018) Sentiment analysis and mining of opinions. Internet of things and big data analytics toward next-generation intelligence. Springer, Cham, pp 503–523 Bhatia S, Sharma M, Bhatia KK (2018) Sentiment analysis and mining of opinions. Internet of things and big data analytics toward next-generation intelligence. Springer, Cham, pp 503–523
6.
Zurück zum Zitat Liu Y, Gao C, Zhang Z, Lu Y, Chen S, Liang M, Tao L (2017) Solving NP-hard problems with Physarum-based ant colony system. IEEE/ACM Trans Comput Biol Bioinf 14:108–120 Liu Y, Gao C, Zhang Z, Lu Y, Chen S, Liang M, Tao L (2017) Solving NP-hard problems with Physarum-based ant colony system. IEEE/ACM Trans Comput Biol Bioinf 14:108–120
7.
Zurück zum Zitat Nabaei A, Hamian M, Parsaei MR, Safdari R, Samad-Soltani T, Zarrabi H, Ghassemi A (2018) Topologies and performance of intelligent algorithms: a comprehensive review. Artif Intell Rev 49:79–103 Nabaei A, Hamian M, Parsaei MR, Safdari R, Samad-Soltani T, Zarrabi H, Ghassemi A (2018) Topologies and performance of intelligent algorithms: a comprehensive review. Artif Intell Rev 49:79–103
8.
Zurück zum Zitat Roy S, Biswas S, Chaudhuri SS (2014) Nature-inspired swarm intelligence and its applications. Int J Mod Educ Comp Sci 12:55–65 Roy S, Biswas S, Chaudhuri SS (2014) Nature-inspired swarm intelligence and its applications. Int J Mod Educ Comp Sci 12:55–65
9.
Zurück zum Zitat Mahi M, Baykan OK, Kodaz H (2018) A new approach based on particle swarm optimization algorithm for solving data allocation problem. Appl Soft Comput 62:571–578 Mahi M, Baykan OK, Kodaz H (2018) A new approach based on particle swarm optimization algorithm for solving data allocation problem. Appl Soft Comput 62:571–578
10.
Zurück zum Zitat Pandey HM, Rajput M, Mishra V (2018) Performance comparison of pattern search, simulated annealing, genetic algorithm and jaya algorithm. Data engineering and intelligent computing. Springer, Singapore, pp 377–384 Pandey HM, Rajput M, Mishra V (2018) Performance comparison of pattern search, simulated annealing, genetic algorithm and jaya algorithm. Data engineering and intelligent computing. Springer, Singapore, pp 377–384
11.
Zurück zum Zitat Gill SS, Buyya R, Chana I, Singh M, Abraham A (2018) BULLET: particle swarm optimization based scheduling technique for provisioned cloud resources. J Netw Sys Manag 26:361–400 Gill SS, Buyya R, Chana I, Singh M, Abraham A (2018) BULLET: particle swarm optimization based scheduling technique for provisioned cloud resources. J Netw Sys Manag 26:361–400
12.
Zurück zum Zitat Bhalla R, Jain P (2016) A model based on effective and intelligent sentiment mining: a review. Indian J Sci Technol 9:32 Bhalla R, Jain P (2016) A model based on effective and intelligent sentiment mining: a review. Indian J Sci Technol 9:32
13.
Zurück zum Zitat Nikitidis S, Nikolaidis N, Pitas I (2012) Multiplicative update rules for incremental training of multiclass support vector machines. Pattern Recognit 45:1838–1852MATH Nikitidis S, Nikolaidis N, Pitas I (2012) Multiplicative update rules for incremental training of multiclass support vector machines. Pattern Recognit 45:1838–1852MATH
14.
Zurück zum Zitat Kingma DP, Mohamed S, Rezende DJ, Welling M (2014) Semi-supervised learning with deep generative models. Adv Neural Inf Proc Sys 2:3581–3589 Kingma DP, Mohamed S, Rezende DJ, Welling M (2014) Semi-supervised learning with deep generative models. Adv Neural Inf Proc Sys 2:3581–3589
15.
Zurück zum Zitat Isaac T, García S, Herrera F (2015) Self-labeled techniques for semi-supervised learning: taxonomy, software and empirical study. Knowl Inf Sys 42:245–284 Isaac T, García S, Herrera F (2015) Self-labeled techniques for semi-supervised learning: taxonomy, software and empirical study. Knowl Inf Sys 42:245–284
16.
Zurück zum Zitat Lafferty J, McCallum A, Pereira F (2001) Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proceedings of the International Conference on Machine Learning, pp 282–289 Lafferty J, McCallum A, Pereira F (2001) Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proceedings of the International Conference on Machine Learning, pp 282–289
17.
Zurück zum Zitat Astorino A, Fuduli A (2015) Support vector machine polyhedral separability in semi supervised learning. J Optim Theory Appl 164:1039–1050MathSciNetMATH Astorino A, Fuduli A (2015) Support vector machine polyhedral separability in semi supervised learning. J Optim Theory Appl 164:1039–1050MathSciNetMATH
18.
Zurück zum Zitat Zhang Z, Zhao M, Chow TWS (2015) Graph based constrained semi-supervised learning framework via label propagation over adaptive neighborhood. IEEE Trans Knowl Data Eng 27:2362–2376 Zhang Z, Zhao M, Chow TWS (2015) Graph based constrained semi-supervised learning framework via label propagation over adaptive neighborhood. IEEE Trans Knowl Data Eng 27:2362–2376
19.
Zurück zum Zitat Subramanya A, Bilmes J (2011) Semi-supervised learning with measure propagation. J Mach Learn Res 12:3311–3370MathSciNetMATH Subramanya A, Bilmes J (2011) Semi-supervised learning with measure propagation. J Mach Learn Res 12:3311–3370MathSciNetMATH
20.
Zurück zum Zitat Cecotti H (2016) Active graph based semi-supervised learning using image matching: application to handwritten digit recognition. Pattern Recognit Lett. 73:76–82 Cecotti H (2016) Active graph based semi-supervised learning using image matching: application to handwritten digit recognition. Pattern Recognit Lett. 73:76–82
21.
Zurück zum Zitat Patel H, Thakur GS (2016) A hybrid weighted nearest neighbor approach to mine imbalanced data. In: Proceeding 12th International Conference Data Mining (ICDM). IEEE, Las Vegas, pp 106–111 Patel H, Thakur GS (2016) A hybrid weighted nearest neighbor approach to mine imbalanced data. In: Proceeding 12th International Conference Data Mining (ICDM). IEEE, Las Vegas, pp 106–111
22.
Zurück zum Zitat Lu J, Behbood V, Hao P, Zuo H, Xue S, Zhang G (2015) Transfer learning using computational intelligence: a survey. Knowl-Based Sys 80:14–23 Lu J, Behbood V, Hao P, Zuo H, Xue S, Zhang G (2015) Transfer learning using computational intelligence: a survey. Knowl-Based Sys 80:14–23
23.
Zurück zum Zitat Perlich C, Dalessandro B, Raeder T, Stitelman O, Provost F (2015) Machine learning for targeted display advertising: transfer learning in action. Mach Learn 95:103–127MathSciNet Perlich C, Dalessandro B, Raeder T, Stitelman O, Provost F (2015) Machine learning for targeted display advertising: transfer learning in action. Mach Learn 95:103–127MathSciNet
24.
Zurück zum Zitat Long M, Wang J, Ding G, Pan SJ, Yu PS (2014) Adaptation regularization: a general framework for transfer learning. IEEE Trans Knowl Data Eng 26:1076–1089 Long M, Wang J, Ding G, Pan SJ, Yu PS (2014) Adaptation regularization: a general framework for transfer learning. IEEE Trans Knowl Data Eng 26:1076–1089
25.
Zurück zum Zitat Wang B, Pineau J (2016) Online boosting algorithms for anytime transfer and multitask learning. In: Proceedings 29th AAAI Conference Artificial Intelligence, AAAI, Austin, pp 3038–3044 Wang B, Pineau J (2016) Online boosting algorithms for anytime transfer and multitask learning. In: Proceedings 29th AAAI Conference Artificial Intelligence, AAAI, Austin, pp 3038–3044
26.
Zurück zum Zitat Kumar A, Khorwal R (2017) Firefly algorithm for feature selection in sentiment analysis. Computational intelligence in data mining. Springer, Singapore, pp 693–703 Kumar A, Khorwal R (2017) Firefly algorithm for feature selection in sentiment analysis. Computational intelligence in data mining. Springer, Singapore, pp 693–703
27.
Zurück zum Zitat Nayak J, Naik B, Behera HS (2016) A novel nature inspired firefly algorithm with higher order neural network: performance analysis. Eng Sci Technol 19:197–211 Nayak J, Naik B, Behera HS (2016) A novel nature inspired firefly algorithm with higher order neural network: performance analysis. Eng Sci Technol 19:197–211
29.
Zurück zum Zitat La L, Cao S, Qin L (2018) Take full advantage of unlabeled data for sentiment classification. Kybernetes 47:474–486 La L, Cao S, Qin L (2018) Take full advantage of unlabeled data for sentiment classification. Kybernetes 47:474–486
30.
Zurück zum Zitat Black PE (2005) Greedy algorithm. Dictionary of Algorithms and Data Structures. U.S, National Institute of Standards and Technology (NIST), Gaithersburg Black PE (2005) Greedy algorithm. Dictionary of Algorithms and Data Structures. U.S, National Institute of Standards and Technology (NIST), Gaithersburg
31.
Zurück zum Zitat Hazewinkel M (ed) (2001) [1994] Greedy algorithm. Encyclopedia of mathematics. Springer/Kluwer Academic Publishers, Dordrecht. ISBN 978-1-55608-010-4 Hazewinkel M (ed) (2001) [1994] Greedy algorithm. Encyclopedia of mathematics. Springer/Kluwer Academic Publishers, Dordrecht. ISBN 978-1-55608-010-4
33.
Zurück zum Zitat Yang X-S, Sadat Hosseini SS, Gandomi AH (2012) Firefly algorithm for solving non-convex economic dispatch problems with valve loading effect. Appl Soft Comput 12:1180–1186 Yang X-S, Sadat Hosseini SS, Gandomi AH (2012) Firefly algorithm for solving non-convex economic dispatch problems with valve loading effect. Appl Soft Comput 12:1180–1186
34.
Zurück zum Zitat Kumar A, Mishra D (2013) Cat swarm based optimization of gene expression data classification. Int J Comp Trends Technol (IJCTT) 4:1185 Kumar A, Mishra D (2013) Cat swarm based optimization of gene expression data classification. Int J Comp Trends Technol (IJCTT) 4:1185
35.
Zurück zum Zitat Meysam O, Yasin O, Mohammad M, Mohammad T (2013) A novel cat swarm optimization algorithm for unconstrained optimization problems. Int J Inf Technol Comp Sci 11:32–41 Meysam O, Yasin O, Mohammad M, Mohammad T (2013) A novel cat swarm optimization algorithm for unconstrained optimization problems. Int J Inf Technol Comp Sci 11:32–41
Metadaten
Titel
A big data approach to sentiment analysis using greedy feature selection with cat swarm optimization-based long short-term memory neural networks
verfasst von
Abdulaziz Alarifi
Amr Tolba
Zafer Al-Makhadmeh
Wael Said
Publikationsdatum
07.05.2018
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 6/2020
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-018-2398-2

Weitere Artikel der Ausgabe 6/2020

The Journal of Supercomputing 6/2020 Zur Ausgabe