Skip to main content
Erschienen in: Cluster Computing 4/2020

06.07.2020

Multi-level particle swarm optimisation and its parallel version for parameter optimisation of ensemble models: a case of sentiment polarity prediction

verfasst von: Gregorius Satia Budhi, Raymond Chiong, Sandeep Dhakal

Erschienen in: Cluster Computing | Ausgabe 4/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Ensemble learning is increasingly used in sentiment analysis. Determining the parameter settings of ensemble models, however, is not easy. Besides its own parameters, an ensemble model has base-predictors that have their individual parameters. Some ensemble models use a specific base-predictor and could be optimised using standard metaheuristics such as the Particle Swarm Optimisation (PSO) approach. Optimising ensemble models with multiple base-predictor candidates is more complicated and challenging, as there are multiple options to choose from. We therefore propose Multi-Level PSO (ML-PSO) and Parallel ML-PSO (PML-PSO) to optimise the parameters of ensemble models, especially those with multiple base-predictors, for sentiment analysis. The idea is to utilise multiple PSOs as particles of the main PSO. The main PSO optimises ensemble-model parameters and determines the best base-predictor, whereas PSOs within it optimise the corresponding base-predictor’s parameters. Experimental results using Bagging Predictors as the underlying ensemble model show that ML-PSO can improve prediction accuracy, while PML-PSO is able to speed up the processing time and further improve the accuracy.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Huang, J., Xue, Y., Hu, X., Jin, H., Lu, X., Liu, Z.: Sentiment analysis of Chinese online reviews using ensemble learning framework. Clust. Comput. 22, 3043–3058 (2019) Huang, J., Xue, Y., Hu, X., Jin, H., Lu, X., Liu, Z.: Sentiment analysis of Chinese online reviews using ensemble learning framework. Clust. Comput. 22, 3043–3058 (2019)
4.
Zurück zum Zitat Budhi, G.S., Chiong, R., Pranata, I., Hu, Z.: Predicting rating polarity through automatic classification of review texts. In: Proceedings of the 2017 IEEE Conference on Big Data and Analytics (ICBDA), pp. 19-24. Kuching, Malaysia, 16–17 November (2017) Budhi, G.S., Chiong, R., Pranata, I., Hu, Z.: Predicting rating polarity through automatic classification of review texts. In: Proceedings of the 2017 IEEE Conference on Big Data and Analytics (ICBDA), pp. 19-24. Kuching, Malaysia, 16–17 November (2017)
5.
Zurück zum Zitat Chiong, R., Fan, Z., Hu, Z., Adam, M.T.P., Lutz, B., Neumann, D.: A sentiment analysis-based machine learning approach for financial market prediction via news disclosures. In: Proceedings of the Genetic and Evolutionary Computation Conference Companion (GECCO '18 Companion), pp. 278–279. Kyoto, Japan, 15–19 July (2018) Chiong, R., Fan, Z., Hu, Z., Adam, M.T.P., Lutz, B., Neumann, D.: A sentiment analysis-based machine learning approach for financial market prediction via news disclosures. In: Proceedings of the Genetic and Evolutionary Computation Conference Companion (GECCO '18 Companion), pp. 278–279. Kyoto, Japan, 15–19 July (2018)
6.
Zurück zum Zitat Bansal, P., Kaur, R.: Twitter sentiment analysis using machine learning and optimization techniques. Int. J. Comput. Appl. 179(19), 5–8 (2018) Bansal, P., Kaur, R.: Twitter sentiment analysis using machine learning and optimization techniques. Int. J. Comput. Appl. 179(19), 5–8 (2018)
16.
Zurück zum Zitat Weise, T., Zapf, M., Chiong, R., Nebro, A.J.: Why is optimization difficult? In: Chiong, R. (ed.) Nature-Inspired Algorithms for Optimisation, pp. 1–50. Springer, Berlin (2009) Weise, T., Zapf, M., Chiong, R., Nebro, A.J.: Why is optimization difficult? In: Chiong, R. (ed.) Nature-Inspired Algorithms for Optimisation, pp. 1–50. Springer, Berlin (2009)
17.
Zurück zum Zitat Zolghadr-Asli, B., Bozorg-Haddad, O., Chu, X.: Introduction. In: Bozorg-Haddad, O. (ed.) Advanced Optimization by Nature-Inspired Algorithms. Springer, Singapore (2018) Zolghadr-Asli, B., Bozorg-Haddad, O., Chu, X.: Introduction. In: Bozorg-Haddad, O. (ed.) Advanced Optimization by Nature-Inspired Algorithms. Springer, Singapore (2018)
18.
Zurück zum Zitat Moser, I., Chiong, R.: Dynamic function optimization: The moving peaks benchmark. In: Alba, E., Nakib, A., Siarry, P. (eds.) Metaheuristics for Dynamic Optimization, pp. 35–59. Springer, Berlin (2013)CrossRef Moser, I., Chiong, R.: Dynamic function optimization: The moving peaks benchmark. In: Alba, E., Nakib, A., Siarry, P. (eds.) Metaheuristics for Dynamic Optimization, pp. 35–59. Springer, Berlin (2013)CrossRef
21.
Zurück zum Zitat Abedi, M., Chiong, R., Noman, N., Zhang, R.: A hybrid particle swarm optimisation approach for energy-efficient single machine scheduling with cumulative deterioration and multiple maintenances. In: Proceedings of 2017 IEEE Symposium Series on Computational Intelligence (SSCI), pp. 2930–2937. Honolulu, Hawaii, USA, 27 November–1 December (2017) Abedi, M., Chiong, R., Noman, N., Zhang, R.: A hybrid particle swarm optimisation approach for energy-efficient single machine scheduling with cumulative deterioration and multiple maintenances. In: Proceedings of 2017 IEEE Symposium Series on Computational Intelligence (SSCI), pp. 2930–2937. Honolulu, Hawaii, USA, 27 November–1 December (2017)
22.
Zurück zum Zitat Hu, Z., Chiong, R., Pranata, I., Susilo, W., Bao, Y.: Identifying malicious web domains using machine learning techniques with online credibility and performance data. In: Proceedings of IEEE Congress on Evolutionary Computation (CEC), pp. 5186–5194. Vancouver, BC, Canada, 24-29 July (2016) Hu, Z., Chiong, R., Pranata, I., Susilo, W., Bao, Y.: Identifying malicious web domains using machine learning techniques with online credibility and performance data. In: Proceedings of IEEE Congress on Evolutionary Computation (CEC), pp. 5186–5194. Vancouver, BC, Canada, 24-29 July (2016)
25.
Zurück zum Zitat Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., Lin, C.-J.: LIBLINEAR: A library for large linear classification. J. Mach. Learn. Res. 9, 1871–1874 (2008)MATH Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., Lin, C.-J.: LIBLINEAR: A library for large linear classification. J. Mach. Learn. Res. 9, 1871–1874 (2008)MATH
26.
Zurück zum Zitat Budhi, G.S., Chiong, R., Hu, Z., Pranata, I., Dhakal, S.: Multi-PSO based classifier selection and parameter optimisation for sentiment polarity prediction. In: Proceedings of IEEE Conference on Big Data and Analytics (ICBDA), pp. 68–73. Langkawi Island, Malaysia, 21-22 November (2018) Budhi, G.S., Chiong, R., Hu, Z., Pranata, I., Dhakal, S.: Multi-PSO based classifier selection and parameter optimisation for sentiment polarity prediction. In: Proceedings of IEEE Conference on Big Data and Analytics (ICBDA), pp. 68–73. Langkawi Island, Malaysia, 21-22 November (2018)
29.
Zurück zum Zitat Szwed, P., Chmiel, W.: Multi-swarm PSO algorithm for the quadratic assignment problem: a massive parallel implementation on the OpenCL platform. In: arXiv:1504.05158. (2015) Szwed, P., Chmiel, W.: Multi-swarm PSO algorithm for the quadratic assignment problem: a massive parallel implementation on the OpenCL platform. In: arXiv:1504.05158. (2015)
32.
Zurück zum Zitat Menard, S.: Logistic Regression: From Introductory to Advanced Concepts and Applications. SAGE, Los Angeles (2010)CrossRef Menard, S.: Logistic Regression: From Introductory to Advanced Concepts and Applications. SAGE, Los Angeles (2010)CrossRef
33.
Zurück zum Zitat Campbell, C., Ying, Y.: Learning with Support Vector Machines. Morgan & Claypool, San Rafael (2011)CrossRef Campbell, C., Ying, Y.: Learning with Support Vector Machines. Morgan & Claypool, San Rafael (2011)CrossRef
34.
Zurück zum Zitat Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning internal representations by error propagation. Parallel Distributed Processing: Explorations in the Microstructure of Cognition vol 1, pp. 318–362. MIT Press, Cambridge (1986)CrossRef Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning internal representations by error propagation. Parallel Distributed Processing: Explorations in the Microstructure of Cognition vol 1, pp. 318–362. MIT Press, Cambridge (1986)CrossRef
36.
Zurück zum Zitat Kennedy, J., Eberhart, R.C.: Particle Swarm Optimization. In: Proceedings of the IEEE International Conference on Neural Networks, pp. 1942–1948. Perth, Australia, 27 November–1 December (1995) Kennedy, J., Eberhart, R.C.: Particle Swarm Optimization. In: Proceedings of the IEEE International Conference on Neural Networks, pp. 1942–1948. Perth, Australia, 27 November–1 December (1995)
37.
Zurück zum Zitat Kennedy, J., Eberhart, R.C.: A discrete binary version of the particle swarm algorithm. In: Proceedings of the IEEE International Conference on Systems, Man & Cybernetics Computational Cybernetics & Simulation, (5), pp. 4104–4108. Orlando, FL, USA, 12–15 October (1997) Kennedy, J., Eberhart, R.C.: A discrete binary version of the particle swarm algorithm. In: Proceedings of the IEEE International Conference on Systems, Man & Cybernetics Computational Cybernetics & Simulation, (5), pp. 4104–4108. Orlando, FL, USA, 12–15 October (1997)
38.
Zurück zum Zitat Yang, X.S., Deb, S., Fong, S.: Accelerated Particle Swarm Optimization and Support Vector Machine for business optimization and applications. Networked Digital Technologies (NDT2011). Commun. Comput. Inf. Sci. 136, 53–66 (2011) Yang, X.S., Deb, S., Fong, S.: Accelerated Particle Swarm Optimization and Support Vector Machine for business optimization and applications. Networked Digital Technologies (NDT2011). Commun. Comput. Inf. Sci. 136, 53–66 (2011)
39.
Zurück zum Zitat Tan, Y., Zhang, J.: Magnifier particle swarm optimization. In: Chiong, R. (ed.) Nature-Inspired Algorithms for Optimisation, pp. 279–298. Springer, Berlin (2009)CrossRef Tan, Y., Zhang, J.: Magnifier particle swarm optimization. In: Chiong, R. (ed.) Nature-Inspired Algorithms for Optimisation, pp. 279–298. Springer, Berlin (2009)CrossRef
41.
Zurück zum Zitat Wahyudi, M., Kristiyanti, D.A.: Sentiment analysis of smartphone product review using support vector machine algorithm-based particle swarm optimization. J. Theor. Appl. Inf. Technol. 91(1), 189–201 (2016) Wahyudi, M., Kristiyanti, D.A.: Sentiment analysis of smartphone product review using support vector machine algorithm-based particle swarm optimization. J. Theor. Appl. Inf. Technol. 91(1), 189–201 (2016)
47.
Zurück zum Zitat Hastie, T., Tibshirani, R.: Generalized Additive Models. Chapman and Hall/CRC, Boca Raton (1990)MATH Hastie, T., Tibshirani, R.: Generalized Additive Models. Chapman and Hall/CRC, Boca Raton (1990)MATH
48.
Zurück zum Zitat Dunteman, G.H., Ho, M.H.R.: Generalized linear models. In: Dobson, A.J., Barnett, A.G. (eds.) An Introduction to Generalized Linear Models, pp. 2–6. SAGE Publications, Thousand Oaks (2011) Dunteman, G.H., Ho, M.H.R.: Generalized linear models. In: Dobson, A.J., Barnett, A.G. (eds.) An Introduction to Generalized Linear Models, pp. 2–6. SAGE Publications, Thousand Oaks (2011)
49.
Zurück zum Zitat Dobson, A.J., Barnett, A.G.: An Introduction to Generalized Linear Models, 3rd edn. CRC Press, Boca Raton (2008)MATH Dobson, A.J., Barnett, A.G.: An Introduction to Generalized Linear Models, 3rd edn. CRC Press, Boca Raton (2008)MATH
51.
Zurück zum Zitat Shah, Y.S., Hernandez-Garcia, L., Jahanian, H., Peltier, S.J.: Support vector machine classification of arterial volume-weighted arterial spin tagging images. Brain Behav. 6, 1–8 (2016)CrossRef Shah, Y.S., Hernandez-Garcia, L., Jahanian, H., Peltier, S.J.: Support vector machine classification of arterial volume-weighted arterial spin tagging images. Brain Behav. 6, 1–8 (2016)CrossRef
52.
Zurück zum Zitat Sun, J., Fujita, H., Chen, P., Li, H.: Dynamic financial distress prediction with concept drift based on time weighting combined with Adaboost support vector machine ensemble. Knowl.-Based Syst. 120, 4–14 (2017)CrossRef Sun, J., Fujita, H., Chen, P., Li, H.: Dynamic financial distress prediction with concept drift based on time weighting combined with Adaboost support vector machine ensemble. Knowl.-Based Syst. 120, 4–14 (2017)CrossRef
53.
Zurück zum Zitat Chinniyan, K., Gangadharan, S., Sabanaikam, K.: Semantic similarity based web document classification using support vector machine. Int. Arab J. Inf. Technol. 14(3), 285–293 (2017) Chinniyan, K., Gangadharan, S., Sabanaikam, K.: Semantic similarity based web document classification using support vector machine. Int. Arab J. Inf. Technol. 14(3), 285–293 (2017)
54.
Zurück zum Zitat Lo, S.L., Chiong, R., Cornforth, D.: Using support vector machine ensembles for target audience classification on Twitter. PLoS ONE 10(4), e0122855 (2015)CrossRef Lo, S.L., Chiong, R., Cornforth, D.: Using support vector machine ensembles for target audience classification on Twitter. PLoS ONE 10(4), e0122855 (2015)CrossRef
55.
Zurück zum Zitat Lo, S.L., Cornforth, D., Chiong, R.: Identifying the high-value social audience from Twitter through text-mining methods. In: Proceedings of the 18th Asia Pacific Symposium on Intelligent and Evolutionary Systems, pp. 325–339. Singapore, 10–12 November (2014) Lo, S.L., Cornforth, D., Chiong, R.: Identifying the high-value social audience from Twitter through text-mining methods. In: Proceedings of the 18th Asia Pacific Symposium on Intelligent and Evolutionary Systems, pp. 325–339. Singapore, 10–12 November (2014)
57.
Zurück zum Zitat Glorot, X., Bengio, Y.: Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the 13th International Conferenceon Artificial Intelligence and Statistics, pp. 249–256. Sardinia, Italy, 13–15 May (2010) Glorot, X., Bengio, Y.: Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the 13th International Conferenceon Artificial Intelligence and Statistics, pp. 249–256. Sardinia, Italy, 13–15 May (2010)
58.
Zurück zum Zitat Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. CoRR abs/1412.6980 (2014) Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. CoRR abs/1412.6980 (2014)
59.
Zurück zum Zitat Adipranata, R., Budhi, G.S., Setiahadi, B.: Automatic classification of sunspot groups for space weather analysis. Int. J. Multimed. Ubiquitous Eng. 8(3), 41–54 (2013) Adipranata, R., Budhi, G.S., Setiahadi, B.: Automatic classification of sunspot groups for space weather analysis. Int. J. Multimed. Ubiquitous Eng. 8(3), 41–54 (2013)
61.
Zurück zum Zitat Budhi, G.S., Adipranata, R.: Java characters recognition using evolutionary neural network and combination of Chi2 and backpropagation neural network. Int. J. Appl. Eng. Res. 9(22), 18025–18036 (2014) Budhi, G.S., Adipranata, R.: Java characters recognition using evolutionary neural network and combination of Chi2 and backpropagation neural network. Int. J. Appl. Eng. Res. 9(22), 18025–18036 (2014)
62.
Zurück zum Zitat Sangjae, L., Joon, Y.C.: Predicting the helpfulness of online reviews using multilayer perceptron neural networks. Expert Syst. Appl. 41(6), 3041–3046 (2014)CrossRef Sangjae, L., Joon, Y.C.: Predicting the helpfulness of online reviews using multilayer perceptron neural networks. Expert Syst. Appl. 41(6), 3041–3046 (2014)CrossRef
64.
Zurück zum Zitat Hunt, E.B., Marin, J., Stone, P.J.: Experiments in Induction. Academic Press, New York (1966) Hunt, E.B., Marin, J., Stone, P.J.: Experiments in Induction. Academic Press, New York (1966)
65.
Zurück zum Zitat Rokach, L., Maimon, O.: Data Mining with Decision Trees: Theory and Applications. World Scientific Publishing Company, Singapore (2007)CrossRef Rokach, L., Maimon, O.: Data Mining with Decision Trees: Theory and Applications. World Scientific Publishing Company, Singapore (2007)CrossRef
67.
Zurück zum Zitat Xu, Z., Li, P., Wang, Y.: Text classifier based on an improved SVM decision tree. Phys. Procedia 33, 1986–1991 (2012)CrossRef Xu, Z., Li, P., Wang, Y.: Text classifier based on an improved SVM decision tree. Phys. Procedia 33, 1986–1991 (2012)CrossRef
68.
Zurück zum Zitat Abhishek, S., Sugumaran, V., Devasenapati, S.B.: Misfire detection in an IC engine using vibration signal and decision tree algorithms. Measurement 50, 370–380 (2014)CrossRef Abhishek, S., Sugumaran, V., Devasenapati, S.B.: Misfire detection in an IC engine using vibration signal and decision tree algorithms. Measurement 50, 370–380 (2014)CrossRef
69.
Zurück zum Zitat Izydorczyk, B., Wojciechowski, B.: Differential diagnosis of eating disorders with the use of classification trees (decision algorithm). Arch. Psychiatry Psychother. 18(4), 53–62 (2016)CrossRef Izydorczyk, B., Wojciechowski, B.: Differential diagnosis of eating disorders with the use of classification trees (decision algorithm). Arch. Psychiatry Psychother. 18(4), 53–62 (2016)CrossRef
71.
Zurück zum Zitat Geurts, P., Ernst, D., Wehenkel, L.: Extremely randomized trees. Mach. Learn. 63(1), 3–42 (2006)CrossRef Geurts, P., Ernst, D., Wehenkel, L.: Extremely randomized trees. Mach. Learn. 63(1), 3–42 (2006)CrossRef
72.
Zurück zum Zitat Bramer, M.: Nearest neighbour classification. In: Principles of Data Mining. pp. 31–38. Springer, London (2007) Bramer, M.: Nearest neighbour classification. In: Principles of Data Mining. pp. 31–38. Springer, London (2007)
73.
Zurück zum Zitat Pan, Z., Wang, Y., Ku, W.: A new general nearest neighbor classification based on the mutual neighborhood information. Knowl.-Based Syst. 121, 142–152 (2017)CrossRef Pan, Z., Wang, Y., Ku, W.: A new general nearest neighbor classification based on the mutual neighborhood information. Knowl.-Based Syst. 121, 142–152 (2017)CrossRef
74.
Zurück zum Zitat Friedman, J.H.: Greedy function approximation: a gradient boosting machine. Ann. Stat. 29(5), 1189–1232 (2001)MathSciNetCrossRef Friedman, J.H.: Greedy function approximation: a gradient boosting machine. Ann. Stat. 29(5), 1189–1232 (2001)MathSciNetCrossRef
75.
Metadaten
Titel
Multi-level particle swarm optimisation and its parallel version for parameter optimisation of ensemble models: a case of sentiment polarity prediction
verfasst von
Gregorius Satia Budhi
Raymond Chiong
Sandeep Dhakal
Publikationsdatum
06.07.2020
Verlag
Springer US
Erschienen in
Cluster Computing / Ausgabe 4/2020
Print ISSN: 1386-7857
Elektronische ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-020-03093-3

Weitere Artikel der Ausgabe 4/2020

Cluster Computing 4/2020 Zur Ausgabe

Premium Partner