Skip to main content
Erschienen in: Soft Computing 23/2020

10.06.2020 | Methodologies and Application

Global-best optimization of ANN trained by PSO using the non-extensive cross-entropy with Gaussian gain

verfasst von: Seba Susan, Rohit Ranjan, Udyant Taluja, Shivang Rai, Pranav Agarwal

Erschienen in: Soft Computing | Ausgabe 23/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

An efficient optimization of network weights has been the primary goal of the artificial neural network (ANN) research community since decades. The aim of every optimization problem is to minimize the network cost which is some form of error function between the desired and the actual network outputs, during the training phase. The conventional gradient-based optimization algorithms like backpropagation are likely to get trapped in local minima and are sensitive to choices of initial weights. The evolutionary algorithms have proved their usefulness in introducing randomness into the optimization procedure, since they work on a global search strategy and induce a globally minimum solution for the network weights. In this paper, we particularly focus on ANN trained by Particle Swarm Optimization (ANN-PSO), in which the local-best and global-best particle positions represent possible solutions to the set of network weights. The global-best position of the swarm, which corresponds to the minimum cost function over time, is determined in our work by minimizing a new non-extensive cross-entropy error cost function. The non-extensive cross-entropy is derived from the non-extensive entropy with Gaussian gain that has proven to give minimum values for regular textures containing periodic information represented by uneven probability distributions. The new cross-entropy is defined, and its utility for optimizing the network weights to a globally minimum solution is analyzed in this paper. Extensive experimentation on two different versions: the baseline ANN-PSO and one of its most recent variants IOPSO-BPA, on benchmark datasets from the UCI repository, with comparisons to the state of the art, validates the efficacy of our method.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Anand A, Suganthi L (2017) Forecasting of electricity demand by hybrid ANN-PSO models. Int J Energy Optim Eng 6(4):66–83 Anand A, Suganthi L (2017) Forecasting of electricity demand by hybrid ANN-PSO models. Int J Energy Optim Eng 6(4):66–83
Zurück zum Zitat Brescia M, Cavuoti S, D'Angelo G, D'Abrusco R, Deniskina N, Garofalo M, Skordovski B (2008) The VO-Neural project: recent developments and some applications. arXiv preprint. arXiv:0806.1006 Brescia M, Cavuoti S, D'Angelo G, D'Abrusco R, Deniskina N, Garofalo M, Skordovski B (2008) The VO-Neural project: recent developments and some applications. arXiv preprint. arXiv:0806.1006
Zurück zum Zitat Chen D, Cao X, Wen F, Sun J (2013) Blessing of dimensionality: high-dimensional feature and its efficient compression for face verification. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3025–3032 Chen D, Cao X, Wen F, Sun J (2013) Blessing of dimensionality: high-dimensional feature and its efficient compression for face verification. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3025–3032
Zurück zum Zitat Chen Z, Zhan Z, Shi W, Chen W, Zhang J (2016) When neural network computation meets evolutionary computation: a survey. In: International symposium on neural networks, Springer, Cham, pp 603–612 Chen Z, Zhan Z, Shi W, Chen W, Zhang J (2016) When neural network computation meets evolutionary computation: a survey. In: International symposium on neural networks, Springer, Cham, pp 603–612
Zurück zum Zitat Chen XL, Fu JP, Yao JL, Gan JF (2018) Prediction of shear strength for squat RC walls using a hybrid ANN–PSO model. Eng Comput 34(2):367–383CrossRef Chen XL, Fu JP, Yao JL, Gan JF (2018) Prediction of shear strength for squat RC walls using a hybrid ANN–PSO model. Eng Comput 34(2):367–383CrossRef
Zurück zum Zitat Dorigo M, Birattari M, Stutzle T (2006) Ant colony optimization. IEEE Comput Intell Mag 1(4):28–39CrossRef Dorigo M, Birattari M, Stutzle T (2006) Ant colony optimization. IEEE Comput Intell Mag 1(4):28–39CrossRef
Zurück zum Zitat Dunne RA, Campbell NA (1997) On the pairing of the softmax activation and cross-entropy penalty functions and the derivation of the softmax activation function. In Proceedings of the 8th Australian conference on the neural networks, Melbourne, vol 181, p 185 Dunne RA, Campbell NA (1997) On the pairing of the softmax activation and cross-entropy penalty functions and the derivation of the softmax activation function. In Proceedings of the 8th Australian conference on the neural networks, Melbourne, vol 181, p 185
Zurück zum Zitat Gharghan SK, Nordin R, Ismail M, Ali JA (2016) Accurate wireless sensor localization technique based on hybrid PSO-ANN algorithm for indoor and outdoor track cycling. IEEE Sens J 16(2):529–541CrossRef Gharghan SK, Nordin R, Ismail M, Ali JA (2016) Accurate wireless sensor localization technique based on hybrid PSO-ANN algorithm for indoor and outdoor track cycling. IEEE Sens J 16(2):529–541CrossRef
Zurück zum Zitat Goldberg DE, Holland JH (1988) Genetic algorithms and machine learning. Mach Learn 3(2):95–99CrossRef Goldberg DE, Holland JH (1988) Genetic algorithms and machine learning. Mach Learn 3(2):95–99CrossRef
Zurück zum Zitat Hajihassani M, Kalatehjari R, Marto A, Mohamad H, Khosrotash M (2020) 3D prediction of tunneling-induced ground movements based on a hybrid ANN and empirical methods. Eng Comput 36(1):251–269CrossRef Hajihassani M, Kalatehjari R, Marto A, Mohamad H, Khosrotash M (2020) 3D prediction of tunneling-induced ground movements based on a hybrid ANN and empirical methods. Eng Comput 36(1):251–269CrossRef
Zurück zum Zitat Kennedy J, Eberhart R (1995) Particle swarm optimization (PSO). In Proceedings of IEEE international conference on neural networks, Perth, Australia, pp 1942–1948 Kennedy J, Eberhart R (1995) Particle swarm optimization (PSO). In Proceedings of IEEE international conference on neural networks, Perth, Australia, pp 1942–1948
Zurück zum Zitat Khan JA, Raja MAZ, Rashidi MM, Syam MI, Wazwaz AM (2015) Nature-inspired computing approach for solving non-linear singular Emden-Fowler problem arising in electromagnetic theory. Connect Sci 27(4):377–396CrossRef Khan JA, Raja MAZ, Rashidi MM, Syam MI, Wazwaz AM (2015) Nature-inspired computing approach for solving non-linear singular Emden-Fowler problem arising in electromagnetic theory. Connect Sci 27(4):377–396CrossRef
Zurück zum Zitat Kiranyaz S, Ince T, Yildirim A, Gabbouj M (2009) Evolutionary artificial neural networks by multi-dimensional particle swarm optimization. Neural Netw 22(10):1448–1462CrossRef Kiranyaz S, Ince T, Yildirim A, Gabbouj M (2009) Evolutionary artificial neural networks by multi-dimensional particle swarm optimization. Neural Netw 22(10):1448–1462CrossRef
Zurück zum Zitat Liu W, Wang Z, Liu X, Zeng N, Liu Y, Alsaadi FE (2017) A survey of deep neural network architectures and their applications. Neurocomputing 234:11–26CrossRef Liu W, Wang Z, Liu X, Zeng N, Liu Y, Alsaadi FE (2017) A survey of deep neural network architectures and their applications. Neurocomputing 234:11–26CrossRef
Zurück zum Zitat Mendes R, Cortez P, Rocha M, Neves J (2002) Particle swarms for feedforward neural network training. In: Proceedings of the 2002 international joint conference on neural networks. IJCNN'02 (Cat. No. 02CH37290), vol 2, pp 1895–1899 Mendes R, Cortez P, Rocha M, Neves J (2002) Particle swarms for feedforward neural network training. In: Proceedings of the 2002 international joint conference on neural networks. IJCNN'02 (Cat. No. 02CH37290), vol 2, pp 1895–1899
Zurück zum Zitat Montana DJ, Davis L (1989) Training feedforward neural networks using genetic algorithms. IJCAI 89:762–767MATH Montana DJ, Davis L (1989) Training feedforward neural networks using genetic algorithms. IJCAI 89:762–767MATH
Zurück zum Zitat Nielsen MA (2015) Neural networks and deep learning, vol 25. Determination press, San Francisco Nielsen MA (2015) Neural networks and deep learning, vol 25. Determination press, San Francisco
Zurück zum Zitat Raja MAZ, Shah FH, Alaidarous ES, Syam MI (2017) Design of bio-inspired heuristic technique integrated with interior-point algorithm to analyze the dynamics of heartbeat model. Appl Soft Comput 52:605–629CrossRef Raja MAZ, Shah FH, Alaidarous ES, Syam MI (2017) Design of bio-inspired heuristic technique integrated with interior-point algorithm to analyze the dynamics of heartbeat model. Appl Soft Comput 52:605–629CrossRef
Zurück zum Zitat Raja MAZ, Shah AA, Mehmood A, Chaudhary NI, Aslam MS (2018) Bio-inspired computational heuristics for parameter estimation of nonlinear Hammerstein controlled autoregressive system. Neural Comput Appl 29(12):1455–1474CrossRef Raja MAZ, Shah AA, Mehmood A, Chaudhary NI, Aslam MS (2018) Bio-inspired computational heuristics for parameter estimation of nonlinear Hammerstein controlled autoregressive system. Neural Comput Appl 29(12):1455–1474CrossRef
Zurück zum Zitat Raza S, Mokhlis H, Arof H, Naidu K, Laghari JA, Khairuddin ASM (2016) Minimum-features-based ANN-PSO approach for islanding detection in distribution system. IET Renew Power Gener 10(9):1255–1263CrossRef Raza S, Mokhlis H, Arof H, Naidu K, Laghari JA, Khairuddin ASM (2016) Minimum-features-based ANN-PSO approach for islanding detection in distribution system. IET Renew Power Gener 10(9):1255–1263CrossRef
Zurück zum Zitat Rumelhart DE, Hinton GE, Williams RJ (1985) Learning internal representations by error propagation (No. ICS-8506). California University, San DiegoCrossRef Rumelhart DE, Hinton GE, Williams RJ (1985) Learning internal representations by error propagation (No. ICS-8506). California University, San DiegoCrossRef
Zurück zum Zitat Socha K, Blum C (2007) An ant colony optimization algorithm for continuous optimization: application to feed-forward neural network training. Neural Comput Appl 16(3):235–247CrossRef Socha K, Blum C (2007) An ant colony optimization algorithm for continuous optimization: application to feed-forward neural network training. Neural Comput Appl 16(3):235–247CrossRef
Zurück zum Zitat Susan S, Dwivedi M (2014) Dynamic growth of hidden-layer neurons using the non-extensive entropy. In: 2014 Fourth international conference on communication systems and network technologies IEEE, pp 491–495 Susan S, Dwivedi M (2014) Dynamic growth of hidden-layer neurons using the non-extensive entropy. In: 2014 Fourth international conference on communication systems and network technologies IEEE, pp 491–495
Zurück zum Zitat Susan S, Hanmandlu M (2013) A non-extensive entropy feature and its application to texture classification. Neurocomputing 120:214–225CrossRef Susan S, Hanmandlu M (2013) A non-extensive entropy feature and its application to texture classification. Neurocomputing 120:214–225CrossRef
Zurück zum Zitat Susan S, Hanmandlu M (2015) Unsupervised detection of nonlinearity in motion using weighted average of non-extensive entropies. SIViP 9(3):511–525CrossRef Susan S, Hanmandlu M (2015) Unsupervised detection of nonlinearity in motion using weighted average of non-extensive entropies. SIViP 9(3):511–525CrossRef
Zurück zum Zitat Susan S, Hanmandlu M (2018) Color texture recognition by color information fusion using the non-extensive entropy. Multidimension Syst Signal Process 29(4):1269–1284CrossRef Susan S, Hanmandlu M (2018) Color texture recognition by color information fusion using the non-extensive entropy. Multidimension Syst Signal Process 29(4):1269–1284CrossRef
Zurück zum Zitat Susan S, Sharma M (2017) Automatic texture defect detection using Gaussian mixture entropy modeling. Neurocomputing 239:232–237CrossRef Susan S, Sharma M (2017) Automatic texture defect detection using Gaussian mixture entropy modeling. Neurocomputing 239:232–237CrossRef
Zurück zum Zitat Susan S, Singh V (2011) On the discriminative power of different feature subsets for handwritten numeral recognition using the box-partitioning method. In: 2011 annual IEEE India conference IEEE, pp 1–5 Susan S, Singh V (2011) On the discriminative power of different feature subsets for handwritten numeral recognition using the box-partitioning method. In: 2011 annual IEEE India conference IEEE, pp 1–5
Zurück zum Zitat Susan S, Sharawat P, Singh S, Meena R, Verma A, Kumar M (2015) Fuzzy C-means with non-extensive entropy regularization. In: 2015 IEEE international conference on signal processing, informatics, communication and energy systems (SPICES) IEEE, pp 1–5 Susan S, Sharawat P, Singh S, Meena R, Verma A, Kumar M (2015) Fuzzy C-means with non-extensive entropy regularization. In: 2015 IEEE international conference on signal processing, informatics, communication and energy systems (SPICES) IEEE, pp 1–5
Zurück zum Zitat Susan S, Ranjan R, Taluja U, Rai S, Agarwal P (2019) Neural net optimization by weight-entropy monitoring. Computational intelligence: theories, applications and future directions-volume II. Springer, Singapore, pp 201–213 Susan S, Ranjan R, Taluja U, Rai S, Agarwal P (2019) Neural net optimization by weight-entropy monitoring. Computational intelligence: theories, applications and future directions-volume II. Springer, Singapore, pp 201–213
Zurück zum Zitat Tizhoosh HR (2005) Opposition-based learning: a new scheme for machine intelligence. In: Computational intelligence for modelling, control and automation, 2005 and international conference on intelligent agents, web technologies and internet commerce, international conference on IEEE, vol 1, pp 695–701 Tizhoosh HR (2005) Opposition-based learning: a new scheme for machine intelligence. In: Computational intelligence for modelling, control and automation, 2005 and international conference on intelligent agents, web technologies and internet commerce, international conference on IEEE, vol 1, pp 695–701
Zurück zum Zitat Yaghini M, Khoshraftar MM, Fallahi M (2011) HIOPGA: a new hybrid metaheuristic algorithm to train feedforward neural networks for Prediction. In: Proceedings of the international conference on data mining, pp 18–21 Yaghini M, Khoshraftar MM, Fallahi M (2011) HIOPGA: a new hybrid metaheuristic algorithm to train feedforward neural networks for Prediction. In: Proceedings of the international conference on data mining, pp 18–21
Zurück zum Zitat Yaghini M, Khoshraftar MM, Fallahi M (2013) A hybrid algorithm for artificial neural network training. Eng Appl Artif Intell 26(1):293–301CrossRef Yaghini M, Khoshraftar MM, Fallahi M (2013) A hybrid algorithm for artificial neural network training. Eng Appl Artif Intell 26(1):293–301CrossRef
Zurück zum Zitat Zameer A, Arshad J, Khan A, Raja MAZ (2017) Intelligent and robust prediction of short term wind power using genetic programming based ensemble of neural networks. Energy Convers Manag 134:361–372CrossRef Zameer A, Arshad J, Khan A, Raja MAZ (2017) Intelligent and robust prediction of short term wind power using genetic programming based ensemble of neural networks. Energy Convers Manag 134:361–372CrossRef
Zurück zum Zitat Zhang GP (2000) Neural networks for classification: a survey. IEEE Trans Syst Man Cybern Part C Appl Rev 30(4):451–462CrossRef Zhang GP (2000) Neural networks for classification: a survey. IEEE Trans Syst Man Cybern Part C Appl Rev 30(4):451–462CrossRef
Zurück zum Zitat Zhang L, Suganthan PN (2016) A survey of randomized algorithms for training neural networks. Inf Sci 364:146–155CrossRef Zhang L, Suganthan PN (2016) A survey of randomized algorithms for training neural networks. Inf Sci 364:146–155CrossRef
Metadaten
Titel
Global-best optimization of ANN trained by PSO using the non-extensive cross-entropy with Gaussian gain
verfasst von
Seba Susan
Rohit Ranjan
Udyant Taluja
Shivang Rai
Pranav Agarwal
Publikationsdatum
10.06.2020
Verlag
Springer Berlin Heidelberg
Erschienen in
Soft Computing / Ausgabe 23/2020
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-020-05080-7

Weitere Artikel der Ausgabe 23/2020

Soft Computing 23/2020 Zur Ausgabe