nach oben

Neural Computing and Applications

Erschienen in:

01.09.2014 | Original Article

Advanced learning methods and exponent regularization applied to a high order neural network

verfasst von: Islam El-Nabarawy, Ashraf M. Abdelbar

Erschienen in: Neural Computing and Applications | Ausgabe 3-4/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

High order neural networks (HONNs) are neural networks which employ neurons that combine their inputs non-linearly. The high order network with exponential synaptic links (HONEST) network is a HONN that uses neurons with product units and adaptable exponents. This study examines the use of several advanced learning methods to train the HONEST network: resilient propagation, conjugate gradient, scaled conjugate gradient (SCG), and the Levenberg–Marquardt method. Using a collection of 32 widely-used benchmark datasets, we compare the mean squared error (MSE) performance of the HONEST network across the four algorithms, in addition to backpropagation, and find the SCG method to produce the best performance to a statistically significant extent. Additionally, we investigate the use of a regularization term in the error function, to smooth the magnitudes of the network exponents and nudge the network towards smaller exponents. We find that the use of regularization reduces exponent magnitudes without compromising test set MSE performance.

Vorheriger Artikel Improved accuracy of He’s energy balance method for analysis of conservative nonlinear oscillator

Nächster Artikel An effective hybrid cuckoo search algorithm for constrained global optimization

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Rumelhart DE, Hinton GE, McClelland JL (1986) A general framework for parallel distributed processing. Parallel Distrib Process 1(2):45–76

Shin Y, Ghosh J (1991) The pi–sigma network: an efficient higher-order neural network for pattern classification and function approximation. In: Proceedings of the international joint conference on neural networks, IJCNN-91, vol 1, pp 13–18

Durbin R, Rumelhart DE (1989) Product units: a computationally powerful and biologically plausible extension to backpropagation networks. Neural Comput 1(1):133–142CrossRef

Fallahnezhad M, Moradi MH, Zaferanlouei S (2011) A hybrid higher order neural classifier for handling classification problems. Expert Syst Appl 38(1):386–393CrossRef

Abdelbar AM, Tagliarini GA (1996) HONEST: a new high order feedforward neural network. In: IEEE international conference on neural networks, vol 2, pp 1257–1262

Narayan S (1993) ExpoNet: A generalization of the multi-layer perceptron model. In: Proceedings of the World Congress on Neural Networks, vol 3, pp 494–497

Rovithakis G, Chalkiadakis I, Zervakis M (2004) High-order neural network structure selection for function approximation applications using genetic algorithms. IEEE Trans Syst Man Cybern B Cybern 34(1):150–158CrossRef

Martínez-Estudillo AC, Martínez-Estudillo FJ, Herváz-Martínez C, García-Pedrajas N (2006) Evolutionary product unit based neural networks for regression. Neural Netw 19(4):477–486MATHCrossRef

Giles CL, Maxwell T (1987) Learning, invariance, and generalization in high-order neural networks. Appl Opt 26(23):4972–4978CrossRef

10.

Abdelbar AM (1998) Achieving superior generalisation with a high order neural network. Neural Comput Appl 7(2):141–146MATHCrossRef

11.

Abdelbar AM, Attia S, Tagliarini GA (2002) A hybridization of Bayesian and neural learning. Neurocomputing 48(1):443–453MATHCrossRef

12.

Tsai HC (2009) Hybrid high order neural networks. Appl Soft Comput 9(3):874–881CrossRef

13.

Tsai HC (2010) Predicting strengths of concrete-type specimens using hybrid multilayer perceptrons with center-unified particle swarm optimization. Expert Syst Appl 37(2):1104–1112CrossRef

14.

Van Den Bergh F, Engelbrecht AP (2001) Training product unit networks using cooperative particle swarm optimisers. In: Proceedings of the international joint conference on neural networks, IJCNN-01, vol 1, pp 126–131

15.

Riedmiller M, Braun H (1993) A direct adaptive method for faster backpropagation learning: the RPROP algorithm. In: IEEE international conference on neural networks, pp 586–591

16.

Hestenes MR, Stiefel E (1952) Methods of conjugate gradients for solving linear systems. J Res Natl Bur Stand 49(6):409–436MATHMathSciNetCrossRef

17.

Møller MF (1993) A scaled conjugate gradient algorithm for fast supervised learning. Neural Netw 6(4):525–533CrossRef

18.

Levenberg K (1944) A method for the solution of certain problems in least squares. Q Appl Math 2:164–168MATHMathSciNet

19.

Marquardt DW (1963) An algorithm for least-squares estimation of nonlinear parameters. J Soc Ind Appl Math 11(2):431–441MATHMathSciNetCrossRef

20.

Hagan MT, Menhaj MB (1994) Training feedforward networks with the Marquardt algorithm. IEEE Trans Neural Netw 5(6):989–993CrossRef

21.

Suratgar AA, Tavakoli MB, Hoseinabadi A (2005) Modified Levenberg–Marquardt method for neural networks training. World Acad Sci Eng Technol 6:46–48

22.

Tikhonov A (1963) Solution of incorrectly formulated problems and the regularization method. Sov Math Dokl 5:1035MATH

23.

Tsai HC, Wu YW, Tyan YY (2013) Programming squat wall strengths and tuning associated codes with pruned modular neural network. Neural Comput Appl 23(3–4):741–749CrossRef

24.

Ivakhnenko A (1971) Polynomial theory of complex systems. IEEE Trans Syst Man Cybern 1(4):364–378MathSciNetCrossRef

25.

Puig V, Witczak M, Nejjari F, Quevedo J, Korbicz J (2007) A GMDH neural network-based approach to passive robust fault detection using a constraint satisfaction backward test. Eng Appl Artif Intell 20(7):886–897CrossRef

26.

Werbos P (1974) Beyond regression: new tools for prediction and analysis in the behavioral sciences. PhD thesis, Committee on Applied Math, Harvard University, Cambridge, MA

27.

Werbos PJ (1994) The roots of backpropagation: from ordered derivatives to neural networks and political forecasting. Wiley-Interscience, New York

28.

Chauvin Y, Rumelhart DE (1995) Backpropagation: theory, architectures, and applications. Lawrence Erlbaum, Hillsdale

29.

Werbos P (1968) The elements of intelligence. Cybernetica (Namur) 3:131–178

30.

Werbos PJ (1990) Backpropagation through time: what it does and how to do it. Proc IEEE 78(10):1550–1560CrossRef

31.

Fletcher R (1987) Practical methods of optimization, 2nd edn. Wiley, New YorkMATH

32.

Andrei N (2007) Scaled conjugate gradient algorithms for unconstrained optimization. Comput Optim Appl 38:401–416MATHMathSciNetCrossRef

33.

Falas T, Stafylopatis A (2005) Implementing temporal-difference learning with the scaled conjugate gradient algorithm. Neural Process Lett 22:361–375CrossRef

34.

Lundén J, Koivunen V (2007) Scaled conjugate gradient method for radar pulse modulation estimation. In: IEEE international conference on acoustics, speech, and signal processing, vol 2, pp 297–300

35.

Mehrotra K, Mohan CK, Ranka S (1996) Elements of artificial neural networks. MIT press, Cambridge

36.

Haykin SS (2009) Neural networks and learning machines. Prentice Hall, Upper Saddle River

37.

Frank A, Asuncion A (2012) UCI machine learning repository. http://archive.ics.uci.edu/ml/

38.

Pearson K (1901) Principal components analysis. Lond Edinb Dublin Philos Mag J 6(2):566

39.

Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The WEKA data mining software: an update. ACM SIGKDD Explor Newslett 11(1):10–18CrossRef

40.

Wilcoxon F (1945) Individual comparisons by ranking methods. Biom Bull 1(6):80–83CrossRef

41.

Demšar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mac Learn Res 7:1–30MATH

42.

García S, Herrera F (2008) An extension on statistical comparisons of classifiers over multiple data sets for all pairwise comparisons. J Mach Learn Res 9(66):2677–2694MATH

43.

Wolberg WH, Mangasarian OL (1990) Multisurface method of pattern separation for medical diagnosis applied to breast cytology. Proc Natl Acad Sci 87(23):9193–9196MATHCrossRef

44.

Yeh IC (1998) Modeling of strength of high-performance concrete using artificial neural networks. Cem Concr Res 28(12):1797–1808CrossRef

45.

Tsanas A, Xifara A (2012) Accurate quantitative estimation of energy performance of residential buildings using statistical machine learning tools. Energy Build 49:560–567CrossRef

46.

Gil D, Girela JL, De Juan J, Jose Gomez-Torres M, Johnsson M (2012) Predicting seminal quality with artificial intelligence methods. Expert Syst Appl 39(16):12,564–12,573CrossRef

47.

Elter M, Schulz-Wendtland R, Wittenberg T (2007) The prediction of breast cancer biopsy outcomes using two CAD approaches that both emphasize an intelligible decision process. Med Phys 34(11):4164–4172CrossRef

48.

Hwang SJ, Fang WH, Lee HJ, Yu HW (2001) Analytical model for predicting shear strength of squat walls. J Struct Eng 127(1):43–50CrossRef

49.

Yeh IC, Yang KJ, Ting TM (2009) Knowledge discovery on RFM model using Bernoulli sequence. Expert Syst Appl 36(3):5866–5871CrossRef

50.

Cortez P, Cerdeira A, Almeida F, Matos T, Reis J (2009) Modeling wine preferences by data mining from physicochemical properties. Decis Support Syst 47(4):547–553CrossRef

Titel: Advanced learning methods and exponent regularization applied to a high order neural network
verfasst von: Islam El-Nabarawy
Ashraf M. Abdelbar
Publikationsdatum: 01.09.2014
Verlag: Springer London
Erschienen in: Neural Computing and Applications / Ausgabe 3-4/2014
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI: https://doi.org/10.1007/s00521-014-1563-7

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Springer Professional "Wirtschaft+Technik"

Weitere Artikel der Ausgabe 3-4/2014

Radial forging force prediction through MR, ANN, and ANFIS models

Extreme learning machine and its applications

Fuzzy logic controller and cascade inverter for direct torque control of IM

Artificial neural networks for the cost estimation of stamping dies

Robust synchronization analysis for static delayed neural networks with nonlinear hybrid coupling

A review of learning vector quantization classifiers