Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Anderson B.D.O., Moore J.B. [1979], Optimal Filtering, Prentice Hall
Azencott R., Dacunha-Castelle D. [1984], Séries d’observations irrégulières. Modélisation et prévision, Masson
Barto A.G., Sutton R.S., Anderson C.W. [1983], Neuron-like elements than can solve difficult learning control problemes, IEEE Trans. On Systems, Man and Cybernetics, 13, pp 835–846
Benveniste A., Métivier M., Priouret P. [1987], Algorithmes adaptatifs et approximations stochastiques. Théorie et application à l’identification, au traitement du signal et à la reconnaissance des formes, Masson
Bengio Y., Simard P., Frasconi F. [1994], Learning long term dependencies with gradient descent is difficult, IEEE Trans. on Neural Networks, 5, pp 157–166
Bertsekas D.P., Tsitsiklis J.N. [1996], Neuro-dynamic programming, Athena Scientific, Belmont, MA
Chatfield C. [1994], The Analysis of Time series, an Introduction, Chapman & Hall
Demailly J.-P. [1991], Analyse numérique et équations différentielles, Presses universitaires de Grenoble
Doya K. [2000], Reinforcement learning in continuous time and space, Neural computation, pp 219–244
Duflo M. [1996], Algorithmes stochastiques, Springer
Dutech A. [1999], Apprentissage d’environnements: approches cognitive et comportementale, thèse de doctorat de l’École nationale supérieure de l’aéronautique et de l’espace
Duvaut P. [1994], Traitement du signal: concepts et applications, Hermès
Elman J.L. [1990], Finding structure in time, Cognitive Science, 14, pp 1179–211
Grondin B. [1994], Les réseaux de neurones pour la modélisation et la conduite des réacteurs chimiques: simulations et expérimentations, thèse de doctorat de l’Université de Bordeaux I
Haykin S. [1996], Adaptive Filter Theory, Prentice Hall
Haykin S. [1999], Neural Networks: a comprehensive foundation, Prentice Hall
Hopfield J.J. [1982], Neural networks and physical systems with emergent collective computational abilities, Proceedings of the National Academy of Sciences, États-Unis, 79, pp 2554–2558
Isermann R., Lachmann K.H., Matko D. [1992], Adaptive Control Systems, Prentice Hall
Jazwinsky A.H. [1970], Stochastic Processes and Filtering Theory, Academic Press
Kirkpatrick S., Gelatt C.D., Vecchi M.P. [1983], Optimization by simulated annealing, Science, 220, pp 671–680
Kushner K.H.J., Clark D.S. [1978] Stochastic Approximation Method for constrained and unconstrained Systems, Applied Mathematical Sciences, 26, Springer-Verlag
Kwakernaak H., Sivan R. [1972], Linear Optimal Control Systems, Wiley
Gouriéroux C., Monfort A. [1995], Séries temporelles et modèles dynamiques, Economica
Landau I.D., Dugard L. [1986], Commande adaptative, aspects pratiques et théoriques, Masson
Landau I.D. [1993], Identification et commande des systèmes, Hermès
Levin A.U., Narendra K.S. [1993], Control of nonlinear dynamical systems using neural networks, IEEE Transactions on neural networks, 4.2, pp 192–207
Levin A.U., Narendra K.S. [1997], Identification of nonlinear dynamical systems using neural networks in Neural Systems for Control, O. Omivar, D.L. Elliott, éd., Academic Press, pp 129–160
Lion M. [2000], Filtrage adaptatif par réseaux neuronaux, application à la trajectographie, thèse de doctorat de l’École nationale supérieure de l’aéronautique et de l’espace
Ljung L., Söderstrom T. [1983], Theory and Practice of Recursive Identification, MIT Press
Ljung L., Sjoberg J., Hjalmarsson H. [1996], On neural network model structures in system identification, in Identification, Adaptation, Learning. The science of learning models from data, S. Bittanti, G. Pici, é d., NATO ASI Series, Springer
Nerrand O., Roussel-Ragot P., Personnaz L., Dreyfus G. [1993], Neural networks and nonlinear adaptive filtering: unifying concepts and new algorithms, Neural Computation, 5, pp 165–199
Nerrand O., Roussel-Ragot P., Urbani D., Personnaz L., Dreyfus G. [1994], Training recurrent neural networks: why and how? An illustration in dynamical processes modeling, IEEE Transactions on neural networks, 5.2, pp 178–184
Norgaard M., Ravn O., Poulsen N.K., Hansen L.K. [2000], Neural Networks for Modelling and Control of Dynamical Systems, Springer
Puskorius G.V., Feldkamp L.A. [1994], Neurocontrol of nonlinear dynamical systems with Kalman filter-trained recurrent networks, IEEE Transactions on Neural Networks, vol. 5, pp 279–297
Rivals I. [1995], Modélisation et commande de processus par réseaux de neurones; application au pilotage d’un véhicule autonome, thèse de doctorat de l’Université Pierre et Marie-Curie, Paris VI
Rivals I., Personnaz L. [2000], Nonlinear Internal Model Control Using Neural Networks, IEEE Transactions on Neural Networks, vol. 11, pp 80–90
Singh S.P., Jaakkola T., Jordan M. [1995], Learning without state estimation in a partially observable Markov decision problems, Proceedings of the 11th Machine Learning conference
Slotine J.J.E., Li W. [1991], Applied Nonlinear Control, Prentice Hall
Slotine J.J.E., Sanner R.M. [1993], Neural Networks for Adaptive Control and Recursive Identification: A Theoretical Framework, in Essays on Control, H.L. Trentelman, J.C. Willems, éd., Birkhauser, pp 381–435
Sontag E.D. [1990], Mathematic Control Theory. Deterministic finite dimensional systems, Springer Verlag
Sontag E.D. [1996], Recurrent Neural Networks: Some Systems-Theoretic Aspects, Dept. of Mathematics, Rutgers University, NB, États-Unis
Sutton R.S. [1988], Learning to predict by the method of temporal differences, Machine Learning, 3, pp 9–44
Thrun S.B. [1992], The role of exploration in learning control, in Handbook of intelligent control, D.A. White, D.A. Sofge, éd., pp 527–559, Van Nostrand
Tong H. [1995], Nonlinear Time Series, a dynamical system approach, Clarendon Press
Urbani D., Roussel-Ragot P., Personnaz L., Dreyfus G. [1993], The selection of nonlinear dynamical systems by statistical tests, Neural Networks for Signal Processing, 4, pp 229–237
Watkins C.J.C.H., Dayan P. [1992] Q-learning, Machine Learning, 8, pp 279–292
Williams, R.J., Zipser, D. [1989], “A learning algorithm for continully runnig fully recurrent neural networks”, Neural Computation, pp. 270–280
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Samuelides, M. (2005). Closed-Loop Control Learning. In: Neural Networks. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-28847-3_5
Download citation
DOI: https://doi.org/10.1007/3-540-28847-3_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22980-3
Online ISBN: 978-3-540-28847-3
eBook Packages: Physics and AstronomyPhysics and Astronomy (R0)