Skip to main content

Closed-Loop Control Learning

  • Chapter
Neural Networks
  • 5442 Accesses

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Anderson B.D.O., Moore J.B. [1979], Optimal Filtering, Prentice Hall

    Google Scholar 

  2. Azencott R., Dacunha-Castelle D. [1984], Séries d’observations irrégulières. Modélisation et prévision, Masson

    Google Scholar 

  3. Barto A.G., Sutton R.S., Anderson C.W. [1983], Neuron-like elements than can solve difficult learning control problemes, IEEE Trans. On Systems, Man and Cybernetics, 13, pp 835–846

    Google Scholar 

  4. Benveniste A., Métivier M., Priouret P. [1987], Algorithmes adaptatifs et approximations stochastiques. Théorie et application à l’identification, au traitement du signal et à la reconnaissance des formes, Masson

    Google Scholar 

  5. Bengio Y., Simard P., Frasconi F. [1994], Learning long term dependencies with gradient descent is difficult, IEEE Trans. on Neural Networks, 5, pp 157–166

    Google Scholar 

  6. Bertsekas D.P., Tsitsiklis J.N. [1996], Neuro-dynamic programming, Athena Scientific, Belmont, MA

    Google Scholar 

  7. Chatfield C. [1994], The Analysis of Time series, an Introduction, Chapman & Hall

    Google Scholar 

  8. Demailly J.-P. [1991], Analyse numérique et équations différentielles, Presses universitaires de Grenoble

    Google Scholar 

  9. Doya K. [2000], Reinforcement learning in continuous time and space, Neural computation, pp 219–244

    Google Scholar 

  10. Duflo M. [1996], Algorithmes stochastiques, Springer

    Google Scholar 

  11. Dutech A. [1999], Apprentissage d’environnements: approches cognitive et comportementale, thèse de doctorat de l’École nationale supérieure de l’aéronautique et de l’espace

    Google Scholar 

  12. Duvaut P. [1994], Traitement du signal: concepts et applications, Hermès

    Google Scholar 

  13. Elman J.L. [1990], Finding structure in time, Cognitive Science, 14, pp 1179–211

    Google Scholar 

  14. Grondin B. [1994], Les réseaux de neurones pour la modélisation et la conduite des réacteurs chimiques: simulations et expérimentations, thèse de doctorat de l’Université de Bordeaux I

    Google Scholar 

  15. Haykin S. [1996], Adaptive Filter Theory, Prentice Hall

    Google Scholar 

  16. Haykin S. [1999], Neural Networks: a comprehensive foundation, Prentice Hall

    Google Scholar 

  17. Hopfield J.J. [1982], Neural networks and physical systems with emergent collective computational abilities, Proceedings of the National Academy of Sciences, États-Unis, 79, pp 2554–2558

    Google Scholar 

  18. Isermann R., Lachmann K.H., Matko D. [1992], Adaptive Control Systems, Prentice Hall

    Google Scholar 

  19. Jazwinsky A.H. [1970], Stochastic Processes and Filtering Theory, Academic Press

    Google Scholar 

  20. Kirkpatrick S., Gelatt C.D., Vecchi M.P. [1983], Optimization by simulated annealing, Science, 220, pp 671–680

    Google Scholar 

  21. Kushner K.H.J., Clark D.S. [1978] Stochastic Approximation Method for constrained and unconstrained Systems, Applied Mathematical Sciences, 26, Springer-Verlag

    Google Scholar 

  22. Kwakernaak H., Sivan R. [1972], Linear Optimal Control Systems, Wiley

    Google Scholar 

  23. Gouriéroux C., Monfort A. [1995], Séries temporelles et modèles dynamiques, Economica

    Google Scholar 

  24. Landau I.D., Dugard L. [1986], Commande adaptative, aspects pratiques et théoriques, Masson

    Google Scholar 

  25. Landau I.D. [1993], Identification et commande des systèmes, Hermès

    Google Scholar 

  26. Levin A.U., Narendra K.S. [1993], Control of nonlinear dynamical systems using neural networks, IEEE Transactions on neural networks, 4.2, pp 192–207

    Google Scholar 

  27. Levin A.U., Narendra K.S. [1997], Identification of nonlinear dynamical systems using neural networks in Neural Systems for Control, O. Omivar, D.L. Elliott, éd., Academic Press, pp 129–160

    Google Scholar 

  28. Lion M. [2000], Filtrage adaptatif par réseaux neuronaux, application à la trajectographie, thèse de doctorat de l’École nationale supérieure de l’aéronautique et de l’espace

    Google Scholar 

  29. Ljung L., Söderstrom T. [1983], Theory and Practice of Recursive Identification, MIT Press

    Google Scholar 

  30. Ljung L., Sjoberg J., Hjalmarsson H. [1996], On neural network model structures in system identification, in Identification, Adaptation, Learning. The science of learning models from data, S. Bittanti, G. Pici, é d., NATO ASI Series, Springer

    Google Scholar 

  31. Nerrand O., Roussel-Ragot P., Personnaz L., Dreyfus G. [1993], Neural networks and nonlinear adaptive filtering: unifying concepts and new algorithms, Neural Computation, 5, pp 165–199

    Google Scholar 

  32. Nerrand O., Roussel-Ragot P., Urbani D., Personnaz L., Dreyfus G. [1994], Training recurrent neural networks: why and how? An illustration in dynamical processes modeling, IEEE Transactions on neural networks, 5.2, pp 178–184

    Google Scholar 

  33. Norgaard M., Ravn O., Poulsen N.K., Hansen L.K. [2000], Neural Networks for Modelling and Control of Dynamical Systems, Springer

    Google Scholar 

  34. Puskorius G.V., Feldkamp L.A. [1994], Neurocontrol of nonlinear dynamical systems with Kalman filter-trained recurrent networks, IEEE Transactions on Neural Networks, vol. 5, pp 279–297

    Google Scholar 

  35. Rivals I. [1995], Modélisation et commande de processus par réseaux de neurones; application au pilotage d’un véhicule autonome, thèse de doctorat de l’Université Pierre et Marie-Curie, Paris VI

    Google Scholar 

  36. Rivals I., Personnaz L. [2000], Nonlinear Internal Model Control Using Neural Networks, IEEE Transactions on Neural Networks, vol. 11, pp 80–90

    Google Scholar 

  37. Singh S.P., Jaakkola T., Jordan M. [1995], Learning without state estimation in a partially observable Markov decision problems, Proceedings of the 11th Machine Learning conference

    Google Scholar 

  38. Slotine J.J.E., Li W. [1991], Applied Nonlinear Control, Prentice Hall

    Google Scholar 

  39. Slotine J.J.E., Sanner R.M. [1993], Neural Networks for Adaptive Control and Recursive Identification: A Theoretical Framework, in Essays on Control, H.L. Trentelman, J.C. Willems, éd., Birkhauser, pp 381–435

    Google Scholar 

  40. Sontag E.D. [1990], Mathematic Control Theory. Deterministic finite dimensional systems, Springer Verlag

    Google Scholar 

  41. Sontag E.D. [1996], Recurrent Neural Networks: Some Systems-Theoretic Aspects, Dept. of Mathematics, Rutgers University, NB, États-Unis

    Google Scholar 

  42. Sutton R.S. [1988], Learning to predict by the method of temporal differences, Machine Learning, 3, pp 9–44

    Google Scholar 

  43. Thrun S.B. [1992], The role of exploration in learning control, in Handbook of intelligent control, D.A. White, D.A. Sofge, éd., pp 527–559, Van Nostrand

    Google Scholar 

  44. Tong H. [1995], Nonlinear Time Series, a dynamical system approach, Clarendon Press

    Google Scholar 

  45. Urbani D., Roussel-Ragot P., Personnaz L., Dreyfus G. [1993], The selection of nonlinear dynamical systems by statistical tests, Neural Networks for Signal Processing, 4, pp 229–237

    Google Scholar 

  46. Watkins C.J.C.H., Dayan P. [1992] Q-learning, Machine Learning, 8, pp 279–292

    Google Scholar 

  47. Williams, R.J., Zipser, D. [1989], “A learning algorithm for continully runnig fully recurrent neural networks”, Neural Computation, pp. 270–280

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Samuelides, M. (2005). Closed-Loop Control Learning. In: Neural Networks. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-28847-3_5

Download citation

Publish with us

Policies and ethics