Skip to main content
Erschienen in: Autonomous Robots 2/2019

19.07.2018

Robust direct trajectory optimization using approximate invariant funnels

verfasst von: Zachary Manchester, Scott Kuindersma

Erschienen in: Autonomous Robots | Ausgabe 2/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Many critical robotics applications require robustness to disturbances arising from unplanned forces, state uncertainty, and model errors. Motion planning algorithms that explicitly reason about robustness require a coupling of trajectory optimization and feedback design, where the system’s closed-loop response to disturbances is optimized. Due to the often-heavy computational demands of solving such problems, the practical application of robust trajectory optimization in robotics has so far been limited. Motivated by recent work on sums-of-squares verification methods for nonlinear systems, we derive a scalable robust trajectory optimization algorithm that optimizes approximate invariant funnels along the trajectory while planning. For the case of ellipsoidal disturbance sets and LQR feedback controllers, the state and input deviations along a nominal trajectory can be computed locally in closed form, permitting fast evaluation of robust cost and constraint functions and their derivatives. The resulting algorithm is a scalable extension of classical direct transcription that demonstrably improves tracking performance over non-robust formulations while incurring only a modest increase in computational cost. We evaluate the algorithm in several simulated robot control tasks.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Betts, J. T. (1998). Survey of numerical methods for trajectory optimization. Journal of Guidance, Control, and Dynamics, 21(2), 193–207.CrossRefMATH Betts, J. T. (1998). Survey of numerical methods for trajectory optimization. Journal of Guidance, Control, and Dynamics, 21(2), 193–207.CrossRefMATH
Zurück zum Zitat Betts, J. T. (2001). Practical methods for optimal control using nonlinear programming, volume 3 of advances in design and control. Philadelphia, PA: Society for Industrial and Applied Mathematics (SIAM). Betts, J. T. (2001). Practical methods for optimal control using nonlinear programming, volume 3 of advances in design and control. Philadelphia, PA: Society for Industrial and Applied Mathematics (SIAM).
Zurück zum Zitat Dai, H., & Tedrake, R. (2012). Optimizing robust limit cycles for legged locomotion on unknown terrain (pp. 1207–1213). Dai, H., & Tedrake, R. (2012). Optimizing robust limit cycles for legged locomotion on unknown terrain (pp. 1207–1213).
Zurück zum Zitat Dai, H., & Tedrake, R. (2013). L2-gain optimization for robust bipedal walking on unknown terrain. In Proceedings of the IEEE international conference on robotics and automation (ICRA). Dai, H., & Tedrake, R. (2013). L2-gain optimization for robust bipedal walking on unknown terrain. In Proceedings of the IEEE international conference on robotics and automation (ICRA).
Zurück zum Zitat Deisenroth, M. P., & Rasmussen, C. E. (2011). PILCO: A model-based and data-efficient approach to policy search. In Proceedings of the 28th international conference on machine learning, Bellevue, WA. Deisenroth, M. P., & Rasmussen, C. E. (2011). PILCO: A model-based and data-efficient approach to policy search. In Proceedings of the 28th international conference on machine learning, Bellevue, WA.
Zurück zum Zitat Farshidian, F., & Buchli, J. (2015). Risk sensitive, nonlinear optimal control: Iterative linear exponential-quadratic optimal control with Gaussian noise. arXiv:1512.07173 [cs]. Farshidian, F., & Buchli, J. (2015). Risk sensitive, nonlinear optimal control: Iterative linear exponential-quadratic optimal control with Gaussian noise. arXiv:​1512.​07173 [cs].
Zurück zum Zitat Gill, P. E., Murray, W., & Saunders, M. A. (2005). SNOPT: An SQP algorithm for large-scale constrained optimization. SIAM Review, 47(1), 99–131.MathSciNetCrossRefMATH Gill, P. E., Murray, W., & Saunders, M. A. (2005). SNOPT: An SQP algorithm for large-scale constrained optimization. SIAM Review, 47(1), 99–131.MathSciNetCrossRefMATH
Zurück zum Zitat Hargraves, C. R., & Paris, S. W. (1987). Direct trajectory optimization using nonlinear programming and collocation. Journal of Guidance, 10(4), 338–342.CrossRefMATH Hargraves, C. R., & Paris, S. W. (1987). Direct trajectory optimization using nonlinear programming and collocation. Journal of Guidance, 10(4), 338–342.CrossRefMATH
Zurück zum Zitat Jacobson, D. H., & Mayne, D. Q. (1970). Differential dynamic programming. Amsterdam: Elsevier.MATH Jacobson, D. H., & Mayne, D. Q. (1970). Differential dynamic programming. Amsterdam: Elsevier.MATH
Zurück zum Zitat Julier, S. J., & Uhlmann, J. K. (2004). Unscented filtering and nonlinear estimation. Proceedings of the IEEE, 92(3), 401–422.CrossRef Julier, S. J., & Uhlmann, J. K. (2004). Unscented filtering and nonlinear estimation. Proceedings of the IEEE, 92(3), 401–422.CrossRef
Zurück zum Zitat Julius, A. A., & Pappas, G. J. (2009). Trajectory based verification using local finite-time invariance. In Hybrid systems: Computation and control, lecture notes in computer science (pp. 223–236). Berlin: Springer. https://doi.org/10.1007/978-3-642-00602-9_16. ISBN 978-3-642-00601-2, 978-3-642-00602-9. Julius, A. A., & Pappas, G. J. (2009). Trajectory based verification using local finite-time invariance. In Hybrid systems: Computation and control, lecture notes in computer science (pp. 223–236). Berlin: Springer. https://​doi.​org/​10.​1007/​978-3-642-00602-9_​16. ISBN 978-3-642-00601-2, 978-3-642-00602-9.
Zurück zum Zitat Kavraki, L. E., Svestka, P., Latombe, J. C., & Overmars, M. H. (1996). Probabilistic roadmaps for path planning in high-dimensional configuration spaces. IEEE Transactions on Robotics and Automation, 12(4), 566–580.CrossRef Kavraki, L. E., Svestka, P., Latombe, J. C., & Overmars, M. H. (1996). Probabilistic roadmaps for path planning in high-dimensional configuration spaces. IEEE Transactions on Robotics and Automation, 12(4), 566–580.CrossRef
Zurück zum Zitat Kuffner Jr, J. J., & LaValle, S. M. (2000). RRT-connect: An efficient approach to single-query path planning. In Proceedings of the IEEE international conference on robotics and automation. Kuffner Jr, J. J., & LaValle, S. M. (2000). RRT-connect: An efficient approach to single-query path planning. In Proceedings of the IEEE international conference on robotics and automation.
Zurück zum Zitat Kuindersma, S., Grupen, R., & Barto, A. (2013). Variable risk control via stochastic optimization. International Journal of Robotics Research, 32(7), 806–825.CrossRef Kuindersma, S., Grupen, R., & Barto, A. (2013). Variable risk control via stochastic optimization. International Journal of Robotics Research, 32(7), 806–825.CrossRef
Zurück zum Zitat Lou, J., & Hauser, K. (2015). Robust trajectory optimization under frictional contact with iterative learning. In Robotics science and systems (RSS). Lou, J., & Hauser, K. (2015). Robust trajectory optimization under frictional contact with iterative learning. In Robotics science and systems (RSS).
Zurück zum Zitat Magni, L., De Nicolao, G., Scattolini, R., & Allgöwer, F. (2003). Robust model predictive control for nonlinear discrete-time systems. International Journal of Robust and Nonlinear Control, 13(3–4), 229–246. https://doi.org/10.1002/rnc.815. ISSN 1099-1239. Magni, L., De Nicolao, G., Scattolini, R., & Allgöwer, F. (2003). Robust model predictive control for nonlinear discrete-time systems. International Journal of Robust and Nonlinear Control, 13(3–4), 229–246. https://​doi.​org/​10.​1002/​rnc.​815. ISSN 1099-1239.
Zurück zum Zitat Majumdar, A., & Tedrake, R. (2013). Robust online motion planning with regions of finite time invariance. In Algorithmic foundations of robotics X (pp. 543–558). Berlin: Springer. Majumdar, A., & Tedrake, R. (2013). Robust online motion planning with regions of finite time invariance. In Algorithmic foundations of robotics X (pp. 543–558). Berlin: Springer.
Zurück zum Zitat Manchester, Z., & Kuindersma, S. (2016). Derivative-free trajectory optimization with unscented dynamic programming. In Proceedings of the 55th conference on decision and control (CDC), Las Vegas, NV. Manchester, Z., & Kuindersma, S. (2016). Derivative-free trajectory optimization with unscented dynamic programming. In Proceedings of the 55th conference on decision and control (CDC), Las Vegas, NV.
Zurück zum Zitat Mayne, D. Q., & Kerrigan, E. C. (2007). Tube-based robust nonlinear model predictive control. In Proceedings of the 7th IFAC symposium on nonlinear control systems, Pretoria (pp. 110–115). Mayne, D. Q., & Kerrigan, E. C. (2007). Tube-based robust nonlinear model predictive control. In Proceedings of the 7th IFAC symposium on nonlinear control systems, Pretoria (pp. 110–115).
Zurück zum Zitat Moore, J., & Tedrake, R. (2014). Adaptive control design for underactuated systems using sums-of-squares optimization. In Proceedings of the 2014 American control conference (ACC). Moore, J., & Tedrake, R. (2014). Adaptive control design for underactuated systems using sums-of-squares optimization. In Proceedings of the 2014 American control conference (ACC).
Zurück zum Zitat Mordatch, I., Lowrey, K., & Todorov, E. (2015). Ensemble-CIO: Full-body dynamic motion planning that transfers to physical humanoids. In Proceedings of the international conference on robotics and automation (ICRA). Mordatch, I., Lowrey, K., & Todorov, E. (2015). Ensemble-CIO: Full-body dynamic motion planning that transfers to physical humanoids. In Proceedings of the international conference on robotics and automation (ICRA).
Zurück zum Zitat Morimoto, J., Zeglin, G., & Atkeson, C. G. (2003). Minimax differential dynamic programming: Application to a biped walking robot. In Proceedings of the 2003 IEEE/RSJ international conference on intelligent robots and systems. Morimoto, J., Zeglin, G., & Atkeson, C. G. (2003). Minimax differential dynamic programming: Application to a biped walking robot. In Proceedings of the 2003 IEEE/RSJ international conference on intelligent robots and systems.
Zurück zum Zitat Nocedal, J., & Wright, S. J. (2006). Numerical optimization (2nd ed.). Berlin: Springer.MATH Nocedal, J., & Wright, S. J. (2006). Numerical optimization (2nd ed.). Berlin: Springer.MATH
Zurück zum Zitat Pan, Y., Theodorou, E., & Bakshi, K. (2015). Robust trajectory optimization: A cooperative stochastic game theoretic approach. In Proceedings of robotics: Science and systems, Rome. Pan, Y., Theodorou, E., & Bakshi, K. (2015). Robust trajectory optimization: A cooperative stochastic game theoretic approach. In Proceedings of robotics: Science and systems, Rome.
Zurück zum Zitat Parillo, P. (2000). Structured semidefinite programs and semialgebraic geometry methods in robustness and optimization. Ph.D. thesis, California Institute of Technology. Parillo, P. (2000). Structured semidefinite programs and semialgebraic geometry methods in robustness and optimization. Ph.D. thesis, California Institute of Technology.
Zurück zum Zitat Plancher, B., Manchester, Z., & Kuindersma, S. (2017). Constrained unscented dynamic programming. In Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS), Vancouver, BC. Plancher, B., Manchester, Z., & Kuindersma, S. (2017). Constrained unscented dynamic programming. In Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS), Vancouver, BC.
Zurück zum Zitat Platt, R., Kaelbling, L., Lozano-Perez, T., & Tedrake, R. (2012). Non-Gaussian belief space planning: Correctness and complexity. In Proceedings of the international conference on robotics and automation (ICRA), St. Paul, MN. Platt, R., Kaelbling, L., Lozano-Perez, T., & Tedrake, R. (2012). Non-Gaussian belief space planning: Correctness and complexity. In Proceedings of the international conference on robotics and automation (ICRA), St. Paul, MN.
Zurück zum Zitat Posa, M., Cantu, C., & Tedrake, R. (2014). A direct method for trajectory optimization of rigid bodies through contact. International Journal of Robotics Research, 33(1), 69–81.CrossRef Posa, M., Cantu, C., & Tedrake, R. (2014). A direct method for trajectory optimization of rigid bodies through contact. International Journal of Robotics Research, 33(1), 69–81.CrossRef
Zurück zum Zitat Posa, M., Kuindersma, S., & Tedrake, R. (2016). Optimization and stabilization of trajectories for constrained dynamical systems. In Proceedings of the international conference on robotics and automation (ICRA) (pp. 1366–1373), Stockholm. IEEE. Posa, M., Kuindersma, S., & Tedrake, R. (2016). Optimization and stabilization of trajectories for constrained dynamical systems. In Proceedings of the international conference on robotics and automation (ICRA) (pp. 1366–1373), Stockholm. IEEE.
Zurück zum Zitat Ratliff, N., Zucker, M., Bagnell, J. A., & Srinivasa, S. (2009). CHOMP: Gradient optimization techniques for efficient motion planning. In Proceedings of the international conference on robotics and automation (ICRA). Ratliff, N., Zucker, M., Bagnell, J. A., & Srinivasa, S. (2009). CHOMP: Gradient optimization techniques for efficient motion planning. In Proceedings of the international conference on robotics and automation (ICRA).
Zurück zum Zitat Schulman, J., Duan, Y., Ho, J., Lee, A., Awwal, I., Bradlow, H., et al. (2014). Motion planning with sequential convex optimization and convex collision checking. The International Journal of Robotics Research, 33(9), 1251–1270. https://doi.org/10.1177/0278364914528132. ISSN 0278-3649, 1741-3176. Schulman, J., Duan, Y., Ho, J., Lee, A., Awwal, I., Bradlow, H., et al. (2014). Motion planning with sequential convex optimization and convex collision checking. The International Journal of Robotics Research, 33(9), 1251–1270. https://​doi.​org/​10.​1177/​0278364914528132​. ISSN 0278-3649, 1741-3176.
Zurück zum Zitat Tassa, Y., Erez, T., & Todorov, E. (2012). Synthesis and stabilization of complex behaviors through online trajectory optimization. In IEEE/RSJ international conference on intelligent robots and systems. Tassa, Y., Erez, T., & Todorov, E. (2012). Synthesis and stabilization of complex behaviors through online trajectory optimization. In IEEE/RSJ international conference on intelligent robots and systems.
Zurück zum Zitat Tedrake, R., Manchester, I. R., Tobenkin, M. M., & Roberts, J. W. (2010). LQR-Trees: Feedback motion planning via sums of squares verification. International Journal of Robotics Research, 29, 1038–1052.CrossRef Tedrake, R., Manchester, I. R., Tobenkin, M. M., & Roberts, J. W. (2010). LQR-Trees: Feedback motion planning via sums of squares verification. International Journal of Robotics Research, 29, 1038–1052.CrossRef
Zurück zum Zitat Tobenkin, M., Manchester, I., & Tedrake, R. (2011). Invariant funnels around trajectories using sums-of-squares programming. In Proceedings of the 18th IFAC World Congress, Milan. Tobenkin, M., Manchester, I., & Tedrake, R. (2011). Invariant funnels around trajectories using sums-of-squares programming. In Proceedings of the 18th IFAC World Congress, Milan.
Zurück zum Zitat van den Berg, J., Abbeel, P., & Goldberg, K. (2011). LQG-MP: Optimized path planning for robots with motion uncertainty and imperfect state information. The International Journal of Robotics Research, 30(7), 895–913. https://doi.org/10.1177/0278364911406562. ISSN 0278-3649. van den Berg, J., Abbeel, P., & Goldberg, K. (2011). LQG-MP: Optimized path planning for robots with motion uncertainty and imperfect state information. The International Journal of Robotics Research, 30(7), 895–913. https://​doi.​org/​10.​1177/​0278364911406562​. ISSN 0278-3649.
Zurück zum Zitat van den Broek, B., Wiegerinck, W., & Kappen, B. (2010). Risk sensitive path integral control. In Proceedings of the 26th conference on uncertainty in artificial intelligence (UAI) (pp. 615–622). van den Broek, B., Wiegerinck, W., & Kappen, B. (2010). Risk sensitive path integral control. In Proceedings of the 26th conference on uncertainty in artificial intelligence (UAI) (pp. 615–622).
Zurück zum Zitat Zhou, K. (1996). Robust and optimal control. Upper Saddle River, NJ: Prentice Hall. ISBN 978-0-13-456567-5. Zhou, K. (1996). Robust and optimal control. Upper Saddle River, NJ: Prentice Hall. ISBN 978-0-13-456567-5.
Metadaten
Titel
Robust direct trajectory optimization using approximate invariant funnels
verfasst von
Zachary Manchester
Scott Kuindersma
Publikationsdatum
19.07.2018
Verlag
Springer US
Erschienen in
Autonomous Robots / Ausgabe 2/2019
Print ISSN: 0929-5593
Elektronische ISSN: 1573-7527
DOI
https://doi.org/10.1007/s10514-018-9779-5

Weitere Artikel der Ausgabe 2/2019

Autonomous Robots 2/2019 Zur Ausgabe

Neuer Inhalt