Skip to main navigation Skip to search Skip to main content

Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems

  • Wesley A. Suttle
  • , Vipul K. Sharma
  • , Krishna C. Kosaraju
  • , S. Sivaranjani
  • , Ji Liu
  • , Vijay Gupta
  • , Brian M. Sadler
  • U.S. Army Research Laboratory
  • Purdue University
  • Clemson University

Research output: Contribution to journalConference articlepeer-review

2 Scopus citations

Abstract

We develop provably safe and convergent reinforcement learning (RL) algorithms for control of nonlinear dynamical systems, bridging the gap between the hard safety guarantees of control theory and the convergence guarantees of RL theory. Recent advances at the intersection of control and RL follow a two-stage, safety filter approach to enforcing hard safety constraints: model-free RL is used to learn a potentially unsafe controller, whose actions are projected onto safe sets prescribed, for example, by a control barrier function. Though safe, such approaches lose any convergence guarantees enjoyed by the underlying RL methods. In this paper, we develop a single-stage, sampling-based approach to hard constraint satisfaction that learns RL controllers enjoying classical convergence guarantees while satisfying hard safety constraints throughout training and deployment. We validate the efficacy of our approach in simulation, including safe control of a quadcopter in a challenging obstacle avoidance problem, and demonstrate that it outperforms existing benchmarks.

Original languageEnglish
Pages (from-to)4420-4428
Number of pages9
JournalProceedings of Machine Learning Research
Volume238
StatePublished - 2024
Event27th International Conference on Artificial Intelligence and Statistics, AISTATS 2024 - Valencia, Spain
Duration: May 2 2024May 4 2024

Fingerprint

Dive into the research topics of 'Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems'. Together they form a unique fingerprint.

Cite this