Safe Pontryagin Differentiable Programming

06/12/2021

Safe Pontryagin Differentiable Programming

Wanxin Jin, Shaoshuai Mou, George J. Pappas

Keywords: optimization, reinforcement learning and planning

Abstract Paper Similar Papers

Abstract: We propose a Safe Pontryagin Differentiable Programming (Safe PDP) methodology, which establishes a theoretical and algorithmic framework to solve a broad class of safety-critical learning and control tasks---problems that require the guarantee of safety constraint satisfaction at any stage of the learning and control progress. In the spirit of interior-point methods, Safe PDP handles different types of system constraints on states and inputs by incorporating them into the cost or loss through barrier functions. We prove three fundamentals of the proposed Safe PDP: first, both the solution and its gradient in the backward pass can be approximated by solving their more efficient unconstrained counterparts; second, the approximation for both the solution and its gradient can be controlled for arbitrary accuracy by a barrier parameter; and third, importantly, all intermediate results throughout the approximation and optimization strictly respect the constraints, thus guaranteeing safety throughout the entire learning and control process. We demonstrate the capabilities of Safe PDP in solving various safety-critical tasks, including safe policy optimization, safe motion planning, and learning MPCs from demonstrations, on different challenging systems such as 6-DoF maneuvering quadrotor and 6-DoF rocket powered landing.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Multi-Robot Collision Avoidance under Uncertainty with Probabilistic Safety Barrier Certificates

Wenhao Luo, Wen Sun, Ashish Kapoor

Keywords Paper

Algorithms -> Clustering; Algorithms -> Semi-Supervised Learning; Theory -> Learning Theory, Algorithms -> Active Learning

0

0

0

0

3:20

03/05/2021

Conservative Safety Critics for Exploration

Homanga Bharadhwaj, Aviral Kumar, Nicholas Rhinehart and
Sergey Levine, Florian Shkurti, Animesh Garg

Keywords Paper

Safe exploration, Reinforcement Learning

0

0

0

0

5:14

26/08/2020

Adaptive Discretization for Evaluation of Probabilistic Cost Functions

Christoph Zimmer, Danny Driess, Mona Meister, Nguyen-Tuong Duy

Keywords Paper

0

0

0

0

14:13

06/12/2021

Safe Policy Optimization with Local Generalized Linear Function Approximations

Akifumi Wachi, Yunyue Wei, Yanan Sui

Keywords Paper

theory, optimization, reinforcement learning and planning

0

0

0

0

9:58

16/11/2020

Guaranteeing Safety of Learned Perception Modules via Measurement-Robust Control Barrier Functions

Sarah Dean, Andrew Taylor, Ryan Cosner and
Benjamin Recht, Aaron Ames

Keywords Paper

1

0

0

0

5:14

18/07/2021

Gaussian Process-Based Real-Time Learning for Safety Critical Applications

Armin Lederer, Alejandro Ordóñez Conejo, Korbinian Maier and
Wenxin Xiao, Jonas Umlauft, Sandra Hirche

Keywords Paper

Probabilistic Methods, Gaussian Processes and Bayesian non-parametrics

0

0

0

0

4:59

19/08/2021

Model-Based Reinforcement Learning for Infinite-Horizon Discounted Constrained Markov Decision Processes

Aria HasanzadeZonuzy, Dileep Kalathil, Srinivas Shakkottai

Keywords Paper

Machine Learning, Reinforcement Learning, Markov Decisions Processes

0

0

0

0

13:26

06/12/2021

Safe Reinforcement Learning with Natural Language Constraints

Tsung-Yen Yang, Michael Y Hu, Yinlam Chow and
Peter J Ramadge, Karthik Narasimhan

Keywords Paper

reinforcement learning and planning

0

0

0

0

13:32

02/02/2021

Deep Verifier Networks: Verification of Deep Discriminative Models with Deep Generative Models

Tong Che, Xiaofeng Liu, Site Li and
Yubin Ge, Ruixiang Zhang, Caiming Xiong, Yoshua Bengio

Keywords Paper

0

0

0

0

19:22

02/02/2021

Scalable and Safe Multi-Agent Motion Planning with Nonlinear Dynamics and Bounded Disturbances

Jingkai Chen, Jiaoyang Li, Chuchu Fan, Brian C. Williams

Keywords Paper

0

0

0

0

17:06

13/04/2021

Online robust control of nonlinear systems with large uncertainty

Dimitar Ho, Hoang Le, John Doyle, Yisong Yue

Keywords Paper

0

0

0

0

3:02

13/04/2021

Provably safe PAC-MDP exploration using analogies

Melrose Roderick, Vaishnavh Nagarajan, Zico Kolter

Keywords Paper

0

0

0

0

2:51

19/01/2020

The High-Level Benefits of Low-Level Sandboxing

Michael Sammler, Deepak Garg, Derek Dreyer, Tadeusz Litak

Keywords Paper

logical relations, type systems, Sandboxing, language-based security, robust safety, Iris

0

0

0

0

20:34

18/07/2021

Safe Reinforcement Learning with Linear Function Approximation

Sanae Amani, Christos Thrampoulidis, Lin Yang

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:03

19/01/2020

Virtual Timeline: A Formal Abstraction for Verifying Preemptive Schedulers with Temporal Isolation

Mengqi Liu, Lionel Rieg, Zhong Shao and
Ronghui Gu, David Costanzo, Jung-Eun Kim, Man-Ki Yoon

Keywords Paper

partitioned scheduling, preemptive scheduler, temporal isolation, mechanized proof, fixed-priority scheduling, formal verification

0

0

0

0

20:02

19/08/2021

Finite-Trace and Generalized-Reactivity Specifications in Temporal Synthesis

Giuseppe De Giacomo, Antonio Di Stasio, Lucas M. Tabajara and
Moshe Vardi, Shufang Zhu

Keywords Paper

Knowledge Representation and Reasoning, Action, Change and Causality, Theoretical Foundations of Planning, Formal Verification, Validation and Synthesis

0

0

0

0

13:42

02/02/2021

Addressing Action Oscillations through Learning Policy Inertia

Chen Chen, Hongyao Tang, Jianye Hao and
Wulong Liu, Zhaopeng Meng

Keywords Paper

0

0

0

0

14:57

26/04/2020

Infinite-Horizon Differentiable Model Predictive Control

Sebastian East, Marco Gallieri, Jonathan Masci and
Jan Koutnik, Mark Cannon

Keywords Paper

Model Predictive Control, Riccati Equation, Imitation Learning, Safe Learning

0

0

0

0

4:56

26/04/2020

GenDICE: Generalized Offline Estimation of Stationary Values

Ruiyi Zhang, Bo Dai, Lihong Li, Dale Schuurmans

Keywords Paper

Off-policy Policy Evaluation, Reinforcement Learning, Stationary Distribution Correction Estimation, Fenchel Dual

0

0

0

0

15:37

13/04/2021

Deep probabilistic accelerated evaluation: A robust certifiable rare-event simulation methodology for black-box safety-critical systems

Mansur Arief, Zhiyuan Huang, Guru Koushik Senthil Kumar and
Yuanlu Bai, Shengyi He, Wenhao Ding, Henry Lam, Ding Zhao

Keywords Paper

0

0

0

0

3:03

16/11/2020

Safe Policy Learning for Continuous Control

Yinlam Chow, Ofir Nachum, Aleksandra Faust and
Edgar Dueñez-Guzman, Mohammad Ghavamzadeh

Keywords Paper

0

0

0

0

5:20

18/07/2021

Safe Reinforcement Learning Using Advantage-Based Intervention

Nolan Wagener, Byron Boots, Ching-An Cheng

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

5:09

06/12/2021

Multi-Objective SPIBB: Seldonian Offline Policy Improvement with Safety Constraints in Finite MDPs

harsh satija, Philip S. Thomas, Joelle Pineau, Romain Laroche

Keywords Paper

reinforcement learning and planning

0

0

0

0

12:27

06/12/2021

Adversarially Robust 3D Point Cloud Recognition Using Self-Supervisions

Jiachen Sun, Yulong Cao, Christopher B Choy and
Zhiding Yu, Anima Anandkumar, Zhuoqing Morley Mao, Chaowei Xiao

Keywords Paper

deep learning, robustness, adversarial robustness and security, self-supervised learning, transformers

0

0

0

0

13:15

18/07/2021

Density Constrained Reinforcement Learning

Zengyi Qin, Yuxiao Chen, Chuchu Fan

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

4:50

12/07/2020

Constrained Markov Decision Processes via Backward Value Functions

Harsh Satija, Philip Amortila, Joelle Pineau

Keywords Paper

Reinforcement Learning - General

0

0

0

0

10:40

02/02/2021

WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning

Qisong Yang, Thiago D. Simão, Simon H Tindemans, Matthijs T. J. Spaan

Keywords Paper

0

0

0

0

17:28

03/05/2021

Enforcing robust control guarantees within neural network policies

Priya Donti, Melrose Roderick, Mahyar Fazlyab, Zico Kolter

Keywords Paper

reinforcement learning, differentiable optimization, robust control

0

0

0

1

5:09

06/12/2020

First Order Constrained Optimization in Policy Space

Yiming Zhang, Quan Vuong, Keith Ross

Keywords Paper

0

0

0

0

3:15

03/05/2021

PAC Confidence Predictions for Deep Neural Network Classifiers

Sangdon Park, Shuo Li, Insup Lee, Osbert Bastani

Keywords Paper

classification, fast DNN inference, probably approximated correct guarantee, calibration, safe planning

0

0

0

0

5:11

02/02/2021

Sample-Specific Output Constraints for Neural Networks

Mathis Brosowsky, Florian Keck, Olaf Dünkel, Marius Zöllner

Keywords Paper

0

0

0

0

15:51

20/07/2020

On the stable recovery of deep structured linear networks under sparsity constraints

François Malgouyres

Keywords Paper

0

0

0

0

18:44

06/12/2020

Almost Surely Stable Deep Dynamics

Nathan Lawrence, Philip Loewen, Michael Forbes and
Johan Backstrom, Bhushan Gopaluni

Keywords Paper

0

0

0

0

3:25

15/06/2020

Reactive probabilistic programming

Guillaume Baudart, Louis Mandel, Eric Atkinson and
Benjamin Sherman, Marc Pouzet, Michael Carbin

Keywords Paper

Streaming inference, Compilation, Semantics, Reactive programming, Probabilistic programming

0

0

0

0

16:16

06/12/2021

Nonsmooth Implicit Differentiation for Machine-Learning and Optimization

Jérôme Bolte, Tam Le, Edouard Pauwels, Tony Silveti-Falls

Keywords Paper

deep learning, optimization, machine learning

0

0

0

0

12:32

03/05/2021

Learning Safe Multi-agent Control with Decentralized Neural Barrier Certificates

Zengyi Qin, Kaiqing Zhang, chenyx Chen and
Jingkai Chen, Chuchu Fan

Keywords Paper

reinforcement learning, control barrier function, safe, Multi-agent

0

0

0

0

5:45

26/04/2020

Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control

Nir Levine, Yinlam Chow, Rui Shu and
Ang Li, Mohammad Ghavamzadeh, Hung Bui

Keywords Paper

Embed-to-Control, Representation Learning, Stochastic Optimal Control, VAE, iLQR

0

0

0

0

5:06

16/11/2020

Learning to Improve Multi-Robot Hallway Navigation

Jin Soo Park, Brian Tsang, Harel Yedidsion and
Garrett Warnell, Daehyun Kyoung, Peter Stone

Keywords Paper

0

0

0

0

5:06

02/02/2021

Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs

Aria HasanzadeZonuzy, Archana Bura, Dileep Kalathil, Srinivas Shakkottai

Keywords Paper

0

0

0

0

17:18

13/04/2021

Provably efficient safe exploration via primal-dual policy optimization

Dongsheng Ding, Xiaohan Wei, Zhuoran Yang and
Zhaoran Wang, Mihailo Jovanovic

Keywords Paper

0

0

0

0

3:07