Enforcing robust control guarantees within neural network policies

03/05/2021

Enforcing robust control guarantees within neural network policies

Priya Donti, Melrose Roderick, Mahyar Fazlyab, Zico Kolter

Keywords: reinforcement learning, differentiable optimization, robust control

Abstract Paper Similar Papers

Abstract: When designing controllers for safety-critical systems, practitioners often face a challenging tradeoff between robustness and performance. While robust control methods provide rigorous guarantees on system stability under certain worst-case disturbances, they often yield simple controllers that perform poorly in the average (non-worst) case. In contrast, nonlinear control methods trained using deep learning have achieved state-of-the-art performance on many control tasks, but often lack robustness guarantees. In this paper, we propose a technique that combines the strengths of these two approaches: constructing a generic nonlinear control policy class, parameterized by neural networks, that nonetheless enforces the same provable robustness criteria as robust control. Specifically, our approach entails integrating custom convex-optimization-based projection layers into a neural network-based policy. We demonstrate the power of this approach on several domains, improving in average-case performance over existing robust control methods and in worst-case stability over (non-robust) deep RL methods.

0

0

0

1

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

03/05/2021

Towards Robust Neural Networks via Close-loop Control

Zhuotong Chen, Qianxiao Li, Zheng Zhang

Keywords Paper

dynamical system, neural network robustness, optimal control

0

0

0

0

4:47

26/08/2020

Non-Parametric Calibration for Classification

Jonathan Wenger, Hedvig Kjellström, Rudolph Triebel )

Keywords Paper

0

0

0

0

15:29

13/04/2021

Online robust control of nonlinear systems with large uncertainty

Dimitar Ho, Hoang Le, John Doyle, Yisong Yue

Keywords Paper

0

0

0

0

3:02

02/02/2021

Efficient Certification of Spatial Robustness

Anian Ruoss, Maximilian Baader, Mislav Balunović, Martin Vechev

Keywords Paper

0

1

0

0

16:30

06/12/2021

Adversarial Robustness with Semi-Infinite Constrained Learning

Alexander Robey, Luiz Chamon, George J. Pappas and
Hamed Hassani, Alejandro Ribeiro

Keywords Paper

theory, deep learning, optimization, robustness, adversarial robustness and security

0

0

0

0

14:48

13/04/2021

Deep probabilistic accelerated evaluation: A robust certifiable rare-event simulation methodology for black-box safety-critical systems

Mansur Arief, Zhiyuan Huang, Guru Koushik Senthil Kumar and
Yuanlu Bai, Shengyi He, Wenhao Ding, Henry Lam, Ding Zhao

Keywords Paper

0

0

0

0

3:03

18/11/2020

Towards understanding and improving the transferability of adversarial examples in deep neural networks

Lei Wu, Zhanxing Zhu

Keywords Paper

0

0

0

0

10:28

06/12/2021

Locally Valid and Discriminative Prediction Intervals for Deep Learning Models

Zhen Lin, Shubhendu Trivedi, Jimeng Sun

Keywords Paper

deep learning

0

0

0

0

12:05

06/12/2021

Derivative-Free Policy Optimization for Linear Risk-Sensitive and Robust Control Design: Implicit Regularization and Sample Complexity

Kaiqing Zhang, Xiangyuan Zhang, Bin Hu, Tamer Basar

Keywords Paper

theory, optimization, reinforcement learning and planning

0

0

0

0

15:57

18/07/2021

Offline Contextual Bandits with Overparameterized Models

David Brandfonbrener, Will Whitney, Rajesh Ranganath, Joan Bruna

Keywords Paper

Optimization, Non-Convex Optimization, Reinforcement Learning and Planning, Optimization, Stochastic Optimization

0

0

0

1

6:07

18/07/2021

Meta-Cal: Well-controlled Post-hoc Calibration by Ranking

Xingchen Ma, Matthew B Blaschko

Keywords Paper

Algorithms, Supervised Learning

0

0

0

0

4:28

03/05/2021

Shapley explainability on the data manifold

Christopher Frye, Damien De Mijolla, Tom Begley and
Laurence Cowton, Megan Stanley, Ilya Feige

Keywords Paper

0

0

0

0

5:14

12/07/2020

Confidence-Aware Learning for Deep Neural Networks

Sangheum Hwang, Jooyoung Moon, Jihyo Kim, Younghak Shin

Keywords Paper

Deep Learning - Algorithms

0

0

0

1

14:05

12/07/2020

Adversarial Neural Pruning with Latent Vulnerability Suppression

Divyam Madaan, Jinwoo Shin, Sung Ju Hwang

Keywords Paper

Adversarial Examples

0

0

0

0

14:43

02/02/2021

A Theory of Independent Mechanisms for Extrapolation in Generative Models

Michel Besserve, Remy Sun, Dominik Janzing, Bernhard Schölkopf

Keywords Paper

0

0

0

0

18:37

03/05/2021

Uncertainty in Gradient Boosting via Ensembles

Andrey Malinin, Liudmila Prokhorenkova, Aleksei Ustimenko

Keywords Paper

uncertainty, knowledge uncertainty, decision trees, gradient boosting, ensembles

0

0

0

0

5:30

26/04/2020

Black-box Off-policy Estimation for Infinite-Horizon Reinforcement Learning

Ali Mousavi, Lihong Li, Qiang Liu, Denny Zhou

Keywords Paper

reinforcement learning, off-policy estimation, importance sampling, propensity score

0

0

0

0

5:25

03/05/2021

Influence Functions in Deep Learning Are Fragile

Samyadeep Basu, Phil Pope, Soheil Feizi

Keywords Paper

Influence Functions, Interpretability

0

0

1

1

6:15

06/12/2020

Improving model calibration with accuracy versus uncertainty optimization

Ranganath Krishnan, Omesh Tickoo

Keywords Paper

0

0

0

0

3:25

12/07/2020

Constrained Markov Decision Processes via Backward Value Functions

Harsh Satija, Philip Amortila, Joelle Pineau

Keywords Paper

Reinforcement Learning - General

0

0

0

0

10:40

06/12/2021

Integrated Latent Heterogeneity and Invariance Learning in Kernel Space

Jiashuo Liu, Zheyuan Hu, Peng Cui and
Bo Li, Zheyan Shen

Keywords Paper

deep learning, reinforcement learning and planning, machine learning

0

0

0

0

11:11

02/02/2021

Addressing Action Oscillations through Learning Policy Inertia

Chen Chen, Hongyao Tang, Jianye Hao and
Wulong Liu, Zhaopeng Meng

Keywords Paper

0

0

0

0

14:57

06/12/2021

Residual Pathway Priors for Soft Equivariance Constraints

Marc Finzi, Gregory Benton, Andrew Wilson

Keywords Paper

deep learning, reinforcement learning and planning, machine learning

0

0

0

0

16:02

19/08/2021

Masked Contrastive Learning for Anomaly Detection

Hyunsoo Cho, Jinseok Seol, Sang-goo Lee

Keywords Paper

Data Mining, Anomaly/Outlier Detection, Clustering, Clustering

0

0

0

0

14:12

18/07/2021

Nondeterminism and Instability in Neural Network Optimization

Cecilia Summers, Michael J Dinneen

Keywords Paper

Deep Learning, Optimization for Deep Networks

0

0

0

0

5:12

06/12/2020

Adversarial Robustness of Supervised Sparse Coding

Jeremias Sulam, Ramchandran Muthukumar, Raman Arora

Keywords Paper

0

0

0

0

3:08

18/07/2021

Discovering symbolic policies with deep reinforcement learning

Mikel Landajuela Larma, Brenden Petersen, Sookyung Kim and
Claudio Santiago, Ruben Glatt, Nathan Mundhenk, Jacob Pettit, Daniel Faissol

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

4:55

06/12/2020

The Advantage of Conditional Meta-Learning for Biased Regularization and Fine Tuning

Giulia Denevi, Massimiliano Pontil, Carlo Ciliberto

Keywords Paper

0

0

0

0

3:24

26/04/2020

A Constructive Prediction of the Generalization Error Across Scales

Jonathan S. Rosenfeld, Amir Rosenfeld, Yonatan Belinkov, Nir Shavit

Keywords Paper

neural networks, deep learning, generalization error, scaling, scalability, vision, language

0

0

0

0

4:59

14/06/2020

Robust Design of Deep Neural Networks Against Adversarial Attacks Based on Lyapunov Theory

Arash Rahnama, Andre T. Nguyen, Edward Raff

Keywords Paper

adversarial machine learning, robustness, control theory, lyapunov theory, spectral norm regularization, stability and robustness analysis of dnns, dissipativity and passivity theory, adversarial attacks, learning theory, mathematical analysis of dnns

0

0

0

0

1:00

06/12/2021

Adversarial Robustness with Non-uniform Perturbations

Ecenaz Erdemir, Jeffrey Bickford, Luca Melis, Sergul Aydore

Keywords Paper

deep learning, optimization, machine learning, robustness, adversarial robustness and security

0

0

0

0

15:05

06/12/2021

Towards Robust Bisimulation Metric Learning

Mete Kemertas, Tristan Aumentado-Armstrong

Keywords Paper

reinforcement learning and planning, robustness, representation learning

0

0

0

0

12:24

06/12/2020

HYDRA: Pruning Adversarially Robust Neural Networks

Vikash Sehwag, Shiqi Wang, Prateek Mittal, Suman Jana

Keywords Paper

0

0

0

0

3:14

02/02/2021

Practical and Rigorous Uncertainty Bounds for Gaussian Process Regression

Christian Fiedler, Carsten W. Scherer, Sebastian Trimpe

Keywords Paper

0

0

0

0

18:49

06/12/2020

Almost Surely Stable Deep Dynamics

Nathan Lawrence, Philip Loewen, Michael Forbes and
Johan Backstrom, Bhushan Gopaluni

Keywords Paper

0

0

0

0

3:25

06/12/2021

A Little Robustness Goes a Long Way: Leveraging Robust Features for Targeted Transfer Attacks

Jacob Springer, Melanie Mitchell, Garrett Kenyon

Keywords Paper

deep learning, optimization, robustness, adversarial robustness and security, transformers

0

0

0

0

9:29

13/04/2021

Simultaneously reconciled quantile forecasting of hierarchically related time series

Xing Han, Sambarta Dasgupta, Joydeep Ghosh

Keywords Paper

0

0

0

0

3:04

30/11/2020

Bridging Adversarial and Statistical Domain Transfer via Spectral Adaptation Networks

Christoph Raab, Philipp Väth, Peter Meier, Frank-Michael Schleif

Keywords Paper

0

0

0

0

10:07

26/08/2020

Adaptive, Distribution-Free Prediction Intervals for Deep Networks

Danijel Kivaranovic, Kory D. Johnson, Hannes Leeb

Keywords Paper

0

0

0

0

16:48

06/12/2021

Truncated Marginal Neural Ratio Estimation

Benjamin K Miller, Alex Cole, Patrick Forré and
Gilles Louppe, Christoph Weniger

Keywords Paper

robustness

0

0

0

0

10:11