FormulaZero: Distributionally Robust Online Adaptation via Offline Population Synthesis

12/07/2020

FormulaZero: Distributionally Robust Online Adaptation via Offline Population Synthesis

Aman Sinha, Matthew O'Kelly, Hongrui Zheng, Rahul Mangharam, John Duchi, Russ Tedrake

Keywords: Reinforcement Learning - General

Abstract Paper Similar Papers

Abstract: Balancing performance and safety is crucial to deploying autonomous vehicles in multi-agent environments. In particular, autonomous racing is a domain that penalizes safe but conservative policies, highlighting the need for robust, adaptive strategies. Current approaches either make simplifying assumptions about other agents or lack robust mechanisms for online adaptation. This work makes algorithmic contributions to both challenges. First, to generate a realistic, diverse set of opponents, we develop a novel method for self-play based on replica-exchange Markov chain Monte Carlo. Second, we propose a distributionally robust bandit optimization procedure that adaptively adjusts risk aversion relative to uncertainty in beliefs about opponents’ behaviors. We rigorously quantify the tradeoffs in performance and robustness when approximating these computations in real-time motion-planning, and we demonstrate our methods experimentally on autonomous vehicles that achieve scaled speeds comparable to Formula One racecars.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

Addressing Action Oscillations through Learning Policy Inertia

Chen Chen, Hongyao Tang, Jianye Hao and
Wulong Liu, Zhaopeng Meng

Keywords Paper

0

0

0

0

14:57

19/08/2021

Objective-aware Traffic Simulation via Inverse Reinforcement Learning

Guanjie Zheng, Hanyang Liu, Kai Xu, Zhenhui Li

Keywords Paper

Multidisciplinary Topics and Applications, Transportation, Applications of Reinforcement Learning, Mining Spatial, Temporal Data

0

0

0

0

12:52

16/11/2020

Sim2Real Transfer for Deep Reinforcement Learning with Stochastic State Transition Delays

Sandeep Singh Sandha, Luis Garcia, Bharathan Balaji and
Fatima Anwar, Mani Srivastava

Keywords Paper

0

0

0

0

4:59

12/08/2020

Stealthy Tracking of Autonomous Vehicles with Cache Side Channels

Mulong Luo, Andrew C. Myers, G. Edward Suh

Keywords Paper

0

0

0

0

11:23

06/12/2020

PRANK: motion Prediction based on RANKing

Yuriy Biktairov, Maxim Stebelev, Irina Rudenko and
Oleh Shliazhko, Boris Yangel

Keywords Paper

0

0

0

0

3:22

06/12/2020

Neural Bridge Sampling for Evaluating Safety-Critical Autonomous Systems

Aman Sinha, Matthew O'Kelly, Russ Tedrake, John Duchi

Keywords Paper

0

0

0

0

3:21

13/04/2021

Deep probabilistic accelerated evaluation: A robust certifiable rare-event simulation methodology for black-box safety-critical systems

Mansur Arief, Zhiyuan Huang, Guru Koushik Senthil Kumar and
Yuanlu Bai, Shengyi He, Wenhao Ding, Henry Lam, Ding Zhao

Keywords Paper

0

0

0

0

3:03

16/11/2020

Learning a Decision Module by Imitating Driver’s Control Behaviors

Junning Huang, Sirui Xie, Jiankai Sun and
Qiurui Ma, Chunxiao Liu, Dahua Lin, Bolei Zhou

Keywords Paper

0

0

0

0

5:04

03/05/2021

HalentNet: Multimodal Trajectory Forecasting with Hallucinative Intents

Deyao Zhu, Mohamed Zahran, Li Erran Li, Mohamed Elhoseiny

Keywords Paper

0

0

0

0

5:18

14/06/2020

QEBA: Query-Efficient Boundary-Based Blackbox Attack

Huichen Li, Xiaojun Xu, Xiaolu Zhang and
Shuang Yang, Bo Li

Keywords Paper

adversarial machine learning, black-box attack, boundary-based attack, attacking public api

0

0

0

0

1:01

06/12/2021

Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization

Zhenghao Peng, Quanyi Li, Ka Ming Hui and
Chunxiao Liu, Bolei Zhou

Keywords Paper

optimization, reinforcement learning and planning

0

0

0

0

12:08

03/05/2021

Enforcing robust control guarantees within neural network policies

Priya Donti, Melrose Roderick, Mahyar Fazlyab, Zico Kolter

Keywords Paper

reinforcement learning, differentiable optimization, robust control

0

0

0

1

5:09

12/07/2020

Can autonomous vehicles identify, recover from, and adapt to distribution shifts?

Angelos Filos, Panagiotis Tigas, Rowan McAllister and
Nicholas Rhinehart, Sergey Levine, Yarin Gal

Keywords Paper

Trustworthy Machine Learning

0

0

0

0

14:49

26/08/2020

Mixed Strategies for Robust Optimization of Unknown Objectives

Pier Giuseppe Sessa, Ilija Bogunovic, Maryam Kamgarpour, Andreas Krause

Keywords Paper

0

0

0

0

14:13

02/02/2021

Robustness Guarantees for Mode Estimation with an Application to Bandits

Aldo Pacchiano, Heinrich Jiang, Michael I. Jordan

Keywords Paper

0

0

0

0

17:04

02/02/2021

Sequential Attacks on Kalman Filter-based Forward Collision Warning Systems

Yuzhe Ma, Jon A Sharp, Ruizhe Wang and
Earlence Fernandes, Xiaojin Zhu

Keywords Paper

0

0

0

0

18:00

14/06/2020

Exploring Data Aggregation in Policy Learning for Vision-Based Urban Autonomous Driving

Aditya Prakash, Aseem Behl, Eshed Ohn-Bar and
Kashyap Chitta, Andreas Geiger

Keywords Paper

autonomous driving, imitation learning, dagger, carla, adaptive sampling, online learning, mixture of distributions, curriculum learning

0

0

0

0

1:01

06/12/2021

The Many Faces of Adversarial Risk

Muni Sreenivas Pydi, Varun Jog

Keywords Paper

theory, machine learning, robustness, adversarial robustness and security, optimal transport

0

0

0

0

10:33

15/11/2020

Differentially-Private Software Frequency Profiling under Linear Constraints

Hailong Zhang, Yu Hao, Sufian Latif and
Raef Bassily, Atanas Rountev

Keywords Paper

program analysis, frequency profiling, differential privacy

0

0

0

0

14:14

06/12/2020

Multi-Robot Collision Avoidance under Uncertainty with Probabilistic Safety Barrier Certificates

Wenhao Luo, Wen Sun, Ashish Kapoor

Keywords Paper

Algorithms -> Clustering; Algorithms -> Semi-Supervised Learning; Theory -> Learning Theory, Algorithms -> Active Learning

0

0

0

0

3:20

26/08/2020

Learning with minibatch Wasserstein : asymptotic and gradient properties

Kilian Fatras, Younès Zine, Rémi Flamary and
Remi Gribonval, Nicolas Courty

Keywords Paper

0

0

0

1

12:59

06/12/2020

Robust Optimal Transport with Applications in Generative Modeling and Domain Adaptation

Yogesh Balaji, Rama Chellappa, Soheil Feizi

Keywords Paper

0

0

0

0

3:24

02/02/2021

Stable Adversarial Learning under Distributional Shifts

Jiashuo Liu, Zheyan Shen, Peng Cui and
Linjun Zhou, Kun Kuang, Bo Li, Yishi Lin

Keywords Paper

0

0

0

0

14:30

03/05/2021

Emergent Road Rules In Multi-Agent Driving Environments

Avik Pal, Jonah Philion, Andrew Liao, Sanja Fidler

Keywords Paper

0

0

0

0

4:48

03/05/2021

Blending MPC & Value Function Approximation for Efficient Reinforcement Learning

Mohak Bhardwaj, Sanjiban Choudhury, Byron Boots

Keywords Paper

reinforcement learning, model-predictive control

0

0

0

0

5:09

06/12/2020

High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization

Qing Feng , Ben Letham, Hongzi Mao, Eytan Bakshy

Keywords Paper

0

0

0

0

3:29

14/09/2020

Learning to Simulate on Sparse Trajectory Data

Hua Wei, Chacha Chen, Chang Liu and
Guanjie Zheng, Zhenhui Li

Keywords Paper

imitation learning, data sparsity, interpolation

0

0

0

0

16:07

26/04/2020

Deep Imitative Models for Flexible Inference, Planning, and Control

Nicholas Rhinehart, Rowan McAllister, Sergey Levine

Keywords Paper

imitation learning, planning, autonomous driving

0

0

0

0

5:02

16/11/2020

BayesRace: Learning to race autonomously using prior experience

Achin Jain, Matthew O’Kelly, Pratik Chaudhari, Manfred Morari

Keywords Paper

0

0

0

0

7:44

06/12/2021

Towards Optimal Strategies for Training Self-Driving Perception Models in Simulation

David Acuna, Jonah Philion, Sanja Fidler

Keywords Paper

theory, vision, domain adaptation, transfer learning

0

0

0

0

11:00

02/02/2021

Solution Concepts in Hierarchical Games Under Bounded Rationality With Applications to Autonomous Driving

Atrisha Sarkar, Krzysztof Czarnecki

Keywords Paper

0

0

0

0

19:57

06/12/2020

Stage-wise Conservative Linear Bandits

Ahmadreza Moradipari, Christos Thrampoulidis, Mahnoosh Alizadeh

Keywords Paper

0

0

0

0

3:18

18/07/2021

Unbalanced minibatch Optimal Transport; applications to Domain Adaptation

Kilian Fatras, Thibault Séjourné, Rémi Flamary, Nicolas Courty

Keywords Paper

Algorithms, Optimal Transport

0

0

0

2

4:57

03/05/2021

Trajectory Prediction using Equivariant Continuous Convolution

Robin Walters, Jinxi Li, Rose Yu

Keywords Paper

argoverse, continuous convolution, trajectory prediction, equivariant, symmetry

0

0

0

0

5:09

16/11/2020

Action-based Representation Learning for Autonomous Driving

Yi Xiao CVC &, UAB, Felipe Codevilla and
Christopher Pal, Antonio Lopez CVC &, UAB

Keywords Paper

0

0

0

0

4:48

06/12/2021

Adversarially Robust 3D Point Cloud Recognition Using Self-Supervisions

Jiachen Sun, Yulong Cao, Christopher B Choy and
Zhiding Yu, Anima Anandkumar, Zhuoqing Morley Mao, Chaowei Xiao

Keywords Paper

deep learning, robustness, adversarial robustness and security, self-supervised learning, transformers

0

0

0

0

13:15

22/11/2021

Multi-Stream Attention Learning for Monocular Vehicle Velocity and Inter-Vehicle Distance Estimation

Kuan-Chih Huang, Yu-Kai Huang, Winston H. Hsu

Keywords Paper

velocity estimation, distance estimation, relative constraint, autonomous driving, autonomous vehicle, perception, ADAS

0

0

0

0

2:32

12/07/2020

Invariant Risk Minimization Games

Kartik Ahuja, Karthikeyan Shanmugam, Kush Varshney, Amit Dhurandhar

Keywords Paper

Causality

0

0

0

0

14:57

12/07/2020

Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences

Daniel Brown, Scott Niekum, Russell Coleman, Ravi Srinivasan

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:11

12/07/2020

Cautious Adaptation For Reinforcement Learning in Safety-Critical Settings

Jesse Zhang, Brian Cheung, Chelsea Finn and
Sergey Levine, Dinesh Jayaraman

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

14:54