Balancing Constraints and Rewards with Meta-Gradient D4PG

Abstract: Deploying Reinforcement Learning (RL) agents to solve real-world applications often requires satisfying complex system constraints. Often the constraint thresholds are incorrectly set due to the complex nature of a system or the inability to verify the thresholds offline (e.g, no simulator or reasonable offline evaluation procedure exists). This results in solutions where a task cannot be solved without violating the constraints. However, in many real-world cases, constraint violations are undesirable yet they are not catastrophic, motivating the need for soft-constrained RL approaches. We present two soft-constrained RL approaches that utilize meta-gradients to find a good trade-off between expected return and minimizing constraint violations. We demonstrate the effectiveness of these approaches by showing that they consistently outperform the baselines across four different Mujoco domains.

12/07/2020

Balancing Constraints and Rewards with Meta-Gradient D4PG

Dan A. Calian, Daniel J Mankowitz, Tom Zahavy, Zhongwen Xu, Junhyuk Oh, Nir Levine, Timothy A Mann

Comments

Similar Papers

Constrained Markov Decision Processes via Backward Value Functions

Harsh Satija, Philip Amortila, Joelle Pineau

Keywords Abstract Paper

Reinforcement Learning - General

Safe Reinforcement Learning with Natural Language Constraints

Tsung-Yen Yang, Michael Y Hu, Yinlam Chow and Peter J Ramadge, Karthik Narasimhan

Keywords Abstract Paper

reinforcement learning and planning

Residual Pathway Priors for Soft Equivariance Constraints

Marc Finzi, Gregory Benton, Andrew Wilson

Keywords Abstract Paper

deep learning, reinforcement learning and planning, machine learning

Adversarial Robustness with Semi-Infinite Constrained Learning

Alexander Robey, Luiz Chamon, George J. Pappas and Hamed Hassani, Alejandro Ribeiro

Keywords Abstract Paper

theory, deep learning, optimization, robustness, adversarial robustness and security

Policy Learning with Constraints in Model-free Reinforcement Learning: A Survey

Yongshuai Liu, Avishai Halev, Xin Liu

Keywords Abstract Paper

Machine learning, General, General, General

A Theory of Independent Mechanisms for Extrapolation in Generative Models

Michel Besserve, Remy Sun, Dominik Janzing, Bernhard Schölkopf

Keywords Abstract Paper

Derivative-Free Policy Optimization for Linear Risk-Sensitive and Robust Control Design: Implicit Regularization and Sample Complexity

Kaiqing Zhang, Xiangyuan Zhang, Bin Hu, Tamer Basar

Keywords Abstract Paper

theory, optimization, reinforcement learning and planning

When Is Generalizable Reinforcement Learning Tractable?

Dhruv Malik, Yuanzhi Li, Pradeep Ravikumar

Keywords Abstract Paper

reinforcement learning and planning, generative model, representation learning

Inverse Constrained Reinforcement Learning

Shehryar Malik, Usman Anwar, Alireza Aghasi, Ali Ahmed

Keywords Abstract Paper

Reinforcement Learning and Planning, Deep RL

An Integer Linear Programming Framework for Mining Constraints from Data

Tao Meng, Kai-Wei Chang

Keywords Abstract Paper

Algorithms, Structured Prediction

Density Constrained Reinforcement Learning

Zengyi Qin, Yuxiao Chen, Chuchu Fan

Keywords Abstract Paper

Reinforcement Learning and Planning

Teaching the Old Dog New Tricks: Supervised Learning with Constraints

Fabrizio Detassis, Michele Lombardi, Michela Milano

Keywords Abstract Paper

Gradually Vanishing Bridge for Adversarial Domain Adaptation

Shuhao Cui, Shuhui Wang, Junbao Zhuo and Chi Su, Qingming Huang, Qi Tian

Keywords Abstract Paper

bridge, domain adaptation, adversarial learning

Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning

Hiroki Furuta, Tatsuya Matsushima, Tadashi Kozuno and Yutaka Matsuo, Sergey Levine, Ofir Nachum, Shixiang Gu

Keywords Abstract Paper

Reinforcement Learning and Planning, Deep RL

Sample-Specific Output Constraints for Neural Networks

Mathis Brosowsky, Florian Keck, Olaf Dünkel, Marius Zöllner

Keywords Abstract Paper

Offline Contextual Bandits with Overparameterized Models

David Brandfonbrener, Will Whitney, Rajesh Ranganath, Joan Bruna

Keywords Abstract Paper

Optimization, Non-Convex Optimization, Reinforcement Learning and Planning, Optimization, Stochastic Optimization

Multi-Label Learning with Pairwise Relevance Ordering

Ming-Kun Xie, Sheng-Jun Huang

Keywords Abstract Paper

machine learning

Adversarial Robustness through Disentangled Representations

Shuo Yang, Tianyu Guo, Yunhe Wang, Chang Xu

Keywords Abstract Paper

Understanding the Limitations of Conditional Generative Models

Ethan Fetaya, Joern-Henrik Jacobsen, Will Grathwohl, Richard Zemel

Keywords Abstract Paper

Conditional Generative Models, Generative Classifiers, Robustness, Adversarial Examples

On hyperparameter tuning in general clustering problemsm

Xinjie Fan, Yuguang Yue, Purnamrita Sarkar, Y. X. Rachel Wang

Keywords Abstract Paper

Unsupervised and Semi-Supervised Learning

Keywords Paper

Tsung-Yen Yang, Michael Y Hu, Yinlam Chow and
Peter J Ramadge, Karthik Narasimhan

Keywords Paper

Keywords Paper

Alexander Robey, Luiz Chamon, George J. Pappas and
Hamed Hassani, Alejandro Ribeiro

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Shuhao Cui, Shuhui Wang, Junbao Zhuo and
Chi Su, Qingming Huang, Qi Tian

Keywords Paper

Hiroki Furuta, Tatsuya Matsushima, Tadashi Kozuno and
Yutaka Matsuo, Sergey Levine, Ofir Nachum, Shixiang Gu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

lucadiliello Di Liello, Pierfrancesco Ardino, Jacopo Gobbi and
Paolo Morettin, Stefano Teso, Andrea Passerini

Keywords Paper

Tianhe Yu, Aviral Kumar, Rafael Rafailov and
Aravind Rajeswaran, Sergey Levine, Chelsea Finn

Keywords Paper

Jonathan Lee, Aldo Pacchiano, Vidya Muthukumar and
Weihao Kong, Emma Brunskill

Keywords Paper

Keywords Paper

Bo Liu, Xingchao Liu, Xiaojie Jin and
Peter Stone, Qiang Liu

Keywords Paper

Keywords Paper

Jiani Huang, Ziyang Li, Binghong Chen and
Karan Samel, Mayur Naik, Le Song, Xujie Si

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Moonkyung Ryu, Yinlam Chow, Ross Anderson and
Christian Tjandraatmadja, Craig Boutilier

Keywords Paper

Divyansh Garg, Shuvam Chakraborty, Chris Cundy and
Jiaming Song, Stefano Ermon

Keywords Paper

Keywords Paper