A distributional view on multi objective policy optimization

12/07/2020

A distributional view on multi objective policy optimization

Abbas Abdolmaleki, Sandy Huang, Leonard Hasenclever, Michael Neunert, Martina Zambelli, Murilo Martins, Francis Song, Nicolas Heess, Raia Hadsell, Martin Riedmiller

Keywords: Reinforcement Learning - Deep RL

Abstract Paper Similar Papers

Abstract: Many real-world problems require trading off multiple competing objectives. However, these objectives are often in different units with different scales, which can make it challenging for practitioners to express numerical preferences over objectives in their native units. In this paper we propose a novel algorithm for multi-objective reinforcement learning that enables setting desired preferences for objectives in a scale-invariant way. We propose to learn a target local policy for each objective, and we use supervised learning to fit a parametric policy to a combination of these distributions. We demonstrate the effectiveness of our approach on challenging high-dimensional real and simulated robotics tasks, and show that setting different preferences in our framework allows us to trace out the space of nondominated solutions.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Multi-Task Reinforcement Learning with Soft Modularization

Ruihan Yang, Huazhe Xu, YI WU, Xiaolong Wang

Keywords Paper

0

0

0

0

3:18

12/07/2020

Flexible and Efficient Long-Range Planning Through Curious Exploration

Aidan Curtis, Minjian Xin, Dilip Arumugam and
Kevin Feigelis, Daniel Yamins

Keywords Paper

Reinforcement Learning - General

0

0

0

0

16:25

06/12/2020

High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization

Qing Feng , Ben Letham, Hongzi Mao, Eytan Bakshy

Keywords Paper

0

0

0

0

3:29

18/07/2021

Goal-Conditioned Reinforcement Learning with Imagined Subgoals

Elliot Chane-Sane, Cordelia Schmid, Ivan Laptev

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

4:57

16/11/2020

Learning Dexterous Manipulation from Suboptimal Experts

Rae Jeong, Jost Tobias Springenberg, Jackie Kay and
Dan Zheng, Alexandre Galashov, Nicolas Heess, Francesco Nori

Keywords Paper

0

0

0

0

5:03

06/12/2021

Learning to Execute: Efficient Learning of Universal Plan-Conditioned Policies in Robotics

Ingmar Schubert, Danny Driess, Ozgur S. Oguz, Marc Toussaint

Keywords Paper

reinforcement learning and planning

0

0

0

0

8:36

06/12/2021

Robust Generalization despite Distribution Shift via Minimum Discriminating Information

Tobias Sutter, Andreas Krause, Daniel Kuhn

Keywords Paper

optimization, machine learning

0

0

0

0

15:05

06/12/2020

Learning Linear Programs from Optimal Decisions

Yingcong Tan, Daria Terekhov, Andrew Delong

Keywords Paper

, Applications -> Privacy, Anonymity, and Security

0

0

0

0

3:21

26/08/2020

Finite-Time Analysis of Decentralized Temporal-Difference Learning with Linear Function Approximation

Jun Sun, Gang Wang, Georgios B. Giannakis and
Qinmin Yang, Zaiyue Yang

Keywords Paper

0

0

0

0

17:07

06/12/2020

One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL

Saurabh Kumar, Aviral Kumar, Sergey Levine, Chelsea Finn

Keywords Paper

0

0

0

0

3:24

12/07/2020

Skew-Fit: State-Covering Self-Supervised Reinforcement Learning

Vitchyr Pong, Murtaza Dalal, Steven Lin and
Ashvin Nair, Shikhar Bahl, Sergey Levine

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:13

18/07/2021

MURAL: Meta-Learning Uncertainty-Aware Rewards for Outcome-Driven Reinforcement Learning

Kevin Li, Abhishek Gupta, Ashwin D Reddy and
Vitchyr Pong, Aurick Zhou, Justin Yu, Sergey Levine

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:16

13/04/2021

Abstract value iteration for hierarchical reinforcement learning

Kishor Jothimurugan, Osbert Bastani, Rajeev Alur

Keywords Paper

0

0

0

0

2:57

12/07/2020

Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control

Jie Xu, Yunsheng Tian, Pingchuan Ma and
Daniela Rus, Shinjiro Sueda, Wojciech Matusik

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:15

06/12/2020

PlanGAN: Model-based Planning With Sparse Rewards and Multiple Goals

Henry Charlesworth, Giovanni Montana

Keywords Paper

0

0

0

0

3:20

06/12/2021

Design of Experiments for Stochastic Contextual Linear Bandits

Andrea Zanette, Kefan Dong, Jonathan N Lee, Emma Brunskill

Keywords Paper

reinforcement learning and planning, bandits

0

0

0

0

13:58

26/04/2020

Learning to Balance: Bayesian Meta-Learning for Imbalanced and Out-of-distribution Tasks

Hae Beom Lee, Hayeon Lee, Donghyun Na and
Saehoon Kim, Minseop Park, Eunho Yang, Sung Ju Hwang

Keywords Paper

meta-learning, few-shot learning, Bayesian neural network, variational inference, learning to learn, imbalanced and out-of-distribution tasks for few-shot learning

0

0

0

1

13:46

06/12/2021

An Information-theoretic Approach to Distribution Shifts

Marco Federici, Ryota Tomioka, Patrick Forré

Keywords Paper

theory, deep learning, machine learning, graph learning, domain adaptation, representation learning

0

0

0

0

9:50

06/12/2021

Learning Collaborative Policies to Solve NP-hard Routing Problems

Minsu Kim, Jinkyoo Park, joungho kim

Keywords Paper

reinforcement learning and planning

0

0

0

0

15:03

06/12/2020

Modeling and Optimization Trade-off in Meta-learning

Katelyn Gao, Ozan Sener

Keywords Paper

0

0

0

0

3:21

03/05/2021

C-Learning: Horizon-Aware Cumulative Accessibility Estimation

Panteha Naderian, Gabriel Loaiza-Ganem, Harry Braviner and
Anthony Caterini, Jesse C Cresswell, Tong Li, Animesh Garg

Keywords Paper

reinforcement learning, goal reaching, Q-learning

0

0

0

0

4:49

26/04/2020

Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization

Michael Volpp, Lukas P. Fröhlich, Kirsten Fischer and
Andreas Doerr, Stefan Falkner, Frank Hutter, Christian Daniel

Keywords Paper

Transfer Learning, Meta Learning, Bayesian Optimization, Reinforcement Learning

0

0

0

0

5:05

06/12/2020

Approximate Heavily-Constrained Learning with Lagrange Multiplier Models

Harikrishna Narasimhan, Andy Cotter, Yichen Zhou and
Serena Wang, Wenshuo Guo

Keywords Paper

0

0

0

0

3:21

06/12/2021

Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies

Tim Seyde, Igor Gilitschenski, Wilko Schwarting and
Bartolomeo Stellato, Martin Riedmiller, Markus Wulfmeier, Daniela Rus

Keywords Paper

reinforcement learning and planning

0

0

0

0

6:48

12/07/2020

Tuning-free Plug-and-Play Proximal Algorithm for Inverse Imaging Problems

Kaixuan Wei, Angelica I Aviles-Rivero, Jingwei Liang and
Ying Fu, Carola-Bibiane Schönlieb, Hua Huang

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

11:48

26/04/2020

Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling

Yuping Luo, Huazhe Xu, Tengyu Ma

Keywords Paper

imitation learning, model-based imitation learning, model-based RL, behavior cloning, covariate shift

0

0

0

0

4:38

26/04/2020

Intrinsically Motivated Discovery of Diverse Patterns in Self-Organizing Systems

Chris Reinke, Mayalen Etcheverry, Pierre-Yves Oudeyer

Keywords Paper

deep learning, unsupervised Learning, self-organization, game-of-life

0

0

0

0

14:57

26/04/2020

Automated curriculum generation through setter-solver interactions

Sebastien Racaniere, Andrew Lampinen, Adam Santoro and
David Reichert, Vlad Firoiu, Timothy Lillicrap

Keywords Paper

Deep Reinforcement Learning, Automatic Curriculum

0

0

0

0

3:55

12/07/2020

Ready Policy One: World Building Through Active Learning

Philip Ball, Jack Parker-Holder, Aldo Pacchiano and
Krzysztof Choromanski, Stephen Roberts

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:31

19/08/2021

Two-Sided Wasserstein Procrustes Analysis

Kun Jin, Chaoyue Liu, Cathy Xia

Keywords Paper

Machine Learning Applications, Applications of Unsupervised Learning, Transfer, Adaptation, Multi-task Learning, Bio/Medicine

0

0

0

1

15:43

23/08/2020

Diverse rule sets

Guangyi Zhang, Aristides Gionis

Keywords Paper

sampling, classifier, pattern mining, rule learning, diversification, rule sets

0

0

0

0

9:41

19/08/2021

Cross-Domain Few-Shot Classification via Adversarial Task Augmentation

Haoqing Wang, Zhi-Hong Deng

Keywords Paper

Computer Vision, Recognition, Adversarial Machine Learning, Deep Learning

0

0

0

0

10:39

14/06/2020

End-to-End 3D Point Cloud Instance Segmentation Without Detection

Haiyong Jiang, Feilong Yan, Jianfei Cai and
Jianmin Zheng, Jun Xiao

Keywords Paper

3d instance segmentation, stable matching, point cloud, label assignment

0

0

0

0

1:01

03/05/2021

Theoretical bounds on estimation error for meta-learning

James Lucas, Mengye Ren, Irene Raissa KAMENI KAMENI and
Toniann Pitassi, Richard Zemel

Keywords Paper

meta learning, minimax risk, few-shot, lower bounds, learning theory

0

0

0

0

4:46

14/09/2020

Option Encoder: A Framework for Discovering a Policy Basis in Reinforcement Learning

Arjun Manoharan, Rahul Ramesh, Balaraman Ravindran

Keywords Paper

hierarchical reinforcement learning, policy distillation

0

0

0

0

13:49

30/11/2020

Large-Scale Cross-Domain Few-Shot Learning

Jiechao Guan, Manli Zhang, Zhiwu Lu

Keywords Paper

0

0

0

0

7:26

12/07/2020

Meta Variance Transfer: Learning to Augment from the Others

Seong-Jin Park, Seungju Han, Ji-won Baek and
Insoo Kim, Juhwan Song, Hae Beom Lee, Jae-Joon Han, Sung Ju Hwang

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

14:59

26/10/2020

PDDLStream: Integrating Symbolic Planners and Blackbox Samplers via Optimistic Adaptive Planning

Caelan Reed Garrett, Tomás Lozano-Pérez, Leslie Pack Kaelbling

Keywords Paper

Task and Motion Planning, Robotics, Sampling-Based Planning, Domain-Independent Planning, Hybrid Planning

0

0

0

0

9:58

03/05/2021

Learning What To Do by Simulating the Past

David Lindner, Rohin Shah, Pieter Abbeel, Anca Dragan

Keywords Paper

reinforcement learning, imitation learning, reward learning

0

0

0

0

5:09

06/12/2021

Local policy search with Bayesian optimization

Sarah Müller, Alexander von Rohr, Sebastian Trimpe

Keywords Paper

theory, optimization, reinforcement learning and planning, active learning

0

0

0

0

11:42