Adaptive Trade-Offs in Off-Policy Learning

26/08/2020

Adaptive Trade-Offs in Off-Policy Learning

Mark Rowland, Will Dabney, Remi Munos

Keywords:

Abstract Paper Similar Papers

Abstract: A great variety of off-policy learning algorithms exist in the literature, and new breakthroughs in this area continue to be made, improving theoretical understanding and yielding state-of-the-art reinforcement learning algorithms. In this paper, we take a unifying view of this space of algorithms, and consider their trade-offs of three fundamental quantities: update variance, fixed-point bias, and contraction rate. This leads to new perspectives on existing methods, and also naturally yields novel algorithms for off-policy evaluation and control. We develop one such algorithm, C-trace, demonstrating that it is able to more efficiently make these trade-offs than existing methods in use, and that it can be scaled to yield state-of-the-art performance in large-scale environments.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AISTATS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

12/07/2020

A Generic First-Order Algorithmic Framework for Bi-Level Programming Beyond Lower-Level Singleton

Risheng Liu, Pan Mu, Xiaoming Yuan and
Shangzhi Zeng, Jin Zhang

Keywords Paper

Optimization - Non-convex

0

0

0

0

13:51

14/06/2020

Mnemonics Training: Multi-Class Incremental Learning Without Forgetting

Yaoyao Liu, Yuting Su, An-An Liu and
Bernt Schiele, Qianru Sun

Keywords Paper

incremental learning, continual learning, classification, recognition, transfer learning, representation learning, bilevel optimization, online learning, imagenet, cifar-100

0

0

0

0

5:01

12/07/2020

Evolving Machine Learning Algorithms From Scratch

Esteban Real, Chen Liang, David So, Quoc Le

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

15:01

15/11/2020

A Modular Cost Analysis for Probabilistic Programs

Martin Avanzini, Georg Moser, Michael Schaper

Keywords Paper

probabilistic programs, automation, average complexity, modularity

0

0

0

0

14:58

06/12/2020

BoTorch: A Framework for Efficient Monte-Carlo Bayesian Optimization

Max Balandat, Brian Karrer, Daniel Jiang and
Samuel Daulton, Ben Letham, Andrew Wilson, Eytan Bakshy

Keywords Paper

Reinforcement Learning and Planning -> Model-Based RL; Reinforcement Learning and Planning -> Reinforcement Learning, Reinforcement Learning and Planning -> Multi-Agent RL

0

0

0

0

3:21

06/12/2021

Heuristic-Guided Reinforcement Learning

Ching-An Cheng, Andrey Kolobov, Adith Swaminathan

Keywords Paper

theory, reinforcement learning and planning

0

0

0

0

8:39

22/11/2021

Meta-learning the Learning Trends Shared Across Tasks

Jathushan Rajasegaran, Salman Khan, Munawar Hayat and
Fahad Shahbaz Khan, Mubarak Shah

Keywords Paper

Meta-learning, Few-shot learning

0

0

0

0

2:38

06/12/2020

Joint Contrastive Learning with Infinite Possibilities

Qi Cai, Yu Wang, Yingwei Pan and
Ting Yao, Tao Mei

Keywords Paper

0

0

0

0

3:06

06/12/2021

On Calibration and Out-of-Domain Generalization

Yoav Wald, Amir Feder, Daniel Greenfeld, Uri Shalit

Keywords Paper

machine learning, domain adaptation, causality

0

0

0

0

11:00

26/04/2020

Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization

Michael Volpp, Lukas P. Fröhlich, Kirsten Fischer and
Andreas Doerr, Stefan Falkner, Frank Hutter, Christian Daniel

Keywords Paper

Transfer Learning, Meta Learning, Bayesian Optimization, Reinforcement Learning

0

0

0

0

5:05

06/12/2020

Classification with Valid and Adaptive Coverage

Yaniv Romano, Matteo Sesia, Emmanuel Candes

Keywords Paper

0

0

0

0

3:14

12/07/2020

Provably Efficient Model-based Policy Adaptation

Yuda Song, Aditi Mavalankar, Wen Sun, Sicun Gao

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:49

06/12/2021

Noether Networks: meta-learning useful conserved quantities

Ferran Alet, Dylan Doblar, Allan Zhou and
Josh Tenenbaum, Kenji Kawaguchi, Chelsea Finn

Keywords Paper

machine learning, vision, meta learning

0

0

0

0

11:18

02/02/2021

Generate Your Counterfactuals: Towards Controlled Counterfactual Generation for Text

Nishtha Madaan, Inkit Padhi, Naveen Panwar, Diptikalyan Saha

Keywords Paper

0

0

0

0

20:15

06/12/2021

GradInit: Learning to Initialize Neural Networks for Stable and Efficient Training

Chen Zhu, Renkun Ni, Zheng Xu and
Kezhi Kong, W. Ronny Huang, Tom Goldstein

Keywords Paper

deep learning, transformers, vision

0

0

0

0

13:17

06/12/2021

Gone Fishing: Neural Active Learning with Fisher Embeddings

Jordan Ash, Surbhi Goel, Akshay Krishnamurthy, Sham Kakade

Keywords Paper

deep learning, machine learning, active learning

0

0

0

0

6:55

19/08/2021

The Successful Ingredients of Policy Gradient Algorithms

Sven Gronauer, Martin Gottwald, Klaus Diepold

Keywords Paper

Machine Learning, Deep Reinforcement Learning, Reproducibility, Validation and Verification

0

0

0

0

14:10

06/12/2021

Differentiable Synthesis of Program Architectures

Guofeng Cui, He Zhu

Keywords Paper

optimization, machine learning, interpretability

0

0

0

0

13:31

02/02/2021

Active Bayesian Assessment of Black-Box Classifiers

Disi Ji, Robert L. Logan, Padhraic Smyth, Mark Steyvers

Keywords Paper

0

0

0

0

14:47

19/08/2021

Fine-grained Generalization Analysis of Structured Output Prediction

Waleed Mustafa, Yunwen Lei, Antoine Ledent, Marius Kloft

Keywords Paper

Machine Learning, Learning Theory, Structured Prediction

0

0

0

0

15:46

03/05/2021

Generalized Multimodal ELBO

Thomas Sutter, Imant Daunhawer, Julia E Vogt

Keywords Paper

self-supervised, generative learning, ELBO, VAE, Multimodal

0

0

0

0

5:15

03/05/2021

C-Learning: Horizon-Aware Cumulative Accessibility Estimation

Panteha Naderian, Gabriel Loaiza-Ganem, Harry Braviner and
Anthony Caterini, Jesse C Cresswell, Tong Li, Animesh Garg

Keywords Paper

reinforcement learning, goal reaching, Q-learning

0

0

0

0

4:49

18/07/2021

Learning Routines for Effective Off-Policy Reinforcement Learning

Edoardo Cetin, Oya Celiktutan

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:17

12/07/2020

Why Are Learned Indexes So Effective?

Paolo Ferragina, Fabrizio Lillo, Giorgio Vinciguerra

Keywords Paper

Applications - Other

0

0

0

0

13:22

12/07/2020

Randomized Block-Diagonal Preconditioning for Parallel Learning

Celestine Mendler-Dünner, Aurelien Lucchi

Keywords Paper

Optimization - Large Scale, Parallel and Distributed

0

0

0

0

12:57

02/02/2021

Meta-Learning Framework with Applications to Zero-Shot Time-Series Forecasting

Boris N. Oreshkin, Dmitri Carpov, Nicolas Chapados, Yoshua Bengio

Keywords Paper

0

0

0

0

17:41

12/07/2020

Handling the Positive-Definite Constraint in the Bayesian Learning Rule

Wu Lin, Mark Schmidt, Mohammad Emtiyaz Khan

Keywords Paper

Probabilistic Inference - Approximate, Monte Carlo, and Spectral Methods

0

0

0

0

14:51

06/12/2021

Flexible Option Learning

Martin Klissarov, Doina Precup

Keywords Paper

reinforcement learning and planning

1

0

0

0

15:47

06/12/2021

Effective Meta-Regularization by Kernelized Proximal Regularization

Weisen Jiang, James Kwok, Yu Zhang

Keywords Paper

meta learning

0

0

0

0

7:32

26/04/2020

Unbiased Contrastive Divergence Algorithm for Training Energy-Based Latent Variable Models

Yixuan Qiu, Lingsong Zhang, Xiao Wang

Keywords Paper

energy model, restricted Boltzmann machine, contrastive divergence, unbiased Markov chain Monte Carlo, distribution coupling

0

0

0

0

4:34

18/07/2021

f-Domain Adversarial Learning: Theory and Algorithms

David Acuna, Guojun Zhang, Marc Law, Sanja Fidler

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

5:17

05/01/2021

MPRNet: Multi-Path Residual Network for Lightweight Image Super Resolution

Armin Mehri, Parichehr B. Ardakani, Angel D. Sappa

Keywords Paper

0

0

0

0

4:57

03/05/2021

Meta-Learning with Neural Tangent Kernels

Yufan Zhou, Zhenyi Wang, Jiayi Xian and
Changyou Chen, Jinhui Xu

Keywords Paper

neural tangent kernel, meta-learning

0

0

0

0

3:54

26/04/2020

Domain Adaptive Multibranch Networks

Róger Bermúdez-Chacón, Mathieu Salzmann, Pascal Fua

Keywords Paper

Domain Adaptation, Computer Vision

0

0

0

0

5:26

06/12/2021

Information-theoretic generalization bounds for black-box learning algorithms

Hrayr Harutyunyan, Maxim Raginsky, Greg Ver Steeg, Aram Galstyan

Keywords Paper

theory, deep learning

0

0

0

0

13:59

06/12/2021

Global-aware Beam Search for Neural Abstractive Summarization

Ye Ma, Zixun Lan, Lu Zong, Kaizhu Huang

Keywords Paper

0

0

0

0

10:22

06/12/2021

Meta-learning to Improve Pre-training

Aniruddh Raghu, Jonathan Lorraine, Simon Kornblith and
Matthew McDermott, David Duvenaud

Keywords Paper

deep learning, optimization, graph learning, meta learning

0

0

0

0

12:57

18/07/2021

SigGPDE: Scaling Sparse Gaussian Processes on Sequential Data

Maud Lemercier, Cristopher Salvi, Thomas Cass and
Edwin V Bonilla, Theo Damoulas, Terry Lyons

Keywords Paper

Probabilistic Methods, Gaussian Processes and Bayesian non-parametrics

0

0

0

0

4:42

06/12/2020

Understanding Deep Architecture with Reasoning Layer

Xinshi Chen, Yufei Zhang, Christoph Reisinger, Le Song

Keywords Paper

0

0

0

0

3:28

06/12/2021

Differentiable Spline Approximations

Minsu Cho, Aditya Balu, Ameya Joshi and
Anjana Deva Prasad, Biswajit Khara, Soumik Sarkar, Baskar Ganapathysubramanian, Adarsh Krishnamurthy, Chinmay Hegde

Keywords Paper

optimization, machine learning

0

0

0

0

7:18