Striving for simplicity and performance in off-policy DRL: Output Normalization and Non-Uniform Sampling

12/07/2020

Striving for simplicity and performance in off-policy DRL: Output Normalization and Non-Uniform Sampling

Che Wang, Yanqiu Wu, Quan Vuong, Keith Ross

Keywords: Reinforcement Learning - Deep RL

Abstract Paper Similar Papers

Abstract: We aim to develop off-policy DRL algorithms that not only exceed state-of-the-art performance but are also simple and minimalistic. For standard continuous control benchmarks, Soft Actor Critic (SAC), which employs entropy maximization, currently provides state-of-the-art performance. We first demonstrate that the entropy term in SAC addresses action saturation due to the bounded nature of the action spaces. With this insight, we propose a streamlined algorithm with a simple normalization scheme or with inverted gradients. We show that both approaches can match SAC's sample efficiency performance without the need of entropy maximization. We then propose a simple non-uniform sampling method for selecting transitions from the replay buffer during training. Extensive experimental results demonstrate that our proposed sampling scheme leads to state of the art sample efficiency on challenging continuous control tasks. We combine all of our findings into one simple algorithm, which we call Streamlined Off Policy with Emphasizing Recent Experience, for which we provide robust public-domain code.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Make One-Shot Video Object Segmentation Efficient Again

Tim Meinhardt, Laura Leal-Taixé

Keywords Paper

0

0

0

0

3:17

18/07/2021

Improved Algorithms for Agnostic Pool-based Active Classification

Julian Katz-Samuels, Jifan Zhang, Lalit Jain, Kevin Jamieson

Keywords Paper

Algorithms, Active Learning

0

0

0

0

5:18

14/06/2020

Fast Soft Color Segmentation

Naofumi Akimoto, Huachun Zhu, Yanghua Jin, Yoshimitsu Aoki

Keywords Paper

soft color segmentation, layer decomposition, image editing, video editing, color, segmentation, neural network, generative

0

0

0

0

1:01

18/07/2021

Privacy-Preserving Feature Selection with Secure Multiparty Computation

Xiling Li, Rafael Dowsley, Martine De Cock

Keywords Paper

Social Aspects of Machine Learning, Privacy, Anonymity, and Security

0

0

0

0

5:12

12/07/2020

Discriminative Adversarial Search for Abstractive Summarization

Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier and
Benjamin Piwowarski, Jacopo Staiano

Keywords Paper

Applications - Language, Speech and Dialog

0

0

0

0

14:54

06/12/2021

Test-Time Classifier Adjustment Module for Model-Agnostic Domain Generalization

Yusuke Iwasawa, Yutaka Matsuo

Keywords Paper

deep learning, optimization, transformers, domain adaptation

0

0

0

0

13:50

26/08/2020

Deterministic Decoding for Discrete Data in Variational Autoencoders

Daniil Polykovskiy, Dmitry Vetrov

Keywords Paper

0

0

0

0

9:00

14/06/2020

GreedyNAS: Towards Fast One-Shot NAS With Greedy Supernet

Shan You, Tao Huang, Mingmin Yang and
Fei Wang, Chen Qian, Changshui Zhang

Keywords Paper

neural architecture search, supernet, one-shot nas, single path, greedy algorithm, exploration and exploitation, searching efficiency

0

0

0

0

1:01

06/12/2020

Online Robust Regression via SGD on the l1 loss

Scott Pesme, Nicolas Flammarion

Keywords Paper

0

0

0

0

3:17

19/08/2021

Weakly Supervised Dense Video Captioning via Jointly Usage of Knowledge Distillation and Cross-modal Matching

Bofeng Wu, Guocheng Niu, Jun Yu and
Xinyan Xiao, Jian Zhang, Hua Wu

Keywords Paper

Computer Vision, Language and Vision, Multi-instance; Multi-label; Multi-view learning

0

0

0

0

12:03

06/12/2021

Sample Selection for Fair and Robust Training

Yuji Roh, Kangwook Lee, Steven Whang, Changho Suh

Keywords Paper

optimization, robustness, fairness

0

0

0

0

13:44

06/12/2020

Improving Generalization in Reinforcement Learning with Mixture Regularization

KAIXIN WANG, Bingyi Kang, Jie Shao, Jiashi Feng

Keywords Paper

0

0

0

1

3:14

02/02/2021

Post-training Quantization with Multiple Points: Mixed Precision without Mixed Precision

Xingchao Liu, Mao Ye, Dengyong Zhou, Qiang Liu

Keywords Paper

0

0

0

0

15:18

26/04/2020

A Baseline for Few-Shot Image Classification

Guneet Singh Dhillon, Pratik Chaudhari, Avinash Ravichandran, Stefano Soatto

Keywords Paper

few-shot learning, transductive learning, fine-tuning, baseline, meta-learning

0

0

0

0

5:08

26/04/2020

Stochastic Conditional Generative Networks with Basis Decomposition

Ze Wang, Xiuyuan Cheng, Guillermo Sapiro, Qiang Qiu

Keywords Paper

0

0

0

0

4:00

06/12/2020

Noise2Same: Optimizing A Self-Supervised Bound for Image Denoising

Yaochen Xie, Zhengyang Wang, Shuiwang Ji

Keywords Paper

0

0

0

0

3:24

22/11/2021

SLURP: Side Learning Uncertainty for Regression Problems

Xuanlong Yu, Gianni Franchi, Emanuel Aldea

Keywords Paper

Uncertainty estimation, Confidence estimation, Auxiliary model, Monocular depth, Optical flow

0

0

0

0

3:03

06/12/2021

Realistic evaluation of transductive few-shot learning

Olivier Veilleux, Malik Boudiaf, Pablo Piantanida, Ismail Ben Ayed

Keywords Paper

optimization, machine learning, few shot learning

0

0

0

0

10:21

03/05/2021

Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online

Yangchen Pan, Kirby Banman, Martha White

Keywords Paper

natural sparsity, Reinforcement learning, fuzzy tiling activation function, sparse representation

0

0

0

1

6:22

06/12/2021

A Minimalist Approach to Offline Reinforcement Learning

Scott Fujimoto, Shixiang (Shane) Gu

Keywords Paper

reinforcement learning and planning, generative model

1

0

0

0

8:31

14/06/2020

Discrete Model Compression With Resource Constraint for Deep Neural Networks

Shangqian Gao, Feihu Huang, Jian Pei, Heng Huang

Keywords Paper

covutional neural networks, model compression, channel pruning, discrete optimization

0

0

0

0

1:01

06/12/2021

Channel Permutations for N:M Sparsity

Jeff Pool, Chong Yu

Keywords Paper

optimization

0

0

0

0

12:41

06/12/2021

Streaming Linear System Identification with Reverse Experience Replay

Suhas Kowshik, Dheeraj Nagaraj, Prateek Jain, Praneeth Netrapalli

Keywords Paper

optimization, reinforcement learning and planning

1

0

0

0

14:17

12/07/2020

Operation-Aware Soft Channel Pruning using Differentiable Masks

Minsoo Kang, Bohyung Han

Keywords Paper

Applications - Computer Vision

0

0

0

0

14:56

06/12/2020

Top-k Training of GANs: Improving GAN Performance by Throwing Away Bad Samples

Samarth Sinha, Zhengli Zhao, Anirudh Goyal ALIAS PARTH GOYAL and
Colin A Raffel, Augustus Odena

Keywords Paper

0

0

0

0

3:20

04/08/2021

Group testing and local search: is there a computational-statistical gap?

Fotis Iliopoulos, Ilias Zadik

Keywords Paper

0

0

0

0

17:50

06/12/2021

Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning

Jongjin Park, Younggyo Seo, Chang Liu and
Li Zhao, Tao Qin, Jinwoo Shin, Tie-Yan Liu

Keywords Paper

reinforcement learning and planning, causality

0

0

0

0

12:12

06/12/2021

Drop-DTW: Aligning Common Signal Between Sequences While Dropping Outliers

Mikita Dvornik, Isma Hadji, Konstantinos Derpanis and
Animesh Garg, Allan Jepson

Keywords Paper

representation learning

0

0

0

0

13:34

18/07/2021

Dash: Semi-Supervised Learning with Dynamic Thresholding

Yi Xu, Lei Shang, Jinxing Ye and
Qi Qian, Yufeng Li, Baigui Sun, Hao Li, rong jin

Keywords Paper

Algorithms, Semi-Supervised Learning

0

0

0

1

15:24

13/04/2021

Experimental design for regret minimization in linear bandits

Andrew Wagenmaker, Julian Katz-Samuels, Kevin Jamieson

Keywords Paper

0

0

0

0

3:05

06/12/2021

POODLE: Improving Few-shot Learning via Penalizing Out-of-Distribution Samples

Duong Le, Khoi Duc Nguyen, Khoi Nguyen and
Quoc-Huy Tran, Rang Nguyen, Binh-Son Hua

Keywords Paper

few shot learning

0

0

0

0

6:48

06/12/2021

The Skellam Mechanism for Differentially Private Federated Learning

Naman Agarwal, Peter Kairouz, Ziyu Liu

Keywords Paper

machine learning, privacy, federated learning

0

0

0

0

15:37

06/12/2020

Mix and Match: An Optimistic Tree-Search Approach for Learning Models from Mixture Distributions

Matthew Faw, Rajat Sen, Karthikeyan Shanmugam and
Constantine Caramanis, Sanjay Shakkottai

Keywords Paper

0

0

0

0

3:24

03/05/2021

Towards Impartial Multi-task Learning

Liyang Liu, Yi Li, Zhanghui Kuang and
Jing-Hao Xue, Yimin Chen, Wenming Yang, Qingmin Liao, Wei Zhang

Keywords Paper

Scene Understanding, Impartial Learning, Multi-task Learning

0

0

0

0

5:06

06/12/2020

Functional Regularization for Representation Learning: A Unified Theoretical Perspective

Siddhant Garg, Yingyu Liang

Keywords Paper

0

0

0

0

3:19

26/04/2020

Ranking Policy Gradient

Kaixiang Lin, Jiayu Zhou

Keywords Paper

Sample-efficient reinforcement learning, off-policy learning.

0

0

0

0

5:43

22/11/2021

Noisy Differentiable Architecture Search

Xiangxiang Chu, Bo Zhang

Keywords Paper

Neural architecture search, AutoML

0

0

0

0

2:30

18/07/2021

Is Space-Time Attention All You Need for Video Understanding?

Gedas Bertasius, Heng Wang, Lorenzo Torresani

Keywords Paper

, Algorithms, AutoML, Deep Learning, Architectures

0

0

0

0

5:15

03/05/2021

Using latent space regression to analyze and leverage compositionality in GANs

Lucy Chai, Jonas Wulff, Phillip Isola

Keywords Paper

Image Editing, Generative Adversarial Networks, Composition, Image Synthesis, Interpretability

0

0

0

0

5:09

12/07/2020

The continuous categorical: a novel simplex-valued exponential family

Elliott Gordon-Rodriguez, Gabriel Loaiza-Ganem, John Cunningham

Keywords Paper

Probabilistic Inference - Models and Probabilistic Programming

0

0

0

0

14:59