Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning

12/07/2020

Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning

Aleksei Petrenko, Zhehui Huang, Tushar Kumar, Gaurav Sukhatme, Vladlen Koltun

Keywords: Reinforcement Learning - Deep RL

Abstract Paper Similar Papers

Abstract: Increasing the scale of reinforcement learning experiments has allowed researchers to achieve unprecedented results in both training sophisticated agents for video games, and in sim-to-real transfer for robotics. Typically such experiments rely on large distributed systems and require expensive hardware setups, limiting wider access to this exciting area of research. In this work we aim to solve this problem by optimizing the efficiency and resource utilization of reinforcement learning algorithms instead of relying on distributed computation. We present the "Sample Factory", a high-throughput training system optimized for a single-machine setting. Our architecture combines a highly efficient, asynchronous, GPU-based sampler with off-policy correction techniques, allowing us to achieve throughput higher than $10^5$ environment frames/second on non-trivial control problems in 3D without sacrificing sample efficiency. We extend Sample Factory to support self-play and population-based training and apply these techniques to train highly capable agents for a multiplayer first-person shooter game. Github: https://github.com/alex-petrenko/sample-factory

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

05/04/2021

Horizontally Fused Training Array: An Effective Hardware Utilization Squeezer for Training Novel Deep Learning Models

Shang Wang, Peiming Yang, Yuxuan Zheng and
Xin Li, Gennady Pekhimenko

Keywords Paper

Theory -> Statistical Physics of Learning, Optimization -> Non-Convex Optimization

0

0

0

0

20:09

05/04/2021

Horizontally Fused Training Array: An Effective Hardware Utilization Squeezer for Training Novel Deep Learning Models

Shang Wang, Peiming Yang, Yuxuan Zheng and
Xin Li, Gennady Pekhimenko

Keywords Paper

Theory -> Statistical Physics of Learning, Optimization -> Non-Convex Optimization

0

0

0

0

4:46

05/04/2021

Accelerating SLIDE Deep Learning on Modern CPUs: Vectorization, Quantizations, Memory Optimizations, and More

Shabnam Daghaghi, Nicholas Meisburger, Mengnan Zhao, Anshumali Shrivastava

Keywords Paper

0

0

0

0

5:31

05/04/2021

Accelerating SLIDE Deep Learning on Modern CPUs: Vectorization, Quantizations, Memory Optimizations, and More

Shabnam Daghaghi, Nicholas Meisburger, Mengnan Zhao, Anshumali Shrivastava

Keywords Paper

0

0

0

0

20:29

26/04/2020

Jelly Bean World: A Testbed for Never-Ending Learning

Emmanouil Antonios Platanios, Abulhair Saparov, Tom Mitchell

Keywords Paper

0

0

0

0

5:02

16/11/2020

Deep Reinforcement Learning with Population-Coded Spiking Neural Network for Continuous Control

Guangzhi Tang, Neelesh Kumar, Raymond Yoo, Konstantinos Michmizos

Keywords Paper

0

0

0

0

5:21

06/12/2020

Efficient Algorithms for Device Placement of DNN Graph Operators

Jakub Tarnawski, Amar Phanishayee, Nikhil Devanur and
Divya Mahajan, Fanny Nina Paravecino

Keywords Paper

0

0

1

0

3:20

26/04/2020

Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation

Byung Hoon Ahn, Prannoy Pilligundla, Amir Yazdanbakhsh, Hadi Esmaeilzadeh

Keywords Paper

Reinforcement Learning, Learning to Optimize, Combinatorial Optimization, Compilers, Code Optimization, Neural Networks, ML for Systems, Learning for Systems

0

0

0

0

4:55

06/12/2020

Kernel Methods Through the Roof: Handling Billions of Points Efficiently

Giacomo Meanti, Luigi Carratino, Lorenzo Rosasco, Alessandro Rudi

Keywords Paper

0

0

0

0

3:28

16/11/2020

Reproducible and Efficient Benchmarks for Hyperparameter Optimization of Neural Machine Translation Systems

Xuan Zhang, Kevin Duh

Keywords Paper

hyperparameter selection, neural systems, automatic optimization, nmt

0

0

0

0

11:38

06/12/2021

Differentiable Optimization of Generalized Nondecomposable Functions using Linear Programs

Zihang Meng, Lopamudra Mukherjee, Yichao Wu and
Vikas Singh, Sathya Narayanan Ravi

Keywords Paper

deep learning, optimization

0

0

0

0

13:21

06/12/2021

BatchQuant: Quantized-for-all Architecture Search with Robust Quantizer

Haoping Bai, Meng Cao, Ping Huang, Jiulong Shan

Keywords Paper

deep learning, optimization

0

0

0

0

4:12

04/11/2020

A Tensor Compiler for Unified Machine Learning Prediction Serving

Supun Nakandala, Karla Saur, Gyeong-In Yu and
Konstantinos Karanasos, Carlo Curino, Markus Weimer, Matteo Interlandi

Keywords Paper

0

0

0

0

19:56

06/12/2021

Practical, Provably-Correct Interactive Learning in the Realizable Setting: The Power of True Believers

Julian Katz-Samuels, Blake Mason, Kevin Jamieson, Rob Nowak

Keywords Paper

theory, machine learning, bandits, kernel methods, active learning

0

0

0

0

7:41

15/11/2020

Dynamic Dispatch of Context-Sensitive Optimizations

Gabriel Poesia, Fernando Magno Quintão Pereira

Keywords Paper

Dynamic dispatch, Compiler, Context-sensitive optimization

0

0

0

0

9:10

16/11/2020

SoftGym: Benchmarking Deep Reinforcement Learning for Deformable Object Manipulation

Xingyu Lin, Yufei Wang, Jake Olkin, David Held

Keywords Paper

0

0

0

0

5:06

13/04/2021

Faster & more reliable tuning of neural networks: Bayesian optimization with importance sampling

Setareh Ariafar, Zelda Mariet, Dana Brooks and
Jennifer Dy, Jasper Snoek

Keywords Paper

0

0

0

0

3:01

18/07/2021

Learn2Hop: Learned Optimization on Rough Landscapes

Amil Merchant, Luke Metz, Samuel Schoenholz, Ekin Cubuk

Keywords Paper

Applications, Others

0

0

0

0

5:19

05/04/2021

Pipelined Backpropagation at Scale: Training Large Models without Batches

Atli Kosson, Vitaliy Chiley, Abhi Venigalla and
Joel Hestness, Urs Koster

Keywords Paper

0

0

0

0

18:00

05/04/2021

Pipelined Backpropagation at Scale: Training Large Models without Batches

Atli Kosson, Vitaliy Chiley, Abhi Venigalla and
Joel Hestness, Urs Koster

Keywords Paper

0

0

0

0

4:14

18/07/2021

Megaverse: Simulating Embodied Agents at One Million Experiences per Second

Aleksei Petrenko, Erik Wijmans, Brennan Shacklett, Vladlen Koltun

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:09

11/08/2020

A computational approach to packet classification

Alon Rashelbach, Ori Rottenstreich, Mark Silberstein

Keywords Paper

Neural Networks, Virtual Switches, Packet Classification

0

0

0

0

16:56

06/12/2021

Distributed Deep Learning In Open Collaborations

Michael Diskin, Alexey Bukhtiyarov, Max Ryabinin and
Lucile Saulnier, quentin lhoest, Anton Sinitsin, Dmitry Popov, Dmitry V. Pyrkin, Maxim Kashirin, Alexander Borzunov, Albert Villanova del Moral, Denis Mazur, Ilia Kobelev, Yacine Jernite, Thomas Wolf, Gennady Pekhimenko

Keywords Paper

deep learning, machine learning, generative model, transfer learning

0

0

0

0

8:48

16/11/2020

Learning Dexterous Manipulation from Suboptimal Experts

Rae Jeong, Jost Tobias Springenberg, Jackie Kay and
Dan Zheng, Alexandre Galashov, Nicolas Heess, Francesco Nori

Keywords Paper

0

0

0

0

5:03

04/11/2020

Rammer: Enabling Holistic Deep Learning Compiler Optimizations with rTasks

Lingxiao Ma, Zhiqiang Xie, Zhi Yang and
Jilong Xue, Youshan Miao, Wei Cui, Wenxiang Hu, Fan Yang, Lintao Zhang, Lidong Zhou

Keywords Paper

0

0

0

0

16:30

02/02/2021

Scalable Graph Networks for Particle Simulations

Karolis Martinkus, Aurelien Lucchi, Nathanaël Perraudin

Keywords Paper

0

0

0

0

18:07

03/05/2021

Practical Massively Parallel Monte-Carlo Tree Search Applied to Molecular Design

Xiufeng Yang, Tanuj Aasawat, Kazuki Yoshizoe

Keywords Paper

molecular design, Upper Confidence bound applied to Trees (UCT), parallel Monte Carlo Tree Search (MCTS)

0

0

0

0

4:59

06/12/2020

Cream of the Crop: Distilling Prioritized Paths For One-Shot Neural Architecture Search

Houwen Peng, Hao Du, Hongyuan Yu and
QI LI, Jing Liao, Jianlong Fu

Keywords Paper

0

0

0

0

3:12

12/07/2020

Multi-Precision Policy Enforced Training (MuPPET) : A Precision-Switching Strategy for Quantised Fixed-Point Training of CNNs

Aditya Rajagopal, Diederik Vink, Stylianos Venieris, Christos-Savvas Bouganis

Keywords Paper

Applications - Other

0

0

0

0

15:30

05/04/2021

Wavelet: Efficient DNN Training with Tick-Tock Scheduling

Guanhua Wang, Kehan Wang, Kenan Jiang and
XIANGJUN LI, Ion Stoica

Keywords Paper

0

0

0

0

17:49

05/04/2021

Wavelet: Efficient DNN Training with Tick-Tock Scheduling

Guanhua Wang, Kehan Wang, Kenan Jiang and
XIANGJUN LI, Ion Stoica

Keywords Paper

0

0

0

0

5:22

16/11/2020

Learning a Decentralized Multi-Arm Motion Planner

Huy Ha, Jingxi Xu, Shuran Song

Keywords Paper

0

0

0

0

3:41

06/12/2020

Probabilistic Active Meta-Learning

Jean Kaddour, Steindor Saemundsson, Marc Deisenroth

Keywords Paper

0

0

0

0

3:17

18/07/2021

Large-Scale Multi-Agent Deep FBSDEs

Tianrong Chen, Ziyi Wang, Ioannis Exarchos, Evangelos Theodorou

Keywords Paper

Theory, Game Theory and Computational Economics

0

0

0

0

5:09

03/05/2021

Large Batch Simulation for Deep Reinforcement Learning

Brennan Shacklett, Erik Wijmans, Aleksei Petrenko and
Manolis Savva, Dhruv Batra, Vladlen Koltun, Kayvon Fatahalian

Keywords Paper

reinforcement learning, simulation

0

0

0

0

5:29

19/08/2021

Hardware-Aware Neural Architecture Search: Survey and Taxonomy

Hadjer Benmeziane, Kaoutar El Maghraoui, Hamza Ouarnoughi and
Smail Niar, Martin Wistuba, Naigang Wang

Keywords Paper

Machine learning, General, General, General

0

0

0

0

14:12

02/02/2021

CMAX++ : Leveraging Experience in Planning and Execution using Inaccurate Models

Anirudh Vemula, J. Andrew Bagnell, Maxim Likhachev

Keywords Paper

0

0

0

0

15:11

06/12/2021

L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization

Jiaqi Gu, Hanqing Zhu, Chenghao Feng and
Zixuan Jiang, Ray Chen, David Pan

Keywords Paper

deep learning, optimization

0

0

0

0

12:07

06/12/2021

On Joint Learning for Solving Placement and Routing in Chip Design

Ruoyu Cheng, Junchi Yan

Keywords Paper

optimization, reinforcement learning and planning, machine learning, graph learning

0

0

0

0

12:47

14/06/2020

RoboTHOR: An Open Simulation-to-Real Embodied AI Platform

Matt Deitke, Winson Han, Alvaro Herrasti and
Aniruddha Kembhavi, Eric Kolve, Roozbeh Mottaghi, Jordi Salvador, Dustin Schwenk, Eli VanderBilt, Matthew Wallingford, Luca Weihs, Mark Yatskar, Ali Farhadi

Keywords Paper

embodied ai, visual navigation, sim to real transfer, reinforcement learning

0

0

0

0

1:01