RLlib Flow: Distributed Reinforcement Learning is a Dataflow Problem

06/12/2021

RLlib Flow: Distributed Reinforcement Learning is a Dataflow Problem

Eric Liang, Zhanghao Wu, Michael Luo, Sven Mika, Joseph Gonzalez, Ion Stoica

Keywords: reinforcement learning and planning

Abstract Paper Similar Papers

Abstract: Researchers and practitioners in the field of reinforcement learning (RL) frequently leverage parallel computation, which has led to a plethora of new algorithms and systems in the last few years. In this paper, we re-examine the challenges posed by distributed RL and try to view it through the lens of an old idea: distributed dataflow. We show that viewing RL as a dataflow problem leads to highly composable and performant implementations. We propose RLlib Flow, a hybrid actor-dataflow programming model for distributed RL, and validate its practicality by porting the full suite of algorithms in RLlib, a widely adopted distributed RL library. Concretely, RLlib Flow provides 2-9$\times$ code savings in real production code and enables the composition of multi-agent algorithms not possible by end users before. The open-source code is available as part of RLlib at https://github.com/ray-project/ray/tree/master/rllib.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

PLUR: A Unifying, Graph-Based View of Program Learning, Understanding, and Repair

Zimin Chen, Vincent J Hellendoorn, Pascal Lamblin and
Petros Maniatis, Pierre-Antoine Manzagol, Daniel Tarlow, Subhodeep Moitra

Keywords Paper

deep learning, machine learning, transformers, graph learning

0

0

0

0

5:59

06/12/2021

Learning to Combine Per-Example Solutions for Neural Program Synthesis

Disha Shrivastava, Hugo Larochelle, Daniel Tarlow

Keywords Paper

deep learning

0

0

0

0

7:54

06/12/2020

Joint Contrastive Learning with Infinite Possibilities

Qi Cai, Yu Wang, Yingwei Pan and
Ting Yao, Tao Mei

Keywords Paper

0

0

0

0

3:06

02/02/2021

Parameterizing Branch-and-Bound Search Trees to Learn Branching Policies

Giulia Zarpellon, Jason Jo, Andrea Lodi, Yoshua Bengio

Keywords Paper

0

0

0

0

17:58

15/06/2020

AutoSys: The Design and Operation of Learning-Augmented Systems

Chieh-Jan Mike Liang, Hui Xue, Mao Yang and
Lidong Zhou, Lifei Zhu, Zhao Lucis Li, Zibo Wang, Qi Chen, Quanlu Zhang, Chuanjie Liu, Wenjun Dai

Keywords Paper

0

0

0

0

22:48

12/07/2020

Evolving Machine Learning Algorithms From Scratch

Esteban Real, Chen Liang, David So, Quoc Le

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

15:01

06/12/2021

Integrating Tree Path in Transformer for Code Representation

Han Peng, Ge Li, Wenhan Wang and
YunFei Zhao, Zhi Jin

Keywords Paper

machine learning, transformers

0

0

0

0

4:42

23/06/2021

DreamCoder: Bootstrapping Inductive Program Synthesis with Wake-Sleep Library Learning

Kevin Ellis, Catherine Wong, Maxwell Nye and
Mathias Sablé-Meyer, Lucas Morales, Luke Hewitt, Luc Cary, Armando Solar-Lezama, Joshua B. Tenenbaum

Keywords Paper

synthesis, neural, learning, refactoring

0

0

0

0

25:24

06/12/2020

Fast geometric learning with symbolic matrices

Jean Feydy, Joan Glaunès, Benjamin Charlier, Michael Bronstein

Keywords Paper

0

0

0

0

3:16

06/12/2020

BoTorch: A Framework for Efficient Monte-Carlo Bayesian Optimization

Max Balandat, Brian Karrer, Daniel Jiang and
Samuel Daulton, Ben Letham, Andrew Wilson, Eytan Bakshy

Keywords Paper

Reinforcement Learning and Planning -> Model-Based RL; Reinforcement Learning and Planning -> Reinforcement Learning, Reinforcement Learning and Planning -> Multi-Agent RL

0

0

0

0

3:21

19/10/2020

MetaTPOT: Enhancing a tree-based pipeline optimization tool using meta-learning

Doron Laadan, Roman Vainshtein, Yarden Curiel and
Gilad Katz, Lior Rokach

Keywords Paper

tpot, meta-learning, genetic programming(gp), automl

0

0

0

0

6:41

18/07/2021

TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale Language Models

Zhuohan Li, Siyuan Zhuang, Shiyuan Guo and
Danyang Zhuo, Hao Zhang, Dawn Song, Ion Stoica

Keywords Paper

Algorithms, Large Scale Learning

0

0

0

0

5:12

06/12/2020

A Scalable MIP-based Method for Learning Optimal Multivariate Decision Trees

Haoran Zhu, Pavankumar Murali, Dzung Phan and
Lam Nguyen, Jayant Kalagnanam

Keywords Paper

0

0

0

0

3:12

19/01/2020

Disentanglement in Nested-Parallel Programs

Sam Westrick, Rohan Yadav, Matthew Fluet, Umut A. Acar

Keywords Paper

memory management, disentanglement, parallel computing, functional programming, data race

0

0

0

0

21:33

03/05/2021

Scalable Learning and MAP Inference for Nonsymmetric Determinantal Point Processes

Mike Gartrell, Insu Han, Elvis Dohmatob and
Jennifer Gillenwater, Victor-Emmanuel Brunel

Keywords Paper

submodular optimization, determinantal point processes, unsupervised learning, representation learning

0

0

0

0

15:15

16/11/2020

Two are Better than One: Joint Entity and Relation Extraction with Table-Sequence Encoders

Jue Wang, Wei Lu

Keywords Paper

named recognition, relation extraction, table-filling problem, representation process

0

0

0

0

11:33

11/08/2020

A computational approach to packet classification

Alon Rashelbach, Ori Rottenstreich, Mark Silberstein

Keywords Paper

Neural Networks, Virtual Switches, Packet Classification

0

0

0

0

16:56

18/07/2021

Leveraging Language to Learn Program Abstractions and Search Heuristics

Catherine Wong, Kevin Ellis, Josh Tenenbaum, Jacob Andreas

Keywords Paper

Algorithms, Multimodal Learning

0

0

0

0

5:18

13/04/2021

Tensor networks for probabilistic sequence modeling

Jacob Miller, Guillaume Rabusseau, John Terilla

Keywords Paper

0

0

0

0

3:01

16/11/2020

AxCell: Automatic Extraction of Results from Machine Learning Papers

Marcin Kardas, Piotr Czapla, Pontus Stenetorp and
Sebastian Ruder, Sebastian Riedel, Ross Taylor, Robert Stojnic

Keywords Paper

machine learning, table subtask, extraction, results extraction

0

0

0

0

11:52

15/06/2020

Verifying concurrent search structure templates

Siddharth Krishna, Nisarg Patel, Dennis Shasha, Thomas Wies

Keywords Paper

separation logic, concurrent data structures, flow framework, template-based verification

0

0

0

0

14:56

04/07/2020

Active Learning for Coreference Resolution using Discrete Annotation

Belinda Z. Li, Gabriel Stanovsky, Luke Zettlemoyer

Keywords Paper

Coreference Resolution, active resolution, Active Learning, Discrete Annotation

0

0

0

0

6:36

18/07/2021

AutoAttend: Automated Attention Representation Search

Chaoyu Guan, Xin Wang, wenwu zhu

Keywords Paper

Algorithms, AutoML

0

0

0

0

4:49

06/12/2021

Effective Meta-Regularization by Kernelized Proximal Regularization

Weisen Jiang, James Kwok, Yu Zhang

Keywords Paper

meta learning

0

0

0

0

7:32

04/07/2020

Torch-Struct: Deep Structured Prediction Library

Alexander Rush

Keywords Paper

structured prediction, NLP, Torch-Struct, Deep Library

0

0

0

0

11:00

23/06/2021

Satisfiability Modulo Ordering Consistency Theory for Multi-threaded Program Verification

Fei He, Zhihang Sun, Hongyu Fan

Keywords Paper

Program verification, satisfiability modulo theory, memory consistency model, concurrency

0

0

0

0

16:32

03/05/2021

Generalized Multimodal ELBO

Thomas Sutter, Imant Daunhawer, Julia E Vogt

Keywords Paper

self-supervised, generative learning, ELBO, VAE, Multimodal

0

0

0

0

5:15

06/12/2021

Row-clustering of a Point Process-valued Matrix

Lihao Yin, Ganggang Xu, Huiyan Sang, Yongtao Guan

Keywords Paper

machine learning, clustering

0

0

0

0

14:30

26/08/2020

Adaptive Trade-Offs in Off-Policy Learning

Mark Rowland, Will Dabney, Remi Munos

Keywords Paper

0

0

0

0

11:29

15/11/2020

A Modular Cost Analysis for Probabilistic Programs

Martin Avanzini, Georg Moser, Michael Schaper

Keywords Paper

probabilistic programs, automation, average complexity, modularity

0

0

0

0

14:58

12/08/2020

Updates-Leak: Data Set Inference and Reconstruction Attacks in Online Learning

Ahmed Salem, Apratim Bhattacharya, Michael Backes and
Mario Fritz, Yang Zhang

Keywords Paper

0

0

0

0

13:05

17/08/2020

Code replicability in computer graphics

Nicolas Bonneel, David Coeurjolly, Julie Digne, Nicolas Mellado

Keywords Paper

replicability, open source, siggraph, code review, reproducibility

0

0

0

0

17:30

12/07/2020

Nested Subspace Arrangement for Representation of Relational Data

Nozomi Hata, Shizuo Kaji, Akihiro Yoshida, Katsuki Fujisawa

Keywords Paper

Representation Learning

0

0

0

0

13:09

03/05/2021

BUSTLE: Bottom-Up Program Synthesis Through Learning-Guided Exploration

Augustus Odena, Kensen Shi, David Bieber and
Rishabh Singh, Charles Sutton, Hanjun Dai

Keywords Paper

Program Synthesis

0

0

0

0

10:26

06/12/2020

Accelerating Reinforcement Learning through GPU Atari Emulation

Steven Dalton, iuri frosio

Keywords Paper

0

0

0

0

3:12

06/12/2021

Practical, Provably-Correct Interactive Learning in the Realizable Setting: The Power of True Believers

Julian Katz-Samuels, Blake Mason, Kevin Jamieson, Rob Nowak

Keywords Paper

theory, machine learning, bandits, kernel methods, active learning

0

0

0

0

7:41

26/04/2020

Scalable Neural Methods for Reasoning With a Symbolic Knowledge Base

William W. Cohen, Haitian Sun, R. Alex Hofer, Matthew Siegler

Keywords Paper

question-answering, knowledge base completion, neuro-symbolic reasoning, multihop reasoning

0

0

0

0

5:05

06/12/2021

Symbolic Regression via Deep Reinforcement Learning Enhanced Genetic Programming Seeding

Terrell Mundhenk, Mikel Landajuela, Ruben Glatt and
Claudio P Santiago, Daniel faissol, Brenden K Petersen

Keywords Paper

optimization, reinforcement learning and planning

0

0

0

0

14:50

06/12/2020

Efficient Algorithms for Device Placement of DNN Graph Operators

Jakub Tarnawski, Amar Phanishayee, Nikhil Devanur and
Divya Mahajan, Fanny Nina Paravecino

Keywords Paper

0

0

1

0

3:20

03/05/2021

Reducing the Computational Cost of Deep Generative Models with Binary Neural Networks

Thomas Bird, Friso Kingma, David Barber

Keywords Paper

generative, binary, optimization, compression

0

0

0

0

5:14