Towards Open Ad Hoc Teamwork Using Graph-based Policy Learning

18/07/2021

Towards Open Ad Hoc Teamwork Using Graph-based Policy Learning

Arrasy Rahman, Niklas Hopner, Filippos Christianos, Stefano V. Albrecht

Keywords: Theory, Learning Theory, Applications, Privacy, Anonymity, and Security, Reinforcement Learning and Planning

Abstract Paper Similar Papers

Abstract: Ad hoc teamwork is the challenging problem of designing an autonomous agent which can adapt quickly to collaborate with teammates without prior coordination mechanisms, including joint training. Prior work in this area has focused on closed teams in which the number of agents is fixed. In this work, we consider open teams by allowing agents with different fixed policies to enter and leave the environment without prior notification. Our solution builds on graph neural networks to learn agent models and joint-action value models under varying team compositions. We contribute a novel action-value computation that integrates the agent model and joint-action value model to produce action-value estimates. We empirically demonstrate that our approach successfully models the effects other agents have on the learner, leading to policies that robustly adapt to dynamic team compositions and significantly outperform several alternative methods.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

16/11/2020

The Emergence of Adversarial Communication in Multi-Agent Reinforcement Learning

Jan Blumenkamp, Amanda Prorok

Keywords Paper

0

0

0

0

4:51

26/08/2020

Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions

Giorgia Ramponi, Amarildo Likmeta, Alberto Maria Metelli and
Andrea Tirinzoni, Marcello Restelli

Keywords Paper

0

0

0

0

9:41

03/05/2021

Learning Safe Multi-agent Control with Decentralized Neural Barrier Certificates

Zengyi Qin, Kaiqing Zhang, chenyx Chen and
Jingkai Chen, Chuchu Fan

Keywords Paper

reinforcement learning, control barrier function, safe, Multi-agent

0

0

0

0

5:45

04/07/2020

Learning to Customize Model Structures for Few-shot Dialogue Generation Tasks

Yiping Song, Zequn Liu, Wei Bi and
Rui Yan, Ming Zhang

Keywords Paper

Few-shot Tasks, open-domain systems, generative models, meta-learning framework

0

0

0

0

11:43

06/12/2020

Promoting Coordination through Policy Regularization in Multi-Agent Deep Reinforcement Learning

Julien Roy, Paul Barde, Félix G Harvey and
Derek Nowrouzezahrai, Chris Pal

Keywords Paper

0

0

0

0

3:21

26/04/2020

Strategies for Pre-training Graph Neural Networks

Weihua Hu, Bowen Liu, Joseph Gomes and
Marinka Zitnik, Percy Liang, Vijay Pande, Jure Leskovec

Keywords Paper

Pre-training, Transfer learning, Graph Neural Networks

0

0

0

0

4:56

16/11/2020

Model-based Reinforcement Learning for Decentralized Multiagent Rendezvous

Rose Wang, J. Chase Kew, Dennis Lee and
Tsang-Wei Lee, Tingnan Zhang, Brian Ichter, Jie Tan, Aleksandra Faust

Keywords Paper

0

0

0

0

4:29

12/07/2020

Multi-Agent Routing Value Iteration Network

Quinlan Sykora, Mengye Ren, Raquel Urtasun

Keywords Paper

Deep Learning - General

0

0

0

0

15:33

06/12/2021

Towards Open-World Feature Extrapolation: An Inductive Graph Learning Approach

Qitian Wu, Chenxiao Yang, Junchi Yan

Keywords Paper

deep learning, machine learning, graph learning, representation learning

0

0

0

0

12:51

06/12/2021

Meta-Learning via Learning with Distributed Memory

Sudarshan Babu, Pedro Savarese, Michael Maire

Keywords Paper

deep learning, optimization, machine learning, vision, meta learning, online learning

0

0

0

0

15:04

06/12/2021

Graph Adversarial Self-Supervised Learning

Longqi Yang, Liangliang Zhang, Wenjing Yang

Keywords Paper

machine learning, adversarial robustness and security, self-supervised learning, graph learning, representation learning

0

0

0

0

4:34

18/07/2021

Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing

Filippos Christianos, Georgios Papoudakis, Arrasy Rahman, Stefano V. Albrecht

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

4:44

03/05/2021

Model-Based Visual Planning with Self-Supervised Functional Distances

Stephen Tian, Suraj Nair, Frederik Ebert and
Sudeep Dasari, Ben Eysenbach, Chelsea Finn, Sergey Levine

Keywords Paper

reinforcement learning, distance learning, model learning, robotics, planning

0

0

0

0

9:11

25/07/2020

TAGNN: Target attentive graph neural networks for session-based recommendation

Feng Yu, Yanqiao Zhu, Qiang Liu and
Shu Wu, Liang Wang, Tieniu Tan

Keywords Paper

session-based recommendation, target attention, graph neural networks

0

0

0

0

7:31

19/10/2020

Learning to profile: User meta-profile network for few-shot learning

Hao Gong, Qifang Zhao, Tianyu Li and
Derek Cho, DuyKhuong Nguyen

Keywords Paper

multi-task learning, multi-modal model, representation learning, meta-learning

0

0

0

1

12:10

06/12/2020

Learning Dynamic Belief Graphs to Generalize on Text-Based Games

Ashutosh Adhikari, Eric Yuan, Marc-Alexandre Côté and
Mikuláš Zelinka, Marc-Antoine Rondeau, Romain Laroche, Pascal Poupart, Jian Tang, Adam Trischler, Will Hamilton

Keywords Paper

0

0

0

0

3:03

12/07/2020

Hallucinative Topological Memory for Zero-Shot Visual Planning

Thanard Kurutach, Kara Liu, Aviv Tamar and
Pieter Abbeel, Christine Tung

Keywords Paper

Planning, Control, and Multiagent Learning

0

0

0

0

14:54

03/05/2021

HyperDynamics: Meta-Learning Object and Agent Dynamics with Hypernetworks

Zhou Xian, Shamit Lal, Hsiao-Yu Tung and
Anthony Platanios, Katerina Fragkiadaki

Keywords Paper

0

0

0

0

5:46

18/07/2021

AutoAttend: Automated Attention Representation Search

Chaoyu Guan, Xin Wang, wenwu zhu

Keywords Paper

Algorithms, AutoML

0

0

0

0

4:49

14/06/2020

Weakly Supervised Visual Semantic Parsing

Alireza Zareian, Svebor Karaman, Shih-Fu Chang

Keywords Paper

scene understanding, scene graph generation, weakly supervised learning, semantic parsing, graph neural networks, visual reasoning

0

0

0

0

5:00

03/05/2021

Model-Based Offline Planning

Arthur Argenson, Gabe Dulac-Arnold

Keywords Paper

reinforcement learning, model predictive control, model-based control, off-line reinforcement learning, robotics, model-based reinforcement learning

0

0

0

0

5:15

22/09/2020

Contextual meta-bandit for recommender systems selection

Marlesson R. O. Santana, Luckeciano C. Melo, Fernando H. F. Camargo and
Bruno Brandão, Anderson Soares, Renan M. Oliveira, Sandor Caetano

Keywords Paper

contextual bandits, hierarchical recommender systems, options framework, reinforcement learning

0

0

0

0

1:48

06/12/2020

One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL

Saurabh Kumar, Aviral Kumar, Sergey Levine, Chelsea Finn

Keywords Paper

0

0

0

0

3:24

06/12/2021

Emergent Discrete Communication in Semantic Spaces

Mycal Tucker, Huao Li, Siddharth Agrawal and
Dana Hughes, Katia Sycara, Michael Lewis, Julie A Shah

Keywords Paper

reinforcement learning and planning, language

0

0

0

0

14:56

02/02/2021

User Driven Model Adjustment via Boolean Rule Explanations

Elizabeth M. Daly, Massimiliano Mattetti, Öznur Alkan, Rahul Nair

Keywords Paper

0

0

0

0

20:38

26/04/2020

Continual Learning with Adaptive Weights (CLAW)

Tameem Adel, Han Zhao, Richard E. Turner

Keywords Paper

Continual learning

0

0

0

0

4:58

22/11/2021

Self-Supervised Learning in Multi-Task Graphs through Iterative Consensus Shift

Emanuela Haller, Elena Burceanu, Marius Leordeanu

Keywords Paper

multi-task graph, self-supervised, consensus, multi-task agreement, selection ensemble, domain adaptation, domain generalization, distribution shift, experts

0

0

0

0

3:03

12/07/2020

Goal-Aware Prediction: Learning to Model What Matters

Suraj Nair, Silvio Savarese, Chelsea Finn

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

11:16

02/02/2021

Semi-supervised Sequence Classification through Change Point Detection

Nauman Ahad, Mark A. Davenport

Keywords Paper

0

0

0

0

14:21

06/12/2021

The Sensory Neuron as a Transformer: Permutation-Invariant Neural Networks for Reinforcement Learning

Yujin Tang, David Ha

Keywords Paper

deep learning, optimization, reinforcement learning and planning, robustness, transformers, generative model, meta learning

0

0

0

0

13:19

25/07/2020

Incorporating scenario knowledge into a unified fine-tuning architecture for event representation

Jianming Zheng, Fei Cai, Honghui Chen

Keywords Paper

scenario knowledge, pre-training, fine-tuning, event representation

0

0

0

0

15:37

16/11/2020

CLOUD: Contrastive Learning of Unsupervised Dynamics

Jianren Wang, Yujie Lu, Hang Zhao

Keywords Paper

0

0

0

0

5:05

06/12/2021

Environment Generation for Zero-Shot Compositional Reinforcement Learning

Izzeddin Gur, Natasha Jaques, Yingjie Miao and
Jongwook Choi, Manoj Tiwari, Honglak Lee, Aleksandra Faust

Keywords Paper

reinforcement learning and planning, robustness, graph learning

0

0

0

0

8:40

06/12/2020

Task-Agnostic Online Reinforcement Learning with an Infinite Mixture of Gaussian Processes

Mengdi Xu, Wenhao Ding, Jiacheng Zhu and
ZUXIN LIU, Baiming Chen, Ding Zhao

Keywords Paper

0

0

0

0

3:21

06/12/2021

Dynamic population-based meta-learning for multi-agent communication with natural language

Abhinav Gupta, Marc Lanctot, Angeliki Lazaridou

Keywords Paper

reinforcement learning and planning, robustness, meta learning

0

0

0

0

14:43

18/07/2021

A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning

Dong Ki Kim, Miao Liu, Matthew Riemer and
Chuangchuang Sun, Marwa Abdulhai, Golnaz Habibi, Sebastian Lopez-Cot, Gerald Tesauro, Jonathan How

Keywords Paper

Reinforcement Learning and Planning, Multi-Agent RL, Algorithms, Representation Learning, Algorithms, Relational Learning

0

0

0

0

5:20

02/02/2021

Meta-Learning Framework with Applications to Zero-Shot Time-Series Forecasting

Boris N. Oreshkin, Dmitri Carpov, Nicolas Chapados, Yoshua Bengio

Keywords Paper

0

0

0

0

17:41

06/12/2020

Mitigating Forgetting in Online Continual Learning via Instance-Aware Parameterization

Hung-Jen Chen, An-Chieh Cheng, Da-Cheng Juan and
Wei Wei, Min Sun

Keywords Paper

0

0

0

0

3:23

03/05/2021

FedBE: Making Bayesian Model Ensemble Applicable to Federated Learning

Hong-You Chen, Wei-Lun Chao

Keywords Paper

0

0

0

0

5:06

06/12/2021

Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization

Zhenghao Peng, Quanyi Li, Ka Ming Hui and
Chunxiao Liu, Bolei Zhou

Keywords Paper

optimization, reinforcement learning and planning

0

0

0

0

12:08