Reinforcement Learning for Zone Based Multiagent Pathfinding under Uncertainty

26/10/2020

Reinforcement Learning for Zone Based Multiagent Pathfinding under Uncertainty

Jiajing Ling, Tarun Gupta, Akshat Kumar

Keywords: multiagent path finding, multiagent reinforcement learning, decision making under uncertainty, decentralized partially observable MDPs

Abstract Paper Similar Papers

Abstract: We present a new framework for the problem of multiple agents finding their paths from respective source to destination nodes in a graph (also called MAPF). Most existing approaches assume that all agents move at fixed speed, and that a single node accommodates only a single agent. Motivated by the emerging applications of autonomous vehicles such as drone traffic management, we present zone-based path finding (or ZBPF) where agents move among zones (e.g., geofenced airblocks for drones), and agents' movements require uncertain travel time. Furthermore, each zone can accommodate multiple agents (as per its capacity). We also develop a 3D simulator for ZBPF in the ml-agents platform of the Unity3D game engine, which provides a clean interface from the simulation environment to learning algorithms. We develop a novel formulation of the ZBPF problem using difference-of-convex functions (DC) programming. The resulting approach utilizes samples from the simulator to optimize agent policies. We also present a multiagent credit assignment scheme that helps our learning approach converge faster. Empirical results in a number of 2D and 3D instances show that our approach can effectively minimize congestion in zones, while ensuring agents reach their final destinations.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICAPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

SuperGlue: Learning Feature Matching With Graph Neural Networks

Paul-Edouard Sarlin, Daniel DeTone, Tomasz Malisiewicz, Andrew Rabinovich

Keywords Paper

feature matching, deep learning, graph neural network, optimal transport, pose estimation, slam, structure-from-motion, localization, local features, real time

0

0

0

0

4:56

22/11/2021

Spatiotemporal Deformable Scene Graphs for Complex Activity Detection

Salman Khan, Fabio Cuzzolin

Keywords Paper

action detection, activity detection, complex activity detection, scene graph, graph convolutional network, autonomous driving, surgical robotics, deformable pooling, parts deformation

0

0

0

0

3:02

16/11/2020

Model-based Reinforcement Learning for Decentralized Multiagent Rendezvous

Rose Wang, J. Chase Kew, Dennis Lee and
Tsang-Wei Lee, Tingnan Zhang, Brian Ichter, Jie Tan, Aleksandra Faust

Keywords Paper

0

0

0

0

4:29

16/11/2020

Multiagent Rollout and Policy Iteration for POMDP with Application to Multi-Robot Repair Problems

Sushmita Bhattacharya, Siva Kailas, Sahil Badyal and
Stephanie Gil, Dimitri Bertsekas

Keywords Paper

0

0

0

0

5:04

26/08/2020

Finite-Time Analysis of Decentralized Temporal-Difference Learning with Linear Function Approximation

Jun Sun, Gang Wang, Georgios B. Giannakis and
Qinmin Yang, Zaiyue Yang

Keywords Paper

0

0

0

0

17:07

14/06/2020

DIST: Rendering Deep Implicit Signed Distance Function With Differentiable Sphere Tracing

Shaohui Liu, Yinda Zhang, Songyou Peng and
Boxin Shi, Marc Pollefeys, Zhaopeng Cui

Keywords Paper

differentiable rendering, 3d reconstruction, implicit representations, multi-view reconstruction, depth completion, 3d deep learning

0

0

0

0

1:01

03/05/2021

Differentiable Segmentation of Sequences

Erik Scharwächter, Jonathan Lennartz, Emmanuel Müller

Keywords Paper

warping functions, concept drift, change point detection, segmented models, segmentation, gradient descent

0

1

0

0

5:10

14/06/2020

Neural Point Cloud Rendering via Multi-Plane Projection

Peng Dai, Yinda Zhang, Zhuwen Li and
Shuaicheng Liu, Bing Zeng

Keywords Paper

neural rendering, point cloud, multi-plane representation, novel view synthesis

0

0

0

0

1:00

12/07/2020

Hallucinative Topological Memory for Zero-Shot Visual Planning

Thanard Kurutach, Kara Liu, Aviv Tamar and
Pieter Abbeel, Christine Tung

Keywords Paper

Planning, Control, and Multiagent Learning

0

0

0

0

14:54

06/12/2021

Cross-view Geo-localization with Layer-to-Layer Transformer

Hongji Yang, Xiufan Lu, Yingying Zhu

Keywords Paper

transformers

0

0

0

0

8:27

26/08/2020

Communication-Efficient Distributed Optimization in Networks with Gradient Tracking and Variance Reduction

Boyue Li, Shicong Cen, Yuxin Chen, Yuejie Chi

Keywords Paper

0

0

0

0

13:50

14/06/2020

Learning to Optimize on SPD Manifolds

Zhi Gao, Yuwei Wu, Yunde Jia, Mehrtash Harandi

Keywords Paper

riemannian optimization, symmetric positive definite (spd) manifolds, optimization-based meta-learning, automatical spd optimizer design, learning to optimize, gradiend-based spd optimization, optimization problems with spd constraints

0

0

0

0

0:50

18/07/2021

Neural-Pull: Learning Signed Distance Function from Point clouds by Learning to Pull Space onto Surface

Baorui Ma, Zhizhong Han, Yushen Liu, Matthias Zwicker

Keywords Paper

Applications, Computer Vision, Algorithms, Adversarial Learning, Deep Learning; Deep Learning, Adversarial Networks

0

0

0

0

5:12

14/06/2020

MotionNet: Joint Perception and Motion Prediction for Autonomous Driving Based on Bird’s Eye View Maps

Pengxiang Wu, Siheng Chen, Dimitris N. Metaxas

Keywords Paper

autonomous driving, perception, motion prediction, bird's eye view map

0

0

0

0

1:00

14/06/2020

From Image Collections to Point Clouds With Self-Supervised Shape and Pose Networks

K L Navaneet, Ansu Mathew, Shashank Kashyap and
Wei-Chih Hung, Varun Jampani, R. Venkatesh Babu

Keywords Paper

3d reconstruction, single image reconstruction, self supervised, point clouds, unsupervised, 2d to 3d, image collections

0

0

0

0

1:01

14/06/2020

Learning to Simulate Dynamic Environments With GameGAN

Seung Wook Kim, Yuhao Zhou, Jonah Philion and
Antonio Torralba, Sanja Fidler

Keywords Paper

gan, game, simulation, video generation, memory, disentangle, interactive

0

0

0

0

1:01

13/04/2021

ATOL: Measure vectorization for automatic topologically-oriented learning

Martin Royer, Frederic Chazal, Clément Levrard and
Yuhei Umeda, Yuichi Ike

Keywords Paper

0

0

0

0

3:05

02/02/2021

ePointDA: An End-to-End Simulation-to-Real Domain Adaptation Framework for LiDAR Point Cloud Segmentation

Sicheng Zhao, Yezhen Wang, Bo Li and
Bichen Wu, Yang Gao, Pengfei Xu, Trevor Darrell, Kurt Keutzer

Keywords Paper

0

0

0

0

20:54

05/01/2021

Fast Pose Graph Optimization via Krylov-Schur and Cholesky Factorization

Gabriel Moreira, Manuel Marques, Joao Paulo Costeira

Keywords Paper

0

0

0

0

5:01

16/11/2020

Self-Supervised Learning of Scene-Graph Representations for Robotic Sequential Manipulation Planning

Son Nguyen, Ozgur Oguz Uni. of Stuttgart &, Max Planck Inst. for Intelligent Systems and
Valentin Hartmann, Marc Toussaint

Keywords Paper

0

0

0

0

5:01

06/12/2021

An online passive-aggressive algorithm for difference-of-squares classification

Lawrence Saul

Keywords Paper

machine learning, online learning

0

0

0

0

14:00

12/07/2020

A distributional view on multi objective policy optimization

Abbas Abdolmaleki, Sandy Huang, Leonard Hasenclever and
Michael Neunert, Martina Zambelli, Murilo Martins, Francis Song, Nicolas Heess, Raia Hadsell, Martin Riedmiller

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:04

13/04/2021

Reinforcement learning for mean field games with strategic complementarities

Kiyeob Lee, Desik Rengarajan, Dileep Kalathil, Srinivas Shakkottai

Keywords Paper

0

0

0

0

2:57

18/07/2021

AutoAttend: Automated Attention Representation Search

Chaoyu Guan, Xin Wang, wenwu zhu

Keywords Paper

Algorithms, AutoML

0

0

0

0

4:49

14/06/2020

Learning Dynamic Routing for Semantic Segmentation

Yanwei Li, Lin Song, Yukang Chen and
Zeming Li, Xiangyu Zhang, Xingang Wang, Jian Sun

Keywords Paper

dynamic inference, semantic segmentation, network design

0

0

0

0

5:01

26/04/2020

Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization

Michael Volpp, Lukas P. Fröhlich, Kirsten Fischer and
Andreas Doerr, Stefan Falkner, Frank Hutter, Christian Daniel

Keywords Paper

Transfer Learning, Meta Learning, Bayesian Optimization, Reinforcement Learning

0

0

0

0

5:05

06/12/2020

Geometric Dataset Distances via Optimal Transport

David Alvarez-Melis, Nicolo Fusi

Keywords Paper

0

0

0

0

3:26

18/07/2021

World Model as a Graph: Learning Latent Landmarks for Planning

Lunjun Zhang, Ge Yang, Bradly Stadie

Keywords Paper

Applications, Computer Vision, Algorithms, Classification; Applications, Computational Social Science; Applications, Visual Scene Analysis and Interpret, Reinforcement Learning and Planning, Deep RL

0

0

0

0

12:48

26/04/2020

A Meta-Transfer Objective for Learning to Disentangle Causal Mechanisms

Yoshua Bengio, Tristan Deleu, Nasim Rahaman and
Nan Rosemary Ke, Sebastien Lachapelle, Olexa Bilaniuk, Anirudh Goyal, Christopher Pal

Keywords Paper

meta-learning, transfer learning, structure learning, modularity, causality

0

0

0

0

5:25

06/12/2021

Neural Scene Flow Prior

Xueqian Li, Jhony Kaesemodel Pontes, Simon Lucey

Keywords Paper

deep learning, optimization, vision

0

0

0

0

14:09

02/02/2021

EECBS: A Bounded-Suboptimal Search for Multi-Agent Path Finding

Jiaoyang Li, Wheeler Ruml, Sven Koenig

Keywords Paper

0

0

0

0

19:58

17/08/2020

Sequential gallery for interactive visual design optimization

Yuki Koyama, Issei Sato, Masataka Goto

Keywords Paper

human-in-the-loop optimization, visual design exploration, bayesian optimization

0

0

0

0

19:53

17/08/2020

Informative scene decomposition for crowd analysis, comparison and simulation guidance

Feixiang He, Yuanhang Xiang, Xi Zhao, He Wang

Keywords Paper

simulation evaluation, bayesian inference, crowd simulation

0

0

0

0

13:54

03/05/2021

End-to-End Egospheric Spatial Memory

Daniel Lenton, Stephen James, Ronald Clark, Andrew Davison

Keywords Paper

image-to-action learning, mapping, spatial awareness, egocentric, differentiable memory

0

0

0

0

5:25

14/06/2020

Peek-a-Boo: Occlusion Reasoning in Indoor Scenes With Plane Representations

Ziyu Jiang, Buyu Liu, Samuel Schulter and
Zhangyang Wang, Manmohan Chandraker

Keywords Paper

occlusion reasoning, indoor scene, plane representation

0

0

0

0

5:00

19/08/2021

Two-Sided Wasserstein Procrustes Analysis

Kun Jin, Chaoyue Liu, Cathy Xia

Keywords Paper

Machine Learning Applications, Applications of Unsupervised Learning, Transfer, Adaptation, Multi-task Learning, Bio/Medicine

0

0

0

1

15:43

14/06/2020

Weakly Supervised Visual Semantic Parsing

Alireza Zareian, Svebor Karaman, Shih-Fu Chang

Keywords Paper

scene understanding, scene graph generation, weakly supervised learning, semantic parsing, graph neural networks, visual reasoning

0

0

0

0

5:00

06/12/2021

Meta-Adaptive Nonlinear Control: Theory and Algorithms

Guanya Shi, Kamyar Azizzadenesheli, Michael O'Connell and
Soon-Jo Chung, Yisong Yue

Keywords Paper

theory, meta learning, online learning, representation learning

0

0

0

0

15:30

22/11/2021

Planar Shape Based Registration for Multi-modal Geometry

Muxingzi Li, Florent Lafarge

Keywords Paper

global registration, energy minimization, geometric primitives, point cloud, polygonal mesh

0

0

0

0

3:00

05/01/2021

Improving Point Cloud Semantic Segmentation by Learning 3D Object Detection

Ozan Unal, Luc Van Gool, Dengxin Dai

Keywords Paper

0

0

0

0

4:58