16/11/2020

SMARTS: An Open-Source Scalable Multi-Agent RL Training School for Autonomous Driving

Ming Zhou, Jun Luo, Julian Villella, Yaodong Yang, David Rusu, Jiayu Miao, Weinan Zhang, Montgomery Alban, IMAN FADAKAR, Zheng Chen, Chongxi Huang, Ying Wen, Kimia Hassanzadeh, Daniel Graves, Zhengbang Zhu, Yihan Ni, Nhat Nguyen, Mohamed Elsayed, Haitham Ammar, Alexander Cowen-Rivers Huawei R&amp, D UK, Sanjeevan Ahilan, Zheng Tian, Daniel Palenicek, Kasra Rezaee, Peyman Yadmellat, Kun Shao, dong chen, Baokuan Zhang, Hongbo Zhang, Jianye Hao, Wulong Liu, Jun Wang

Keywords:

Abstract: Interaction is fundamental in autonomous driving (AD). Despite more than a decade of intensive R&D in AD, how to dynamically interact with diverse road users in various contexts still remains unsolved. Multi-agent learning has recently seen big breakthroughs and has much to offer towards solving realistic interaction in AD. However, to realize this potential we need multi-agent AD simulation of realistic interaction. To break this apparent chicken-and-egg circularity, we built an AD simulation platform called SMARTS (Scalable Multi-Agent Rl Training School), which is designed to accumulate behavior models of road users towards increasingly realistic and diverse interaction that in turn enables deeper and broader multi-agent research on interaction. In this paper, we describe the design goals of SMARTS, explain its key architectural ideas, illustrate its use for multi-agent research through experiments on concrete interaction scenarios, and introduce a set of benchmarks and metrics. As an open-source, industrial-strength platform, the future of SMARTS lies in its growth along with the multi-agent research it enables in the years to come.

 0
 0
 0
 0
This is an embedded video. Talk and the respective paper are published at CoRL 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment
no comments yet
code of conduct: tbd Characters remaining: 140

Similar Papers