High Acceleration Reinforcement Learning for Real-World Juggling with Binary Rewards

Abstract: Robots that can learn in the physical world will be important to enable robots to escape their stiff and pre-programmed movements. For dynamic high-acceleration tasks, such as juggling, learning in the real-world is particularly challenging as one must push the limits of the robot and its actuation without harming the system, amplifying the necessity of sample efficiency and safety for robot learning algorithms. In contrast to prior work which mainly focuses on the learning algorithm, we propose a learning system, that directly incorporates these requirements in the design of the policy representation, initialization, and optimization. We demonstrate that this system enables the high-speed Barrett WAM manipulator to learn juggling two balls from 56 minutes of experience with a binary reward signal and finally juggles continuously for up to 33 minutes or about 4500 repeated catches. The videos documenting the learning process and the evaluation can be found at https://sites.google.com/view/jugglingbot .

03/05/2021

ACNMP: Skill Transfer and Task Extrapolation through Learning from Demonstration and Reinforcement Learning via Representation Sharing

Roland Hafner, Tim Hertweck, Philipp Kloeppner and
Michael Bloesch, Michael Neunert, Markus Wulfmeier, Saran Tunyasuvunakool, Nicolas Heess, Martin Riedmiller

Yevgen Chebotar, Karol Hausman, Yao Lu and
Ted Xiao, Dmitry Kalashnikov, Jacob Varley, Alex Irpan, Benjamin Eysenbach, Ryan C Julian, Chelsea Finn, Sergey Levine

Peter Anderson, Ayush Shrivastava, Joanne Truong and
Arjun Majumdar, Devi Parikh Georgia Tech &, Facebook AI Research, Dhruv Batra Georgia Tech &, Facebook AI Research, Stefan Lee

policy learning, zero-order optimization, reinforcement learning, model predictive control, robotics, model-based learning

5:09

03/05/2021

CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning

Ossama Ahmed, Frederik Träuble, Anirudh Goyal and
Alexander Neitz, Manuel Wuthrich, Yoshua Bengio, Bernhard Schoelkopf, Stefan Bauer

Learning Physical Common Sense as Knowledge Graph Completion via BERT Data Augmentation and Constrained Tucker Factorization

Manuel Wuthrich, Felix Widmaier, Felix Grimminger and
Shruti Joshi, Vaibhav Agrawal, Bilal Hammoud, Majid Khadiv, Miroslav Bogdanovic, Vincent Berenz, Julian Viereck, Maximilien Naveau, Ludovic Righetti, Bernhard Schölkopf, Stefan Bauer

Machine Learning, Transfer, Adaptation, Multi-task Learning, Reinforcement Learning, Incremental Learning, Learning in Robotics

11:02

05/01/2021

Applications, Applications, Computer Vision; Deep Learning, Deep Autoencoders; Deep Learning, Generative Models; Probabilistic Methods , Reinforcement Learning and Planning, Deep RL

5:13

16/11/2020

cars in uncommon states (cus), fine-grained car parsing in ad, 3d part guided image editing, part level object understanding, cus dataset

1:01

06/12/2021