Hardware as Policy: Mechanical and Computational Co-Optimization using Deep Reinforcement Learning

16/11/2020

Hardware as Policy: Mechanical and Computational Co-Optimization using Deep Reinforcement Learning

Tianjian Chen, Zhanpeng He, Matei Ciocarlie

Keywords:

Abstract Paper Code Similar Papers

Abstract: Deep Reinforcement Learning (RL) has shown great success in learning complex control policies for a variety of applications in robotics. However, in most such cases, the hardware of the robot has been considered immutable, modeled as part of the environment. In this study, we explore the problem of learning hardware and control parameters together in a unified RL framework. To achieve this, we propose to model the robot body as a “hardware policy”, analogous to and optimized jointly with its computational counterpart. We show that, by modeling such hardware policies as auto-differentiable computational graphs, the ensuing optimization problem can be solved efficiently by gradient-based algorithms from the Policy Optimization family. We present two such design examples: a toy mass-spring problem, and a real-world problem of designing an underactuated hand. We compare our method against traditional co-optimization approaches, and also demonstrate its effectiveness by building a physical prototype based on the learned hardware parameters. Videos and more details are available at https://roamlab.github.io/hwasp/.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CoRL 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

16/11/2020

Multi-Level Structure vs. End-to-End-Learning in High-Performance Tactile Robotic Manipulation

Florian Voigt, Lars Johannsmeier, Sami Haddadin

Keywords Paper

0

0

0

0

5:13

06/12/2020

Deep Imitation Learning for Bimanual Robotic Manipulation

Fan Xie, Alexander Chowdhury, Clara De Paolis Kaluza and
Linfeng Zhao, Lawson Wong, Rose Yu

Keywords Paper

0

0

0

0

3:12

16/11/2020

SoftGym: Benchmarking Deep Reinforcement Learning for Deformable Object Manipulation

Xingyu Lin, Yufei Wang, Jake Olkin, David Held

Keywords Paper

0

0

0

0

5:06

16/11/2020

ACNMP: Skill Transfer and Task Extrapolation through Learning from Demonstration and Reinforcement Learning via Representation Sharing

Mete Akbulut, Erhan Oztop, Muhammet Yunus Seker and
Hh X, Ahmet Tekden, Emre Ugur

Keywords Paper

0

0

0

0

5:03

16/11/2020

Towards General and Autonomous Learning of Core Skills: A Case Study in Locomotion

Roland Hafner, Tim Hertweck, Philipp Kloeppner and
Michael Bloesch, Michael Neunert, Markus Wulfmeier, Saran Tunyasuvunakool, Nicolas Heess, Martin Riedmiller

Keywords Paper

0

0

0

0

5:24

06/12/2021

Evolution Gym: A Large-Scale Benchmark for Evolving Soft Robots

Jagdeep Bhatia, Holly Jackson, Yunsheng Tian and
Jie Xu, Wojciech Matusik

Keywords Paper

optimization, reinforcement learning and planning, machine learning

0

0

0

0

13:48

12/07/2020

Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control

Jie Xu, Yunsheng Tian, Pingchuan Ma and
Daniela Rus, Shinjiro Sueda, Wojciech Matusik

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:15

03/05/2021

Learning Cross-Domain Correspondence for Control with Dynamics Cycle-Consistency

Qiang Zhang, Tete Xiao, Alyosha Efros and
Lerrel Pinto, Xiaolong Wang

Keywords Paper

self-supervised learning, robotics

0

0

0

0

14:38

03/05/2021

CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning

Ossama Ahmed, Frederik Träuble, Anirudh Goyal and
Alexander Neitz, Manuel Wuthrich, Yoshua Bengio, Bernhard Schoelkopf, Stefan Bauer

Keywords Paper

reinforcement learning, transfer learning, robotics, domain adaptation, generalization, causality, sim2real transfer

0

0

0

0

5:03

16/11/2020

Learning Certified Control Using Contraction Metric

Dawei Sun, Susmit Jha, Chuchu Fan

Keywords Paper

0

0

0

0

5:02

16/11/2020

Chaining Behaviors from Data with Model-Free Reinforcement Learning

Avi Singh, Albert Yu, Jonathan Yang and
Jesse Zhang, Aviral Kumar, Sergey Levine

Keywords Paper

0

0

0

0

5:01

06/12/2021

Learning to Execute: Efficient Learning of Universal Plan-Conditioned Policies in Robotics

Ingmar Schubert, Danny Driess, Ozgur S. Oguz, Marc Toussaint

Keywords Paper

reinforcement learning and planning

0

0

0

0

8:36

06/12/2021

Continual World: A Robotic Benchmark For Continual Reinforcement Learning

Maciej Wołczyk, Michał Zając, Razvan Pascanu and
Łukasz Kuciński, Piotr Miłoś

Keywords Paper

reinforcement learning and planning, continual learning

0

0

0

0

8:13

06/12/2021

Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives

Murtaza Dalal, Deepak Pathak, Russ Salakhutdinov

Keywords Paper

optimization, reinforcement learning and planning

0

0

0

0

10:01

03/05/2021

Model-Based Visual Planning with Self-Supervised Functional Distances

Stephen Tian, Suraj Nair, Frederik Ebert and
Sudeep Dasari, Ben Eysenbach, Chelsea Finn, Sergey Levine

Keywords Paper

reinforcement learning, distance learning, model learning, robotics, planning

0

0

0

0

9:11

26/04/2020

Intrinsically Motivated Discovery of Diverse Patterns in Self-Organizing Systems

Chris Reinke, Mayalen Etcheverry, Pierre-Yves Oudeyer

Keywords Paper

deep learning, unsupervised Learning, self-organization, game-of-life

0

0

0

0

14:57

16/11/2020

Robot Action Selection Learning via Layered Dimension Informed Program Synthesis

Jarrett Holtz, Arjun Guha, Joydeep Biswas

Keywords Paper

0

0

0

0

5:05

16/11/2020

Multiagent Rollout and Policy Iteration for POMDP with Application to Multi-Robot Repair Problems

Sushmita Bhattacharya, Siva Kailas, Sahil Badyal and
Stephanie Gil, Dimitri Bertsekas

Keywords Paper

0

0

0

0

5:04

16/11/2020

Learning Dexterous Manipulation from Suboptimal Experts

Rae Jeong, Jost Tobias Springenberg, Jackie Kay and
Dan Zheng, Alexandre Galashov, Nicolas Heess, Francesco Nori

Keywords Paper

0

0

0

0

5:03

26/04/2020

Learning Compositional Koopman Operators for Model-Based Control

Yunzhu Li, Hao He, Jiajun Wu and
Dina Katabi, Antonio Torralba

Keywords Paper

Koopman operators, graph neural networks, compositionality

0

0

0

0

4:55

06/12/2021

Amortized Synthesis of Constrained Configurations Using a Differentiable Surrogate

Xingyuan Sun, Tianju Xue, Szymon Rusinkiewicz, Ryan Adams

Keywords Paper

deep learning, optimization

0

0

0

0

12:41

26/10/2020

PDDLStream: Integrating Symbolic Planners and Blackbox Samplers via Optimistic Adaptive Planning

Caelan Reed Garrett, Tomás Lozano-Pérez, Leslie Pack Kaelbling

Keywords Paper

Task and Motion Planning, Robotics, Sampling-Based Planning, Domain-Independent Planning, Hybrid Planning

0

0

0

0

9:58

16/11/2020

Learning a Decentralized Multi-Arm Motion Planner

Huy Ha, Jingxi Xu, Shuran Song

Keywords Paper

0

0

0

0

3:41

16/11/2020

Learning Physical Common Sense as Knowledge Graph Completion via BERT Data Augmentation and Constrained Tucker Factorization

Zhenjie Zhao, Evangelos Papalexakis, Xiaojuan Ma

Keywords Paper

human-robot interaction, physical learning, natural processing, model generalization

0

0

0

0

6:42

16/11/2020

Learning Equality Constraints for Motion Planning on Manifolds

Giovanni Sutanto, Isabel Rayas Fernández, Peter Englert and
Ragesh Kumar Ramachandran, Gaurav Sukhatme

Keywords Paper

0

0

0

0

4:59

16/11/2020

Transporter Networks: Rearranging the Visual World for Robotic Manipulation

Andy Zeng, Pete Florence, Jonathan Tompson and
Stefan Welker, Jonathan Chien, Maria Attarian, Travis Armstrong, Ivan Krasin, Dan Duong, Vikas Sindhwani, Johnny Lee

Keywords Paper

0

0

0

0

5:01

16/11/2020

CoT-AMFlow: Adaptive Modulation Network with Co-Teaching Strategy for Unsupervised Optical Flow Estimation

Hengli Wang, Rui Fan, Ming Liu

Keywords Paper

0

0

0

0

4:57

19/08/2021

Inter-Task Similarity for Lifelong Reinforcement Learning in Heterogeneous Tasks

Sergio A. Serrano

Keywords Paper

Machine Learning, Transfer, Adaptation, Multi-task Learning, Reinforcement Learning, Incremental Learning, Learning in Robotics

0

0

0

0

11:02

15/11/2020

Koord: A Language for Programming and Verifying Distributed Robotics Application

Ritwika Ghosh, Chiao Hsieh, Sasa Misailovic, Sayan Mitra

Keywords Paper

Programming Language for Robotics, Distributed Robotics

0

0

0

0

15:17

18/07/2021

Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills

Yevgen Chebotar, Karol Hausman, Yao Lu and
Ted Xiao, Dmitry Kalashnikov, Jacob Varley, Alex Irpan, Benjamin Eysenbach, Ryan C Julian, Chelsea Finn, Sergey Levine

Keywords Paper

Applications, Robotics

0

0

0

0

5:20

16/11/2020

Fit2Form: 3D Generative Model for Robot Gripper Form Design

Huy Ha, Shubham Agrawal, Shuran Song

Keywords Paper

0

0

0

0

5:06

16/11/2020

Model-Based Inverse Reinforcement Learning from Visual Demonstrations

Neha Das, Sarah Bechtle, Todor Davchev and
Dinesh Jayaraman, Akshara Rai, Franziska Meier

Keywords Paper

0

0

0

0

5:03

16/11/2020

Towards Robotic Assembly by Predicting Robust, Precise and Task-oriented Grasps

Jialiang Zhao, Daniel Troniak, Oliver Kroemer

Keywords Paper

0

0

0

0

5:02

06/12/2020

Multi-Task Reinforcement Learning with Soft Modularization

Ruihan Yang, Huazhe Xu, YI WU, Xiaolong Wang

Keywords Paper

0

0

0

0

3:18

02/02/2021

CMAX++ : Leveraging Experience in Planning and Execution using Inaccurate Models

Anirudh Vemula, J. Andrew Bagnell, Maxim Likhachev

Keywords Paper

0

0

0

0

15:11

02/02/2021

BT Expansion: a Sound and Complete Algorithm for Behavior Planning of Intelligent Robots with Behavior Trees

Zhongxuan Cai, Minglong Li, Wanrong Huang, Wenjing Yang

Keywords Paper

0

0

0

0

15:49

16/11/2020

Multi-Modal Anomaly Detection for Unstructured and Uncertain Environments

Tianchen Ji, Sri Theja Vuppala, Girish Chowdhary, Katherine Driggs-Campbell

Keywords Paper

0

0

0

0

5:04

19/08/2021

Deep Reinforcement Learning for Multi-contact Motion Planning of Hexapod Robots

Huiqiao Fu, Kaiqiang Tang, Peng Li and
Wenqi Zhang, Xinpeng Wang, Guizhou Deng, Tao Wang, Chunlin Chen

Keywords Paper

Machine Learning, Deep Reinforcement Learning, Learning in Robotics, Motion and Path Planning

0

0

0

0

10:54

03/05/2021

Extracting Strong Policies for Robotics Tasks from Zero-Order Trajectory Optimizers

Cristina Pinneri, Shambhuraj Sawant, Sebastian Blaes, Georg Martius

Keywords Paper

policy learning, zero-order optimization, reinforcement learning, model predictive control, robotics, model-based learning

0

0

0

0

5:09

06/12/2020

MATE: Plugging in Model Awareness to Task Embedding for Meta Learning

Xiaohan Chen, Zhangyang Wang, Siyu Tang, Krikamol Muandet

Keywords Paper

0

0

0

0

3:19