Pontryagin Differentiable Programming: An End-to-End Learning and Control Framework

Abstract: This paper develops a Pontryagin differentiable programming (PDP) methodology, which establishes a unified framework to solve a broad class of learning and control tasks. The PDP distinguishes from existing methods by two novel techniques: first, we differentiate through Pontryagin's Maximum Principle, and this allows to obtain the analytical derivative of a trajectory with respect to tunable parameters within an optimal control system, enabling end-to-end learning of dynamics, policies, or/and control objective functions; and second, we propose an auxiliary control system in the backward pass of the PDP framework, and the output of this auxiliary control system is the analytical derivative of the original system's trajectory with respect to the parameters, which can be iteratively solved using standard control tools. We investigate three learning modes of the PDP: inverse reinforcement learning, system identification, and control/planning. We demonstrate the capability of the PDP in each learning mode on different high-dimensional systems, including multilink robot arm, 6-DoF maneuvering UAV, and 6-DoF rocket powered landing.

26/04/2020

Neuroscience and Cognitive Science, Neuroscience, Reinforcement Learning and Planning, Algorithms, Representation Learning; Algorithms, Sparse Coding and Dimensionality Expansion; Applications, Matrix and Ten

5:16

26/04/2020

Pontryagin Differentiable Programming: An End-to-End Learning and Control Framework

Wanxin Jin, Zhaoran Wang, Zhuoran Yang, Shaoshuai Mou

Comments

Similar Papers

Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control

Nir Levine, Yinlam Chow, Rui Shu and Ang Li, Mohammad Ghavamzadeh, Hung Bui

Keywords Abstract Paper

Embed-to-Control, Representation Learning, Stochastic Optimal Control, VAE, iLQR

Variational Empowerment as Representation Learning for Goal-Conditioned Reinforcement Learning

Jongwook Choi, Archit Sharma, Honglak Lee and Sergey Levine, Shixiang Gu

Keywords Abstract Paper

Neuroscience and Cognitive Science, Neuroscience, Reinforcement Learning and Planning, Algorithms, Representation Learning; Algorithms, Sparse Coding and Dimensionality Expansion; Applications, Matrix and Ten

Infinite-Horizon Differentiable Model Predictive Control

Sebastian East, Marco Gallieri, Jonathan Masci and Jan Koutnik, Mark Cannon

Keywords Abstract Paper

Model Predictive Control, Riccati Equation, Imitation Learning, Safe Learning

Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization

Michael Volpp, Lukas P. Fröhlich, Kirsten Fischer and Andreas Doerr, Stefan Falkner, Frank Hutter, Christian Daniel

Keywords Abstract Paper

Transfer Learning, Meta Learning, Bayesian Optimization, Reinforcement Learning

Learning Interpretable Decision Rule Sets: A Submodular Optimization Approach

Fan Yang, Kai He, Linxiao Yang and Hongxia Du, Jingbang Yang, Bo Yang, Liang Sun

Keywords Abstract Paper

optimization

Differentiable Segmentation of Sequences

Erik Scharwächter, Jonathan Lennartz, Emmanuel Müller

Keywords Abstract Paper

warping functions, concept drift, change point detection, segmented models, segmentation, gradient descent

Learning to Control PDEs with Differentiable Physics

Philipp Holl, Nils Thuerey, Vladlen Koltun

Keywords Abstract Paper

Differentiable physics, Optimal control, Deep learning

Learn to expect the unexpected: Probably approximately correct domain generalization

Vikas Garg, Adam Tauman Kalai, Katrina Ligett, Steven Wu

Keywords Abstract Paper

An online passive-aggressive algorithm for difference-of-squares classification

Lawrence Saul

Keywords Abstract Paper

machine learning, online learning

Multi-Agent Determinantal Q-Learning

Yaodong Yang, Ying Wen, Jun Wang and Liheng Chen, Kun Shao, David Mguni, Weinan Zhang

Keywords Abstract Paper

Planning, Control, and Multiagent Learning

Even your Teacher Needs Guidance: Ground-Truth Targets Dampen Regularization Imposed by Self-Distillation

Kenneth Borup, Lars N Andersen

Keywords Abstract Paper

theory, deep learning, optimization

A Mathematical Framework for Quantifying Transferability in Multi-source Transfer Learning

Xinyi Tong, Xiangxiang Xu, Shao-Lun Huang, Lizhong Zheng

Keywords Abstract Paper

theory, deep learning, machine learning, vision, transfer learning

Learning Structured Latent Factors from Dependent Data:A Generative Model Framework from Information-Theoretic Perspective

Ruixiang ZHANG, Katsuhiko Ishiguro, Masanori Koyama

Keywords Abstract Paper

Learning Theory

IEPT: Instance-Level and Episode-Level Pretext Tasks for Few-Shot Learning

Manli Zhang, Jianhong Zhang, Zhiwu Lu and Tao Xiang, Mingyu Ding, Songfang Huang

Keywords Abstract Paper

self-supervised learning, few-shot learning, episode-level pretext task

Learning Similarity Metrics for Numerical Simulations

Georg Kohl, Kiwon Um, Nils Thuerey

Keywords Abstract Paper

General Machine Learning Techniques

Towards Verified Stochastic Variational Inference for Probabilistic Programs

Wonyeol Lee, Hangyeol Yu, Xavier Rival, Hongseok Yang

Keywords Abstract Paper

semantics, correctness, Probabilistic programming, static analysis

On the Modularity of Hypernetworks

Tomer Galanti, Lior Wolf

Keywords Abstract Paper

Tuning-free Plug-and-Play Proximal Algorithm for Inverse Imaging Problems

Kaixuan Wei, Angelica I Aviles-Rivero, Jingwei Liang and Ying Fu, Carola-Bibiane Schönlieb, Hua Huang

Keywords Abstract Paper

Deep Learning - Algorithms

Feature Importance Ranking for Deep Learning

Maksymilian Wojtas, Ke Chen

Keywords Abstract Paper

An Information-Theoretic Framework for Unifying Active Learning Problems

Quoc Phong Nguyen, Bryan Kian Hsiang Low, Patrick Jaillet

Keywords Abstract Paper

Nir Levine, Yinlam Chow, Rui Shu and
Ang Li, Mohammad Ghavamzadeh, Hung Bui

Keywords Paper

Jongwook Choi, Archit Sharma, Honglak Lee and
Sergey Levine, Shixiang Gu

Keywords Paper

Sebastian East, Marco Gallieri, Jonathan Masci and
Jan Koutnik, Mark Cannon

Keywords Paper

Michael Volpp, Lukas P. Fröhlich, Kirsten Fischer and
Andreas Doerr, Stefan Falkner, Frank Hutter, Christian Daniel

Keywords Paper

Fan Yang, Kai He, Linxiao Yang and
Hongxia Du, Jingbang Yang, Bo Yang, Liang Sun

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yaodong Yang, Ying Wen, Jun Wang and
Liheng Chen, Kun Shao, David Mguni, Weinan Zhang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Manli Zhang, Jianhong Zhang, Zhiwu Lu and
Tao Xiang, Mingyu Ding, Songfang Huang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Kaixuan Wei, Angelica I Aviles-Rivero, Jingwei Liang and
Ying Fu, Carola-Bibiane Schönlieb, Hua Huang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Zhining Liu, Pengfei Wei, Jing Jiang and
Wei Cao, Jiang Bian, Yi Chang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yangyuxuan Kang, Anbang Yao, Shandong Wang and
Ming Lu, Yurong Chen, Enhua Wu

Keywords Paper

Keywords Paper