The Differentiable Cross-Entropy Method

12/07/2020

The Differentiable Cross-Entropy Method

Brandon Amos, Denis Yarats

Keywords: Optimization - Non-convex

Abstract Paper Similar Papers

Abstract: We study the Cross-Entropy Method (CEM) for the non-convex optimization of a continuous and parameterized objective function and introduce a differentiable variant that enables us to differentiate the output of CEM with respect to the objective function's parameters. In the machine learning setting this brings CEM inside of the end-to-end learning pipeline where this has otherwise been impossible. We show applications in a synthetic energy-based structured prediction task and in non-convex continuous control. In the control setting we show how to embed optimal action sequences into a lower-dimensional space. This enables us to use policy optimization to fine-tune modeling components by differentiating through the CEM-based controller.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

12/07/2020

A Free-Energy Principle for Representation Learning

Yansong Gao, Pratik Chaudhari

Keywords Paper

Representation Learning

0

0

0

0

15:32

06/12/2021

Parameter Inference with Bifurcation Diagrams

Gregory Szep, Neil Dalchau, Attila Csikász-Nagy

Keywords Paper

theory, generative model

0

0

0

0

15:07

06/12/2020

Multi-task Causal Learning with Gaussian Processes

Virginia Aglietti, Theo Damoulas, Mauricio Álvarez, Javier González

Keywords Paper

0

0

0

0

3:14

18/07/2021

Dataset Dynamics via Gradient Flows in Probability Space

David Alvarez-Melis, Nicolo Fusi

Keywords Paper

Algorithms, Optimal Transport

0

0

0

1

5:11

26/04/2020

Empirical Bayes Transductive Meta-Learning with Synthetic Gradients

Shell Xu Hu, Pablo Moreno, Yang Xiao and
Xi Shen, Guillaume Obozinski, Neil Lawrence, Andreas Damianou

Keywords Paper

Meta-learning, Empirical Bayes, Synthetic Gradient, Information Bottleneck

0

0

0

0

4:47

06/12/2021

An online passive-aggressive algorithm for difference-of-squares classification

Lawrence Saul

Keywords Paper

machine learning, online learning

0

0

0

0

14:00

06/12/2020

Domain Adaptation as a Problem of Inference on Graphical Models

Kun Zhang, Mingming Gong, Petar Stojanov and
Biwei Huang, QINGSONG LIU, Clark Glymour

Keywords Paper

Probabilistic Methods -> Graphical Models, Theory -> Learning Theory

0

0

0

0

3:27

06/12/2020

High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization

Qing Feng , Ben Letham, Hongzi Mao, Eytan Bakshy

Keywords Paper

0

0

0

0

3:29

12/07/2020

On Contrastive Learning for Likelihood-free Inference

Conor Durkan, Iain Murray, George Papamakarios

Keywords Paper

Probabilistic Inference - Approximate, Monte Carlo, and Spectral Methods

0

0

0

0

14:45

06/12/2021

Self-Supervised Learning with Data Augmentations Provably Isolates Content from Style

Julius von Kügelgen, Yash Sharma, Luigi Gresele and
Wieland Brendel, Bernhard Schölkopf, Michel Besserve, Francesco Locatello

Keywords Paper

theory, self-supervised learning, representation learning

0

0

0

0

16:02

26/04/2020

A Meta-Transfer Objective for Learning to Disentangle Causal Mechanisms

Yoshua Bengio, Tristan Deleu, Nasim Rahaman and
Nan Rosemary Ke, Sebastien Lachapelle, Olexa Bilaniuk, Anirudh Goyal, Christopher Pal

Keywords Paper

meta-learning, transfer learning, structure learning, modularity, causality

0

0

0

0

5:25

26/04/2020

The Shape of Data: Intrinsic Distance for Data Distributions

Anton Tsitsulin, Marina Munkhoeva, Davide Mottin and
Panagiotis Karras, Alex Bronstein, Ivan Oseledets, Emmanuel Mueller

Keywords Paper

Deep Learning, Generative Models, Nonlinear Dimensionality Reduction, Manifold Learning, Similarity and Distance Learning, Spectral Methods

0

0

0

0

4:49

19/08/2021

Variational Model-based Policy Optimization

Yinlam Chow, Brandon Cui, Moonkyung Ryu, Mohammad Ghavamzadeh

Keywords Paper

Machine Learning, Reinforcement Learning

0

0

0

0

15:31

06/12/2021

Scalable Intervention Target Estimation in Linear Models

Burak Varici, Karthikeyan Shanmugam, Prasanna Sattigeri, Ali Tajer

Keywords Paper

theory, graph learning, causality

0

0

0

0

15:16

03/05/2021

NOVAS: Non-convex Optimization via Adaptive Stochastic Search for End-to-end Learning and Control

Ioannis Exarchos, Marcus A Pereira, Ziyi Wang, Evangelos Theodorou

Keywords Paper

deep neural networks, deep FBSDEs, stochastic control, nested optimization

0

0

0

0

5:35

13/04/2021

Variational autoencoder with learned latent structure

Marissa Connor, Gregory Canal, Christopher Rozell

Keywords Paper

0

0

0

0

3:00

06/12/2020

Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders

Masha Itkina, Boris Ivanovic, Ransalu Senanayake and
Mykel J Kochenderfer, Marco Pavone

Keywords Paper

0

0

0

0

3:39

06/12/2021

Out-of-Distribution Generalization in Kernel Regression

Abdulkadir Canatar, Blake Bordelon, Cengiz Pehlevan

Keywords Paper

theory, deep learning, machine learning

0

0

0

0

15:07

26/08/2020

Causal Bayesian Optimization

Virginia Aglietti, Xiaoyu Lu, Andrei Paleyes, Javier Gonzalez

Keywords Paper

0

0

0

0

13:56

18/07/2021

Conjugate Energy-Based Models

Hao Wu, Babak Esmaeili, Michael Wick and
Jean-Baptiste Tristan, Jan-Willem van de Meent

Keywords Paper

Deep Learning, Generative Models

0

0

0

0

5:22

06/12/2021

Deep Explicit Duration Switching Models for Time Series

Abdul Fatir Ansari, Konstantinos Benidis, Richard Kurle and
Ali Caner Turkmen, Harold Soh, Alexander J Smola, Bernie Wang, Tim Januschowski

Keywords Paper

0

0

0

0

10:12

02/02/2021

Measuring Dependence with Matrix-based Entropy Functional

Shujian Yu, Francesco Alesiani, Xi Yu and
Robert Jenssen, Jose Principe

Keywords Paper

0

0

0

0

19:33

14/09/2020

Learning Representations from Dendrograms

Morteza Haghir Chehreghani, Mostafa Haghir Chehreghani

Keywords Paper

0

0

0

0

14:44

02/02/2021

Learning Cycle-Consistent Cooperative Networks via Alternating MCMC Teaching for Unsupervised Cross-Domain Translation

Jianwen Xie, Zilong Zheng, Xiaolin Fang and
Song-Chun Zhu, Ying Nian Wu

Keywords Paper

0

0

0

0

14:48

13/04/2021

Influence decompositions for neural network attribution

Kyle Reing, Greg Ver Steeg, Aram Galstyan

Keywords Paper

0

0

0

0

2:52

14/06/2020

Enhanced Transport Distance for Unsupervised Domain Adaptation

Mengxue Li, Yi-Ming Zhai, You-Wei Luo and
Peng-Fei Ge, Chuan-Xian Ren

Keywords Paper

uda, optimal transport, neural networks, attention mechanism, kantorovich potential

0

0

0

0

0:58

18/07/2021

Learning Randomly Perturbed Structured Predictors for Direct Loss Minimization

Hedda Cohen Indelman, Tamir Hazan

Keywords Paper

Algorithms, Structured Prediction, Algorithms, Collaborative Filtering, Applications, Recommender Systems

0

0

0

0

5:11

18/07/2021

Variational Empowerment as Representation Learning for Goal-Conditioned Reinforcement Learning

Jongwook Choi, Archit Sharma, Honglak Lee and
Sergey Levine, Shixiang Gu

Keywords Paper

Neuroscience and Cognitive Science, Neuroscience, Reinforcement Learning and Planning, Algorithms, Representation Learning; Algorithms, Sparse Coding and Dimensionality Expansion; Applications, Matrix and Ten

0

0

0

0

5:16

18/07/2021

Asynchronous Decentralized Optimization With Implicit Stochastic Variance Reduction

Kenta Niwa, Guoqiang Zhang, W. Bastiaan Kleijn and
Noboru Harada, Hiroshi Sawada, Akinori Fujino

Keywords Paper

Optimization, Distributed and Parallel Optimization

0

0

0

0

5:41

26/08/2020

Calibrated Surrogate Maximization of Linear-fractional Utility in Binary Classification

Han Bao, Masashi Sugiyama

Keywords Paper

0

0

0

0

15:01

06/12/2021

A Mathematical Framework for Quantifying Transferability in Multi-source Transfer Learning

Xinyi Tong, Xiangxiang Xu, Shao-Lun Huang, Lizhong Zheng

Keywords Paper

theory, deep learning, machine learning, vision, transfer learning

2

1

0

0

13:27

18/07/2021

Explaining Time Series Predictions with Dynamic Masks

Jonathan Crabbé, Mihaela van der Schaar

Keywords Paper

Social Aspects of Machine Learning, Fairness, Accountability, and Transparency

0

0

0

0

5:17

02/02/2021

Ordered Counterfactual Explanation by Mixed-Integer Linear Optimization

Kentaro Kanamori, Takuya Takagi, Ken Kobayashi and
Yuichi Ike, Kento Uemura, Hiroki Arimura

Keywords Paper

0

0

0

0

16:47

18/07/2021

Active Learning of Continuous-time Bayesian Networks through Interventions

Dominik Linzner, Heinz Koeppl

Keywords Paper

Probabilistic Methods, Graphical Models

0

0

0

0

5:07

06/12/2020

A Statistical Mechanics Framework for Task-Agnostic Sample Design in Machine Learning

Bhavya Kailkhura, Jayaraman Thiagarajan, Qunwei Li and
Jize Zhang, Yi Zhou, Timo Bremer

Keywords Paper

0

0

0

0

3:21

06/12/2020

Deep Rao-Blackwellised Particle Filters for Time Series Forecasting

Richard Kurle, Syama Sundar Rangapuram, Emmanuel de Bézenac and
Stephan Günnemann, Jan Gasthaus

Keywords Paper

0

0

0

0

3:14

16/11/2020

Generative adversarial training of product of policies for robust and adaptive movement primitives

Emmanuel Pignat, Hakan Girgin, Sylvain Calinon

Keywords Paper

0

0

0

0

4:26

16/11/2020

Learning to Compose Hierarchical Object-Centric Controllers for Robotic Manipulation

Mohit Sharma, Jacky Liang, Jialiang Zhao and
Alex Lagrassa, Oliver Kroemer

Keywords Paper

0

0

0

0

4:51

05/01/2021

Cross-Domain Latent Modulation for Variational Transfer Learning

Jinyong Hou, Jeremiah D. Deng, Stephen Cranefield, Xuejie Ding

Keywords Paper

0

0

0

0

4:52

03/08/2020

Prediction Intervals: Split Normal Mixture from Quality-Driven Deep Ensembles

Tárik S. Salem, Helge Langseth, Heri Ramampiaro

Keywords Paper

0

0

0

0

7:45