Gradient Estimation with Stochastic Softmax Tricks

Abstract: The Gumbel-Max trick is the basis of many relaxed gradient estimators. These estimators are easy to implement and low variance, but the goal of scaling them comprehensively to large combinatorial distributions is still outstanding. Working within the perturbation model framework, we introduce stochastic softmax tricks, which generalize the Gumbel-Softmax trick to combinatorial spaces. Our framework is a unified perspective on existing relaxed estimators for perturbation models, and it contains many novel relaxations. We design structured relaxations for subset selection, spanning trees, arborescences, and others. When compared to less structured baselines, we find that stochastic softmax tricks can be used to train latent variable models that perform better and discover more latent structure.

06/12/2021

Machine Learning, Learning Generative Models, Time-series; Data Streams, Unsupervised Learning, Approximate Probabilistic Inference

13:39

06/12/2021

Gradient Estimation with Stochastic Softmax Tricks

Max Paulus, Dami Choi, Daniel Tarlow, Andreas Krause, Chris J. Maddison

Comments

Similar Papers

Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces

Kirill Struminsky, Artyom Gadetsky, Denis Rakitin and Danil Karpushkin, Dmitry Vetrov

Keywords Abstract Paper

deep learning, optimization

Responsive Safety in Reinforcement Learning

Adam Stooke, Joshua Achiam, Pieter Abbeel

Keywords Abstract Paper

Reinforcement Learning - Deep RL

Invertible Gaussian Reparameterization: Revisiting the Gumbel-Softmax

Andres Potapczynski, Gabriel Loaiza-Ganem, John Cunningham

Keywords Abstract Paper

Repulsive Deep Ensembles are Bayesian

Francesco D'Angelo, Vincent Fortuin

Keywords Abstract Paper

deep learning, optimization

A Graduated Filter Method for Large Scale Robust Estimation

Huu Le, Christopher Zach

Keywords Abstract Paper

robust fitting, bundle adjustment, non-convex, poor local minima, non-linear least squares, graduated non-convexity.

Powerpropagation: A sparsity inducing weight reparameterisation

Jonathan Schwarz, Siddhant M Jayakumar, Razvan Pascanu and Peter E Latham, Yee Teh

Keywords Abstract Paper

deep learning, optimization, continual learning

Beyond perturbation stability: LP recovery guarantees for MAP inference on noisy stable instances

Hunter Lang, Aravind Reddy, David Sontag, Aravindan Vijayaraghavan

Keywords Abstract Paper

Monte Carlo Filtering Objectives

Shuangshuang Chen, Sihao Ding, Yiannis Karayiannidis, Mårten Björkman

Keywords Abstract Paper

Machine Learning, Learning Generative Models, Time-series; Data Streams, Unsupervised Learning, Approximate Probabilistic Inference

Learning with Algorithmic Supervision via Continuous Relaxations

Felix Petersen, Christian Borgelt, Hilde Kuehne, Oliver Deussen

Keywords Abstract Paper

deep learning

The continuous categorical: a novel simplex-valued exponential family

Elliott Gordon-Rodriguez, Gabriel Loaiza-Ganem, John Cunningham

Keywords Abstract Paper

Probabilistic Inference - Models and Probabilistic Programming

Sharpness-aware Minimization for Efficiently Improving Generalization

Pierre Foret, Ariel Kleiner, Hossein Mobahi, Behnam Neyshabur

Keywords Abstract Paper

Generalization, Deep Learning, Training Method, Regularization, Sharpness Minimization

DisARM: An Antithetic Gradient Estimator for Binary Latent Variables

Zhe Dong, Andriy Mnih, George Tucker

Keywords Abstract Paper

Why Gradient Clipping Accelerates Training: A Theoretical Justification for Adaptivity

Jingzhao Zhang, Tianxing He, Suvrit Sra, Ali Jadbabaie

Keywords Abstract Paper

Adaptive methods, optimization, deep learning

Structure Adaptive Algorithms for Stochastic Bandits

Rémy Degenne, Han Shao, Wouter Koolen

Keywords Abstract Paper

Online Learning, Active Learning, and Bandits

Bayesian Attention Modules

Xinjie Fan, Shujian Zhang, Bo Chen, Mingyuan Zhou

Keywords Abstract Paper

Learning Better Structured Representations Using Low-rank Adaptive Label Smoothing

Asish Ghoshal, Xilun Chen, Sonal Gupta and Luke Zettlemoyer, Yashar Mehdad

Keywords Abstract Paper

calibration, semantic parsing, structured prediction, label smoothing

Coupled Gradient Estimators for Discrete Latent Variables

Zhe Dong, Andriy Mnih, George Tucker

Keywords Abstract Paper

Maximum Roaming Multi-Task Learning

Lucas Pascal, Pietro Michiardi, Xavier Bost and Benoit Huet, Maria A. Zuluaga

Keywords Abstract Paper

The Convex Relaxation Barrier, Revisited: Tightened Single-Neuron Relaxations for Neural Network Verification

Christian Tjandraatmadja, Ross Anderson, Joey Huchette and Will Ma, KRUNAL KISHOR PATEL, Juan Pablo Vielma

Keywords Abstract Paper

SLURP: Side Learning Uncertainty for Regression Problems

Xuanlong Yu, Gianni Franchi, Emanuel Aldea

Keywords Abstract Paper

Uncertainty estimation, Confidence estimation, Auxiliary model, Monocular depth, Optical flow

Domain Knowledge Empowered Structured Neural Net for End-to-End Event Temporal Relation Extraction

Rujun Han, Yichao Zhou, Nanyun Peng

Keywords Abstract Paper

Kirill Struminsky, Artyom Gadetsky, Denis Rakitin and
Danil Karpushkin, Dmitry Vetrov

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Jonathan Schwarz, Siddhant M Jayakumar, Razvan Pascanu and
Peter E Latham, Yee Teh

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Asish Ghoshal, Xilun Chen, Sonal Gupta and
Luke Zettlemoyer, Yashar Mehdad

Keywords Paper

Keywords Paper

Lucas Pascal, Pietro Michiardi, Xavier Bost and
Benoit Huet, Maria A. Zuluaga

Keywords Paper

Christian Tjandraatmadja, Ross Anderson, Joey Huchette and
Will Ma, KRUNAL KISHOR PATEL, Juan Pablo Vielma

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Mingyang Yi, LU HOU, Lifeng Shang and
Xin Jiang, Qun Liu, Zhi-Ming Ma

Keywords Paper

Yanwei Fu, Chen Liu, Donghao Li and
Xinwei Sun, Jinshan ZENG, Yuan Yao

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper