Oops I Took A Gradient: Scalable Sampling for Discrete Distributions

Abstract: We propose a general and scalable approximate sampling strategy for probabilistic models with discrete variables. Our approach uses gradients of the likelihood function with respect to its discrete inputs to propose updates in a Metropolis-Hastings sampler. We show empirically that this approach outperforms generic samplers in a number of difficult settings including Ising models, Potts models, restricted Boltzmann machines, and factorial hidden Markov models. We also demonstrate our improved sampler for training deep energy-based models on high dimensional discrete image data. This approach outperforms variational auto-encoders and existing energy-based models. Finally, we give bounds showing that our approach is near-optimal in the class of samplers which propose local updates.

06/12/2020

Oops I Took A Gradient: Scalable Sampling for Discrete Distributions

Will Grathwohl, Kevin Swersky, Milad Hashemi, David Duvenaud, Chris Maddison

Comments

Similar Papers

Stochastic Normalizing Flows

Hao Wu, Jonas Köhler, Frank Noe

Keywords Abstract Paper

Gaussianization Flows

Chenlin Meng, Yang Song, Jiaming Song, Stefano Ermon

Keywords Abstract Paper

A Contour Stochastic Gradient Langevin Dynamics Algorithm for Simulations of Multi-modal Distributions

Wei Deng, Guang Lin, Faming Liang

Keywords Abstract Paper

Amortized variance reduction for doubly stochastic objective

Ayman Boustati, Sattar Vakili, James Hensman, ST John

Keywords Abstract Paper

Instance-Optimal Compressed Sensing via Posterior Sampling

Ajil Jalal, Sushrut Karmalkar, Alex Dimakis, Eric Price

Keywords Abstract Paper

Algorithms, Sparsity and Compressed Sensing

Time-independent Generalization Bounds for SGLD in Non-convex Settings

Tyler Farghly, Patrick Rebeschini

Keywords Abstract Paper

Slice Sampling Reparameterization Gradients

David M Zoltowski, Diana Cai, Ryan Adams

Keywords Abstract Paper

optimization, machine learning, generative model

Distributionally Robust Federated Averaging

Yuyang Deng, Mohammad Mahdi Kamani, Mehrdad Mahdavi

Keywords Abstract Paper

Localizing and amortizing: Efficient inference for gaussian processes

Linfeng Liu, Liping Liu

Keywords Abstract Paper

Batch Stationary Distribution Estimation

Junfeng Wen, Bo Dai, Lihong Li, Dale Schuurmans

Keywords Abstract Paper

Probabilistic Inference - Approximate, Monte Carlo, and Spectral Methods

Conformal Bayesian Computation

Edwin Fong, Chris C Holmes

Keywords Abstract Paper

Near Input Sparsity Time Kernel Embeddings via Adaptive Sampling

Amir Zandieh, David Woodruff

Keywords Abstract Paper

Distributionally Robust Parametric Maximum Likelihood Estimation

Viet Anh Nguyen, Xuhui Zhang, Jose Blanchet, Angelos Georghiou

Keywords Abstract Paper

Run-Sort-ReRun: Escaping Batch Size Limitations in Sliced Wasserstein Generative Models

José Lezama, Wei Chen, Qiang Qiu

Keywords Abstract Paper

Score-Based Generative Modeling through Stochastic Differential Equations

Yang Song, Jascha Sohl-Dickstein, Durk Kingma and Abhishek Kumar, Stefano Ermon, Ben Poole

Keywords Abstract Paper

score matching, stochastic differential equations, score-based generative models, diffusion, generative models

Continuous Latent Process Flows

Ruizhi Deng, Marcus Brubaker, Greg Mori, Andreas M Lehrmann

Keywords Abstract Paper

Spatio-Temporal Variational Gaussian Processes

Oliver Hamelijnck, William Wilkinson, Niki Loppi and Arno Solin, Theodoros Damoulas

Keywords Abstract Paper

generative model, kernel methods

Kernel-convoluted Deep Neural Networks with Data Augmentation

Minjin Kim, Young-geun Kim, Dongha Kim and Yongdai Kim, Myunghee Cho Paik

Keywords Abstract Paper

Regularized Autoencoders via Relaxed Injective Probability Flow

Abhishek Kumar, Ben Poole, Kevin Murphy

Keywords Abstract Paper

Faster Wasserstein Distance Estimation with the Sinkhorn Divergence

Lénaïc Chizat, Pierre Roussillon, Flavien Léger and François-Xavier Vialard, Gabriel Peyré

Keywords Abstract Paper

Autoregressive Score Matching

Chenlin Meng, Lantao Yu, Yang Song and Jiaming Song, Stefano Ermon

Keywords Abstract Paper

On learning continuous pairwise markov random fields

Abhin Shah, Devavrat Shah, Gregory Wornell

Keywords Abstract Paper

Generalized Doubly Reparameterized Gradient Estimators

Matthias Bauer, Andriy Mnih

Keywords Abstract Paper

Probabilistic Methods, Approximate Inference

On Estimation in Latent Variable Models

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yang Song, Jascha Sohl-Dickstein, Durk Kingma and
Abhishek Kumar, Stefano Ermon, Ben Poole

Keywords Paper

Keywords Paper

Oliver Hamelijnck, William Wilkinson, Niki Loppi and
Arno Solin, Theodoros Damoulas

Keywords Paper

Minjin Kim, Young-geun Kim, Dongha Kim and
Yongdai Kim, Myunghee Cho Paik

Keywords Paper

Keywords Paper

Lénaïc Chizat, Pierre Roussillon, Flavien Léger and
François-Xavier Vialard, Gabriel Peyré

Keywords Paper

Chenlin Meng, Lantao Yu, Yang Song and
Jiaming Song, Stefano Ermon

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Hadrien Hendrikx, Lin Xiao, Sebastien Bubeck and
Francis Bach, Laurent Massoulié

Keywords Paper