Estimating Gradients for Discrete Random Variables by Sampling without Replacement

26/04/2020

Estimating Gradients for Discrete Random Variables by Sampling without Replacement

Wouter Kool, Herke van Hoof, Max Welling

Keywords: gradient, estimator, discrete, categorical, sampling, without replacement, reinforce, baseline, variance, gumbel, vae, structured prediction

Abstract Paper Code Similar Papers

Abstract: We derive an unbiased estimator for expectations over discrete random variables based on sampling without replacement, which reduces variance as it avoids duplicate samples. We show that our estimator can be derived as the Rao-Blackwellization of three different estimators. Combining our estimator with REINFORCE, we obtain a policy gradient estimator and we reduce its variance using a built-in control variate which is obtained without additional model evaluations. The resulting estimator is closely related to other gradient estimators. Experiments with a toy problem, a categorical Variational Auto-Encoder and a structured prediction problem show that our estimator is the only estimator that is consistently among the best estimators in both high and low entropy settings.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

19/08/2021

Regularizing Variational Autoencoder with Diversity and Uncertainty Awareness

Dazhong Shen, Chuan Qin, Chao Wang and
Hengshu Zhu, Enhong Chen, Hui Xiong

Keywords Paper

Machine Learning, Bayesian Learning, Probabilistic Machine Learning, Unsupervised Learning

0

0

0

0

13:04

18/07/2021

Wasserstein Distributional Normalization For Robust Distributional Certification of Noisy Labeled Data

Sung Woo Park, Junseok Kwon

Keywords Paper

Deep Learning, Generative Models, Algorithms, Representation Learning; Optimization, Submodular Optimization, Probabilistic Methods, Robust statistics

0

0

0

0

5:20

19/08/2021

Federated Model Distillation with Noise-Free Differential Privacy

Lichao Sun, Lingjuan Lyu

Keywords Paper

Data Mining, Federated Learning, Privacy Preserving Data Mining, Multi-agent Learning, Trustable Learning

0

0

0

0

14:30

06/12/2021

Realistic evaluation of transductive few-shot learning

Olivier Veilleux, Malik Boudiaf, Pablo Piantanida, Ismail Ben Ayed

Keywords Paper

optimization, machine learning, few shot learning

0

0

0

0

10:21

03/05/2021

Combining Ensembles and Data Augmentation Can Harm Your Calibration

Yeming Wen, Ghassen Jerfel, Rafael Müller and
Michael W Dusenberry, Jasper Snoek, Balaji Lakshminarayanan, Dustin Tran

Keywords Paper

Uncertainty estimates, Ensembles, Calibration

0

0

0

0

6:10

18/07/2021

Valid Causal Inference with (Some) Invalid Instruments

Jason Hartford, Victor Veitch, Dhanya Sridhar, Kevin Leyton-Brown

Keywords Paper

Probabilistic Methods, Causal Inference

0

0

0

0

4:34

12/07/2020

Doubly robust off-policy evaluation with shrinkage

Yi Su, Maria Dimakopoulou, Akshay Krishnamurthy, Miroslav Dudik

Keywords Paper

Online Learning, Active Learning, and Bandits

0

0

0

0

15:08

18/07/2021

Provably End-to-end Label-noise Learning without Anchor Points

Xuefeng Li, Tongliang Liu, Bo Han and
Gang Niu, Masashi Sugiyama

Keywords Paper

Algorithms, Semi-Supervised Learning

0

0

0

0

5:16

06/12/2021

The Skellam Mechanism for Differentially Private Federated Learning

Naman Agarwal, Peter Kairouz, Ziyu Liu

Keywords Paper

machine learning, privacy, federated learning

0

0

0

0

15:37

03/05/2021

Gradient Origin Networks

Sam Bond-Taylor, Chris G Willcocks

Keywords Paper

Implicit Representation, Generative Models, Deep Learning

0

0

0

0

5:01

03/08/2020

Amortized variance reduction for doubly stochastic objective

Ayman Boustati, Sattar Vakili, James Hensman, ST John

Keywords Paper

0

0

0

0

5:02

06/12/2020

Distribution-free binary classification: prediction sets, confidence intervals and calibration

Chirag Gupta, Aleksandr Podkopaev, Aaditya Ramdas

Keywords Paper

0

0

0

0

3:21

18/07/2021

State Relevance for Off-Policy Evaluation

Simon Shen, Jason Ma, Omer Gottesman, Finale Doshi-Velez

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:02

02/02/2021

Practical and Rigorous Uncertainty Bounds for Gaussian Process Regression

Christian Fiedler, Carsten W. Scherer, Sebastian Trimpe

Keywords Paper

0

0

0

0

18:49

03/05/2021

Rao-Blackwellizing the Straight-Through Gumbel-Softmax Gradient Estimator

Max B Paulus, Chris Maddison, Andreas Krause

Keywords Paper

softmax, gumbel, rao-blackwell, rao, straightthrough, straight-through, gumbel-softmax

0

0

0

0

13:25

06/12/2021

Nonuniform Negative Sampling and Log Odds Correction with Rare Events Data

HaiYing Wang, Aonan Zhang, Chong Wang

Keywords Paper

0

0

0

0

14:58

06/12/2021

What’s a good imputation to predict with missing values?

Marine Le Morvan, Julie Josse, Erwan Scornet, Gael Varoquaux

Keywords Paper

deep learning

0

0

0

0

14:30

26/08/2020

Adaptive, Distribution-Free Prediction Intervals for Deep Networks

Danijel Kivaranovic, Kory D. Johnson, Hannes Leeb

Keywords Paper

0

0

0

0

16:48

04/07/2020

A Batch Normalized Inference Network Keeps the KL Vanishing Away

Qile Zhu, Wei Bi, Xiaojiang Liu and
Xiyao Ma, Xiaolin Li, Dapeng Wu

Keywords Paper

amortized inference, language modeling, text classification, dialogue generation

0

0

0

0

11:16

06/12/2020

Optimal Variance Control of the Score-Function Gradient Estimator for Importance-Weighted Bounds

Valentin Liévin, Andrea Dittadi, Anders Christensen, Ole Winther

Keywords Paper

0

0

0

0

3:06

18/07/2021

Demystifying Inductive Biases for (Beta-)VAE Based Architectures

Dominik Zietlow, Michal Rolinek, Georg Martius

Keywords Paper

Deep Learning, Adversarial Networks, Applications, Body Pose, Face, and Gesture Analysis; Applications, Computer Vision; Deep Learning, Generative Models, Deep Learning, Embedding and Representation learning

0

0

0

0

4:51

03/05/2021

Contextual Dropout: An Efficient Sample-Dependent Dropout Module

XINJIE FAN, Shujian Zhang, Korawat Tanwisuth and
Xiaoning Qian, Mingyuan Zhou

Keywords Paper

Supervised Deep Networks, Probabilistic Methods, Efficient Inference Methods

0

0

0

0

4:30

06/12/2020

Generalization error in high-dimensional perceptrons: Approaching Bayes error with convex optimization

Benjamin Aubin, Florent Krzakala, Yue Lu, Lenka Zdeborová

Keywords Paper

0

0

0

0

3:08

06/12/2020

Learning discrete distributions with infinite support

Doron Cohen, Aryeh Kontorovich, Geoﬀrey Wolfer

Keywords Paper

0

0

0

0

2:59

06/12/2021

Implicit Generative Copulas

Tim Janke, Mohamed Ghanmi, Florian Steinke

Keywords Paper

deep learning, generative model

0

0

0

0

5:46

06/12/2021

Consistency Regularization for Variational Auto-Encoders

Samarth Sinha, Adji Bousso Dieng

Keywords Paper

deep learning, machine learning, self-supervised learning, generative model, contrastive learning, representation learning

0

0

0

0

10:52

26/04/2020

Extreme Classification via Adversarial Softmax Approximation

Robert Bamler, Stephan Mandt

Keywords Paper

Extreme classification, negative sampling

0

0

0

0

5:04

19/08/2021

Independence-aware Advantage Estimation

Pushi Zhang, Li Zhao, Guoqing Liu and
Jiang Bian, Minlie Huang, Tao Qin, Tie-Yan Liu

Keywords Paper

Machine Learning, Reinforcement Learning, Deep Reinforcement Learning

0

0

0

0

14:58

26/04/2020

Probability Calibration for Knowledge Graph Embedding Models

Pedro Tabacof, Luca Costabello

Keywords Paper

knowledge graph embeddings, probability calibration, calibration, graph representation learning, knowledge graphs

0

0

0

0

5:36

26/04/2020

SUMO: Unbiased Estimation of Log Marginal Probability for Latent Variable Models

Yucen Luo, Alex Beatson, Mohammad Norouzi and
Jun Zhu, David Duvenaud, Ryan P. Adams, Ricky T. Q. Chen

Keywords Paper

0

0

0

0

5:14

12/07/2020

The continuous categorical: a novel simplex-valued exponential family

Elliott Gordon-Rodriguez, Gabriel Loaiza-Ganem, John Cunningham

Keywords Paper

Probabilistic Inference - Models and Probabilistic Programming

0

0

0

0

14:59

02/02/2021

Variance Penalized On-Policy and Off-Policy Actor-Critic

Arushi Jain, Gandharv Patil, Ayush Jain and
Khimya Khetarpal, Doina Precup

Keywords Paper

0

0

0

0

17:58

18/07/2021

Tighter Bounds on the Log Marginal Likelihood of Gaussian Process Regression Using Conjugate Gradients

Artem Artemev, David Burt, Mark van der Wilk

Keywords Paper

Probabilistic Methods, Gaussian Processes and Bayesian non-parametrics

0

0

0

0

17:13

06/12/2021

Double Machine Learning Density Estimation for Local Treatment Effects with Instruments

Yonghan Jung, Jin Tian, Elias Bareinboim

Keywords Paper

machine learning, causality

0

0

0

0

14:24

12/07/2020

Being Bayesian, Even Just a Bit, Fixes Overconfidence in ReLU Networks

Agustinus Kristiadi, Matthias Hein, Philipp Hennig

Keywords Paper

Deep Learning - General

0

0

0

0

15:02

13/04/2021

Automatic differentiation variational inference with mixtures

Warren Morningstar, Sharad Vikram, Cusuh Ham and
Andrew Gallagher, Joshua Dillon

Keywords Paper

0

0

0

0

3:05

06/12/2020

Correlation Robust Influence Maximization

Louis Chen, Divya Padmanabhan, Chee Chin Lim, Karthik Natarajan

Keywords Paper

0

0

0

0

3:19

26/04/2020

Identifying through Flows for Recovering Latent Representations

Shen Li, Bryan Hooi, Gim Hee Lee

Keywords Paper

Representation learning, identifiable generative models, nonlinear-ICA

0

0

0

0

5:11

03/05/2021

Multi-Class Uncertainty Calibration via Mutual Information Maximization-based Binning

Kanil Patel, William H Beluch, Bin Yang and
Michael Pfeiffer, Dan Zhang

Keywords Paper

deep neural networks, histogram binning, post-hoc calibration, uncertainty calibration, mutual information

0

0

0

0

5:13

06/12/2020

Consistency Regularization for Certified Robustness of Smoothed Classifiers

Jongheon Jeong, Jinwoo Shin

Keywords Paper

0

0

0

0

3:16