Your classifier is secretly an energy based model and you should treat it like one

26/04/2020

Your classifier is secretly an energy based model and you should treat it like one

Will Grathwohl, Kuan-Chieh Wang, Joern-Henrik Jacobsen, David Duvenaud, Mohammad Norouzi, Kevin Swersky

Keywords: energy based models, adversarial robustness, generative models, out of distribution detection, outlier detection, hybrid models, robustness, calibration

Abstract Paper Code Similar Papers

Abstract: We propose to reinterpret a standard discriminative classifier of p(y|x) as an energy based model for the joint distribution p(x, y). In this setting, the standard class probabilities can be easily computed as well as unnormalized values of p(x) and p(x|y). Within this framework, standard discriminative architectures may be used and the model can also be trained on unlabeled data. We demonstrate that energy based training of the joint distribution improves calibration, robustness, and out-of-distribution detection while also enabling our models to generate samples rivaling the quality of recent GAN approaches. We improve upon recently proposed techniques for scaling up the training of energy based models and present an approach which adds little overhead compared to standard classification training. Our approach is the first to achieve performance rivaling the state-of-the-art in both generative and discriminative learning within one hybrid model.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

Flow Contrastive Estimation of Energy-Based Models

Ruiqi Gao, Erik Nijkamp, Diederik P. Kingma and
Zhen Xu, Andrew M. Dai, Ying Nian Wu

Keywords Paper

energy-based model, flow-based model, generative model, glow, noise contrastive estimation, semi-supervised learning, gan, probabilistic model.

0

0

0

0

5:01

14/06/2020

APQ: Joint Search for Network Architecture, Pruning and Quantization Policy

Tianzhe Wang, Kuan Wang, Han Cai and
Ji Lin, Zhijian Liu, Hanrui Wang, Yujun Lin, Song Han

Keywords Paper

efficiency, model compression, joint design, neural architecture search, channel pruning, mixed-precision quantization

0

0

0

0

1:00

18/07/2021

Improved Denoising Diffusion Probabilistic Models

Alexander Nichol, Prafulla Dhariwal

Keywords Paper

Deep Learning, Generative Models, Theory, Game Theory and Computational Economics, Reinforcement Learning and Planning, Multi-Agent RL

0

0

0

0

4:25

06/12/2020

Revisiting the Sample Complexity of Sparse Spectrum Approximation of Gaussian Processes

Minh Hoang, Nghia Hoang, Hai Pham, David Woodruff

Keywords Paper

, Deep Learning

0

0

0

0

3:25

18/07/2021

Learning Neural Network Subspaces

Mitchell Wortsman, Maxwell Horton, Carlos Guestrin and
Ali Farhadi, Mohammad Rastegari

Keywords Paper

Deep Learning, Applications, Dialog- or Communication-Based Learning, Algorithms, Representation Learning

0

0

0

0

5:07

03/05/2021

No MCMC for me: Amortized sampling for fast and stable training of energy-based models

Will Grathwohl, Jacob Kelly, Milad Hashemi and
Mohammad Norouzi, Kevin Swersky, David Duvenaud

Keywords Paper

EBM, Generative Models, semi-supervised learning, Energy-Based Models, JEM, Energy Based Models

0

0

0

0

5:37

18/07/2021

A Discriminative Technique for Multiple-Source Adaptation

Corinna Cortes, Mehryar Mohri, Ananda Theertha Suresh, Ningshan Zhang

Keywords Paper

Applications, , Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

1

4:49

03/05/2021

Learning Energy-Based Generative Models via Coarse-to-Fine Expanding and Sampling

Yang Zhao, Jianwen Xie, Ping Li

Keywords Paper

generative model, image translation, Energy-based model

0

0

0

0

5:57

12/07/2020

Enhancing Simple Models by Exploiting What They Already Know

Amit Dhurandhar, Karthikeyan Shanmugam, Ronny Luss

Keywords Paper

Supervised Learning

0

0

0

0

13:57

26/04/2020

Training binary neural networks with real-to-binary convolutions

Brais Martinez, Jing Yang, Adrian Bulat, Georgios Tzimiropoulos

Keywords Paper

binary networks

0

0

0

0

4:41

26/08/2020

Practical Nonisotropic Monte Carlo Sampling in High Dimensions via Determinantal Point Processes

Krzysztof Choromanski, Aldo Pacchiano, Jack Parker-Holder, Yunhao Tang

Keywords Paper

0

0

0

0

12:42

03/05/2021

Adversarial score matching and improved sampling for image generation

Alexia Jolicoeur-Martineau, Rémi Piché-Taillefer, Ioannis Mitliagkas, Remi Combes

Keywords Paper

score matching, adversarial, generative model, GAN, Langevin dynamics

0

0

0

0

4:56

06/12/2020

Learning Discrete Energy-based Models via Auxiliary-variable Local Exploration

Hanjun Dai, Rishabh Singh, Bo Dai and
Charles Sutton, Dale Schuurmans

Keywords Paper

0

0

0

0

3:23

12/07/2020

Conditional Augmentation for Generative Modeling

Heewoo Jun, Rewon Child, Mark Chen and
John Schulman, Aditya Ramesh, Alec Radford, Ilya Sutskever

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

16:07

12/07/2020

Learning Deep Kernels for Non-Parametric Two-Sample Tests

Feng Liu, Wenkai Xu, Jie Lu and
Guangquan Zhang, Arthur Gretton, D.J. Sutherland

Keywords Paper

General Machine Learning Techniques

0

0

0

0

15:00

26/08/2020

Gaussianization Flows

Chenlin Meng, Yang Song, Jiaming Song, Stefano Ermon

Keywords Paper

0

0

0

0

11:21

02/02/2021

Augmented Partial Mutual Learning with Frame Masking for Video Captioning

Ke Lin, Zhuoxin Gan, Liwei Wang

Keywords Paper

0

0

0

0

16:57

12/07/2020

Bridging the Gap Between f-GANs and Wasserstein GANs

Jiaming Song, Stefano Ermon

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

12:06

26/08/2020

Kernel Conditional Density Operators

Ingmar Schuster, Mattes Mollenhauer, Stefan Klus, Krikamol Muandet

Keywords Paper

0

0

0

0

14:59

02/02/2021

Harmonized Dense Knowledge Distillation Training for Multi-Exit Architectures

Xinglu Wang, Yingming Li

Keywords Paper

0

0

0

0

15:12

06/12/2020

Analytical Probability Distributions and Exact Expectation-Maximization for Deep Generative Networks

Randall Balestriero, Sebastien PARIS, Richard Baraniuk

Keywords Paper

0

0

0

0

3:17

06/12/2020

ICE-BeeM: Identifiable Conditional Energy-Based Deep Models Based on Nonlinear ICA

Ilyes Khemakhem, Ricardo Monti, Diederik P. Kingma, Aapo Hyvarinen

Keywords Paper

0

0

0

0

3:02

12/07/2020

Model Fusion with Kullback--Leibler Divergence

Sebastian Claici, Mikhail Yurochkin, Soumya Ghosh, Justin Solomon

Keywords Paper

Probabilistic Inference - Approximate, Monte Carlo, and Spectral Methods

0

0

0

0

9:58

06/12/2021

Score-based Generative Modeling in Latent Space

Arash Vahdat, Karsten Kreis, Jan Kautz

Keywords Paper

generative model

0

0

0

0

14:53

04/07/2020

Estimating the influence of auxiliary tasks for multi-task learning of sequence tagging tasks

Fynn Schröder, Chris Biemann

Keywords Paper

multi-task tasks, MTL, TL, MTL setups

0

0

0

0

12:02

05/01/2021

Domain Impression: A Source Data Free Domain Adaptation Method

Vinod K. Kurmi, Venkatesh K. Subramanian, Vinay P. Namboodiri

Keywords Paper

0

0

0

0

5:06

30/11/2020

MatchGAN: A Self-Supervised Semi-Supervised Conditional Generative Adversarial Network

Jiaze Sun, Binod Bhattarai, Tae-Kyun Kim

Keywords Paper

0

0

0

0

8:00

06/12/2021

Bridging Explicit and Implicit Deep Generative Models via Neural Stein Estimators

Qitian Wu, Rui Gao, Hongyuan Zha

Keywords Paper

generative model

0

0

0

0

12:51

03/05/2021

Generalized Energy Based Models

Michael Arbel, Liang Zhou, Arthur Gretton

Keywords Paper

Generative Models, Optimization, Density estimation, Adversarial training, MCMC, Sampling

0

0

0

0

4:42

26/04/2020

Single Episode Policy Transfer in Reinforcement Learning

Jiachen Yang, Brenden Petersen, Hongyuan Zha, Daniel Faissol

Keywords Paper

transfer learning, reinforcement learning

0

0

0

0

4:52

07/09/2020

Image Harmonization with Attention-based Deep Feature Modulation

Guoqing Hao, Satoshi Iizuka, Kazuhiro Fukui

Keywords Paper

image harmonization, feature map modulation, attention

0

0

0

0

5:03

14/06/2020

Diverse Image Generation via Self-Conditioned GANs

Steven Liu, Tongzhou Wang, David Bau and
Jun-Yan Zhu, Antonio Torralba

Keywords Paper

generative adversarial networks, image synthesis, mode collapse, clustering, unsupervised learning

0

0

0

0

1:00

03/05/2021

FairBatch: Batch Selection for Model Fairness

Yuji Roh, Kangwook Lee, Steven Whang, Changho Suh

Keywords Paper

bilevel optimization, batch selection, model fairness

0

0

0

0

5:04

14/06/2020

Exemplar Normalization for Learning Deep Representation

Ruimao Zhang, Zhanglin Peng, Lingyun Wu and
Zhen Li, Ping Luo

Keywords Paper

normalization, learning to normalize, sample-adaptive, deep learning, image classification, semantic segmentation

0

0

0

0

1:00

13/04/2021

High-dimensional multi-task averaging and application to kernel mean embedding

Hannah Marienwald, Jean-Baptiste Fermanian, Gilles Blanchard

Keywords Paper

0

0

0

0

3:01

02/02/2021

SIMPLE: SIngle-network with Mimicking and Point Learning for Bottom-up Human Pose Estimation

Jiabin Zhang, Zheng Zhu, Jiwen Lu and
Junjie Huang, Guan Huang, Jie Zhou

Keywords Paper

0

0

0

0

15:17

05/01/2021

Learning Fast Converging, Effective Conditional Generative Adversarial Networks With a Mirrored Auxiliary Classifier

Zi Wang

Keywords Paper

0

0

0

0

4:59

03/08/2020

C-MI-GAN : Estimation of Conditional Mutual Information using MinMax formulation

Arnab Mondal, Arnab Bhattacharjee, Sudipto Mukherjee and
Himanshu Asnani, Sreeram Kannan, Prathosh A P

Keywords Paper

0

0

0

0

7:56

03/05/2021

Learning Robust State Abstractions for Hidden-Parameter Block MDPs

Amy Zhang, Shagun Sodhani, Khimya Khetarpal, Joelle Pineau

Keywords Paper

bisimulation, block mdp, hidden-parameter mdp, multi-task reinforcement learning

0

0

0

0

4:17

06/12/2020

A Contour Stochastic Gradient Langevin Dynamics Algorithm for Simulations of Multi-modal Distributions

Wei Deng, Guang Lin, Faming Liang

Keywords Paper

0

0

0

0

3:26