Learning Optimal Representations with the Decodable Information Bottleneck

06/12/2020

Learning Optimal Representations with the Decodable Information Bottleneck

Yann Dubois, Douwe Kiela, David Schwab, Ramakrishna Vedantam

Keywords:

Abstract Paper Similar Papers

Abstract: We address the question of characterizing and finding optimal representations for supervised learning. Traditionally, this question has been tackled using the Information Bottleneck, which compresses the inputs while retaining information about the targets, in a decoder-agnostic fashion. In machine learning, however, our goal is not compression but rather generalization, which is intimately linked to the predictive family or decoder of interest (e.g. linear classifier). We propose the Decodable Information Bottleneck (DIB) that considers information retention and compression from the perspective of the desired predictive family. As a result, DIB gives rise to representations that are optimal in terms of expected test performance and can be estimated with guarantees. Empirically, we show that the framework can be used to enforce a small generalization gap on downstream classifiers and to predict the generalization ability of neural networks.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

Progressive Multi-task Learning with Controlled Information Flow for Joint Entity and Relation Extraction

Kai Sun, Richong Zhang, Samuel Mensah and
Yongyi Mao, Xudong Liu

Keywords Paper

0

0

0

0

13:45

06/12/2021

Gradient Starvation: A Learning Proclivity in Neural Networks

Mohammad Pezeshki, Oumar Kaba, Yoshua Bengio and
Aaron Courville, Doina Precup, Guillaume Lajoie

Keywords Paper

theory, deep learning, optimization, robustness

0

0

0

0

10:52

12/07/2020

MetaFun: Meta-Learning with Iterative Functional Updates

Jin Xu, Jean-Francois Ton, Hyunjik Kim and
Adam Kosiorek, Yee Whye Teh

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

13:51

18/07/2021

The Lipschitz Constant of Self-Attention

Hyunjik Kim, George Papamakarios, Andriy Mnih

Keywords Paper

Theory, Deep learning Theory

0

0

0

0

5:22

26/04/2020

Weakly Supervised Disentanglement with Guarantees

Rui Shu, Yining Chen, Abhishek Kumar and
Stefano Ermon, Ben Poole

Keywords Paper

disentanglement, theory of disentanglement, representation learning, generative models

0

0

0

0

4:42

26/04/2020

Functional Regularisation for Continual Learning with Gaussian Processes

Michalis K. Titsias, Jonathan Schwarz, Alexander G. de G. Matthews and
Razvan Pascanu, Yee Whye Teh

Keywords Paper

Continual Learning, Gaussian Processes, Lifelong learning, Incremental Learning

0

0

0

0

4:31

18/07/2021

On Recovering from Modeling Errors Using Testing Bayesian Networks

Haiying Huang, Adnan Darwiche

Keywords Paper

Probabilistic Methods, Graphical Models

0

0

0

0

5:09

06/12/2020

Almost Surely Stable Deep Dynamics

Nathan Lawrence, Philip Loewen, Michael Forbes and
Johan Backstrom, Bhushan Gopaluni

Keywords Paper

0

0

0

0

3:25

06/12/2021

Meta-Learning for Relative Density-Ratio Estimation

Atsutoshi Kumagai, Tomoharu Iwata, Yasuhiro Fujiwara

Keywords Paper

deep learning, machine learning, meta learning

0

0

0

0

8:56

18/07/2021

Sparsifying Networks via Subdifferential Inclusion

Sagar Verma, Jean-Christophe Pesquet

Keywords Paper

Optimization, Convex Optimization

0

0

0

0

5:10

06/12/2021

Even your Teacher Needs Guidance: Ground-Truth Targets Dampen Regularization Imposed by Self-Distillation

Kenneth Borup, Lars N Andersen

Keywords Paper

theory, deep learning, optimization

0

0

0

0

6:00

18/07/2021

Towards Understanding Learning in Neural Networks with Linear Teachers

Roei Sarussi, Alon Brutzkus, Amir Globerson

Keywords Paper

Probabilistic Methods, Theory, Probabilistic Methods, MCMC

0

0

0

0

5:22

13/04/2021

Learning with gradient descent and weakly convex losses

Dominic Richards, Mike Rabbat

Keywords Paper

0

0

0

0

3:20

06/12/2020

Learning with Operator-valued Kernels in Reproducing Kernel Krein Spaces

Akash Saha, Balamurugan Palaniappan

Keywords Paper

0

0

0

0

3:22

06/12/2021

Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces

Kirill Struminsky, Artyom Gadetsky, Denis Rakitin and
Danil Karpushkin, Dmitry Vetrov

Keywords Paper

deep learning, optimization

0

0

0

0

14:26

06/12/2020

Can Temporal-Diﬀerence and Q-Learning Learn Representation? A Mean-Field Theory

Yufeng Zhang, Qi Cai, Zhuoran Yang and
Yongxin Chen, Zhaoran Wang

Keywords Paper

0

0

0

0

3:02

03/05/2021

Learning explanations that are hard to vary

Giambattista Parascandolo, Alexander Neitz, Antonio Orvieto and
Luigi Gresele, Bernhard Schoelkopf

Keywords Paper

invariances, gradient alignment, consistency

0

0

0

0

5:16

06/12/2021

Robust Implicit Networks via Non-Euclidean Contractions

Saber Jafarpour, Alexander Davydov, Anton Proskurnikov, Francesco Bullo

Keywords Paper

theory, deep learning, machine learning, robustness, vision

0

0

0

0

14:59

18/07/2021

Variational Empowerment as Representation Learning for Goal-Conditioned Reinforcement Learning

Jongwook Choi, Archit Sharma, Honglak Lee and
Sergey Levine, Shixiang Gu

Keywords Paper

Neuroscience and Cognitive Science, Neuroscience, Reinforcement Learning and Planning, Algorithms, Representation Learning; Algorithms, Sparse Coding and Dimensionality Expansion; Applications, Matrix and Ten

0

0

0

0

5:16

06/12/2021

Towards Sample-efficient Overparameterized Meta-learning

Yue Sun, Adhyyan Narang, Ibrahim Gulluk and
Samet Oymak, Maryam Fazel

Keywords Paper

theory, machine learning, meta learning, representation learning, few shot learning

0

0

0

0

13:54

12/07/2020

Optimistic bounds for multi-output learning

Henry Reeve, Ata Kaban

Keywords Paper

Supervised Learning

0

0

0

0

14:41

06/12/2020

Optimization and Generalization Analysis of Transduction through Gradient Boosting and Application to Multi-scale Graph Neural Networks

Kenta Oono, Taiji Suzuki

Keywords Paper

0

0

0

0

3:22

12/07/2020

Supervised learning: no loss no cry

Richard Nock, Aditya Menon

Keywords Paper

Learning Theory

0

0

0

0

15:18

03/05/2021

Optimal Rates for Averaged Stochastic Gradient Descent under Neural Tangent Kernel Regime

Atsushi Nitanda, Taiji Suzuki

Keywords Paper

stochastic gradient descent, neural tangent kernel, over-parameterization, two-layer neural network

0

0

0

0

18:48

06/12/2021

A Mathematical Framework for Quantifying Transferability in Multi-source Transfer Learning

Xinyi Tong, Xiangxiang Xu, Shao-Lun Huang, Lizhong Zheng

Keywords Paper

theory, deep learning, machine learning, vision, transfer learning

2

1

0

0

13:27

03/05/2021

NOVAS: Non-convex Optimization via Adaptive Stochastic Search for End-to-end Learning and Control

Ioannis Exarchos, Marcus A Pereira, Ziyi Wang, Evangelos Theodorou

Keywords Paper

deep neural networks, deep FBSDEs, stochastic control, nested optimization

0

0

0

0

5:35

20/07/2020

A type of generalization error induced by initialization in deep neural networks

Yaoyu Zhang, Zhi-Qin John Xu, Tao Luo, Zheng Ma

Keywords Paper

0

0

0

0

17:33

12/07/2020

Scalable Differential Privacy with Certified Robustness in Adversarial Learning

Hai Phan, My T. Thai, Han Hu and
Ruoming Jin, Tong Sun, Dejing Dou

Keywords Paper

Privacy-preserving Statistics and Machine Learning

0

0

0

0

14:47

13/04/2021

Alternating direction method of multipliers for quantization

Tianjian Huang, Prajwal Singhania, Maziar Sanjabi and
Pabitra Mitra, Meisam Razaviyayn

Keywords Paper

1

0

0

0

2:43

06/12/2021

Fractal Structure and Generalization Properties of Stochastic Optimization Algorithms

Alexander Camuto, George Deligiannidis, Murat Erdogdu and
Mert Gurbuzbalaban, Umut Simsekli, Lingjiong Zhu

Keywords Paper

theory, deep learning, optimization

0

0

0

0

14:36

14/06/2020

Semi-Supervised Semantic Segmentation With Cross-Consistency Training

Yassine Ouali, Céline Hudelot, Myriam Tami

Keywords Paper

semantic segmentation, semi-supervised learning, consistency training, semi-supervised semantic segmentation

0

0

0

0

1:01

06/12/2021

Integrated Latent Heterogeneity and Invariance Learning in Kernel Space

Jiashuo Liu, Zheyuan Hu, Peng Cui and
Bo Li, Zheyan Shen

Keywords Paper

deep learning, reinforcement learning and planning, machine learning

0

0

0

0

11:11

06/12/2020

Early-Learning Regularization Prevents Memorization of Noisy Labels

Sheng Liu, Jonathan Niles-Weed, Narges Razavian, Carlos Fernandez-Granda

Keywords Paper

0

0

0

0

3:06

06/12/2021

Noether Networks: meta-learning useful conserved quantities

Ferran Alet, Dylan Doblar, Allan Zhou and
Josh Tenenbaum, Kenji Kawaguchi, Chelsea Finn

Keywords Paper

machine learning, vision, meta learning

0

0

0

0

11:18

18/07/2021

Stability and Generalization of Stochastic Gradient Methods for Minimax Problems

Yunwen Lei, Zhenhuan Yang, Tianbao Yang, Yiming Ying

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

16:24

06/12/2020

Self-Supervised Relational Reasoning for Representation Learning

Massimiliano Patacchiola, Amos Storkey

Keywords Paper

0

0

0

0

2:55

03/05/2021

Estimating informativeness of samples with Smooth Unique Information

Hrayr Harutyunyan, Alessandro Achille, Giovanni Paolini and
Orchid Majumder, Avinash Ravichandran, Rahul Bhotika, Stefano Soatto

Keywords Paper

dataset summarization, ntk, stability theory, sample information, information theory

0

0

0

0

6:05

18/07/2021

An Identifiable Double VAE For Disentangled Representations

Graziano Mita, Maurizio Filippone, Pietro Michiardi

Keywords Paper

Deep Learning, Adversarial Networks, Deep Learning, Generative Models

0

0

0

0

4:51

26/04/2020

Target-Embedding Autoencoders for Supervised Representation Learning

Daniel Jarrett, Mihaela van der Schaar

Keywords Paper

autoencoders, supervised learning, representation learning, target-embedding, label-embedding

0

0

0

0

10:47

06/12/2021

Spherical Motion Dynamics: Learning Dynamics of Normalized Neural Network using SGD and Weight Decay

Ruosi Wan, Zhanxing Zhu, Xiangyu Zhang, Jian Sun

Keywords Paper

deep learning, optimization, vision

0

0

0

0

14:08