Usable Information and Evolution of Optimal Representations During Training

03/05/2021

Usable Information and Evolution of Optimal Representations During Training

Michael Kleinman, Alessandro Achille, Daksh Idnani, Jonathan Kao

Keywords: Representation Learning, SGD, Learning Dynamics, Usable Information, Initialization

Abstract Paper Similar Papers

Abstract: We introduce a notion of usable information contained in the representation learned by a deep network, and use it to study how optimal representations for the task emerge during training. We show that the implicit regularization coming from training with Stochastic Gradient Descent with a high learning-rate and small batch size plays an important role in learning minimal sufficient representations for the task. In the process of arriving at a minimal sufficient representation, we find that the content of the representation changes dynamically during training. In particular, we find that semantically meaningful but ultimately irrelevant information is encoded in the early transient dynamics of training, before being later discarded. In addition, we evaluate how perturbing the initial part of training impacts the learning dynamics and the resulting representations. We show these effects on both perceptual decision-making tasks inspired by neuroscience literature, as well as on standard image classification tasks.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

18/07/2021

Function Contrastive Learning of Transferable Meta-Representations

Waleed Gondal, Shruti Joshi, Nasim Rahaman and
Stefan Bauer, Manuel Wuthrich, Bernhard Schölkopf

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

5:46

06/12/2020

Information-theoretic Task Selection for Meta-Reinforcement Learning

Ricardo Luna Gutierrez, Matteo Leonetti

Keywords Paper

0

0

0

0

2:57

06/12/2021

Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations

Jiayao Zhang, Hua Wang, Weijie Su

Keywords Paper

deep learning, optimization

0

0

0

0

13:45

06/12/2021

What training reveals about neural network complexity

Andreas Loukas, Marinos Poiitis, Stefanie Jegelka

Keywords Paper

deep learning

0

0

0

0

8:29

01/07/2020

Multi-Action Dialog Policy Learning with Interactive Human Teaching

Megha Jhunjhunwala, Caleb Bryant, Pararth Shah

Keywords Paper

0

0

0

0

7:09

03/05/2021

On the Dynamics of Training Attention Models

Haoye Lu, Yongyi Mao, Amiya Nayak

Keywords Paper

0

0

0

0

5:09

26/04/2020

The Early Phase of Neural Network Training

Jonathan Frankle, David J. Schwab, Ari S. Morcos

Keywords Paper

empirical, learning dynamics, lottery tickets, critical periods, early

0

0

0

0

5:07

06/12/2020

Self-Distillation Amplifies Regularization in Hilbert Space

Hossein Mobahi, Mehrdad Farajtabar, Peter Bartlett

Keywords Paper

0

0

0

0

3:18

06/12/2020

Identifying Learning Rules From Neural Network Observables

Aran Nayebi, Sanjana Srivastava, Surya Ganguli, Daniel Yamins

Keywords Paper

0

0

0

0

3:12

18/07/2021

Zero-Shot Knowledge Distillation from a Decision-Based Black-Box Model

Zi Wang

Keywords Paper

Deep Learning

0

0

0

0

5:08

13/04/2021

Learning to defend by learning to attack

Haoming Jiang, Zhehui Chen, Yuyang Shi and
Bo Dai, Tuo Zhao

Keywords Paper

0

0

0

0

2:58

06/12/2020

Neural Complexity Measures

Yoonho Lee, Juho Lee, Sung Ju Hwang and
Eunho Yang, Seungjin Choi

Keywords Paper

0

0

0

0

3:22

06/12/2020

Deep learning versus kernel learning: an empirical study of loss landscape geometry and the time evolution of the Neural Tangent Kernel

Stanislav Fort, Gintare Karolina Dziugaite, Mansheej Paul and
Sepideh Kharaghani, Dan Roy, Surya Ganguli

Keywords Paper

0

0

0

0

3:26

06/12/2021

MixACM: Mixup-Based Robustness Transfer via Distillation of Activated Channel Maps

Awais Muhammad, Fengwei Zhou, Chuanlong Xie and
Jiawei Li, Sung-Ho Bae, Zhenguo Li

Keywords Paper

deep learning, optimization, robustness, adversarial robustness and security

0

0

0

0

12:51

14/09/2020

Partial Label Learning via Self-Paced Curriculum Strategy

Gengyu Lyu, Songhe Feng, Yi Jin, Yidong Li

Keywords Paper

partial-label learning, self-paced learning strategy, curriculum learning strategy, instructor-student-collaborative

0

0

0

0

6:46

18/07/2021

Bayesian Structural Adaptation for Continual Learning

Abhishek Kumar, Sunabha Chatterjee, Piyush Rai

Keywords Paper

Probabilistic Methods, Bayesian Methods

0

0

0

0

7:39

06/12/2020

Early-Learning Regularization Prevents Memorization of Noisy Labels

Sheng Liu, Jonathan Niles-Weed, Narges Razavian, Carlos Fernandez-Granda

Keywords Paper

0

0

0

0

3:06

06/12/2021

Meta Internal Learning

Raphael Bensadoun, Shir Gur, Tomer Galanti, Lior Wolf

Keywords Paper

vision, generative model, meta learning

0

0

0

0

7:41

04/11/2020

Retiarii: A Deep Learning Exploratory-Training Framework

Quanlu Zhang, Zhenhua Han, Fan Yang and
Yuge Zhang, Zhe Liu, Mao Yang, Lidong Zhou

Keywords Paper

0

0

0

0

20:05

05/12/2020

Systematic generalization on gSCAN with language conditioned embedding

Tong Gao, Qi Huang, Raymond Mooney

Keywords Paper

0

0

0

0

14:19

01/07/2020

Self-Training for Unsupervised Parsing with PRPN

Anhad Mohananey, Katharina Kann, Samuel R. Bowman

Keywords Paper

0

0

0

0

7:39

06/12/2021

Meta-learning with an Adaptive Task Scheduler

Huaxiu Yao, Yu Wang, Ying Wei and
Peilin Zhao, Mehrdad Mahdavi, Defu Lian, Chelsea Finn

Keywords Paper

optimization, meta learning

0

0

0

0

15:12

03/05/2021

Dataset Meta-Learning from Kernel Ridge-Regression

Timothy Nguyen, Zhourong Chen, Jaehoon Lee

Keywords Paper

dataset corruption, infinite-width networks, neural kernels, kernel-ridge regression, dataset compression, dataset distillation, meta-learning

0

0

0

0

4:59

02/02/2021

Progressive Multi-task Learning with Controlled Information Flow for Joint Entity and Relation Extraction

Kai Sun, Richong Zhang, Samuel Mensah and
Yongyi Mao, Xudong Liu

Keywords Paper

0

0

0

0

13:45

19/08/2021

Regularising Knowledge Transfer by Meta Functional Learning

Pan Li, Yanwei Fu, Shaogang Gong

Keywords Paper

Machine Learning, Classification, Transfer, Adaptation, Multi-task Learning, Weakly Supervised Learning

0

0

0

0

13:41

07/09/2020

Zero-Shot Domain Generalization

Udit Maniyar, Joseph K J, Aniket Anand Deshmukh and
Urun Dogan, Vineeth N Balasubramanian

Keywords Paper

Domain Generalization, zero-shot learning, semantic space, multi task learning, Learning with limited data, representation learning, classification

0

0

0

0

9:59

06/12/2020

Learning Invariances in Neural Networks from Training Data

Greg Benton, Marc Finzi, Pavel Izmailov, Andrew Wilson

Keywords Paper

0

0

0

0

3:03

04/07/2020

Empowering Active Learning to Jointly Optimize System and User Demands

Ji-Ung Lee, Christian M. Meyer, Iryna Gurevych

Keywords Paper

educational application, Active Learning, end-user application, active approach

0

0

0

0

12:00

06/12/2020

Black-Box Ripper: Copying black-box models using generative evolutionary algorithms

Antonio Barbalau, Adrian Cosma, Radu Tudor Ionescu, Marius Popescu

Keywords Paper

0

0

0

0

3:18

26/04/2020

Sliced Cramer Synaptic Consolidation for Preserving Deeply Learned Representations

Soheil Kolouri, Nicholas A. Ketz, Andrea Soltoggio, Praveen K. Pilly

Keywords Paper

selective plasticity, catastrophic forgetting, intransigence

0

0

0

0

3:59

08/12/2020

Dynamic Curriculum Learning for Low-Resource Neural Machine Translation

Chen Xu, Bojie Hu, Yufan Jiang and
Kai Feng, Zeyang Wang, Shen Huang, Qi Ju, Tong Xiao, Jingbo Zhu

Keywords Paper

0

0

0

0

13:28

14/06/2020

Self2Self With Dropout: Learning Self-Supervised Denoising From Single Image

Yuhui Quan, Mingqin Chen, Tongyao Pang, Hui Ji

Keywords Paper

image denoising, deep learning, unsupervised learning, self-supervised learning, single-image learning

0

0

0

0

1:01

04/07/2020

Explaining Black Box Predictions and Unveiling Data Artifacts through Influence Functions

Xiaochuang Han, Byron C. Wallace, Yulia Tsvetkov

Keywords Paper

NLP, model prediction, model decisions, natural inference

0

0

0

0

11:57

26/04/2020

Adversarially robust transfer learning

Ali Shafahi, Parsa Saadatpanah, Chen Zhu and
Amin Ghiasi, Christoph Studer, David Jacobs, Tom Goldstein

Keywords Paper

0

0

0

0

4:58

16/11/2020

Low-Resource Domain Adaptation for Compositional Task-Oriented Semantic Parsing

Xilun Chen, Asish Ghoshal, Yashar Mehdad and
Luke Zettlemoyer, Sonal Gupta

Keywords Paper

task-oriented parsing, low-resource adaptation, generalization, virtual assistants

0

0

0

0

11:11

02/02/2021

DeepCollaboration: Collaborative Generative and Discriminative Models for Class Incremental Learning

Bo Cui, Guyue Hu, Shan Yu

Keywords Paper

0

0

0

0

15:13

18/07/2021

Reinforcement Learning with Prototypical Representations

Denis Yarats, Rob Fergus, Alessandro Lazaric, Lerrel Pinto

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:15

20/07/2020

SelectNet: Learning to Sample from the Wild for Imbalanced Data Training

Yunru Liu, Tingran Gao, Haizhao Yang

Keywords Paper

0

0

0

0

20:50

26/04/2020

The Break-Even Point on Optimization Trajectories of Deep Neural Networks

Stanislaw Jastrzebski, Maciej Szymczak, Stanislav Fort and
Devansh Arpit, Jacek Tabor, Kyunghyun Cho, Krzysztof Geras

Keywords Paper

generalization, sgd, learning rate, batch size, hessian, curvature, trajectory, optimization

0

0

0

0

4:42

02/02/2021

Self-Supervised Self-Supervision by Combining Deep Learning and Probabilistic Logic

Hunter Lang, Hoifung Poon

Keywords Paper

0

0

0

0

18:09