Self-Supervised Representation Learning on Neural Network Weights for Model Characteristic Prediction

06/12/2021

Self-Supervised Representation Learning on Neural Network Weights for Model Characteristic Prediction

Konstantin Schürholt, Dimche Kostadinov, Damian Borth

Keywords: deep learning, self-supervised learning, representation learning

Abstract Paper Similar Papers

Abstract: Self-Supervised Learning (SSL) has been shown to learn useful and information-preserving representations. Neural Networks (NNs) are widely applied, yet their weight space is still not fully understood. Therefore, we propose to use SSL to learn neural representations of the weights of populations of NNs. To that end, we introduce domain specific data augmentations and an adapted attention architecture. Our empirical evaluation demonstrates that self-supervised representation learning in this domain is able to recover diverse NN model characteristics. Further, we show that the proposed learned representations outperform prior work for predicting hyper-parameters, test accuracy, and generalization gap as well as transfer to out-of-distribution settings.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Asymptotics of representation learning in finite Bayesian neural networks

Jacob Zavatone-Veth, Abdulkadir Canatar, Ben Ruben, Cengiz Pehlevan

Keywords Paper

deep learning, representation learning

0

0

0

0

14:09

03/05/2021

Representation Learning via Invariant Causal Mechanisms

Jovana Mitrovic, Brian McWilliams, Jacob C Walker and
Lars Buesing, Charles Blundell

Keywords Paper

Self-supervised Learning, Representation Learning, Causality, Contrastive Methods

1

0

0

0

7:03

06/12/2021

Meta-Learning Sparse Implicit Neural Representations

Jaeho Lee, Jihoon Tack, Namhoon Lee, Jinwoo Shin

Keywords Paper

deep learning, optimization, meta learning, representation learning

0

0

0

0

8:41

26/04/2020

Federated Adversarial Domain Adaptation

Xingchao Peng, Zijun Huang, Yizhe Zhu, Kate Saenko

Keywords Paper

Federated Learning, Domain Adaptation, Transfer Learning, Feature Disentanglement

0

0

0

2

4:57

06/12/2021

Learning to Learn Dense Gaussian Processes for Few-Shot Learning

Ze Wang, Zichen Miao, Xiantong Zhen, Qiang Qiu

Keywords Paper

deep learning, optimization, generative model, meta learning, kernel methods, few shot learning

0

0

0

0

5:21

26/04/2020

Continual Learning with Bayesian Neural Networks for Non-Stationary Data

Richard Kurle, Botond Cseke, Alexej Klushyn and
Patrick van der Smagt, Stephan Günnemann

Keywords Paper

Continual Learning, Online Variational Bayes, Non-Stationary Data, Bayesian Neural Networks, Variational Inference, Lifelong Learning, Concept Drift, Episodic Memory

0

0

0

0

5:26

18/07/2021

Towards Understanding Learning in Neural Networks with Linear Teachers

Roei Sarussi, Alon Brutzkus, Amir Globerson

Keywords Paper

Probabilistic Methods, Theory, Probabilistic Methods, MCMC

0

0

0

0

5:22

14/09/2020

A Principle of Least Action for the Training of Neural Networks

Skander Karkar, Ibrahim Ayed, Emmanuel de Bézenac, Patrick Gallinari

Keywords Paper

deep learning, optimal transport, dynamical systems

0

0

0

0

15:01

06/12/2021

Learning where to learn: Gradient sparsity in meta and continual learning

Johannes von Oswald, Dominic Zhao, Seijin Kobayashi and
Simon Schug, Massimo Caccia, Nicolas Zucchet, João Sacramento

Keywords Paper

deep learning, optimization, meta learning, continual learning, few shot learning

0

0

0

0

12:20

06/12/2020

What Do Neural Networks Learn When Trained With Random Labels?

Hartmut Maennel, Ibrahim Alabdulmohsin, Ilya Tolstikhin and
Robert Baldock, Olivier Bousquet, Sylvain Gelly, Daniel Keysers

Keywords Paper

0

0

0

0

3:22

03/05/2021

A Discriminative Gaussian Mixture Model with Sparsity

Hideaki Hayashi, Seiichi Uchida

Keywords Paper

classification, Gaussian mixture model, sparse Bayesian learning

0

0

0

0

4:19

22/06/2020

Learning Credal Sum-Product Networks

Amelie Levray, Vaishak Belle

Keywords Paper

credal networks, imprecise probabilities, tractable learning

0

0

0

0

5:10

03/05/2021

Exploring the Uncertainty Properties of Neural Networks’ Implicit Priors in the Infinite-Width Limit

Ben Adlam, Jaehoon Lee, Lechao Xiao and
Jeffrey Pennington, Jasper Snoek

Keywords Paper

Deep Learning, Bayesian Neural Networks, Neural Network Gaussian Process, Infinite-Width Limit, Uncertainty, Gaussian Process

0

0

0

0

4:34

06/12/2020

Towards Understanding Hierarchical Learning: Benefits of Neural Representations

Minshuo Chen, Yu Bai, Jason Lee and
Tuo Zhao, Huan Wang, Caiming Xiong, Richard Socher

Keywords Paper

0

0

0

0

3:07

26/04/2020

Neural Oblivious Decision Ensembles for Deep Learning on Tabular Data

Sergei Popov, Stanislav Morozov, Artem Babenko

Keywords Paper

tabular data, architectures, DNN

0

0

0

0

5:05

18/07/2021

Exploiting Shared Representations for Personalized Federated Learning

Liam Collins, Hamed Hassani, Aryan Mokhtari, Sanjay Shakkottai

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

1

1

0

1

5:09

03/05/2021

Go with the flow: Adaptive control for Neural ODEs

Mathieu Chalvidal, Matthew Ricci, Rufin VanRullen, Thomas Serre

Keywords Paper

Neural ODEs, Normalizing flows, Hypernetworks, Optimal Control Theory

0

0

0

0

5:03

26/04/2020

Coherent Gradients: An Approach to Understanding Generalization in Gradient Descent-based Optimization

Satrajit Chatterjee

Keywords Paper

generalization, deep learning

0

0

0

0

5:01

14/06/2020

Multi-Mutual Consistency Induced Transfer Subspace Learning for Human Motion Segmentation

Tao Zhou, Huazhu Fu, Chen Gong and
Jianbing Shen, Ling Shao, Fatih Porikli

Keywords Paper

human motion segmentation, transfer subspace learning, multi-level features, multi-mutual consistency learning.

0

0

0

0

1:00

18/07/2021

Leveraging Sparse Linear Layers for Debuggable Deep Networks

Eric Wong, Shibani Santurkar, Aleksander Madry

Keywords Paper

Deep Learning

0

0

0

0

17:01

13/04/2021

Fast adaptation with linearized neural networks

Wesley Maddox, Shuai Tang, Pablo Moreno and
Andrew Gordon Wilson, Andreas Damianou

Keywords Paper

0

0

0

0

3:13

02/02/2021

Self-Supervised Hypergraph Convolutional Networks for Session-based Recommendation

Xin Xia, Hongzhi Yin, Junliang Yu and
Qinyong Wang, Lizhen Cui, Xiangliang Zhang

Keywords Paper

0

0

0

0

21:04

03/05/2021

Neurally Augmented ALISTA

Freya Behrens, Jonathan Sauder, Peter Jung

Keywords Paper

learned ISTA, unrolled algorithms, compressed sensing, sparse reconstruction

0

0

0

0

5:18

12/07/2020

Training Binary Neural Networks through Learning with Noisy Supervision

Kai Han, Yunhe Wang, Yixing Xu and
Chunjing Xu, Enhua Wu, Chang Xu

Keywords Paper

Unsupervised and Semi-Supervised Learning

0

0

0

0

12:34

18/07/2021

Contrastive Learning Inverts the Data Generating Process

Roland S. Zimmermann, Yash Sharma, Steffen Schneider and
Matthias Bethge, Wieland Brendel

Keywords Paper

Theory, Deep learning Theory

1

1

0

1

5:17

12/07/2020

Puzzle Mix: Exploiting Saliency and Local Statistics for Optimal Mixup

Jang-Hyun Kim, Wonho Choo, Hyun Oh Song

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

13:40

26/04/2020

Sliced Cramer Synaptic Consolidation for Preserving Deeply Learned Representations

Soheil Kolouri, Nicholas A. Ketz, Andrea Soltoggio, Praveen K. Pilly

Keywords Paper

selective plasticity, catastrophic forgetting, intransigence

0

0

0

0

3:59

03/05/2021

Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning

Dong Bok Lee, Dongchan Min, Seanie Lee, Sung Ju Hwang

Keywords Paper

Unsupervised Learning, Variational Autoencoders, Unsupervised Meta-learning, Meta-Learning

0

0

0

0

13:31

06/12/2021

Efficient Learning of Discrete-Continuous Computation Graphs

David Friede, Mathias Niepert

Keywords Paper

deep learning, reinforcement learning and planning, graph learning

0

0

0

0

12:31

18/07/2021

Federated Continual Learning with Weighted Inter-client Transfer

Jaehong Yoon, Wonyong Jeong, GiWoong Lee and
Eunho Yang, Sung Ju Hwang

Keywords Paper

Algorithms, Continual Learning

0

0

0

1

4:41

06/12/2021

Meta Learning Backpropagation And Improving It

Louis Kirsch, Jürgen Schmidhuber

Keywords Paper

deep learning, optimization, generative model, meta learning

0

0

0

0

12:39

06/12/2020

Gradient-EM Bayesian Meta-Learning

Yayi Zou, Xiaoqi Lu

Keywords Paper

0

0

0

0

3:23

06/12/2020

The Surprising Simplicity of the Early-Time Learning Dynamics of Neural Networks

Wei Hu, Lechao Xiao, Ben Adlam, Jeffrey Pennington

Keywords Paper

0

0

0

0

3:20

14/06/2020

Deep Unfolding Network for Image Super-Resolution

Kai Zhang, Luc Van Gool, Radu Timofte

Keywords Paper

super-resolution, unfolding, degradation model, gaussian kernel, deblurring

0

0

0

0

1:01

06/12/2021

Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces

Kirill Struminsky, Artyom Gadetsky, Denis Rakitin and
Danil Karpushkin, Dmitry Vetrov

Keywords Paper

deep learning, optimization

0

0

0

0

14:26

06/12/2020

Efficient Variational Inference for Sparse Deep Learning with Theoretical Guarantee

Jincheng Bai, Qifan Song, Guang Cheng

Keywords Paper

0

0

0

0

3:11

18/07/2021

Quasi-global Momentum: Accelerating Decentralized Deep Learning on Heterogeneous Data

Tao Lin, Praneeth Karimireddy, Sebastian Stich, Martin Jaggi

Keywords Paper

Deep Learning, Optimization for Deep Networks

0

0

0

0

5:14

26/04/2020

Continual learning with hypernetworks

Johannes von Oswald, Christian Henning, João Sacramento, Benjamin F. Grewe

Keywords Paper

Continual Learning, Catastrophic Forgetting, Meta Model, Hypernetwork

0

0

0

0

5:04

26/04/2020

SVQN: Sequential Variational Soft Q-Learning Networks

Shiyu Huang, Hang Su, Jun Zhu, Ting Chen

Keywords Paper

reinforcement learning, POMDP, variational inference, generative model

0

0

0

0

4:52

26/08/2020

Orthogonal Gradient Descent for Continual Learning

Mehrdad Farajtabar, Navid Azizan, Alex Mott, Ang Li

Keywords Paper

0

0

0

0

13:33