Learning compositional functions via multiplicative weight updates

06/12/2020

Learning compositional functions via multiplicative weight updates

Jeremy Bernstein, Jiawei Zhao, Markus Meister, Ming-Yu Liu, Anima Anandkumar, Yisong Yue

Keywords:

Abstract Paper Similar Papers

Abstract: Compositionality is a basic structural feature of both biological and artificial neural networks. Learning compositional functions via gradient descent incurs well known problems like vanishing and exploding gradients, making careful learning rate tuning essential for real-world applications. This paper proves that multiplicative weight updates satisfy a descent lemma tailored to compositional functions. Based on this lemma, we derive Madam---a multiplicative version of the Adam optimiser---and show that it can train state of the art neural network architectures without learning rate tuning. We further show that Madam is easily adapted to train natively compressed neural networks by representing their weights in a logarithmic number system. We conclude by drawing connections between multiplicative weight updates and recent findings about synapses in biology.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

18/07/2021

Linear Transformers Are Secretly Fast Weight Programmers

Imanol Schlag, Kazuki Irie, Jürgen Schmidhuber

Keywords Paper

Deep Learning

0

0

0

0

5:18

06/12/2020

Characterizing emergent representations in a space of candidate learning rules for deep networks

Yinan Cao, Christopher Summerfield, Andrew Saxe

Keywords Paper

0

0

0

0

3:16

06/12/2020

Learning to Learn with Feedback and Local Plasticity

Jack Lindsey, Ashok Litwin-Kumar

Keywords Paper

0

0

0

0

3:01

06/12/2021

Scalable Rule-Based Representation Learning for Interpretable Classification

Zhuo Wang, Wei Zhang, Ning Liu, Jianyong Wang

Keywords Paper

optimization, machine learning, representation learning, interpretability

0

0

0

0

14:52

06/12/2021

Training Feedback Spiking Neural Networks by Implicit Differentiation on the Equilibrium State

Mingqing Xiao, Qingyan Meng, Zongpeng Zhang and
Yisen Wang, Zhouchen Lin

Keywords Paper

deep learning

0

0

0

0

12:22

06/12/2020

Kernelized information bottleneck leads to biologically plausible 3-factor Hebbian learning in deep networks

Roman Pogodin, Peter E Latham

Keywords Paper

Deep Learning -> Adversarial Networks, Algorithms -> Semi-Supervised Learning

0

0

0

0

2:30

06/12/2021

Differentiable Optimization of Generalized Nondecomposable Functions using Linear Programs

Zihang Meng, Lopamudra Mukherjee, Yichao Wu and
Vikas Singh, Sathya Narayanan Ravi

Keywords Paper

deep learning, optimization

0

0

0

0

13:21

06/12/2020

A meta-learning approach to (re)discover plasticity rules that carve a desired function into a neural network

Basile Confavreux, Friedemann Zenke, Everton Agnes and
Timothy Lillicrap, Tim Vogels

Keywords Paper

0

0

0

0

3:25

06/12/2020

MMA Regularization: Decorrelating Weights of Neural Networks by Maximizing the Minimal Angles

Zhennan Wang, Canqun Xiang, Wenbin Zou, Chen Xu

Keywords Paper

0

0

0

0

3:23

12/07/2020

Improving Molecular Design by Stochastic Iterative Target Augmentation

Kevin Yang, Wengong Jin, Kyle Swanson and
Regina Barzilay, Tommi Jaakkola

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

15:14

06/12/2021

Convergence and Alignment of Gradient Descent with Random Backpropagation Weights

Ganlin Song, Ruitu Xu, John Lafferty

Keywords Paper

deep learning, optimization

0

0

0

0

5:13

18/07/2021

Align, then memorise: the dynamics of learning with feedback alignment

Maria Refinetti, Stéphane d'Ascoli, Ruben Ohana, Sebastian Goldt

Keywords Paper

Theory, Models of Learning and Generalization

0

0

0

0

4:38

18/07/2021

LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning

Yuhuai Wu, Markus Rabe, Wenda Li and
Jimmy Ba, Roger Grosse, Christian Szegedy

Keywords Paper

Deep Learning

0

0

0

0

6:18

06/12/2021

Differentiable Spike: Rethinking Gradient-Descent for Training Spiking Neural Networks

Yuhang Li, Yufei Guo, Shanghang Zhang and
Shikuang Deng, Yongqing Hai, Shi Gu

Keywords Paper

deep learning, optimization, machine learning

0

0

0

0

6:19

23/08/2020

AutoFIS: Automatic feature interaction selection in factorization models for click-through rate prediction

Bin Liu, Chenxu Zhu, Guilin Li and
Weinan Zhang, Jincai Lai, Ruiming Tang, Xiuqiang He, Zhenguo Li, Yong Yu

Keywords Paper

feature selection, neural architecture search, recommendation, factorization machine

0

0

0

0

19:23

26/04/2020

Learning to Learn by Zeroth-Order Oracle

Yangjun Ruan, Yuanhao Xiong, Sashank Reddi and
Sanjiv Kumar, Cho-Jui Hsieh

Keywords Paper

learning to learn, zeroth-order optimization, black-box adversarial attack

0

0

0

0

4:48

03/05/2021

Optimal Conversion of Conventional Artificial Neural Networks to Spiking Neural Networks

Shikuang Deng, Shi Gu

Keywords Paper

second-order approximation, weight balance, spiking neural network

0

0

0

0

5:24

18/07/2021

Learning by Turning: Neural Architecture Aware Optimisation

Yang Liu, Jeremy Bernstein, Markus Meister, Yisong Yue

Keywords Paper

Deep Learning, Optimization for Deep Networks

0

0

0

0

4:42

26/04/2020

Differentiable learning of numerical rules in knowledge graphs

Po-Wei Wang, Daria Stepanova, Csaba Domokos, J. Zico Kolter

Keywords Paper

knowledge graphs, rule learning, differentiable neural logic

0

0

0

0

4:49

06/12/2021

Rethinking Neural Operations for Diverse Tasks

Nicholas Roberts, Mikhail Khodak, Tri Dao and
Liam Li, Christopher Ré, Ameet S Talwalkar

Keywords Paper

deep learning, machine learning

0

0

0

0

10:26

26/04/2020

Locally Constant Networks

Guang-He Lee, Tommi S. Jaakkola

Keywords Paper

0

0

0

0

4:44

19/08/2021

Pruning of Deep Spiking Neural Networks through Gradient Rewiring

Yanqi Chen, Zhaofei Yu, Wei Fang and
Tiejun Huang, Yonghong Tian

Keywords Paper

Humans and AI, Brain Sciences, Cognitive Modeling, Classification

0

0

0

0

12:58

02/02/2021

GENSYNTH: Synthesizing Datalog Programs without Language Bias

Jonathan Mendelson, Aaditya Naik, Mukund Raghothaman, Mayur Naik

Keywords Paper

0

0

0

0

20:03

12/07/2020

Deep Molecular Programming: A Natural Implementation of Binary-Weight ReLU Neural Networks

Marko Vasic, Cameron Chalk, Sarfraz Khurshid, David Soloveichik

Keywords Paper

Deep Learning - Theory

0

0

0

0

14:25

16/11/2020

Unsupervised Parsing with S-DIORA: Single Tree Encoding for Deep Inside-Outside Recursive Autoencoders

Andrew Drozdov, Subendhu Rongali, Yi-Pei Chen and
Tim O'Gorman, Mohit Iyyer, Andrew McCallum

Keywords Paper

fine-tuning, unsupervised parsing, deep autoencoder, diora

0

0

0

0

11:39

13/04/2021

LassoNet: Neural networks with feature sparsity

Ismael Lemhadri, Feng Ruan, Rob Tibshirani

Keywords Paper

0

0

0

0

3:13

06/12/2020

Meta-Learning through Hebbian Plasticity in Random Networks

Elias Najarro, Sebastian Risi

Keywords Paper

0

0

0

0

3:21

14/09/2020

A Generic and Model-Agnostic Exemplar Synthetization Framework for Explainable AI

Antonio Barbalau, Adrian Cosma, Radu Tudor Ionescu, Marius Popescu

Keywords Paper

explainable ai, black-box, generative modelling, evolutionary algorithm, prototype synthetization, exemplar generation

0

0

0

0

10:08

11/08/2020

A computational approach to packet classification

Alon Rashelbach, Ori Rottenstreich, Mark Silberstein

Keywords Paper

Neural Networks, Virtual Switches, Packet Classification

0

0

0

0

16:56

06/12/2021

Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces

Kirill Struminsky, Artyom Gadetsky, Denis Rakitin and
Danil Karpushkin, Dmitry Vetrov

Keywords Paper

deep learning, optimization

0

0

0

0

14:26

02/02/2021

Frivolous Units: Wider Networks Are Not Really That Wide

Stephen Casper, Xavier Boix, Vanessa D'Amario and
Ling Guo, Martin Schrimpf, Kasper Vinken, Gabriel Kreiman

Keywords Paper

0

0

0

0

18:04

14/06/2020

Towards Unified INT8 Training for Convolutional Neural Network

Feng Zhu, Ruihao Gong, Fengwei Yu and
Xianglong Liu, Yanfei Wang, Zhelong Li, Xiuqi Yang, Junjie Yan

Keywords Paper

int8 training, gradient quantization, direction sensitive gradient clipping, learning rate scaling, gradient distribution

0

0

0

0

1:01

12/07/2020

Operation-Aware Soft Channel Pruning using Differentiable Masks

Minsoo Kang, Bohyung Han

Keywords Paper

Applications - Computer Vision

0

0

0

0

14:56

03/05/2021

BRECQ: Pushing the Limit of Post-Training Quantization by Block Reconstruction

Yuhang Li, Ruihao Gong, Xu Tan and
Yang Yang, Peng Hu, Qi Zhang, fengwei yu, Wei Wang, Shi Gu

Keywords Paper

Second-order analysis, Mixed Precision, Post Training Quantization

0

0

0

0

4:36

03/05/2021

Learning N:M Fine-grained Structured Sparse Neural Networks From Scratch

Aojun Zhou, Yukun Ma, Junnan Zhu and
Jianbo Liu, Zhijie Zhang, Kun Yuan, Wenxiu Sun, Hongsheng Li

Keywords Paper

sparsity, efficient training and inference.

0

0

0

0

5:09

23/08/2020

AutoShuffleNet: Learning permutation matrices via an exact lipschitz continuous penalty in deep convolutional neural networks

Jiancheng Lyu, Shuai Zhang, Yingyong Qi, Jack Xin

Keywords Paper

shufflenet, permutation, lipschitz continuous penalty, convolutional neural network

0

0

0

0

13:06

26/08/2020

Learning in Gated Neural Networks

Ashok Makkuva, Sewoong Oh, Sreeram Kannan, Pramod Viswanath

Keywords Paper

0

0

0

0

14:42

02/02/2021

Adversarial Turing Patterns from Cellular Automata

Nurislam Tursynbek, Ilya Vilkoviskiy, Maria Sindeeva, Ivan Oseledets

Keywords Paper

0

0

0

0

14:50

06/12/2021

Second-Order Neural ODE Optimizer

Guan-Horng Liu, Tianrong Chen, Evangelos Theodorou

Keywords Paper

deep learning, optimization, machine learning, vision

0

0

0

0

14:59

13/04/2021

A theoretical characterization of semi-supervised learning with self-training for gaussian mixture models

Samet Oymak, Talha Cihad Gulcu

Keywords Paper

1

1

0

0

2:59