Shuffling Recurrent Neural Networks

02/02/2021

Shuffling Recurrent Neural Networks

Michael Rotman, Lior Wolf

Keywords:

Abstract Paper Similar Papers

Abstract: We propose a novel recurrent neural network model, where the hidden state hₜ is obtained by permuting the vector elements of the previous hidden state hₜ₋₁ and adding the output of a learned function β(xₜ) of the input xₜ at time t. In our model, the prediction is given by a second learned function, which is applied to the hidden state s(hₜ). The method is easy to implement, extremely efficient, and does not suffer from vanishing nor exploding gradients. In an extensive set of experiments, the method shows competitive results, in comparison to the leading literature baselines. We share our implementation at https://github.com/rotmanmi/SRNN.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38948774

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AAAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Spectra of the Conjugate Kernel and Neural Tangent Kernel for linear-width neural networks

Zhou Fan, Zhichao Wang

Keywords Paper

0

0

0

0

3:25

06/12/2021

Robust Implicit Networks via Non-Euclidean Contractions

Saber Jafarpour, Alexander Davydov, Anton Proskurnikov, Francesco Bullo

Keywords Paper

theory, deep learning, machine learning, robustness, vision

0

0

0

0

14:59

06/12/2020

Network Diffusions via Neural Mean-Field Dynamics

shushan He, Hongyuan Zha, Xiaojing Ye

Keywords Paper

0

0

0

0

3:21

18/07/2021

Dynamic Game Theoretic Neural Optimizer

Guan-Horng Liu, CHEN Chen, Evangelos Theodorou

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

16:40

03/05/2021

DrNAS: Dirichlet Neural Architecture Search

Xiangning Chen, Ruochen Wang, Minhao Cheng and
Xiaocheng Tang, Cho-Jui Hsieh

Keywords Paper

0

0

0

0

5:00

19/08/2021

Sensitivity Direction Learning with Neural Networks Using Domain Knowledge as Soft Shape Constraints

Kazuyuki Wakasugi

Keywords Paper

Machine Learning, Explainable/Interpretable Machine Learning, Constraints and Data Mining; Constraints and Machine Learning

0

0

0

0

14:52

18/07/2021

Sparsifying Networks via Subdifferential Inclusion

Sagar Verma, Jean-Christophe Pesquet

Keywords Paper

Optimization, Convex Optimization

0

0

0

0

5:10

13/04/2021

Fast adaptation with linearized neural networks

Wesley Maddox, Shuai Tang, Pablo Moreno and
Andrew Gordon Wilson, Andreas Damianou

Keywords Paper

0

0

0

0

3:13

06/12/2021

Even your Teacher Needs Guidance: Ground-Truth Targets Dampen Regularization Imposed by Self-Distillation

Kenneth Borup, Lars N Andersen

Keywords Paper

theory, deep learning, optimization

0

0

0

0

6:00

06/12/2020

Matrix Inference and Estimation in Multi-Layer Models

Parthe Pandit, Moji Sahraee Ardakan, Sundeep Rangan and
Phil Schniter, Alyson Fletcher

Keywords Paper

0

0

0

0

3:24

02/02/2021

Meta-Learning Framework with Applications to Zero-Shot Time-Series Forecasting

Boris N. Oreshkin, Dmitri Carpov, Nicolas Chapados, Yoshua Bengio

Keywords Paper

0

0

0

0

17:41

18/07/2021

On the Implicit Bias of Initialization Shape: Beyond Infinitesimal Mirror Descent

Shahar Azulay, Edward Moroshko, Mor Shpigel Nacson and
Blake Woodworth, Nati Srebro, Amir Globerson, Daniel Soudry

Keywords Paper

, Probabilistic Methods, MCMC, Theory, Deep learning Theory

0

0

0

0

15:38

03/05/2021

Dataset Meta-Learning from Kernel Ridge-Regression

Timothy Nguyen, Zhourong Chen, Jaehoon Lee

Keywords Paper

dataset corruption, infinite-width networks, neural kernels, kernel-ridge regression, dataset compression, dataset distillation, meta-learning

0

0

0

0

4:59

26/04/2020

Provable Benefit of Orthogonal Initialization in Optimizing Deep Linear Networks

Wei Hu, Lechao Xiao, Jeffrey Pennington

Keywords Paper

deep learning theory, non-convex optimization, orthogonal initialization

0

0

0

0

5:10

12/07/2020

Information-Theoretic Local Minima Characterization and Regularization

Zhiwei Jia, Hao Su

Keywords Paper

Deep Learning - Theory

0

0

0

0

14:11

18/07/2021

The Heavy-Tail Phenomenon in SGD

Mert Gurbuzbalaban, Umut Simsekli, Lingjiong Zhu

Keywords Paper

Optimization, Stochastic Optimization

0

0

0

0

5:37

06/12/2020

Can Temporal-Diﬀerence and Q-Learning Learn Representation? A Mean-Field Theory

Yufeng Zhang, Qi Cai, Zhuoran Yang and
Yongxin Chen, Zhaoran Wang

Keywords Paper

0

0

0

0

3:02

03/05/2021

The Recurrent Neural Tangent Kernel

Sina Alemohammad, Jack Wang, Randall Balestriero, Richard Baraniuk

Keywords Paper

Gaussian Process, Recurrent Neural Network, Neural Tangent Kernel, Overparameterization

0

0

0

0

4:44

06/12/2021

$(\textrm{Implicit})^2$: Implicit Layers for Implicit Representations

Zhichun Huang, Shaojie Bai, J. Zico Kolter

Keywords Paper

deep learning, representation learning

1

0

0

1

12:23

02/02/2021

A General Class of Transfer Learning Regression without Implementation Cost

Shunya Minami, Song Liu, Stephen Wu and
Kenji Fukumizu, Ryo Yoshida

Keywords Paper

0

0

0

0

14:13

06/12/2021

A Near-Optimal Algorithm for Debiasing Trained Machine Learning Models

Ibrahim Alabdulmohsin, Mario Lucic

Keywords Paper

deep learning, machine learning, fairness

0

0

0

0

9:48

12/07/2020

Unique Properties of Wide Minima in Deep Networks

Rotem Mulayoff, Tomer Michaeli

Keywords Paper

Deep Learning - Theory

0

0

0

0

14:35

26/04/2020

RNNs Incrementally Evolving on an Equilibrium Manifold: A Panacea for Vanishing and Exploding Gradients?

Anil Kag, Ziming Zhang, Venkatesh Saligrama

Keywords Paper

novel recurrent neural architectures, learning representations of outputs or states

0

0

0

0

5:03

06/12/2020

Model Fusion via Optimal Transport

Sidak Pal Singh, Martin Jaggi

Keywords Paper

1

0

0

1

3:10

06/12/2021

Improving Compositionality of Neural Networks by Decoding Representations to Inputs

Mike Wu, Noah Goodman, Stefano Ermon

Keywords Paper

deep learning, machine learning, adversarial robustness and security, generative model

0

0

0

0

12:36

18/07/2021

FL-NTK: A Neural Tangent Kernel-based Framework for Federated Learning Analysis

Baihe Huang, Xiaoxiao Li, Zhao Song, Xin Yang

Keywords Paper

Theory, Deep learning Theory

0

0

0

0

4:49

03/05/2021

Coupled Oscillatory Recurrent Neural Network (coRNN): An accurate and (gradient) stable architecture for learning long time dependencies

T. Konstantin Rusch, Siddhartha Mishra

Keywords Paper

Long-term dependencies, Gradient stability, Oscillators, RNNs

0

0

0

0

13:38

04/08/2021

Modeling from Features: a Mean-field Framework for Over-parameterized Deep Neural Networks

Cong Fang, Jason Lee, Pengkun Yang, Tong Zhang

Keywords Paper

0

0

0

0

15:10

06/12/2021

Meta-Learning for Relative Density-Ratio Estimation

Atsutoshi Kumagai, Tomoharu Iwata, Yasuhiro Fujiwara

Keywords Paper

deep learning, machine learning, meta learning

0

0

0

0

8:56

26/04/2020

Conservative Uncertainty Estimation By Fitting Prior Networks

Kamil Ciosek, Vincent Fortuin, Ryota Tomioka and
Katja Hofmann, Richard Turner

Keywords Paper

uncertainty quantification, deep learning, Gaussian process, epistemic uncertainty, random network, prior, Bayesian inference

0

0

0

1

5:06

12/07/2020

Operation-Aware Soft Channel Pruning using Differentiable Masks

Minsoo Kang, Bohyung Han

Keywords Paper

Applications - Computer Vision

0

0

0

0

14:56

06/12/2020

Provably Efficient Neural Estimation of Structural Equation Models: An Adversarial Approach

Luofeng Liao, You-Lin Chen, Zhuoran Yang and
Bo Dai, Mladen Kolar, Zhaoran Wang

Keywords Paper

Theory -> Information Theory, Algorithms -> Stochastic Methods

0

0

0

0

3:23

26/04/2020

Coherent Gradients: An Approach to Understanding Generalization in Gradient Descent-based Optimization

Satrajit Chatterjee

Keywords Paper

generalization, deep learning

0

0

0

0

5:01

06/12/2020

Benchmarking Deep Inverse Models over time, and the Neural-Adjoint method

Ben Ren, Willie Padilla, Jordan Malof

Keywords Paper

0

0

0

0

3:17

03/05/2021

Neural Jump Ordinary Differential Equations: Consistent Continuous-Time Prediction and Filtering

Calypso Herrera, Florian Krach, Josef Teichmann

Keywords Paper

irregular-observed data modelling, conditional expectation, Neural ODE

0

0

0

0

3:50

06/12/2021

Faster Directional Convergence of Linear Neural Networks under Spherically Symmetric Data

Dachao Lin, Ruoyu Sun, Zhihua Zhang

Keywords Paper

deep learning, optimization

0

0

0

0

11:29

02/02/2021

Lipschitz Lifelong Reinforcement Learning

Erwan Lecarpentier, David Abel, Kavosh Asadi and
Yuu Jinnai, Emmanuel Rachelson, Michael L. Littman

Keywords Paper

1

1

0

0

15:53

26/08/2020

A Linear-time Independence Criterion Based on a Finite Basis Approximation

Longfei Yan, W. Bastiaan Kleijn, thushara abhayapala

Keywords Paper

0

0

0

0

12:09

09/07/2020

Kernel and Rich Regimes in Overparametrized Models

Blake E Woodworth, Suriya Gunasekar, Jason Lee and
Edward Moroshko, Pedro Henrique Pamplona Savarese, Itay Golan, Daniel Soudry, Nathan Srebro

Keywords Paper

Neural networks/deep learning,

0

0

0

0

13:29

06/12/2021

Joint Inference for Neural Network Depth and Dropout Regularization

Kishan K C, Rui Li, MohammadMahdi Gilany

Keywords Paper

deep learning, generative model, continual learning

0

0

0

0

11:01