Constrained Weight Optimization for Learning Without Activation Normalization

05/01/2021

Constrained Weight Optimization for Learning Without Activation Normalization

Daiki Ikami, Go Irie, Takashi Shibata

Keywords:

Abstract Paper Similar Papers

Abstract: Weight Normalization (WN) is an essential building block in deep learning. However, even state-of-the-art WN methods need to be combined with activation normalization methods, such as Batch Normalization (BN), to provide the same classification accuracy as BN. In this paper, we aim to circumvent this issue with a weight normalization approach that can be used on its own to provide a classification accuracy competitive to BN. Our approach mimics three fundamental properties of BN, namely, keeping the norm of the weights constant, setting the mean of the weights to zero, and simulating stochastic perturbations due to batch sampling bias. Unlike most of the existing WN methods that rely on "reparametrization", our method directly optimizes the weights with proper constraints and thus can circumvent its serious drawback, gradient explosion. Moreover, we propose an efficient and easy-to-implement algorithm to solve our constrained optimization problem without sacrificing its benefits. The results of classification experiments on three popular benchmark datasets demonstrate that our method is highly competitive with or even better than the state-of-the-art normalization methods.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at WACV 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

Delving into Variance Transmission and Normalization: Shift of Average Gradient Makes the Network Collapse

Yuxiang Liu, Jidong Ge, Chuanyi Li, Jie Gui

Keywords Paper

0

0

0

0

14:49

02/02/2021

SA-BNN: State-Aware Binary Neural Network

Chunlei Liu, Peng Chen, Bohan Zhuang and
Chunhua Shen, Baochang Zhang, Wenrui Ding

Keywords Paper

0

0

0

0

17:42

03/05/2021

Robust Overfitting may be mitigated by properly learned smoothening

Tianlong Chen, Zhenyu Zhang, Sijia Liu and
Shiyu Chang, Zhangyang Wang

Keywords Paper

Robust Overfitting, Adversarial Training, Adversarial Robustness

0

0

0

0

4:33

06/12/2021

One Loss for All: Deep Hashing with a Single Cosine Similarity based Learning Objective

Jiun Tian Hoe, Kam Woh Ng, Tianyu Zhang and
Chee Seng Chan, Yi-Zhe Song, Tao Xiang

Keywords Paper

machine learning

0

0

0

0

11:39

06/12/2021

Batch Multi-Fidelity Bayesian Optimization with Deep Auto-Regressive Networks

Shibo Li, Robert Kirby, Shandian Zhe

Keywords Paper

deep learning, optimization

0

0

0

0

14:05

12/07/2020

Responsive Safety in Reinforcement Learning

Adam Stooke, Joshua Achiam, Pieter Abbeel

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

13:36

06/12/2021

Generalized DataWeighting via Class-Level Gradient Manipulation

Can Chen, Shuhao Zheng, Xi Chen and
Erqun Dong, Xue (Steve) Liu, Hao Liu, Dejing Dou

Keywords Paper

optimization, machine learning, meta learning

0

0

0

0

13:13

06/12/2021

Aligned Structured Sparsity Learning for Efficient Image Super-Resolution

Yulun Zhang, Huan Wang, Can Qin, Yun Fu

Keywords Paper

deep learning

0

0

0

0

13:23

03/05/2021

Self-supervised Adversarial Robustness for the Low-label, High-data Regime

Sven Gowal, Po-Sen Huang, Aaron v den and
Timothy A Mann, Pushmeet Kohli

Keywords Paper

self-supervised, adversarial training, robustness

0

0

0

0

5:17

14/06/2020

Gradually Vanishing Bridge for Adversarial Domain Adaptation

Shuhao Cui, Shuhui Wang, Junbao Zhuo and
Chi Su, Qingming Huang, Qi Tian

Keywords Paper

bridge, domain adaptation, adversarial learning

0

0

0

0

1:01

03/05/2021

Reweighting Augmented Samples by Minimizing the Maximal Expected Loss

Mingyang Yi, LU HOU, Lifeng Shang and
Xin Jiang, Qun Liu, Zhi-Ming Ma

Keywords Paper

sample reweighting, data augmentation

0

0

0

0

4:58

18/07/2021

Density Constrained Reinforcement Learning

Zengyi Qin, Yuxiao Chen, Chuchu Fan

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

4:50

06/12/2021

CBP: backpropagation with constraint on weight precision using a pseudo-Lagrange multiplier method

Guhyun Kim, Doo Seok Jeong

Keywords Paper

deep learning

0

0

0

0

12:13

06/12/2020

POMO: Policy Optimization with Multiple Optima for Reinforcement Learning

Yeong-Dae Kwon, Jinho Choo, Byoungjip Kim and
Iljoo Yoon, Youngjune Gwon, Seungjai Min

Keywords Paper

0

0

0

0

3:19

06/12/2021

ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees

Kuan-Lin Chen, Ching-Hua Lee, Harinath Garudadri, Bhaskar D Rao

Keywords Paper

optimization, vision

0

0

0

0

13:27

03/05/2021

Distance-Based Regularisation of Deep Networks for Fine-Tuning

Henry Gouk, Timothy Hospedales, massimiliano pontil

Keywords Paper

Statistical Learning Theory, Transfer Learning, Deep Learning

0

0

0

0

4:57

18/07/2021

Positive-Negative Momentum: Manipulating Stochastic Gradient Noise to Improve Generalization

Zeke Xie, Li Yuan, Zhanxing Zhu, Masashi Sugiyama

Keywords Paper

Optimization, Stochastic Optimization

0

0

0

0

5:17

14/06/2020

Continual Learning With Extended Kronecker-Factored Approximate Curvature

Janghyeon Lee, Hyeong Gwon Hong, Donggyu Joo, Junmo Kim

Keywords Paper

continual learning, curvature approximation, extended k-fac

0

0

0

0

1:01

06/12/2021

Accelerated Sparse Neural Training: A Provable and Efficient Method to Find N:M Transposable Masks

Itay Hubara, Brian Chmiel, Moshe Island and
Ron Banner, Joseph Naor, Daniel Soudry

Keywords Paper

deep learning

0

0

0

0

11:02

06/12/2020

Domain Adaptation with Conditional Distribution Matching and Generalized Label Shift

Remi Tachet des Combes, Han Zhao, Yu-Xiang Wang, Geoffrey Gordon

Keywords Paper

0

0

0

0

3:19

06/12/2021

Scalable Rule-Based Representation Learning for Interpretable Classification

Zhuo Wang, Wei Zhang, Ning Liu, Jianyong Wang

Keywords Paper

optimization, machine learning, representation learning, interpretability

0

0

0

0

14:52

06/12/2021

Towards Stable and Robust AdderNets

Minjing Dong, Yunhe Wang, Xinghao Chen, Chang Xu

Keywords Paper

deep learning, robustness, adversarial robustness and security

0

0

0

0

11:19

03/05/2021

AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights

Byeongho Heo, Sanghyuk Chun, Seong Joon Oh and
Dongyoon Han, Sangdoo Yun, Gyuwan Kim, Youngjung Uh, Jung-Woo Ha

Keywords Paper

effective learning rate, normalize layer, scale-invariant weights, momentum optimizer

0

0

0

0

5:16

03/05/2021

Orthogonalizing Convolutional Layers with the Cayley Transform

Asher Trockman, Zico Kolter

Keywords Paper

Lipschitz constrained networks, orthogonal layers, adversarial robustness

0

0

0

0

9:31

12/07/2020

A Generic First-Order Algorithmic Framework for Bi-Level Programming Beyond Lower-Level Singleton

Risheng Liu, Pan Mu, Xiaoming Yuan and
Shangzhi Zeng, Jin Zhang

Keywords Paper

Optimization - Non-convex

0

0

0

0

13:51

02/02/2021

DPFPS: Dynamic and Progressive Filter Pruning for Compressing Convolutional Neural Networks from Scratch

Xiaofeng Ruan, Yufan Liu, Bing Li and
Chunfeng Yuan, Weiming Hu

Keywords Paper

0

0

0

0

14:38

26/04/2020

DeepHoyer: Learning Sparser Neural Network with Differentiable Scale-Invariant Sparsity Measures

Huanrui Yang, Wei Wen, Hai Li

Keywords Paper

Deep neural network, Sparsity inducing regularizer, Model compression

0

0

0

0

4:54

06/12/2020

NeuMiss networks: differentiable programming for supervised learning with missing values.

Marine Le Morvan, Julie Josse, Thomas Moreau and
Erwan Scornet, Gael Varoquaux

Keywords Paper

0

0

0

0

3:20

06/12/2020

Provable Online CP/PARAFAC Decomposition of a Structured Tensor via Dictionary Learning

Sirisha Rambhatla, Xingguo Li, Jarvis Haupt

Keywords Paper

0

0

0

0

3:22

06/12/2021

AugMax: Adversarial Composition of Random Augmentations for Robust Training

Haotao Wang, Chaowei Xiao, Jean Kossaifi and
Zhiding Yu, Anima Anandkumar, Zhangyang Wang

Keywords Paper

deep learning, robustness, adversarial robustness and security

0

0

0

0

11:19

06/12/2021

Smooth Bilevel Programming for Sparse Regularization

Clarice Poon, Gabriel Peyré

Keywords Paper

machine learning

0

0

0

0

13:06

06/12/2021

Joint Semantic Mining for Weakly Supervised RGB-D Salient Object Detection

Jingjing Li, Wei Ji, Qi Bi and
Cheng Yan, Miao Zhang, Yongri Piao, Huchuan Lu, Li cheng

Keywords Paper

vision

0

0

0

0

9:03

19/10/2020

Robust normalized squares maximization for unsupervised domain adaptation

Wenju Zhang, Xiang Zhang, Qing Liao and
Wenjing Yang, Long Lan, Zhigang Luo

Keywords Paper

transfer learning, image classification, domain adaptation

0

0

0

0

6:23

04/07/2020

Learning to Faithfully Rationalize by Construction

Sarthak Jain, Sarah Wiegreffe, Yuval Pinter, Byron C. Wallace

Keywords Paper

NLP, neural classification, training, automatic evaluations

0

0

0

0

11:55

06/12/2020

Pruning Filter in Filter

Fanxu Meng, Hao Cheng, Ke Li and
Huixiang Luo, Xiaowei Guo, Guangming Lu, Xing Sun

Keywords Paper

0

0

0

0

3:05

02/02/2021

Near Lossless Transfer Learning for Spiking Neural Networks

Zhanglu Yan, Jun Zhou, Weng-Fai Wong

Keywords Paper

0

0

0

0

16:34

14/09/2020

Squeezing Correlated Neurons for Resource-Efficient Deep Neural Networks

Elbruz Ozen, Alex Orailoglu

Keywords Paper

deep learning, information redundancy, pruning

0

0

0

0

14:48

06/12/2021

Benign Overfitting in Multiclass Classification: All Roads Lead to Interpolation

Ke Wang, Vidya Muthukumar, Christos Thrampoulidis

Keywords Paper

machine learning

0

0

0

0

12:38

14/06/2020

Controllable Orthogonalization in Training DNNs

Lei Huang, Li Liu, Fan Zhu and
Diwen Wan, Zehuan Yuan, Bo Li, Ling Shao

Keywords Paper

orthogonalization, weight normalization, newtons iteration, dynamic isometry, lipschitz continuity, regularization, orthogonality, deep learning, gans, small batch size

0

0

0

0

5:00

14/06/2020

Learning a Unified Sample Weighting Network for Object Detection

Qi Cai, Yingwei Pan, Yu Wang and
Jingen Liu, Ting Yao, Tao Mei

Keywords Paper

object detection, sample weighting, uncertainty prediction, sampling strategies, faster r-cnn, joint learning, multi task learning, mscoco, region based detectors, two stage

0

0

0

0

0:59