Why Spectral Normalization Stabilizes GANs: Analysis and Improvements

06/12/2021

Why Spectral Normalization Stabilizes GANs: Analysis and Improvements

Zinan Lin, Vyas Sekar, Giulia Fanti

Keywords: generative model

Abstract Paper Similar Papers

Abstract: Spectral normalization (SN) is a widely-used technique for improving the stability and sample quality of Generative Adversarial Networks (GANs). However, current understanding of SN's efficacy is limited. In this work, we show that SN controls two important failure modes of GAN training: exploding and vanishing gradients. Our proofs illustrate a (perhaps unintentional) connection with the successful LeCun initialization. This connection helps to explain why the most popular implementation of SN for GANs requires no hyper-parameter tuning, whereas stricter implementations of SN have poor empirical performance out-of-the-box. Unlike LeCun initialization which only controls gradient vanishing at the beginning of training, SN preserves this property throughout training. Building on this theoretical understanding, we propose a new spectral normalization technique: Bidirectional Scaled Spectral Normalization (BSSN), which incorporates insights from later improvements to LeCun initialization: Xavier initialization and Kaiming initialization. Theoretically, we show that BSSN gives better gradient control than SN. Empirically, we demonstrate that it outperforms SN in sample quality and training stability on several benchmark datasets.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

26/04/2020

Consistency Regularization for Generative Adversarial Networks

Han Zhang, Zizhao Zhang, Augustus Odena, Honglak Lee

Keywords Paper

Generative Adversarial Networks, Consistency Regularization, GAN

0

0

0

0

6:15

06/12/2020

Differentiable Augmentation for Data-Efficient GAN Training

Shengyu Zhao, Zhijian Liu, Ji Lin and
Jun-Yan Zhu, Song Han

Keywords Paper

0

0

0

0

3:22

06/12/2020

Improving GAN Training with Probability Ratio Clipping and Sample Reweighting

Yue Wu, Pan Zhou, Andrew Wilson and
Eric Xing, Zhiting Hu

Keywords Paper

0

0

0

0

3:22

14/06/2020

Watch Your Up-Convolution: CNN Based Generative Deep Neural Networks Are Failing to Reproduce Spectral Distributions

Ricard Durall, Margret Keuper, Janis Keuper

Keywords Paper

spectral regularization, gan, deepfake, up-convolution, generative models, frequency spectrum

0

0

0

0

1:00

12/07/2020

Bridging the Gap Between f-GANs and Wasserstein GANs

Jiaming Song, Stefano Ermon

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

12:06

02/02/2021

Sparsity Aware Normalization for GANs

Idan Kligvasser, Tomer Michaeli

Keywords Paper

0

0

0

0

17:09

03/05/2021

Training GANs with Stronger Augmentations via Contrastive Discriminator

Jongheon Jeong, Jinwoo Shin

Keywords Paper

visual representation learning, contrastive learning, unsupervised learning, data augmentation, generative adversarial networks

0

0

0

0

5:48

06/12/2021

ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees

Kuan-Lin Chen, Ching-Hua Lee, Harinath Garudadri, Bhaskar D Rao

Keywords Paper

optimization, vision

0

0

0

0

13:27

12/07/2020

Implicit competitive regularization in GANs

Florian Schaefer, Hongkai Zheng, Anima Anandkumar

Keywords Paper

Deep Learning - Theory

0

0

0

0

15:07

06/12/2020

Training Generative Adversarial Networks by Solving Ordinary Differential Equations

Chongli Qin, Yan Wu, Jost Tobias Springenberg and
Andy Brock, Jeff Donahue, Timothy Lillicrap, Pushmeet Kohli

Keywords Paper

0

0

0

0

3:20

18/07/2021

Accumulated Decoupled Learning with Gradient Staleness Mitigation for Convolutional Neural Networks

Huiping Zhuang, Zhenyu Weng, Fulin Luo and
Kar-Ann Toh, Haizhou Li, Zhiping Lin

Keywords Paper

Optimization, Distributed and Parallel Optimization

0

0

0

0

5:05

14/06/2020

On Positive-Unlabeled Classification in GAN

Tianyu Guo, Chang Xu, Jiajun Huang and
Yunhe Wang, Boxin Shi, Chao Xu, Dacheng Tao

Keywords Paper

generative adversarial nets, positive-unlabled classification, image generation, deep learning, computer vision

0

0

0

0

0:59

06/12/2021

Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data

Liming Jiang, Bo Dai, Wayne Wu, Chen Change Loy

Keywords Paper

generative model

0

0

0

0

13:51

02/02/2021

Distribution Adaptive INT8 Quantization for Training CNNs

Kang Zhao, Sida Huang, Pan Pan and
Yinghan Li, Yingya Zhang, Zhenyu Gu, Yinghui Xu

Keywords Paper

0

0

0

0

16:42

06/12/2020

ColdGANs: Taming Language GANs with Cautious Sampling Strategies

Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier and
Benjamin Piwowarski, Jacopo Staiano

Keywords Paper

0

0

0

0

3:19

26/04/2020

Stable Rank Normalization for Improved Generalization in Neural Networks and GANs

Amartya Sanyal, Philip H. Torr, Puneet K. Dokania

Keywords Paper

Generelization, regularization, empirical lipschitz

0

0

0

0

5:25

03/05/2021

Influence Estimation for Generative Adversarial Networks

Naoyuki Terashita, Hiroki Ohashi, Yuichi Nonaka, Takashi Kanemaru

Keywords Paper

influence, data cleansing, generative adversarial networks

0

0

1

1

10:18

14/06/2020

MSG-GAN: Multi-Scale Gradients for Generative Adversarial Networks

Animesh Karnewar, Oliver Wang

Keywords Paper

msg-gan, multi-scale gradients, generative adversarial network, high-res image synthesis, gan training stability

0

0

0

0

1:00

26/04/2020

Language GANs Falling Short

Massimo Caccia, Lucas Caccia, William Fedus and
Hugo Larochelle, Joelle Pineau, Laurent Charlin

Keywords Paper

NLP, GAN, MLE, adversarial, text generation, temperature

0

0

0

0

4:29

14/06/2020

Alleviation of Gradient Exploding in GANs: Fake Can Be Real

Song Tao, Jia Wang

Keywords Paper

gans, mode collapse, instability, unbalanced generation, gradient exploding, fake-as-real

0

0

0

0

1:01

02/02/2021

Understanding Catastrophic Overfitting in Single-step Adversarial Training

Hoki Kim, Woojin Lee, Jaewook Lee

Keywords Paper

0

0

0

0

17:22

26/08/2020

Adversarial Robustness of Flow-Based Generative Models

Phillip Pope, Yogesh Balaji, Soheil Feizi

Keywords Paper

0

0

0

0

12:24

02/02/2021

Fast and Scalable Adversarial Training of Kernel SVM via Doubly Stochastic Gradients

Huimin Wu, Zhengmian Hu, Bin Gu

Keywords Paper

0

0

0

0

14:04

26/04/2020

Effect of Activation Functions on the Training of Overparametrized Neural Nets

Abhishek Panigrahi, Abhishek Shetty, Navin Goyal

Keywords Paper

activation functions, deep learning theory, neural networks

0

0

0

0

5:13

26/04/2020

Simple and Effective Regularization Methods for Training on Noisily Labeled Data with Generalization Guarantee

Wei Hu, Zhiyuan Li, Dingli Yu

Keywords Paper

deep learning theory, regularization, noisy labels

0

0

0

0

5:13

12/07/2020

Towards Understanding the Dynamics of the First-Order Adversaries

Zhun Deng, Hangfeng He, Jiaoyang Huang, Weijie Su

Keywords Paper

Adversarial Examples

0

0

0

0

11:05

06/12/2020

Towards Better Generalization of Adaptive Gradient Methods

Yingxue Zhou, Belhal Karimi, Jinxing Yu and
Zhiqiang Xu, Ping Li

Keywords Paper

0

0

0

0

3:21

06/12/2020

Understanding and Improving Fast Adversarial Training

Maksym Andriushchenko, Nicolas Flammarion

Keywords Paper

0

0

0

0

3:23

14/06/2020

Single-Step Adversarial Training With Dropout Scheduling

Vivek B.S., R. Venkatesh Babu

Keywords Paper

adversarial training, robustness, efficient training, representation learning, generalization, supervised learning, recognition, classification, neural networks, deep learning

0

0

0

0

1:01

06/12/2021

An Infinite-Feature Extension for Bayesian ReLU Nets That Fixes Their Asymptotic Overconfidence

Agustinus Kristiadi, Matthias Hein, Philipp Hennig

Keywords Paper

deep learning, kernel methods

0

0

0

0

10:57

18/07/2021

What Are Bayesian Neural Network Posteriors Really Like?

Pavel Izmailov, Sharad Vikram, Matt Hoffman, Andrew Wilson

Keywords Paper

Deep Learning, Bayesian Deep Learning

0

0

0

0

17:13

05/01/2021

InfoMax-GAN: Improved Adversarial Image Generation via Information Maximization and Contrastive Learning

Kwot Sin Lee, Ngoc-Trung Tran, Ngai-Man Cheung

Keywords Paper

0

0

0

0

5:01

06/12/2021

Data-Efficient Instance Generation from Instance Discrimination

Ceyuan Yang, Yujun Shen, Yinghao Xu, Bolei Zhou

Keywords Paper

machine learning, generative model

0

0

0

0

6:53

26/08/2020

On Minimax Optimality of GANs for Robust Mean Estimation

Kaiwen Wu, Gavin Weiguang Ding, Ruitong Huang, Yaoliang Yu

Keywords Paper

0

0

0

0

12:54

03/05/2021

Certify or Predict: Boosting Certified Robustness with Compositional Architectures

Mark Niklas Mueller, Mislav Balunovic, Martin Vechev

Keywords Paper

Certified Robustness, Adversarial Accuracy, Network Architecture, Provable Robustness, Robustness

0

0

0

0

5:03

06/12/2021

Boost Neural Networks by Checkpoints

Feng Wang, Guoyizhe Wei, Qiao Liu and
Jinxiang Ou, xian wei, Hairong Lv

Keywords Paper

deep learning

1

0

0

0

4:45

14/06/2020

Noise Robust Generative Adversarial Networks

Takuhiro Kaneko, Tatsuya Harada

Keywords Paper

generative adversarial networks (gans), image synthesis, noise robust models, image denoising, deep generative models, adversarial training, reparameterization trick, transformation constraint, image restoration, weakly supervised learning

0

0

0

0

1:01

06/12/2020

ContraGAN: Contrastive Learning for Conditional Image Generation

Minguk Kang, Jaesik Park

Keywords Paper

Neuroscience and Cognitive Science -> Brain Mapping, Neuroscience and Cognitive Science -> Visual Perception

0

0

0

0

3:21

06/12/2020

Towards a Better Global Loss Landscape of GANs

Ruoyu Sun, Tiantian Fang, Alex Schwing

Keywords Paper

0

0

0

0

3:06

03/05/2021

Robust Overfitting may be mitigated by properly learned smoothening

Tianlong Chen, Zhenyu Zhang, Sijia Liu and
Shiyu Chang, Zhangyang Wang

Keywords Paper

Robust Overfitting, Adversarial Training, Adversarial Robustness

0

0

0

0

4:33