Improved OOD Generalization via Adversarial Training and Pretraing

18/07/2021

Improved OOD Generalization via Adversarial Training and Pretraing

Mingyang Yi, Lu Hou, Jiacheng Sun, Lifeng Shang, Xin Jiang, Qun Liu, Zhiming Ma

Keywords: Theory, Deep learning Theory

Abstract Paper Similar Papers

Abstract: Recently, learning a model that generalizes well on out-of-distribution (OOD) data has attracted great attention in the machine learning community. In this paper, after defining OOD generalization by Wasserstein distance, we theoretically justify that a model robust to input perturbation also generalizes well on OOD data. Inspired by previous findings that adversarial training helps improve robustness, we show that models trained by adversarial training have converged excess risk on OOD data. Besides, in the paradigm of pre-training then fine-tuning, we theoretically justify that the input perturbation robust model in the pre-training stage provides an initialization that generalizes well on downstream OOD data. Finally, various experiments conducted on image classification and natural language understanding tasks verify our theoretical findings.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

18/07/2021

Function Contrastive Learning of Transferable Meta-Representations

Waleed Gondal, Shruti Joshi, Nasim Rahaman and
Stefan Bauer, Manuel Wuthrich, Bernhard Schölkopf

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

5:46

07/09/2020

Adversarial Concurrent Training: Optimizing Robustness and Accuracy Trade-off of Deep Neural Networks

Elahe Arani, Fahad Sarfraz, Bahram Zonooz

Keywords Paper

Adversarial Robustness, Generalization, Adversarial Training, Deep Learning, Collaborative Learning

0

0

0

0

3:39

06/12/2021

On Calibration and Out-of-Domain Generalization

Yoav Wald, Amir Feder, Daniel Greenfeld, Uri Shalit

Keywords Paper

machine learning, domain adaptation, causality

0

0

0

0

11:00

03/05/2021

Improving Transformation Invariance in Contrastive Representation Learning

Adam Foster, Rattana Pukdee, Tom Rainforth

Keywords Paper

transformation invariance, contrastive learning, representation learning

0

0

0

0

5:23

19/04/2021

Keep learning: Self-supervised meta-learning for learning from inference

Akhil Kedia, Sai Chetan Chinthakindi

Keywords Paper

0

0

0

0

11:27

02/02/2021

DeepCollaboration: Collaborative Generative and Discriminative Models for Class Incremental Learning

Bo Cui, Guyue Hu, Shan Yu

Keywords Paper

0

0

0

0

15:13

18/07/2021

Contrastive Learning Inverts the Data Generating Process

Roland S. Zimmermann, Yash Sharma, Steffen Schneider and
Matthias Bethge, Wieland Brendel

Keywords Paper

Theory, Deep learning Theory

1

1

0

1

5:17

02/02/2021

Learning a Few-shot Embedding Model with Contrastive Learning

Chen Liu, Yanwei Fu, Chengming Xu and
Siqian Yang, Jilin Li, Chengjie Wang, Li Zhang

Keywords Paper

0

0

0

0

15:02

18/07/2021

Scalable Normalizing Flows for Permutation Invariant Densities

Marin Biloš, Stephan Günnemann

Keywords Paper

Deep Learning, Generative Models

0

0

0

0

5:10

03/05/2021

Learning perturbation sets for robust machine learning

Eric Wong, Zico Kolter

Keywords Paper

conditional variational autoencoder, adversarial examples, perturbation sets, robust machine learning

0

1

0

0

5:06

06/12/2021

Adversarial Training Helps Transfer Learning via Better Representations

Zhun Deng, Linjun Zhang, Kailas Vodrahalli and
Kenji Kawaguchi, James Zou

Keywords Paper

theory, deep learning, adversarial robustness and security, transfer learning, semi-supervised learning

0

0

0

0

9:01

16/11/2020

Meta Fine-Tuning Neural Language Models for Multi-Domain Text Mining

Chengyu Wang, Minghui Qiu, Jun Huang, Xiaofeng He

Keywords Paper

nlp tasks, fine-tuning, learning process, multi-domain tasks

0

0

0

0

9:58

06/12/2020

Conditioning and Processing: Techniques to Improve Information-Theoretic Generalization Bounds

Hassan Hafez-Kolahi, Zeinab Golgooni, Shohreh Kasaei, Mahdieh Soleymani

Keywords Paper

0

0

0

0

3:25

18/07/2021

Improved Denoising Diffusion Probabilistic Models

Alexander Nichol, Prafulla Dhariwal

Keywords Paper

Deep Learning, Generative Models, Theory, Game Theory and Computational Economics, Reinforcement Learning and Planning, Multi-Agent RL

0

0

0

0

4:25

02/02/2021

Towards Trustworthy Predictions from Deep Neural Networks with Fast Adversarial Calibration

Christian Tomani, Florian Buettner

Keywords Paper

0

1

0

0

15:26

03/05/2021

Trusted Multi-View Classification

Zongbo Han, Changqing Zhang, Huazhu FU, Joey T Zhou

Keywords Paper

Uncertainty Machine Learning, Multi-View Learning, Multi-Modal Learning

0

0

0

0

4:33

03/05/2021

Remembering for the Right Reasons: Explanations Reduce Catastrophic Forgetting

Sayna Ebrahimi, Suzanne Petryk, Akash Gokul and
William Gan, Joseph E Gonzalez, Marcus Rohrbach, trevor darrell

Keywords Paper

Explainability, Catastrophic Forgetting, Continual Learning, XAI, Lifelong Learning

0

0

0

0

5:13

06/12/2020

Do Adversarially Robust ImageNet Models Transfer Better?

Hadi Salman, Andrew Ilyas, Logan Engstrom and
Ashish Kapoor, Aleksander Madry

Keywords Paper

0

0

0

0

4:16

22/11/2021

Cross-Modal Generative Augmentation for Visual Question Answering

Zixu Wang, Yishu Miao, Lucia Specia

Keywords Paper

visual question answering, data augmentation, generative model, multimodal machine learning

0

0

0

0

2:49

03/05/2021

Dataset Meta-Learning from Kernel Ridge-Regression

Timothy Nguyen, Zhourong Chen, Jaehoon Lee

Keywords Paper

dataset corruption, infinite-width networks, neural kernels, kernel-ridge regression, dataset compression, dataset distillation, meta-learning

0

0

0

0

4:59

06/12/2021

A Framework to Learn with Interpretation

Jayneel Parekh, Pavlo Mozharovskyi, Florence d'Alché-Buc

Keywords Paper

deep learning, interpretability

0

0

0

0

14:05

14/06/2020

CONSAC: Robust Multi-Model Fitting by Conditional Sample Consensus

Florian Kluger, Eric Brachmann, Hanno Ackermann and
Carsten Rother, Michael Ying Yang, Bodo Rosenhahn

Keywords Paper

robust estimator, reinforcement learning, self-supervised, unsupervised, multi-model, ransac, dataset, vanishing points, homography, 3d reconstruction

0

0

0

0

1:00

26/04/2020

Frequency-based Search-control in Dyna

Yangchen Pan, Jincheng Mei, Amir-massoud Farahmand

Keywords Paper

Model-based reinforcement learning, search-control, Dyna, frequency of a signal

0

0

0

0

4:32

16/11/2020

Improving Grammatical Error Correction Models with Purpose-Built Adversarial Examples

Lihao Wang, Xiaoqing Zheng

Keywords Paper

grammatical correction, sequence-to-sequence learning, neural networks, gec

0

0

0

0

11:40

06/12/2020

Submodular Meta-Learning

Arman Adibi, Aryan Mokhtari, Hamed Hassani

Keywords Paper

0

0

0

0

3:17

03/05/2021

Exploring Balanced Feature Spaces for Representation Learning

Bingyi Kang, Yu Li, Sain Xie and
Zehuan Yuan, Jiashi Feng

Keywords Paper

Representation Learning, Contrastive Learning, Long-Tailed Recognition

0

0

0

0

7:18

14/06/2020

Better Captioning With Sequence-Level Exploration

Jia Chen, Qin Jin

Keywords Paper

caption, sequece-level, diversity, precision

0

0

0

0

0:57

03/05/2021

Neurally Augmented ALISTA

Freya Behrens, Jonathan Sauder, Peter Jung

Keywords Paper

learned ISTA, unrolled algorithms, compressed sensing, sparse reconstruction

0

0

0

0

5:18

05/01/2021

Analyzing Deep Neural Network's Transferability via Frechet Distance

Yifan Ding, Liqiang Wang, Boqing Gong

Keywords Paper

0

0

0

0

4:59

16/11/2020

PatchBERT: Just-in-Time, Out-of-Vocabulary Patching

Sangwhan Moon, Naoaki Okazaki

Keywords Paper

natural processing, downstream tasks, mitigation, large models

0

0

0

0

7:02

03/05/2021

Learning and Evaluating Representations for Deep One-Class Classification

Kihyuk Sohn, Chun-Liang Li, Jinsung Yoon and
Minho Jin, Tomas Pfister

Keywords Paper

self-supervised learning, deep one-class classification

0

0

0

1

5:13

06/12/2021

Adversarial Reweighting for Partial Domain Adaptation

Xiang Gu, Xi Yu, yan yang and
Jian Sun, Zongben Xu

Keywords Paper

domain adaptation

0

0

0

1

9:03

26/04/2020

Theory and Evaluation Metrics for Learning Disentangled Representations

Kien Do, Truyen Tran

Keywords Paper

disentanglement, metrics

0

0

0

0

3:37

05/01/2021

Few-Shot Learning via Feature Hallucination With Variational Inference

Qinxuan Luo, Lingfeng Wang, Jingguo Lv and
Shiming Xiang, Chunhong Pan

Keywords Paper

0

0

0

0

4:56

14/06/2020

Mnemonics Training: Multi-Class Incremental Learning Without Forgetting

Yaoyao Liu, Yuting Su, An-An Liu and
Bernt Schiele, Qianru Sun

Keywords Paper

incremental learning, continual learning, classification, recognition, transfer learning, representation learning, bilevel optimization, online learning, imagenet, cifar-100

0

0

0

0

5:01

03/05/2021

Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning

Dong Bok Lee, Dongchan Min, Seanie Lee, Sung Ju Hwang

Keywords Paper

Unsupervised Learning, Variational Autoencoders, Unsupervised Meta-learning, Meta-Learning

0

0

0

0

13:31

06/12/2021

Tailoring: encoding inductive biases by optimizing unsupervised objectives at prediction time

Ferran Alet, Maria Bauza, Kenji Kawaguchi and
Nurullah Giray Kuru, Tomás Lozano-Pérez, Leslie Kaelbling

Keywords Paper

deep learning, optimization, machine learning, self-supervised learning, meta learning

0

0

0

0

15:05

26/08/2020

Deep Active Learning: Unified and Principled Method for Query and Training

Changjian Shui, Fan Zhou, Christian Gagné, Boyu Wang

Keywords Paper

0

0

0

0

12:12

08/12/2020

Dynamic Curriculum Learning for Low-Resource Neural Machine Translation

Chen Xu, Bojie Hu, Yufan Jiang and
Kai Feng, Zeyang Wang, Shen Huang, Qi Ju, Tong Xiao, Jingbo Zhu

Keywords Paper

0

0

0

0

13:28

16/11/2020

A Diagnostic Study of Explainability Techniques for Text Classification

Pepa Atanasova, Jakob Grue Simonsen, Christina Lioma, Isabelle Augenstein

Keywords Paper

downstream tasks, machine learning, explainability techniques, diverse techniques

0

0

0

0

11:24