Learning explanations that are hard to vary

03/05/2021

Learning explanations that are hard to vary

Giambattista Parascandolo, Alexander Neitz, Antonio Orvieto, Luigi Gresele, Bernhard Schoelkopf

Keywords: invariances, gradient alignment, consistency

Abstract Paper Similar Papers

Abstract: In this paper, we investigate the principle that good explanations are hard to vary in the context of deep learning. We show that averaging gradients across examples -- akin to a logical OR of patterns -- can favor memorization and `patchwork' solutions that sew together different strategies, instead of identifying invariances. To inspect this, we first formalize a notion of consistency for minima of the loss surface, which measures to what extent a minimum appears only when examples are pooled. We then propose and experimentally validate a simple alternative algorithm based on a logical AND, that focuses on invariances and prevents memorization in a set of real-world tasks. Finally, using a synthetic dataset with a clear distinction between invariant and spurious mechanisms, we dissect learning signals and compare this approach to well-established regularizers.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs

Alekh Agarwal, Sham Kakade, Akshay Krishnamurthy, Wen Sun

Keywords Paper

0

0

0

0

3:34

06/12/2020

Functional Regularization for Representation Learning: A Unified Theoretical Perspective

Siddhant Garg, Yingyu Liang

Keywords Paper

0

0

0

0

3:19

18/07/2021

On Recovering from Modeling Errors Using Testing Bayesian Networks

Haiying Huang, Adnan Darwiche

Keywords Paper

Probabilistic Methods, Graphical Models

0

0

0

0

5:09

12/07/2020

Structured Prediction with Partial Labelling through the Infimum Loss

Vivien Cabannnes, Francis Bach, Alessandro Rudi

Keywords Paper

Sequential, Network, and Time-Series Modeling

0

0

0

0

13:01

14/06/2020

Semi-Supervised Semantic Segmentation With Cross-Consistency Training

Yassine Ouali, Céline Hudelot, Myriam Tami

Keywords Paper

semantic segmentation, semi-supervised learning, consistency training, semi-supervised semantic segmentation

0

0

0

0

1:01

06/12/2021

Gradient Starvation: A Learning Proclivity in Neural Networks

Mohammad Pezeshki, Oumar Kaba, Yoshua Bengio and
Aaron Courville, Doina Precup, Guillaume Lajoie

Keywords Paper

theory, deep learning, optimization, robustness

0

0

0

0

10:52

13/04/2021

A theoretical characterization of semi-supervised learning with self-training for gaussian mixture models

Samet Oymak, Talha Cihad Gulcu

Keywords Paper

1

1

0

0

2:59

06/12/2020

Untangling tradeoffs between recurrence and self-attention in artificial neural networks

Giancarlo Kerg, bhargav104 Kanuparthi, Anirudh Goyal ALIAS PARTH GOYAL and
Kyle Goyette, Yoshua Bengio, Guillaume Lajoie

Keywords Paper

0

0

0

0

3:20

18/07/2021

Robust Unsupervised Learning via L-statistic Minimization

Andreas Maurer, Daniela Angela Parletta, Andrea Paudice, Massimiliano Pontil

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:03

18/07/2021

A Distribution-dependent Analysis of Meta Learning

Mikhail Konobeev, Ilja Kuzborskij, Csaba Szepesvari

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:06

13/04/2021

Semi-supervised learning with meta-gradient

Taihong Xiao, Xin-Yu Zhang, Haolin Jia and
Ming-Ming Cheng, Ming-Hsuan Yang

Keywords Paper

0

0

0

0

2:56

13/04/2021

Contrastive learning of strong-mixing continuous-time stochastic processes

Bingbin Liu, Pradeep Ravikumar, Andrej Risteski

Keywords Paper

0

0

0

0

2:57

06/12/2021

Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces

Kirill Struminsky, Artyom Gadetsky, Denis Rakitin and
Danil Karpushkin, Dmitry Vetrov

Keywords Paper

deep learning, optimization

0

0

0

0

14:26

06/12/2021

Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning

Jannik Kossen, Neil Band, Clare Lyle and
Aidan Gomez, Thomas Rainforth, Yarin Gal

Keywords Paper

deep learning, transformers

0

0

0

0

9:54

18/07/2021

Toward Understanding the Feature Learning Process of Self-supervised Contrastive Learning

Zixin Wen, Yuanzhi Li

Keywords Paper

Theory, Deep learning Theory

0

0

0

0

5:48

12/07/2020

Supervised learning: no loss no cry

Richard Nock, Aditya Menon

Keywords Paper

Learning Theory

0

0

0

0

15:18

06/12/2021

An online passive-aggressive algorithm for difference-of-squares classification

Lawrence Saul

Keywords Paper

machine learning, online learning

0

0

0

0

14:00

12/07/2020

On Implicit Regularization in $\beta$-VAEs

Abhishek Kumar, Ben Poole

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

13:34

26/08/2020

Robust Learning from Discriminative Feature Feedback

Sanjoy Dasgupta, Sivan Sabato

Keywords Paper

0

0

0

0

14:37

06/12/2021

Counterfactual Maximum Likelihood Estimation for Training Deep Networks

Xinyi Wang, Wenhu Chen, Michael Saxon, William Yang Wang

Keywords Paper

deep learning, domain adaptation, causality, language

0

0

0

0

14:07

13/04/2021

On data efficiency of meta-learning

Maruan Al-Shedivat, Liam Li, Eric Xing, Ameet Talwalkar

Keywords Paper

0

0

0

0

3:24

12/07/2020

Closed Loop Neural-Symbolic Learning via Integrating Neural Perception, Grammar Parsing, and Symbolic Reasoning

Qing Li, Siyuan Huang, Yining Hong and
Yixin Chen, Ying Nian Wu, Song-Chun Zhu

Keywords Paper

Applications - Computer Vision

0

0

0

0

14:01

14/09/2020

Learning Gradient Boosted Multi-label Classification Rules

Michael Rapp, Eneldo Loza Mencía, Johannes Fürnkranz and
Vu-Linh Nguyen, Eyke Hüllermeier

Keywords Paper

multi-label classification, gradient boosting, rule learning

0

0

0

0

15:45

06/12/2021

Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability

Dibya Ghosh, Jad Rahme, Aviral Kumar and
Amy Zhang, Ryan Adams, Sergey Levine

Keywords Paper

reinforcement learning and planning

0

0

0

0

15:17

06/12/2020

A Variational Approach for Learning from Positive and Unlabeled Data

Hui Chen, Fangqing Liu, Yin Wang and
Liyue Zhao, Hao Wu

Keywords Paper

0

0

0

0

3:13

18/07/2021

Learning from Similarity-Confidence Data

Yuzhou Cao, Lei Feng, Yitian Xu and
Bo An, Gang Niu, Masashi Sugiyama

Keywords Paper

Algorithms, Semi-Supervised Learning

0

0

0

0

4:05

13/04/2021

Adversarially robust estimate and risk analysis in linear regression

Yue Xing, Ruizhi Zhang, Guang Cheng

Keywords Paper

0

0

0

0

3:03

12/07/2020

Sequential Transfer in Reinforcement Learning with a Generative Model

Andrea Tirinzoni, Riccardo Poiani, Marcello Restelli

Keywords Paper

Reinforcement Learning - General

0

0

0

0

10:54

26/04/2020

Empirical Bayes Transductive Meta-Learning with Synthetic Gradients

Shell Xu Hu, Pablo Moreno, Yang Xiao and
Xi Shen, Guillaume Obozinski, Neil Lawrence, Andreas Damianou

Keywords Paper

Meta-learning, Empirical Bayes, Synthetic Gradient, Information Bottleneck

0

0

0

0

4:47

06/12/2021

Simple Stochastic and Online Gradient Descent Algorithms for Pairwise Learning

ZHENHUAN YANG, Yunwen Lei, Puyu Wang and
Tianbao Yang, Yiming Ying

Keywords Paper

optimization, machine learning, privacy

0

0

0

0

14:40

13/04/2021

List learning with attribute noise

Mahdi Cheraghchi, Elena Grigorescu, Brendan Juba and
Karl Wimmer, Ning Xie

Keywords Paper

0

0

0

0

2:51

19/08/2021

Time-Series Representation Learning via Temporal and Contextual Contrasting

Emadeldeen Eldele, Mohamed Ragab, Zhenghua Chen and
Min Wu, Chee Keong Kwoh, Xiaoli Li, Cuntai Guan

Keywords Paper

Machine Learning, Deep Learning, Semi-Supervised Learning, Time-series; Data Streams

0

0

0

0

12:35

06/12/2020

Learning from Aggregate Observations

Yivan Zhang, Nontawat Charoenphakdee, Zhenguo Wu, Masashi Sugiyama

Keywords Paper

0

0

0

0

3:21

18/07/2021

The Impact of Record Linkage on Learning from Feature Partitioned Data

Richard Nock, Stephen J Hardy, Wilko Henecka and
Hamish Ivey-Law, Jakub Nabaglo, Giorgio Patrini, Guillaume Smith, Brian Thorne

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

6:02

18/07/2021

Variational Empowerment as Representation Learning for Goal-Conditioned Reinforcement Learning

Jongwook Choi, Archit Sharma, Honglak Lee and
Sergey Levine, Shixiang Gu

Keywords Paper

Neuroscience and Cognitive Science, Neuroscience, Reinforcement Learning and Planning, Algorithms, Representation Learning; Algorithms, Sparse Coding and Dimensionality Expansion; Applications, Matrix and Ten

0

0

0

0

5:16

06/12/2021

A Mathematical Framework for Quantifying Transferability in Multi-source Transfer Learning

Xinyi Tong, Xiangxiang Xu, Shao-Lun Huang, Lizhong Zheng

Keywords Paper

theory, deep learning, machine learning, vision, transfer learning

2

1

0

0

13:27

02/02/2021

Fine-grained Generalization Analysis of Vector-Valued Learning

Liang Wu, Antoine Ledent, Yunwen Lei, Marius Kloft

Keywords Paper

0

0

0

0

13:54

06/12/2020

Modeling and Optimization Trade-off in Meta-learning

Katelyn Gao, Ozan Sener

Keywords Paper

0

0

0

0

3:21

19/08/2021

Abductive Learning with Ground Knowledge Base

Le-Wen Cai, Wang-Zhou Dai, Yu-Xuan Huang and
Yu-Feng Li, Stephen Muggleton, Yuan Jiang

Keywords Paper

Knowledge Representation and Reasoning, Diagnosis and Abductive Reasoning, Knowledge Aided Learning, Weakly Supervised Learning

0

0

0

0

12:59

13/04/2021

Fork or fail: Cycle-consistent training with many-to-one mappings

Qipeng Guo, Zhijing Jin, Ziyu Wang and
Xipeng Qiu, Weinan Zhang, Jun Zhu, Zheng Zhang, Wipf David

Keywords Paper

0

0

0

0

3:31