Estimating Training Data Influence by Tracing Gradient Descent

06/12/2020

Estimating Training Data Influence by Tracing Gradient Descent

Garima Pruthi, Frederick Liu, Satyen Kale, Mukund Sundararajan

Keywords: Algorithms; Algorithms -> Online Learning; Optimization -> Combinatorial Optimization; Optimization -> Convex Optimization; The, Algorithms -> Bandit Algorithms

Abstract Paper Similar Papers

Abstract: We introduce a method called TracIn that computes the influence of a training example on a prediction made by the model. The idea is to trace how the loss on the test point changes during the training process whenever the training example of interest was utilized. We provide a scalable implementation of TracIn via: (a) a first-order gradient approximation to the exact computation, (b) saved checkpoints of standard training procedures, and (c) cherry-picking layers of a deep neural network. In contrast with previously proposed methods, TracIn is simple to implement; all it needs is the ability to work with gradients, checkpoints, and loss functions. The method is general. It applies to any machine learning model trained using stochastic gradient descent or a variant of it, agnostic of architecture, domain and task. We expect the method to be widely useful within processes that study and improve training data.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

13/04/2021

Learn to expect the unexpected: Probably approximately correct domain generalization

Vikas Garg, Adam Tauman Kalai, Katrina Ligett, Steven Wu

Keywords Paper

0

0

0

0

3:01

12/07/2020

Efficient Domain Generalization via Common-Specific Low-Rank Decomposition

Vihari Piratla, Praneeth Netrapalli, Sunita Sarawagi

Keywords Paper

Supervised Learning

0

0

0

0

14:51

18/07/2021

Model Performance Scaling with Multiple Data Sources

Tatsunori Hashimoto

Keywords Paper

Algorithms, Supervised Learning

0

0

0

1

4:50

18/07/2021

Training Data Subset Selection for Regression with Controlled Generalization Error

Durga S, Rishabh Iyer, Ganesh Ramakrishnan, Abir De

Keywords Paper

, Algorithms, Online Learning, Algorithms, Supervised Learning

0

0

0

0

4:15

06/12/2021

Training Neural Networks with Fixed Sparse Masks

Yi-Lin Sung, Varun Nair, Colin Raffel

Keywords Paper

deep learning, transfer learning

0

0

0

0

14:20

06/12/2021

Even your Teacher Needs Guidance: Ground-Truth Targets Dampen Regularization Imposed by Self-Distillation

Kenneth Borup, Lars N Andersen

Keywords Paper

theory, deep learning, optimization

0

0

0

0

6:00

02/02/2021

Explaining Neural Matrix Factorization with Gradient Rollback

Carolin Lawrence, Timo Sztyler, Mathias Niepert

Keywords Paper

0

0

0

0

16:47

13/04/2021

Curriculum learning by optimizing learning dynamics

Tianyi Zhou, Shengjie Wang, Jeff Bilmes

Keywords Paper

0

0

0

0

3:03

03/05/2021

Dataset Meta-Learning from Kernel Ridge-Regression

Timothy Nguyen, Zhourong Chen, Jaehoon Lee

Keywords Paper

dataset corruption, infinite-width networks, neural kernels, kernel-ridge regression, dataset compression, dataset distillation, meta-learning

0

0

0

0

4:59

06/12/2021

Explicit loss asymptotics in the gradient descent training of neural networks

Maksim Velikanov, Dmitry Yarotsky

Keywords Paper

theory, deep learning, optimization

0

0

0

0

9:54

07/09/2020

Zero-Shot Domain Generalization

Udit Maniyar, Joseph K J, Aniket Anand Deshmukh and
Urun Dogan, Vineeth N Balasubramanian

Keywords Paper

Domain Generalization, zero-shot learning, semantic space, multi task learning, Learning with limited data, representation learning, classification

0

0

0

0

9:59

06/12/2020

Learning Differentiable Programs with Admissible Neural Heuristics

Ameesh Shah, Eric Zhan, Jennifer Sun and
Abhinav Verma, Yisong Yue, Swarat Chaudhuri

Keywords Paper

Algorithms -> Missing Data; Algorithms -> Uncertainty Estimation; Probabilistic Methods -> Causal Inference; Probabilistic Meth, Probabilistic Methods -> Bayesian Nonparametrics

0

0

0

0

3:28

04/11/2020

Retiarii: A Deep Learning Exploratory-Training Framework

Quanlu Zhang, Zhenhua Han, Fan Yang and
Yuge Zhang, Zhe Liu, Mao Yang, Lidong Zhou

Keywords Paper

0

0

0

0

20:05

06/12/2021

Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations

Jiayao Zhang, Hua Wang, Weijie Su

Keywords Paper

deep learning, optimization

0

0

0

0

13:45

26/04/2020

Gradients as Features for Deep Representation Learning

Fangzhou Mu, Yingyu Liang, Yin Li

Keywords Paper

representation learning, gradient features, deep learning

0

0

0

0

5:07

12/07/2020

Subspace Fitting Meets Regression: The Effects of Supervision and Orthonormality Constraints on Double Descent of Generalization Errors

Yehuda Dar, Paul Mayer, Lorenzo Luzi, Richard Baraniuk

Keywords Paper

Unsupervised and Semi-Supervised Learning

0

0

0

0

15:39

06/12/2021

Adaptive Risk Minimization: Learning to Adapt to Domain Shift

Marvin Zhang, Henrik Marklund, Nikita Dhawan and
Abhishek Gupta, Sergey Levine, Chelsea Finn

Keywords Paper

machine learning, robustness, vision, domain adaptation

0

0

0

0

9:30

06/12/2021

Subquadratic Overparameterization for Shallow Neural Networks

ChaeHwan Song, Ali Ramezani-Kebrya, Thomas Pethick and
Armin Eftekhari, Volkan Cevher

Keywords Paper

theory, deep learning, optimization

0

0

0

0

5:23

06/12/2021

Shape your Space: A Gaussian Mixture Regularization Approach to Deterministic Autoencoders

Amrutha Saseendran, Kathrin Skubch, Stefan Falkner, Margret Keuper

Keywords Paper

generative model

0

0

0

0

12:18

18/07/2021

Learning a Universal Template for Few-shot Dataset Generalization

Eleni Triantafillou, Hugo Larochelle, Richard Zemel, Vincent Dumoulin

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

5:27

06/12/2020

SuperLoss: A Generic Loss for Robust Curriculum Learning

Thibault Castells, Philippe Weinzaepfel, Jerome Revaud

Keywords Paper

, Probabilistic Methods -> MCMC

0

0

0

0

3:26

26/04/2020

Gradient $\ell_1$ Regularization for Quantization Robustness

Milad Alizadeh, Arash Behboodi, Mart van Baalen and
Christos Louizos, Tijmen Blankevoort, Max Welling

Keywords Paper

quantization, regularization, robustness, gradient regularization

0

0

0

0

5:01

03/05/2021

For self-supervised learning, Rationality implies generalization, provably

Yamini Bansal, Gal Kaplun, Boaz Barak

Keywords Paper

Representation learning, Self-supervised learning, Generalization Bounds, Deep Learning Theory

0

0

0

0

7:23

13/04/2021

A theoretical characterization of semi-supervised learning with self-training for gaussian mixture models

Samet Oymak, Talha Cihad Gulcu

Keywords Paper

1

1

0

0

2:59

06/12/2020

Continuous Meta-Learning without Tasks

James Harrison, Apoorva Sharma, Chelsea Finn, Marco Pavone

Keywords Paper

0

0

0

0

3:09

05/01/2021

Multi-Loss Weighting With Coefficient of Variations

Rick Groenendijk, Sezer Karaoglu, Theo Gevers, Thomas Mensink

Keywords Paper

0

0

0

0

4:56

12/07/2020

Multi-Agent Determinantal Q-Learning

Yaodong Yang, Ying Wen, Jun Wang and
Liheng Chen, Kun Shao, David Mguni, Weinan Zhang

Keywords Paper

Planning, Control, and Multiagent Learning

0

0

0

0

15:58

06/12/2021

Model, sample, and epoch-wise descents: exact solution of gradient flow in the random feature model

Antoine Bodin, Nicolas Macris

Keywords Paper

deep learning, optimization

0

0

0

0

15:00

18/07/2021

Better Training using Weight-Constrained Stochastic Dynamics

Benedict Leimkuhler, Tiffany Vlaar, Timothée Pouchon, Amos Storkey

Keywords Paper

Deep Learning, Bayesian Deep Learning

0

0

0

0

5:14

18/07/2021

On the Proof of Global Convergence of Gradient Descent for Deep ReLU Networks with Linear Widths

Quynh Nguyen

Keywords Paper

Theory, Deep learning Theory

0

0

0

0

4:43

18/07/2021

Sinkhorn Label Allocation: Semi-Supervised Classification via Annealed Self-Training

Kai Sheng Tai, Peter Bailis, Gregory Valiant

Keywords Paper

Algorithms, Semi-Supervised Learning

0

0

0

1

6:59

18/07/2021

Function Contrastive Learning of Transferable Meta-Representations

Waleed Gondal, Shruti Joshi, Nasim Rahaman and
Stefan Bauer, Manuel Wuthrich, Bernhard Schölkopf

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

5:46

12/07/2020

Operation-Aware Soft Channel Pruning using Differentiable Masks

Minsoo Kang, Bohyung Han

Keywords Paper

Applications - Computer Vision

0

0

0

0

14:56

26/04/2020

Infinite-Horizon Differentiable Model Predictive Control

Sebastian East, Marco Gallieri, Jonathan Masci and
Jan Koutnik, Mark Cannon

Keywords Paper

Model Predictive Control, Riccati Equation, Imitation Learning, Safe Learning

0

0

0

0

4:56

26/04/2020

Learned step size quantization

Steven K. Esser, Jeffrey L. McKinstry, Deepika Bablani and
Rathinakumar Appuswamy, Dharmendra S. Modha

Keywords Paper

deep learning, low precision, classification, quantization

0

0

0

0

4:40

12/07/2020

Fast Adaptation to New Environments via Policy-Dynamics Value Functions

Roberta Raileanu, Max Goldstein, Arthur Szlam, Facebook Rob Fergus

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

13:39

05/01/2021

Towards Zero-Shot Learning With Fewer Seen Class Examples

Vinay Kumar Verma, Ashish Mishra, Anubha Pandey and
Hema A. Murthy, Piyush Rai

Keywords Paper

0

0

0

0

4:08

14/09/2020

Automatic Tuning of Stochastic Gradient Descent with Bayesian Optimisation

Victor Picheny, Vincent Dutordoir, Artem Artemev, Nicolas Durrande

Keywords Paper

learning rate, gaussian process, variational inference

0

0

0

0

15:13

03/08/2020

Batch norm with entropic regularization turns deterministic autoencoders into generative models

Amur Ghose, Abdullah Rashwan, Pascal Poupart

Keywords Paper

0

0

0

0

8:18

06/12/2020

On the training dynamics of deep networks with $L_2$ regularization

Aitor Lewkowycz, Guy Gur-Ari

Keywords Paper

0

0

0

0

3:24