HyDRA: Hypergradient Data Relevance Analysis for Interpreting Deep Neural Networks

02/02/2021

HyDRA: Hypergradient Data Relevance Analysis for Interpreting Deep Neural Networks

Yuanyuan Chen, Boyang Li, Han Yu, Pengcheng Wu, Chunyan Miao

Keywords:

Abstract Paper Similar Papers

Abstract: The behaviors of deep neural networks (DNNs) are notoriously resistant to human interpretations. In this paper, we propose Hypergradient Data Relevance Analysis, or HyDRA, which interprets the predictions made by DNNs as effects of their training data. Existing approaches generally estimate data contributions around the final model parameters and ignore how the training data shape the optimization trajectory. By unrolling the hypergradient of test loss w.r.t. the weights of training data, HyDRA assesses the contribution of training data toward test data points throughout the training trajectory. In order to accelerate computation, we remove the Hessian from the calculation and prove that, under moderate conditions, the approximation error is bounded. Corroborating this theoretical claim, empirical results indicate the error is indeed small. In addition, we quantitatively demonstrate that HyDRA outperforms influence functions in accurately estimating data contribution and detecting noisy data labels. The source code is available at https://github.com/cyyever/aaai_hydra.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38949069

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AAAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Single Layer Predictive Normalized Maximum Likelihood for Out-of-Distribution Detection

Koby Bibas, Meir Feder, Tal Hassner

Keywords Paper

theory, deep learning, machine learning

0

0

0

0

4:52

13/04/2021

On the generalization properties of adversarial training

Yue Xing, Qifan Song, Guang Cheng

Keywords Paper

0

0

0

0

3:05

18/07/2021

Generalization Guarantees for Neural Architecture Search with Train-Validation Split

Samet Oymak, Mingchen Li, Mahdi Soltanolkotabi

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:16

26/08/2020

Adversarial Robustness of Flow-Based Generative Models

Phillip Pope, Yogesh Balaji, Soheil Feizi

Keywords Paper

0

0

0

0

12:24

12/07/2020

Detecting Out-of-Distribution Examples with Gram Matrices

Chandramouli Shama Sastry, Sageev Oore

Keywords Paper

Fairness, Equity, Justice, and Safety

0

0

0

0

12:40

18/07/2021

The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks

Bohan Wang, Qi Meng, Wei Chen, Tie-Yan Liu

Keywords Paper

Theory, Deep learning Theory

0

0

0

0

16:53

26/04/2020

Simple and Effective Regularization Methods for Training on Noisily Labeled Data with Generalization Guarantee

Wei Hu, Zhiyuan Li, Dingli Yu

Keywords Paper

deep learning theory, regularization, noisy labels

0

0

0

0

5:13

18/07/2021

Uniform Convergence, Adversarial Spheres and a Simple Remedy

Gregor Bachmann, Seyed Moosavi, Thomas Hofmann

Keywords Paper

Theory, Deep learning Theory

0

2

0

0

5:52

26/04/2020

Implicit Bias of Gradient Descent based Adversarial Training on Separable Data

Yan Li, Ethan X.Fang, Huan Xu, Tuo Zhao

Keywords Paper

implicit bias, adversarial training, robustness, gradient descent

0

0

0

0

4:53

12/07/2020

Double Trouble in Double Descent: Bias and Variance(s) in the Lazy Regime

Stéphane d'Ascoli, Maria Refinetti, Giulio Biroli, Florent Krzakala

Keywords Paper

Deep Learning - Theory

0

0

0

0

15:11

03/05/2021

BRECQ: Pushing the Limit of Post-Training Quantization by Block Reconstruction

Yuhang Li, Ruihao Gong, Xu Tan and
Yang Yang, Peng Hu, Qi Zhang, fengwei yu, Wei Wang, Shi Gu

Keywords Paper

Second-order analysis, Mixed Precision, Post Training Quantization

0

0

0

0

4:36

26/04/2020

Effect of Activation Functions on the Training of Overparametrized Neural Nets

Abhishek Panigrahi, Abhishek Shetty, Navin Goyal

Keywords Paper

activation functions, deep learning theory, neural networks

0

0

0

0

5:13

06/12/2021

Stability & Generalisation of Gradient Descent for Shallow Neural Networks without the Neural Tangent Kernel

Dominic Richards, Ilja Kuzborskij

Keywords Paper

deep learning, optimization

0

0

0

0

11:09

14/06/2020

Watch Your Up-Convolution: CNN Based Generative Deep Neural Networks Are Failing to Reproduce Spectral Distributions

Ricard Durall, Margret Keuper, Janis Keuper

Keywords Paper

spectral regularization, gan, deepfake, up-convolution, generative models, frequency spectrum

0

0

0

0

1:00

19/04/2021

Neural data-to-text generation with LM-based text augmentation

Ernie Chang, Xiaoyu Shen, Dawei Zhu and
Vera Demberg, Hui Su

Keywords Paper

0

0

0

0

7:32

03/05/2021

Learning with Instance-Dependent Label Noise: A Sample Sieve Approach

Hao Cheng, Zhaowei Zhu, Xingyu Li and
Yifei Gong, Xing Sun, Yang Liu

Keywords Paper

deep neural networks., instance-based label noise, Learning with noisy labels

0

0

0

0

5:18

12/07/2020

Towards Understanding the Dynamics of the First-Order Adversaries

Zhun Deng, Hangfeng He, Jiaoyang Huang, Weijie Su

Keywords Paper

Adversarial Examples

0

0

0

0

11:05

18/07/2021

DORO: Distributional and Outlier Robust Optimization

Runtian Zhai, Chen Dan, Zico Kolter, Pradeep Ravikumar

Keywords Paper

Probabilistic Methods, Robust statistics

0

0

0

1

5:06

06/12/2021

Test-Time Classifier Adjustment Module for Model-Agnostic Domain Generalization

Yusuke Iwasawa, Yutaka Matsuo

Keywords Paper

deep learning, optimization, transformers, domain adaptation

0

0

0

0

13:50

16/11/2020

Exposing Shallow Heuristics of Relation Extraction Models with Challenge Data

Shachar Rosenman, Alon Jacovi, Yoav Goldberg

Keywords Paper

data process, re collection, sota models, tacred

0

0

0

0

5:55

03/05/2021

For self-supervised learning, Rationality implies generalization, provably

Yamini Bansal, Gal Kaplun, Boaz Barak

Keywords Paper

Representation learning, Self-supervised learning, Generalization Bounds, Deep Learning Theory

0

0

0

0

7:23

06/12/2021

Scaling Ensemble Distribution Distillation to Many Classes with Proxy Targets

Max Ryabinin, Andrey Malinin, Mark Gales

Keywords Paper

machine learning

0

0

0

0

12:36

04/07/2020

Improved Natural Language Generation via Loss Truncation

Daniel Kang, Tatsunori Hashimoto

Keywords Paper

Natural Generation, optimization, estimation, distinguishability

0

0

0

0

10:35

03/05/2021

Implicit Gradient Regularization

David Barrett, Benoit Dherin

Keywords Paper

regularization, theory, deep learning, implicit regularization, deep learning theory, theoretical issues in deep learning

0

0

0

0

4:55

06/12/2021

Rethinking gradient sparsification as total error minimization

Atal Sahu, Aritra Dutta, Ahmed M. Abdelmoniem and
Trambak Banerjee, Marco Canini, Panos Kalnis

Keywords Paper

deep learning, optimization

0

0

0

0

12:31

02/02/2021

Tackling Instance-Dependent Label Noise via a Universal Probabilistic Model

Qizhou Wang, Bo Han, Tongliang Liu and
Gang Niu, Jian Yang, Chen Gong

Keywords Paper

0

0

0

0

14:56

14/06/2020

On the Acceleration of Deep Learning Model Parallelism With Staleness

An Xu, Zhouyuan Huo, Heng Huang

Keywords Paper

layer-wise staleness, asynchronous model parallelism, convolutional neural networks.

0

0

0

0

1:01

03/05/2021

Overparameterisation and worst-case generalisation: friend or foe?

Aditya Krishna Menon, Ankit Singh Rawat, Sanjiv Kumar

Keywords Paper

worst-case generalisation, overparameterisation

0

0

0

0

5:01

18/07/2021

iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients

Miao Zhang, Steven Su, Shirui Pan and
Xiaojun Chang, Mohammad Abbasnejad, Reza Haffari

Keywords Paper

Algorithms, AutoML

0

0

0

0

5:19

26/04/2020

Adversarial AutoAugment

Xinyu Zhang, Qiang Wang, Jian Zhang, Zhao Zhong

Keywords Paper

Automatic Data Augmentation, Adversarial Learning, Reinforcement Learning

0

0

0

0

4:30

26/04/2020

Towards Better Understanding of Adaptive Gradient Algorithms in Generative Adversarial Nets

Mingrui Liu, Youssef Mroueh, Jerret Ross and
Wei Zhang, Xiaodong Cui, Payel Das, Tianbao Yang

Keywords Paper

Generative Adversarial Nets, Adaptive Gradient Algorithms

0

0

0

0

5:08

26/08/2020

Adversarial Risk Bounds through Sparsity based Compression

Emilio Balda, Niklas Koep, Arash Behboodi, Rudolf Mathar

Keywords Paper

0

0

0

0

15:15

03/05/2021

ResNet After All: Neural ODEs and Their Numerical Solution

Katharina Ott, Prateek Katiyar, Philipp Hennig, Michael Tiemann

Keywords Paper

0

0

0

0

5:10

05/01/2021

Spike-Thrift: Towards Energy-Efficient Deep Spiking Neural Networks by Limiting Spiking Activity via Attention-Guided Compression

Souvik Kundu, Gourav Datta, Massoud Pedram, Peter A. Beerel

Keywords Paper

0

0

0

0

5:22

06/12/2020

HYDRA: Pruning Adversarially Robust Neural Networks

Vikash Sehwag, Shiqi Wang, Prateek Mittal, Suman Jana

Keywords Paper

0

0

0

0

3:14

26/04/2020

Mixed Precision DNNs: All you need is a good parametrization

Stefan Uhlich, Lukas Mauch, Fabien Cardinaux and
Kazuki Yoshiyama, Javier Alonso Garcia, Stephen Tiedemann, Thomas Kemp, Akira Nakamura

Keywords Paper

Deep Neural Network Compression, Quantization, Straight through gradients

1

0

0

0

5:11

05/04/2021

Adaptive Gradient Communication via Critical Learning Regime Identification

Saurabh Agarwal, Hongyi Wang, Kangwook Lee and
Shivaram Venkataraman, Dimitrios Papailiopoulos

Keywords Paper

0

0

0

0

4:23

05/04/2021

Adaptive Gradient Communication via Critical Learning Regime Identification

Saurabh Agarwal, Hongyi Wang, Kangwook Lee and
Shivaram Venkataraman, Dimitrios Papailiopoulos

Keywords Paper

0

0

0

0

21:08

18/07/2021

RATT: Leveraging Unlabeled Data to Guarantee Generalization

Saurabh Garg, Sivaraman Balakrishnan, Zico Kolter, Zachary Lipton

Keywords Paper

Probabilistic Methods, Graphical Models, Theory, Computational Complexity, Theory, Models of Learning and Generalization

0

0

0

1

17:27

06/12/2020

Sampling-Decomposable Generative Adversarial Recommender

Binbin Jin, Defu Lian, Zheng Liu and
Qi Liu, Jianhui Ma, Xing Xie, Enhong Chen

Keywords Paper

0

0

0

0

3:17