MUSE: Feature Self-Distillation with Mutual Information and Self-Information

22/11/2021

MUSE: Feature Self-Distillation with Mutual Information and Self-Information

Yu Gong, Ye Yu, Gaurav Mittal, Greg Mori, Mei Chen

Keywords: knowledge distillation, self-distillation, mutual information

Abstract Paper Similar Papers

Abstract: We present a novel information-theoretic approach to introduce dependency among features of a deep convolutional neural network (CNN). The core idea of our proposed method, called MUSE, is to combine MUtual information and SElf-information to jointly improve the expressivity of all features extracted from different layers in a CNN. We present two variants of the realization of MUSE---Additive Information and Multiplicative Information. Importantly, we argue and empirically demonstrate that MUSE, compared to other feature discrepancy functions, is a more functional proxy to introduce dependency and effectively improve the expressivity of all features in the knowledge distillation framework. MUSE achieves superior performance over a variety of popular architectures and feature discrepancy functions for self-distillation and online distillation, and performs competitively with state-of-the-art methods for offline distillation. MUSE is also demonstrably versatile that enables it to be easily extended to CNN-based models on tasks other than image classification such as object detection.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at BMVC 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

03/05/2021

Attentional Constellation Nets for Few-Shot Learning

Weijian Xu, Yifan Xu, Huaijin Wang, Zhuowen Tu

Keywords Paper

few-shot learning, constellation models

0

0

0

0

5:10

06/12/2021

Efficient Equivariant Network

Lingshen He, Yuxuan Chen, zhengyang shen and
Yiming Dong, Yisen Wang, Zhouchen Lin

Keywords Paper

deep learning, vision

0

0

0

0

8:20

14/06/2020

Improving Convolutional Networks With Self-Calibrated Convolutions

Jiang-Jiang Liu, Qibin Hou, Ming-Ming Cheng and
Changhu Wang, Jiashi Feng

Keywords Paper

self-calibrated, feature transformation, image classification, network architecture, convolutional neural networks

0

0

0

0

1:00

04/07/2020

ReInceptionE: Relation-Aware Inception Network with Joint Local-Global Structural Information for Knowledge Graph Embedding

Zhiwen Xie, Guangyou Zhou, Jin Liu, Jimmy Xiangji Huang

Keywords Paper

Relation-Aware Network, Knowledge Embedding, ReInceptionE, Knowledge embedding

0

0

0

0

10:38

14/06/2020

Erasing Integrated Learning: A Simple Yet Effective Approach for Weakly Supervised Object Localization

Jinjie Mai, Meng Yang, Wenfeng Luo

Keywords Paper

weakly supervised, object localization, adversarial erasing

0

0

0

0

5:00

02/02/2021

Multi-Proxy Wasserstein Classifier for Image Classification

Benlin Liu, Yongming Rao, Jiwen Lu and
Jie Zhou, Cho-Jui Hsieh

Keywords Paper

0

0

0

0

12:05

06/12/2021

Container: Context Aggregation Networks

peng gao, Jiasen Lu, hongsheng Li and
Roozbeh Mottaghi, Aniruddha Kembhavi

Keywords Paper

deep learning, self-supervised learning, transformers, vision, language

0

0

0

0

8:50

14/06/2020

MMTM: Multimodal Transfer Module for CNN Fusion

Hamid Reza Vaezi Joze, Amirreza Shaban, Michael L. Iuzzolino, Kazuhito Koishida

Keywords Paper

multimodal fusion, multimodal transfer module, hand gesture recognition, speech enhancement, action recognition

0

0

0

0

1:02

12/07/2020

Informative Dropout for Robust Representation Learning: A Shape-bias Perspective

Baifeng Shi, Dinghuai Zhang, Qi Dai and
Jingdong Wang, Zhanxing Zhu, Yadong Mu

Keywords Paper

Accountability, Transparency and Interpretability

0

0

0

0

14:58

26/04/2020

FSNet: Compression of Deep Convolutional Neural Networks by Filter Summary

Yingzhen Yang, Jiahui Yu, Nebojsa Jojic and
Jun Huan, Thomas S. Huang

Keywords Paper

Compression of Convolutional Neural Networks, Filter Summary CNNs, Weight Sharing

0

0

0

0

4:38

14/09/2020

FedMAX: Mitigating Activation Divergence for Accurate and Communication-Efficient Federated Learning

Wei Chen, Kartikeya Bhardwaj, Radu Marculescu

Keywords Paper

federated learning, maximum entropy, non-iid

0

0

0

0

15:03

22/11/2021

Rethinking Token-Mixing MLP for MLP-based Vision Backbone

Tan Yu, XU LI, Yunfeng Cai and
Mingming Sun, Ping Li

Keywords Paper

vision backbone, MLP, image recognition

0

0

0

0

1:59

02/02/2021

DenserNet: Weakly Supervised Visual Localization Using Multi-Scale Feature Aggregation

Dongfang Liu, Yiming Cui, Liqi Yan and
Christos Mousas, Baijian Yang, Yingjie Chen

Keywords Paper

0

0

0

0

16:15

12/07/2020

Channel Equilibrium Networks for Learning Deep Representation

Wenqi Shao, Shitao Tang, Xingang Pan and
Ping Tan, Xiaogang Wang, Ping Luo

Keywords Paper

Deep Learning - General

0

0

0

0

15:17

14/06/2020

Visual Commonsense R-CNN

Tan Wang, Jianqiang Huang, Hanwang Zhang, Qianru Sun

Keywords Paper

visual commonsense learning, causal inference, un-/self-supervised learning, visual representation learning, vision and language

0

0

0

0

1:01

30/11/2020

HPGCNN: Hierarchical Parallel Group Convolutional Neural Networks for Point Clouds Processing

Jisheng Dang, Jun Yang

Keywords Paper

0

0

0

0

9:33

02/02/2021

Large Norms of CNN Layers Do Not Hurt Adversarial Robustness

Youwei Liang, Dong Huang

Keywords Paper

0

0

0

0

13:44

02/02/2021

Cross-Domain Grouping and Alignment for Domain Adaptive Semantic Segmentation

Minsu Kim, Sunghun Joung, Seungryong Kim and
JungIn Park, Ig-Jae Kim, Kwanghoon Sohn

Keywords Paper

0

0

0

0

14:47

03/05/2021

Spatial Dependency Networks: Neural Layers for Improved Generative Image Modeling

Đorđe Miladinović, Aleksandar Stanić, Stefan Bauer and
Jürgen Schmidhuber, Joachim M Buhmann

Keywords Paper

Image Modeling, Deep generative models, Neural networks, Variational Autoencoders

0

0

0

0

4:59

06/12/2020

Storage Efficient and Dynamic Flexible Runtime Channel Pruning via Deep Reinforcement Learning

Jianda Chen, Shangyu Chen, Sinno Jialin Pan

Keywords Paper

0

0

0

0

3:15

02/02/2021

Context-aware Attentional Pooling (CAP) for Fine-grained Visual Classification

Ardhendu Behera, Zachary Wharton, Pradeep R P G Hewage, Asish Bera

Keywords Paper

0

0

0

0

18:54

06/12/2020

Glance and Focus: a Dynamic Approach to Reducing Spatial Redundancy in Image Classification

Yulin Wang, Kangchen Lv, Rui Huang and
Shiji Song, Le Yang, Gao Huang

Keywords Paper

0

0

0

0

3:23

05/01/2021

Exploiting the Redundancy in Convolutional Filters for Parameter Reduction

Kumara Kahatapitiya, Ranga Rodrigo

Keywords Paper

0

0

0

0

5:10

22/11/2021

Adaptive GMM Convolution for Point Cloud Learning

Fei Yang, Huan Wang, Zhong Jin

Keywords Paper

point cloud learning, point cloud segmentation, discrete convolution, adaptive kernel representation, rotation invariance, local pattern matching, Gaussian mixture model, mixture density network

0

0

0

0

8:52

14/06/2020

Discrete Model Compression With Resource Constraint for Deep Neural Networks

Shangqian Gao, Feihu Huang, Jian Pei, Heng Huang

Keywords Paper

covutional neural networks, model compression, channel pruning, discrete optimization

0

0

0

0

1:01

07/09/2020

Few-Shot Learning with Complex-valued Neural Networks

Zhen Liu, Baochang Zhang, Guodong Guo

Keywords Paper

few-shot learning, complex-valued network, metric-learning, image classification

0

0

0

0

7:15

22/11/2021

Searching for TrioNet: Combining Convolution with Local and Global Self-Attention

Huaijin Pi, Huiyu Wang, Yingwei Li and
Zizhang Li, Alan Yuille

Keywords Paper

Self-Attention, Neural Architecture Search

0

0

0

0

2:56

18/07/2021

MARINA: Faster Non-Convex Distributed Learning with Compression

Eduard Gorbunov, Konstantin Burlachenko, Zhize Li, Peter Richtarik

Keywords Paper

Optimization, Non-Convex Optimization

0

0

0

0

5:12

14/06/2020

Residual Feature Aggregation Network for Image Super-Resolution

Jie Liu, Wenjie Zhang, Yuting Tang and
Jie Tang, Gangshan Wu

Keywords Paper

image super-resolution, convolutional neural network, deep learning

0

0

0

0

1:00

02/02/2021

Tied Block Convolution: Leaner and Better CNNs with Shared Thinner Filters

Xudong Wang, Stella X. Yu

Keywords Paper

0

0

0

0

15:06

06/12/2020

Network Diffusions via Neural Mean-Field Dynamics

shushan He, Hongyuan Zha, Xiaojing Ye

Keywords Paper

0

0

0

0

3:21

22/11/2021

GhostShiftAddNet: More Features from Energy-Efficient Operations

Jia Bi, Jonathon Hare, Geoff V Merrett

Keywords Paper

Efficient convolutional neural network, embedded platform, feature redundancy, image classifier.

0

0

0

0

3:37

05/01/2021

Improving Robustness and Uncertainty Modelling in Neural Ordinary Differential Equations

Srinivas Anumasa, P. K. Srijith

Keywords Paper

0

0

0

0

4:53

06/12/2021

Recurrence along Depth: Deep Convolutional Neural Networks with Recurrent Layer Aggregation

Jingyu Zhao, Yanwen Fang, Guodong Li

Keywords Paper

deep learning, machine learning, vision

0

0

0

0

11:37

03/05/2021

Domain Generalization with MixStyle

Kaiyang Zhou, Yongxin Yang, Yu Qiao, Tao Xiang

Keywords Paper

Style Mixing, Domain Generalization

0

0

0

0

4:28

14/06/2020

Non-Local Neural Networks With Grouped Bilinear Attentional Transforms

Lu Chi, Zehuan Yuan, Yadong Mu, Changhu Wang

Keywords Paper

attention, non-local, bilinear, image classification, video classification, grouped, data-adaptive

0

0

0

0

1:01

06/12/2021

Shift Invariance Can Reduce Adversarial Robustness

Vasu Singla, Songwei Ge, Basri Ronen, David Jacobs

Keywords Paper

deep learning, machine learning, robustness, adversarial robustness and security

0

0

0

0

8:28

03/05/2021

Revisiting Dynamic Convolution via Matrix Decomposition

Yunsheng Li, Yinpeng Chen, Xiyang Dai and
mengchen liu, Dongdong Chen, Ye Yu, Lu Yuan, Zicheng Liu, Mei Chen, Nuno Vasconcelos

Keywords Paper

efficient network, dynamic network, supervised representation learning, matrix decomposition

0

0

0

0

5:15

14/06/2020

PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection

Shaoshuai Shi, Chaoxu Guo, Li Jiang and
Zhe Wang, Jianping Shi, Xiaogang Wang, Hongsheng Li

Keywords Paper

3d object detection, point cloud, 3d scene understanding, lidar, autonomous driving, kitti dataset, waymo open dataset

0

0

0

0

1:01

02/02/2021

Compressing Deep Convolutional Neural Networks by Stacking Low-dimensional Binary Convolution Filters

Weichao Lan, Liang Lan

Keywords Paper

0

0

0

0

14:38