SD-MTCNN: Self-Distilled Multi-Task CNN

Abstract: Multi-task learning (MTL) using convolutional neural networks (CNN) deals with training the network for multiple correlated tasks in concert. For accuracy-critical applications, there are endeavors to boost the model performance by resorting to a deeper network, which also increases the model complexity. However, such burdensome models are difficult to be deployed on mobile or edge devices. To ensure a trade-off between performance and complexity of CNNs in the context of MTL, we introduce the novel paradigm of self-distillation within the network. Different from traditional knowledge distillation (KD), which trains the Student in accordance with a cumbersome Teacher, our self-distilled multi-task CNN model: SD-MTCNN aims at distilling knowledge from deeper CNN layers into the shallow layers. Precisely, we follow a hard-sharing based MTL setup where all the tasks share a generic feature-encoder on top of which separate task-specific decoders are enacted. Under this premise, SD-MTCNN distills the more abstract features from the decoders to the encoded feature space, which guarantees improved multi-task performance from different parts of the network. We validate SD-MTCNN on three benchmark datasets: CityScapes, NYUv2, and Mini-Taskonomy, and results confirm the improved generalization capability of self-distilled multi-task CNNs in comparison to the literature and baselines.

06/12/2021

SD-MTCNN: Self-Distilled Multi-Task CNN

Ankit Jha, Awanish Kumar, Biplab Banerjee, Vinay Namboodiri

Comments

Similar Papers

MixACM: Mixup-Based Robustness Transfer via Distillation of Activated Channel Maps

Awais Muhammad, Fengwei Zhou, Chuanlong Xie and Jiawei Li, Sung-Ho Bae, Zhenguo Li

Keywords Abstract Paper

deep learning, optimization, robustness, adversarial robustness and security

Group Softmax Loss With Discriminative Feature Grouping

Takumi Kobayashi

Keywords Abstract Paper

Dynamic Hierarchical Mimicking Towards Consistent Optimization Objectives

Duo Li, Qifeng Chen

Keywords Abstract Paper

deep supervision, neural network optimization, knowledge mimicking

Deep Reinforcement Learning with Smooth Policy

Qianli Shen, Yan Li, Haoming Jiang and Zhaoran Wang, Tuo Zhao

Keywords Abstract Paper

Reinforcement Learning - Deep RL

Data-Free Knowledge Amalgamation via Group-Stack Dual-GAN

Jingwen Ye, Yixin Ji, Xinchao Wang and Xin Gao, Mingli Song

Keywords Abstract Paper

knowledge amalgamation, data-free training, gan, adversarial learning, machine learning architectures

Adversarially-Trained Deep Nets Transfer Better: Illustration on Image Classification

Francisco Utrera, Evan Kravitz, N. Benjamin Erichson and Rajiv Khanna, Michael W Mahoney

Keywords Abstract Paper

adversarial training, limited data, influence functions, transfer learning

LRSC: Learning Representations for Subspace Clustering

Changsheng Li, Chen Yang, Bo Liu and Ye Yuan, Guoren Wang

Keywords Abstract Paper

Collaborative Distillation for Ultra-Resolution Universal Style Transfer

Huan Wang, Yijun Li, Yuehai Wang and Haoji Hu, Ming-Hsuan Yang

Keywords Abstract Paper

model compression, neural style transfer, knowledge distillation, low-level vision

3D Human Pose and Shape Estimation Through Collaborative Learning and Multi-View Model-Fitting

Zhongguo Li, Magnus Oskarsson, Anders Heyden

Keywords Abstract Paper

Unsupervised Multi-Target Domain Adaptation Through Knowledge Distillation

Le Thanh Nguyen-Meidine, Atif Belal, Madhu Kiran and Jose Dolz, Louis-Antoine Blais-Morin, Eric Granger

Keywords Abstract Paper

Towards Compact Single Image Super-Resolution via Contrastive Self-distillation

Yanbo Wang, Shaohui Lin, Yanyun Qu and Haiyan Wu, Zhizhong Zhang, Yuan Xie, Angela Yao

Keywords Abstract Paper

Computer Vision, 2D and 3D Computer Vision

IEPT: Instance-Level and Episode-Level Pretext Tasks for Few-Shot Learning

Manli Zhang, Jianhong Zhang, Zhiwu Lu and Tao Xiang, Mingyu Ding, Songfang Huang

Keywords Abstract Paper

self-supervised learning, few-shot learning, episode-level pretext task

Few-Shot Learning with Complex-valued Neural Networks

Zhen Liu, Baochang Zhang, Guodong Guo

Keywords Abstract Paper

few-shot learning, complex-valued network, metric-learning, image classification

Factorized Higher-Order CNNs With an Application to Spatio-Temporal Emotion Estimation

Jean Kossaifi, Antoine Toisoul, Adrian Bulat and Yannis Panagakis, Timothy M. Hospedales, Maja Pantic

Keywords Abstract Paper

tensor methods, deep learning, spatiotemporal, emotion, cnn, tensor decomposition, low-rank, valence, arousal

Feature-map-level Online Adversarial Knowledge Distillation

Inseop Chung, SeongUk Park, Kim Jangho, NOJUN KWAK

Keywords Abstract Paper

Applications - Computer Vision

Domain Generalization with MixStyle

Kaiyang Zhou, Yongxin Yang, Yu Qiao, Tao Xiang

Keywords Abstract Paper

Style Mixing, Domain Generalization

$(\textrm{Implicit})^2$: Implicit Layers for Implicit Representations

Zhichun Huang, Shaojie Bai, J. Zico Kolter

Keywords Abstract Paper

deep learning, representation learning

DivideMix: Learning with Noisy Labels as Semi-supervised Learning

Junnan Li, Richard Socher, Steven C.H. Hoi

Keywords Abstract Paper

label noise, semi-supervised learning

DeepCollaboration: Collaborative Generative and Discriminative Models for Class Incremental Learning

Bo Cui, Guyue Hu, Shan Yu

Keywords Abstract Paper

Attentional Constellation Nets for Few-Shot Learning

Weijian Xu, Yifan Xu, Huaijin Wang, Zhuowen Tu

Keywords Abstract Paper

few-shot learning, constellation models

Two sides of the same coin: White-box and black-box attacks for transfer learning

Awais Muhammad, Fengwei Zhou, Chuanlong Xie and
Jiawei Li, Sung-Ho Bae, Zhenguo Li

Keywords Paper

Keywords Paper

Keywords Paper

Qianli Shen, Yan Li, Haoming Jiang and
Zhaoran Wang, Tuo Zhao

Keywords Paper

Jingwen Ye, Yixin Ji, Xinchao Wang and
Xin Gao, Mingli Song

Keywords Paper

Francisco Utrera, Evan Kravitz, N. Benjamin Erichson and
Rajiv Khanna, Michael W Mahoney

Keywords Paper

Changsheng Li, Chen Yang, Bo Liu and
Ye Yuan, Guoren Wang

Keywords Paper

Huan Wang, Yijun Li, Yuehai Wang and
Haoji Hu, Ming-Hsuan Yang

Keywords Paper

Keywords Paper

Le Thanh Nguyen-Meidine, Atif Belal, Madhu Kiran and
Jose Dolz, Louis-Antoine Blais-Morin, Eric Granger

Keywords Paper

Yanbo Wang, Shaohui Lin, Yanyun Qu and
Haiyan Wu, Zhizhong Zhang, Yuan Xie, Angela Yao

Keywords Paper

Manli Zhang, Jianhong Zhang, Zhiwu Lu and
Tao Xiang, Mingyu Ding, Songfang Huang

Keywords Paper

Keywords Paper

Jean Kossaifi, Antoine Toisoul, Adrian Bulat and
Yannis Panagakis, Timothy M. Hospedales, Maja Pantic

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yinghua Zhang, Yangqiu Song, Jian Liang and
Kun Bai, Qiang Yang

Keywords Paper

Keywords Paper

Huaxiu Yao, Long-Kai Huang, Linjun Zhang and
Ying WEI, Li Tian, James Zou, Junzhou Huang, Zhenhui (Jessie) Li

Keywords Paper

Chiyuan Zhang, Samy Bengio, Moritz Hardt and
Michael C. Mozer, Yoram Singer

Keywords Paper

Qianli Ma, Zhenjing Zheng, Jiawei Zheng and
Sen Li, Wanqing Zhuang, Garrison W. Cottrell

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Xiaoyu Tao, Xiaopeng Hong, Xinyuan Chang and
Songlin Dong, Xing Wei, Yihong Gong

Keywords Paper

Keywords Paper

Keywords Paper

Kaipeng Zhang, Zhenqiang Li, Zhifeng Li and
Wei Liu, Yoichi Sato

Keywords Paper

Keywords Paper

Keywords Paper

Mi Luo, Fei Chen, Dapeng Hu and
Yifan Zhang, Jian Liang, Jiashi Feng

Keywords Paper

Keywords Paper