Learning to Branch for Multi-Task Learning

12/07/2020

Learning to Branch for Multi-Task Learning

Pengsheng Guo, Chen-Yu Lee, Daniel Ulbricht

Keywords: Transfer, Multitask and Meta-learning

Abstract Paper Similar Papers

Abstract: Training multiple tasks jointly in one deep network yields reduced latency during inference and better performance over the single-task counterpart by sharing certain layers of a network. However, over-sharing a network could erroneously enforce over-generalization, causing negative knowledge transfer across tasks. Prior works rely on human intuition or pre-computed task relatedness scores for ad hoc branching structures. They provide sub-optimal end results and often require huge efforts for the trial-and-error process. In this work, we present an automated multi-task learning algorithm that learns where to share or branch within a network, designing an effective network topology that is directly optimized for multiple objectives across tasks. Specifically, we propose a novel tree-structured design space that casts a tree branching operation as a gumbel-softmax sampling procedure. This enables differentiable network splitting that is end-to-end trainable. We validate the proposed method on controlled synthetic data, CelebA, and Taskonomy.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

19/08/2021

GAEN: Graph Attention Evolving Networks

Min Shi, Yu Huang, Xingquan Zhu and
Yufei Tang, Yuan Zhuang, Jianxun Liu

Keywords Paper

Data Mining, Feature Extraction, Selection and Dimensionality Reduction, Mining Graphs, Semi Structured Data, Complex Data, Mining Spatial, Temporal Data

0

0

0

0

13:52

16/11/2020

The Emergence of Adversarial Communication in Multi-Agent Reinforcement Learning

Jan Blumenkamp, Amanda Prorok

Keywords Paper

0

0

0

0

4:51

26/04/2020

Gradients as Features for Deep Representation Learning

Fangzhou Mu, Yingyu Liang, Yin Li

Keywords Paper

representation learning, gradient features, deep learning

0

0

0

0

5:07

18/07/2021

Learning Binary Decision Trees by Argmin Differentiation

Valentina Zantedeschi, Matt J. Kusner, Vlad Niculae

Keywords Paper

Deep Learning

0

0

0

0

5:16

14/09/2020

Modeling Dynamic Heterogeneous Network for Link Prediction using Hierarchical Attention with Temporal RNN

Hansheng Xue, Luwei Yang, Wen Jiang and
Yi Wei, Yi Hu, Yu Lin

Keywords Paper

dynamic heterogeneous network, hierarchical attention, recurrent neural network, temporal self-attention

0

0

0

0

13:27

06/12/2020

AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning

Ximeng Sun, Rameswar Panda, Rogerio Feris, Kate Saenko

Keywords Paper

0

0

0

0

3:13

02/02/2021

Parameterizing Branch-and-Bound Search Trees to Learn Branching Policies

Giulia Zarpellon, Jason Jo, Andrea Lodi, Yoshua Bengio

Keywords Paper

0

0

0

0

17:58

19/08/2021

DA-GCN: A Domain-aware Attentive Graph Convolution Network for Shared-account Cross-domain Sequential Recommendation

Lei Guo, Li Tang, Tong Chen and
Lei Zhu, Quoc Viet Hung Nguyen, Hongzhi Yin

Keywords Paper

Machine Learning, Recommender Systems, Personalization and User Modeling, Information Retrieval

0

0

0

0

14:02

02/02/2021

Group-Wise Semantic Mining for Weakly Supervised Semantic Segmentation

Xueyi Li, Tianfei Zhou, Jianwu Li and
Yi Zhou, Zhaoxiang Zhang

Keywords Paper

0

0

0

0

14:47

23/08/2020

Unsupervised differentiable multi-aspect network embedding

Chanyoung Park, Carl Yang, Qi Zhu and
Donghyun Kim, Hwanjo Yu, Jiawei Han

Keywords Paper

representation learning, network embedding, graph mining

0

0

0

0

19:55

02/02/2021

LRSC: Learning Representations for Subspace Clustering

Changsheng Li, Chen Yang, Bo Liu and
Ye Yuan, Guoren Wang

Keywords Paper

0

0

0

0

15:09

02/02/2021

Deep Fusion Clustering Network

Wenxuan Tu, Sihang Zhou, Xinwang Liu and
Xifeng Guo, Zhiping Cai, En Zhu, Jieren Cheng

Keywords Paper

0

0

0

0

16:32

30/11/2020

MTNAS: Search Multi-Task Networks for Autonomous Driving

Hao Liu, Dong Li, JinZhang Peng and
Qingjie Zhao, Lu Tian, Yi Shan

Keywords Paper

0

0

0

0

9:06

23/08/2020

An efficient neighborhood-based interaction model for recommendation on heterogeneous graph

Jiarui Jin, Jiarui Qin, Yuchen Fang and
Kounianhua Du, Weinan Zhang, Yong Yu, Zheng Zhang, Alexander J. Smola

Keywords Paper

recommender system, heterogeneous information network, neighborhood-based interaction

0

0

0

0

13:53

14/09/2020

Neural Cross-Domain Collaborative Filtering with Shared Entities

Vijaikumar M, Shirish Shevade, Narasimha Murty

Keywords Paper

cross-domain collaborative filtering, deep learning, neural networks, wide and deep framework, recommendation system

0

0

0

0

15:21

05/01/2021

Unsupervised Domain Adaptation in Semantic Segmentation via Orthogonal and Clustered Embeddings

Marco Toldo, Umberto Michieli, Pietro Zanuttigh

Keywords Paper

0

0

0

0

4:59

18/07/2021

Joining datasets via data augmentation in the label space for neural networks

Jake Zhao Zhao, Mingfeng Ou, linji Xue and
Yunkai Cui, Sai Wu, Gang Chen

Keywords Paper

Deep Learning, Theory, Statistical Physics of Learning, Optimization, Non-Convex Optimization; Theory

0

0

0

0

5:14

26/08/2020

Rep the Set: Neural Networks for Learning Set Representations

Konstantinos Skianis, Giannis Nikolentzos, Stratis Limnios, Michalis Vazirgiannis

Keywords Paper

0

0

0

0

14:19

14/09/2020

Progressive Supervision for Node Classification

Yiwei Wang, Wei Wang, Yuxuan Liang and
Yujun Cai, Bryan Hooi

Keywords Paper

graph convolutional networks, progressive supervision, node classification

0

0

0

0

11:07

12/07/2020

Puzzle Mix: Exploiting Saliency and Local Statistics for Optimal Mixup

Jang-Hyun Kim, Wonho Choo, Hyun Oh Song

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

13:40

26/04/2020

Progressive learning and disentanglement of hierarchical representations

Zhiyuan Li, Jaideep Vitthal Murkute, Prashnna Kumar Gyawali, Linwei Wang

Keywords Paper

generative model, disentanglement, progressive learning, VAE

0

0

0

0

5:06

23/08/2020

HGCN: A heterogeneous graph convolutional network-based deep learning model toward collective classification

Zhihua Zhu, Xinxin Fan, Xiaokai Chu, Jingping Bi

Keywords Paper

heterogeneous information network, heterogeneous graph convolutional network, collective classification, relational learning

0

0

0

0

15:08

06/12/2020

Graph Information Bottleneck

Tailin Wu, Hongyu Ren, Pan Li, Jure Leskovec

Keywords Paper

0

0

0

0

3:24

23/08/2020

Neural subgraph isomorphism counting

Xin Liu, Haojie Pan, Mutian He and
Yangqiu Song, Xin Jiang, Lifeng Shang

Keywords Paper

neural network, dynamic memory, subgraph isomorphism

0

0

0

0

13:43

03/05/2021

Evaluating the Disentanglement of Deep Generative Models through Manifold Topology

Sharon Zhou, Eric Zelikman, Fred Lu and
Andrew Ng, Gunnar E Carlsson, Stefano Ermon

Keywords Paper

generative models, disentanglement, evaluation

0

0

0

0

5:06

14/06/2020

Attention Convolutional Binary Neural Tree for Fine-Grained Visual Categorization

Ruyi Ji, Longyin Wen, Libo Zhang and
Dawei Du, Yanjun Wu, Chen Zhao, Xianglong Liu, Feiyue Huang

Keywords Paper

fine-grained visual categorization (fgvc), neural tree, attention mechanism, convolutional neural networks (cnn)

0

0

0

0

1:02

23/08/2020

MultiSage: Empowering GCN with contextualized multi-embeddings on web-scale multipartite networks

Carl Yang, Aditya Pal, Andrew Zhai and
Nikil Pancha, Jiawei Han, Charles Rosenberg, Jure Leskovec

Keywords Paper

contextualized multi-embedding, web-scale training and inference, search and recommendation, graph neural network

0

0

0

0

16:50

26/04/2020

Learned step size quantization

Steven K. Esser, Jeffrey L. McKinstry, Deepika Bablani and
Rathinakumar Appuswamy, Dharmendra S. Modha

Keywords Paper

deep learning, low precision, classification, quantization

0

0

0

0

4:40

26/04/2020

Demystifying Inter-Class Disentanglement

Aviv Gabbay, Yedid Hoshen

Keywords Paper

disentanglement, latent optimization, domain translation

0

0

0

0

4:55

19/04/2021

Dependency parsing with structure preserving embeddings

Ákos Kádár, Lan Xiao, Mete Kemertas and
Federico Fancellu, Allan Jepson, Afsaneh Fazly

Keywords Paper

0

0

0

0

12:34

14/06/2020

Auto-Encoding Twin-Bottleneck Hashing

Yuming Shen, Jie Qin, Jiaxin Chen and
Mengyang Yu, Li Liu, Fan Zhu, Fumin Shen, Ling Shao

Keywords Paper

image hashing, data retrieval, unsupervised learning, graph neural networks

0

0

0

0

1:00

26/04/2020

Continual Learning with Adaptive Weights (CLAW)

Tameem Adel, Han Zhao, Richard E. Turner

Keywords Paper

Continual learning

0

0

0

0

4:58

03/05/2021

Auxiliary Task Update Decomposition: The Good, the Bad and the Neutral

Lucio Dery, Yann Dauphin, David Grangier

Keywords Paper

multitask learning, deeplearning, pre-training, gradient decomposition

0

0

0

0

5:22

06/12/2021

Universal Graph Convolutional Networks

Di Jin, Zhizhi Yu, Cuiying Huo and
Rui Wang, Xiao Wang, Dongxiao He, Jiawei Han

Keywords Paper

graph learning

0

0

0

0

10:29

06/12/2021

Sequence-to-Sequence Learning with Latent Neural Grammars

Yoon Kim

Keywords Paper

deep learning

0

0

0

0

14:31

03/05/2021

Estimating informativeness of samples with Smooth Unique Information

Hrayr Harutyunyan, Alessandro Achille, Giovanni Paolini and
Orchid Majumder, Avinash Ravichandran, Rahul Bhotika, Stefano Soatto

Keywords Paper

dataset summarization, ntk, stability theory, sample information, information theory

0

0

0

0

6:05

02/02/2021

TreeCaps: Tree-Based Capsule Networks for Source Code Processing

Nghi D. Q. Bui, Yijun Yu, Lingxiao Jiang

Keywords Paper

0

0

0

0

14:58

26/04/2020

Dynamic Sparse Training: Find Efficient Sparse Network From Scratch With Trainable Masked Layers

Junjie LIU, Zhe XU, Runbin SHI and
Ray C. C. Cheung, Hayden K.H. So

Keywords Paper

neural network pruning, sparse learning, network compression, architecture search

0

0

0

0

4:49

22/11/2021

Progressive Growing of Points with Tree-structured Generators

Hyeontae Son, Young Min Kim

Keywords Paper

Point cloud auto-encoder, Progressive growing, Tree-structured generators

0

0

0

0

2:40

23/08/2020

Predicting temporal sets with deep neural networks

Le Yu, Leilei Sun, Bowen Du and
Chuanren Liu, Hui Xiong, Weifeng Lv

Keywords Paper

graph convolutions, temporal sets, temporal data, sequence learning

0

0

0

0

13:36