Branched Multi-Task Networks: Deciding what layers to share

07/09/2020

Branched Multi-Task Networks: Deciding what layers to share

Simon Vandenhende, Stamatios Georgoulis, Luc Van Gool, Bert De Brabandere

Keywords: multi-task learning, neural architecture search, scene understanding, MTL, efficient, NAS, transfer learning, taskonomy, task affinity

Abstract Paper Similar Papers

Abstract: In the context of multi-task learning, neural networks with branched architectures have often been employed to jointly tackle the tasks at hand. Such ramified networks typically start with a number of shared layers, after which different tasks branch out into their own sequence of layers. Understandably, as the number of possible network configurations is combinatorially large, deciding what layers to share and where to branch out becomes cumbersome. Prior works have either relied on ad hoc methods to determine the level of layer sharing, which is suboptimal, or utilized neural architecture search techniques to establish the network design, which is considerably expensive. In this paper, we go beyond these limitations and propose an approach to automatically construct branched multi-task networks, by leveraging the employed tasks' affinities. Given a specific budget, i.e. number of learnable parameters, the proposed approach generates architectures, in which shallow layers are task-agnostic, whereas deeper ones gradually grow more task-specific. Extensive experimental analysis across numerous, diverse multi-tasking datasets shows that, for a given budget, our method consistently yields networks with the highest performance, while for a certain performance threshold it requires the least amount of learnable parameters.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at BMVC 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

07/09/2020

Automated Search for Resource-Efficient Branched Multi-Task Networks

David Brüggemann, Menelaos Kanakis, Stamatios Georgoulis, Luc Van Gool

Keywords Paper

multi task, neural architecture search, resource efficient networks, dense prediction, encoder branching, proxyless resource loss, differentiable search space, branched networks, tree-like networks, Gumbel-Softmax

0

0

0

0

8:31

02/02/2021

Anytime Inference with Distilled Hierarchical Neural Ensembles

Adria Ruiz, Jakob Verbeek

Keywords Paper

0

0

0

0

15:09

05/01/2021

Dynamic Routing Networks

Shaofeng Cai, Yao Shu, Wei Wang

Keywords Paper

0

0

0

0

4:52

06/12/2020

AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning

Ximeng Sun, Rameswar Panda, Rogerio Feris, Kate Saenko

Keywords Paper

0

0

0

0

3:13

06/12/2020

Throughput-Optimal Topology Design for Cross-Silo Federated Learning

Othmane MARFOQ, CHUAN XU, Giovanni Neglia, Richard Vidal

Keywords Paper

0

0

0

0

3:21

14/09/2020

Modeling Dynamic Heterogeneous Network for Link Prediction using Hierarchical Attention with Temporal RNN

Hansheng Xue, Luwei Yang, Wen Jiang and
Yi Wei, Yi Hu, Yu Lin

Keywords Paper

dynamic heterogeneous network, hierarchical attention, recurrent neural network, temporal self-attention

0

0

0

0

13:27

12/07/2020

Learning to Branch for Multi-Task Learning

Pengsheng Guo, Chen-Yu Lee, Daniel Ulbricht

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

13:36

03/05/2021

More or Less: When and How to Build Convolutional Neural Network Ensembles

Abdul Wasay, Stratos Idreos

Keywords Paper

empirical study, ensemble learning, computer vision, machine learning systems

0

0

0

0

4:39

07/09/2020

Towards Convolutional Neural Networks Compression via Global&Progressive Product Quantization

Weihan Chen, Peisong Wang, Jian Cheng

Keywords Paper

convolutional neural network compression, product quantization

0

0

0

0

5:03

14/09/2020

A Principle of Least Action for the Training of Neural Networks

Skander Karkar, Ibrahim Ayed, Emmanuel de Bézenac, Patrick Gallinari

Keywords Paper

deep learning, optimal transport, dynamical systems

0

0

0

0

15:01

02/02/2021

Provable Benefits of Overparameterization in Model Compression: From Double Descent to Pruning Neural Networks

Xiangyu Chang, Yingcong Li, Samet Oymak, Christos Thrampoulidis

Keywords Paper

0

0

0

0

18:14

05/04/2021

Nimble: Efficiently Compiling Dynamic Neural Networks for Model Inference

Haichen Shen, Jared Roesch, Zhi Chen and
wweic Chen, Yong Wu, Mu Li, Vin Sharma, Zachary Tatlock, Yida Wang

Keywords Paper

0

0

0

0

16:40

05/04/2021

Nimble: Efficiently Compiling Dynamic Neural Networks for Model Inference

Haichen Shen, Jared Roesch, Zhi Chen and
wweic Chen, Yong Wu, Mu Li, Vin Sharma, Zachary Tatlock, Yida Wang

Keywords Paper

0

0

0

0

20:51

26/04/2020

Multiplicative Interactions and Where to Find Them

Siddhant M. Jayakumar, Wojciech M. Czarnecki, Jacob Menick and
Jonathan Schwarz, Jack Rae, Simon Osindero, Yee Whye Teh, Tim Harley, Razvan Pascanu

Keywords Paper

multiplicative interactions, hypernetworks, attention

0

0

0

0

5:34

05/01/2021

MPRNet: Multi-Path Residual Network for Lightweight Image Super Resolution

Armin Mehri, Parichehr B. Ardakani, Angel D. Sappa

Keywords Paper

0

0

0

0

4:57

06/12/2021

BatchQuant: Quantized-for-all Architecture Search with Robust Quantizer

Haoping Bai, Meng Cao, Ping Huang, Jiulong Shan

Keywords Paper

deep learning, optimization

0

0

0

0

4:12

02/02/2021

Fitting the Search Space of Weight-sharing NAS with Graph Convolutional Networks

Xin Chen, Lingxi Xie, Jun Wu and
Longhui Wei, Yuhui Xu, Qi Tian

Keywords Paper

0

0

0

0

15:02

18/07/2021

Learn-to-Share: A Hardware-friendly Transfer Learning Framework Exploiting Computation and Parameter Sharing

Cheng Fu, Hanxian Huang, Xinyun Chen and
Yuandong Tian, Jishen Zhao

Keywords Paper

Applications, Natural Language Processing

0

0

0

0

16:03

18/07/2021

On the Implicit Bias of Initialization Shape: Beyond Infinitesimal Mirror Descent

Shahar Azulay, Edward Moroshko, Mor Shpigel Nacson and
Blake Woodworth, Nati Srebro, Amir Globerson, Daniel Soudry

Keywords Paper

, Probabilistic Methods, MCMC, Theory, Deep learning Theory

0

0

0

0

15:38

14/06/2020

Deep Unfolding Network for Image Super-Resolution

Kai Zhang, Luc Van Gool, Radu Timofte

Keywords Paper

super-resolution, unfolding, degradation model, gaussian kernel, deblurring

0

0

0

0

1:01

06/12/2021

On the Expected Complexity of Maxout Networks

Hanna Tseran, Guido Montufar

Keywords Paper

deep learning, machine learning

0

0

0

0

10:33

04/11/2020

Ansor: Generating High-Performance Tensor Programs for Deep Learning

Lianmin Zheng, Chengfan Jia, Minmin Sun and
Zhao Wu, Cody Hao Yu, Ameer Haj-Ali, Yida Wang, Jun Yang, Danyang Zhuo, Koushik Sen, Joseph E. Gonzalez, Ion Stoica

Keywords Paper

0

0

0

0

20:10

06/12/2020

Efficient Algorithms for Device Placement of DNN Graph Operators

Jakub Tarnawski, Amar Phanishayee, Nikhil Devanur and
Divya Mahajan, Fanny Nina Paravecino

Keywords Paper

0

0

1

0

3:20

03/08/2020

Distributed computation and reconfiguration in actively dynamic networks

Othon Michail, George Skretas, Paul G. Spirakis

Keywords Paper

polylogarithmic time, distributed algorithms, edge complexity, transformation, reconfiguration, dynamic networks

0

0

0

0

24:10

18/07/2021

Training Graph Neural Networks with 1000 Layers

Guohao Li, Matthias Müller, Bernard Ghanem, Vladlen Koltun

Keywords Paper

Algorithms, Large Scale Learning

0

0

0

0

5:14

08/07/2020

The Topology of Local Computing in Networks

Pierre Fraigniaud and Ami Paz

Keywords Paper

Distributed computing, distributed graph algorithms, combinatorial topology

0

0

0

0

26:06

06/12/2020

Set2Graph: Learning Graphs From Sets

Hadar Serviansky, Nimrod Segol, Jonathan Shlomi and
Kyle Cranmer, Eilam Gross, Haggai Maron, Yaron Lipman

Keywords Paper

0

0

0

0

3:04

12/07/2020

Finding trainable sparse networks through Neural Tangent Transfer

Tianlin Liu, Friedemann Zenke

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

13:43

18/07/2021

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

Jianfei Chen, Lianmin Zheng, Zhewei Yao and
Dequan Wang, Ion Stoica, Michael Mahoney, Joseph E Gonzalez

Keywords Paper

Algorithms, Large Scale Learning

0

0

0

0

18:54

14/06/2020

When NAS Meets Robustness: In Search of Robust Architectures Against Adversarial Attacks

Minghao Guo, Yuzhe Yang, Rui Xu and
Ziwei Liu, Dahua Lin

Keywords Paper

adversarial robustness, neural architecture search, adversarial examples, deep learning architectures, adversarial attacks

0

0

0

0

1:01

18/11/2020

Deep-n-cheap: An automated search framework for low complexity deep learning

Sourya Dey, Saikrishna C. Kanala, Keith M. Chugg, Peter A. Beerel

Keywords Paper

0

0

0

0

11:59

02/02/2021

Learning Deep Generative Models for Queuing Systems

Cesar Ojeda, Kostadin Cvejoski, Bodgan Georgiev and
Christian Bauckhage, Jannis Schuecker, Ramses J. Sanchez

Keywords Paper

0

0

0

0

19:27

06/12/2020

Collegial Ensembles

Etai Littwin, Ben Myara, Sima Sabah and
Joshua Susskind, Shuangfei Zhai, Oren Golan

Keywords Paper

0

0

0

0

3:17

06/12/2021

EvoGrad: Efficient Gradient-Based Meta-Learning and Hyperparameter Optimization

Ondrej Bohdal, Yongxin Yang, Timothy Hospedales

Keywords Paper

deep learning, optimization, graph learning, meta learning, few shot learning

0

0

0

0

14:09

14/06/2020

Dataless Model Selection With the Deep Frame Potential

Calvin Murdock, Simon Lucey

Keywords Paper

deep learning, sparse approximation theory, deep network architectures, model selection, sparsity, mutual coherence

0

0

0

1

5:00

02/02/2021

Simple and Effective Stochastic Neural Networks

Tianyuan Yu, Yongxin Yang, Da Li and
Timothy Hospedales, Tao Xiang

Keywords Paper

0

0

0

0

13:52

03/05/2021

Dataset Condensation with Gradient Matching

Bo ZHAO, Konda Reddy Mopuri, Hakan Bilen

Keywords Paper

dataset condensation, image generation, data-efficient learning

0

0

0

0

15:09

03/05/2021

Estimating informativeness of samples with Smooth Unique Information

Hrayr Harutyunyan, Alessandro Achille, Giovanni Paolini and
Orchid Majumder, Avinash Ravichandran, Rahul Bhotika, Stefano Soatto

Keywords Paper

dataset summarization, ntk, stability theory, sample information, information theory

0

0

0

0

6:05

12/07/2020

Disentangling Trainability and Generalization in Deep Neural Networks

Lechao Xiao, Jeffrey Pennington, Samuel Schoenholz

Keywords Paper

Deep Learning - Theory

0

0

0

0

15:10

06/12/2020

MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures

Jeong Un Ryu, JWoong Shin, Hae Beom Lee, Sung Ju Hwang

Keywords Paper

0

0

0

0

3:32