Estimating the influence of auxiliary tasks for multi-task learning of sequence tagging tasks

04/07/2020

Estimating the influence of auxiliary tasks for multi-task learning of sequence tagging tasks

Fynn Schröder, Chris Biemann

Keywords: multi-task tasks, MTL, TL, MTL setups

Abstract Paper Similar Papers

Abstract: Multi-task learning (MTL) and transfer learning (TL) are techniques to overcome the issue of data scarcity when training state-of-the-art neural networks. However, finding beneficial auxiliary datasets for MTL or TL is a time- and resource-consuming trial-and-error approach. We propose new methods to automatically assess the similarity of sequence tagging datasets to identify beneficial auxiliary data for MTL or TL setups. Our methods can compute the similarity between any two sequence tagging datasets, \ie they do not need to be annotated with the same tagset or multiple labels in parallel. Additionally, our methods take tokens and their labels into account, which is more robust than only using either of them as an information source, as conducted in prior work. We empirically show that our similarity measures correlate with the change in test score of neural networks that use the auxiliary dataset for MTL to increase the main task performance. We provide an efficient, open-source implementation.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ACL 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Validation Free and Replication Robust Volume-based Data Valuation

Xinyi Xu, Zhaoxuan Wu, Chuan Sheng Foo, Bryan Kian Hsiang Low

Keywords Paper

deep learning, robustness

0

0

0

0

14:46

02/02/2021

Latent Independent Excitation for Generalizable Sensor-based Cross-Person Activity Recognition

Hangwei Qian, Sinno Jialin Pan, Chunyan Miao

Keywords Paper

0

0

0

0

16:06

18/07/2021

Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation

Haoxiang Wang, Han Zhao, Bo Li

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

5:01

03/05/2021

Learning Robust State Abstractions for Hidden-Parameter Block MDPs

Amy Zhang, Shagun Sodhani, Khimya Khetarpal, Joelle Pineau

Keywords Paper

bisimulation, block mdp, hidden-parameter mdp, multi-task reinforcement learning

0

0

0

0

4:17

02/02/2021

Cross-Domain Grouping and Alignment for Domain Adaptive Semantic Segmentation

Minsu Kim, Sunghun Joung, Seungryong Kim and
JungIn Park, Ig-Jae Kim, Kwanghoon Sohn

Keywords Paper

0

0

0

0

14:47

04/07/2020

Learning to Faithfully Rationalize by Construction

Sarthak Jain, Sarah Wiegreffe, Yuval Pinter, Byron C. Wallace

Keywords Paper

NLP, neural classification, training, automatic evaluations

0

0

0

0

11:55

03/05/2021

Distance-Based Regularisation of Deep Networks for Fine-Tuning

Henry Gouk, Timothy Hospedales, massimiliano pontil

Keywords Paper

Statistical Learning Theory, Transfer Learning, Deep Learning

0

0

0

0

4:57

02/02/2021

Bridging Towers of Multi-task Learning with a Gating Mechanism for Aspect-based Sentiment Analysis and Sequential Metaphor Identification

Rui Mao, Xiao Li

Keywords Paper

0

0

0

0

19:27

26/04/2020

Unsupervised Model Selection for Variational Disentangled Representation Learning

Sunny Duan, Loic Matthey, Andre Saraiva and
Nick Watters, Chris Burgess, Alexander Lerchner, Irina Higgins

Keywords Paper

unsupervised disentanglement metric, disentangling, representation learning

0

0

0

0

5:02

06/12/2020

Domain Adaptation with Conditional Distribution Matching and Generalized Label Shift

Remi Tachet des Combes, Han Zhao, Yu-Xiang Wang, Geoffrey Gordon

Keywords Paper

0

0

0

0

3:19

18/07/2021

You Only Sample (Almost) Once: Linear Cost Self-Attention Via Bernoulli Sampling

Zhanpeng Zeng, Yunyang Xiong, Sathya Ravi and
Shailesh Acharya, Glenn Fung, Vikas Singh

Keywords Paper

Applications, Natural Language Processing

0

0

0

0

5:16

18/07/2021

Quasi-global Momentum: Accelerating Decentralized Deep Learning on Heterogeneous Data

Tao Lin, Praneeth Karimireddy, Sebastian Stich, Martin Jaggi

Keywords Paper

Deep Learning, Optimization for Deep Networks

0

0

0

0

5:14

12/07/2020

Learning Autoencoders with Relational Regularization

Hongteng Xu, Dixin Luo, Ricardo Henao and
Svati Shah, Lawrence Carin

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

13:59

12/07/2020

Enhancing Simple Models by Exploiting What They Already Know

Amit Dhurandhar, Karthikeyan Shanmugam, Ronny Luss

Keywords Paper

Supervised Learning

0

0

0

0

13:57

01/07/2020

Zero-Resource Cross-Domain Named Entity Recognition

Zihan Liu, Genta Indra Winata, Pascale Fung

Keywords Paper

0

0

0

0

5:15

02/02/2021

Adversarial Training Reduces Information and Improves Transferability

Matteo Terzi, Alessandro Achille, Marco Maggipinto, Gian Antonio Susto

Keywords Paper

0

0

0

0

19:54

02/02/2021

DenserNet: Weakly Supervised Visual Localization Using Multi-Scale Feature Aggregation

Dongfang Liu, Yiming Cui, Liqi Yan and
Christos Mousas, Baijian Yang, Yingjie Chen

Keywords Paper

0

0

0

0

16:15

26/04/2020

Understanding Knowledge Distillation in Non-autoregressive Machine Translation

Chunting Zhou, Jiatao Gu, Graham Neubig

Keywords Paper

knowledge distillation, non-autoregressive neural machine translation

0

0

0

0

4:55

18/07/2021

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

Jianfei Chen, Lianmin Zheng, Zhewei Yao and
Dequan Wang, Ion Stoica, Michael Mahoney, Joseph E Gonzalez

Keywords Paper

Algorithms, Large Scale Learning

0

0

0

0

18:54

26/04/2020

Sharing Knowledge in Multi-Task Deep Reinforcement Learning

Carlo D'Eramo, Davide Tateo, Andrea Bonarini and
Marcello Restelli, Jan Peters

Keywords Paper

Deep Reinforcement Learning, Multi-Task

0

0

0

0

4:27

06/12/2021

DSelect-k: Differentiable Selection in the Mixture of Experts with Applications to Multi-Task Learning

Hussein Hazimeh, Zhe Zhao, Aakanksha Chowdhery and
Maheswaran Sathiamoorthy, Yihua Chen, Rahul Mazumder, Lichan Hong, Ed Chi

Keywords Paper

deep learning, optimization

0

0

0

0

11:52

06/12/2021

AugMax: Adversarial Composition of Random Augmentations for Robust Training

Haotao Wang, Chaowei Xiao, Jean Kossaifi and
Zhiding Yu, Anima Anandkumar, Zhangyang Wang

Keywords Paper

deep learning, robustness, adversarial robustness and security

0

0

0

0

11:19

02/02/2021

A General Class of Transfer Learning Regression without Implementation Cost

Shunya Minami, Song Liu, Stephen Wu and
Kenji Fukumizu, Ryo Yoshida

Keywords Paper

0

0

0

0

14:13

06/12/2021

Neural Routing by Memory

Kaipeng Zhang, Zhenqiang Li, Zhifeng Li and
Wei Liu, Yoichi Sato

Keywords Paper

deep learning

0

0

0

0

6:41

06/12/2021

CAM-GAN: Continual Adaptation Modules for Generative Adversarial Networks

Sakshi Varshney, Vinay Kumar Verma, P. K. Srijith and
Lawrence Carin, Piyush Rai

Keywords Paper

generative model, representation learning, continual learning

0

0

0

0

14:50

06/12/2020

GOCor: Bringing Globally Optimized Correspondence Volumes into Your Neural Network

Prune Truong, Martin Danelljan, Luc V Gool, Radu Timofte

Keywords Paper

0

0

0

0

3:18

22/11/2021

Model Composition: Can Multiple Neural Networks Be Combined into a Single Network Using Only Unlabeled Data?

Amin Banitalebi-Dehkordi, Xinyu Kang, Yong Zhang

Keywords Paper

Model Composition, Combining Neural Networks, Pseudo Label, Self Training, Label Aggregation, Combining Models

0

0

0

0

2:58

06/12/2021

Contrastively Disentangled Sequential Variational Autoencoder

Junwen Bai, Weiran Wang, Carla Gomes

Keywords Paper

self-supervised learning, generative model, contrastive learning, representation learning, interpretability

0

0

0

0

12:53

23/08/2020

Spectrum-guided adversarial disparity learning

Zhe Liu, Lina Yao, Lei Bai and
Xianzhi Wang, Can Wang

Keywords Paper

adversarial autoencoder, generative models, intraclass variability, activity recognition

0

0

0

0

14:30

06/12/2021

On Calibration and Out-of-Domain Generalization

Yoav Wald, Amir Feder, Daniel Greenfeld, Uri Shalit

Keywords Paper

machine learning, domain adaptation, causality

0

0

0

0

11:00

06/12/2021

Self-Supervised Learning with Data Augmentations Provably Isolates Content from Style

Julius von Kügelgen, Yash Sharma, Luigi Gresele and
Wieland Brendel, Bernhard Schölkopf, Michel Besserve, Francesco Locatello

Keywords Paper

theory, self-supervised learning, representation learning

0

0

0

0

16:02

02/02/2021

Meta-Learning Framework with Applications to Zero-Shot Time-Series Forecasting

Boris N. Oreshkin, Dmitri Carpov, Nicolas Chapados, Yoshua Bengio

Keywords Paper

0

0

0

0

17:41

18/07/2021

On Linear Identifiability of Learned Representations

Geoffrey Roeder, Luke Metz, Durk Kingma

Keywords Paper

Deep Learning, Embedding and Representation learning

0

0

0

0

5:11

14/06/2020

A Unified Object Motion and Affinity Model for Online Multi-Object Tracking

Junbo Yin, Wenguan Wang, Qinghao Meng and
Ruigang Yang, Jianbing Shen

Keywords Paper

mot, multi-task learning, motion, affinity, attention, online

0

0

0

0

1:03

14/06/2020

Improving One-Shot NAS by Suppressing the Posterior Fading

Xiang Li, Chen Lin, Chuming Li and
Ming Sun, Wei Wu, Junjie Yan, Wanli Ouyang

Keywords Paper

neural architecture search automl classification imagenet bayesian

0

0

0

0

0:58

05/12/2020

Self-supervised learning for pairwise data refinement

Gustavo Hernandez Abrego, Bowen Liang, Wei Wang and
Zarana Parekh, Yinfei Yang, Yunhsuan Sung

Keywords Paper

0

0

0

0

15:17

06/12/2021

SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness

Jongheon Jeong, Sejun Park, Minkyu Kim and
Heung-Chang Lee, Do-Guk Kim, Jinwoo Shin

Keywords Paper

deep learning, machine learning, robustness, adversarial robustness and security

0

0

0

0

12:23

06/12/2021

Tailoring: encoding inductive biases by optimizing unsupervised objectives at prediction time

Ferran Alet, Maria Bauza, Kenji Kawaguchi and
Nurullah Giray Kuru, Tomás Lozano-Pérez, Leslie Kaelbling

Keywords Paper

deep learning, optimization, machine learning, self-supervised learning, meta learning

0

0

0

0

15:05

30/11/2020

Bridging Adversarial and Statistical Domain Transfer via Spectral Adaptation Networks

Christoph Raab, Philipp Väth, Peter Meier, Frank-Michael Schleif

Keywords Paper

0

0

0

0

10:07

06/12/2021

Sparse Flows: Pruning Continuous-depth Models

Lucas Liebenwein, Ramin Hasani, Alexander Amini, Daniela Rus

Keywords Paper

deep learning, generative model

0

0

0

0

12:51