On Negative Interference in Multilingual Models: Findings and A Meta-Learning Treatment

Abstract: Modern multilingual models are trained on concatenated text from multiple languages in hopes of conferring benefits to each (positive transfer), with the most pronounced benefits accruing to low-resource languages. However, recent work has shown that this approach can degrade performance on high-resource languages, a phenomenon known as negative interference. In this paper, we present the first systematic study of negative interference. We show that, contrary to previous belief, negative interference also impacts low-resource languages. While parameters are maximally shared to learn language-universal structures, we demonstrate that language-specific parameters do exist in multilingual models and they are a potential cause of negative interference. Motivated by these observations, we also present a meta-learning algorithm that obtains better cross-lingual transferability and alleviates negative interference, by adding language-specific layers as meta-parameters and training them in a manner that explicitly improves shared layers′ generalization on all languages. Overall, our results show that negative interference is more common than previously known, suggesting new directions for improving multilingual representations.

03/05/2021

supervised contrastive learning, pre-trained language model fine-tuning, natural language understanding, generalization, few-shot learning, robustness

4:44

08/12/2020

On Negative Interference in Multilingual Models: Findings and A Meta-Learning Treatment

Zirui Wang, Zachary C. Lipton, Yulia Tsvetkov

Comments

Similar Papers

Contrastive Learning with Adversarial Perturbations for Conditional Text Generation

Seanie Lee, Dong Bok Lee, Sung Ju Hwang

Keywords Abstract Paper

contrastive learning, conditional text generation

Refining Language Models with Compositional Explanations

Huihan Yao, Ying Chen, Qinyuan Ye and Xisen Jin, Xiang Ren

Keywords Abstract Paper

machine learning, fairness, language

Text Classification with Negative Supervision

Sora Ohashi, Junya Takayama, Tomoyuki Kajiwara and Chenhui Chu, Yuki Arase

Keywords Abstract Paper

Text Classification, text representation, text tasks, single- classifications

Feature Projection for Improved Text Classification

Qi Qin, Wenpeng Hu, Bing Liu

Keywords Abstract Paper

Text Classification, classification, sentiment classification, Bert classification

Supervised Contrastive Learning for Pre-trained Language Model Fine-tuning

Beliz Gunel, Jingfei Du, Alexis Conneau, Veselin Stoyanov

Keywords Abstract Paper

supervised contrastive learning, pre-trained language model fine-tuning, natural language understanding, generalization, few-shot learning, robustness

Joint Training for Learning Cross-lingual Embeddings with Sub-word Information without Parallel Corpora

Ali Hakimi Parizi, Paul Cook

Keywords Abstract Paper

Empirical Analysis of Unlabeled Entity Problem in Named Entity Recognition

Yangming Li, lemao liu, Shuming Shi

Keywords Abstract Paper

Negative Sampling, Unlabeled Entity Problem, Named Entity Recognition

A Semantically Consistent and Syntactically Variational Encoder-Decoder Framework for Paraphrase Generation

Wenqing Chen, Jidong Tian, Liqiang Xiao and Hao He, Yaohui Jin

Keywords Abstract Paper

Machine translationese: Effects of algorithmic bias on linguistic complexity in machine translation

Eva Vanmassenhove, Dimitar Shterionov, Matthew Gwilliam

Keywords Abstract Paper

Implicit unlikelihood training: Improving neural text generation with reinforcement learning

Evgeny Lagutin, Daniil Gavrilov, Pavel Kalaidin

Keywords Abstract Paper

Improving Commonsense Causal Reasoning by Adversarial Training and Data Augmentation

Ieva Staliūnaitė, Philip John Gorinski, Ignacio Iacobacci

Keywords Abstract Paper

Linguistic Features for Readability Assessment

Tovly Deutsch, Masoud Jasbi, Stuart Shieber

Keywords Abstract Paper

Improving Neural Language Generation with Spectrum Control

Lingxiao Wang, Jing Huang, Kevin Huang and Ziniu Hu, Guangtao Wang, Quanquan Gu

Keywords Abstract Paper

Scale-Aware Polar Representation for Arbitrarily-Shaped Text Detection

Yanguang Bi, Zhiqiang Hu

Keywords Abstract Paper

Educating Text Autoencoders: Latent Representation Guidance via Denoising

Tianxiao Shen, Jonas Mueller, Regina Barzilay, Tommi Jaakkola

Keywords Abstract Paper

Deep Learning - Generative Models and Autoencoders

What Have We Achieved on Text Summarization?

Dandan Huang, Leyang Cui, Sen Yang and Guangsheng Bao, Kun Wang, Jun Xie, Yue Zhang

Keywords Abstract Paper

text summarization, deep learning, automatic summarizers, summarization systems

CDL: Curriculum Dual Learning for Emotion-Controllable Response Generation

Lei Shen, Yang Feng

Keywords Abstract Paper

Emotion-Controllable Generation, training process, response tasks, CDL

Cross-Lingual Unsupervised Sentiment Classification with Multi-View Transfer Learning

Hongliang Fei, Ping Li

Keywords Abstract Paper

Cross-Lingual Classification, sentiment classification, unsupervised system, classification

Unsupervised Word Translation with Adversarial Autoencoder

Tasnim Mohiuddin, Shafiq Joty

Keywords Abstract Paper

Unsupervised Translation, machine translation, transfer learning, word task

Learning to summarize with human feedback

Nisan Stiennon, Long Ouyang, Jeffrey Wu and Daniel Ziegler, Ryan Lowe, Chelsea Voss, Alec Radford, Dario Amodei, Paul Christiano

Keywords Abstract Paper

Small but Mighty: New Benchmarks for Split and Rephrase

Li Zhang, Huaiyu Zhu, Siddhartha Brahma, Yunyao Li

Keywords Abstract Paper

text task, fine-grained evaluation, automatic process, rule-based model

Joint Modelling of Emotion and Abusive Language Detection

Keywords Paper

Huihan Yao, Ying Chen, Qinyuan Ye and
Xisen Jin, Xiang Ren

Keywords Paper

Sora Ohashi, Junya Takayama, Tomoyuki Kajiwara and
Chenhui Chu, Yuki Arase

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Wenqing Chen, Jidong Tian, Liqiang Xiao and
Hao He, Yaohui Jin

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Lingxiao Wang, Jing Huang, Kevin Huang and
Ziniu Hu, Guangtao Wang, Quanquan Gu

Keywords Paper

Keywords Paper

Keywords Paper

Dandan Huang, Leyang Cui, Sen Yang and
Guangsheng Bao, Kun Wang, Jun Xie, Yue Zhang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Nisan Stiennon, Long Ouyang, Jeffrey Wu and
Daniel Ziegler, Ryan Lowe, Chelsea Voss, Alec Radford, Dario Amodei, Paul Christiano

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Pengyu Cheng, Weituo Hao, Siyang Yuan and
Shijing Si, Lawrence Carin

Keywords Paper

Keywords Paper

Xin Dong, Yaxin Zhu, Yupeng Zhang and
Zuohui Fu, Dongkuan Xu, Sen Yang, Gerard Melo

Keywords Paper

Keywords Paper

Keywords Paper

Yuchen Lu, Soumye Singhal, Florian Strub and
Olivier Pietquin, Aaron Courville

Keywords Paper

Keywords Paper