A Mutual Information Maximization Perspective of Language Representation Learning

26/04/2020

A Mutual Information Maximization Perspective of Language Representation Learning

Lingpeng Kong, Cyprien de Masson d'Autume, Lei Yu, Wang Ling, Zihang Dai, Dani Yogatama

Keywords:

Abstract Paper Similar Papers

Abstract: We show state-of-the-art word representation learning methods maximize an objective function that is a lower bound on the mutual information between different parts of a word sequence (i.e., a sentence). Our formulation provides an alternative perspective that unifies classical word embedding models (e.g., Skip-gram) and modern contextual embeddings (e.g., BERT, XLNet). In addition to enhancing our theoretical understanding of these methods, our derivation leads to a principled framework that can be used to construct new self-supervised tasks. We provide an example by drawing inspirations from related methods based on mutual information maximization that have been successful in computer vision, and introduce a simple self-supervised objective that maximizes the mutual information between a global sentence representation and n-grams in the sentence. Our analysis offers a holistic view of representation learning methods to transfer knowledge and translate progress across multiple domains (e.g., natural language processing, computer vision, audio processing).

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

16/11/2020

DualTKB: A Dual Learning Bridge between Text and Knowledge Base

Pierre Dognin, Igor Melnyk, Inkit Padhi and
Cicero Nogueira dos Santos, Payel Das

Keywords Paper

kb conversion, dual approach, generative models, weak supervision

0

0

0

0

11:47

14/06/2020

Semi-Supervised Semantic Segmentation With Cross-Consistency Training

Yassine Ouali, Céline Hudelot, Myriam Tami

Keywords Paper

semantic segmentation, semi-supervised learning, consistency training, semi-supervised semantic segmentation

0

0

0

0

1:01

26/08/2020

Context Mover's Distance & Barycenters: Optimal Transport of Contexts for Building Representations

Sidak Pal Singh, Andreas Hug, Aymeric Dieuleveut, Martin Jaggi

Keywords Paper

0

0

0

0

14:15

06/12/2021

CLDA: Contrastive Learning for Semi-Supervised Domain Adaptation

Ankit Singh

Keywords Paper

domain adaptation, contrastive learning

0

0

0

0

6:24

06/12/2020

Variational Interaction Information Maximization for Cross-domain Disentanglement

HyeongJoo Hwang, Geon-Hyeong Kim, Seunghoon Hong, Kee-Eung Kim

Keywords Paper

Classification, Few-Shot Learning, Missing Data, Network Analysis, Adversarial Learning

0

0

0

0

3:28

16/11/2020

Dynamic Context Selection for Document-level Neural Machine Translation via Reinforcement Learning

Xiaomian Kang, Yang Zhao, Jiajun Zhang, Chengqing Zong

Keywords Paper

document-level translation, translations, document-level model, selection module

0

0

0

0

11:36

18/07/2021

Robust Representation Learning via Perceptual Similarity Metrics

Saeid A Taghanaki, Kristy Choi, Amir Hosein Khasahmadi, Anirudh Goyal

Keywords Paper

Deep Learning, Embedding and Representation learning

0

0

0

0

4:48

19/04/2021

Meta-learning for effective multi-task and multilingual modelling

Ishan Tarunesh, Sushil Khyalia, Vishwajeet Kumar and
Ganesh Ramakrishnan, Preethi Jyothi

Keywords Paper

0

0

0

0

8:19

06/12/2021

Improving Transferability of Representations via Augmentation-Aware Self-Supervision

Hankook Lee, Kibok Lee, Kimin Lee and
Honglak Lee, Jinwoo Shin

Keywords Paper

self-supervised learning, contrastive learning, representation learning, transfer learning

0

0

0

0

10:10

26/04/2020

Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework

Zirui Wang, Jiateng Xie, Ruochen Xu and
Yiming Yang, Graham Neubig, Jaime G. Carbonell

Keywords Paper

Cross-lingual Representation

0

0

0

0

4:53

06/12/2021

Dual Progressive Prototype Network for Generalized Zero-Shot Learning

Chaoqun Wang, Shaobo Min, Xuejin Chen and
Xiaoyan Sun, Houqiang Li

Keywords Paper

0

0

0

0

10:51

22/11/2021

Domain Attention Consistency for Multi-Source Domain Adaptation

Zhongying Deng, Kaiyang Zhou, Yongxin Yang, Tao Xiang

Keywords Paper

Transferable Attribute Learning, Domain Attention Consistency, Multi-Source Domain Adaptation

0

0

0

0

9:24

06/12/2021

Variational Multi-Task Learning with Gumbel-Softmax Priors

Jiayi Shen, Xiantong Zhen, Marcel Worring, Ling Shao

Keywords Paper

machine learning, generative model

0

0

0

0

13:09

06/12/2021

Learning Causal Semantic Representation for Out-of-Distribution Prediction

Chang Liu, Xinwei Sun, Jindong Wang and
Haoyue Tang, Tao Li, Tao Qin, Wei Chen, Tie-Yan Liu

Keywords Paper

generative model, domain adaptation, representation learning

0

0

0

0

14:29

05/12/2020

Self-supervised learning for pairwise data refinement

Gustavo Hernandez Abrego, Bowen Liang, Wei Wang and
Zarana Parekh, Yinfei Yang, Yunhsuan Sung

Keywords Paper

0

0

0

0

15:17

02/02/2021

DAST: Unsupervised Domain Adaptation in Semantic Segmentation Based on Discriminator Attention and Self-Training

Fei Yu, Mo Zhang, Hexin Dong and
Sheng Hu, Bin Dong, Li Zhang

Keywords Paper

0

0

0

0

13:43

22/11/2021

C4Net: Contextual Compression and Complementary Combination Network for Salient Object Detection

Hazarapet Tunanyan

Keywords Paper

salient object detection, c4net, excessiveness loss, complementary combination

0

0

0

0

3:04

03/05/2021

CPR: Classifier-Projection Regularization for Continual Learning

Sungmin Cha, Hsiang Hsu, Taebaek Hwang and
Flavio Calmon, Taesup Moon

Keywords Paper

regularization, wide local minima, continual learning

0

0

0

1

5:21

06/12/2020

Self-supervised Co-Training for Video Representation Learning

Tengda Han, Weidi Xie, Andrew Zisserman

Keywords Paper

0

0

0

0

3:08

04/07/2020

Addressing Posterior Collapse with Mutual Information for Improved Variational Neural Machine Translation

Arya D. McCarthy, Xian Li, Jiatao Gu, Ning Dong

Keywords Paper

Variational Translation, posterior collapse, auxiliary task, uncertainty

0

0

0

0

11:00

18/07/2021

Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation

Cunxiao Du, Zhaopeng Tu, Jing Jiang

Keywords Paper

Applications, Natural Language Processing

0

0

0

0

17:21

06/12/2021

CAM-GAN: Continual Adaptation Modules for Generative Adversarial Networks

Sakshi Varshney, Vinay Kumar Verma, P. K. Srijith and
Lawrence Carin, Piyush Rai

Keywords Paper

generative model, representation learning, continual learning

0

0

0

0

14:50

26/04/2020

FreeLB: Enhanced Adversarial Training for Natural Language Understanding

Chen Zhu, Yu Cheng, Zhe Gan and
Siqi Sun, Tom Goldstein, Jingjing Liu

Keywords Paper

0

0

0

0

5:26

26/04/2020

Improving Neural Language Generation with Spectrum Control

Lingxiao Wang, Jing Huang, Kevin Huang and
Ziniu Hu, Guangtao Wang, Quanquan Gu

Keywords Paper

0

0

0

0

4:58

02/02/2021

Learning a Few-shot Embedding Model with Contrastive Learning

Chen Liu, Yanwei Fu, Chengming Xu and
Siqian Yang, Jilin Li, Chengjie Wang, Li Zhang

Keywords Paper

0

0

0

0

15:02

02/02/2021

Exploring Auxiliary Reasoning Tasks for Task-oriented Dialog Systems with Meta Cooperative Learning

Bowen Qin, Min Yang, Lidong Bing and
Qingshan Jiang, Chengming Li, Ruifeng Xu

Keywords Paper

0

0

0

0

15:41

03/05/2021

Off-Dynamics Reinforcement Learning: Training for Transfer with Domain Classifiers

Ben Eysenbach, Shreyas Chaudhari, Swapnil Asawa and
Sergey Levine, Ruslan Salakhutdinov

Keywords Paper

reinforcement learning, domain adaptation, transfer learning

0

0

0

0

4:31

18/07/2021

Variational Empowerment as Representation Learning for Goal-Conditioned Reinforcement Learning

Jongwook Choi, Archit Sharma, Honglak Lee and
Sergey Levine, Shixiang Gu

Keywords Paper

Neuroscience and Cognitive Science, Neuroscience, Reinforcement Learning and Planning, Algorithms, Representation Learning; Algorithms, Sparse Coding and Dimensionality Expansion; Applications, Matrix and Ten

0

0

0

0

5:16

06/12/2021

HSVA: Hierarchical Semantic-Visual Adaptation for Zero-Shot Learning

Shiming Chen, Guosen Xie, Yang Liu and
Qinmu Peng, Baigui Sun, Hao Li, Xinge You, Ling Shao

Keywords Paper

generative model, domain adaptation

0

0

0

0

9:19

06/12/2021

Contrastively Disentangled Sequential Variational Autoencoder

Junwen Bai, Weiran Wang, Carla Gomes

Keywords Paper

self-supervised learning, generative model, contrastive learning, representation learning, interpretability

0

0

0

0

12:53

06/12/2021

Aligning Pretraining for Detection via Object-Level Contrastive Learning

Fangyun Wei, Yue Gao, Zhirong Wu and
Han Hu, Stephen Lin

Keywords Paper

vision, contrastive learning, representation learning, transfer learning

0

0

0

0

10:23

01/07/2020

Adversarial Training for Commonsense Inference

Lis Pereira, Xiaodong Liu, Fei Cheng and
Masayuki Asahara, Ichiro Kobayashi

Keywords Paper

0

0

0

0

5:09

06/12/2021

Predicting What You Already Know Helps: Provable Self-Supervised Learning

Jason Lee, Qi Lei, Nikunj Saunshi, JIACHENG ZHUO

Keywords Paper

theory, self-supervised learning, representation learning

0

0

0

0

14:51

02/02/2021

Self-Supervised Attention-Aware Reinforcement Learning

Haiping Wu, Khimya Khetarpal, Doina Precup

Keywords Paper

0

0

0

0

14:04

03/05/2021

On Learning Universal Representations Across Languages

Xiangpeng Wei, Rongxiang Weng, Yue Hu and
Luxi Xing, Heng Yu, Weihua Luo

Keywords Paper

hierarchical contrastive learning, cross-lingual pretraining, universal representation learning

0

0

0

0

3:51

02/11/2020

Guided multi-branch learning systems for sound event detection with sound separation

Yuxin Huang, Liwei Lin, Shuo Ma and
Xiangdong Wang, Hong Liu, Yueliang Qian, Min Liu, Kazushige Ouchi

Keywords Paper

0

0

0

0

12:52

02/02/2021

Adversarial Training Reduces Information and Improves Transferability

Matteo Terzi, Alessandro Achille, Marco Maggipinto, Gian Antonio Susto

Keywords Paper

0

0

0

0

19:54

25/04/2020

A Design Engineering Approach for Quantitatively Exploring Context-Aware Sentence Retrieval for Nonspeaking Individuals with Motor Disabilities

Per Ola Kristensson, James Lilley, Rolf Black, Annalu Waller

Keywords Paper

augmentative and alternative communication, design engineering, text entry, context-aware text entry, sentence prediction, information retrieval

0

0

0

0

14:57

16/11/2020

Semi-Supervised Bilingual Lexicon Induction with Two-way Interaction

Xu Zhao, Zihao Wang, Hao Wu, Yong Zhang

Keywords Paper

bilingual induction, prior transport, semi-supervision, bli

0

0

0

0

11:27

12/07/2020

Learning Autoencoders with Relational Regularization

Hongteng Xu, Dixin Luo, Ricardo Henao and
Svati Shah, Lawrence Carin

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

13:59