Estimating Mutual Information Between Dense Word Embeddings

04/07/2020

Estimating Mutual Information Between Dense Word Embeddings

Vitalii Zhelezniak, Aleksandar Savkov, Nils Hammerla

Keywords: Estimating Information, unsupervised tasks, Dense Embeddings, statistical view

Abstract Paper Similar Papers

Abstract: Word embedding-based similarity measures are currently among the top-performing methods on unsupervised semantic textual similarity (STS) tasks. Recent work has increasingly adopted a statistical view on these embeddings, with some of the top approaches being essentially various correlations (which include the famous cosine similarity). Another excellent candidate for a similarity measure is mutual information (MI), which can capture arbitrary dependencies between the variables and has a simple and intuitive expression. Unfortunately, its use in the context of dense word embeddings has so far been avoided due to difficulties with estimating MI for continuous data. In this work we go through a vast literature on estimating MI in such cases and single out the most promising methods, yielding a simple and elegant similarity measure for word embeddings. We show that mutual information is a viable alternative to correlations, gives an excellent signal that correlates well with human judgements of similarity and rivals existing state-of-the-art unsupervised methods.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ACL 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

Have We Solved The Hard Problem? It’s Not Easy! Contextual Lexical Contrast as a Means to Probe Neural Coherence

Wenqiang Lei, Yisong Miao, Runpeng Xie and
Bonnie Webber, Meichun Liu, Tat-Seng Chua, Nancy F. Chen

Keywords Paper

0

0

0

0

18:55

16/11/2020

When Hearst Is not Enough: Improving Hypernymy Detection from Corpus with Distributional Models

Changlong Yu, Jialong Han, Peifeng Wang and
Yangqiu Song, Hongming Zhang, Wilfred Ng, Shuming Shi

Keywords Paper

hypernymy detection, pattern-based ones, distributional methods, pattern-based model

0

0

0

0

11:56

16/11/2020

Word Rotator's Distance

Sho Yokoi, Ryo Takahashi, Reina Akama and
Jun Suzuki, Kentaro Inui

Keywords Paper

assessing similarity, vector converter, word alignment, alignment-based approaches

0

0

0

0

11:32

19/04/2021

Framing word sense disambiguation as a multi-label problem for model-agnostic knowledge integration

Simone Conia, Roberto Navigli

Keywords Paper

0

0

0

0

6:38

16/11/2020

An Unsupervised Sentence Embedding Method by Mutual Information Maximization

Yan Zhang, Ruidan He, Zuozhu Liu and
Kwan Hui Lim, Lidong Bing

Keywords Paper

sentence-pair tasks, clustering, semantic search, downstream tasks

0

0

0

0

12:22

16/11/2020

A Simple Approach to Learning Unsupervised Multilingual Embeddings

Pratik Jawanpuria, Mayank Meghwanshi, Bamdev Mishra

Keywords Paper

learning alignment, unsupervised alignment, bilingual induction, cross-lingual similarity

0

0

0

0

5:03

26/04/2020

Understanding l4-based Dictionary Learning: Interpretation, Stability, and Robustness

Yuexiang Zhai, Hermish Mehta, Zhengyuan Zhou, Yi Ma

Keywords Paper

L4-norm Maximization, Robust Dictionary Learning

0

0

0

0

5:05

16/11/2020

Wasserstein Distance Regularized Sequence Representation for Text Matching in Asymmetrical Domains

Weijie Yu, Chen Xu, Jun Xu and
Liang Pang, Xiaopeng Gao, Xiaozhao Wang, Ji-Rong Wen

Keywords Paper

real-world practices, text matching, matching models, match method

0

0

0

0

11:43

02/02/2021

Matching on Sets: Conquer Occluded Person Re-identification Without Alignment

Mengxi Jia, Xinhua Cheng, Yunpeng Zhai and
Shijian Lu, Siwei Ma, Yonghong Tian, Jian Zhang

Keywords Paper

0

0

0

0

15:02

05/01/2021

Saliency Driven Perceptual Image Compression

Yash Patel, Srikar Appalaraju, R. Manmatha

Keywords Paper

0

0

0

0

4:58

16/11/2020

Semantic Label Smoothing for Sequence to Sequence Problems

Michal Lukasik, Himanshu Jain, Aditya Menon and
Seungyeon Kim, Srinadh Bhojanapalli, Felix Yu, Sanjiv Kumar

Keywords Paper

classification, label de-noising, seqseq settings, machine translation

0

0

0

0

7:33

19/04/2021

Unsupervised extractive summarization using pointwise mutual information

Vishakh Padmakumar, He He

Keywords Paper

0

0

0

0

7:05

06/12/2021

Sliced Mutual Information: A Scalable Measure of Statistical Dependence

Ziv Goldfeld, Kristjan Greenewald

Keywords Paper

theory, machine learning

0

0

0

0

13:59

14/06/2020

Robust Reference-Based Super-Resolution With Similarity-Aware Deformable Convolution

Gyumin Shim, Jinsun Park, In So Kweon

Keywords Paper

reference-based super-resolution, self-similarity super-resolution, deformable convolution, non-local block, single-image super-resolution, perceptual-oriented super-resolution

0

0

0

0

1:01

18/07/2021

Which transformer architecture fits my data? A vocabulary bottleneck in self-attention

Noam Wies, Yoav Levine, Daniel Jannai, Amnon Shashua

Keywords Paper

Theory, Deep learning Theory

0

0

0

0

5:11

02/02/2021

Locate Globally, Segment Locally: A Progressive Architecture With Knowledge Review Network for Salient Object Detection

Binwei Xu, Haoran Liang, Ronghua Liang, Peng Chen

Keywords Paper

0

0

0

0

18:32

03/05/2021

Support-set bottlenecks for video-text representation learning

Mandela Patrick, Po-Yao Huang, Yuki Asano and
Florian Metze, Alexander G Hauptmann, Joao F. Henriques, Andrea Vedaldi

Keywords Paper

contrastive learning, video-text learning, multi-modal learning, video representation learning

0

0

0

0

6:40

19/08/2021

CIMON: Towards High-quality Hash Codes

Xiao Luo, Daqing Wu, Zeyu Ma and
Chong Chen, Minghua Deng, Jinwen Ma, Zhongming Jin, Jianqiang Huang, Xian-Sheng Hua

Keywords Paper

Computer Vision, Recognition, Information Retrieval

0

0

0

0

14:20

19/08/2021

MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction

Guozhi Tang, Lele Xie, Lianwen Jin and
Jiapeng Wang, Jingdong Chen, Zhen Xu, Qianying Wang, Yaqiang Wu, Hui Li

Keywords Paper

Computer Vision, Language and Vision, Structural and Model-Based Approaches, Knowledge Representation and Reasoning, Information Extraction

0

0

0

0

14:33

13/04/2021

Exploiting equality constraints in causal inference

Chi Zhang, Carlos Cinelli, Bryant Chen, Judea Pearl

Keywords Paper

0

0

0

0

3:02

02/02/2021

Exploiting Relationship for Complex-scene Image Generation

Tianyu Hua, Hongdong Zheng, Yalong Bai and
Wei Zhang, Xiao-Ping Zhang, Tao Mei

Keywords Paper

0

0

0

0

15:01

02/02/2021

Rejection Sampling for Weighted Jaccard Similarity Revisited

Xiaoyun Li, Ping Li

Keywords Paper

0

0

0

0

19:21

06/12/2021

HSVA: Hierarchical Semantic-Visual Adaptation for Zero-Shot Learning

Shiming Chen, Guosen Xie, Yang Liu and
Qinmu Peng, Baigui Sun, Hao Li, Xinge You, Ling Shao

Keywords Paper

generative model, domain adaptation

0

0

0

0

9:19

08/12/2020

Manifold Learning-based Word Representation Refinement Incorporating Global and Local Information

Wenyu Zhao, Dong Zhou, Lin Li, Jinjun Chen

Keywords Paper

0

0

0

0

14:59

04/07/2020

How Does Selective Mechanism Improve Self-Attention Networks?

Xinwei Geng, Longyue Wang, Xing Wang and
Bing Qin, Ting Liu, Zhaopeng Tu

Keywords Paper

NLP tasks, natural inference, semantic labelling, machine translation

0

0

0

0

11:43

26/04/2020

Data-dependent Gaussian Prior Objective for Language Generation

Zuchao Li, Rui Wang, Kehai Chen and
Masso Utiyama, Eiichiro Sumita, Zhuosheng Zhang, Hai Zhao

Keywords Paper

Gaussian Prior Objective, Language Generation

0

0

0

0

14:27

16/11/2020

A Bilingual Generative Transformer for Semantic Sentence Embedding

John Wieting, Graham Neubig, Taylor Berg-Kirkpatrick

Keywords Paper

source separation, semantic encoding, data distributions, unsupervised evaluations

0

0

0

0

14:32

19/10/2020

Intent-driven similarity in e-commerce listings

Gilad Fuchs, Yoni Acriche, Idan Hasson, Pavel Petrov

Keywords Paper

machine learning, e-commerce, sentence similarity

0

0

0

0

9:57

16/11/2020

Sparse Text Generation

Pedro Henrique Martins, Zita Marinho, André F. T. Martins

Keywords Paper

story completion, dialogue generation, text generators, language models

0

0

0

0

11:27

04/07/2020

Improving Candidate Generation for Low-resource Cross-lingual Entity Linking

Shuyan Zhou, Shruti Rijhwani, John Wieting and
Jaime Carbonell, Graham Neubig

Keywords Paper

Candidate Generation, Low-resource Linking, Cross-lingual linking, Cross-lingual XEL

0

0

0

0

12:03

07/09/2020

Advancing weakly supervised cross-domain alignment with optimal transport

Siyang Yuan, Ke Bai, Liqun Chen and
Yizhe Zhang, Chenyang Tao, Chunyuan Li, Guoyin Wang, Ricardo Henao, Lawrence Carin Duke

Keywords Paper

Optimal Transport, Cross Domain Alignment

0

0

0

0

10:04

08/12/2020

Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity

Anne Lauscher, Ivan Vulić, Edoardo Maria Ponti and
Anna Korhonen, Goran Glavaš

Keywords Paper

0

0

0

0

13:01

30/11/2020

Scale-Aware Polar Representation for Arbitrarily-Shaped Text Detection

Yanguang Bi, Zhiqiang Hu

Keywords Paper

0

0

0

0

9:56

14/06/2020

Attention-Guided Hierarchical Structure Aggregation for Image Matting

Yu Qiao, Yuhao Liu, Xin Yang and
Dongsheng Zhou, Mingliang Xu, Qiang Zhang, Xiaopeng Wei

Keywords Paper

image matting, attention, hierarchical, aggregation, appearance cues

0

0

0

0

0:59

05/01/2021

The Devil Is in the Boundary: Exploiting Boundary Representation for Basis-Based Instance Segmentation

Myungchul Kim, Sanghyun Woo, Dahun Kim, In So Kweon

Keywords Paper

0

0

0

0

4:47

14/06/2020

Multi-Modality Cross Attention Network for Image and Sentence Matching

Xi Wei, Tianzhu Zhang, Yan Li and
Yongdong Zhang, Feng Wu

Keywords Paper

cross modal, retrieval, transformer, attention, intra-modality, inter-modality

0

0

0

0

0:59

19/04/2021

Generative text modeling through short run inference

Bo Pang, Erik Nijkamp, Tian Han, Ying Nian Wu

Keywords Paper

0

0

0

0

7:55

06/12/2020

Distributed Training with Heterogeneous Data: Bridging Median- and Mean-Based Algorithms

Xiangyi Chen, Tiancong Chen, Haoran Sun and
Steven Wu, Mingyi Hong

Keywords Paper

0

0

0

0

3:19

04/07/2020

Extracting Headless MWEs from Dependency Parse Trees: Parsing, Tagging, and Joint Modeling Approaches

Tianze Shi, Lillian Lee

Keywords Paper

parsing, tagging, predicting MWEs, identifying MWEs

0

0

0

0

11:55

06/12/2021

Towards Sample-Optimal Compressive Phase Retrieval with Sparse and Generative Priors

Zhaoqiang Liu, Subhroshekhar Ghosh, Jonathan Scarlett

Keywords Paper

theory, optimization, generative model

0

0

0

0

10:41