Mutual Information Gradient Estimation for Representation Learning

26/04/2020

Mutual Information Gradient Estimation for Representation Learning

Liangjian Wen, Yiji Zhou, Lirong He, Mingyuan Zhou, Zenglin Xu

Keywords: Mutual Information, Score Estimation, Representation Learning, Information Bottleneck

Abstract Paper Similar Papers

Abstract: Mutual Information (MI) plays an important role in representation learning. However, MI is unfortunately intractable in continuous and high-dimensional settings. Recent advances establish tractable and scalable MI estimators to discover useful representation. However, most of the existing methods are not capable of providing an accurate estimation of MI with low-variance when the MI is large. We argue that directly estimating the gradients of MI is more appealing for representation learning than estimating MI in itself. To this end, we propose the Mutual Information Gradient Estimator (MIGE) for representation learning based on the score estimation of implicit distributions. MIGE exhibits a tight and smooth gradient estimation of MI in the high-dimensional and large-MI settings. We expand the applications of MIGE in both unsupervised learning of deep representations based on InfoMax and the Information Bottleneck method. Experimental results have indicated significant performance improvement in learning useful representation.

1

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

26/04/2020

On Mutual Information Maximization for Representation Learning

Michael Tschannen, Josip Djolonga, Paul K. Rubenstein and
Sylvain Gelly, Mario Lucic

Keywords Paper

mutual information, representation learning, unsupervised learning, self-supervised learning

0

0

0

0

4:40

06/12/2020

Robust Optimal Transport with Applications in Generative Modeling and Domain Adaptation

Yogesh Balaji, Rama Chellappa, Soheil Feizi

Keywords Paper

0

0

0

0

3:24

12/07/2020

Can Increasing Input Dimensionality Improve Deep Reinforcement Learning?

Kei Ota, Tomoaki Oiki, Devesh Jha and
Toshisada Mariyama, Daniel Nikovski

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

14:55

02/02/2021

DIBS: Diversity Inducing Information Bottleneck in Model Ensembles

Samarth Sinha, Homanga Bharadhwaj, Anirudh Goyal and
Hugo Larochelle, Animesh Garg, Florian Shkurti

Keywords Paper

0

0

0

0

16:26

03/05/2021

Share or Not? Learning to Schedule Language-Specific Capacity for Multilingual Translation

Biao Zhang, Ankur Bapna, Rico Sennrich, Orhan Firat

Keywords Paper

multilingual transformer, multilingual translation, language-specific modeling, conditional computation

0

0

0

0

15:04

16/11/2020

Multi-document Summarization with Maximal Marginal Relevance-guided Reinforcement Learning

Yuning Mao, Yanru Qu, Yiqing Xie and
Xiang Ren, Jiawei Han

Keywords Paper

single-document summarization, single-document sds, multi-document summarization, multi-document mds

0

0

0

0

10:58

06/12/2020

Hausdorff Dimension, Heavy Tails, and Generalization in Neural Networks

Umut Simsekli, Ozan Sener, George Deligiannidis, Murat Erdogdu

Keywords Paper

Deep Learning -> Supervised Deep Networks, Deep Learning -> Embedding Approaches

0

0

0

0

3:32

12/07/2020

Regularized Optimal Transport is Ground Cost Adversarial

François-Pierre Paty, Marco Cuturi

Keywords Paper

General Machine Learning Techniques

1

0

1

1

13:26

06/12/2021

Do Neural Optimal Transport Solvers Work? A Continuous Wasserstein-2 Benchmark

Alexander Korotin, Lingxiao Li, Aude Genevay and
Justin Solomon, Alexander Filippov, Evgeny Burnaev

Keywords Paper

deep learning, machine learning, generative model, optimal transport

0

0

0

0

13:39

13/04/2021

Beyond marginal uncertainty: How accurately can bayesian regression models estimate posterior predictive correlations?

Chaoqi Wang, Shengyang Sun, Roger Grosse

Keywords Paper

0

0

0

0

3:01

26/04/2020

Understanding the Limitations of Variational Mutual Information Estimators

Jiaming Song, Stefano Ermon

Keywords Paper

0

0

0

0

5:04

06/12/2021

Network-to-Network Regularization: Enforcing Occam's Razor to Improve Generalization

Rohan Ghosh, Mehul Motani

Keywords Paper

theory, deep learning, machine learning

0

0

0

0

14:07

06/12/2021

A Faster Decentralized Algorithm for Nonconvex Minimax Problems

Wenhan Xian, Feihu Huang, Yanfu Zhang, Heng Huang

Keywords Paper

optimization, machine learning, adversarial robustness and security

0

0

0

0

13:59

13/04/2021

Federated learning with compression: Unified analysis and sharp guarantees

Farzin Haddadpour, Mohammad Mahdi Kamani, Aryan Mokhtari, Mehrdad Mahdavi

Keywords Paper

0

0

0

0

3:03

12/07/2020

LTF: A Label Transformation Framework for Correcting Label Shift

Jiaxian Guo, Mingming Gong, Tongliang Liu and
Kun Zhang, Dacheng Tao

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

13:15

06/12/2021

Predicting Deep Neural Network Generalization with Perturbation Response Curves

Yair Schiff, Brian Quanz, Payel Das, Pin-Yu Chen

Keywords Paper

deep learning

0

0

0

0

11:13

18/07/2021

Unbalanced minibatch Optimal Transport; applications to Domain Adaptation

Kilian Fatras, Thibault Séjourné, Rémi Flamary, Nicolas Courty

Keywords Paper

Algorithms, Optimal Transport

0

0

0

2

4:57

26/04/2020

Stochastic AUC Maximization with Deep Neural Networks

Mingrui Liu, Zhuoning Yuan, Yiming Ying, Tianbao Yang

Keywords Paper

Stochastic AUC Maximization, Deep Neural Networks

0

0

0

0

4:58

03/05/2021

Neural Topic Model via Optimal Transport

He Zhao, Dinh Phung, Viet Huynh and
Trung Le, Wray Buntine

Keywords Paper

optimal transport, document analysis, topic modelling

0

0

0

1

9:29

18/07/2021

Self Normalizing Flows

T. Anderson Keller, Jorn Peters, Priyank Jaini and
Emiel Hoogeboom, Patrick Forré, Max Welling

Keywords Paper

Deep Learning, Generative Models

0

1

1

0

4:24

26/08/2020

Learning with minibatch Wasserstein : asymptotic and gradient properties

Kilian Fatras, Younès Zine, Rémi Flamary and
Remi Gribonval, Nicolas Courty

Keywords Paper

0

0

0

1

12:59

06/12/2021

$(\textrm{Implicit})^2$: Implicit Layers for Implicit Representations

Zhichun Huang, Shaojie Bai, J. Zico Kolter

Keywords Paper

deep learning, representation learning

1

0

0

1

12:23

14/06/2020

Forward and Backward Information Retention for Accurate Binary Neural Networks

Haotong Qin, Ruihao Gong, Xianglong Liu and
Mingzhu Shen, Ziran Wei, Fengwei Yu, Jingkuan Song

Keywords Paper

model compression, binary neural networks, deep learning, quantization, computer vision

0

0

0

0

1:00

02/02/2021

GLISTER: Generalization based Data Subset Selection for Efficient and Robust Learning

Krishnateja Killamsetty, Durga Sivasubramanian, Ganesh Ramakrishnan, Rishabh Iyer

Keywords Paper

0

0

0

0

19:14

02/02/2021

Interpreting Neural Networks as Quantitative Argumentation Frameworks

Nico Potyka

Keywords Paper

0

0

0

0

20:29

07/09/2020

Few-Shot Learning with Complex-valued Neural Networks

Zhen Liu, Baochang Zhang, Guodong Guo

Keywords Paper

few-shot learning, complex-valued network, metric-learning, image classification

0

0

0

0

7:15

03/05/2021

Exploring the Uncertainty Properties of Neural Networks’ Implicit Priors in the Infinite-Width Limit

Ben Adlam, Jaehoon Lee, Lechao Xiao and
Jeffrey Pennington, Jasper Snoek

Keywords Paper

Deep Learning, Bayesian Neural Networks, Neural Network Gaussian Process, Infinite-Width Limit, Uncertainty, Gaussian Process

0

0

0

0

4:34

18/07/2021

Learning Generalized Intersection Over Union for Dense Pixelwise Prediction

Jiaqian Yu, Jingtao Xu, Yiwei Chen and
Weiming Li, Qiang Wang, ByungIn Yoo, Jae-Joon Han

Keywords Paper

Applications, Computer Vision

0

0

0

0

5:10

07/09/2020

Learning Effectively from Noisy Supervision for Weakly Supervised Semantic Segmentation

Wenbin Xie, Qiaoqiao Wei, Zheng Li, Hui Zhang

Keywords Paper

Semantic Segmentation, Weakly Supervised Semantic Segmentation, Self Attention

0

0

0

0

3:46

26/04/2020

Provable Benefit of Orthogonal Initialization in Optimizing Deep Linear Networks

Wei Hu, Lechao Xiao, Jeffrey Pennington

Keywords Paper

deep learning theory, non-convex optimization, orthogonal initialization

0

0

0

0

5:10

06/12/2020

Neural Methods for Point-wise Dependency Estimation

Yao-Hung Hubert Tsai, Han Zhao, Makoto Yamada and
LP Morency, Russ Salakhutdinov

Keywords Paper

0

0

0

0

3:21

06/12/2021

Simple Stochastic and Online Gradient Descent Algorithms for Pairwise Learning

ZHENHUAN YANG, Yunwen Lei, Puyu Wang and
Tianbao Yang, Yiming Ying

Keywords Paper

optimization, machine learning, privacy

0

0

0

0

14:40

06/12/2020

Improved Analysis of Clipping Algorithms for Non-convex Optimization

Bohang Zhang, Jikai Jin, Cong Fang, Liwei Wang

Keywords Paper

0

0

0

0

3:16

18/07/2021

Outlier-Robust Optimal Transport

Debarghya Mukherjee, Aritra Guha, Justin Solomon and
Yuekai Sun, Mikhail Yurochkin

Keywords Paper

Algorithms, Meta-Learning, Algorithms, Few-Shot Learning, Algorithms, Optimal Transport

0

0

1

1

4:46

02/02/2021

Distribution Adaptive INT8 Quantization for Training CNNs

Kang Zhao, Sida Huang, Pan Pan and
Yinghan Li, Yingya Zhang, Zhenyu Gu, Yinghui Xu

Keywords Paper

0

0

0

0

16:42

14/06/2020

Closed-Loop Matters: Dual Regression Networks for Single Image Super-Resolution

Yong Guo, Jian Chen, Jingdong Wang and
Qi Chen, Jiezhang Cao, Zeshuai Deng, Yanwu Xu, Mingkui Tan

Keywords Paper

computer vision, image super-resolution, dual regression scheme, closed-loop

0

0

0

0

1:01

14/06/2020

Moving in the Right Direction: A Regularization for Deep Metric Learning

Deen Dayal Mohan, Nishant Sankaran, Dennis Fedorishin and
Srirangaraj Setlur, Venu Govindaraju

Keywords Paper

deep metric learning, regularization, image retrieval.

0

0

0

0

1:00

06/12/2021

Sliced Mutual Information: A Scalable Measure of Statistical Dependence

Ziv Goldfeld, Kristjan Greenewald

Keywords Paper

theory, machine learning

0

0

0

0

13:59

12/07/2020

Distance Metric Learning with Joint Representation Diversification

Xu Chu, Yang Lin, Xiting Wang and
Xin Gao, Qi Tong, Hailong Yu, Yasha Wang

Keywords Paper

Applications - Computer Vision

0

0

0

0

14:32

06/12/2021

EF21: A New, Simpler, Theoretically Better, and Practically Faster Error Feedback

Peter Richtarik, Igor Sokolov, Ilyas Fatkhullin

Keywords Paper

optimization, machine learning

0

0

0

0

19:56