Variational Hetero-Encoder Randomized GANs for Joint Image-Text Modeling

26/04/2020

Variational Hetero-Encoder Randomized GANs for Joint Image-Text Modeling

Hao Zhang, Bo Chen, Long Tian, Zhengjue Wang, Mingyuan Zhou

Keywords: Deep topic model, image generation, text generation, raster-scan-GAN, zero-shot learning

Abstract Paper Code Similar Papers

Abstract: For bidirectional joint image-text modeling, we develop variational hetero-encoder (VHE) randomized generative adversarial network (GAN), a versatile deep generative model that integrates a probabilistic text decoder, probabilistic image encoder, and GAN into a coherent end-to-end multi-modality learning framework. VHE randomized GAN (VHE-GAN) encodes an image to decode its associated text, and feeds the variational posterior as the source of randomness into the GAN image generator. We plug three off-the-shelf modules, including a deep topic model, a ladder-structured image encoder, and StackGAN++, into VHE-GAN, which already achieves competitive performance. This further motivates the development of VHE-raster-scan-GAN that generates photo-realistic images in not only a multi-scale low-to-high-resolution manner, but also a hierarchical-semantic coarse-to-fine fashion. By capturing and relating hierarchical semantic and visual concepts with end-to-end training, VHE-raster-scan-GAN achieves state-of-the-art performance in a wide variety of image-text multi-modality learning and generation tasks.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

Generalized Adversarially Learned Inference

Yatin Dandi, Homanga Bharadhwaj, Abhishek Kumar, Piyush Rai

Keywords Paper

0

0

0

0

16:22

02/02/2021

IB-GAN: Disentangled Representation Learning with Information Bottleneck Generative Adversarial Networks

Insu Jeon, Wonkwang Lee, Myeongjang Pyeon, Gunhee Kim

Keywords Paper

0

0

0

0

18:10

19/08/2021

AgeFlow: Conditional Age Progression and Regression with Normalizing Flows

Zhizhong Huang, Shouzhen Chen, Junping Zhang, Hongming Shan

Keywords Paper

Computer Vision, Biometrics, Face and Gesture Recognition, Unsupervised Learning, 2D and 3D Computer Vision

0

0

0

0

8:20

14/06/2020

Learning Fused Pixel and Feature-Based View Reconstructions for Light Fields

Jinglei Shi, Xiaoran Jiang, Christine Guillemot

Keywords Paper

light field, view synthesis, feature-based reconstruction, pixel-based reconstruction, deep learning, angular super-resolution

0

0

0

0

4:56

30/11/2020

Low-light Color Imaging via Dual Camera Acquisition

Peiyao Guo, Zhan Ma

Keywords Paper

0

0

0

0

7:28

12/07/2020

Representation Learning via Adversarially-Contrastive Optimal Transport

Anoop Cherian, Shuchin Aeron

Keywords Paper

Representation Learning

0

0

0

0

14:47

02/02/2021

F2Net: Learning to Focus on the Foreground for Unsupervised Video Object Segmentation

Daizong Liu, Dongdong Yu, Changhu Wang, Pan Zhou

Keywords Paper

0

0

0

0

16:59

30/11/2020

Mask-Ranking Network for Semi-Supervised Video Object Segmentation

Wenjing Li, Xiang Zhang, Yujie Hu, Yingqi Tang

Keywords Paper

0

0

0

0

5:36

22/11/2021

ExSinGAN: Learning an Explainable Generative Model from a Single Image

Zicheng Zhang, Congying Han, Tiande Guo

Keywords Paper

single image generation, single image generative model, generative adversarial network, image synthesis

0

0

0

0

3:00

26/04/2020

Neural Outlier Rejection for Self-Supervised Keypoint Learning

Jiexiong Tang, Hanme Kim, Vitor Guizilini and
Sudeep Pillai, Rares Ambrus

Keywords Paper

Self-Supervised Learning, Keypoint Detection, Outlier Rejection, Deep Learning

0

0

0

0

4:55

02/02/2021

End-to-End Differentiable Learning to HDR Image Synthesis for Multi-exposure Images

Junghee Kim, Siyeong Lee, Suk-Ju Kang

Keywords Paper

0

0

0

0

15:35

06/12/2020

CoMIR: Contrastive Multimodal Image Representation for Registration

Nicolas Pielawski, Elisabeth Wetzer, Johan Öfverstedt and
Jiahao Lu, Carolina Wählby, Joakim Lindblad, Natasa Sladoje

Keywords Paper

0

0

0

0

2:55

16/11/2020

Discriminative Nearest Neighbor Few-Shot Intent Detection by Transferring Natural Language Inference

Jianguo Zhang, Kazuma Hashimoto, Wenhao Liu and
Chien-Sheng Wu, Yao Wan, Philip Yu, Richard Socher, Caiming Xiong

Keywords Paper

intent detection, detecting intents, oos detection, large-scale task

0

0

0

0

11:43

02/02/2021

Enhanced Regularizers for Attributional Robustness

Anindya Sarkar, Anirban Sarkar, Vineeth N Balasubramanian

Keywords Paper

0

0

0

0

17:30

03/08/2020

Walking on Two Legs: Learning Image Segmentation with Noisy Labels

Guohua Cheng, Hongli Ji, Yan Tian

Keywords Paper

0

0

0

0

10:02

06/12/2020

Predictive Information Accelerates Learning in RL

Kuang-Huei Lee, Ian Fischer, Anthony Liu and
Yijie Guo, Honglak Lee, John Canny, Sergio Guadarrama

Keywords Paper

0

0

0

0

3:10

19/08/2021

EmbedMask: Embedding Coupling for Instance Segmentation

Hui Ying, Zhaojin Huang, Shu Liu and
Tianjia Shao, Kun Zhou

Keywords Paper

Computer Vision, Recognition

0

0

0

0

10:08

22/11/2021

Learning to Predict Convolutional Filters with Guidance for Conditional Image Generation

Lei Chen, Mengyao Zhai, Greg Mori

Keywords Paper

conditional image generation, dynamic network

0

0

0

0

2:54

18/07/2021

Multi-Dimensional Classification via Sparse Label Encoding

BINBIN JIA, Min-Ling Zhang

Keywords Paper

Algorithms, Supervised Learning

0

0

0

0

16:52

06/12/2020

CoADNet: Collaborative Aggregation-and-Distribution Networks for Co-Salient Object Detection

Qijian Zhang, Runmin Cong, Junhui Hou and
Chongyi Li, Yao Zhao

Keywords Paper

, Theory -> Learning Theory

0

0

0

0

3:14

14/06/2020

Controllable Person Image Synthesis With Attribute-Decomposed GAN

Yifang Men, Yiming Mao, Yuning Jiang and
Wei-Ying Ma, Zhouhui Lian

Keywords Paper

image synthesis, pose transfer, generative adversarial networks, image editing, attribute separation, feature disentanglement, fashion ai

0

0

0

0

4:56

06/12/2021

Aligning Pretraining for Detection via Object-Level Contrastive Learning

Fangyun Wei, Yue Gao, Zhirong Wu and
Han Hu, Stephen Lin

Keywords Paper

vision, contrastive learning, representation learning, transfer learning

0

0

0

0

10:23

06/12/2020

Learning Representations from Audio-Visual Spatial Alignment

Pedro Morgado, Yi Li, Nuno Nvasconcelos

Keywords Paper

0

0

0

0

3:21

03/05/2021

Disentangled Recurrent Wasserstein Autoencoder

Jun Han, Martin Min, Ligong Han and
Li Erran Li, Xuan Zhang

Keywords Paper

Recurrent Generative Model, Sequential Representation Learning, Disentanglement

0

0

0

0

9:17

22/11/2021

Inter-intra Variant Dual Representations for Self-supervised Video Recognition

Lin ZHANG, Qi She, Zhengyang Shen, Changhu Wang

Keywords Paper

video action recognition, self-supervised learning, contrastive learning, representation learning

0

0

0

0

2:55

06/12/2021

Associating Objects with Transformers for Video Object Segmentation

Zongxin Yang, Yunchao Wei, Yi Yang

Keywords Paper

transformers

0

0

0

0

12:29

13/04/2021

Learning bijective feature maps for linear ICA

Alexander Camuto, Matthew Willetts, Chris Holmes and
Brooks Paige, Stephen Roberts

Keywords Paper

0

0

0

0

3:02

22/11/2021

Structured Latent Embeddings for Recognizing Unseen Classes in Unseen Domains

Shivam Chandhok, Sanath Narayan, Hisham Cholakkal and
Rao Muhammad Anwer, Vineeth N Balasubramanian, Fahad Shahbaz Khan, Ling Shao

Keywords Paper

Zero-Shot, Domain Generalization, multimodal-alignment, domain-invariant, conceptual partition, semantics

0

0

0

0

2:48

14/06/2020

JL-DCF: Joint Learning and Densely-Cooperative Fusion Framework for RGB-D Salient Object Detection

Keren Fu, Deng-Ping Fan, Ge-Peng Ji, Qijun Zhao

Keywords Paper

visual saliency, salient object detection, rgb-d, depth information, joint learning, dense connections, multi-modal features, feature fusion, deep learning, encoder-decoder

0

0

0

0

1:01

06/12/2021

Instance-Conditioned GAN

Arantxa Casanova, Marlene Careil, Jakob Verbeek and
Michal Drozdzal, Adriana Romero Soriano

Keywords Paper

generative model

0

0

0

0

15:23

30/11/2020

Second Order enhanced Multi-glimpse Attention in Visual Question Answering

Qiang Sun, Binghui Xie, Yanwei Fu

Keywords Paper

0

0

0

0

7:20

02/02/2021

Contrastive Transformation for Self-supervised Correspondence Learning

Ning Wang, Wengang Zhou, Houqiang Li

Keywords Paper

0

0

0

0

13:41

30/11/2020

MIX'EM: Unsupervised Image Classification using a Mixture of Embeddings

Ali Varamesh, Tinne Tuytelaars

Keywords Paper

0

0

0

0

6:40

03/05/2021

Fully Unsupervised Diversity Denoising with Convolutional Variational Autoencoders

Mangal Prakash, Alexander Krull, Florian Jug

Keywords Paper

Variational Autoencoders, Noise model, Unsupervised denoising, Diversity denoising

0

0

0

0

4:56

14/06/2020

ActBERT: Learning Global-Local Video-Text Representations

Linchao Zhu, Yi Yang

Keywords Paper

actbert, cross-modal pretraining, video and language, transformer, tangled transformer, instructional videos

0

0

0

0

4:58

06/12/2021

End-to-end Multi-modal Video Temporal Grounding

Yi-Wen Chen, Yi-Hsuan Tsai, Ming-Hsuan Yang

Keywords Paper

self-supervised learning, transformers, vision, contrastive learning

0

0

0

0

8:46

30/11/2020

OpenGAN: Open Set Generative Adversarial Networks

Luke Ditria, Benjamin J. Meyer, Tom Drummond

Keywords Paper

0

0

1

1

10:09

06/12/2021

Data-Efficient GAN Training Beyond (Just) Augmentations: A Lottery Ticket Perspective

Tianlong Chen, Yu Cheng, Zhe Gan and
Jingjing Liu, Zhangyang Wang

Keywords Paper

generative model

0

0

0

0

11:30

30/11/2020

Image Captioning through Image Transformer

Sen He, Wentong Liao, Hamed R. Tavakoli and
Michael Yang, Bodo Rosenhahn, Nicolas Pugeault

Keywords Paper

0

0

0

0

9:49

06/12/2020

Learning Semantic-aware Normalization for Generative Adversarial Networks

Heliang Zheng, Jianlong Fu, zengyh Zeng and
Jiebo Luo, Zheng-Jun Zha

Keywords Paper

0

0

0

0

3:11