Improving Text-to-Image Synthesis Using Contrastive Learning

Abstract: The goal of text-to-image synthesis is to generate a visually realistic image that matches a given text description. In practice, the captions annotated by humans for the same image have large variance in terms of contents and the choice of words. The linguistic discrepancy between the captions of the identical image leads to the synthetic images deviating from the ground truth. To address this issue, we propose a contrastive learning approach to improve the quality and enhance the semantic consistency of synthetic images. In the pretraining stage, we utilize the contrastive learning approach to learn the consistent textual representations for the captions corresponding to the same image. Furthermore, in the following stage of GAN training, we employ the contrastive learning method to enhance the consistency between the generated images from the captions related to the same image. We evaluate our approach over two popular text-to-image synthesis models, AttnGAN and DM-GAN, on datasets CUB and COCO, respectively. Experimental results have shown that our approach can effectively improve the quality of synthetic images in terms of three metrics: IS, FID and R-precision. Especially, on the challenging COCO dataset, our approach boosts the FID significantly by 29.60% over AttnGAn and by 21.96% over DM-GAN.

19/04/2021

representation learning, self-supervised learning, unsupervised learning, discrete representations, bag of visual words, image understanding, deep learning, convolutional neural networks

1:01

06/12/2021

Improving Text-to-Image Synthesis Using Contrastive Learning

Hui Ye, Xiulong Yang, Martin Takac, Rajshekhar Sunderraman, Shihao Ji

Comments

Similar Papers

Crisscrossed captions: Extended intramodal and intermodal semantic similarity judgments for MS-COCO

Zarana Parekh, Jason Baldridge, Daniel Cer and Austin Waters, Yinfei Yang

Keywords Abstract Paper

Each Attribute Matters: Contrastive Attention for Sentence-based Image Editing

Liuqing Zhao, Fan Lyu, Fuyuan Hu and Kaizhu Huang, Fenglei Xu, Linyan Li

Keywords Abstract Paper

Image manipulation, Generation adversarial network

Learning Representations by Predicting Bags of Visual Words

Spyros Gidaris, Andrei Bursuc, Nikos Komodakis and Patrick Pérez, Matthieu Cord

Keywords Abstract Paper

representation learning, self-supervised learning, unsupervised learning, discrete representations, bag of visual words, image understanding, deep learning, convolutional neural networks

Object-aware Contrastive Learning for Debiased Scene Representation

Sangwoo Mo, Hyunwoo Kang, Kihyuk Sohn and Chun-Liang Li, Jinwoo Shin

Keywords Abstract Paper

self-supervised learning, contrastive learning, representation learning

DeepI2I: Enabling Deep Hierarchical Image-to-Image Translation by Transferring from GANs

yaxing wang, Lu Yu, Joost van de Weijer

Keywords Abstract Paper

Algorithms -> Online Learning, Optimization -> Stochastic Optimization

Looking Beyond Single Images for Contrastive Semantic Segmentation Learning

FEIHU ZHANG, Philip Torr, Rene Ranftl, Stephan Richter

Keywords Abstract Paper

machine learning, vision, contrastive learning, representation learning

Visually Grounded Compound PCFGs

Yanpeng Zhao, Ivan Titov

Keywords Abstract Paper

exploiting groundings, language understanding, gradient estimates, fully-differentiable learning

VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer

Zineng Tang, Jaemin Cho, Hao Tan, Mohit Bansal

Keywords Abstract Paper

language

More Grounded Image Captioning by Distilling Image-Text Matching Model

Yuanen Zhou, Meng Wang, Daqing Liu and Zhenzhen Hu, Hanwang Zhang

Keywords Abstract Paper

grounded image captioning, image-text matching, visual grounding, cross-task knowledge distillation

Meta-learning for effective multi-task and multilingual modelling

Ishan Tarunesh, Sushil Khyalia, Vishwajeet Kumar and Ganesh Ramakrishnan, Preethi Jyothi

Keywords Abstract Paper

Exploiting multimodal reinforcement learning for simultaneous machine translation

Julia Ive, Andy Mingren Li, Yishu Miao and Ozan Caglayan, Pranava Madhyastha, Lucia Specia

Keywords Abstract Paper

L2C: Describing visual differences needs semantic understanding of individuals

An Yan, Xin Wang, Tsu-Jui Fu, William Yang Wang

Keywords Abstract Paper

Self-supervised Pre-training and Contrastive Representation Learning for Multiple-choice Video QA

Seonhoon Kim, Seohyeong Jeong, Eunbyul Kim and Inho Kang, Nojun Kwak

Keywords Abstract Paper

Joint Training for Learning Cross-lingual Embeddings with Sub-word Information without Parallel Corpora

Ali Hakimi Parizi, Paul Cook

Keywords Abstract Paper

A Mutual Information Maximization Perspective of Language Representation Learning

Lingpeng Kong, Cyprien de Masson d'Autume, Lei Yu and Wang Ling, Zihang Dai, Dani Yogatama

Keywords Abstract Paper

Dual Progressive Prototype Network for Generalized Zero-Shot Learning

Chaoqun Wang, Shaobo Min, Xuejin Chen and Xiaoyan Sun, Houqiang Li

Keywords Abstract Paper

Contrastive Learning with Adversarial Perturbations for Conditional Text Generation

Seanie Lee, Dong Bok Lee, Sung Ju Hwang

Keywords Abstract Paper

contrastive learning, conditional text generation

Learning to Decouple Relations: Few-Shot Relation Classification with Entity-Guided Attention and Confusion-Aware Training

Yingyao Wang, Junwei Bao, Guangyi Liu and Youzheng Wu, Xiaodong He, Bowen Zhou, Tiejun Zhao

Keywords Abstract Paper

Learning to Represent Image and Text with Denotation Graph

Bowen Zhang, Hexiang Hu, Vihan Jain and Eugene Ie, Fei Sha

Keywords Abstract Paper

cross-modal retrieval, referring expression, compositional recognition, pre-training

Cross-Modality Relevance for Reasoning on Language and Vision

Chen Zheng, Quan Guo, Parisa Kordjamshidi

Keywords Abstract Paper

Cross-Modality Relevance, Language Vision, visual answering, VQA

Contrastive Learning with Adversarial Examples

Chih-Hui Ho, Nuno Nvasconcelos

Keywords Abstract Paper

Shape-Texture Debiased Neural Network Training

Yinigwei Li, Qihang Yu, Mingxing Tan and Jieru Mei, Peng Tang, Wei Shen, Alan Yuille, Cihang Xie

Zarana Parekh, Jason Baldridge, Daniel Cer and
Austin Waters, Yinfei Yang

Keywords Paper

Liuqing Zhao, Fan Lyu, Fuyuan Hu and
Kaizhu Huang, Fenglei Xu, Linyan Li

Keywords Paper

Spyros Gidaris, Andrei Bursuc, Nikos Komodakis and
Patrick Pérez, Matthieu Cord

Keywords Paper

Sangwoo Mo, Hyunwoo Kang, Kihyuk Sohn and
Chun-Liang Li, Jinwoo Shin

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yuanen Zhou, Meng Wang, Daqing Liu and
Zhenzhen Hu, Hanwang Zhang

Keywords Paper

Ishan Tarunesh, Sushil Khyalia, Vishwajeet Kumar and
Ganesh Ramakrishnan, Preethi Jyothi

Keywords Paper

Julia Ive, Andy Mingren Li, Yishu Miao and
Ozan Caglayan, Pranava Madhyastha, Lucia Specia

Keywords Paper

Keywords Paper

Seonhoon Kim, Seohyeong Jeong, Eunbyul Kim and
Inho Kang, Nojun Kwak

Keywords Paper

Keywords Paper

Lingpeng Kong, Cyprien de Masson d'Autume, Lei Yu and
Wang Ling, Zihang Dai, Dani Yogatama

Keywords Paper

Chaoqun Wang, Shaobo Min, Xuejin Chen and
Xiaoyan Sun, Houqiang Li

Keywords Paper

Keywords Paper

Yingyao Wang, Junwei Bao, Guangyi Liu and
Youzheng Wu, Xiaodong He, Bowen Zhou, Tiejun Zhao

Keywords Paper

Bowen Zhang, Hexiang Hu, Vihan Jain and
Eugene Ie, Fei Sha

Keywords Paper

Keywords Paper

Keywords Paper

Yinigwei Li, Qihang Yu, Mingxing Tan and
Jieru Mei, Peng Tang, Wei Shen, Alan Yuille, Cihang Xie

Keywords Paper

Keywords Paper

Kaihao Zhang, Wenhan Luo, Yiran Zhong and
Lin Ma, Björn Stenger, Wei Liu, Hongdong Li

Keywords Paper

Keywords Paper

Fenglin Liu, Xuancheng Ren, Xian Wu and
Shen Ge, Wei Fan, Yuexian Zou, Xu Sun

Keywords Paper

Chen Zhu, Yu Cheng, Zhe Gan and
Siqi Sun, Tom Goldstein, Jingjing Liu

Keywords Paper

Keywords Paper

Keywords Paper

Sora Ohashi, Junya Takayama, Tomoyuki Kajiwara and
Chenhui Chu, Yuki Arase

Keywords Paper

Bolei Xu, Jingxin Liu, Xianxu Hou and
Bozhi Liu, Guoping Qiu

Keywords Paper

Hongwei Xue, Yupan Huang, Bei Liu and
Houwen Peng, Jianlong Fu, Houqiang Li, Jiebo Luo

Keywords Paper

Keywords Paper

Yu Meng, Chenyan Xiong, Payal Bajaj and
saurabh tiwary, Paul Bennett, Jiawei Han, XIA SONG

Keywords Paper

Keywords Paper

Keywords Paper