ManiGAN: Text-Guided Image Manipulation

14/06/2020

ManiGAN: Text-Guided Image Manipulation

Bowen Li, Xiaojuan Qi, Thomas Lukasiewicz, Philip H.S. Torr

Keywords: image manipulation, natural language, generative adversarial networks, gan

Abstract Paper Similar Papers

Abstract: The goal of our paper is to semantically edit parts of an image matching a given text that describes desired attributes (e.g., texture, colour, and background), while preserving other contents that are irrelevant to the text. To achieve this, we propose a novel generative adversarial network (ManiGAN), which contains two key components: text-image affine combination module (ACM) and detail correction module (DCM). The ACM selects image regions relevant to the given text and then correlates the regions with corresponding semantic words for effective manipulation. Meanwhile, it encodes original image features to help reconstruct text-irrelevant contents. The DCM rectifies mismatched attributes and completes missing contents of the synthetic image. Finally, we suggest a new metric for evaluating image manipulation results, in terms of both the generation of new attributes and the reconstruction of text-irrelevant contents. Extensive experiments on the CUB and COCO datasets demonstrate the superior performance of the proposed method.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

STEFANN: Scene Text Editor Using Font Adaptive Neural Network

Prasun Roy, Saumik Bhattacharya, Subhankar Ghosh, Umapada Pal

Keywords Paper

scene image, scene text editor, font adaptive, font generation, font color transfer, single observation, computer vision, deep learning

0

0

0

0

1:00

22/11/2021

FacialGAN: Style Transfer and Attribute Manipulation on Synthetic Faces

Ricard Durall Lopez, Jireh Jam, Dominik Strassel and
Moi Hoon Yap, Janis Keuper

Keywords Paper

GAN, attribute manipulation, style transfer, face editing

0

0

0

0

2:55

06/12/2021

Dual Progressive Prototype Network for Generalized Zero-Shot Learning

Chaoqun Wang, Shaobo Min, Xuejin Chen and
Xiaoyan Sun, Houqiang Li

Keywords Paper

0

0

0

0

10:51

06/12/2020

Generative View Synthesis: From Single-view Semantics to Novel-view Images

Tewodros Amberbir Habtegebrial, Varun Jampani, Orazio Gallo, Didier Stricker

Keywords Paper

0

0

0

0

3:20

07/09/2020

Robust Scene Text Recognition Through Adaptive Image Enhancement

Ye Qian, Yuyang Wang, Feng Su

Keywords Paper

text recognition, image enhancement, spatial rectification, end-to-end, scene text

0

0

0

0

7:50

14/06/2020

Show, Edit and Tell: A Framework for Editing Image Captions

Fawaz Sammani, Luke Melas-Kyriazi

Keywords Paper

image captioning, image description, editing captions, sequence editing, copy mechanism, adaptive copy mechanism, selecting mechanism, copy lstm

0

0

0

0

1:01

14/06/2020

Predicting Sharp and Accurate Occlusion Boundaries in Monocular Depth Estimation Using Displacement Fields

Michaël Ramamonjisoa, Yuming Du, Vincent Lepetit

Keywords Paper

monocular_depth_estimation, occlusion_boundaries, occlusion_contours, computer_vision, deep_learning

0

0

0

0

1:00

22/11/2021

SuperStyleNet: Deep Image Synthesis with Superpixel Based Style Encoder

Jonghyun Kim, Gen Li, Cheolkon Jung, Joongkyu Kim

Keywords Paper

image-to-image translation, semantic image synthesis, image generation, superpixel, style encoder, graph self-attention

0

0

0

0

2:52

22/11/2021

MAGECally invert images for realistic editing

Asya Grechka, jean Francois Goudou, Matthieu Cord

Keywords Paper

gan inversion, gan, stylegan2, gan editing, image editing, gan projection, stylegan, semantic editing, latent space manipulation, latent editing

0

0

0

0

3:01

14/06/2020

RiFeGAN: Rich Feature Generation for Text-to-Image Synthesis From Prior Knowledge

Jun Cheng, Fuxiang Wu, Yanling Tian and
Lei Wang, Dapeng Tao

Keywords Paper

image synthesis, self-attentional embedding mixture, multi-captions, limited information, caption matching

0

0

0

0

1:01

16/11/2020

Compressive Summarization with Plausibility and Salience Modeling

Shrey Desai, Jiacheng Xu, Greg Durrett

Keywords Paper

compressive systems, compressions, rouge, pre-trained model

0

0

0

0

12:04

02/02/2021

Self-Supervised Sketch-to-Image Synthesis

Bingchen Liu, Yizhe Zhu, Kunpeng Song, Ahmed Elgammal

Keywords Paper

0

0

0

0

14:42

30/11/2020

Horizontal Flipping Assisted Disentangled Feature Learning for Semi-Supervised Person Re-Identification

Gehan Hao, Yang Yang, Xue Zhou and
Guanan Wang, Zhen Lei

Keywords Paper

0

0

0

0

5:09

19/08/2021

Text-based Person Search via Multi-Granularity Embedding Learning

Chengji Wang, Zhiming Luo, Yaojin Lin, Shaozi Li

Keywords Paper

Computer Vision, Language and Vision, Recognition

0

0

0

0

12:25

06/12/2021

Implicit Semantic Response Alignment for Partial Domain Adaptation

Wenxiao Xiao, Zhengming Ding, Hongfu Liu

Keywords Paper

domain adaptation, transfer learning

0

0

0

0

11:43

02/02/2021

FaceController: Controllable Attribute Editing for Face in the Wild

Zhiliang Xu, Xiyu Yu, Zhibin Hong and
Zhen Zhu, Junyu Han, Jingtuo Liu, Errui Ding, Xiang Bai

Keywords Paper

0

0

0

0

14:26

22/11/2021

Each Attribute Matters: Contrastive Attention for Sentence-based Image Editing

Liuqing Zhao, Fan Lyu, Fuyuan Hu and
Kaizhu Huang, Fenglei Xu, Linyan Li

Keywords Paper

Image manipulation, Generation adversarial network

0

0

0

0

3:10

14/06/2020

StructEdit: Learning Structural Shape Variations

Kaichun Mo, Paul Guerrero, Li Yi and
Hao Su, Peter Wonka, Niloy J. Mitra, Leonidas J. Guibas

Keywords Paper

3d vision, 3d graphics, shape editing, generative modeling, shape analysis, edit transfer, shape parts, shape structure, conditional generative model, variational auto-encoder

0

0

0

0

1:01

30/11/2020

MagGAN: High-Resolution Face Attribute Editing with Mask-Guided Generative Adversarial Network

Yi Wei, Zhe Gan, Wenbo Li and
Siwei Lyu, Ming-Ching Chang, Lei Zhang, Jianfeng Gao, Pengchuan Zhang

Keywords Paper

0

0

0

0

7:42

22/11/2021

Reference Guided Image Inpainting using Facial Attributes

Dongsik Yoon, Youngsaeng Jin, Jeong-gi Kwak and
Yuanming Li, David K Han, Hanseok Ko

Keywords Paper

image inpainting, GAN, image completion, image manipulation

0

0

0

0

2:43

14/06/2020

Nested Scale-Editing for Conditional Image Synthesis

Lingzhi Zhang, Jiancong Wang, Yinshuang Xu and
Jie Min, Tarmily Wen, James C. Gee, Jianbo Shi

Keywords Paper

scale editing, identity recovery, image synthesis, super-resolution, image outpainting, text2image, cross-modal translation

0

0

0

0

1:01

18/07/2021

Markpainting: Adversarial Machine Learning meets Inpainting

David G Khachaturov, Ilia Shumailov, Yiren Zhao and
Nicolas Papernot, Ross Anderson

Keywords Paper

Social Aspects of Machine Learning, Privacy, Anonymity, and Security

0

0

0

0

4:28

14/06/2020

Fashion Editing With Adversarial Parsing Learning

Haoye Dong, Xiaodan Liang, Yixuan Zhang and
Xujie Zhang, Xiaohui Shen, Zhenyu Xie, Bowen Wu, Jian Yin

Keywords Paper

fashion editing, image generation, image synthesis, gan, generative adversarial network, image manipulation, human parsing, segmentation, image editing, virtual try-on

0

0

0

0

1:00

03/05/2021

Bowtie Networks: Generative Modeling for Joint Few-Shot Recognition and Novel-View Synthesis

Zhipeng Bao, Yu-Xiong Wang, Martial Hebert

Keywords Paper

adversarial training, computer vision, object recognition, few-shot learning, generative models

0

0

0

0

5:11

02/02/2021

Object-Centric Image Generation from Layouts

Tristan Sylvain, Pengchuan Zhang, Yoshua Bengio and
R Devon Hjelm, Shikhar Sharma

Keywords Paper

0

0

0

0

17:44

14/06/2020

Bringing Old Photos Back to Life

Ziyu Wan, Bo Zhang, Dongdong Chen and
Pan Zhang, Dong Chen, Jing Liao, Fang Wen

Keywords Paper

image restoration, low-level vision, image translation

0

0

0

0

4:41

14/06/2020

Differential Treatment for Stuff and Things: A Simple Unsupervised Domain Adaptation Method for Semantic Segmentation

Zhonghao Wang, Mo Yu, Yunchao Wei and
Rogerio Feris, Jinjun Xiong, Wen-mei Hwu, Thomas S. Huang, Honghui Shi

Keywords Paper

semantic segmentation, domain adaptation, unsupervised learning, stuff matching, instance matching

0

0

0

0

1:00

19/08/2021

Disentangled Face Attribute Editing via Instance-Aware Latent Space Search

Yuxuan Han, Jiaolong Yang, Ying Fu

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Explainable/Interpretable Machine Learning

0

0

0

0

12:51

05/01/2021

Style Transfer by Rigid Alignment in Neural Net Feature Space

Suryabhan Singh Hada, Miguel A. Carreira-Perpinan

Keywords Paper

0

0

0

0

4:34

22/11/2021

An Adaptive Rectification Model for Arbitrary-Shaped Scene Text Recognition

Ye Qian, Long Chen, Feng Su

Keywords Paper

scene text recognition, rectification, projective transformation

0

0

0

0

2:35

26/04/2020

Masked Based Unsupervised Content Transfer

Ron Mokady, Sagie Benaim, Lior Wolf, Amit Bermano

Keywords Paper

0

0

0

0

4:38

14/06/2020

UCTGAN: Diverse Image Inpainting Based on Unsupervised Cross-Space Translation

Lei Zhao, Qihang Mo, Sihuan Lin and
Zhizhong Wang, Zhiwen Zuo, Haibo Chen, Wei Xing, Dongming Lu

Keywords Paper

image inpainting, diverse image inpainting, image completion, unsupervised cross-space translation, diverse image generation, deep-learning based inpainting, deep learning, multiple-solution inpainting

0

0

0

0

1:01

12/07/2020

Educating Text Autoencoders: Latent Representation Guidance via Denoising

Tianxiao Shen, Jonas Mueller, Regina Barzilay, Tommi Jaakkola

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

17:06

19/08/2021

Context-Aware Image Inpainting with Learned Semantic Priors

Wendong Zhang, Junwei Zhu, Ying Tai and
Yunbo Wang, Wenqing Chu, Bingbing Ni, Chengjie Wang, Xiaokang Yang

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Deep Learning

0

0

0

0

13:26

02/02/2021

Learning Intact Features by Erasing-Inpainting for Few-shot Classification

Junjie Li, Zilei Wang, Xiaoming Hu

Keywords Paper

0

0

0

0

15:15

30/11/2020

Image Captioning through Image Transformer

Sen He, Wentong Liao, Hamed R. Tavakoli and
Michael Yang, Bodo Rosenhahn, Nicolas Pugeault

Keywords Paper

0

0

0

0

9:49

14/06/2020

Cross-Domain Correspondence Learning for Exemplar-Based Image Translation

Pan Zhang, Bo Zhang, Dong Chen and
Lu Yuan, Fang Wen

Keywords Paper

exemplar based image translation, correspondence, gan, weak supervision learning

0

0

0

0

5:01

14/06/2020

Controllable Person Image Synthesis With Attribute-Decomposed GAN

Yifang Men, Yiming Mao, Yuning Jiang and
Wei-Ying Ma, Zhouhui Lian

Keywords Paper

image synthesis, pose transfer, generative adversarial networks, image editing, attribute separation, feature disentanglement, fashion ai

0

0

0

0

4:56

30/11/2020

Scale-Aware Polar Representation for Arbitrarily-Shaped Text Detection

Yanguang Bi, Zhiqiang Hu

Keywords Paper

0

0

0

0

9:56

22/11/2021

Contour-guided Image Completion with Perceptual Grouping

Morteza Rezanejad, Sidharth Gupta, Chandra Gummaluru and
Ryan Marten, John Wilder, Michael Gruninger, Dirk B. Walther

Keywords Paper

Image Completion, Inpainting, Perceptual Grouping, Stochastic Completion Fields, Contour Completion, Good Continuation, Perceptual Organisation

0

0

0

0

3:02