Zero-Shot Text-to-Image Generation

18/07/2021

Zero-Shot Text-to-Image Generation

Aditya Ramesh, Mikhail Pavlov, Gabriel Goh, Scott Gray, Chelsea Voss, Alec Radford, Mark Chen, Ilya Sutskever

Keywords: Deep Learning, Generative Models

Abstract Paper Similar Papers

Abstract: Text-to-image generation has traditionally focused on finding better modeling assumptions for training on a fixed dataset. These assumptions might involve complex architectures, auxiliary losses, or side information such as object part labels or segmentation masks supplied during training. We describe a simple approach for this task based on a transformer that autoregressively models the text and image tokens as a single stream of data. With sufficient data and scale, our approach is competitive with previous domain-specific models when evaluated in a zero-shot fashion.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

Learn to Augment: Joint Data Augmentation and Network Optimization for Text Recognition

Canjie Luo, Yuanzhi Zhu, Lianwen Jin, Yongpan Wang

Keywords Paper

data augmentation, text recognition, joint training

0

0

0

0

0:59

06/12/2021

Shape your Space: A Gaussian Mixture Regularization Approach to Deterministic Autoencoders

Amrutha Saseendran, Kathrin Skubch, Stefan Falkner, Margret Keuper

Keywords Paper

generative model

0

0

0

0

12:18

26/04/2020

Learned step size quantization

Steven K. Esser, Jeffrey L. McKinstry, Deepika Bablani and
Rathinakumar Appuswamy, Dharmendra S. Modha

Keywords Paper

deep learning, low precision, classification, quantization

0

0

0

0

4:40

06/12/2021

Encoding Robustness to Image Style via Adversarial Feature Perturbations

Manli Shu, Zuxuan Wu, Micah Goldblum, Tom Goldstein

Keywords Paper

deep learning, machine learning, robustness, adversarial robustness and security, domain adaptation

0

0

0

0

7:36

06/12/2020

Robust Quantization: One Model to Rule Them All

Moran Shkolnik, Brian Chmiel, Ron Banner and
Gil Shomron, Yury Nahshan, Alex Bronstein, Uri Weiser

Keywords Paper

0

0

0

0

3:13

18/07/2021

Training Data Subset Selection for Regression with Controlled Generalization Error

Durga S, Rishabh Iyer, Ganesh Ramakrishnan, Abir De

Keywords Paper

, Algorithms, Online Learning, Algorithms, Supervised Learning

0

0

0

0

4:15

06/12/2021

Grounding inductive biases in natural images: invariance stems from variations in data

Diane Bouchacourt, Mark Ibrahim, Ari Morcos

Keywords Paper

machine learning, transformers

0

0

0

0

14:19

14/06/2020

Reinforced Feature Points: Optimizing Feature Detection and Description for a High-Level Task

Aritra Bhowmik, Stefan Gumhold, Carsten Rother, Eric Brachmann

Keywords Paper

sparse features, reinforcement learning, key point detection, feature description, feature matching, relative pose estimation, ransac, essential matrix, sift, superpoint

0

0

0

0

5:01

16/11/2020

Unsupervised Metric Relocalization Using Transform Consistency Loss

Mike Kasper, Fernando Nobre, Christoffer Heckman, Nima Keivan

Keywords Paper

0

0

0

0

3:58

06/12/2021

Referring Transformer: A One-step Approach to Multi-task Visual Grounding

Muchen Li, Leonid Sigal

Keywords Paper

transformers, vision

0

0

0

0

7:54

05/01/2021

Two-Level Adversarial Visual-Semantic Coupling for Generalized Zero-Shot Learning

Shivam Chandhok, Vineeth N Balasubramanian

Keywords Paper

0

0

0

0

4:59

06/12/2021

Meta Internal Learning

Raphael Bensadoun, Shir Gur, Tomer Galanti, Lior Wolf

Keywords Paper

vision, generative model, meta learning

0

0

0

0

7:41

05/01/2021

ClassMix: Segmentation-Based Data Augmentation for Semi-Supervised Learning

Viktor Olsson, Wilhelm Tranheden, Juliano Pinto, Lennart Svensson

Keywords Paper

0

0

0

0

4:58

06/12/2021

Domain Invariant Representation Learning with Domain Density Transformations

A. Tuan Nguyen, Toan Tran, Yarin Gal, Atilim Gunes Baydin

Keywords Paper

generative model, domain adaptation, representation learning

0

0

0

0

7:10

16/11/2020

Effective Unsupervised Domain Adaptation with Adversarially Trained Language Models

Thuy-Trang Vu, Dinh Phung, Gholamreza Haffari

Keywords Paper

combinatorial problem, unsupervised tasks, named recognition, broad-coverage models

0

0

0

0

11:57

18/07/2021

Learning a Universal Template for Few-shot Dataset Generalization

Eleni Triantafillou, Hugo Larochelle, Richard Zemel, Vincent Dumoulin

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

5:27

22/11/2021

Mode-Guided Feature Augmentation for Domain Generalization

Muhammad Haris Khan, Syed Muhammad talha Zaidi, Salman Khan, Fahad Shahbaz Khan

Keywords Paper

out-of-domain robustness, domain generalization, domain adaptation, convolutional neural networks, data augmentation, feature augmentation, subspace similarity, covariate shift, in-domain generalization, robust objective function

0

0

0

0

2:56

30/11/2020

MTNAS: Search Multi-Task Networks for Autonomous Driving

Hao Liu, Dong Li, JinZhang Peng and
Qingjie Zhao, Lu Tian, Yi Shan

Keywords Paper

0

0

0

0

9:06

07/09/2020

Zero-Shot Domain Generalization

Udit Maniyar, Joseph K J, Aniket Anand Deshmukh and
Urun Dogan, Vineeth N Balasubramanian

Keywords Paper

Domain Generalization, zero-shot learning, semantic space, multi task learning, Learning with limited data, representation learning, classification

0

0

0

0

9:59

04/07/2020

A Generate-and-Rank Framework with Semantic Type Regularization for Biomedical Concept Normalization

Dongfang Xu, Zeyu Zhang, Steven Bethard

Keywords Paper

Biomedical Normalization, Concept normalization, Generate-and-Rank Framework, Semantic Regularization

0

0

0

0

11:36

06/12/2021

Scalable Rule-Based Representation Learning for Interpretable Classification

Zhuo Wang, Wei Zhang, Ning Liu, Jianyong Wang

Keywords Paper

optimization, machine learning, representation learning, interpretability

0

0

0

0

14:52

14/06/2020

Cross-Domain Face Presentation Attack Detection via Multi-Domain Disentangled Representation Learning

Guoqing Wang, Hu Han, Shiguang Shan, Xilin Chen

Keywords Paper

face presentation attack detection, face anti-spoofing, cross-domain, disentangled representation learning, multi-domain learning.

0

0

0

0

1:01

02/02/2021

Unsupervised Model Adaptation for Continual Semantic Segmentation

Serban Stan, Mohammad Rostami

Keywords Paper

0

0

0

0

15:56

14/06/2020

Learning Fast and Robust Target Models for Video Object Segmentation

Andreas Robinson, Felix Järemo Lawin, Martin Danelljan and
Fahad Shahbaz Khan, Michael Felsberg

Keywords Paper

video object segmentation, semi-supervised

0

0

0

0

4:57

14/06/2020

Towards Learning Structure via Consensus for Face Segmentation and Parsing

Iacopo Masi, Joe Mathai, Wael AbdAlmageed

Keywords Paper

face segmentation, occlusion detection, face, face parsing, semantic segmentation, smoothness, structure, pixel-wise loss, encoder-decoder

0

0

0

0

1:01

07/09/2020

BriNet: Towards Bridging the Intra-class and Inter-class Gaps in One-Shot Segmentation

Xianghui Yang, Bairun Wang, Xinchi Zhou and
Kaige Chen, Shuai Yi, Wanli Ouyang, Luping Zhou

Keywords Paper

Few-shot Semantic Segmentation, Few-shot learning, Semantic Segmentation

0

0

0

0

8:26

22/11/2021

Prototype-based Incremental Few-Shot Segmentation

Fabio Cermelli, Massimiliano Mancini, Yongqin Xian and
Zeynep Akata, Barbara Caputo

Keywords Paper

segmentation, incremental learning, continual learning, few shot learning, any shot learning, prototype, knowledge distillation

0

0

0

0

2:56

06/12/2021

Object DGCNN: 3D Object Detection using Dynamic Graphs

Yue Wang, Justin Solomon

Keywords Paper

vision, graph learning

0

0

0

0

10:03

12/07/2020

Learning Similarity Metrics for Numerical Simulations

Georg Kohl, Kiwon Um, Nils Thuerey

Keywords Paper

General Machine Learning Techniques

0

0

0

0

15:16

30/11/2020

Learning Local Feature Descriptors for Multiple Object Tracking

Dmytro Borysenko, Dmytro Mykheievskyi, Viktor Porokhonskyy

Keywords Paper

0

0

0

0

9:42

04/07/2020

A Transformer-based Approach for Source Code Summarization

Wasi Ahmad, Saikat Chakraborty, Baishakhi Ray, Kai-Wei Chang

Keywords Paper

Source Summarization, summarization, ablation studies, Transformer-based Approach

0

0

0

0

6:14

07/09/2020

Adversarial Concurrent Training: Optimizing Robustness and Accuracy Trade-off of Deep Neural Networks

Elahe Arani, Fahad Sarfraz, Bahram Zonooz

Keywords Paper

Adversarial Robustness, Generalization, Adversarial Training, Deep Learning, Collaborative Learning

0

0

0

0

3:39

25/07/2020

Domain-adaptive neural automated essay scoring

Yue Cao, Hanqi Jin, Xiaojun Wan, Zhiwei Yu

Keywords Paper

domain adaptation, natural language processing, automated essay scoring, self-supervised learning

0

0

0

0

13:00

04/07/2020

Learning to Customize Model Structures for Few-shot Dialogue Generation Tasks

Yiping Song, Zequn Liu, Wei Bi and
Rui Yan, Ming Zhang

Keywords Paper

Few-shot Tasks, open-domain systems, generative models, meta-learning framework

0

0

0

0

11:43

06/12/2021

$(\textrm{Implicit})^2$: Implicit Layers for Implicit Representations

Zhichun Huang, Shaojie Bai, J. Zico Kolter

Keywords Paper

deep learning, representation learning

1

0

0

1

12:23

06/12/2021

Adversarial Reweighting for Partial Domain Adaptation

Xiang Gu, Xi Yu, yan yang and
Jian Sun, Zongben Xu

Keywords Paper

domain adaptation

0

0

0

1

9:03

06/12/2021

Adaptive Risk Minimization: Learning to Adapt to Domain Shift

Marvin Zhang, Henrik Marklund, Nikita Dhawan and
Abhishek Gupta, Sergey Levine, Chelsea Finn

Keywords Paper

machine learning, robustness, vision, domain adaptation

0

0

0

0

9:30

05/01/2021

Towards Contextual Learning in Few-Shot Object Classification

Mathieu Page Fortin, Brahim Chaib-draa

Keywords Paper

0

0

0

0

4:57

14/06/2020

MaskGAN: Towards Diverse and Interactive Facial Image Manipulation

Cheng-Han Lee, Ziwei Liu, Lingyun Wu, Ping Luo

Keywords Paper

facial image manipulation, face segmentation, image synthesis, generative adversarial network

0

0

0

0

1:00

06/12/2020

Network-to-Network Translation with Conditional Invertible Neural Networks

Robin Rombach, Patrick Esser, Bjorn Ommer

Keywords Paper

0

0

0

0

3:25