Faces a la Carte: Text-to-Face Generation via Attribute Disentanglement

Abstract: Text-to-Face (TTF) synthesis is a challenging task with great potential for diverse computer vision applications. Compared to Text-to-Image (TTI) synthesis tasks, the textual description of faces can be much more complicated and detailed due to the variety of facial attributes and the parsing of high dimensional abstract natural language. In this paper, we propose a Text-to-Face model that not only produces images in high resolution (1024*1024) with text-to-image consistency, but also outputs multiple diverse faces to cover a wide range of unspecified facial features in a natural way. By fine-tuning the multi-label classifier and image encoder, our model obtains the adjustment vectors and image embeddings which are used to transform the input noise vector sampled from the normal distribution. Afterwards, the transformed noise vector is fed into a pre-trained high-resolution image generator to produce a set of faces with the desired facial attributes. We refer to our model as TTF-HD. Experimental results show that TTF-HD generates high-quality synthesised faces from free-

14/06/2020

radial distortion correction, face recognition, spatial transformer network, cascaded network, fisheye camera, wide-angle camera

1:00

26/04/2020

deblurring, face, multi-view, video, blur, GAN, novel view synthesis, inversion, deep learning, dataset

2:50

22/11/2021

3d flow, dense 3d facial motion capture, optical flow, scene flow, 3d reconstruction and tracking, in-the-wild monocular tracking, facial reenactment, expression recognition, performance capture, non-rigid facial deformations

1:01

14/06/2020

image synthesis, pose transfer, generative adversarial networks, image editing, attribute separation, feature disentanglement, fashion ai

4:56

30/11/2020

3d face dataset, face prediction, riggable model, 3d morphable model, dynamic details, deep neural network, displacement map

1:00

14/06/2020

generative models, bijective metric learning, blackbox face matcher, distillation framework, face synthesis, id preservation, feature-conditional structure, feature reconstruction, dibigan.

5:03

14/06/2020

single image view synthesis, view synthesis, differentiable rendering, point cloud, convolutional neural networks, generative networks

4:58

06/12/2021

3d face reconstruction, soft rasterization, differentiable rendering, free-form deformation, 3d morphable model, face parsing

5:01

05/01/2021

deep learning, visual inspection, unsupervised anomaly detection, anomaly localization, autoencoder, variational autoencoder, gradient descent, inpainting

4:19

22/11/2021

face reenactment, triplet perceptual loss, generative adversarial network, facial expression transformation, disentanglement of appearance and geometry information

1:01

14/06/2020