Learning to Have an Ear for Face Super-Resolution

14/06/2020

Learning to Have an Ear for Face Super-Resolution

Givi Meishvili, Simon Jenni, Paolo Favaro

Keywords: super-resolution, audio, sound, face, multi-modal, inverting, gan, generative, adversarial, autoencoder

Abstract Paper Similar Papers

Abstract: We propose a novel method to use both audio and a low-resolution image to perform extreme face super-resolution (a 16x increase of the input size). When the resolution of the input image is very low (e.g., 8x8 pixels), the loss of information is so dire that important details of the original identity have been lost and audio can aid the recovery of a plausible high-resolution image. In fact, audio carries information about facial attributes, such as gender and age. To combine the aural and visual modalities, we propose a method to first build the latent representations of a face from the lone audio track and then from the lone low-resolution image. We then train a network to fuse these two representations. We show experimentally that audio can assist in recovering attributes such as the gender, the age and the identity, and thus improve the correctness of the high-resolution image reconstruction process. Our procedure does not make use of human annotation and thus can be easily trained with existing video datasets. Moreover, we show that our model builds a factorized representation of images and audio as it allows one to mix low-resolution images and audio from different videos and to generate realistic faces with semantically meaningful combinations.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

30/11/2020

Multiple Exemplars-based Hallucination for Face Super-resolution and Editing

Kaili Wang, Jose Oramas, Tinne Tuytelaars

Keywords Paper

0

0

0

0

9:46

06/12/2021

BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation

Mingcong Liu, Qiang Li, Zekui Qin and
Guoxin Zhang, Pengfei Wan, Wen Zheng

Keywords Paper

generative model

0

0

0

0

3:49

30/11/2020

Localin Reshuffle Net: Toward Naturally and Efficiently Facial Image Blending

Chengyao Zheng, Siyu Xia, Joseph Robinson and
Changsheng Lu, Wayne Wu, Chen Qian, Ming Shao

Keywords Paper

0

0

0

0

2:19

05/01/2021

R-MNet: A Perceptual Adversarial Network for Image Inpainting

Jireh Jam, Connah Kendrick, Vincent Drouard and
Kevin Walker, Gee-Sern Hsu, Moi Hoon Yap

Keywords Paper

0

0

0

0

5:02

06/12/2021

Low-Rank Subspaces in GANs

Jiapeng Zhu, Ruili Feng, Yujun Shen and
Deli Zhao, Zheng-Jun Zha, Jingren Zhou, Qifeng Chen

Keywords Paper

generative model

0

0

0

0

11:41

14/06/2020

A Disentangling Invertible Interpretation Network for Explaining Latent Representations

Patrick Esser, Robin Rombach, Björn Ommer

Keywords Paper

interpretability, inn, disentangling, generative models, invertible neural networks, autoencoders, normalizing flows, vae, explainable, xai

0

0

0

0

1:01

02/02/2021

Non-Autoregressive Coarse-to-Fine Video Captioning

Bang Yang, Yuexian Zou, Fenglin Liu, Can Zhang

Keywords Paper

0

0

0

0

18:21

06/12/2021

Improved Transformer for High-Resolution GANs

Long Zhao, Zizhao Zhang, Ting Chen and
Dimitris Metaxas, Han Zhang

Keywords Paper

transformers, generative model

0

0

0

0

12:11

03/05/2021

You Only Need Adversarial Supervision for Semantic Image Synthesis

Edgar Schoenfeld, Vadim Sushko, Dan Zhang and
Juergen Gall, Bernt Schiele, Anna Khoreva

Keywords Paper

GANs, Semantic Image Synthesis, Image Generation, Deep Learning

0

0

0

0

5:11

19/08/2021

AgeFlow: Conditional Age Progression and Regression with Normalizing Flows

Zhizhong Huang, Shouzhen Chen, Junping Zhang, Hongming Shan

Keywords Paper

Computer Vision, Biometrics, Face and Gesture Recognition, Unsupervised Learning, 2D and 3D Computer Vision

0

0

0

0

8:20

14/06/2020

Learning Meta Face Recognition in Unseen Domains

Jianzhu Guo, Xiangyu Zhu, Chenxu Zhao and
Dong Cao, Zhen Lei, Stan Z. Li

Keywords Paper

face recognition, meta learning, domain generalization, metric learning

0

0

0

0

5:01

06/12/2020

Fourier Spectrum Discrepancies in Deep Network Generated Images

Tarik Dzanic, Karan Shah, Freddie Witherden

Keywords Paper

0

0

0

0

3:04

07/09/2020

Loss Functions for Person Image Generation

Haoyue Shi, Le Wang, Wei Tang and
Nanning Zheng, Gang Hua

Keywords Paper

person image generation, pose transfer, generative adversarial networks, structural similarity loss

0

0

0

0

2:24

05/01/2021

Unsupervised Multimodal Video-to-Video Translation via Self-Supervised Learning

Kangning Liu, Shuhang Gu, Andres Romero, Radu Timofte

Keywords Paper

0

0

0

0

5:00

03/05/2021

Counterfactual Generative Networks

Axel Sauer, Andreas Geiger

Keywords Paper

Generative Models, Data Augmentation, Image Classification, Counterfactuals, Robustness, Causality

0

0

0

0

5:25

30/11/2020

Robust High Dynamic Range (HDR) Imaging with Complex Motion and Parallax

Zhiyuan Pu, Peiyao Guo, M. Salman Asif, Zhan Ma

Keywords Paper

0

0

0

0

7:38

19/08/2021

Enhance Image as You Like with Unpaired Learning

Xiaopeng Sun, Muxingzi Li, Tianyu He, Lubin Fan

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Applications of Unsupervised Learning

0

0

0

0

11:20

05/01/2021

3D Dense Geometry-Guided Facial Expression Synthesis by Adversarial Learning

Rumeysa Bodur, Binod Bhattarai, Tae-Kyun Kim

Keywords Paper

0

0

0

0

4:43

19/08/2021

Adv-Makeup: A New Imperceptible and Transferable Attack on Face Recognition

Bangjie Yin, Wenxuan Wang, Taiping Yao and
Junfeng Guo, Zelun Kong, Shouhong Ding, Jilin Li, Cong Liu

Keywords Paper

Computer Vision, Biometrics, Face and Gesture Recognition, Recognition, Adversarial Machine Learning

0

0

0

0

12:46

06/12/2021

Artistic Style Transfer with Internal-external Learning and Contrastive Learning

Haibo Chen, lei zhao, Zhizhong Wang and
Huiming Zhang, Zhiwen Zuo, Ailin Li, Wei Xing, Dongming Lu

Keywords Paper

deep learning, contrastive learning

0

0

0

0

10:21

14/06/2020

Rotation Consistent Margin Loss for Efficient Low-Bit Face Recognition

Yudong Wu, Yichao Wu, Ruihao Gong and
Yuanhao Lv, Ken Chen, Ding Liang, Xiaolin Hu, Xianglong Liu, Junjie Yan

Keywords Paper

open-set face recognition, low-bit quantization, rotation consistent margin

0

0

0

0

1:00

14/06/2020

Deep Spatial Gradient and Temporal Depth Learning for Face Anti-Spoofing

Zezheng Wang, Zitong Yu, Chenxu Zhao and
Xiangyu Zhu, Yunxiao Qin, Qiusheng Zhou, Feng Zhou, Zhen Lei

Keywords Paper

face anti-spoofing, depth supervised learning, multiple frames, detailed discriminative clues, 3d moving faces

0

0

0

0

4:57

02/02/2021

FaceController: Controllable Attribute Editing for Face in the Wild

Zhiliang Xu, Xiyu Yu, Zhibin Hong and
Zhen Zhu, Junyu Han, Jingtuo Liu, Errui Ding, Xiang Bai

Keywords Paper

0

0

0

0

14:26

26/04/2020

Robust And Interpretable Blind Image Denoising Via Bias-Free Convolutional Neural Networks

Sreyas Mohan, Zahra Kadkhodaie, Eero P. Simoncelli, Carlos Fernandez-Granda

Keywords Paper

denoising, overfitting, generalization, robustness, interpretability, analysis of neural networks

0

0

0

0

5:01

22/11/2021

Cascaded Cross MLP-Mixer GANs for Cross-View Image Translation

Bin Ren, Hao Tang, Nicu Sebe

Keywords Paper

cross view, MLP, image translation, image generation

0

0

0

0

9:58

05/01/2021

A Variational Information Bottleneck Based Method to Compress Sequential Networks for Human Action Recognition

Ayush Srivastava, Oshin Dutta, Jigyasa Gupta and
Sumeet Agarwal, Prathosh AP

Keywords Paper

0

0

0

0

4:29

14/06/2020

Attention Mechanism Exploits Temporal Contexts: Real-Time 3D Human Pose Reconstruction

Ruixu Liu, Ju Shen, He Wang and
Chen Chen, Sen-ching Cheung, Vijayan Asari

Keywords Paper

3d human pose, attention mechanism, multi-scale dilation convolution, monocular motion reconstruction

0

0

0

0

5:01

06/12/2021

Learning to Generate Realistic Noisy Images via Pixel-level Noise-aware Adversarial Training

Yuanhao Cai, Xiaowan Hu, Haoqian Wang and
Yulun Zhang, Hanspeter Pfister, Donglai Wei

Keywords Paper

deep learning, adversarial robustness and security, vision, generative model, graph learning

0

0

0

0

3:05

06/12/2020

RNNPool: Efficient Non-linear Pooling for RAM Constrained Inference

Oindrila Saha, Aditya Kusupati, Harsha Simhadri and
Manik Varma, Prateek Jain

Keywords Paper

0

0

0

0

3:30

14/06/2020

Factorized Higher-Order CNNs With an Application to Spatio-Temporal Emotion Estimation

Jean Kossaifi, Antoine Toisoul, Adrian Bulat and
Yannis Panagakis, Timothy M. Hospedales, Maja Pantic

Keywords Paper

tensor methods, deep learning, spatiotemporal, emotion, cnn, tensor decomposition, low-rank, valence, arousal

0

0

0

0

1:01

14/06/2020

RiFeGAN: Rich Feature Generation for Text-to-Image Synthesis From Prior Knowledge

Jun Cheng, Fuxiang Wu, Yanling Tian and
Lei Wang, Dapeng Tao

Keywords Paper

image synthesis, self-attentional embedding mixture, multi-captions, limited information, caption matching

0

0

0

0

1:01

14/06/2020

DAVD-Net: Deep Audio-Aided Video Decompression of Talking Heads

Xi Zhang, Xiaolin Wu, Xinliang Zhai and
Xianye Ben, Chengjie Tu

Keywords Paper

talking heads, video restoration, audio-video correlations, multimodal fusion, encoder information, constraining projection

0

0

0

0

4:57

06/12/2021

VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text

Hassan Akbari, Liangzhe Yuan, Rui Qian and
Wei-Hong Chuang, Shih-Fu Chang, Yin Cui, Boqing Gong

Keywords Paper

machine learning, self-supervised learning, transformers, vision, contrastive learning

0

0

0

0

15:59

14/06/2020

Learning Rank-1 Diffractive Optics for Single-Shot High Dynamic Range Imaging

Qilin Sun, Ethan Tseng, Qiang Fu and
Wolfgang Heidrich, Felix Heide

Keywords Paper

single-shot hdr, high-dynamic-range imaging, computational photography, computational optics, end-to-end camera optimization, automotive imaging

0

0

0

0

4:58

06/12/2020

Learning Semantic-aware Normalization for Generative Adversarial Networks

Heliang Zheng, Jianlong Fu, zengyh Zeng and
Jiebo Luo, Zheng-Jun Zha

Keywords Paper

0

0

0

0

3:11

14/06/2020

RDCFace: Radial Distortion Correction for Face Recognition

He Zhao, Xianghua Ying, Yongjie Shi and
Xin Tong, Jingsi Wen, Hongbin Zha

Keywords Paper

radial distortion correction, face recognition, spatial transformer network, cascaded network, fisheye camera, wide-angle camera

0

0

0

0

1:00

26/04/2020

Adversarially Robust Representations with Smooth Encoders

Taylan Cemgil, Sumedh Ghaisas, Krishnamurthy (Dj) Dvijotham, Pushmeet Kohli

Keywords Paper

Adversarial Learning, Robust Representations, Variational AutoEncoder, Wasserstein Distance, Variational Inference

0

0

0

0

5:16

06/12/2020

Hierarchical Quantized Autoencoders

Will Williams, Sam Ringer, Tom Ash and
David MacLeod, Jamie Dougherty, John Hughes

Keywords Paper

0

0

0

0

3:13

14/06/2020

EventSR: From Asynchronous Events to Image Reconstruction, Restoration, and Super-Resolution via End-to-End Adversarial Learning

Lin Wang, Tae-Kyun Kim, Kuk-Jin Yoon

Keywords Paper

event-based vision, image super-resolution, image restoration, image reconstruction, unsupervised and adversarial learning

0

0

0

0

1:03

14/06/2020

Interpreting the Latent Space of GANs for Semantic Face Editing

Yujun Shen, Jinjin Gu, Xiaoou Tang, Bolei Zhou

Keywords Paper

generative adversarial network, network interpretation, face editing

0

0

0

0

1:01