Cascaded Cross MLP-Mixer GANs for Cross-View Image Translation

Abstract: It is hard to generate an image at target view well for previous cross-view image translation methods that directly adopt a simple encoder-decoder or U-Net structure, especially for drastically different views and severe deformation cases. To ease this problem, we propose a novel two-stage framework with a new Cascaded Cross MLP-Mixer (CrossMLP) sub-network in the first stage and one refined pixel-level loss in the second stage. In the first stage, the CrossMLP sub-network learns the latent transformation cues between image code and semantic map code via our novel cross MLP-Mixer blocks. Then the coarse results are generated progressively under the guidance of those cues. Moreover, in the second stage, we design a refined pixel-level loss that eases the noisy semantic label problem in the cross-view translation task in a much simple fashion for better network optimization. Extensive experimental results on Dayton~cite{vo2016localizing} and CVUSA~cite{workman2015wide} datasets show that our method can generate significantly better results than state-of-the-art methods. The source code, data, and trained models are available later.

Cascaded Cross MLP-Mixer GANs for Cross-View Image Translation

Bin Ren, Hao Tang, Nicu Sebe

Comments

Similar Papers

Coarse-to-Fine Gaze Redirection With Numerical and Pictorial Guidance

Jingjing Chen, Jichao Zhang, Enver Sangineto and Tao Chen, Jiayuan Fan, Nicu Sebe

Keywords Abstract Paper

Hierarchical Scene Coordinate Classification and Regression for Visual Localization

Xiaotian Li, Shuzhe Wang, Yi Zhao and Jakob Verbeek, Juho Kannala

Keywords Abstract Paper

visual localization, camera relocalization, scene coordinate regression

FeatureFlow: Robust Video Interpolation via Structure-to-Texture Generation

Shurui Gui, Chaoyue Wang, Qihua Chen, Dacheng Tao

Keywords Abstract Paper

frame interpolation, slow motion, video processing, generation framework, deep learning, computer vision

Benefiting From Bicubically Down-Sampled Images for Learning Real-World Image Super-Resolution

Mohammad Saeed Rad, Thomas Yu, Claudiu Musat and Hazim Kemal Ekenel, Behzad Bozorgtabar, Jean-Philippe Thiran

Keywords Abstract Paper

Localin Reshuffle Net: Toward Naturally and Efficiently Facial Image Blending

Chengyao Zheng, Siyu Xia, Joseph Robinson and Changsheng Lu, Wayne Wu, Chen Qian, Ming Shao

Keywords Abstract Paper

Efficient Dynamic Scene Deblurring Using Spatially Variant Deconvolution Network With Optical Flow Guided Training

Yuan Yuan, Wei Su, Dandan Ma

Keywords Abstract Paper

dynamic scene deblurring, deconvolution neural network, bi-directional optical flow, deformable convolution, deep learning, image restoration

When does Contrastive Learning Preserve Adversarial Robustness from Pretraining to Finetuning?

Lijie Fan, Sijia Liu, Pin-Yu Chen and Gaoyuan Zhang, Chuang Gan

Keywords Abstract Paper

machine learning, robustness, adversarial robustness and security, self-supervised learning, vision, contrastive learning, clustering

Global Filter Networks for Image Classification

Yongming Rao, Wenliang Zhao, Zheng Zhu and Jiwen Lu, Jie Zhou

Keywords Abstract Paper

machine learning, robustness, transformers, vision

MaskGAN: Towards Diverse and Interactive Facial Image Manipulation

Cheng-Han Lee, Ziwei Liu, Lingyun Wu, Ping Luo

Keywords Abstract Paper

facial image manipulation, face segmentation, image synthesis, generative adversarial network

Degradation Model Learning for Real-World Single Image Super-resolution

Jin XIAO, Hongwei Yong, Lei Zhang

Keywords Abstract Paper

Spatially-Attentive Patch-Hierarchical Network for Adaptive Motion Deblurring

Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan

Keywords Abstract Paper

motion blur, spatially varying, attention, dynamic filter, adaptive, dynamic scene, deformable, encoder decoder, hierarchical, convolutional neural network

CentripetalText: An Efficient Text Instance Representation for Scene Text Detection

Tao Sheng, Jie Chen, Zhouhui Lian

Keywords Abstract Paper

robustness

Shifted Chunk Transformer for Spatio-Temporal Representational Learning

Xuefan Zha, Wentao Zhu, Lv Xun and Sen Yang, Ji Liu

Keywords Abstract Paper

machine learning, transformers, vision, language

Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning

Yu Liu, Lianghua Huang, Pan Pan and Bin Wang, Yinghui Xu, Rong Jin

Keywords Abstract Paper

Improved Transformer for High-Resolution GANs

Long Zhao, Zizhao Zhang, Ting Chen and Dimitris Metaxas, Han Zhang

Keywords Abstract Paper

transformers, generative model

Attention-Aware Feature Aggregation for Real-time Stereo Matching on Edge Devices

Jia-Ren Chang National Chiao Tung University, aetherAI, Pei-Chun Chang, Yong-Sheng Chen

Keywords Abstract Paper

ASLFeat: Learning Local Features of Accurate Shape and Localization

Zixin Luo, Lei Zhou, Xuyang Bai and Hongkai Chen, Jiahui Zhang, Yao Yao, Shiwei Li, Tian Fang, Long Quan

Keywords Abstract Paper

image matching, local feature keypoints, local feature descriptors, deep learning

Mask-Ranking Network for Semi-Supervised Video Object Segmentation

Wenjing Li, Xiang Zhang, Yujie Hu, Yingqi Tang

Keywords Abstract Paper

Robust Scene Text Recognition Through Adaptive Image Enhancement

Ye Qian, Yuyang Wang, Feng Su

Keywords Abstract Paper

text recognition, image enhancement, spatial rectification, end-to-end, scene text

Learning Fused Pixel and Feature-Based View Reconstructions for Light Fields

Jinglei Shi, Xiaoran Jiang, Christine Guillemot

Keywords Abstract Paper

light field, view synthesis, feature-based reconstruction, pixel-based reconstruction, deep learning, angular super-resolution

RDCFace: Radial Distortion Correction for Face Recognition

He Zhao, Xianghua Ying, Yongjie Shi and Xin Tong, Jingsi Wen, Hongbin Zha

Keywords Abstract Paper

Jingjing Chen, Jichao Zhang, Enver Sangineto and
Tao Chen, Jiayuan Fan, Nicu Sebe

Keywords Paper

Xiaotian Li, Shuzhe Wang, Yi Zhao and
Jakob Verbeek, Juho Kannala

Keywords Paper

Keywords Paper

Mohammad Saeed Rad, Thomas Yu, Claudiu Musat and
Hazim Kemal Ekenel, Behzad Bozorgtabar, Jean-Philippe Thiran

Keywords Paper

Chengyao Zheng, Siyu Xia, Joseph Robinson and
Changsheng Lu, Wayne Wu, Chen Qian, Ming Shao

Keywords Paper

Keywords Paper

Lijie Fan, Sijia Liu, Pin-Yu Chen and
Gaoyuan Zhang, Chuang Gan

Keywords Paper

Yongming Rao, Wenliang Zhao, Zheng Zhu and
Jiwen Lu, Jie Zhou

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Xuefan Zha, Wentao Zhu, Lv Xun and
Sen Yang, Ji Liu

Keywords Paper

Yu Liu, Lianghua Huang, Pan Pan and
Bin Wang, Yinghui Xu, Rong Jin

Keywords Paper

Long Zhao, Zizhao Zhang, Ting Chen and
Dimitris Metaxas, Han Zhang

Keywords Paper

Keywords Paper

Zixin Luo, Lei Zhou, Xuyang Bai and
Hongkai Chen, Jiahui Zhang, Yao Yao, Shiwei Li, Tian Fang, Long Quan

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

He Zhao, Xianghua Ying, Yongjie Shi and
Xin Tong, Jingsi Wen, Hongbin Zha

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Avisek Lahiri, Arnav Kumar Jain, Sanskar Agrawal and
Pabitra Mitra, Prabir Kumar Biswas

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Heliang Zheng, Jianlong Fu, zengyh Zeng and
Jiebo Luo, Zheng-Jun Zha

Keywords Paper

Yuxin Wang, Hongtao Xie, Zheng-Jun Zha and
Mengting Xing, Zilong Fu, Yongdong Zhang

Keywords Paper

Yuyang Zhang Institute of Automation, Chinese Academy of Sciences, University of Chinese Academy of Sciences and
Jinge Wang, Shibiao Xu, Xiao Liu, Xiaopeng Zhang

Keywords Paper

Francisco Utrera, Evan Kravitz, N. Benjamin Erichson and
Rajiv Khanna, Michael W Mahoney

Keywords Paper

Keywords Paper

Keywords Paper