SynDistNet: Self-Supervised Monocular Fisheye Camera Distance Estimation Synergized With Semantic Segmentation for Autonomous Driving

05/01/2021

SynDistNet: Self-Supervised Monocular Fisheye Camera Distance Estimation Synergized With Semantic Segmentation for Autonomous Driving

Varun Ravi Kumar, Marvin Klingner, Senthil Yogamani, Stefan Milz, Tim Fingscheidt, Patrick Mader

Keywords:

Abstract Paper Similar Papers

Abstract: State-of-the-art self-supervised learning approaches for monocular depth estimation usually suffer from scale ambiguity. They do not generalize well when applied on distance estimation for complex projection models such as in fisheye and omnidirectional cameras. This paper introduces a novel multi-task learning strategy to improve self-supervised monocular distance estimation on fisheye and pinhole camera images. Our contribution to this work is threefold: Firstly, we introduce a novel distance estimation network architecture using a self-attention based encoder coupled with robust semantic feature guidance to the decoder that can be trained in a one-stage fashion. Secondly, we integrate a generalized robust loss function, which improves performance significantly while removing the need for hyperparameter tuning with the reprojection loss. Finally, we reduce the artifacts caused by dynamic objects violating static world assumptions using a semantic masking strategy. We significantly improve upon the RMSE of previous work on fisheye by a 25% reduction in RMSE. As there is little work on fisheye cameras, we evaluated the proposed method on KITTI using a pinhole model. We achieved state-of-the-art performance among self-supervised methods without requiring an external scale estimation.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at WACV 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

07/09/2020

Non-Probabilistic Cosine Similarity Loss for Few-Shot Image Classification

Joonhyuk Kim, Inug Yoon, Gyeong-Moon Park, Jong-Hwan Kim

Keywords Paper

few-shot learning, image classification, NPC loss

0

0

0

0

4:59

22/11/2021

Spatial-Temporal Residual Aggregation for High Resolution Video Inpainting

Vishnu Sanjay Ramiya Srinivasan, Rui Ma, Qiang Tang and
Zili Yi, Zhan Xu

Keywords Paper

high resolution video inpainting, spatial-temporal aggregation, residual aggregation, spatial-temporal attention, image alignment

0

0

0

0

2:58

17/08/2020

Learning temporal coherence via self-supervision for GAN-based video generation

Mengyu Chu, You Xie, Jonas Mayer and
Laura Leal-Taixé, Nils Thuerey

Keywords Paper

self-supervision, temporal cycle-consistency, video super-resolution, generative adversarial network, unpaired video translation

0

0

0

0

16:59

14/06/2020

ViewAL: Active Learning With Viewpoint Entropy for Semantic Segmentation

Yawar Siddiqui, Julien Valentin, Matthias Nießner

Keywords Paper

active learning, semantic segmentation, deep learning, view consistency

0

0

0

0

1:01

26/04/2020

Efficient and Information-Preserving Future Frame Prediction and Beyond

Wei Yu, Yichao Lu, Steve Easterbrook, Sanja Fidler

Keywords Paper

self-supervised learning, generative pre-training, video prediction, reversible architecture

0

0

0

0

4:18

06/12/2020

Provably Robust Metric Learning

Lu Wang, Xuanqing Liu, Jinfeng Yi and
Yuan Jiang, Cho-Jui Hsieh

Keywords Paper

0

0

0

0

3:14

06/12/2021

Spectrum-to-Kernel Translation for Accurate Blind Image Super-Resolution

Guangpin Tao, Xiaozhong Ji, Wenzhuo Wang and
Shuo Chen, Chuming Lin, Yun Cao, Tong Lu, Donghao Luo, Ying Tai

Keywords Paper

deep learning, optimization, vision, generative model

0

0

0

0

12:00

14/06/2020

Erasing Integrated Learning: A Simple Yet Effective Approach for Weakly Supervised Object Localization

Jinjie Mai, Meng Yang, Wenfeng Luo

Keywords Paper

weakly supervised, object localization, adversarial erasing

0

0

0

0

5:00

26/04/2020

Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning

Kimin Lee, Kibok Lee, Jinwoo Shin, Honglak Lee

Keywords Paper

Deep reinforcement learning, Generalization in visual domains

0

0

0

0

5:03

14/06/2020

Multi-Scale Interactive Network for Salient Object Detection

Youwei Pang, Xiaoqi Zhao, Lihe Zhang, Huchuan Lu

Keywords Paper

saliency detection, salient object detection, feature interaction strategy, scale-insensitive loss, multi-scale features, multi-level features, fully convolutional network, deep learning

0

0

0

0

1:01

14/06/2020

SmallBigNet: Integrating Core and Contextual Views for Video Classification

Xianhang Li, Yali Wang, Zhipeng Zhou, Yu Qiao

Keywords Paper

video classification, action recognition, temporal convolution, 3d maxpooling, shared convolution

0

0

0

0

1:00

07/09/2020

Unsupervised Monocular Depth Estimation with Multi-Baseline Stereo

Saad Imran, Muhammad Umar Karim Khan, Sikander Mukaram, Chong-Min Kyung

Keywords Paper

Unsupervised Monocular Depth, Small-Baseline, Wide-Baseline, Multi-Baseline, Stereo

0

0

0

0

4:32

05/01/2021

LoGAN: Latent Graph Co-Attention Network for Weakly-Supervised Video Moment Retrieval

Reuben Tan, Huijuan Xu, Kate Saenko, Bryan A. Plummer

Keywords Paper

0

0

0

0

5:21

03/05/2021

Deconstructing the Regularization of BatchNorm

Yann Dauphin, Ekin Cubuk

Keywords Paper

understanding neural networks, batch normalization, regularization, deep learning

0

0

0

0

5:09

06/12/2020

Make One-Shot Video Object Segmentation Efficient Again

Tim Meinhardt, Laura Leal-Taixé

Keywords Paper

0

0

0

0

3:17

22/11/2021

A Closer Look at Few-Shot Video Classification: A New Baseline and Benchmark

Zhenxi Zhu, Limin Wang, Sheng Guo, Gangshan Wu

Keywords Paper

few-shot learning, classifier-based baseline, new benchmark, action recognition

0

0

0

0

2:58

14/06/2020

Equalization Loss for Long-Tailed Object Recognition

Jingru Tan, Changbao Wang, Buyu Li and
Quanquan Li, Wanli Ouyang, Changqing Yin, Junjie Yan

Keywords Paper

long tail, object detection, lvis, object recognition

0

0

0

0

1:00

06/12/2020

One-bit Supervision for Image Classification

Hengtong Hu, Lingxi Xie, Zewei Du and
Richang Hong, Qi Tian

Keywords Paper

0

0

0

0

3:14

18/07/2021

Image-Level or Object-Level? A Tale of Two Resampling Strategies for Long-Tailed Detection

Nadine Chang, Zhiding Yu, Yu-Xiong Wang and
Anima Anandkumar, Sanja Fidler, Jose Alvarez

Keywords Paper

Applications, Computer Vision

0

0

0

0

5:17

14/06/2020

Fast-MVSNet: Sparse-to-Dense Multi-View Stereo With Learned Propagation and Gauss-Newton Refinement

Zehao Yu, Shenghua Gao

Keywords Paper

multi-view stereo, sparse-to-dense, gauss-newton optimization, propagation, coarse-to-fine

0

0

0

0

1:01

06/12/2020

Self-supervised Co-Training for Video Representation Learning

Tengda Han, Weidi Xie, Andrew Zisserman

Keywords Paper

0

0

0

0

3:08

07/09/2020

ViewSynth: Learning Local Features from Depth using View Synthesis

Jisan Mahmud, Rajat Vikram Singh, Peri Akiva and
Spondon Kundu, Kuan-Chuan Peng, Jan-Michael Frahm

Keywords Paper

viewpoint invariant representation learning, depth representation learning, view synthesis, correspondence learning

0

0

0

0

10:00

07/09/2020

Meta-RetinaNet for Few-shot Object Detection

Shaoqi Li, Wenfeng Song, Shuai Li and
Aimin Hao, Hong Qin

Keywords Paper

Few shot, object detection, meta-learning, Meta-RetinaNet, Balanced Loss, coefficient vector

0

0

0

0

8:51

06/12/2021

Consistency Regularization for Variational Auto-Encoders

Samarth Sinha, Adji Bousso Dieng

Keywords Paper

deep learning, machine learning, self-supervised learning, generative model, contrastive learning, representation learning

0

0

0

0

10:52

06/12/2020

Semantic Visual Navigation by Watching YouTube Videos

Matthew Chang, Arjun Gupta, Saurabh Gupta

Keywords Paper

0

0

0

0

3:16

05/01/2021

Temporal Stochastic Softmax for 3D CNNs: An Application in Facial Expression Recognition

Theo Ayral, Marco Pedersoli, Simon Bacon, Eric Granger

Keywords Paper

0

0

0

0

5:00

02/02/2021

Adversarial Training Reduces Information and Improves Transferability

Matteo Terzi, Alessandro Achille, Marco Maggipinto, Gian Antonio Susto

Keywords Paper

0

0

0

0

19:54

07/09/2020

Boosting Image and Video Compression via Learning Latent Residual Patterns

Yen-Chung Chen, Keng-Jui Chang, Yi-Hsuan Tsai, Wei-Chen Chiu

Keywords Paper

compression artifacts, image compression, video compression, latent residual

0

0

0

0

7:48

06/12/2020

Soft Contrastive Learning for Visual Localization

Janine Thoma, Danda Pani Paudel, Luc V Gool

Keywords Paper

0

0

0

0

3:18

14/06/2020

When to Use Convolutional Neural Networks for Inverse Problems

Nathaniel Chodosh, Simon Lucey

Keywords Paper

optimization, sparse coding, inverse problems, trajectory reconstruction, artifact removal

0

0

0

0

1:02

06/12/2020

DeepI2I: Enabling Deep Hierarchical Image-to-Image Translation by Transferring from GANs

yaxing wang, Lu Yu, Joost van de Weijer

Keywords Paper

Algorithms -> Online Learning, Optimization -> Stochastic Optimization

0

0

0

0

3:23

22/11/2021

Temporal Meta-Adaptor for Video Object Detection

Chi Wang, Yang Hua, ZHENG LU and
Jian Gao, Neil Robertson

Keywords Paper

video object detection, temporal aggregation, meta-learning, ImageNet VID

0

0

0

0

6:58

14/06/2020

Attention Mechanism Exploits Temporal Contexts: Real-Time 3D Human Pose Reconstruction

Ruixu Liu, Ju Shen, He Wang and
Chen Chen, Sen-ching Cheung, Vijayan Asari

Keywords Paper

3d human pose, attention mechanism, multi-scale dilation convolution, monocular motion reconstruction

0

0

0

0

5:01

06/12/2021

ReSSL: Relational Self-Supervised Learning with Weak Augmentation

Mingkai Zheng, Shan You, Fei Wang and
Chen Qian, Changshui Zhang, Xiaogang Wang, Chang Xu

Keywords Paper

self-supervised learning, contrastive learning

0

0

0

0

6:35

06/12/2021

Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations

Wouter Van Gansbeke, Simon Vandenhende, Stamatios Georgoulis, Luc V Gool

Keywords Paper

self-supervised learning, vision, contrastive learning, representation learning

0

0

0

0

13:32

02/02/2021

HR-Depth: High Resolution Self-Supervised Monocular Depth Estimation

Xiaoyang Lyu, Liang Liu, Mengmeng Wang and
Xin Kong, Lina Liu, Yong Liu, Xinxin Chen, Yi Yuan

Keywords Paper

0

0

0

0

12:10

06/12/2021

Why Do Better Loss Functions Lead to Less Transferable Features?

Simon Kornblith, Ting Chen, Honglak Lee, Mohammad Norouzi

Keywords Paper

deep learning, machine learning, vision, transfer learning

0

0

0

0

9:26

02/02/2021

Fast and Compact Bilinear Pooling by Shifted Random Maclaurin

Tan Yu, Xiaoyun Li, Ping Li

Keywords Paper

0

0

0

0

14:24

14/06/2020

Scale-Space Flow for End-to-End Optimized Video Compression

Eirikur Agustsson, David Minnen, Nick Johnston and
Johannes Ballé, Sung Jin Hwang, George Toderici

Keywords Paper

learned video compression, scale-space flow, bilinear warping

0

0

0

0

0:55

14/06/2020

Instance-Aware, Context-Focused, and Memory-Efficient Weakly Supervised Object Detection

Zhongzheng Ren, Zhiding Yu, Xiaodong Yang and
Ming-Yu Liu, Yong Jae Lee, Alexander G. Schwing, Jan Kautz

Keywords Paper

weakly-supervised, object detection, video recognition, instance-aware, context-focused, memory-efficient

0

0

0

0

0:59