Where Are You? Localization from Embodied Dialog

16/11/2020

Where Are You? Localization from Embodied Dialog

Meera Hahn, Jacob Krantz, Dhruv Batra, Devi Parikh, James Rehg, Stefan Lee, Peter Anderson

Keywords: cooperative task, embodied dialog, cooperative localization, locator

Abstract Paper Similar Papers

Abstract: We present WHERE ARE YOU? (WAY), a dataset of ~6k dialogs in which two humans -- an Observer and a Locator -- complete a cooperative localization task. The Observer is spawned at random in a 3D environment and can navigate from first-person views while answering questions from the Locator. The Locator must localize the Observer in a detailed top-down map by asking questions and giving instructions. Based on this dataset, we define three challenging tasks: Localization from Embodied Dialog or LED (localizing the Observer from dialog history), Embodied Visual Dialog (modeling the Observer), and Cooperative Localization (modeling both agents). In this paper, we focus on the LED task -- providing a strong baseline model with detailed ablations characterizing both dataset biases and the importance of various modeling choices. Our best model achieves 32.7% success at identifying the Observer′s location within 3m in unseen buildings, vs. 70.4% for human Locators.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EMNLP 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

Learning Saliency Propagation for Semi-Supervised Instance Segmentation

Yanzhao Zhou, Xin Wang, Jianbin Jiao and
Trevor Darrell, Fisher Yu

Keywords Paper

semi-supervised, instance segmentation, saliency, propagation, message passing, multiple instance learning, partial-supervised, generalization

0

0

0

0

1:01

14/06/2020

Density-Based Clustering for 3D Object Detection in Point Clouds

Syeda Mariam Ahmed, Chee Meng Chew

Keywords Paper

3d object detection, edge-aware pointnet, instance segmentation, unsupervised clustering, cascaded modules, semantic segmentation, amodal bounding box detection

0

0

0

0

0:51

06/12/2021

Landmark-RxR: Solving Vision-and-Language Navigation with Fine-Grained Alignment Supervision

Keji He, Yan Huang, Qi Wu and
Jianhua Yang, Dong An, Shuanglin Sima, Liang Wang

Keywords Paper

0

0

0

0

14:11

14/06/2020

Recognizing Objects From Any View With Object and Viewer-Centered Representations

Sainan Liu, Vincent Nguyen, Isaac Rehg, Zhuowen Tu

Keywords Paper

object-centered viewer-centered recognition classification

0

0

0

0

1:01

16/11/2020

Self-Supervised Learning of Scene-Graph Representations for Robotic Sequential Manipulation Planning

Son Nguyen, Ozgur Oguz Uni. of Stuttgart &, Max Planck Inst. for Intelligent Systems and
Valentin Hartmann, Marc Toussaint

Keywords Paper

0

0

0

0

5:01

30/11/2020

Adaptive Spotting: Deep Reinforcement Object Search in 3D Point Clouds

Onkar Krishna, Go Irie, Xiaomeng Wu and
Takahito Kawanishi, Kunio Kashino

Keywords Paper

0

0

0

0

6:58

14/06/2020

Learning Human-Object Interaction Detection Using Interaction Points

Tiancai Wang, Tong Yang, Martin Danelljan and
Fahad Shahbaz Khan, Xiangyu Zhang, Jian Sun

Keywords Paper

human-object interaction, interaction point, interaction grouping, keypoint detection

0

0

0

0

0:58

16/11/2020

Model-based Reinforcement Learning for Decentralized Multiagent Rendezvous

Rose Wang, J. Chase Kew, Dennis Lee and
Tsang-Wei Lee, Tingnan Zhang, Brian Ichter, Jie Tan, Aleksandra Faust

Keywords Paper

0

0

0

0

4:29

14/06/2020

SPARE3D: A Dataset for SPAtial REasoning on Three-View Line Drawings

Wenyu Han, Siyuan Xiang, Chenhui Liu and
Ruoyu Wang, Chen Feng

Keywords Paper

reasoning, line-drawings, dataset

0

0

0

0

1:01

14/06/2020

PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation

Li Jiang, Hengshuang Zhao, Shaoshuai Shi and
Shu Liu, Chi-Wing Fu, Jiaya Jia

Keywords Paper

instance segmentation, point cloud, 3d, scene understanding, indoor scenes, bottom-up, grouping, dual-set, scannet, s3dis

0

0

0

0

5:01

06/12/2021

Searching Parameterized AP Loss for Object Detection

Tao Chenxin, Zizhang Li, Xizhou Zhu and
Gao Huang, Yong Liu, jifeng dai

Keywords Paper

machine learning, vision

0

0

0

0

6:13

14/06/2020

GraspNet-1Billion: A Large-Scale Benchmark for General Object Grasping

Hao-Shu Fang, Chenxi Wang, Minghao Gou, Cewu Lu

Keywords Paper

robotics, grasping, 6d pose, grasp pose, manipulation, dataset, pick and place, bin picking

0

0

0

0

1:01

06/12/2021

REMIPS: Physically Consistent 3D Reconstruction of Multiple Interacting People under Weak Supervision

Mihai Fieraru, Mihai Zanfir, Teodor Szente and
Eduard Bazavan, Vlad Olaru, Cristian Sminchisescu

Keywords Paper

transformers

0

0

0

0

7:26

12/07/2020

MoNet3D: Towards Accurate Monocular 3D Object Localization in Real Time

XICHUAN ZHOU, YiCong Peng, Chunqiao Long and
Fengbo Ren, Cong Shi

Keywords Paper

Applications - Computer Vision

0

0

0

0

11:57

02/02/2021

PHASE: PHysically-grounded Abstract Social Events for Machine Social Perception

Aviv Netanyahu, Tianmin Shu, Boris Katz and
Andrei Barbu, Joshua B. Tenenbaum

Keywords Paper

0

0

0

0

14:05

06/12/2020

Multi-Plane Program Induction with 3D Box Priors

Yikai Li, Jiayuan Mao, Xiuming Zhang and
Bill Freeman, Josh Tenenbaum, Noah Snavely, Jiajun Wu

Keywords Paper

0

0

0

0

3:24

03/05/2021

Learning to Set Waypoints for Audio-Visual Navigation

Changan Chen, Sagnik Majumder, Ziad Al-Halah and
Ruohan Gao, Santhosh Kumar Ramakrishnan, Kristen Grauman

Keywords Paper

visual navigation, audio visual learning, embodied vision

0

0

0

0

5:04

02/02/2021

Semantic Consistency Networks for 3D Object Detection

Wenwen Wei, Ping Wei, Nanning Zheng

Keywords Paper

0

0

0

0

14:06

14/06/2020

P2B: Point-to-Box Network for 3D Object Tracking in Point Clouds

Haozhe Qi, Chen Feng, Zhiguo Cao and
Feng Zhao, Yang Xiao

Keywords Paper

3d object tracking, end-to-end, point cloud, feature augmentation, target proposal, siamese network

0

0

0

0

5:01

22/11/2021

Point3D: tracking actions as moving points with 3D CNNs

Shentong Mo, Jingfei Xia, Xiaoqing Tan, Bhiksha Raj

Keywords Paper

Spatio-temporal action detection

0

0

0

0

3:02

05/12/2020

Point-of-interest oriented question answering with joint inference of semantic matching and distance correlation

Yifei Yuan, Jingbo Zhou, Wai Lam

Keywords Paper

0

0

0

0

13:14

25/07/2020

Immersive search: Using virtual reality to examine how a third dimension impacts the searching process

Austin R. Ward, Rob Capra

Keywords Paper

three-dimensional search, immersive search, virtual reality

0

0

0

0

9:58

03/05/2021

VTNet: Visual Transformer Network for Object Goal Navigation

Heming Du, Xin Yu, Liang Zheng

Keywords Paper

0

0

0

0

4:12

14/06/2020

G2L-Net: Global to Local Network for Real-Time 6D Pose Estimation With Embedding Vector Features

Wei Chen, Xi Jia, Hyung Jin Chang and
Jinming Duan, Aleš Leonardis

Keywords Paper

object pose estimation, point cloud, embedding vector features, real time

0

0

0

0

1:01

02/02/2021

Asking the Right Questions: Learning Interpretable Action Models Through Query Answering

Pulkit Verma, Shashank Rao Marpally, Siddharth Srivastava

Keywords Paper

0

0

0

0

18:48

05/01/2021

On the Generalization of Learning-Based 3D Reconstruction

Miguel Angel Bautista, Walter Talbott, Shuangfei Zhai and
Nitish Srivastava, Joshua M. Susskind

Keywords Paper

0

0

0

0

4:58

06/12/2021

SIMONe: View-Invariant, Temporally-Abstracted Object Representations via Unsupervised Video Decomposition

Rishabh Kabra, Daniel Zoran, Goker Erdogan and
Loic Matthey, Antonia Creswell, Matt Botvinick, Alexander Lerchner, Chris Burgess

Keywords Paper

self-supervised learning

0

0

0

0

14:42

14/06/2020

Joint Spatial-Temporal Optimization for Stereo 3D Object Tracking

Peiliang Li, Jieqi Shi, Shaojie Shen

Keywords Paper

3d object tracking, stereo cameras, autonomous driving

0

0

0

0

1:01

02/02/2021

Embodied Visual Active Learning for Semantic Segmentation

David Nilsson, Aleksis Pirinen, Erik Gärtner, Cristian Sminchisescu

Keywords Paper

0

0

0

0

18:49

14/06/2020

DeepEMD: Few-Shot Image Classification With Differentiable Earth Mover’s Distance and Structured Classifiers

Chi Zhang, Yujun Cai, Guosheng Lin, Chunhua Shen

Keywords Paper

few-shot classification, meta learning, few-shot learning, classification, metric learning, convex, optimization, image retrieval

0

0

0

0

5:00

14/06/2020

FroDO: From Detections to 3D Objects

Martin Rünz, Kejie Li, Meng Tang and
Lingni Ma, Chen Kong, Tanner Schmidt, Ian Reid, Lourdes Agapito, Julian Straub, Steven Lovegrove, Richard Newcombe

Keywords Paper

reconstruction, shape embedding, 3d vision, object detection, shape prior, object representation, monocular, sdf, pointcloud, inference

0

0

0

0

1:01

22/11/2021

Grid Cell Path Integration For Movement-Based Visual Object Recognition

Niels Leadholm, Marcus Lewis, Subutai Ahmad

Keywords Paper

biologically plausible, translation invariance, robustness, sequential vision, transsaccadic vision, grid cells, path integration, continual learning, predictive representations, Hebbian learning

0

0

0

0

11:21

22/11/2021

Attention to Action: Leveraging Attention for Object Navigation

Shi Chen, Qi Zhao

Keywords Paper

Object-goal Navigation, Attention, Visual Navigation

0

0

0

0

2:51

07/09/2020

Mixup-CAM: Weakly-supervised Semantic Segmentation via Uncertainty Regularization

Yu-Ting Chang, Qiaosong Wang, Wei-Chih Hung and
Robinson Piramuthu, Yi-Hsuan Tsai, Ming-Hsuan Yang

Keywords Paper

semantic segmentation, weakly-supervised learning, class activatin map, mixup augmentation, entropy regularization

0

0

0

0

8:22

26/10/2020

Adaptive Informative Path Planning with Multimodal Sensing

Shushman Choudhury, Nate Gruver, Mykel J. Kochenderfer

Keywords Paper

informative path planning, online POMDP solver, search-and-rescue

0

0

0

0

9:35

06/12/2021

Nested Counterfactual Identification from Arbitrary Surrogate Experiments

Juan Correa, Sanghack Lee, Elias Bareinboim

Keywords Paper

graph learning, causality, fairness

0

0

0

0

13:34

06/12/2021

3DP3: 3D Scene Perception via Probabilistic Programming

Nishad Gothoskar, Marco Cusumano-Towner, Ben Zinberg and
Matin Ghavamizadeh, Falk Pollok, Austin Garrett, Josh Tenenbaum, Dan Gutfreund, Vikash Mansinghka

Keywords Paper

deep learning, vision, generative model, graph learning

0

0

0

0

6:16

02/02/2021

Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers

Shijie Geng, Peng Gao, Moitreya Chatterjee and
Chiori Hori, Jonathan Le Roux, Yongfeng Zhang, Hongsheng Li, Anoop Cherian

Keywords Paper

0

0

0

0

19:36

14/06/2020

GPS-Net: Graph Property Sensing Network for Scene Graph Generation

Xin Lin, Changxing Ding, Jinquan Zeng, Dacheng Tao

Keywords Paper

scene graph generation, message passing, visual genome dataset, visual relationship detection, open images dataset

0

0

0

0

4:56

16/11/2020

Hierarchical Robot Navigation in Novel Environments using Rough 2-D Maps

Chengguang Xu, Christopher Amato, Lawson Wong

Keywords Paper

0

0

0

0

4:53