Detecting Attended Visual Targets in Video

14/06/2020

Detecting Attended Visual Targets in Video

Eunji Chong, Yongxin Wang, Nataniel Ruiz, James M. Rehg

Keywords: attention, gaze, video, dataset, social scene understanding.

Abstract Paper Similar Papers

Abstract: We address the problem of detecting attention targets in video. Our goal is to identify where each person in each frame of a video is looking, and correctly handle the case where the gaze target is out-of-frame. Our novel architecture models the dynamic interaction between the scene and head features and infers time-varying attention targets. We introduce a new annotated dataset, VideoAttentionTarget, containing complex and dynamic patterns of real-world gaze behavior. Our experiments show that our model can effectively infer dynamic attention in videos. In addition, we apply our predicted attention maps to two social gaze behavior recognition tasks, and show that the resulting classifiers significantly outperform existing methods. We achieve state-of-the-art performance on three datasets: GazeFollow (static images), VideoAttentionTarget (videos), and VideoCoAtt (videos), and obtain the first results for automatically classifying clinically-relevant gaze behavior without wearable cameras or eye trackers.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

22/11/2021

V3GAN: Decomposing Background, Foreground and Motion for Video Generation

Arti Keshari, Sonam Gupta, Sukhendu Das

Keywords Paper

video generation, unconditional video generation, shuffling loss, feature level masking, unsupervised learning, GAN, foreground, background, motion decomposition

0

0

0

0

3:02

04/07/2020

Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA

Hyounghun Kim, Zineng Tang, Mohit Bansal

Keywords Paper

Dense-Caption Matching, Temporal VideoQA, answering questions, frame problem

0

0

0

0

10:56

05/01/2021

DORi: Discovering Object Relationships for Moment Localization of a Natural Language Query in a Video

Cristian Rodriguez-Opazo, Edison Marrese-Taylor, Basura Fernando and
Hongdong Li, Stephen Gould

Keywords Paper

0

0

0

0

5:02

14/06/2020

Active Vision for Early Recognition of Human Actions

Boyu Wang, Lihan Huang, Minh Hoai

Keywords Paper

early recognition, active vision, view selection, action early prediction

0

0

0

0

1:01

05/01/2021

Coarse Temporal Attention Network (CTA-Net) for Driver's Activity Recognition

Zachary Wharton, Ardhendu Behera, Yonghuai Liu, Nik Bessis

Keywords Paper

0

0

0

0

5:30

06/12/2020

Video Frame Interpolation without Temporal Priors

Youjian Zhang, Chaoyue Wang, Dacheng Tao

Keywords Paper

0

0

0

0

3:18

02/02/2021

CompFeat: Comprehensive Feature Aggregation for Video Instance Segmentation

Yang Fu, Linjie Yang, Ding Liu and
Thomas S. Huang, Humphrey Shi

Keywords Paper

0

0

0

0

16:24

30/11/2020

Transforming Multi-Concept Attention into Video Summarization

Yen-Ting Liu, Yu-Jhe Li, Yu-Chiang Frank Wang

Keywords Paper

0

0

0

0

7:07

05/01/2021

Integrating Human Gaze Into Attention for Egocentric Activity Recognition

Kyle Min, Jason J. Corso

Keywords Paper

0

0

0

0

4:56

14/06/2020

Syntax-Aware Action Targeting for Video Captioning

Qi Zheng, Chaoyue Wang, Dacheng Tao

Keywords Paper

video and language, video captioning, action predicting

0

0

0

0

1:01

14/06/2020

Searching for Actions on the Hyperbole

Teng Long, Pascal Mettes, Heng Tao Shen, Cees G. M. Snoek

Keywords Paper

video retrieval, hyperbolic learning, hierarchical, zero-shot learning, action recognition, hyperbolic geometry

0

0

0

0

1:00

22/11/2021

GTA: Global Temporal Attention for Video Action Understanding

Bo He, Xitong Yang, Zuxuan Wu and
Hao Chen, Ser-Nam Lim, Abhinav Shrivastava

Keywords Paper

action recognition, self-attention, temporal modeling

0

0

0

0

2:55

04/07/2020

TVQA+: Spatio-Temporal Grounding for Video Question Answering

Jie Lei, Licheng Yu, Tamara Berg, Mohit Bansal

Keywords Paper

Spatio-Temporal Grounding, Video Answering, Spatio-Temporal Answering, Spatio-Temporal Evidence

0

0

0

0

11:42

22/11/2021

CTRN: Class-Temporal Relational Network for Action Detection

Rui Dai, Srijan Das, Francois Bremond

Keywords Paper

action detection, graph reasoning, graph convolutional network, temporal modelling, multi-label classification

0

0

0

0

7:02

26/04/2020

CLEVRER: Collision Events for Video Representation and Reasoning

Kexin Yi, Chuang Gan, Yunzhu Li and
Pushmeet Kohli, Jiajun Wu, Antonio Torralba, Joshua B. Tenenbaum

Keywords Paper

Neuro-symbolic, Reasoning

0

0

0

0

4:53

22/11/2021

Deep Video Inpainting Detection

Peng Zhou, Ning Yu, Zuxuan Wu and
Larry Davis, Abhinav Shrivastava, Ser-Nam Lim

Keywords Paper

Video Inpainting Detection, Manipulation Detection, DeepFake Detection

0

0

0

0

3:01

14/06/2020

Classifying, Segmenting, and Tracking Object Instances in Video with Mask Propagation

Gedas Bertasius, Lorenzo Torresani

Keywords Paper

instance segmentation, object detection, object tracking, video analysis.

0

0

0

0

4:59

02/02/2021

Temporal ROI Align for Video Object Recognition

Tao Gong, Kai Chen, Xinjiang Wang and
Qi Chu, Feng Zhu, Dahua Lin, Nenghai Yu, Huamin Feng

Keywords Paper

0

0

0

0

14:29

22/11/2021

Hierarchical Contrastive Motion Learning for Video Action Recognition

Xitong Yang, Xiaodong Yang, Sifei Liu and
Deqing Sun, Larry Davis, Jan Kautz

Keywords Paper

action recognition, motion hierarchy, motion representation, contrastive learning

0

0

0

0

8:29

02/02/2021

Activity Image-to-Video Retrieval by Disentangling Appearance and Motion

Liu Liu, Jiangtong Li, Li Niu and
Ruicong Xu, Liqing Zhang

Keywords Paper

0

0

0

1

14:34

14/06/2020

Beyond Short-Term Snippet: Video Relation Detection With Spatio-Temporal Global Context

Chenchen Liu, Yang Jin, Kehan Xu and
Guoqiang Gong, Yadong Mu

Keywords Paper

video visual relation detection, visual relation detection, deep learning

0

0

0

0

1:01

26/04/2020

Learning from Unlabelled Videos Using Contrastive Predictive Neural 3D Mapping

Adam W. Harley, Shrinidhi K. Lakshmikanth, Fangyu Li and
Xian Zhou, Hsiao-Yu Fish Tung, Katerina Fragkiadaki

Keywords Paper

3D feature learning, unsupervised learning, inverse graphics, object discovery

0

0

0

0

5:17

30/11/2020

Rotation Axis Focused Attention Network (RAFA-Net) for Estimating Head Pose

Ardhendu Behera, Zachary Wharton, Pradeep Hewage, Swagat Kumar

Keywords Paper

0

0

0

0

10:19

14/06/2020

EmotiCon: Context-Aware Multimodal Emotion Recognition Using Frege’s Principle

Trisha Mittal, Pooja Guhan, Uttaran Bhattacharya and
Rohan Chandra, Aniket Bera, Dinesh Manocha

Keywords Paper

affective computing, perceived emotions, context understanding, multimodal, inter-agent interactions, depth maps, deep learning, background, attention maps

0

0

0

0

1:00

06/12/2020

Self-Supervised MultiModal Versatile Networks

Jean-Baptiste Alayrac, Adria Recasens, Rosalia Schneider and
Relja Arandjelović, Jason Ramapuram, Jeffrey De Fauw, Lucas Smaira, Sander Dieleman, Andrew Zisserman

Keywords Paper

1

0

0

0

3:25

14/06/2020

Action Genome: Actions As Compositions of Spatio-Temporal Scene Graphs

Jingwei Ji, Ranjay Krishna, Li Fei-Fei, Juan Carlos Niebles

Keywords Paper

action recognition, scene graph, video understanding, relationships, composition, action, activity, video

0

0

0

0

1:01

02/02/2021

Contrastive Transformation for Self-supervised Correspondence Learning

Ning Wang, Wengang Zhou, Houqiang Li

Keywords Paper

0

0

0

0

13:41

25/07/2020

3D self-attention for unsupervised video quantization

Jingkuan Song, Ruimin Lang, Xiaosu Zhu and
Xing Xu, Lianli Gao, Heng Tao Shen

Keywords Paper

quantization, video retrieval, ann search

0

0

0

0

9:44

06/12/2021

SIMONe: View-Invariant, Temporally-Abstracted Object Representations via Unsupervised Video Decomposition

Rishabh Kabra, Daniel Zoran, Goker Erdogan and
Loic Matthey, Antonia Creswell, Matt Botvinick, Alexander Lerchner, Chris Burgess

Keywords Paper

self-supervised learning

0

0

0

0

14:42

14/06/2020

Unsupervised Learning From Video With Deep Neural Embeddings

Chengxu Zhuang, Tianwei She, Alex Andonian and
Max Sobol Mark, Daniel Yamins

Keywords Paper

unsupervised learning, self-supervised learning, video learning, contrastive learning, deep neural networks, action recognition, object recognition, two-pathway models

0

0

0

0

1:01

14/06/2020

Video Modeling With Correlation Networks

Heng Wang, Du Tran, Lorenzo Torresani, Matt Feiszli

Keywords Paper

action recognition, video classification, motion, correlation, temporal information, kinetics, something-something.

0

0

0

0

1:05

25/04/2020

TurkEyes: A Web-Based Toolbox for Crowdsourcing Attention Data

Anelise Newman, Barry McNamara, Camilo Fosco and
Yun Bin Zhang, Pat Sukhum, Matthew Tancik, Nam Wook Kim, Zoya Bylinskii

Keywords Paper

eye tracking, attention, crowdsourcing, interaction techniques

0

0

0

0

13:19

14/06/2020

Video Instance Segmentation Tracking With a Modified VAE Architecture

Chung-Ching Lin, Ying Hung, Rogerio Feris, Linglin He

Keywords Paper

video instance segmentation, video object tracking, variational autoencoder, vae, gaussian process, multi-task learning

0

0

0

0

1:01

30/11/2020

Learning Multi-Instance Sub-pixel Point Localization

Julien Schroeter, Tinne Tuytelaars, Kirill Sidorov, David Marshall

Keywords Paper

0

0

0

0

9:28

22/11/2021

Exemplar-Based Early Event Prediction in Video

ZEKUN ZHANG, FARRUKH M KORAISHY, Minh Hoai

Keywords Paper

early prediction, video prediction, exemplar

0

0

0

0

2:44

05/01/2021

Exploration of Spatial and Temporal Modeling Alternatives for HOI

Rishabh Dabral, Srijon Sarkar, Sai Praneeth Reddy, Ganesh Ramakrishnan

Keywords Paper

0

0

0

0

4:48

06/12/2020

Self-Learning Transformations for Improving Gaze and Head Redirection

Yufeng Zheng, Seonwook Park, Xucong Zhang and
Shalini De Mello, Otmar Hilliges

Keywords Paper

0

0

0

0

3:20

22/11/2021

Knowing What, Where and When to Look: Video Action modelling with Attention

Juan-Manuel Perez-Rua, Brais Martinez, Xiatian Zhu and
Antoine S Toisoul, Victor A Escorcia, Tao Xiang

Keywords Paper

Action recognition, Fine-grained action, video attention, Spatial attention, Channel attention, Temporal attention, Spatio-temporal attention, Feature refinement

0

0

0

0

2:46

14/06/2020

Spatio-Temporal Graph for Video Captioning With Knowledge Distillation

Boxiao Pan, Haoye Cai, De-An Huang and
Kuan-Hui Lee, Adrien Gaidon, Ehsan Adeli, Juan Carlos Niebles

Keywords Paper

video captioning, spatio-temporal graph, video understanding, vision and language, knowledge distillation, transformer, computer vision.

0

0

0

0

1:01

14/06/2020

Learning to Observe: Approximating Human Perceptual Thresholds for Detection of Suprathreshold Image Transformations

Alan Dolhasz, Carlo Harvey, Ian Williams

Keywords Paper

percetpion, jnd, vision, deep learning, image compositing, local distortions, subjective quality

0

0

0

0

1:01