Compositional Video Synthesis with Action Graphs

18/07/2021

Compositional Video Synthesis with Action Graphs

Amir Bar, Roei Herzig, Xiaolong Wang, Anna Rohrbach, Gal Chechik, Prof. Darrell, Amir Globerson

Keywords: Applications, Computer Vision

Abstract Paper Similar Papers

Abstract: Videos of actions are complex signals containing rich compositional structure in space and time. Current video generation methods lack the ability to condition the generation on multiple coordinated and potentially simultaneous timed actions. To address this challenge, we propose to represent the actions in a graph structure called Action Graph and present the new "Action Graph To Video" synthesis task. Our generative model for this task (AG2Vid) disentangles motion and appearance features, and by incorporating a scheduling mechanism for actions facilitates a timely and coordinated video generation. We train and evaluate AG2Vid on CATER and Something-Something V2 datasets, which results in videos that have better visual quality and semantic consistency compared to baselines. Finally, our model demonstrates zero-shot abilities by synthesizing novel compositions of the learned actions.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

25/07/2020

Learning to transfer graph embeddings for inductive graph based recommendation

Le Wu, Yonghui Yang, Lei Chen and
Defu Lian, Richang Hong, Meng Wang

Keywords Paper

graph neural network, content based recommendation, inductive graph learning

0

0

0

0

15:15

22/11/2021

Zero-Shot Action Recognition from Diverse Object-Scene Compositions

Carlo Bretti, Pascal Mettes

Keywords Paper

action recognition, zero-shot learning, object-scene compositions

0

0

0

0

2:43

22/11/2021

Conditional Model Selection for Efficient Video Understanding

Mihir Jain, Haitam Ben Yahia, Amir Ghodrati and
Amirhossein Habibian, Fatih Porikli

Keywords Paper

action recognition, efficient classification, efficient localization, conditional compute

0

0

0

0

2:49

22/11/2021

LARNet: Latent Action Representation for Human Action Synthesis

Naman Biyani, Aayush Jung Bahadur Rana, Shruti Vyas, Yogesh Rawat

Keywords Paper

action synthesis, video synthesis, joint generative model, human action generation, end-to-end learning, conditional video generation

0

0

0

0

3:02

05/01/2021

Only Time Can Tell: Discovering Temporal Data for Temporal Modeling

Laura Sevilla-Lara, Shengxin Zha, Zhicheng Yan and
Vedanuj Goswami, Matt Feiszli, Lorenzo Torresani

Keywords Paper

0

0

0

0

4:14

14/06/2020

Action Genome: Actions As Compositions of Spatio-Temporal Scene Graphs

Jingwei Ji, Ranjay Krishna, Li Fei-Fei, Juan Carlos Niebles

Keywords Paper

action recognition, scene graph, video understanding, relationships, composition, action, activity, video

0

0

0

0

1:01

19/08/2021

Self-Supervised Video Action Localization with Adversarial Temporal Transforms

Guoqiang Gong, Liangfeng Zheng, Wenhao Jiang, Yadong Mu

Keywords Paper

Computer Vision, Action Recognition, Video

0

0

0

0

14:39

02/02/2021

Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context

Ziyi Liu, Le Wang, Wei Tang and
Junsong Yuan, Nanning Zheng, Gang Hua

Keywords Paper

0

0

0

0

19:49

14/06/2020

G-TAD: Sub-Graph Localization for Temporal Action Detection

Mengmeng Xu, Chen Zhao, David S. Rojas and
Ali Thabet, Bernard Ghanem

Keywords Paper

temporal action detection, adaptive semantic context, subgraph localization, graph convolution, gcnext, graph alignment, thumos14, activitynet1.3

0

0

0

0

1:01

14/06/2020

Searching for Actions on the Hyperbole

Teng Long, Pascal Mettes, Heng Tao Shen, Cees G. M. Snoek

Keywords Paper

video retrieval, hyperbolic learning, hierarchical, zero-shot learning, action recognition, hyperbolic geometry

0

0

0

0

1:00

03/05/2021

gradSim: Differentiable simulation for system identification and visuomotor control

Krishna Murthy Jatavallabhula, Miles Macklin, Florian Golemo and
Vikram Voleti, Linda Petrini, Martin Weiss, Breandan Considine, Jérôme Parent-Lévesque, Kevin Xie, Kenny Erleben, Liam Paull, Florian Shkurti, Derek Nowrouzezahrai, Sanja Fidler

Keywords Paper

3D scene understanding, Physical parameter estimation, System identification, Differentiable simulation, Differentiable physics, Differentiable rendering, 3D vision

0

0

0

0

5:01

02/02/2021

Anticipating Future Relations via Graph Growing for Action Prediction

Xinxiao Wu, Jianwei Zhao, Ruiqi Wang

Keywords Paper

0

0

0

0

14:44

22/11/2021

Dynamic Graph Warping Transformer for Video Alignment

Junyan Wang, Yang Long, Maurice Pagnucco, Yang Song

Keywords Paper

Video alignment, Transformer, Graph Neural Network

0

0

0

0

2:45

30/11/2020

Discovering Multi-Label Actor-Action Association in a Weakly Supervised Setting

Sovan Biswas, Juergen Gall

Keywords Paper

0

0

0

0

10:06

07/09/2020

ALBA: Reinforcement Learning for Video Object Segmentation

Shreyank Gowda, Panagiotis Eustratiadis, Timothy Hospedales, Laura Sevilla-Lara

Keywords Paper

video object segmentation, tracking

0

0

0

0

3:49

02/02/2021

A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization

Ashraful Islam, Chengjiang Long, Richard Radke

Keywords Paper

0

0

0

0

16:53

05/01/2021

Multi-Frame Recurrent Adversarial Network for Moving Object Segmentation

Prashant W. Patil, Akshay Dudhane, Subrahmanyam Murala

Keywords Paper

0

0

0

0

5:00

02/02/2021

Structured Co-reference Graph Attention for Video-grounded Dialogue

Junyeong Kim, Sunjae Yoon, Dahyun Kim, Chang D. Yoo

Keywords Paper

0

0

0

0

14:54

14/06/2020

ActionBytes: Learning From Trimmed Videos to Localize Actions

Mihir Jain, Amir Ghodrati, Cees G. M. Snoek

Keywords Paper

action localization, weakly-supervised, self-supervised learning, action proposals, zero-shot, thumos14, activitynet, multithumos, self-training, temporal segmentation

0

0

0

0

1:01

22/11/2021

Knowing What, Where and When to Look: Video Action modelling with Attention

Juan-Manuel Perez-Rua, Brais Martinez, Xiatian Zhu and
Antoine S Toisoul, Victor A Escorcia, Tao Xiang

Keywords Paper

Action recognition, Fine-grained action, video attention, Spatial attention, Channel attention, Temporal attention, Spatio-temporal attention, Feature refinement

0

0

0

0

2:46

02/02/2021

Temporal Relational Modeling with Self-Supervision for Action Segmentation

Dong Wang, Di Hu, Xingjian Li, Dejing Dou

Keywords Paper

0

0

0

0

17:10

02/02/2021

ACSNet: Action-Context Separation Network for Weakly Supervised Temporal Action Localization

Ziyi Liu, Le Wang, Qilin Zhang and
Wei Tang, Junsong Yuan, Nanning Zheng, Gang Hua

Keywords Paper

0

0

0

0

18:34

03/05/2021

VA-RED$^2$: Video Adaptive Redundancy Reduction

Bowen Pan, Rameswar Panda, Camilo L Fosco and
Chung-Ching Lin, Alex J Andonian, Yue Meng, Kate Saenko, Aude Oliva, Rogerio Feris

Keywords Paper

0

0

0

0

5:02

22/11/2021

Unsupervised Spatio-temporal Latent Feature Clustering for Multiple-object Tracking and Segmentation

Abubakar Siddique, Reza Jalil Mozhdehi, Henry Medeiros

Keywords Paper

Unsupervised learning, Subspace clustering, Heterogeneous autoencoder, Constraints k-means, Multi-task learning, Uncertainty learning, MOTS

0

0

0

0

2:57

05/01/2021

How to Make a BLT Sandwich? Learning VQA Towards Understanding Web Instructional Videos

Shaojie Wang, Wentian Zhao, Ziyi Kou and
Jing Shi, Chenliang Xu

Keywords Paper

0

0

0

0

4:33

14/06/2020

ZSTAD: Zero-Shot Temporal Activity Detection

Lingling Zhang, Xiaojun Chang, Jun Liu and
Minnan Luo, Sen Wang, Zongyuan Ge, Alexander Hauptmann

Keywords Paper

zero-shot learning, temporal activity detetction, r-c3d, super class

0

0

0

0

1:01

22/11/2021

Few-Shot Temporal Action Localization with Query Adaptive Transformer

Sauradip Nag, Xiatian Zhu, Tao Xiang

Keywords Paper

temporal action localization, few shot learning, transformer, class imbalance, meta learning, action detection

0

0

0

0

2:56

14/06/2020

Beyond Short-Term Snippet: Video Relation Detection With Spatio-Temporal Global Context

Chenchen Liu, Yang Jin, Kehan Xu and
Guoqiang Gong, Yadong Mu

Keywords Paper

video visual relation detection, visual relation detection, deep learning

0

0

0

0

1:01

19/08/2021

Learning Implicit Temporal Alignment for Few-shot Video Classification

Songyang Zhang, Jiale Zhou, Xuming He

Keywords Paper

Computer Vision, Action Recognition, Deep Learning

0

0

0

0

6:20

03/05/2021

Self-Supervised Learning of Compressed Video Representations

Youngjae Yu, Sangho Lee, Gunhee Kim, Yale Song

Keywords Paper

self-supervised learning, Compressed videos

0

0

0

0

4:34

05/01/2021

Action Duration Prediction for Segment-Level Alignment of Weakly-Labeled Videos

Reza Ghoddoosian, Saif Sayed, Vassilis Athitsos

Keywords Paper

0

0

0

0

5:00

14/06/2020

Spatio-Temporal Graph for Video Captioning With Knowledge Distillation

Boxiao Pan, Haoye Cai, De-An Huang and
Kuan-Hui Lee, Adrien Gaidon, Ehsan Adeli, Juan Carlos Niebles

Keywords Paper

video captioning, spatio-temporal graph, video understanding, vision and language, knowledge distillation, transformer, computer vision.

0

0

0

0

1:01

05/01/2021

Supervoxel Attention Graphs for Long-Range Video Modeling

Yang Wang, Gedas Bertasius, Tae-Hyun Oh and
Abhinav Gupta, Minh Hoai, Lorenzo Torresani

Keywords Paper

0

0

0

0

2:01

22/11/2021

A Closer Look at Few-Shot Video Classification: A New Baseline and Benchmark

Zhenxi Zhu, Limin Wang, Sheng Guo, Gangshan Wu

Keywords Paper

few-shot learning, classifier-based baseline, new benchmark, action recognition

0

0

0

0

2:58

02/02/2021

Weakly-supervised Temporal Action Localization by Uncertainty Modeling

Pilhyeon Lee, Jinglu Wang, Yan Lu, Hyeran Byun

Keywords Paper

0

0

0

0

14:01

18/07/2021

Unsupervised Co-part Segmentation through Assembly

Qingzhe Gao, Bin Wang, Libin Liu, Baoquan Chen

Keywords Paper

Applications, Computer Vision

0

0

0

0

5:01

06/12/2021

Reformulating Zero-shot Action Recognition for Multi-label Actions

Alec Kerrigan, Kevin Duarte, Yogesh Rawat, Mubarak Shah

Keywords Paper

machine learning, vision

0

0

0

0

15:01

14/06/2020

Adaptive Interaction Modeling via Graph Operations Search

Haoxin Li, Wei-Shi Zheng, Yu Tao and
Haifeng Hu, Jian-Huang Lai

Keywords Paper

interaction recognition, adaptive interaction modeling, neural architecture search, graph operations, video action analysis

0

0

0

0

1:01

14/06/2020

SCT: Set Constrained Temporal Transformer for Set Supervised Action Segmentation

Mohsen Fayyaz, Jürgen Gall

Keywords Paper

action segmentation, action recognition, weakly supervised, set

0

0

0

0

1:01

06/12/2021

Discovering Dynamic Salient Regions for Spatio-Temporal Graph Neural Networks

Iulia Duta, Andrei L Nicolicioiu, Marius Leordeanu

Keywords Paper

deep learning, machine learning, graph learning

0

0

0

0

9:50