Gold Seeker: Information Gain From Policy Distributions for Goal-Oriented Vision-and-Langauge Reasoning

14/06/2020

Gold Seeker: Information Gain From Policy Distributions for Goal-Oriented Vision-and-Langauge Reasoning

Ehsan Abbasnejad, Iman Abbasnejad, Qi Wu, Javen Shi, Anton van den Hengel

Keywords: information-seeking agent vision and language tasks vqa interactive agents reinforcement learning

Abstract Paper Similar Papers

Abstract: As Computer Vision moves from passive analysis of pixels to active analysis of semantics, the breadth of information algorithms need to reason over has expanded significantly. One of the key challenges in this vein is the ability to identify the information required to make a decision, and select an action that will recover it. We propose a reinforcement-learning approach that maintains a distribution over its internal information, thus explicitly representing the ambiguity in what it knows, and needs to know, towards achieving its goal. Potential actions are then generated according to this distribution. For each potential action a distribution of the expected outcomes is calculated, and the value of the potential information gain assessed. The action taken is that which maximizes the potential information gain. We demonstrate this approach applied to two vision-and-language problems that have attracted significant recent interest, visual dialog and visual query generation. In both cases the method actively selects actions that will best reduce its internal uncertainty, and outperforms its competitors in achieving the goal of the challenge.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

26/04/2020

Explain Your Move: Understanding Agent Actions Using Focused Feature Saliency

Piyush Gupta, Nikaash Puri, Sukriti Verma and
Dhruv Kayastha, Shripad Deshmukh, Balaji Krishnamurthy, Sameer Singh

Keywords Paper

Deep Reinforcement Learning, Saliency maps, Chess, Atari games, Interpretable AI

0

0

0

0

4:59

22/11/2021

Single-Modal Entropy based Active Learning for Visual Question Answering

Dong-Jin Kim, Jae Won Cho, Jinsoo Choi and
Yunjae Jung, In So Kweon

Keywords Paper

Visual Question Answering, Vision and Language, Active Learning

0

0

0

0

2:42

06/12/2020

Predictive Information Accelerates Learning in RL

Kuang-Huei Lee, Ian Fischer, Anthony Liu and
Yijie Guo, Honglak Lee, John Canny, Sergio Guadarrama

Keywords Paper

0

0

0

0

3:10

18/07/2021

Inverse Decision Modeling: Learning Interpretable Representations of Behavior

Daniel Jarrett, Alihan Hüyük, Mihaela van der Schaar

Keywords Paper

Reinforcement Learning and Planning, Others

0

0

0

0

16:11

06/12/2020

Value-driven Hindsight Modelling

Arthur Guez, Fabio Viola, Theophane Weber and
Lars Buesing, Steven Kapturowski, Doina Precup, David Silver, Nicolas Heess

Keywords Paper

1

0

0

0

3:20

13/04/2021

Linear models are robust optimal under strategic behavior

Wei Tang, Chien-Ju Ho, Yang Liu

Keywords Paper

0

0

0

0

3:32

23/08/2020

Spectrum-guided adversarial disparity learning

Zhe Liu, Lina Yao, Lei Bai and
Xianzhi Wang, Can Wang

Keywords Paper

adversarial autoencoder, generative models, intraclass variability, activity recognition

0

0

0

0

14:30

26/04/2020

Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization

Michael Volpp, Lukas P. Fröhlich, Kirsten Fischer and
Andreas Doerr, Stefan Falkner, Frank Hutter, Christian Daniel

Keywords Paper

Transfer Learning, Meta Learning, Bayesian Optimization, Reinforcement Learning

0

0

0

0

5:05

16/11/2020

CapWAP: Image Captioning with a Purpose

Adam Fisch, Kenton Lee, Ming-Wei Chang and
Jonathan Clark, Regina Barzilay

Keywords Paper

image task, visual images, captioning, capwap

0

0

0

0

11:26

02/02/2021

A Simple and Effective Self-Supervised Contrastive Learning Framework for Aspect Detection

Tian Shi, Liuqing Li, Ping Wang, Chandan K. Reddy

Keywords Paper

0

0

0

0

19:21

03/05/2021

Trusted Multi-View Classification

Zongbo Han, Changqing Zhang, Huazhu FU, Joey T Zhou

Keywords Paper

Uncertainty Machine Learning, Multi-View Learning, Multi-Modal Learning

0

0

0

0

4:33

14/06/2020

Focus on Defocus: Bridging the Synthetic to Real Domain Gap for Depth Estimation

Maxim Maximov, Kevin Galim, Laura Leal-Taixé

Keywords Paper

depth estimation, generalisation, depth from focus, blur estimation, depth

0

0

0

0

1:01

30/11/2020

Second Order enhanced Multi-glimpse Attention in Visual Question Answering

Qiang Sun, Binghui Xie, Yanwei Fu

Keywords Paper

0

0

0

0

7:20

22/11/2021

Attention to Action: Leveraging Attention for Object Navigation

Shi Chen, Qi Zhao

Keywords Paper

Object-goal Navigation, Attention, Visual Navigation

0

0

0

0

2:51

02/02/2021

Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers

Shijie Geng, Peng Gao, Moitreya Chatterjee and
Chiori Hori, Jonathan Le Roux, Yongfeng Zhang, Hongsheng Li, Anoop Cherian

Keywords Paper

0

0

0

0

19:36

06/12/2020

Beyond Individualized Recourse: Interpretable and Interactive Summaries of Actionable Recourses

Kai Rawal, Himabindu Lakkaraju

Keywords Paper

0

0

0

0

3:31

06/12/2020

Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders

Masha Itkina, Boris Ivanovic, Ransalu Senanayake and
Mykel J Kochenderfer, Marco Pavone

Keywords Paper

0

0

0

0

3:39

05/01/2021

Integrating Human Gaze Into Attention for Egocentric Activity Recognition

Kyle Min, Jason J. Corso

Keywords Paper

0

0

0

0

4:56

06/12/2021

Stateful Strategic Regression

Keegan Harris, Hoda Heidari, Steven Wu

Keywords Paper

optimization

0

0

0

0

14:02

14/06/2020

Syntax-Aware Action Targeting for Video Captioning

Qi Zheng, Chaoyue Wang, Dacheng Tao

Keywords Paper

video and language, video captioning, action predicting

0

0

0

0

1:01

06/12/2021

The Value of Information When Deciding What to Learn

Dilip Arumugam, Benjamin Van Roy

Keywords Paper

theory, reinforcement learning and planning, bandits

0

0

0

0

9:27

02/02/2021

The Value-Improvement Path: Towards Better Representations for Reinforcement Learning

Will Dabney, André Barreto, Mark Rowland and
Robert Dadashi, John Quan, Marc G. Bellemare, David Silver

Keywords Paper

0

0

0

0

20:06

14/06/2020

Learning to Observe: Approximating Human Perceptual Thresholds for Detection of Suprathreshold Image Transformations

Alan Dolhasz, Carlo Harvey, Ian Williams

Keywords Paper

percetpion, jnd, vision, deep learning, image compositing, local distortions, subjective quality

0

0

0

0

1:01

03/05/2021

Learning to Represent Action Values as a Hypergraph on the Action Vertices

Arash Tavakoli, Mehdi Fatemi, Petar Kormushev

Keywords Paper

reinforcement learning, learning action representations, multi-dimensional discrete action spaces, structural inductive bias, structural credit assignment

0

0

0

0

3:43

07/09/2020

Adversarial Concurrent Training: Optimizing Robustness and Accuracy Trade-off of Deep Neural Networks

Elahe Arani, Fahad Sarfraz, Bahram Zonooz

Keywords Paper

Adversarial Robustness, Generalization, Adversarial Training, Deep Learning, Collaborative Learning

0

0

0

0

3:39

14/06/2020

Counterfactual Vision and Language Learning

Ehsan Abbasnejad, Damien Teney, Amin Parvaneh and
Javen Shi, Anton van den Hengel

Keywords Paper

counterfactual reasoning vision and language tasks vqa

0

0

0

0

5:00

02/02/2021

A Continual Learning Framework for Uncertainty-Aware Interactive Image Segmentation

Ervine Zheng, Qi Yu, Rui Li and
Pengcheng Shi, Anne Haake

Keywords Paper

0

0

0

0

14:21

26/04/2020

Frequency-based Search-control in Dyna

Yangchen Pan, Jincheng Mei, Amir-massoud Farahmand

Keywords Paper

Model-based reinforcement learning, search-control, Dyna, frequency of a signal

0

0

0

0

4:32

03/05/2021

Fast And Slow Learning Of Recurrent Independent Mechanisms

Kanika Madan, Nan Rosemary Ke, Anirudh Goyal and
Bernhard Schoelkopf, Yoshua Bengio

Keywords Paper

better generalization, modular representations, learning mechanisms

0

0

0

0

5:09

12/07/2020

Data Valuation using Reinforcement Learning

Jinsung Yoon, Sercan Arik, Tomas Pfister

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

14:35

02/02/2021

Hindsight and Sequential Rationality of Correlated Play

Dustin Morrill, Ryan D'Orazio, Reca Sarfati and
Marc Lanctot, James R Wright, Amy R Greenwald, Michael Bowling

Keywords Paper

0

0

0

0

18:34

30/11/2020

Visualizing Color-wise Saliency of Black-Box Image Classification Models

Yuhki Hatakeyama, Hiroki Sakuma, Yoshinori Konishi, Kohei Suenaga

Keywords Paper

0

0

0

0

9:43

19/08/2021

Stochastic Actor-Executor-Critic for Image-to-Image Translation

Ziwei Luo, Jing Hu, Xin Wang and
Siwei Lyu, Bin Kong, Youbing Yin, Qi Song, Xi Wu

Keywords Paper

Machine Learning, Deep Reinforcement Learning, Learning Generative Models, Applications of Reinforcement Learning

0

0

0

0

5:34

14/06/2020

NestedVAE: Isolating Common Factors via Weak Supervision

Matthew J. Vowels, Necati Cihan Camgöz, Richard Bowden

Keywords Paper

fairness, bias, representation learning, invariance, vae, variational, weakly supervised, information bottleneck

0

0

0

0

1:00

12/07/2020

Generating Programmatic Referring Expressions via Program Synthesis

Jiani Huang, Calvin Smith, Osbert Bastani and
Rishabh Singh, Aws Albarghouthi, Mayur Naik

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

13:54

18/07/2021

Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning

Shariq Iqbal, Christian Schroeder, Bei Peng and
Wendelin Boehmer, Shimon Whiteson, Fei Sha

Keywords Paper

Optimization, Convex Optimization, Reinforcement Learning and Planning, Multi-Agent RL, Algorithms, Large Scale Learning; Probabilistic Methods, Distributed Inference

0

0

0

0

20:08

26/04/2020

Progressive learning and disentanglement of hierarchical representations

Zhiyuan Li, Jaideep Vitthal Murkute, Prashnna Kumar Gyawali, Linwei Wang

Keywords Paper

generative model, disentanglement, progressive learning, VAE

0

0

0

0

5:06

06/12/2020

Weakly-Supervised Reinforcement Learning for Controllable Behavior

Lisa Lee, Benjamin Eysenbach, Russ Salakhutdinov and
Shixiang (Shane) Gu, Chelsea Finn

Keywords Paper

0

0

0

0

3:31

06/12/2021

Local Disentanglement in Variational Auto-Encoders Using Jacobian $L_1$ Regularization

Travers Rhodes, Daniel Lee

Keywords Paper

representation learning

0

0

0

0

6:58

06/12/2021

Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation

Nicklas Hansen, Hao Su, Xiaolong Wang

Keywords Paper

reinforcement learning and planning, transformers

0

0

0

0

8:43