The role of syntactic planning in compositional image captioning

19/04/2021

The role of syntactic planning in compositional image captioning

Emanuele Bugliarello, Desmond Elliott

Keywords:

Abstract Paper Similar Papers

Abstract: Image captioning has focused on generalizing to images drawn from the same distribution as the training set, and not to the more challenging problem of generalizing to different distributions of images. Recently, Nikolaus et al. (2019) introduced a dataset to assess compositional generalization in image captioning, where models are evaluated on their ability to describe images with unseen adjective–noun and noun–verb compositions. In this work, we investigate different methods to improve compositional generalization by planning the syntactic structure of a caption. Our experiments show that jointly modeling tokens and syntactic tags enhances generalization in both RNN- and Transformer-based models, while also improving performance on standard metrics.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EACL 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Learnable Fourier Features for Multi-dimensional Spatial Positional Encoding

Yang Li, Si Si, Gang Li and
Cho-Jui Hsieh, Samy Bengio

Keywords Paper

machine learning, transformers, vision

0

0

0

0

10:54

02/02/2021

Learning Intact Features by Erasing-Inpainting for Few-shot Classification

Junjie Li, Zilei Wang, Xiaoming Hu

Keywords Paper

0

0

0

0

15:15

06/12/2020

Learning Semantic-aware Normalization for Generative Adversarial Networks

Heliang Zheng, Jianlong Fu, zengyh Zeng and
Jiebo Luo, Zheng-Jun Zha

Keywords Paper

0

0

0

0

3:11

02/02/2021

Tailoring Embedding Function to Heterogeneous Few-Shot Tasks by Global and Local Feature Adaptors

Su Lu, Han-Jia Ye, De-Chuan Zhan

Keywords Paper

0

0

0

0

14:21

22/11/2021

Image-Text Alignment using Adaptive Cross-attention with Transformer Encoder for Scene Graphs

Juyong Song, Sunghyun Choi

Keywords Paper

cross-attention, multi-modal, retrieval, scene-graphs, graph neural networks, contrastive loss

0

0

0

0

3:01

25/07/2020

Regional relation modeling for visual place recognition

Yingying Zhu, Biao Li, Jiong Wang, Zhou Zhao

Keywords Paper

convolutional neural network, visual place recognition, content-based image retrieval, relation modeling

0

0

0

0

14:11

22/11/2021

OODformer: Out-Of-Distribution Detection Transformer

Rajat Koner, Poulami Sinhamahapatra, Karsten Roscher and
Stephan Günnemann, Volker Tresp

Keywords Paper

Out-Of-Distribution Detection, Vision Transfomer, Repsentation Learning

0

0

0

0

3:19

05/01/2021

Intra-Class Part Swapping for Fine-Grained Image Classification

Lianbo Zhang, Shaoli Huang, Wei Liu

Keywords Paper

0

0

0

0

4:43

02/11/2020

Guided multi-branch learning systems for sound event detection with sound separation

Yuxin Huang, Liwei Lin, Shuo Ma and
Xiangdong Wang, Hong Liu, Yueliang Qian, Min Liu, Kazushige Ouchi

Keywords Paper

0

0

0

0

12:52

30/11/2020

Image Captioning through Image Transformer

Sen He, Wentong Liao, Hamed R. Tavakoli and
Michael Yang, Bodo Rosenhahn, Nicolas Pugeault

Keywords Paper

0

0

0

0

9:49

02/02/2021

Object-Centric Image Generation from Layouts

Tristan Sylvain, Pengchuan Zhang, Yoshua Bengio and
R Devon Hjelm, Shikhar Sharma

Keywords Paper

0

0

0

0

17:44

04/07/2020

Estimating the influence of auxiliary tasks for multi-task learning of sequence tagging tasks

Fynn Schröder, Chris Biemann

Keywords Paper

multi-task tasks, MTL, TL, MTL setups

0

0

0

0

12:02

06/12/2020

Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders

Masha Itkina, Boris Ivanovic, Ransalu Senanayake and
Mykel J Kochenderfer, Marco Pavone

Keywords Paper

0

0

0

0

3:39

14/06/2020

ROAM: Recurrently Optimizing Tracking Model

Tianyu Yang, Pengfei Xu, Runbo Hu and
Hua Chai, Antoni B. Chan

Keywords Paper

resizable tracking model, recurrent neural optimizer, meta learning, random filter scaling, visual tracking.

0

0

0

0

1:01

14/06/2020

Learning Fused Pixel and Feature-Based View Reconstructions for Light Fields

Jinglei Shi, Xiaoran Jiang, Christine Guillemot

Keywords Paper

light field, view synthesis, feature-based reconstruction, pixel-based reconstruction, deep learning, angular super-resolution

0

0

0

0

4:56

06/12/2021

Grounding inductive biases in natural images: invariance stems from variations in data

Diane Bouchacourt, Mark Ibrahim, Ari Morcos

Keywords Paper

machine learning, transformers

0

0

0

0

14:19

14/06/2020

Learning Texture Invariant Representation for Domain Adaptation of Semantic Segmentation

Myeongjin Kim, Hyeran Byun

Keywords Paper

domain adaptation, segmentation, texture

0

0

0

0

1:01

07/09/2020

Mixup-CAM: Weakly-supervised Semantic Segmentation via Uncertainty Regularization

Yu-Ting Chang, Qiaosong Wang, Wei-Chih Hung and
Robinson Piramuthu, Yi-Hsuan Tsai, Ming-Hsuan Yang

Keywords Paper

semantic segmentation, weakly-supervised learning, class activatin map, mixup augmentation, entropy regularization

0

0

0

0

8:22

16/11/2020

Unsupervised Metric Relocalization Using Transform Consistency Loss

Mike Kasper, Fernando Nobre, Christoffer Heckman, Nima Keivan

Keywords Paper

0

0

0

0

3:58

26/04/2020

Neural Outlier Rejection for Self-Supervised Keypoint Learning

Jiexiong Tang, Hanme Kim, Vitor Guizilini and
Sudeep Pillai, Rares Ambrus

Keywords Paper

Self-Supervised Learning, Keypoint Detection, Outlier Rejection, Deep Learning

0

0

0

0

4:55

22/11/2021

Multi-Source Domain Adaptation via supervised contrastive learning and confident consistency regularization

Marin Scalbert, Florent Couzinié-Devy, Maria Vakalopoulou

Keywords Paper

unsupervised domain adaptation, contrastive learning, semi-supervised learning, consistency regularization, domain shift

0

0

0

0

2:57

06/12/2021

Looking Beyond Single Images for Contrastive Semantic Segmentation Learning

FEIHU ZHANG, Philip Torr, Rene Ranftl, Stephan Richter

Keywords Paper

machine learning, vision, contrastive learning, representation learning

0

0

0

0

14:48

06/12/2020

ICE-BeeM: Identifiable Conditional Energy-Based Deep Models Based on Nonlinear ICA

Ilyes Khemakhem, Ricardo Monti, Diederik P. Kingma, Aapo Hyvarinen

Keywords Paper

0

0

0

0

3:02

19/08/2021

Towards Cross-View Consistency in Semantic Segmentation While Varying View Direction

Xin Tong, Xianghua Ying, Yongjie Shi and
He Zhao, Ruibin Wang

Keywords Paper

Computer Vision, Recognition, Robotics and Vision

0

0

0

0

10:10

02/02/2021

Merging Statistical Feature via Adaptive Gate for Improved Text Classification

Xianming Li, Zongxi Li, Haoran Xie, Qing Li

Keywords Paper

0

0

0

0

14:56

18/07/2021

Generative Adversarial Transformers

Drew A. Hudson, Larry Zitnick

Keywords Paper

Deep Learning, Architectures

0

0

0

0

5:15

02/02/2021

Towards Reusable Network Components by Learning Compatible Representations

Michael Gygli, Jasper Uijlings, Vittorio Ferrari

Keywords Paper

0

0

0

0

19:58

06/12/2021

Probabilistic Attention for Interactive Segmentation

Prasad Gabbur, Manjot Bilkhu, Javier Movellan

Keywords Paper

transformers, vision

0

0

0

0

13:20

14/09/2020

On Saliency Maps and Adversarial Robustness

Puneet Mangla, Vedant Singh, Vineeth Balasubramanian

Keywords Paper

adversarial robustness, saliency maps, deep neural networks

0

0

0

0

17:29

14/06/2020

Interpretable and Accurate Fine-grained Recognition via Region Grouping

Zixuan Huang, Yin Li

Keywords Paper

interpretable deep model, fine-grained recognition, region-based recognition

0

0

0

0

4:58

14/06/2020

Hierarchical Scene Coordinate Classification and Regression for Visual Localization

Xiaotian Li, Shuzhe Wang, Yi Zhao and
Jakob Verbeek, Juho Kannala

Keywords Paper

visual localization, camera relocalization, scene coordinate regression

0

0

0

0

1:01

06/12/2020

CoADNet: Collaborative Aggregation-and-Distribution Networks for Co-Salient Object Detection

Qijian Zhang, Runmin Cong, Junhui Hou and
Chongyi Li, Yao Zhao

Keywords Paper

, Theory -> Learning Theory

0

0

0

0

3:14

19/08/2021

Context-Aware Image Inpainting with Learned Semantic Priors

Wendong Zhang, Junwei Zhu, Ying Tai and
Yunbo Wang, Wenqing Chu, Bingbing Ni, Chengjie Wang, Xiaokang Yang

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Deep Learning

0

0

0

0

13:26

14/06/2020

Harmonizing Transferability and Discriminability for Adapting Object Detectors

Chaoqi Chen, Zebiao Zheng, Xinghao Ding and
Yue Huang, Qi Dou

Keywords Paper

unsupervised domain adaptation, cross-domain object detection, transfer learning, deep learning, hierarchical transferability calibration

0

0

0

0

1:01

14/06/2020

Articulation-Aware Canonical Surface Mapping

Nilesh Kulkarni, Abhinav Gupta, David F. Fouhey, Shubham Tulsiani

Keywords Paper

canonical surface map, correspondence, articulation, single image to 3d, pose

0

0

0

0

1:01

06/12/2021

Manifold Topology Divergence: a Framework for Comparing Data Manifolds.

Serguei Barannikov, Ilya Trofimov, Grigorii Sotnikov and
Ekaterina Trimbach, Alexander Korotin, Alexander Filippov, Evgeny Burnaev

Keywords Paper

generative model

0

0

0

0

15:01

02/02/2021

Context-aware Attentional Pooling (CAP) for Fine-grained Visual Classification

Ardhendu Behera, Zachary Wharton, Pradeep R P G Hewage, Asish Bera

Keywords Paper

0

0

0

0

18:54

05/01/2021

Towards Fair Cross-Domain Adaptation via Generative Learning

Tongxin Wang, Zhengming Ding, Wei Shao and
Haixu Tang, Kun Huang

Keywords Paper

0

0

0

0

4:56

30/11/2020

Second Order enhanced Multi-glimpse Attention in Visual Question Answering

Qiang Sun, Binghui Xie, Yanwei Fu

Keywords Paper

0

0

0

0

7:20

06/12/2020

Few-shot Image Generation with Elastic Weight Consolidation

Yijun Li, Richard Zhang, Jingwan (Cynthia) Lu, Eli Shechtman

Keywords Paper

0

0

0

0

3:16