SubICap: Towards Subword-Informed Image Captioning

Abstract: Existing Image Captioning (IC) systems model words as atomic units in captions and are unable to exploit the structural information in the words. This makes representation of rare words very difficult and out-of-vocabulary words impossible. Moreover, to avoid computational complexity, existing IC models operate over a modest sized vocabulary of frequent words, such that the identity of rare words is lost. In this work we address this common limitation of IC systems in dealing with rare words in the corpora. We decompose words into smaller constituent units `subwords' and represent captions as a sequence of subwords instead of words. This helps represent all words in the corpora using a significantly lower subword vocabulary, leading to better parameter learning. Using subword language modeling, our captioning system improves various metric scores, with a training vocabulary size approximately 90% less than the baseline and various state-of-the-art word-level models. Our quantitative and qualitative results and analysis signify the efficacy of our proposed approach.

18/07/2021

Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision

SubICap: Towards Subword-Informed Image Captioning

Naeha Sharif, Mohammed Bennamoun, Wei Liu, Syed Afaq Ali Shah

Comments

Similar Papers

Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision

Chao Jia, Yinfei Yang, Ye Xia and Yi-Ting Chen, Zarana Parekh, Hieu Pham, Quoc Le, Yun-Hsuan Sung, Zhen Li, Tom Duerig

Keywords Abstract Paper

Deep Learning, Embedding and Representation learning

Equalization Loss for Long-Tailed Object Recognition

Jingru Tan, Changbao Wang, Buyu Li and Quanquan Li, Wanli Ouyang, Changqing Yin, Junjie Yan

Keywords Abstract Paper

long tail, object detection, lvis, object recognition

HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks

Zhen Dong, Zhewei Yao, Daiyaan Arfeen and Amir Gholami, Michael Mahoney, Kurt Keutzer

Keywords Abstract Paper

Moving Down the Long Tail of Word Sense Disambiguation with Gloss Informed Bi-encoders

Terra Blevins, Luke Zettlemoyer

Keywords Abstract Paper

Word Disambiguation, Word WSD, WSD, sense disambiguation

Weakly-Supervised Salient Object Detection via Scribble Annotations

Jing Zhang, Xin Yu, Aixuan Li and Peipei Song, Bowen Liu, Yuchao Dai

Keywords Abstract Paper

rgb saliency detection, scribble annotation, weakly-supervised

Taking Notes on the Fly Helps Language Pre-Training

Qiyu Wu, Chen Xing, Yatao Li and Guolin Ke, Di He, Tie-Yan Liu

Keywords Abstract Paper

Natural Language Processing, Pre-training

Contemplating Real-World Object Classification

Ali Borji

Keywords Abstract Paper

Robustness, object recognition, deep learning, ObjectNet

Automatic differentiation variational inference with mixtures

Warren Morningstar, Sharad Vikram, Cusuh Ham and Andrew Gallagher, Joshua Dillon

Keywords Abstract Paper

From Saturation to Zero-Shot Visual Relationship Detection Using Local Context

Nikolaos Gkanatsios, Vassilis Pitsikalis, Petros Maragos

Keywords Abstract Paper

Visual Relationship Detection, Scene Graph Generation, Zero-shot Classification, Local Context, Language Bias

Self-training pre-trained language models for zero- and few-shot multi-dialectal Arabic sequence labeling

Muhammad Khalifa, Muhammad Abdul-Mageed, Khaled Shaalan

Keywords Abstract Paper

From Zero to Hero: On the Limitations of Zero-Shot Language Transfer with Multilingual Transformers

Anne Lauscher, Vinit Ravishankar, Ivan Vulić, Goran Glavaš

Keywords Abstract Paper

zero-shot transfer, downstream transfer, resource-lean scenarios, pos tagging

Continuous Self-Attention Models with Neural ODE Networks

Jing Zhang, Peng Zhang, Baiwen Kong and Junqiu Wei, Xin Jiang

Keywords Abstract Paper

Accurate Post Training Quantization With Small Calibration Sets

Itay Hubara, Yury Nahshan, Yair Hanani and Ron Banner, Daniel Soudry

Keywords Abstract Paper

Algorithms, AutoML

Improving Massively Multilingual Neural Machine Translation and Zero-Shot Translation

Biao Zhang, Philip Williams, Ivan Titov, Rico Sennrich

Keywords Abstract Paper

Massively Translation, Zero-Shot Translation, neural translation, NMT

An Information Bottleneck Approach for Controlling Conciseness in Rationale Extraction

Bhargavi Paranjape, Mandar Joshi, John Thickstun and Hannaneh Hajishirzi, Luke Zettlemoyer

Keywords Abstract Paper

language understanding, semi-supervised setting, complex models, explainer

Shapeshifter: a Parameter-efficient Transformer using Factorized Reshaped Matrices

Aliakbar Panahi, Seyran Saeedi, Tom Arodz

Keywords Abstract Paper

transformers

Disfluency correction using unsupervised and semi-supervised learning

Nikhil Saini, Drumil Trivedi, Shreya Khare and Tejas Dhamecha, Preethi Jyothi, Samarth Bharadwaj, Pushpak Bhattacharyya

Keywords Abstract Paper

Top-k Training of GANs: Improving GAN Performance by Throwing Away Bad Samples

Samarth Sinha, Zhengli Zhao, Anirudh Goyal ALIAS PARTH GOYAL and Colin A Raffel, Augustus Odena

Keywords Abstract Paper

Do sequence-to-sequence VAEs learn global features of sentences?

Tom Bosc, Pascal Vincent

Keywords Abstract Paper

generation, memorization, autoregressive models, variational autoencoder

Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity

Anne Lauscher, Ivan Vulić, Edoardo Maria Ponti and Anna Korhonen, Goran Glavaš

Keywords Abstract Paper

Empirical Analysis of Unlabeled Entity Problem in Named Entity Recognition

Yangming Li, lemao liu, Shuming Shi

Keywords Abstract Paper

Chao Jia, Yinfei Yang, Ye Xia and
Yi-Ting Chen, Zarana Parekh, Hieu Pham, Quoc Le, Yun-Hsuan Sung, Zhen Li, Tom Duerig

Keywords Paper

Jingru Tan, Changbao Wang, Buyu Li and
Quanquan Li, Wanli Ouyang, Changqing Yin, Junjie Yan

Keywords Paper

Zhen Dong, Zhewei Yao, Daiyaan Arfeen and
Amir Gholami, Michael Mahoney, Kurt Keutzer

Keywords Paper

Keywords Paper

Jing Zhang, Xin Yu, Aixuan Li and
Peipei Song, Bowen Liu, Yuchao Dai

Keywords Paper

Qiyu Wu, Chen Xing, Yatao Li and
Guolin Ke, Di He, Tie-Yan Liu

Keywords Paper

Keywords Paper

Warren Morningstar, Sharad Vikram, Cusuh Ham and
Andrew Gallagher, Joshua Dillon

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Jing Zhang, Peng Zhang, Baiwen Kong and
Junqiu Wei, Xin Jiang

Keywords Paper

Itay Hubara, Yury Nahshan, Yair Hanani and
Ron Banner, Daniel Soudry

Keywords Paper

Keywords Paper

Bhargavi Paranjape, Mandar Joshi, John Thickstun and
Hannaneh Hajishirzi, Luke Zettlemoyer

Keywords Paper

Keywords Paper

Nikhil Saini, Drumil Trivedi, Shreya Khare and
Tejas Dhamecha, Preethi Jyothi, Samarth Bharadwaj, Pushpak Bhattacharyya

Keywords Paper

Samarth Sinha, Zhengli Zhao, Anirudh Goyal ALIAS PARTH GOYAL and
Colin A Raffel, Augustus Odena

Keywords Paper

Keywords Paper

Anne Lauscher, Ivan Vulić, Edoardo Maria Ponti and
Anna Korhonen, Goran Glavaš

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Aditya Kusupati, Matthew Wallingford, Vivek Ramanujan and
Raghav Somani, Jae Sung Park, Krishna Pillutla, Prateek Jain, Sham Kakade, Ali Farhadi

Keywords Paper

Pengfei Wang, Chengquan Zhang, Fei Qi and
Shanshan Liu, Xiaoqiang Zhang, Pengyuan Lyu, Junyu Han, Jingtuo Liu, Errui Ding, Guangming Shi

Keywords Paper

Keywords Paper

Keywords Paper

Jingjing Li, Wei Ji, Qi Bi and
Cheng Yan, Miao Zhang, Yongri Piao, Huchuan Lu, Li cheng

Keywords Paper

Sylvestre-Alvise Rebuffi, Sven Gowal, Dan Andrei Calian and
Florian Stimberg, Olivia Wiles, Timothy A Mann

Keywords Paper

Yiqin Yang, Xiaoteng Ma, Li Chenghao and
Zewu Zheng, Qiyuan Zhang, Gao Huang, Jun Yang, Qianchuan Zhao

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper