Crisscrossed captions: Extended intramodal and intermodal semantic similarity judgments for MS-COCO

19/04/2021

Crisscrossed captions: Extended intramodal and intermodal semantic similarity judgments for MS-COCO

Zarana Parekh, Jason Baldridge, Daniel Cer, Austin Waters, Yinfei Yang

Keywords:

Abstract Paper Similar Papers

Abstract: By supporting multi-modal retrieval training and evaluation, image captioning datasets have spurred remarkable progress on representation learning. Unfortunately, datasets have limited cross-modal associations: images are not paired with other images, captions are only paired with other captions of the same image, there are no negative associations and there are missing positive cross-modal associations. This undermines research into how inter-modality learning impacts intra-modality tasks. We address this gap with Crisscrossed Captions (CxC), an extension of the MS-COCO dataset with human semantic similarity judgments for 267,095 intra- and inter-modality pairs. We report baseline results on CxC for strong existing unimodal and multimodal models. We also evaluate a multitask dual encoder trained on both image-caption and caption-caption pairs that crucially demonstrates CxC’s value for measuring the influence of intra- and inter-modality learning.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EACL 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Diverse Image Captioning with Context-Object Split Latent Spaces

Shweta Mahajan, Stefan Roth

Keywords Paper

0

0

0

0

3:19

02/02/2021

Task-Independent Knowledge Makes for Transferable Representations for Generalized Zero-Shot Learning

Chaoqun Wang, Xuejin Chen, Shaobo Min and
Xiaoyan Sun, Houqiang Li

Keywords Paper

0

0

0

0

14:56

06/12/2021

Dual Progressive Prototype Network for Generalized Zero-Shot Learning

Chaoqun Wang, Shaobo Min, Xuejin Chen and
Xiaoyan Sun, Houqiang Li

Keywords Paper

0

0

0

0

10:51

03/05/2021

Relating by Contrasting: A Data-efficient Framework for Multimodal Generative Models

Yuge Shi, Brooks Paige, Philip Torr, Siddharth N

Keywords Paper

Deep generative model, representation learning, multi-modal learning

0

0

0

0

5:09

06/12/2021

Improving Transferability of Representations via Augmentation-Aware Self-Supervision

Hankook Lee, Kibok Lee, Kimin Lee and
Honglak Lee, Jinwoo Shin

Keywords Paper

self-supervised learning, contrastive learning, representation learning, transfer learning

0

0

0

0

10:10

06/12/2020

Variational Interaction Information Maximization for Cross-domain Disentanglement

HyeongJoo Hwang, Geon-Hyeong Kim, Seunghoon Hong, Kee-Eung Kim

Keywords Paper

Classification, Few-Shot Learning, Missing Data, Network Analysis, Adversarial Learning

0

0

0

0

3:28

26/04/2020

Neural Outlier Rejection for Self-Supervised Keypoint Learning

Jiexiong Tang, Hanme Kim, Vitor Guizilini and
Sudeep Pillai, Rares Ambrus

Keywords Paper

Self-Supervised Learning, Keypoint Detection, Outlier Rejection, Deep Learning

0

0

0

0

4:55

06/12/2021

Looking Beyond Single Images for Contrastive Semantic Segmentation Learning

FEIHU ZHANG, Philip Torr, Rene Ranftl, Stephan Richter

Keywords Paper

machine learning, vision, contrastive learning, representation learning

0

0

0

0

14:48

06/12/2020

Contrastive Learning with Adversarial Examples

Chih-Hui Ho, Nuno Nvasconcelos

Keywords Paper

0

0

0

0

3:13

06/12/2021

Look at What I’m Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos

Reuben Tan, Bryan Plummer, Kate Saenko and
Hailin Jin, Bryan Russell

Keywords Paper

optimization

0

0

0

0

12:28

22/11/2021

Improving Text-to-Image Synthesis Using Contrastive Learning

Hui Ye, Xiulong Yang, Martin Takac and
Rajshekhar Sunderraman, Shihao Ji

Keywords Paper

Text-to-Image Synthesis, Image Generation, Contrastive Learning, Image-text Matching, Siamese Structure, Cross-domain Representation Learning, Generative Adversarial Networks, Conditional Generative Adversarial Networks

0

0

0

0

3:28

06/12/2020

Self-Supervised Learning by Cross-Modal Audio-Video Clustering

Humam Alwassel, Dhruv Mahajan, Bruno Korbar and
Lorenzo Torresani, Bernard Ghanem, Du Tran

Keywords Paper

, Applications -> Computer Vision

0

0

0

0

3:17

02/02/2021

Mind-the-Gap! Unsupervised Domain Adaptation for Text-Video Retrieval

Qingchao Chen, Yang Liu, Samuel Albanie

Keywords Paper

0

0

0

0

15:19

02/02/2021

Self-Supervised Attention-Aware Reinforcement Learning

Haiping Wu, Khimya Khetarpal, Doina Precup

Keywords Paper

0

0

0

0

14:04

03/05/2021

Disentangled Recurrent Wasserstein Autoencoder

Jun Han, Martin Min, Ligong Han and
Li Erran Li, Xuan Zhang

Keywords Paper

Recurrent Generative Model, Sequential Representation Learning, Disentanglement

0

0

0

0

9:17

18/07/2021

Robust Representation Learning via Perceptual Similarity Metrics

Saeid A Taghanaki, Kristy Choi, Amir Hosein Khasahmadi, Anirudh Goyal

Keywords Paper

Deep Learning, Embedding and Representation learning

0

0

0

0

4:48

06/12/2020

CoMIR: Contrastive Multimodal Image Representation for Registration

Nicolas Pielawski, Elisabeth Wetzer, Johan Öfverstedt and
Jiahao Lu, Carolina Wählby, Joakim Lindblad, Natasa Sladoje

Keywords Paper

0

0

0

0

2:55

14/06/2020

A Disentangling Invertible Interpretation Network for Explaining Latent Representations

Patrick Esser, Robin Rombach, Björn Ommer

Keywords Paper

interpretability, inn, disentangling, generative models, invertible neural networks, autoencoders, normalizing flows, vae, explainable, xai

0

0

0

0

1:01

26/04/2020

Decoupling Representation and Classifier for Long-Tailed Recognition

Bingyi Kang, Saining Xie, Marcus Rohrbach and
Zhicheng Yan, Albert Gordo, Jiashi Feng, Yannis Kalantidis

Keywords Paper

long-tailed recognition, classification

0

0

0

1

5:00

14/06/2020

Fine-Grained Image-to-Image Transformation Towards Visual Recognition

Wei Xiong, Yutong He, Yixuan Zhang and
Wenhan Luo, Lin Ma, Jiebo Luo

Keywords Paper

generative adversarial networks, fine-grained image generation, non-local networks, feature modulation.

0

0

0

0

0:59

05/01/2021

Intra-Class Part Swapping for Fine-Grained Image Classification

Lianbo Zhang, Shaoli Huang, Wei Liu

Keywords Paper

0

0

0

0

4:43

03/05/2021

$i$-Mix: A Domain-Agnostic Strategy for Contrastive Representation Learning

Kibok Lee, Yian Zhu, Kihyuk Sohn and
Chun-Liang Li, Jinwoo Shin, Honglak Lee

Keywords Paper

self-supervised learning, unsupervised representation learning, data augmentation, MixUp, contrastive representation learning

0

0

0

0

5:04

07/09/2020

Unified Representation Learning for Cross Model Compatibility

Chien-Yi Wang, Ya-Liang Chang, Shang-Ta Yang and
Dong Chen, Shang-Hong Lai

Keywords Paper

representation learning, metric learning, face recognition, person re-identification, model compatibility, open-set recognition

0

0

0

0

3:14

03/05/2021

What Should Not Be Contrastive in Contrastive Learning

Tete Xiao, Xiaolong Wang, Alyosha Efros, trevor darrell

Keywords Paper

Representation learning, Contrastive learning, Self-supervised learning

0

0

0

0

4:56

06/12/2021

SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation Learning

Talip Ucar, Ehsan Hajiramezanali, Lindsay Edwards

Keywords Paper

self-supervised learning, contrastive learning, representation learning

0

0

0

0

13:28

14/06/2020

Better Captioning With Sequence-Level Exploration

Jia Chen, Qin Jin

Keywords Paper

caption, sequece-level, diversity, precision

0

0

0

0

0:57

06/12/2021

Object-aware Contrastive Learning for Debiased Scene Representation

Sangwoo Mo, Hyunwoo Kang, Kihyuk Sohn and
Chun-Liang Li, Jinwoo Shin

Keywords Paper

self-supervised learning, contrastive learning, representation learning

0

0

0

0

10:25

14/06/2020

Few-Shot Learning via Embedding Adaptation With Set-to-Set Functions

Han-Jia Ye, Hexiang Hu, De-Chuan Zhan, Fei Sha

Keywords Paper

few-shot learning, meta-learning, embedding learning, embedding adaptation, set-to-set

0

0

0

0

1:04

04/07/2020

Cross-Modality Relevance for Reasoning on Language and Vision

Chen Zheng, Quan Guo, Parisa Kordjamshidi

Keywords Paper

Cross-Modality Relevance, Language Vision, visual answering, VQA

0

0

0

0

10:59

14/06/2020

Multi-Mutual Consistency Induced Transfer Subspace Learning for Human Motion Segmentation

Tao Zhou, Huazhu Fu, Chen Gong and
Jianbing Shen, Ling Shao, Fatih Porikli

Keywords Paper

human motion segmentation, transfer subspace learning, multi-level features, multi-mutual consistency learning.

0

0

0

0

1:00

06/12/2021

Designing Counterfactual Generators using Deep Model Inversion

Jayaraman Thiagarajan, Vivek Sivaraman Narayanaswamy, Deepta Rajan and
Jia Liang, Akshay Chaudhari, Andreas Spanias

Keywords Paper

optimization, representation learning, interpretability

0

0

0

0

13:28

14/06/2020

Hierarchically Robust Representation Learning

Qi Qian, Juhua Hu, Hao Li

Keywords Paper

representation learning, hierarchical robustness

0

0

0

0

1:01

19/04/2021

Meta-learning for effective multi-task and multilingual modelling

Ishan Tarunesh, Sushil Khyalia, Vishwajeet Kumar and
Ganesh Ramakrishnan, Preethi Jyothi

Keywords Paper

0

0

0

0

8:19

14/06/2020

JL-DCF: Joint Learning and Densely-Cooperative Fusion Framework for RGB-D Salient Object Detection

Keren Fu, Deng-Ping Fan, Ge-Peng Ji, Qijun Zhao

Keywords Paper

visual saliency, salient object detection, rgb-d, depth information, joint learning, dense connections, multi-modal features, feature fusion, deep learning, encoder-decoder

0

0

0

0

1:01

06/12/2021

Variational Multi-Task Learning with Gumbel-Softmax Priors

Jiayi Shen, Xiantong Zhen, Marcel Worring, Ling Shao

Keywords Paper

machine learning, generative model

0

0

0

0

13:09

02/02/2021

Adversarial Training Reduces Information and Improves Transferability

Matteo Terzi, Alessandro Achille, Marco Maggipinto, Gian Antonio Susto

Keywords Paper

0

0

0

0

19:54

07/09/2020

Conditional Attention for Content-based Image Retrieval

Zechao Hu, Adrian Bors

Keywords Paper

content based image retrieval, conditional attention

0

0

0

0

6:40

03/05/2021

Exploring Balanced Feature Spaces for Representation Learning

Bingyi Kang, Yu Li, Sain Xie and
Zehuan Yuan, Jiashi Feng

Keywords Paper

Representation Learning, Contrastive Learning, Long-Tailed Recognition

0

0

0

0

7:18

14/06/2020

Transformation GAN for Unsupervised Image Synthesis and Representation Learning

Jiayu Wang, Wengang Zhou, Guo-Jun Qi and
Zhongqian Fu, Qi Tian, Houqiang Li

Keywords Paper

gan, unsupervised learning, representation learning

0

0

0

0

1:00

06/12/2021

Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations

Wouter Van Gansbeke, Simon Vandenhende, Stamatios Georgoulis, Luc V Gool

Keywords Paper

self-supervised learning, vision, contrastive learning, representation learning

0

0

0

0

13:32