On the (in)effectiveness of images for text classification

19/04/2021

On the (in)effectiveness of images for text classification

Chunpeng Ma, Aili Shen, Hiyori Yoshikawa, Tomoya Iwakura, Daniel Beck, Timothy Baldwin

Keywords:

Abstract Paper Similar Papers

Abstract: Images are core components of multi-modal learning in natural language processing (NLP), and results have varied substantially as to whether images improve NLP tasks or not. One confounding effect has been that previous NLP research has generally focused on sophisticated tasks (in varying settings), generally applied to English only. We focus on text classification, in the context of assigning named entity classes to a given Wikipedia page, where images generally complement the text and the Wikipedia page can be in one of a number of different languages. Our experiments across a range of languages show that images complement NLP models (including BERT) trained without external pre-training, but when combined with BERT models pre-trained on large-scale external data, images contribute nothing.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EACL 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

22/11/2021

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels

Subhabrata Choudhury, Iro Laina, Christian Rupprecht, Andrea Vedaldi

Keywords Paper

fine-grained recognition, weakly-supervised recognition, fine-grained retrieval, unsupervised recognition, image-to-text retrieval, text-to-image retrieval, image classification

0

0

0

0

8:53

08/12/2020

Emergent Communication Pretraining for Few-Shot Machine Translation

Yaoyiran Li, Edoardo Maria Ponti, Ivan Vulić, Anna Korhonen

Keywords Paper

0

0

0

0

14:42

16/11/2020

Unsupervised Natural Language Inference via Decoupled Multimodal Contrastive Learning

Wanyun Cui, Guangyu Zheng, Wei Wang

Keywords Paper

natural problem, plain inference, task-agnostic pretraining, multimodal learning

0

0

0

0

11:25

16/11/2020

New Protocols and Negative Results for Textual Entailment Data Collection

Samuel R. Bowman, Jennimaria Palomaki, Livio Baldini Soares, Emily Pitler

Keywords Paper

benchmarking, language understanding, transfer applications, crowdsourcing protocol

0

0

0

0

12:27

18/07/2021

Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision

Chao Jia, Yinfei Yang, Ye Xia and
Yi-Ting Chen, Zarana Parekh, Hieu Pham, Quoc Le, Yun-Hsuan Sung, Zhen Li, Tom Duerig

Keywords Paper

Deep Learning, Embedding and Representation learning

0

0

0

0

21:03

12/07/2020

On Variational Learning of Controllable Representations for Text without Supervision

Peng Xu, Jackie Chi Kit Cheung, Yanshuai Cao

Keywords Paper

Representation Learning

0

0

0

0

14:51

22/11/2021

Text-Based Person Search with Limited Data

Xiao Han, Sen He, Li Zhang, Tao Xiang

Keywords Paper

person re-identification, cross-modal image retrieval, fine-grained image retrieval, text-based person search

0

0

0

0

3:04

07/09/2020

From Saturation to Zero-Shot Visual Relationship Detection Using Local Context

Nikolaos Gkanatsios, Vassilis Pitsikalis, Petros Maragos

Keywords Paper

Visual Relationship Detection, Scene Graph Generation, Zero-shot Classification, Local Context, Language Bias

0

0

0

0

7:17

04/07/2020

Words Aren't Enough, Their Order Matters: On the Robustness of Grounding Visual Referring Expressions

Arjun Akula, Spandana Gella, Yaser Al-Onaizan and
Song-Chun Zhu, Siva Reddy

Keywords Paper

Robustness Expressions, Grounding Expressions, Visual recognition, natural understanding

0

0

0

0

6:53

05/01/2021

Multimodal Prototypical Networks for Few-Shot Learning

Frederik Pahde, Mihai Puscas, Tassilo Klein, Moin Nabi

Keywords Paper

0

0

0

0

4:56

06/12/2021

ReSSL: Relational Self-Supervised Learning with Weak Augmentation

Mingkai Zheng, Shan You, Fei Wang and
Chen Qian, Changshui Zhang, Xiaogang Wang, Chang Xu

Keywords Paper

self-supervised learning, contrastive learning

0

0

0

0

6:35

14/06/2020

Domain Decluttering: Simplifying Images to Mitigate Synthetic-Real Domain Shift and Improve Depth Estimation

Yunhan Zhao, Shu Kong, Daeyun Shin, Charless Fowlkes

Keywords Paper

monocular depth prediction, real-synthetic domain shift, synthetic training data, domain adaptation, image inpainting, high-level domain gaps

0

0

0

0

1:01

14/06/2020

Webly Supervised Knowledge Embedding Model for Visual Reasoning

Wenbo Zheng, Lan Yan, Chao Gou, Fei-Yue Wang

Keywords Paper

visual reasoning, webly supervised learning

0

0

0

0

1:01

19/08/2021

Few-shot Neural Human Performance Rendering from Sparse RGBD Videos

Anqi Pang, Xin Chen, Haimin Luo and
Minye Wu, Jingyi Yu, Lan Xu

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Biometrics, Face and Gesture Recognition, Motion and Tracking

0

0

0

0

11:02

12/07/2020

Data-Efficient Image Recognition with Contrastive Predictive Coding

Olivier Henaff

Keywords Paper

Unsupervised and Semi-Supervised Learning

0

0

0

0

14:17

04/07/2020

Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting

Po-Yao Huang, Junjie Hu, Xiaojun Chang, Alexander Hauptmann

Keywords Paper

Unsupervised Translation, Unsupervised MT, MT, alignment

0

0

0

0

12:17

14/06/2020

Unsupervised Learning for Intrinsic Image Decomposition From a Single Image

Yunfei Liu, Yu Li, Shaodi You, Feng Lu

Keywords Paper

intrinsic image decomposition, unsupervised learning, distribution, priors, independence constraint, physical consistency constraint

0

0

0

0

1:00

16/11/2020

SSMBA: Self-Supervised Manifold Based Data Augmentation for Improving Out-of-Domain Robustness

Nathan Ng, Kyunghyun Cho, Marzyeh Ghassemi

Keywords Paper

data augmentation, ood generalization, robustness benchmarks, ssmba

0

0

0

0

10:26

04/07/2020

LINSPECTOR: Multilingual Probing Tasks for Word Representations

Gözde Gül Sahin, Clara Vania, Ilia Kuznetsov, Iryna Gurevych

Keywords Paper

Word Representations, NLP, classification tasks, probing tasks

0

0

0

0

11:51

03/05/2021

Viewmaker Networks: Learning Views for Unsupervised Representation Learning

Alex Tamkin, Mike Wu, Noah Goodman

Keywords Paper

representation learning, self-supervised, views, contrastive learning, unsupervised learning, data augmentation

0

0

0

0

5:03

06/12/2021

CLIP-It! Language-Guided Video Summarization

Medhini Narasimhan, Anna Rohrbach, Trevor Darrell

Keywords Paper

transformers

0

0

0

0

6:14

05/01/2021

Hyperrealistic Image Inpainting With Hypergraphs

Gourav Wadhwa, Abhinav Dhall, Subrahmanyam Murala, Usman Tariq

Keywords Paper

0

0

0

0

3:51

14/06/2020

A Disentangling Invertible Interpretation Network for Explaining Latent Representations

Patrick Esser, Robin Rombach, Björn Ommer

Keywords Paper

interpretability, inn, disentangling, generative models, invertible neural networks, autoencoders, normalizing flows, vae, explainable, xai

0

0

0

0

1:01

02/02/2021

Semantic-guided Reinforced Region Embedding for Generalized Zero-Shot Learning

Jiannan Ge, Hongtao Xie, Shaobo Min, Yongdong Zhang

Keywords Paper

0

0

0

0

16:22

02/02/2021

Non-Autoregressive Coarse-to-Fine Video Captioning

Bang Yang, Yuexian Zou, Fenglin Liu, Can Zhang

Keywords Paper

0

0

0

0

18:21

06/12/2020

Improving Natural Language Processing Tasks with Human Gaze-Guided Neural Attention

Ekta Sood, Simon Tannert, Philipp Mueller, Andreas Bulling

Keywords Paper

0

0

0

0

2:56

02/02/2021

A Continual Learning Framework for Uncertainty-Aware Interactive Image Segmentation

Ervine Zheng, Qi Yu, Rui Li and
Pengcheng Shi, Anne Haake

Keywords Paper

0

0

0

0

14:21

08/12/2020

comp-syn: Perceptually Grounded Word Embeddings with Color

Bhargav Srinivasa Desikan, Tasker Hull, Ethan Nadler and
Douglas Guilbeault, Aabir Abubakar Kar, Mark Chu, Donald Ruggiero Lo Sardo

Keywords Paper

0

0

0

0

9:37

14/06/2020

CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement

Ho Kei Cheng, Jihoon Chung, Yu-Wing Tai, Chi-Keung Tang

Keywords Paper

segmentation refinement, high-resolution, 4k, semantic segmentation, scene parsing

0

0

0

0

1:01

14/06/2020

Instance-Aware Image Colorization

Jheng-Wei Su, Hung-Kuo Chu, Jia-Bin Huang

Keywords Paper

colorization, instance-aware, deep learning, computer vision

0

0

0

0

1:01

02/02/2021

RpBERT: A Text-image Relation Propagation-based BERT Model for Multimodal NER

Lin Sun, Jiquan Wang, Kai Zhang and
Yindu Su, Fangsheng Weng

Keywords Paper

0

0

0

0

17:21

14/06/2020

Cops-Ref: A New Dataset and Task on Compositional Referring Expression Comprehension

Zhenfang Chen, Peng Wang, Lin Ma and
Kwan-Yee K. Wong, Qi Wu

Keywords Paper

compositional referring expression comprehension, visual reasoning

0

0

0

0

1:00

04/07/2020

Towards Robustifying NLI Models Against Lexical Dataset Biases

Xiang Zhou, Mohit Bansal

Keywords Paper

Natural Inference, data augmentation, Robustifying Models, deep models

0

0

0

0

11:34

06/12/2020

Learning Object-Centric Representations of Multi-Object Scenes from Multiple Views

Nanbo Li, Cian Eastwood, Robert Fisher

Keywords Paper

0

0

0

0

3:19

30/11/2020

COG: COnsistent data auGmentation for object perception

Zewen He, Rui Wu, Dingqian Zhang

Keywords Paper

0

0

0

0

5:16

18/07/2021

Learning Transferable Visual Models From Natural Language Supervision

Alec Radford, Jong Wook Kim, Chris Hallacy and
Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, Ilya Sutskever

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

19:40

04/07/2020

Cross-modal Language Generation using Pivot Stabilization for Web-scale Language Coverage

Ashish V. Thapliyal, Radu Soricut

Keywords Paper

Cross-modal Generation, Web-scale Coverage, Cross-modal tasks, Pivot Stabilization

0

0

0

0

11:43

26/04/2020

On the "steerability" of generative adversarial networks

Ali Jahanian, Lucy Chai, Phillip Isola

Keywords Paper

generative adversarial network, latent space interpolation, dataset bias, model generalization

0

0

0

0

4:47

16/11/2020

An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot Labels

Ilias Chalkidis, Manos Fergadiotis, Sotiris Kotitsas and
Prodromos Malakasiotis, Nikolaos Aletras, Ion Androutsopoulos

Keywords Paper

flat classification, hierarchical approaches, zero-shot learning, few learning

0

0

0

0

12:21

19/08/2021

Explaining Self-Supervised Image Representations with Visual Probing

Dominika Basaj, Witold Oleszkiewicz, Igor Sieradzki and
Michał Górszczak, Barbara Rychalska, Tomasz Trzcinski, Bartosz Zieliński

Keywords Paper

Computer Vision, Language and Vision, Unsupervised Learning, Explainability

0

0

0

0

11:03