The Curious Layperson: Fine-Grained Image Recognition without Expert Labels

22/11/2021

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels

Subhabrata Choudhury, Iro Laina, Christian Rupprecht, Andrea Vedaldi

Keywords: fine-grained recognition, weakly-supervised recognition, fine-grained retrieval, unsupervised recognition, image-to-text retrieval, text-to-image retrieval, image classification

Abstract Paper Code Similar Papers

Abstract: Most of us are not experts in specific fields, such as ornithology. Nonetheless, we do have general image and language understanding capabilities that we use to match what we see to expert resources. This allows us to expand our knowledge and perform novel tasks without ad-hoc external supervision. On the contrary, machines have a much harder time consulting expert-curated knowledge bases unless trained specifically with that knowledge in mind. Thus, in this paper we consider a new problem: fine-grained image recognition without expert annotations, which we address by leveraging the vast knowledge available in web encyclopedias. First, we learn a model to describe the visual appearance of objects using non-expert image descriptions. We then train a fine- grained textual similarity model that matches image descriptions with documents on a sentence-level basis. We evaluate the method on two datasets and compare with several strong baselines and the state of the art in cross-modal retrieval. Code is available at: https://github.com/subhc/clever.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at BMVC 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

Webly Supervised Knowledge Embedding Model for Visual Reasoning

Wenbo Zheng, Lan Yan, Chao Gou, Fei-Yue Wang

Keywords Paper

visual reasoning, webly supervised learning

0

0

0

0

1:01

08/12/2020

Emergent Communication Pretraining for Few-Shot Machine Translation

Yaoyiran Li, Edoardo Maria Ponti, Ivan Vulić, Anna Korhonen

Keywords Paper

0

0

0

0

14:42

19/04/2021

On the (in)effectiveness of images for text classification

Chunpeng Ma, Aili Shen, Hiyori Yoshikawa and
Tomoya Iwakura, Daniel Beck, Timothy Baldwin

Keywords Paper

0

0

0

0

6:15

26/04/2020

Variational Template Machine for Data-to-Text Generation

Rong Ye, Wenxian Shi, Hao Zhou and
Zhongyu Wei, Lei Li

Keywords Paper

0

0

0

0

4:55

04/07/2020

Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting

Po-Yao Huang, Junjie Hu, Xiaojun Chang, Alexander Hauptmann

Keywords Paper

Unsupervised Translation, Unsupervised MT, MT, alignment

0

0

0

0

12:17

06/12/2020

Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and Reasoning

Weili Nie, Zhiding Yu, Lei Mao and
Ankit Patel, Yuke Zhu, Anima Anandkumar

Keywords Paper

0

0

0

0

3:23

03/05/2021

Concept Learners for Few-Shot Learning

Kaidi Cao, Maria Brbic, Jure Leskovec

Keywords Paper

few-shot learning, meta learning

0

0

0

0

4:55

04/07/2020

Unsupervised Domain Clusters in Pretrained Language Models

Roee Aharoni, Yoav Goldberg

Keywords Paper

NLP, data-driven domains, neural translation, Unsupervised Clusters

0

0

0

0

11:55

03/05/2021

Viewmaker Networks: Learning Views for Unsupervised Representation Learning

Alex Tamkin, Mike Wu, Noah Goodman

Keywords Paper

representation learning, self-supervised, views, contrastive learning, unsupervised learning, data augmentation

0

0

0

0

5:03

19/01/2020

Visualization by Example

Chenglong Wang, Yu Feng, Rastislav Bodik and
Alvin Cheung, Isil Dillig

Keywords Paper

Program Synthesis, Data Visualization

0

0

0

0

20:36

16/11/2020

What Does My QA Model Know? Devising Controlled Probes using Expert

Kyle Richardson, Ashish Sabharwal

Keywords Paper

knowledge challenges, benchmark tasks, diagnostic tasks, taxonomic reasoning

0

0

0

0

12:16

06/12/2020

Unsupervised Translation of Programming Languages

Baptiste Roziere, Marie-Anne Lachaux, Lowik Chanussot, Guillaume Lample

Keywords Paper

0

0

0

0

3:17

04/07/2020

Compositional Generalization by Factorizing Alignment and Translation

Jacob Russin, Jason Jo, Randall O'Reilly, Yoshua Bengio

Keywords Paper

Compositional Generalization, Translation, natural processing, cognitive science

0

0

0

0

10:37

02/02/2021

KG-BART: Knowledge Graph-Augmented BART for Generative Commonsense Reasoning

Ye Liu, Yao Wan, Lifang He and
Hao Peng, Philip S. Yu

Keywords Paper

0

0

0

0

17:52

16/11/2020

Universal Natural Language Processing with Limited Annotations: Try Few-shot Textual Entailment as a Start

Wenpeng Yin, Nazneen Fatema Rajani, Dragomir Radev and
Richard Socher, Caiming Xiong

Keywords Paper

nlp problems, textual entailment, nlp task, downstream tasks

0

0

0

0

12:08

05/01/2021

Where to Look?: Mining Complementary Image Regions for Weakly Supervised Object Localization

Sadbhavana Babar, Sukhendu Das

Keywords Paper

0

0

0

0

5:01

06/12/2020

Network-to-Network Translation with Conditional Invertible Neural Networks

Robin Rombach, Patrick Esser, Bjorn Ommer

Keywords Paper

0

0

0

0

3:25

14/06/2020

Cops-Ref: A New Dataset and Task on Compositional Referring Expression Comprehension

Zhenfang Chen, Peng Wang, Lin Ma and
Kwan-Yee K. Wong, Qi Wu

Keywords Paper

compositional referring expression comprehension, visual reasoning

0

0

0

0

1:00

08/12/2020

TableGPT: Few-shot Table-to-Text Generation with Table Structure Reconstruction and Content Matching

Heng Gong, Yawei Sun, Xiaocheng Feng and
Bing Qin, Wei Bi, Xiaojiang Liu, Ting Liu

Keywords Paper

0

0

0

0

8:45

04/07/2020

Expertise Style Transfer: A New Task Towards Better Communication between Experts and Laymen

Yixin Cao, Ruihao Shui, Liangming Pan and
Min-Yen Kan, Zhiyuan Liu, Tat-Seng Chua

Keywords Paper

Expertise Transfer, style transfer, text simplification, automatic evaluation

0

0

0

0

13:37

02/02/2021

HMS: A Hierarchical Solver with Dependency-Enhanced Understanding for Math Word Problem

Xin Lin, Zhenya Huang, Hongke Zhao and
Enhong Chen, Qi Liu, Hao Wang, Shijin Wang

Keywords Paper

0

0

0

0

18:01

18/07/2021

Reasoning Over Virtual Knowledge Bases With Open Predicate Relations

Haitian Sun, Patrick Verga, Bhuwan Dhingra and
Russ Salakhutdinov, William Cohen

Keywords Paper

Applications, Natural Language Processing

0

0

0

0

4:55

12/07/2020

Data-Efficient Image Recognition with Contrastive Predictive Coding

Olivier Henaff

Keywords Paper

Unsupervised and Semi-Supervised Learning

0

0

0

0

14:17

16/11/2020

An Imitation Game for Learning Semantic Parsers from User Interaction

Ziyu Yao, Yiqi Tang, Wen-tau Yih and
Huan Sun, Yu Su

Keywords Paper

bootstrapping, fine-tuning parsers, theoretical analysis, text-to-sql problem

0

0

0

0

11:49

19/10/2020

Feature extraction for large-scale text collections

Luke Gallagher, Antonio Mallia, J. Shane Culpepper and
Torsten Suel, B. Barla Cambazoglu

Keywords Paper

clueweb, feature index, feature extraction, feature repository, lambdamart, ltr, learning to rank, feature importance

0

0

0

0

9:41

22/11/2021

Text-Based Person Search with Limited Data

Xiao Han, Sen He, Li Zhang, Tao Xiang

Keywords Paper

person re-identification, cross-modal image retrieval, fine-grained image retrieval, text-based person search

0

0

0

0

3:04

26/04/2020

Meta-Learning without Memorization

Mingzhang Yin, George Tucker, Mingyuan Zhou and
Sergey Levine, Chelsea Finn

Keywords Paper

meta-learning, memorization, regularization, overfitting, mutually-exclusive

0

0

0

0

5:09

20/08/2020

The Simple Essence of Algebraic Subtyping: Principal Type Inference with Subtyping Made Easy (Functional Pearl)

Lionel Parreaux

Keywords Paper

subtyping, principal types, type inference

0

0

0

0

14:39

07/09/2020

From Saturation to Zero-Shot Visual Relationship Detection Using Local Context

Nikolaos Gkanatsios, Vassilis Pitsikalis, Petros Maragos

Keywords Paper

Visual Relationship Detection, Scene Graph Generation, Zero-shot Classification, Local Context, Language Bias

0

0

0

0

7:17

04/07/2020

LINSPECTOR: Multilingual Probing Tasks for Word Representations

Gözde Gül Sahin, Clara Vania, Ilia Kuznetsov, Iryna Gurevych

Keywords Paper

Word Representations, NLP, classification tasks, probing tasks

0

0

0

0

11:51

14/06/2020

VQA With No Questions-Answers Training

Ben-Zion Vatashsky, Shimon Ullman

Keywords Paper

visual question answering, vqa, vision and language, question understanding, visual reasoning, compositionality, explainable models, domain adaptation, zero-shot learning, abstract procedure

0

0

0

0

1:01

03/05/2021

Iterated learning for emergent systematicity in VQA

Ankit Vani, Max Schwarzer, Yuchen Lu and
Eeshan Dhekane, Aaron Courville

Keywords Paper

clevr, vqa, shapes, neural module network, cultural transmission, iterated learning, visual question answering, systematic generalization, compositionality

0

0

0

0

15:10

14/06/2020

The GAN That Warped: Semantic Attribute Editing With Unpaired Data

Garoe Dorta, Sara Vicente, Neill D. F. Campbell, Ivor J. A. Simpson

Keywords Paper

image editing, warping, high resolution, unpaired data, deep neural networks

0

0

0

0

1:01

04/07/2020

Cross-modal Language Generation using Pivot Stabilization for Web-scale Language Coverage

Ashish V. Thapliyal, Radu Soricut

Keywords Paper

Cross-modal Generation, Web-scale Coverage, Cross-modal tasks, Pivot Stabilization

0

0

0

0

11:43

03/05/2021

Learning Task-General Representations with Generative Neuro-Symbolic Modeling

Reuben Feinman, Brenden Lake

Keywords Paper

probabilistic programs, neuro-symbolic models, few-shot concept learning, generative models

0

0

0

0

6:13

02/02/2021

What's the Best Place for an AI Conference, Vancouver or _______: Why Completing Comparative Questions is Difficult

‪Avishai Zagoury‬, Einat Minkov, Idan Szpektor, William W. Cohen

Keywords Paper

0

0

0

0

15:15

30/11/2020

A cost-effective method for improving and re-purposing large, pre-trained GANs by fine-tuning their class-embeddings

Qi Li, Long Mai, Michael A. Alcorn, Anh Nguyen

Keywords Paper

0

0

0

0

7:54

14/06/2020

OrigamiNet: Weakly-Supervised, Segmentation-Free, One-Step, Full Page Text Recognition by learning to unfold

Mohamed Yousef, Tom E. Bishop

Keywords Paper

text recognition, weakly supervised, handwriting recognition, convolutional neural network fully convolutional, ctc

0

0

0

0

1:00

18/07/2021

A large-scale benchmark for few-shot program induction and synthesis

Ferran Alet, Javier Lopez-Contreras, James Koppel and
Maxwell Nye, Armando Solar-Lezama, Tomas Lozano-Perez, Leslie Kaelbling, Josh Tenenbaum

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

5:07

04/07/2020

INFOTABS: Inference on Tables as Semi-structured Data

Vivek Gupta, Maitrey Mehta, Pegah Nokhiz, Vivek Srikumar

Keywords Paper

INFOTABS, complex reasoning, modeling strategies, meaning fragments

0

0

0

0

11:38