PhraseCut: Language-Based Image Segmentation in the Wild

14/06/2020

PhraseCut: Language-Based Image Segmentation in the Wild

Chenyun Wu, Zhe Lin, Scott Cohen, Trung Bui, Subhransu Maji

Keywords: visual grounding, referring expressions, dataset, vision and language, segmentation

Abstract Paper Similar Papers

Abstract: We consider the problem of segmenting image regions given a natural language phrase, and study it on a novel dataset of 77,262 images and 345,486 phrase-region pairs. Our dataset is collected on top of the Visual Genome dataset and uses the existing annotations to generate a challenging set of referring phrases for which the corresponding regions are manually annotated. Phrases in our dataset correspond to multiple regions and describe a large number of object and stuff categories as well as their attributes such as color, shape, parts, and relationships with other entities in the image. Our experiments show that the scale and diversity of concepts in our dataset poses significant challenges to the existing state-of-the-art. We systematically handle the long-tail nature of these concepts and present a modular approach to combine category, attribute, and relationship cues that outperforms existing approaches.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

08/12/2020

Probing Multimodal Embeddings for Linguistic Properties: the Visual-Semantic Case

Adam Dahlgren Lindström, Johanna Björklund, Suna Bensch, Frank Drewes

Keywords Paper

0

0

0

0

14:20

02/02/2021

Object Relation Attention for Image Paragraph Captioning

Li-Chuan Yang, Chih-Yuan Yang, Jane Yung-jen Hsu

Keywords Paper

0

0

0

0

15:03

30/11/2020

Rotation Axis Focused Attention Network (RAFA-Net) for Estimating Head Pose

Ardhendu Behera, Zachary Wharton, Pradeep Hewage, Swagat Kumar

Keywords Paper

0

0

0

0

10:19

02/02/2021

Encoder-Decoder Based Unified Semantic Role Labeling with Label-Aware Syntax

Hao Fei, Fei Li, Bobo Li, Donghong Ji

Keywords Paper

0

0

0

0

16:10

04/07/2020

Paraphrase-Sense-Tagged Sentences

Anne Cocos, Chris Callison-Burch

Keywords Paper

natural tasks, ranking sentences, hypernym prediction, sense-aware models

0

0

0

0

9:29

14/06/2020

Referring Image Segmentation via Cross-Modal Progressive Comprehension

Shaofei Huang, Tianrui Hui, Si Liu and
Guanbin Li, Yunchao Wei, Jizhong Han, Luoqi Liu, Bo Li

Keywords Paper

referring segmentation, progressive comprehension, cross-modal, entity perception, relation-aware reasoning

0

0

0

0

1:01

30/11/2020

Visual Tracking by TridentAlign and Context Embedding

Janghoon Choi, Junseok Kwon, Kyoung Mu Lee

Keywords Paper

0

0

0

0

7:41

02/02/2021

Region-aware Global Context Modeling for Automatic Nerve Segmentation from Ultrasound Images

Huisi Wu, Jiasheng Liu, Wei Wang and
Zhenkun Wen, Jing Qin

Keywords Paper

0

0

0

0

15:15

08/12/2020

Modeling language evolution and feature dynamics in a realistic geographic environment

Rhea Kapur, Phillip Rogers

Keywords Paper

0

0

0

0

15:00

08/12/2020

What Meaning-Form Correlation Has to Compose With: A Study of MFC on Artificial and Natural Language

Timothee Mickus, Timothée Bernard, Denis Paperno

Keywords Paper

0

0

0

0

16:01

06/12/2021

Mixed Supervised Object Detection by Transferring Mask Prior and Semantic Similarity

Yan Liu, Zhijie Zhang, Li Niu and
Junjie Chen, Liqing Zhang

Keywords Paper

vision, transfer learning

0

0

0

0

9:11

16/11/2020

Interpretable Multi-dataset Evaluation for Named Entity Recognition

Jinlan Fu, Pengfei Liu, Graham Neubig

Keywords Paper

natural tasks, interpretable evaluation, named task, analysis tool

0

0

0

0

11:11

14/06/2020

Image Search With Text Feedback by Visiolinguistic Attention Learning

Yanbei Chen, Shaogang Gong, Loris Bazzani

Keywords Paper

vision and language, image search, text feedback, attention mechanism, transformer, multimodal learning, representation learning, composition, image retrieval, interactive image search

0

0

0

0

1:00

19/04/2021

Interpretability for morphological inflection: From character-level predictions to subword-level rules

Tatyana Ruzsics, Olga Sozinova, Ximena Gutierrez-Vasques, Tanja Samardzic

Keywords Paper

0

0

0

0

10:53

14/06/2020

Cross-Domain Document Object Detection: Benchmark Suite and Method

Kai Li, Curtis Wigington, Chris Tensmeyer and
Handong Zhao, Nikolaos Barmpalios, Vlad I. Morariu, Varun Manjunatha, Tong Sun, Yun Fu

Keywords Paper

document object detection, cross-domain object detection, evaluation benchmark

0

0

0

0

1:01

16/11/2020

What Does My QA Model Know? Devising Controlled Probes using Expert

Kyle Richardson, Ashish Sabharwal

Keywords Paper

knowledge challenges, benchmark tasks, diagnostic tasks, taxonomic reasoning

0

0

0

0

12:16

14/06/2020

Composed Query Image Retrieval Using Locally Bounded Features

Mehrdad Hosseinzadeh, Yang Wang

Keywords Paper

image retrieval, composed query, multi modal learning, self attention, cross modal attention

0

0

0

0

1:01

16/11/2020

QADiscourse - Discourse Relations as QA Pairs: Representation, Crowdsourcing and Baselines

Valentina Pyatkin, Ayal Klein, Reut Tsarfaty, Ido Dagan

Keywords Paper

natural understanding, predicting relations, discourse relations, question-and-answer pairs

0

0

0

0

11:22

16/11/2020

Widget Captioning: Generating Natural Language Description for Mobile User Interface Elements

Yang Li, Gang Li, Luheng He and
Jingjie Zheng, Hong Li, Zhiwei Guan

Keywords Paper

mobile uis, automatically descriptions, widget captioning, multimodal task

0

0

0

0

11:16

07/09/2020

Advancing weakly supervised cross-domain alignment with optimal transport

Siyang Yuan, Ke Bai, Liqun Chen and
Yizhe Zhang, Chenyang Tao, Chunyuan Li, Guoyin Wang, Ricardo Henao, Lawrence Carin Duke

Keywords Paper

Optimal Transport, Cross Domain Alignment

0

0

0

0

10:04

30/11/2020

RE-Net: A Relation Embedded Deep Model for AU Occurrence and Intensity Estimation

Huiyuan Yang, Lijun Yin

Keywords Paper

0

0

0

0

7:58

16/11/2020

Position-Aware Tagging for Aspect Sentiment Triplet Extraction

Lu Xu, Hao Li, Wei Lu, Lidong Bing

Keywords Paper

aspect extraction, triplet process, aste, pipeline approaches

0

0

0

0

11:46

14/06/2020

Dynamic Refinement Network for Oriented and Densely Packed Object Detection

Xingjia Pan, Yuqiang Ren, Kekai Sheng and
Weiming Dong, Haolei Yuan, Xiaowei Guo, Chongyang Ma, Changsheng Xu

Keywords Paper

object detection, oriented, densely packed, sku110k, feature selection, dynamic, anchor-free

0

0

0

0

5:01

19/08/2021

Generating Senses and RoLes: An End-to-End Model for Dependency- and Span-based Semantic Role Labeling

Rexhina Blloshmi, Simone Conia, Rocco Tripodi, Roberto Navigli

Keywords Paper

Natural Language Processing, Natural Language Semantics, Natural Language Generation, Natural Language Processing

0

0

0

0

15:18

06/12/2020

RANet: Region Attention Network for Semantic Segmentation

Dingguo Shen, Yuanfeng Ji, Ping Li and
Yi Wang, Di Lin

Keywords Paper

0

0

0

0

3:13

06/12/2021

SILG: The Multi-domain Symbolic Interactive Language Grounding Benchmark

Victor Zhong, Austin W. Hanjie, Sida Wang and
Karthik Narasimhan, Luke Zettlemoyer

Keywords Paper

reinforcement learning and planning

0

0

0

0

13:52

14/06/2020

Vec2Face: Unveil Human Faces From Their Blackbox Features in Face Recognition

Chi Nhan Duong, Thanh-Dat Truong, Khoa Luu and
Kha Gia Quach, Hung Bui, Kaushik Roy

Keywords Paper

generative models, bijective metric learning, blackbox face matcher, distillation framework, face synthesis, id preservation, feature-conditional structure, feature reconstruction, dibigan.

0

0

0

0

5:03

04/07/2020

Predictive Biases in Natural Language Processing Models: A Conceptual Framework and Overview

Deven Santosh Shah, H. Andrew Schwartz, Dirk Hovy

Keywords Paper

NLP, Natural Models, Conceptual Framework, mitigation techniques

0

0

0

0

11:52

19/08/2021

Multi-Hop Fact Checking of Political Claims

Wojciech Ostrowski, Arnav Arora, Pepa Atanasova, Isabelle Augenstein

Keywords Paper

Natural Language Processing, NLP Applications and Tools, Resources and Evaluation, Text Classification

0

0

0

0

12:47

14/06/2020

Learning Saliency Propagation for Semi-Supervised Instance Segmentation

Yanzhao Zhou, Xin Wang, Jianbin Jiao and
Trevor Darrell, Fisher Yu

Keywords Paper

semi-supervised, instance segmentation, saliency, propagation, message passing, multiple instance learning, partial-supervised, generalization

0

0

0

0

1:01

05/01/2021

Are These From the Same Place? Seeing the Unseen in Cross-View Image Geo-Localization

Royston Rodrigues, Masahiro Tani

Keywords Paper

0

0

0

0

4:57

14/06/2020

Graph-Structured Referring Expression Reasoning in the Wild

Sibei Yang, Guanbin Li, Yizhou Yu

Keywords Paper

graph-structured reasoning, ref-reasoning dataset, referring expression reasoning, scene graph, neural module, visual grounding, grounding referring expressions

0

0

0

0

4:58

13/04/2021

Learning bijective feature maps for linear ICA

Alexander Camuto, Matthew Willetts, Chris Holmes and
Brooks Paige, Stephen Roberts

Keywords Paper

0

0

0

0

3:02

07/09/2020

Multimodal Image Translation with Stochastic Style Representations and Mutual Information Loss

Sanghyeon Na, Seungjoo Yoo, Jaegul Choo

Keywords Paper

image-to-image translation, generative adversarial network

0

0

0

0

9:52

01/07/2020

Improving Slot Filling by Utilizing Contextual Information

Amir Pouran Ben Veyseh, Franck Dernoncourt, Thien Huu Nguyen

Keywords Paper

0

0

0

0

14:11

04/07/2020

Multi-Domain Named Entity Recognition with Genre-Aware and Agnostic Inference

Jing Wang, Mayank Kulkarni, Daniel Preotiuc-Pietro

Keywords Paper

Multi-Domain Recognition, Named recognition, domain models, NER

0

0

0

0

11:46

25/04/2020

Tempura: Query Analysis with Structural Templates

Tongshuang Wu, Kanit Wongsuphasawat, Donghao Ren and
Kayur Patel, Chris DuBois

Keywords Paper

natural language processing, error analysis, query analysis

0

0

0

0

15:05

16/11/2020

ENT-DESC: Entity Description Generation by Exploring Knowledge Graph

Liying Cheng, Dekun Wu, Lidong Bing and
Yan Zhang, Zhanming Jie, Wei Lu, Luo Si

Keywords Paper

knowledge-to-text generation, information loss, kg, graph-to-sequence models

0

0

0

0

11:03

16/11/2020

Form2Seq : A Framework for Higher-Order Form Structure Extraction

Milan Aggarwal, Hiresh Gupta, Mausoom Sarkar, Balaji Krishnamurthy

Keywords Paper

document extraction, semantic task, image resolution, structure extraction

0

0

0

0

11:26

16/11/2020

T3: Tree-Autoencoder Constrained Adversarial Text Generation for Targeted Attack

Boxin Wang, Hengzhi Pei, Boyuan Pan and
Qian Chen, Shuohang Wang, Bo Li

Keywords Paper

adversarial generation, nlp tasks, sentiment analysis, qa

0

0

0

0

11:59