Reinforced active learning for image segmentation

26/04/2020

Reinforced active learning for image segmentation

Arantxa Casanova, Pedro O. Pinheiro, Negar Rostamzadeh, Christopher J. Pal

Keywords: semantic segmentation, active learning, reinforcement learning

Abstract Paper Similar Papers

Abstract: Learning-based approaches for semantic segmentation have two inherent challenges. First, acquiring pixel-wise labels is expensive and time-consuming. Second, realistic segmentation datasets are highly unbalanced: some categories are much more abundant than others, biasing the performance to the most represented ones. In this paper, we are interested in focusing human labelling effort on a small subset of a larger pool of data, minimizing this effort while maximizing performance of a segmentation model on a hold-out set. We present a new active learning strategy for semantic segmentation based on deep reinforcement learning (RL). An agent learns a policy to select a subset of small informative image regions -- opposed to entire images -- to be labeled, from a pool of unlabeled data. The region selection decision is made based on predictions and uncertainties of the segmentation model being trained. Our method proposes a new modification of the deep Q-network (DQN) formulation for active learning, adapting it to the large-scale nature of semantic segmentation problems. We test the proof of concept in CamVid and provide results in the large-scale dataset Cityscapes. On Cityscapes, our deep RL region-based DQN approach requires roughly 30% less additional labeled data than our most competitive baseline to reach the same performance. Moreover, we find that our method asks for more labels of under-represented categories compared to the baselines, improving their performance and helping to mitigate class imbalance.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

Addressing Domain Gap via Content Invariant Representation for Semantic Segmentation

Li Gao, Lefei Zhang, Qian Zhang

Keywords Paper

0

0

0

0

16:16

06/12/2021

Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning

Hanzhe Hu, Fangyun Wei, Han Hu and
Qiwei Ye, Jinshi Cui, Liwei Wang

Keywords Paper

vision, semi-supervised learning

0

0

0

0

11:53

02/02/2021

Consistency Regularization with High-dimensional Non-adversarial Source-guided Perturbation for Unsupervised Domain Adaptation in Segmentation

Kaihong Wang, Chenhongyi Yang, Margrit Betke

Keywords Paper

0

0

0

0

19:35

06/12/2021

Improving Contrastive Learning on Imbalanced Data via Open-World Sampling

Ziyu Jiang, Tianlong Chen, Ting Chen, Zhangyang Wang

Keywords Paper

contrastive learning

0

0

0

0

10:12

02/02/2021

Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning

Yu Liu, Lianghua Huang, Pan Pan and
Bin Wang, Yinghui Xu, Rong Jin

Keywords Paper

0

0

0

0

15:15

22/09/2020

FISSA: Fusing item similarity models with self-attention networks for sequential recommendation

Jing Lin, Weike Pan, Zhong Ming

Keywords Paper

Item Similarity Models, Sequential Recommendation, Gating Networks, Self-Attention

0

0

0

0

2:06

14/06/2020

Transferring and Regularizing Prediction for Semantic Segmentation

Yiheng Zhang, Zhaofan Qiu, Ting Yao and
Chong-Wah Ngo, Dong Liu, Tao Mei

Keywords Paper

semantic segmentation, domain adaptation, adversarial learning

0

0

0

0

0:58

03/05/2021

Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning

Dong Bok Lee, Dongchan Min, Seanie Lee, Sung Ju Hwang

Keywords Paper

Unsupervised Learning, Variational Autoencoders, Unsupervised Meta-learning, Meta-Learning

0

0

0

0

13:31

02/02/2021

Deep Metric Learning with Self-Supervised Ranking

Zheren Fu, Yan Li, Zhendong Mao and
Quan Wang, Yongdong Zhang

Keywords Paper

0

0

0

0

12:36

14/06/2020

Steering Self-Supervised Feature Learning Beyond Local Pixel Statistics

Simon Jenni, Hailin Jin, Paolo Favaro

Keywords Paper

self-supervised, representation learning, inpainting, unsupervised, feature learning, self-supervision, transformations, image statistics

0

0

0

0

5:01

19/08/2021

Dependent Multi-Task Learning with Causal Intervention for Image Captioning

Wenqing Chen, Jidong Tian, Caoyun Fan and
Hao He, Yaohui Jin

Keywords Paper

Machine Learning, Transfer, Adaptation, Multi-task Learning, Natural Language Generation, Language and Vision

0

0

0

0

12:02

03/05/2021

Policy-Driven Attack: Learning to Query for Hard-label Black-box Adversarial Examples

Ziang Yan, Yiwen Guo, Jian Liang, Changshui Zhang

Keywords Paper

hard-label attack, adversarial attack, black-box attack, reinforcement learning

0

0

0

0

4:55

06/12/2020

Pixel-Level Cycle Association: A New Perspective for Domain Adaptive Semantic Segmentation

Guoliang Kang, Yunchao Wei, Yi Yang and
Yueting Zhuang, Alexander Hauptmann

Keywords Paper

0

0

0

0

3:16

14/06/2020

Deep Semantic Clustering by Partition Confidence Maximisation

Jiabo Huang, Shaogang Gong, Xiatian Zhu

Keywords Paper

deep clustering, cluster separability, separability measurement, semantic plausibility

0

0

0

0

1:00

02/02/2021

Bi-Classifier Determinacy Maximization for Unsupervised Domain Adaptation

Shuang Li, Fangrui Lv, Binhui Xie and
Chi Harold Liu, Jian Liang, Chen Qin

Keywords Paper

0

0

0

0

14:07

18/07/2021

Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning

Anuj Mahajan, Mikayel Samvelyan, Lei Mao and
Viktor Makoviychuk, Animesh Garg, Jean Kossaifi, Shimon Whiteson, Yuke Zhu, Anima Anandkumar

Keywords Paper

Reinforcement Learning and Planning, Multi-Agent RL

0

0

0

0

5:16

12/07/2020

Loss Function Search for Face Recognition

Xiaobo Wang, Shuo Wang, Shifeng Zhang and
Cheng Chi, Tao Mei

Keywords Paper

Applications - Computer Vision

0

0

0

0

12:35

14/06/2020

Selective Transfer With Reinforced Transfer Network for Partial Domain Adaptation

Zhihong Chen, Chao Chen, Zhaowei Cheng and
Boyuan Jiang, Ke Fang, Xinyu Jin

Keywords Paper

partial domain adaptation, selective transfer, pixel-level information, reconstruct error, reinforcement learning

1

1

0

0

1:01

06/12/2021

A Gaussian Process-Bayesian Bernoulli Mixture Model for Multi-Label Active Learning

Weishi Shi, Dayou Yu, Qi Yu

Keywords Paper

machine learning, generative model, kernel methods, active learning

0

0

0

0

13:38

06/12/2021

Efficient Active Learning for Gaussian Process Classification by Error Reduction

Guang Zhao, Edward Dougherty, Byung-Jun Yoon and
Francis Alexander, Xiaoning Qian

Keywords Paper

optimization, machine learning, kernel methods, active learning

0

0

0

0

6:56

06/12/2020

Adversarial Style Mining for One-Shot Unsupervised Domain Adaptation

Yawei Luo, Ping Liu, Tao Guan and
Junqing Yu, Yi Yang

Keywords Paper

0

0

0

0

3:22

06/12/2021

Local policy search with Bayesian optimization

Sarah Müller, Alexander von Rohr, Sebastian Trimpe

Keywords Paper

theory, optimization, reinforcement learning and planning, active learning

0

0

0

0

11:42

02/02/2021

Spatial-temporal Causal Inference for Partial Image-to-video Adaptation

Jin Chen, Xinxiao Wu, Yao Hu, Jiebo Luo

Keywords Paper

0

0

0

0

20:01

30/11/2020

DEAL: Difficulty-aware Active Learning for Semantic Segmentation

Shuai Xie, Zunlei Feng, Ying chen and
Songtao Sun, Chao Ma, Mingli Song

Keywords Paper

0

0

0

0

9:41

30/11/2020

Unsupervised Domain Adaptive Object Detection using Forward-Backward Cyclic Adaptation

Siqi Yang, Lin Wu, Arnold Wiliem, Brian C. Lovell

Keywords Paper

0

0

0

0

6:32

19/08/2021

Learning Implicit Temporal Alignment for Few-shot Video Classification

Songyang Zhang, Jiale Zhou, Xuming He

Keywords Paper

Computer Vision, Action Recognition, Deep Learning

0

0

0

0

6:20

14/06/2020

Prior Guided GAN Based Semantic Inpainting

Avisek Lahiri, Arnav Kumar Jain, Sanskar Agrawal and
Pabitra Mitra, Prabir Kumar Biswas

Keywords Paper

semantic inpainting, generative adversarial networks, video inpainting, facial keypoints, generative models

0

0

0

0

1:01

18/07/2021

A Distribution-dependent Analysis of Meta Learning

Mikhail Konobeev, Ilja Kuzborskij, Csaba Szepesvari

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:06

14/06/2020

Towards Efficient Model Compression via Learned Global Ranking

Ting-Wu Chin, Ruizhou Ding, Cha Zhang, Diana Marculescu

Keywords Paper

model compression, filter pruning, automl, image classification

0

0

0

0

5:03

07/09/2020

Multi-label Zero-shot Classification by Learning to Transfer from External Knowledge

He Huang, Wei Tang, Philip Yu and
Yuanwei Chen, Wenhao Zheng, Qing-Guo Chen

Keywords Paper

zero-shot learning, graph neural networks, multi-label classification

0

0

0

0

10:33

06/12/2021

See More for Scene: Pairwise Consistency Learning for Scene Classification

Gongwei Chen, Xinhang Song, Bohan Wang, Shuqiang Jiang

Keywords Paper

deep learning, machine learning

0

0

0

0

9:15

22/11/2021

Siamese Prototypical Contrastive Learning

Shentong Mo, Zhun Sun, Chao Li

Keywords Paper

self-supervised learning, contrastive learning, representation learning

0

0

0

0

2:50

06/12/2020

Sparse Graphical Memory for Robust Planning

Scott Emmons, Ajay Jain, Misha Laskin and
Thanard Kurutach, Pieter Abbeel, Deepak Pathak

Keywords Paper

0

0

0

0

3:23

06/12/2020

Off-Policy Imitation Learning from Observations

Zhuangdi Zhu, Kaixiang Lin, Bo Dai, Jiayu Zhou

Keywords Paper

0

0

0

1

3:24

14/06/2020

Hierarchically Robust Representation Learning

Qi Qian, Juhua Hu, Hao Li

Keywords Paper

representation learning, hierarchical robustness

0

0

0

0

1:01

18/07/2021

LAMDA: Label Matching Deep Domain Adaptation

Trung Le, Tuan Nguyen, Nhat Ho and
Hung Bui, Dinh Phung

Keywords Paper

Theory, Deep learning Theory

0

0

0

1

5:14

16/11/2020

DORB: Dynamically Optimizing Multiple Rewards with Bandits

Ramakanth Pasunuru, Han Guo, Mohit Bansal

Keywords Paper

language tasks, optimization rewards, nlg tasks, question generation

0

0

0

0

11:34

26/08/2020

Discrete Action On-Policy Learning with Action-Value Critic

Yuguang Yue, Yunhao Tang, Mingzhang Yin, Mingyuan Zhou

Keywords Paper

0

0

0

0

14:23

05/01/2021

Multimodal Prototypical Networks for Few-Shot Learning

Frederik Pahde, Mihai Puscas, Tassilo Klein, Moin Nabi

Keywords Paper

0

0

0

0

4:56

13/04/2021

Comparing the value of labeled and unlabeled data in method-of-moments latent variable estimation

Mayee Chen, Benjamin Cohen-Wang, Stephen Mussmann and
Frederic Sala, Christopher Re

Keywords Paper

0

0

0

0

3:04