Using Error Decay Prediction to Overcome Practical Issues of Deep Active Learning for Named Entity Recognition

14/09/2020

Using Error Decay Prediction to Overcome Practical Issues of Deep Active Learning for Named Entity Recognition

Haw-Shiuan Chang, Shankar Vembu, Sunil Mohan, Rheeya Uppaal, Andrew McCallum

Keywords:

Abstract Paper Similar Papers

Abstract: Existing deep active learning algorithms achieve impressive sampling efficiency on natural language processing tasks. However, they exhibit several weaknesses in practice, including (a) inability to use uncertainty sampling with black-box models, (b) lack of robustness to labeling noise, and (c) lack of transparency. In response, we propose a transparent batch active sampling framework by estimating the error decay curves of multiple feature-defined subsets of the data. Experiments on four named entity recognition (NER) tasks demonstrate that the proposed methods significantly outperform diversification-based methods for black-box NER taggers, and can make the sampling process more robust to labeling noise when combined with uncertainty-based methods. Furthermore, the analysis of experimental results sheds light on the weaknesses of different active sampling strategies, and when traditional uncertainty-based or diversification-based methods can be expected to work well.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ECML PKDD 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

04/07/2020

Towards Robustifying NLI Models Against Lexical Dataset Biases

Xiang Zhou, Mohit Bansal

Keywords Paper

Natural Inference, data augmentation, Robustifying Models, deep models

0

0

0

0

11:34

02/02/2021

Tackling Instance-Dependent Label Noise via a Universal Probabilistic Model

Qizhou Wang, Bo Han, Tongliang Liu and
Gang Niu, Jian Yang, Chen Gong

Keywords Paper

0

0

0

0

14:56

02/02/2021

Uncertainty-Aware Multi-View Representation Learning

Yu Geng, Zongbo Han, Changqing Zhang, Qinghua Hu

Keywords Paper

0

0

0

0

14:19

06/12/2021

DP-SSL: Towards Robust Semi-supervised Learning with A Few Labeled Samples

Yi Xu, Jiandong Ding, Lu Zhang, Shuigeng Zhou

Keywords Paper

deep learning, machine learning, semi-supervised learning

0

0

0

0

10:11

13/04/2021

Automatic differentiation variational inference with mixtures

Warren Morningstar, Sharad Vikram, Cusuh Ham and
Andrew Gallagher, Joshua Dillon

Keywords Paper

0

0

0

0

3:05

04/07/2020

Perturbation Based Learning for Structured NLP tasks with Application to Dependency Parsing

Amichay Doitch, Ram Yazdi, Tamir Hazan, Roi Reichart

Keywords Paper

Structured tasks, Dependency Parsing, NLP, sampling

0

0

0

0

10:53

06/12/2021

Instance-dependent Label-noise Learning under a Structural Causal Model

Yu Yao, Tongliang Liu, Mingming Gong and
Bo Han, Gang Niu, Kun Zhang

Keywords Paper

deep learning, causality

0

0

0

0

11:12

19/08/2021

Partial Multi-Label Optimal Margin Distribution Machine

Nan Cao, Teng Zhang, Hai Jin

Keywords Paper

Machine Learning, Classification, Multi-instance; Multi-label; Multi-view learning, Weakly Supervised Learning

0

0

0

0

11:43

16/11/2020

Exposing Shallow Heuristics of Relation Extraction Models with Challenge Data

Shachar Rosenman, Alon Jacovi, Yoav Goldberg

Keywords Paper

data process, re collection, sota models, tacred

0

0

0

0

5:55

03/05/2021

In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo-label Selection Framework for Semi-Supervised Learning

Mamshad Nayeem Rizve, Kevin Duarte, Yogesh S Rawat, Mubarak Shah

Keywords Paper

Deep Learning, Calibration, Uncertainty, Pseudo-Labeling, Semi-Supervised Learning

0

0

0

0

5:06

16/11/2020

SSMBA: Self-Supervised Manifold Based Data Augmentation for Improving Out-of-Domain Robustness

Nathan Ng, Kyunghyun Cho, Marzyeh Ghassemi

Keywords Paper

data augmentation, ood generalization, robustness benchmarks, ssmba

0

0

0

0

10:26

02/02/2021

Category Dictionary Guided Unsupervised Domain Adaptation for Object Detection

Shuai Li, Jianqiang Huang, Xian-Sheng Hua, Lei Zhang

Keywords Paper

0

0

0

0

15:00

06/12/2020

Understanding Anomaly Detection with Deep Invertible Networks through Hierarchies of Distributions and Features

Robin Schirrmeister, Yuxuan Zhou, Tonio Ball, Dan Zhang

Keywords Paper

0

0

0

0

3:21

02/02/2021

MASKER: Masked Keyword Regularization for Reliable Text Classification

Seung Jun Moon, Sangwoo Mo, Kimin Lee and
Jaeho Lee, Jinwoo Shin

Keywords Paper

0

0

0

0

15:05

02/02/2021

DecAug: Out-of-Distribution Generalization via Decomposed Feature Representation and Semantic Augmentation

Haoyue Bai, Rui Sun, Lanqing Hong and
Fengwei Zhou, Nanyang Ye, Han-Jia Ye, S.-H. Gary Chan, Zhenguo Li

Keywords Paper

0

0

0

0

15:59

14/06/2020

Attack to Explain Deep Representation

Mohammad A. A. K. Jalwana, Naveed Akhtar, Mohammed Bennamoun, Ajmal Mian

Keywords Paper

interpreting deep learning, adversarial attack, explanation attack, explainable ai, image generation

0

0

0

0

1:01

06/12/2021

Parallel Bayesian Optimization of Multiple Noisy Objectives with Expected Hypervolume Improvement

Samuel Daulton, Maximilian Balandat, Eytan Bakshy

Keywords Paper

optimization, machine learning, kernel methods

0

0

0

0

9:08

04/07/2020

Mind the Trade-off: Debiasing NLU Models without Degrading the In-distribution Performance

Prasetya Ajie Utama, Nafise Sadat Moosavi, Iryna Gurevych

Keywords Paper

Debiasing Models, natural tasks, NLU tasks, debiasing methods

0

0

0

1

11:09

06/12/2020

Joints in Random Forests

Alvaro Correia, Robert Peharz, Cassio de Campos

Keywords Paper

0

0

0

0

2:28

06/12/2020

ColdGANs: Taming Language GANs with Cautious Sampling Strategies

Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier and
Benjamin Piwowarski, Jacopo Staiano

Keywords Paper

0

0

0

0

3:19

14/06/2020

Deep Generative Model for Robust Imbalance Classification

Xinyue Wang, Yilin Lyu, Liping Jing

Keywords Paper

imbalance classification, deep generative classifier, generative modelrobust classification

0

0

0

0

1:01

22/11/2021

Unsupervised Domain Adaptation of Black-Box Source Models

Haojian Zhang, Yabin Zhang, Kui Jia, Lei Zhang

Keywords Paper

domain adaptation, black box, unsupervised, noisy label, iterative

0

0

0

0

2:57

14/06/2020

Multi-Scale Interactive Network for Salient Object Detection

Youwei Pang, Xiaoqi Zhao, Lihe Zhang, Huchuan Lu

Keywords Paper

saliency detection, salient object detection, feature interaction strategy, scale-insensitive loss, multi-scale features, multi-level features, fully convolutional network, deep learning

0

0

0

0

1:01

06/12/2021

Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation

Nicklas Hansen, Hao Su, Xiaolong Wang

Keywords Paper

reinforcement learning and planning, transformers

0

0

0

0

8:43

14/06/2020

Conditional Gaussian Distribution Learning for Open Set Recognition

Xin Sun, Zhenning Yang, Chi Zhang and
Keck-Voon Ling, Guohao Peng

Keywords Paper

open set recognition, conditional variational auto-encoder, gaussian distribution learning, probabilistic ladder architecture.

0

0

0

0

1:01

06/12/2021

Bayesian Adaptation for Covariate Shift

Aurick Zhou, Sergey Levine

Keywords Paper

deep learning, machine learning, robustness, vision, domain adaptation

0

0

0

0

8:21

30/11/2020

dpVAEs: Fixing Sample Generation for Regularized VAEs

Riddhish Bhalodia, Iain Lee, Shireen Elhabian

Keywords Paper

0

0

0

0

7:54

08/12/2020

Natural Language Inference with Mixed Effects

William Gantt, Benjamin Kane, Aaron Steven White

Keywords Paper

0

0

0

0

8:31

06/12/2021

Understanding the Limits of Unsupervised Domain Adaptation via Data Poisoning

Akshay Mehra, Bhavya Kailkhura, Pin-Yu Chen, Jihun Hamm

Keywords Paper

robustness, domain adaptation

0

0

0

0

13:34

06/12/2021

STEP: Out-of-Distribution Detection in the Presence of Limited In-Distribution Labeled Data

Zhi Zhou, Lan-Zhe Guo, Zhanzhan Cheng and
Yu-Feng Li, Shiliang Pu

Keywords Paper

optimization, semi-supervised learning

0

0

0

0

11:24

12/07/2020

Robustness to Programmable String Transformations via Augmented Abstract Training

Yuhao Zhang, Aws Albarghouthi, Loris D'Antoni

Keywords Paper

Adversarial Examples

0

0

0

0

14:49

16/11/2020

Does my multimodal model learn cross-modal interactions? It's harder to tell than you might think!

Jack Hessel, Lillian Lee

Keywords Paper

modeling interactions, multimodal tasks, visual answering, multimodal learning

0

0

0

0

12:02

03/05/2021

Empirical Analysis of Unlabeled Entity Problem in Named Entity Recognition

Yangming Li, lemao liu, Shuming Shi

Keywords Paper

Negative Sampling, Unlabeled Entity Problem, Named Entity Recognition

0

0

0

1

4:49

06/12/2020

Provably Good Batch Off-Policy Reinforcement Learning Without Great Exploration

Yao Liu, Adith Swaminathan, Alekh Agarwal, Emma Brunskill

Keywords Paper

0

0

0

0

3:17

06/12/2021

Multi-Label Learning with Pairwise Relevance Ordering

Ming-Kun Xie, Sheng-Jun Huang

Keywords Paper

machine learning

0

0

0

0

3:56

07/09/2020

Object Detection as a Positive-Unlabeled Problem

Yuewei Yang, Kevin Liang, Lawrence Carin Duke

Keywords Paper

object detections, positive unlabeled learning

0

0

0

0

8:54

02/02/2021

Learning from Noisy Labels with Complementary Loss Functions

Deng-Bao Wang, Yong Wen, Lujia Pan, Min-Ling Zhang

Keywords Paper

0

0

0

0

14:00

13/04/2021

Comparing the value of labeled and unlabeled data in method-of-moments latent variable estimation

Mayee Chen, Benjamin Cohen-Wang, Stephen Mussmann and
Frederic Sala, Christopher Re

Keywords Paper

0

0

0

0

3:04

12/07/2020

Adversarial Filters of Dataset Biases

Ronan Le Bras, Swabha Swayamdipta, Chandra Bhagavatula and
Rowan Zellers, Matthew Peters, Ashish Sabharwal, Yejin Choi

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

15:25

06/12/2020

Generalized Hindsight for Reinforcement Learning

Alex Li, Lerrel Pinto, Pieter Abbeel

Keywords Paper

0

0

0

0

3:20