A Rigorous Study on Named Entity Recognition: Can Fine-tuning Pretrained Model Lead to the Promised Land?

Abstract: Fine-tuning pretrained model has achieved promising performance on standard NER benchmarks. Generally, these benchmarks are blessed with strong name regularity, high mention coverage and sufficient context diversity. Unfortunately, when scaling NER to open situations, these advantages may no longer exist. And therefore it raises a critical question of whether previous creditable approaches can still work well when facing these challenges. As there is no currently available dataset to investigate this problem, this paper proposes to conduct randomization test on standard benchmarks. Specifically, we erase name regularity, mention coverage and context diversity respectively from the benchmarks, in order to explore their impact on the generalization ability of models. To further verify our conclusions, we also construct a new open NER dataset that focuses on entity types with weaker name regularity and lower mention coverage to verify our conclusion. From both randomization test and empirical experiments, we draw the conclusions that 1) name regularity is critical for the models to generalize to unseen mentions; 2) high mention coverage may undermine the model generalization ability and 3) context patterns may not require enormous data to capture when using pretrained encoders.

12/07/2020

multihop qa, multihop, eqasc, qasc

10:42

02/02/2021

A Rigorous Study on Named Entity Recognition: Can Fine-tuning Pretrained Model Lead to the Promised Land?

Hongyu Lin, Yaojie Lu, Jialong Tang, Xianpei Han, Le Sun, Zhicheng Wei, Nicholas Jing Yuan

Comments

Similar Papers

Adversarial Learning Guarantees for Linear Hypotheses and Neural Networks

Pranjal Awasthi, Natalie Frank, Mehryar Mohri

Keywords Abstract Paper

Learning Theory

Ranking vs. Classifying: Measuring Knowledge Base Completion Quality

Marina Speranskaya, Martin Schmitt, Benjamin Roth

Keywords Abstract Paper

knowledge base completion, knowledge graph embedding, classification, ranking

Is Automated Topic Model Evaluation Broken? The Incoherence of Coherence

Alexander Hoyle, Pranav Goel, Andrew Hian-Cheong and Denis Peskov, Jordan Boyd-Graber, Philip Resnik

Keywords Abstract Paper

Learning to Explain: Datasets and Models for Identifying Valid Reasoning Chains in Multihop Question-Answering

Harsh Jhamtani, Peter Clark

Keywords Abstract Paper

multihop qa, multihop, eqasc, qasc

Learning the Parameters of Bayesian Networks from Uncertain Data

Segev Wasserkrug, Radu Marinescu, Sergey Zeltyn and Evgeny Shindin, Yishai A Feldman

Keywords Abstract Paper

Reliable Post hoc Explanations: Modeling Uncertainty in Explainability

Dylan Slack, Anna Hilgard, Sameer Singh, Himabindu Lakkaraju

Keywords Abstract Paper

robustness, interpretability

Meta-Cal: Well-controlled Post-hoc Calibration by Ranking

Xingchen Ma, Matthew B Blaschko

Keywords Abstract Paper

Algorithms, Supervised Learning

SAM: The Sensitivity of Attribution Methods to Hyperparameters

Naman Bansal, Chirag Agarwal, Anh Nguyen

Keywords Abstract Paper

xai, explainable, attribution, sensitivity, robustness, explanation, hyperparameters

Understanding the Limitations of Conditional Generative Models

Ethan Fetaya, Joern-Henrik Jacobsen, Will Grathwohl, Richard Zemel

Keywords Abstract Paper

Conditional Generative Models, Generative Classifiers, Robustness, Adversarial Examples

Learning from Context or Names? An Empirical Study on Neural Relation Extraction

Hao Peng, Tianyu Gao, Xu Han and Yankai Lin, Peng Li, Zhiyuan Liu, Maosong Sun, Jie Zhou

Keywords Abstract Paper

relation benchmarks, re scenarios, neural models, re models

MASKER: Masked Keyword Regularization for Reliable Text Classification

Seung Jun Moon, Sangwoo Mo, Kimin Lee and Jaeho Lee, Jinwoo Shin

Keywords Abstract Paper

MASSIVE: Tractable and Robust Bayesian Learning of Many-Dimensional Instrumental Variable Models

Ioan Gabriel Bucur, Tom Claassen, Tom Heskes

Keywords Abstract Paper

Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data

Lingkai Kong, Haoming Jiang, Yuchen Zhuang and Jie Lyu, Tuo Zhao, Chao Zhang

Keywords Abstract Paper

augmented training, in-distribution calibration, text classification, expectation error

Learning to Faithfully Rationalize by Construction

Sarthak Jain, Sarah Wiegreffe, Yuval Pinter, Byron C. Wallace

Keywords Abstract Paper

NLP, neural classification, training, automatic evaluations

Structured Dropout Variational Inference for Bayesian Neural Networks

Son Nguyen, Duong Nguyen, Khai Nguyen and Khoat Than, Hung Bui, Nhat Ho

Keywords Abstract Paper

deep learning, generative model

Further Analysis of Outlier Detection with Deep Generative Models

Ziyu Wang, Bin Dai, David P Wipf, Jun Zhu

Keywords Abstract Paper

DoLFIn: Distributions over Latent Features for Interpretability

Phong Le, Willem Zuidema

Keywords Abstract Paper

Goodness-of-fit test for mismatched self-exciting processes

Song Wei, Shixiang Zhu, Minghe Zhang, Yao Xie

Keywords Abstract Paper

Transfer-Based Semantic Anomaly Detection

Lucas Deecke, Lukas Ruff, Rob Vandermeulen, Hakan Bilen

Keywords Abstract Paper

Algorithms, Unsupervised Learning

Perturbation Based Learning for Structured NLP tasks with Application to Dependency Parsing

Amichay Doitch, Ram Yazdi, Tamir Hazan, Roi Reichart

Keywords Abstract Paper

Structured tasks, Dependency Parsing, NLP, sampling

Learning to Generate Visual Questions with Noisy Supervision

Shen Kai, Lingfei Wu, Siliang Tang and Yueting Zhuang, zhen he, Zhuoye Ding, Yun Xiao, Bo Long

Keywords Abstract Paper

Keywords Paper

Keywords Paper

Alexander Hoyle, Pranav Goel, Andrew Hian-Cheong and
Denis Peskov, Jordan Boyd-Graber, Philip Resnik

Keywords Paper

Keywords Paper

Segev Wasserkrug, Radu Marinescu, Sergey Zeltyn and
Evgeny Shindin, Yishai A Feldman

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Hao Peng, Tianyu Gao, Xu Han and
Yankai Lin, Peng Li, Zhiyuan Liu, Maosong Sun, Jie Zhou

Keywords Paper

Seung Jun Moon, Sangwoo Mo, Kimin Lee and
Jaeho Lee, Jinwoo Shin

Keywords Paper

Keywords Paper

Lingkai Kong, Haoming Jiang, Yuchen Zhuang and
Jie Lyu, Tuo Zhao, Chao Zhang

Keywords Paper

Keywords Paper

Son Nguyen, Duong Nguyen, Khai Nguyen and
Khoat Than, Hung Bui, Nhat Ho

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Shen Kai, Lingfei Wu, Siliang Tang and
Yueting Zhuang, zhen he, Zhuoye Ding, Yun Xiao, Bo Long

Keywords Paper

Sean Welleck, Ilia Kulikov, Stephen Roller and
Emily Dinan, Kyunghyun Cho, Jason Weston

Keywords Paper

Keywords Paper

Linyang Li, Ruotian Ma, Qipeng Guo and
Xiangyang Xue, Xipeng Qiu

Keywords Paper

Muhammad Ferjad Naeem, Seong Joon Oh, Yunjey Choi and
Youngjung Uh, Jaejun Yoo

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Akash Kumar Mohankumar, Preksha Nema, Sharan Narasimhan and
Mitesh M. Khapra, Balaji Vasan Srinivasan, Balaraman Ravindran

Keywords Paper

Keywords Paper

Keywords Paper

Yada Pruksachatkun, Jason Phang, Haokun Liu and
Phu Mon Htut, Xiaoyi Zhang, Richard Yuanzhe Pang, Clara Vania, Katharina Kann, Samuel R. Bowman

Keywords Paper

Bo Zhang, Yue Zhang, Rui Wang and
Zhenghua Li, Min Zhang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper