Interpretable Multi-dataset Evaluation for Named Entity Recognition

16/11/2020

Interpretable Multi-dataset Evaluation for Named Entity Recognition

Jinlan Fu, Pengfei Liu, Graham Neubig

Keywords: natural tasks, interpretable evaluation, named task, analysis tool

Abstract Paper Similar Papers

Abstract: With the proliferation of models for natural language processing tasks, it is even harder to understand the differences between models and their relative merits. Simply looking at differences between holistic metrics such as accuracy, BLEU, or F1 does not tell us why or how particular methods perform differently and how diverse datasets influence the model design choices. In this paper, we present a general methodology for interpretable evaluation for the named entity recognition (NER) task. The proposed evaluation method enables us to interpret the differences in models and datasets, as well as the interplay between them, identifying the strengths and weaknesses of current systems. By making our analysis tool available, we make it easy for future researchers to run similar analyses and drive progress in this area: https://github.com/neulab/InterpretEval

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EMNLP 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

Dynamic Refinement Network for Oriented and Densely Packed Object Detection

Xingjia Pan, Yuqiang Ren, Kekai Sheng and
Weiming Dong, Haolei Yuan, Xiaowei Guo, Chongyang Ma, Changsheng Xu

Keywords Paper

object detection, oriented, densely packed, sku110k, feature selection, dynamic, anchor-free

0

0

0

0

5:01

16/11/2020

What Does My QA Model Know? Devising Controlled Probes using Expert

Kyle Richardson, Ashish Sabharwal

Keywords Paper

knowledge challenges, benchmark tasks, diagnostic tasks, taxonomic reasoning

0

0

0

0

12:16

08/12/2020

ContraCAT: Contrastive Coreference Analytical Templates for Machine Translation

Dario Stojanovski, Benno Krojer, Denis Peskov, Alexander Fraser

Keywords Paper

0

0

0

0

14:09

04/07/2020

Predictive Biases in Natural Language Processing Models: A Conceptual Framework and Overview

Deven Santosh Shah, H. Andrew Schwartz, Dirk Hovy

Keywords Paper

NLP, Natural Models, Conceptual Framework, mitigation techniques

0

0

0

0

11:52

06/12/2021

Refining Language Models with Compositional Explanations

Huihan Yao, Ying Chen, Qinyuan Ye and
Xisen Jin, Xiang Ren

Keywords Paper

machine learning, fairness, language

0

0

0

0

13:17

06/12/2021

Is Automated Topic Model Evaluation Broken? The Incoherence of Coherence

Alexander Hoyle, Pranav Goel, Andrew Hian-Cheong and
Denis Peskov, Jordan Boyd-Graber, Philip Resnik

Keywords Paper

0

0

0

0

15:00

14/06/2020

Cops-Ref: A New Dataset and Task on Compositional Referring Expression Comprehension

Zhenfang Chen, Peng Wang, Lin Ma and
Kwan-Yee K. Wong, Qi Wu

Keywords Paper

compositional referring expression comprehension, visual reasoning

0

0

0

0

1:00

03/05/2021

Gradient Vaccine: Investigating and Improving Multi-task Optimization in Massively Multilingual Models

Zirui Wang, Yulia Tsvetkov, Orhan Firat, Yuan Cao

Keywords Paper

Multilingual Modeling, Multi-task Learning

0

0

0

0

10:16

04/07/2020

Generating Hierarchical Explanations on Text Classification via Feature Interaction Detection

Hanjie Chen, Guangtao Zheng, Yangfeng Ji

Keywords Paper

Text Classification, Generating explanations, natural processing, model prediction

0

0

0

0

11:47

23/08/2020

Diverse rule sets

Guangyi Zhang, Aristides Gionis

Keywords Paper

sampling, classifier, pattern mining, rule learning, diversification, rule sets

0

0

0

0

9:41

05/01/2021

ChartOCR: Data Extraction From Charts Images via a Deep Hybrid Framework

Junyu Luo, Zekun Li, Jinpeng Wang, Chin-Yew Lin

Keywords Paper

0

0

0

0

4:58

04/07/2020

Reverse Engineering Configurations of Neural Text Generation Models

Yi Tay, Dara Bahri, Che Zheng and
Clifford Brunk, Donald Metzler, Andrew Tomkins

Keywords Paper

Reverse Models, neural modeling, Neural Models, generative models

0

0

0

0

6:16

16/11/2020

A Diagnostic Study of Explainability Techniques for Text Classification

Pepa Atanasova, Jakob Grue Simonsen, Christina Lioma, Isabelle Augenstein

Keywords Paper

downstream tasks, machine learning, explainability techniques, diverse techniques

0

0

0

0

11:24

06/12/2020

Learning Deep Attribution Priors Based On Prior Knowledge

Ethan Weinberger, Joe Janizek, Su-In Lee

Keywords Paper

0

0

0

0

4:20

04/07/2020

Towards Robustifying NLI Models Against Lexical Dataset Biases

Xiang Zhou, Mohit Bansal

Keywords Paper

Natural Inference, data augmentation, Robustifying Models, deep models

0

0

0

0

11:34

16/11/2020

Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations

Wanrong Zhu, Xin Wang, Pradyumna Narayana and
Kazoo Sone, Sugato Basu, William Yang Wang

Keywords Paper

visually generation, vision-and-language tasks, cider, utilities

0

0

0

0

6:47

08/12/2020

Bayesian Methods for Semi-supervised Text Annotation

Kristian Miok, Gregor Pirs, Marko Robnik-Sikonja

Keywords Paper

0

0

0

0

11:18

26/08/2020

Model-Agnostic Counterfactual Explanations for Consequential Decisions

Amir-Hossein Karimi, Gilles Barthe, Borja Balle, Isabel Valera

Keywords Paper

0

0

0

0

16:25

06/12/2020

Beyond Individualized Recourse: Interpretable and Interactive Summaries of Actionable Recourses

Kai Rawal, Himabindu Lakkaraju

Keywords Paper

0

0

0

0

3:31

04/07/2020

Temporally-Informed Analysis of Named Entity Recognition

Shruti Rijhwani, Daniel Preotiuc-Pietro

Keywords Paper

named recognition, NLP tasks, Natural models, language use

0

0

0

0

11:30

06/12/2021

Self-Interpretable Model with Transformation Equivariant Interpretation

Yipei Wang, Xiaoqian Wang

Keywords Paper

deep learning, machine learning, vision

0

0

0

0

8:26

08/12/2020

Best Practices for Data-Efficient Modeling in NLG:How to Train Production-Ready Neural Models with Less Data

Ankit Arun, Soumya Batra, Vikas Bhardwaj and
Ashwini Challa, Pinar Donmez, Peyman Heidari, Hakan Inan, Shashank Jain, Anuj Kumar, Shawn Mei, Karthik Mohan, Michael White

Keywords Paper

0

0

0

0

15:01

16/11/2020

Do Explicit Alignments Robustly Improve Multilingual Encoders?

Shijie Wu, Mark Dredze

Keywords Paper

multilingual, unsupervised encoders, cross-lingual representation, contrastive objective

0

0

0

0

7:14

12/07/2020

Automatic Reparameterisation of Probabilistic Programs

Maria Gorinova, Dave Moore, Matthew Hoffman

Keywords Paper

Probabilistic Inference - Models and Probabilistic Programming

0

0

0

0

15:40

03/05/2021

CoDA: Contrast-enhanced and Diversity-promoting Data Augmentation for Natural Language Understanding

Yanru Qu, Dinghan Shen, Yelong Shen and
Sandra Sajeev, Weizhu Chen, Jiawei Han

Keywords Paper

consistency training, contrastive learning, data augmentation, natural language understanding

0

0

0

0

6:02

07/09/2020

Advancing weakly supervised cross-domain alignment with optimal transport

Siyang Yuan, Ke Bai, Liqun Chen and
Yizhe Zhang, Chenyang Tao, Chunyuan Li, Guoyin Wang, Ricardo Henao, Lawrence Carin Duke

Keywords Paper

Optimal Transport, Cross Domain Alignment

0

0

0

0

10:04

06/12/2021

Reliable Post hoc Explanations: Modeling Uncertainty in Explainability

Dylan Slack, Anna Hilgard, Sameer Singh, Himabindu Lakkaraju

Keywords Paper

robustness, interpretability

0

0

0

0

15:06

02/02/2021

Multi-Dimensional Explanation of Target Variables from Documents

Diego Antognini, Claudiu Musat, Boi Faltings

Keywords Paper

0

0

0

0

19:03

16/11/2020

An Imitation Game for Learning Semantic Parsers from User Interaction

Ziyu Yao, Yiqi Tang, Wen-tau Yih and
Huan Sun, Yu Su

Keywords Paper

bootstrapping, fine-tuning parsers, theoretical analysis, text-to-sql problem

0

0

0

0

11:49

04/07/2020

Don’t Say That! Making Inconsistent Dialogue Unlikely with Unlikelihood Training

Margaret Li, Stephen Roller, Ilia Kulikov and
Sean Welleck, Y-Lan Boureau, Kyunghyun Cho, Jason Weston

Keywords Paper

dialogue tasks, Unlikelihood Training, Generative models, maximum training

0

0

0

0

11:26

26/04/2020

Neural Module Networks for Reasoning over Text

Nitish Gupta, Kevin Lin, Dan Roth and
Sameer Singh, Matt Gardner

Keywords Paper

question answering, compositionality, neural module networks, multi-step reasoning, reading comprehension

0

0

0

0

4:36

03/08/2020

Automated Dependence Plots

David Inouye, Liu Leqi, Joon Sik Kim and
Bryon Aragam, Pradeep Ravikumar

Keywords Paper

0

0

0

0

8:35

02/02/2021

Do Response Selection Models Really Know What’s Next? Utterance Manipulation Strategies for Multi-turn Response Selection

Taesun Whang, Dongyub Lee, Dongsuk Oh and
Chanhee Lee, Kijong Han, Dong-hun Lee, Saebyeok Lee

Keywords Paper

0

0

0

0

17:37

02/02/2021

LIREx: Augmenting Language Inference with Relevant Explanations

Xinyan Zhao, V.G.Vinod Vydiswaran

Keywords Paper

0

0

0

0

18:56

16/11/2020

LAReQA: Language-Agnostic Answer Retrieval from a Multilingual Pool

Uma Roy, Noah Constant, Rami Al-Rfou and
Aditya Barua, Aaron Phillips, Yinfei Yang

Keywords Paper

language-agnostic retrieval, cross-lingual tasks, cross-lingual retrieval, alignment

0

0

0

0

12:07

06/12/2021

Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation

Nicklas Hansen, Hao Su, Xiaolong Wang

Keywords Paper

reinforcement learning and planning, transformers

0

0

0

0

8:43

02/02/2021

A Simple and Effective Self-Supervised Contrastive Learning Framework for Aspect Detection

Tian Shi, Liuqing Li, Ping Wang, Chandan K. Reddy

Keywords Paper

0

0

0

0

19:21

14/09/2020

Diversity-Based Generalization for Unsupervised Text Classification under Domain Shift

Jitin Krishnan, Hemant Purohit, Huzefa Rangwala

Keywords Paper

text classification, unsupervised domain adaptation, natural language processing, neural networks

0

0

0

0

16:13

19/04/2021

Learning from revisions: Quality assessment of claims in argumentation at scale

Gabriella Skitalinskaya, Jonas Klaff, Henning Wachsmuth

Keywords Paper

0

0

0

0

9:47

04/07/2020

A Two-Step Approach for Implicit Event Argument Detection

Zhisong Zhang, Xiang Kong, Zhengzhong Liu and
Xuezhe Ma, Eduard Hovy

Keywords Paper

Implicit Detection, implicit task, modeling, argument detection

0

0

0

0

6:47