Towards a decomposable metric for explainable evaluation of text generation from AMR

19/04/2021

Towards a decomposable metric for explainable evaluation of text generation from AMR

Juri Opitz, Anette Frank

Keywords:

Abstract Paper Similar Papers

Abstract: Systems that generate natural language text from abstract meaning representations such as AMR are typically evaluated using automatic surface matching metrics that compare the generated texts to reference texts from which the input meaning representations were constructed. We show that besides well-known issues from which such metrics suffer, an additional problem arises when applying these metrics for AMR-to-text evaluation, since an abstract meaning representation allows for numerous surface realizations. In this work we aim to alleviate these issues by proposing <span class="tex-math">ℳℱ𝛽</span>, a decomposable metric that builds on two pillars. The first is the <b>principle of meaning preservation <span class="tex-math">ℳ</span> </b>: it measures to what extent a given AMR can be reconstructed from the generated sentence using SOTA AMR parsers and applying (fine-grained) AMR evaluation metrics to measure the distance between the original and the reconstructed AMR. The second pillar builds on a <b>principle of (grammatical) form <span class="tex-math">ℱ</span></b> that measures the linguistic quality of the generated text, which we implement using SOTA language models. In two extensive pilot studies we show that fulfillment of both principles offers benefits for AMR-to-text evaluation, including explainability of scores. Since <span class="tex-math">ℳℱ𝛽</span> does not necessarily rely on gold AMRs, it may extend to other text generation tasks.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EACL 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

04/07/2020

Self-Attention with Cross-Lingual Position Representation

Liang Ding, Longyue Wang, Dacheng Tao

Keywords Paper

natural tasks, WMT'17 tasks, Cross-Lingual Representation, Position encoding

0

0

0

0

7:46

08/12/2020

A Deep Metric Learning Method for Biomedical Passage Retrieval

Andrés Rosso-Mateus, Fabio A. González, Manuel Montes-y-Gómez

Keywords Paper

0

0

0

0

14:58

19/08/2021

Keep the Structure: A Latent Shift-Reduce Parser for Semantic Parsing

Yuntao Li, Bei Chen, Qian Liu and
Yan Gao, Jian-Guang Lou, Yan Zhang, Dongmei Zhang

Keywords Paper

Natural Language Processing, Natural Language Semantics

0

0

0

0

12:37

30/11/2020

Scale-Aware Polar Representation for Arbitrarily-Shaped Text Detection

Yanguang Bi, Zhiqiang Hu

Keywords Paper

0

0

0

0

9:56

14/06/2020

Cops-Ref: A New Dataset and Task on Compositional Referring Expression Comprehension

Zhenfang Chen, Peng Wang, Lin Ma and
Kwan-Yee K. Wong, Qi Wu

Keywords Paper

compositional referring expression comprehension, visual reasoning

0

0

0

0

1:00

16/11/2020

LAReQA: Language-Agnostic Answer Retrieval from a Multilingual Pool

Uma Roy, Noah Constant, Rami Al-Rfou and
Aditya Barua, Aaron Phillips, Yinfei Yang

Keywords Paper

language-agnostic retrieval, cross-lingual tasks, cross-lingual retrieval, alignment

0

0

0

0

12:07

08/12/2020

Informative Manual Evaluation of Machine Translation Output

Maja Popović

Keywords Paper

0

0

0

0

15:26

08/12/2020

AlphaMWE: Construction of Multilingual Parallel Corpora with MWE Annotations

Lifeng Han, Gareth Jones, Alan Smeaton

Keywords Paper

0

0

0

0

14:26

04/07/2020

Towards Holistic and Automatic Evaluation of Open-Domain Dialogue Generation

Bo Pang, Erik Nijkamp, Wenjuan Han and
Linqi Zhou, Yixian Liu, Kewei Tu

Keywords Paper

Holistic Generation, Open-domain generation, Natural Processing, human evaluation

0

0

0

1

11:57

12/07/2020

Educating Text Autoencoders: Latent Representation Guidance via Denoising

Tianxiao Shen, Jonas Mueller, Regina Barzilay, Tommi Jaakkola

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

17:06

04/07/2020

CLIReval: Evaluating Machine Translation as a Cross-Lingual Information Retrieval Task

Shuo Sun, Suzanna Sia, Kevin Duh

Keywords Paper

Machine Translation, Cross-Lingual Task, machine MT, MT

0

0

0

0

9:37

19/08/2021

Micro-Expression Recognition Enhanced by Macro-Expression from Spatial-Temporal Domain

Bin Xia, Shangfei Wang

Keywords Paper

Computer Vision, Biometrics, Face and Gesture Recognition

0

0

0

0

3:28

06/12/2020

Measuring Systematic Generalization in Neural Proof Generation with Transformers

Nicolas Gontier, Koustuv Sinha, Siva Reddy, Chris Pal

Keywords Paper

0

0

0

0

3:23

06/12/2021

BARTScore: Evaluating Generated Text as Text Generation

Weizhe Yuan, Graham Neubig, Pengfei Liu

Keywords Paper

0

0

0

0

13:47

16/11/2020

Word Rotator's Distance

Sho Yokoi, Ryo Takahashi, Reina Akama and
Jun Suzuki, Kentaro Inui

Keywords Paper

assessing similarity, vector converter, word alignment, alignment-based approaches

0

0

0

0

11:32

14/06/2020

Attention-Guided Hierarchical Structure Aggregation for Image Matting

Yu Qiao, Yuhao Liu, Xin Yang and
Dongsheng Zhou, Mingliang Xu, Qiang Zhang, Xiaopeng Wei

Keywords Paper

image matting, attention, hierarchical, aggregation, appearance cues

0

0

0

0

0:59

16/11/2020

A Bilingual Generative Transformer for Semantic Sentence Embedding

John Wieting, Graham Neubig, Taylor Berg-Kirkpatrick

Keywords Paper

source separation, semantic encoding, data distributions, unsupervised evaluations

0

0

0

0

14:32

03/05/2021

On Learning Universal Representations Across Languages

Xiangpeng Wei, Rongxiang Weng, Yue Hu and
Luxi Xing, Heng Yu, Weihua Luo

Keywords Paper

hierarchical contrastive learning, cross-lingual pretraining, universal representation learning

0

0

0

0

3:51

01/07/2020

Leveraging Sentence Similarity in Natural Language Generation: Improving Beam Search using Range Voting

Sebastian Borgeaud, Guy Emerson

Keywords Paper

0

0

0

0

8:58

01/07/2020

Word Embeddings as Tuples of Feature Probabilities

Siddharth Bhat, Alok Debnath, Souvik Banerjee, Manish Shrivastava

Keywords Paper

0

0

0

0

11:57

03/05/2021

CoDA: Contrast-enhanced and Diversity-promoting Data Augmentation for Natural Language Understanding

Yanru Qu, Dinghan Shen, Yelong Shen and
Sandra Sajeev, Weizhu Chen, Jiawei Han

Keywords Paper

consistency training, contrastive learning, data augmentation, natural language understanding

0

0

0

0

6:02

06/12/2021

FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition

Yichong Leng, Xu Tan, Linchen Zhu and
Jin Xu, Renqian Luo, Linquan Liu, Tao Qin, Xiangyang Li, Edward Lin, Tie-Yan Liu

Keywords Paper

0

0

0

0

13:44

16/11/2020

Simulated multiple reference training improves low-resource machine translation

Huda Khayrallah, Brian Thompson, Matt Post, Philipp Koehn

Keywords Paper

machine mt, mt, simulated training, simulated

0

0

0

0

6:56

16/11/2020

Semantically-Aligned Universal Tree-Structured Solver for Math Word Problems

Jinghui Qin, Lihui Lin, Xiaodan Liang and
Rumin Zhang, Liang Lin

Keywords Paper

one-unknown mwps, universal, uet, semantically-aligned solver

0

0

0

0

11:08

08/12/2020

SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILP

Katsuki Chousa, Masaaki Nagata, Masaaki Nishino

Keywords Paper

0

0

0

0

14:39

16/11/2020

Partially-Aligned Data-to-Text Generation with Distant Supervision

Zihao Fu, Bei Shi, Wai Lam and
Lidong Bing, Zhiyuan Liu

Keywords Paper

data-to-text task, generation task, dataset problem, over-generation problem

0

0

0

0

11:58

14/06/2020

More Grounded Image Captioning by Distilling Image-Text Matching Model

Yuanen Zhou, Meng Wang, Daqing Liu and
Zhenzhen Hu, Hanwang Zhang

Keywords Paper

grounded image captioning, image-text matching, visual grounding, cross-task knowledge distillation

0

0

0

0

1:01

04/07/2020

Considering Likelihood in NLP Classiﬁcation Explanations with Occlusion and Language Modeling

David Harbecke, Christoph Alt

Keywords Paper

NLP, NLP Explanations, Language Modeling, NLP models

0

0

0

0

12:01

04/07/2020

Tangled up in BLEU: Reevaluating the Evaluation of Automatic Machine Translation Evaluation Metrics

Nitika Mathur, Timothy Baldwin, Trevor Cohn

Keywords Paper

judging metrics, assessment, pairwise ranking, thresholding

0

0

0

0

11:39

19/08/2021

Generating Senses and RoLes: An End-to-End Model for Dependency- and Span-based Semantic Role Labeling

Rexhina Blloshmi, Simone Conia, Rocco Tripodi, Roberto Navigli

Keywords Paper

Natural Language Processing, Natural Language Semantics, Natural Language Generation, Natural Language Processing

0

0

0

0

15:18

04/07/2020

Facet-Aware Evaluation for Extractive Summarization

Yuning Mao, Liyuan Liu, Qi Zhu and
Xiang Ren, Jiawei Han

Keywords Paper

Facet-Aware Evaluation, Extractive Summarization, fine-grained evaluation, comparative analysis

0

0

0

0

11:43

08/12/2020

A Human Evaluation of AMR-to-English Generation Systems

Emma Manning, Shira Wein, Nathan Schneider

Keywords Paper

0

0

0

0

15:12

04/07/2020

Automatic Machine Translation Evaluation using Source Language Inputs and Cross-lingual Language Model

Kosuke Takahashi, Katsuhito Sudoh, Satoshi Nakamura

Keywords Paper

Automatic Evaluation, machine translation, Cross-lingual Model, regression model

0

0

0

0

7:17

04/07/2020

Better Document-level Machine Translation with Bayes' Rule

Lei Yu, Laurent Sartran, Wojciech Stokowiec and
Wang Ling, Lingpeng Kong, Phil Blunsom, Chris Dyer

Keywords Paper

Document-level Translation, inference, Bayes Rule, document models

0

0

0

0

10:57

03/05/2021

Probing BERT in Hyperbolic Spaces

Boli Chen, Yao Fu, Guangwei Xu and
Pengjun Xie, Chuanqi Tan, Mosha Chen, Liping Jing

Keywords Paper

Sentiment, Syntax, Probe, BERT, Hyperbolic

0

0

0

0

5:10

04/07/2020

ECPE-2D: Emotion-Cause Pair Extraction based on Joint Two-Dimensional Representation, Interaction and Prediction

Zixiang Ding, Rui Xia, Jianfei Yu

Keywords Paper

Prediction, emotion-cause extraction, text analysis, Joint Representation

0

0

0

0

9:35

16/11/2020

COGS: A Compositional Generalization Challenge Based on Semantic Interpretation

Najoung Kim, Tal Linzen

Keywords Paper

compositional generalization, language architectures, cogs, lstms

0

0

0

0

11:42

19/04/2021

If you’ve got it, flaunt it: Making the most of fine-grained sentiment annotations

Jeremy Barnes, Lilja Øvrelid, Erik Velldal

Keywords Paper

0

0

0

0

12:36

16/11/2020

Wasserstein Distance Regularized Sequence Representation for Text Matching in Asymmetrical Domains

Weijie Yu, Chen Xu, Jun Xu and
Liang Pang, Xiaopeng Gao, Xiaozhao Wang, Ji-Rong Wen

Keywords Paper

real-world practices, text matching, matching models, match method

0

0

0

0

11:43

16/11/2020

Statistical Power and Translationese in Machine Translation Evaluation

Yvette Graham, Barry Haddow, Philipp Koehn

Keywords Paper

machine evaluation, human-parity mt, human translation, significance tests

0

0

0

0

11:48