Improving Truthfulness of Headline Generation

04/07/2020

Improving Truthfulness of Headline Generation

Kazuki Matsumaru, Sho Takase, Naoaki Okazaki

Keywords: Truthfulness Generation, abstractive summarization, headline generation, automatic headlines

Abstract Paper Similar Papers

Abstract: Most studies on abstractive summarization report ROUGE scores between system and reference summaries. However, we have a concern about the truthfulness of generated summaries: whether all facts of a generated summary are mentioned in the source text. This paper explores improving the truthfulness in headline generation on two popular datasets. Analyzing headlines generated by the state-of-the-art encoder-decoder model, we show that the model sometimes generates untruthful headlines. We conjecture that one of the reasons lies in untruthful supervision data used for training the model. In order to quantify the truthfulness of article-headline pairs, we consider the textual entailment of whether an article entails its headline. After confirming quite a few untruthful instances in the datasets, this study hypothesizes that removing untruthful instances from the supervision data may remedy the problem of the untruthful behaviors of the model. Building a binary classifier that predicts an entailment relation between an article and its headline, we filter out untruthful instances from the supervision data. Experimental results demonstrate that the headline generation model trained on filtered supervision data shows no clear difference in ROUGE scores but remarkable improvements in automatic and manual evaluations of the generated headlines.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ACL 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

19/04/2021

Hidden biases in unreliable news detection datasets

Xiang Zhou, Heba Elfardy, Christos Christodoulopoulos and
Thomas Butler, Mohit Bansal

Keywords Paper

0

0

0

0

10:57

16/11/2020

Multi-Fact Correction in Abstractive Text Summarization

Yue Dong, Shuohang Wang, Zhe Gan and
Yu Cheng, Jackie Chi Kit Cheung, Jingjing Liu

Keywords Paper

news summarization, factual inconsistency, pre-trained systems, extractive strategies

0

0

0

0

11:53

16/11/2020

Unsupervised Reference-Free Summary Quality Evaluation via Contrastive Learning

Hanlu Wu, Tengfei Ma, Lingfei Wu and
Tariro Manyumwa, Shouling Ji

Keywords Paper

summarization task, document system, rouge, unsupervised learning

0

0

0

0

11:16

07/06/2021

Political Depolarization of News Articles Using Attribute-Aware Word Embeddings

Ruibo Liu, Lili Wang, Chenyan Jia, Soroush Vosoughi

Keywords Paper

Qualitative and quantitative studies of social media, Trust, reputation, recommendation systems, Subjectivity in textual data, sentiment analysis, polarity/opinion identification and extraction, linguistic analyses of social media behavior, Measuring predi

0

0

0

0

6:25

02/02/2021

Fact-Enhanced Synthetic News Generation

Kai Shu, Yichuan Li, Kaize Ding, Huan Liu

Keywords Paper

0

0

0

0

13:47

03/08/2020

Adapting Text Embeddings for Causal Inference

Victor Veitch, Dhanya Sridhar, David Blei

Keywords Paper

0

0

0

0

8:51

16/11/2020

UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation

Jian Guan, Minlie Huang

Keywords Paper

open-ended generation, story generation, evaluating generation, constructing samples

0

0

0

0

11:26

02/02/2021

Generating Diversified Comments via Reader-Aware Topic Modeling and Saliency Detection

Wei Wang, Piji Li, Hai-Tao Zheng

Keywords Paper

0

0

0

0

15:11

04/07/2020

Unsupervised Opinion Summarization as Copycat-Review Generation

Arthur Bražinskas, Mirella Lapata, Ivan Titov

Keywords Paper

Unsupervised Summarization, Copycat-Review Generation, Opinion summarization, automatically summaries

0

0

0

0

10:55

14/06/2020

Towards Causal VQA: Revealing and Reducing Spurious Correlations by Invariant and Covariant Semantic Editing

Vedika Agarwal, Rakshith Shetty, Mario Fritz

Keywords Paper

robustness, vqa, causality, gan, dataset, evaluation, automated semantic scene editing, data augmentation, invariance, covariance

0

0

0

0

1:00

16/11/2020

Few-Shot Learning for Opinion Summarization

Arthur Bražinskas, Mirella Lapata, Ivan Titov

Keywords Paper

opinion summarization, automatic text, summary production, summarization mode

0

0

0

0

11:48

15/11/2020

Taming Type Annotations in Gradual Typing

John Peter Campora, Sheng Chen

Keywords Paper

variational types, gradual typing, cast errors

0

0

0

0

14:33

16/11/2020

Small but Mighty: New Benchmarks for Split and Rephrase

Li Zhang, Huaiyu Zhu, Siddhartha Brahma, Yunyao Li

Keywords Paper

text task, fine-grained evaluation, automatic process, rule-based model

0

0

0

0

6:58

02/02/2021

Contextualized Rewriting for Text Summarization

Guangsheng Bao, Yue Zhang

Keywords Paper

0

0

0

0

17:38

04/07/2020

Hooks in the Headline: Learning to Generate Headlines with Controlled Styles

Di Jin, Zhijing Jin, Joey Tianyi Zhou and
Lisa Orii, Peter Szolovits

Keywords Paper

Stylistic Generation, summarization tasks, automatic evaluation, summarization systems

0

0

0

0

13:42

19/04/2021

How to evaluate a summarizer: Study design and statistical analysis for manual linguistic quality evaluation

Julius Steen, Katja Markert

Keywords Paper

0

0

0

0

12:04

22/06/2020

Enriching Knowledge Bases with Interesting Negative Statements

Hiba Arnaout, Simon Razniewski, Gerhard Weikum

Keywords Paper

information retrieval, knowledge bases, ranking, negation

0

0

0

0

5:25

07/06/2021

Textual Analysis and Timely Detection of Suspended Social Media Accounts

Dominic Seyler, Shulong Tan, Dingcheng Li and
Jingyuan Zhang, Ping Li

Keywords Paper

Qualitative and quantitative studies of social media, Credibility of online content, Subjectivity in textual data, sentiment analysis, polarity/opinion identification and extraction, linguistic analyses of social media behavior, Text categorization, topic

0

0

0

0

8:06

04/07/2020

Conditional Augmentation for Aspect Term Extraction via Masked Sequence-to-Sequence Generation

Kun Li, Chengbo Chen, Xiaojun Quan and
Qing Ling, Yan Song

Keywords Paper

Conditional Augmentation, Aspect Extraction, sentiment analysis, data augmentation

0

0

0

0

11:30

26/08/2020

Mitigating Overfitting in Supervised Classification from Two Unlabeled Datasets: A Consistent Risk Correction Approach

Nan Lu, Tianyi Zhang, Gang Niu, Masashi Sugiyama

Keywords Paper

0

0

0

0

10:16

26/04/2020

Learning The Difference That Makes A Difference With Counterfactually-Augmented Data

Divyansh Kaushik, Eduard Hovy, Zachary Lipton

Keywords Paper

humans in the loop, annotation artifacts, text classification, sentiment analysis, natural language inference

0

0

0

0

4:25

04/07/2020

Improving Image Captioning Evaluation by Considering Inter References Variance

Yanzhi Yi, Hangyu Deng, Jinglu Hu

Keywords Paper

Image Evaluation, Evaluating captions, system-level tasks, BERTScore

0

0

0

0

11:31

02/02/2021

The Style-Content Duality of Attractiveness: Learning to Write Eye-Catching Headlines via Disentanglement

Mingzhe Li, Xiuying Chen, Min Yang and
Shen Gao, Dongyan Zhao, Rui Yan

Keywords Paper

0

0

0

0

16:14

22/06/2020

Cross-context News Corpus for Protest Events related Knowledge Base Construction

Ali Hürriyetoğlu, Erdem Yörük, Deniz Yüret and
Osman Mutlu, Çağrı Yoltar, Fırat Duruşan, Burak Gürel

Keywords Paper

protests, contentious politics, news, text classification, event extraction, social sciences, political sciences, computational social science

0

0

0

0

4:45

04/07/2020

On the Robustness of Language Encoders against Grammatical Errors

Fan Yin, Quanyu Long, Tao Meng, Kai-Wei Chang

Keywords Paper

downstream applications, linguistic task, Language Encoders, pre-trained encoders

0

0

0

0

11:09

02/02/2021

Robustness to Spurious Correlations in Text Classification via Automatically Generated Counterfactuals

Zhao Wang, Aron Culotta

Keywords Paper

0

0

0

0

17:39

06/12/2020

A Unified View of Label Shift Estimation

Saurabh Garg, Yifan Wu, Sivaraman Balakrishnan, Zachary Lipton

Keywords Paper

0

0

0

0

3:18

01/07/2020

A Probabilistic Model with Commonsense Constraints for Pattern-based Temporal Fact Extraction

Yang Zhou, Tong Zhao, Meng Jiang

Keywords Paper

0

0

0

0

9:38

02/02/2021

Hierarchical Coherence Modeling for Document Quality Assessment

Dongliang Liao, Jin Xu, Gongfu Li, Yiru Wang

Keywords Paper

0

0

0

0

18:16

02/02/2021

Multi-Dimensional Explanation of Target Variables from Documents

Diego Antognini, Claudiu Musat, Boi Faltings

Keywords Paper

0

0

0

0

19:03

19/04/2021

StructSum: Summarization via structured representations

Vidhisha Balachandran, Artidoro Pagnoni, Jay Yoon Lee and
Dheeraj Rajagopal, Jaime Carbonell, Yulia Tsvetkov

Keywords Paper

0

0

0

0

6:32

04/07/2020

Discrete Optimization for Unsupervised Sentence Summarization with Word-Level Extraction

Raphael Schumann, Lili Mou, Yao Lu and
Olga Vechtomova, Katja Markert

Keywords Paper

Unsupervised Summarization, Word-Level Extraction, Automatic summarization, Discrete Optimization

0

0

0

0

10:39

16/11/2020

CAT-Gen: Improving Robustness in NLP Models via Controlled Adversarial Text Generation

Tianlu Wang, Xuezhi Wang, Yao Qin and
Ben Packer, Kang Li, Jilin Chen, Alex Beutel, Ed Chi

Keywords Paper

sentiment classification, model re-training, nlp models, cat-gen model

0

0

0

0

6:58

04/07/2020

Max-Margin Incremental CCG Parsing

Miloš Stanojević, Mark Steedman

Keywords Paper

Incremental parsing, human processing, ASR, MT

0

0

0

0

11:39

14/06/2020

ContourNet: Taking a Further Step Toward Accurate Arbitrary-Shaped Scene Text Detection

Yuxin Wang, Hongtao Xie, Zheng-Jun Zha and
Mengting Xing, Zilong Fu, Yongdong Zhang

Keywords Paper

scene text detection, arbitrary shapes, false-positive suppression, large scale variance

0

0

0

0

1:01

06/12/2020

Learning to summarize with human feedback

Nisan Stiennon, Long Ouyang, Jeffrey Wu and
Daniel Ziegler, Ryan Lowe, Chelsea Voss, Alec Radford, Dario Amodei, Paul Christiano

Keywords Paper

0

0

0

0

3:17

02/02/2021

Re-TACRED: Addressing Shortcomings of the TACRED Dataset

George Stoica, Emmanouil Antonios Platanios, Barnabas Poczos

Keywords Paper

0

0

0

0

16:45

06/12/2021

Towards Calibrated Model for Long-Tailed Visual Recognition from Prior Perspective

Zhengzhuo Xu, Zenghao Chai, Chun Yuan

Keywords Paper

theory, machine learning

0

0

0

0

4:23

01/07/2020

Distilling the Evidence to Augment Fact Verification Models

Beatrice Portelli, Jason Zhao, Tal Schuster and
Giuseppe Serra, Enrico Santus

Keywords Paper

0

0

0

0

10:24

16/11/2020

Detecting Word Sense Disambiguation Biases in Machine Translation for Model-Agnostic Adversarial Attacks

Denis Emelin, Ivan Titov, Rico Sennrich

Keywords Paper

word disambiguation, nmt, prediction errors, adversarial strategy

0

0

0

0

12:57