Data Augmentation for Abstractive Query-Focused Multi-Document Summarization

02/02/2021

Data Augmentation for Abstractive Query-Focused Multi-Document Summarization

Ramakanth Pasunuru, Asli Celikyilmaz, Michel Galley, Chenyan Xiong, Yizhe Zhang, Mohit Bansal, Jianfeng Gao

Keywords:

Abstract Paper Similar Papers

Abstract: The progress in Query-focused Multi-Document Summarization (QMDS) has been limited by the lack of sufficient largescale high-quality training datasets. We present two QMDS training datasets, which we construct using two data augmentation methods: (1) transferring the commonly used single-document CNN/Daily Mail summarization dataset to create the QMDSCNN dataset, and (2) mining search-query logs to create the QMDSIR dataset. These two datasets have complementary properties, i.e., QMDSCNN has real summaries but queries are simulated, while QMDSIR has real queries but simulated summaries. To cover both these real summary and query aspects, we build abstractive end-to-end neural network models on the combined datasets that yield new state-of-the-art transfer results on DUC datasets. We also introduce new hierarchical encoders that enable a more efficient encoding of the query together with multiple documents. Empirical results demonstrate that our data augmentation and encoding methods outperform baseline models on automatic metrics, as well as on human evaluations along multiple attributes.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38949305

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AAAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

16/11/2020

Coarse-to-Fine Query Focused Multi-Document Summarization

Yumo Xu, Mirella Lapata

Keywords Paper

modeling interactions, query summarization, assembling summaries, question answering

0

0

0

0

11:30

14/06/2020

Learning Meta Face Recognition in Unseen Domains

Jianzhu Guo, Xiangyu Zhu, Chenxu Zhao and
Dong Cao, Zhen Lei, Stan Z. Li

Keywords Paper

face recognition, meta learning, domain generalization, metric learning

0

0

0

0

5:01

12/07/2020

PoKED: A Semi-Supervised System for Word Sense Disambiguation

Feng Wei

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

15:39

06/12/2021

Scalable Neural Data Server: A Data Recommender for Transfer Learning

Tianshi Cao, Sasha (Alexandre) Doubov, David Acuna, Sanja Fidler

Keywords Paper

machine learning, vision, transfer learning

0

0

0

0

12:54

14/06/2020

Factorized Higher-Order CNNs With an Application to Spatio-Temporal Emotion Estimation

Jean Kossaifi, Antoine Toisoul, Adrian Bulat and
Yannis Panagakis, Timothy M. Hospedales, Maja Pantic

Keywords Paper

tensor methods, deep learning, spatiotemporal, emotion, cnn, tensor decomposition, low-rank, valence, arousal

0

0

0

0

1:01

12/07/2020

Deep Reinforcement Learning with Smooth Policy

Qianli Shen, Yan Li, Haoming Jiang and
Zhaoran Wang, Tuo Zhao

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

9:51

02/02/2021

Precise Yet Efficient Semantic Calibration and Refinement in ConvNets for Real-time Polyp Segmentation from Colonoscopy Videos

Huisi Wu, Jiafu Zhong, Wei Wang and
Zhenkun Wen, Jing Qin

Keywords Paper

0

0

0

0

17:40

03/05/2021

The geometry of integration in text classification RNNs

Kyle Aitken, Vinay Ramasesh, Ankush Garg and
Yuan Cao, David Sussillo, Niru Maheswaranathan

Keywords Paper

interpretability, dynamical systems, reverse engineering, document classification, Recurrent neural networks

0

0

0

0

5:13

16/11/2020

Evaluating the Factual Consistency of Abstractive Text Summarization

Wojciech Kryscinski, Bryan McCann, Caiming Xiong, Richard Socher

Keywords Paper

assessing algorithms, natural inference, fact checking, auxiliary tasks

0

0

0

0

12:05

19/04/2021

Progressively pretrained dense corpus index for open-domain question answering

Wenhan Xiong, Hong Wang, William Yang Wang

Keywords Paper

0

0

0

0

12:15

19/10/2020

Efficient neural query auto completion

Sida Wang, Weiwei Guo, Huiji Gao, Bo Long

Keywords Paper

deep learning, query auto completion, neural language model

0

0

0

0

9:59

06/12/2020

VIME: Extending the Success of Self- and Semi-supervised Learning to Tabular Domain

Jinsung Yoon, Yao Zhang, James Jordon, Mihaela van der Schaar

Keywords Paper

0

0

0

0

3:25

06/12/2021

Revisiting Deep Learning Models for Tabular Data

Yury Gorishniy, Ivan Rubachev, Valentin Khrulkov, Artem Babenko

Keywords Paper

deep learning, transformers

0

0

0

0

12:14

06/12/2021

Structured Reordering for Modeling Latent Alignments in Sequence Transduction

bailin wang, Mirella Lapata, Ivan Titov

Keywords Paper

language

0

0

0

0

15:00

25/07/2020

Query resolution for conversational search with limited supervision

Nikos Voskarides, Dan Li, Pengjie Ren and
Evangelos Kanoulas, Maarten Rijke

Keywords Paper

query resolution, conversational search

0

0

0

0

8:42

22/06/2020

Syntactic Question Abstraction and Retrieval for Data-Scarce Semantic Parsing

Wonseok Hwang, Jinyeong Yim, Seunghyun Park, Minjoon Seo

Keywords Paper

Semantic Parsing, NLIDB, WikiSQL, Question Answering, SQL, Information Retrieval

0

0

0

0

4:37

05/04/2021

RL-Scope: Cross-stack Profiling for Deep Reinforcement Learning Workloads

James Gleeson, Sri Krishnan, Moshe Gabel and
Vijay Janapa Reddi, Eyal de Lara, Gennady Pekhimenko

Keywords Paper

0

0

0

0

20:00

05/04/2021

RL-Scope: Cross-stack Profiling for Deep Reinforcement Learning Workloads

James Gleeson, Sri Krishnan, Moshe Gabel and
Vijay Janapa Reddi, Eyal de Lara, Gennady Pekhimenko

Keywords Paper

0

0

0

0

5:17

02/02/2021

LREN: Low-Rank Embedded Network for Sample-Free Hyperspectral Anomaly Detection

Kai Jiang, Weiying Xie, Jie Lei and
Tao Jiang, Yunsong Li

Keywords Paper

0

0

0

0

12:56

08/12/2020

Domain Transfer based Data Augmentation for Neural Query Translation

Liang Yao, Baosong Yang, Haibo Zhang and
Boxing Chen, Weihua Luo

Keywords Paper

0

0

0

0

10:57

16/11/2020

PathQG: Neural Question Generation from Facts

Siyuan Wang, Zhongyu Wei, Zhihao Fan and
Zengfeng Huang, Weijian Sun, Qi Zhang, Xuanjing Huang

Keywords Paper

question generation, query learning, query-based generation, sequence problem

0

0

0

0

11:16

06/12/2021

Automatic Unsupervised Outlier Model Selection

Yue Zhao, Ryan Rossi, Leman Akoglu

Keywords Paper

machine learning, self-supervised learning, meta learning, clustering

0

0

0

0

15:08

16/11/2020

Neural Topic Modeling with Cycle-Consistent Adversarial Training

Xuemeng Hu, Rui Wang, Deyu Zhou, Yuxuan Xiong

Keywords Paper

neural modeling, deep models, adversarial-neural model, adversarially network

0

0

0

1

9:57

16/11/2020

Learning from Context or Names? An Empirical Study on Neural Relation Extraction

Hao Peng, Tianyu Gao, Xu Han and
Yankai Lin, Peng Li, Zhiyuan Liu, Maosong Sun, Jie Zhou

Keywords Paper

relation benchmarks, re scenarios, neural models, re models

0

0

0

0

11:56

19/08/2021

A Sequence-to-Set Network for Nested Named Entity Recognition

Zeqi Tan, Yongliang Shen, Shuai Zhang and
Weiming Lu, Yueting Zhuang

Keywords Paper

Natural Language Processing, Information Extraction, Named Entities

0

0

0

0

10:38

03/05/2021

IEPT: Instance-Level and Episode-Level Pretext Tasks for Few-Shot Learning

Manli Zhang, Jianhong Zhang, Zhiwu Lu and
Tao Xiang, Mingyu Ding, Songfang Huang

Keywords Paper

self-supervised learning, few-shot learning, episode-level pretext task

0

0

0

0

5:03

26/04/2020

Learning from Explanations with Neural Execution Tree

Ziqi Wang, Yujia Qin, Wenxuan Zhou and
Jun Yan, Qinyuan Ye, Leonardo Neves, Zhiyuan Liu, Xiang Ren

Keywords Paper

0

0

0

0

4:58

02/02/2021

High Dimensional Level Set Estimation with Bayesian Neural Network

Huong Ha, Sunil Gupta, Santu Rana, Svetha Venkatesh

Keywords Paper

0

0

0

0

19:14

04/11/2020

Retiarii: A Deep Learning Exploratory-Training Framework

Quanlu Zhang, Zhenhua Han, Fan Yang and
Yuge Zhang, Zhe Liu, Mao Yang, Lidong Zhou

Keywords Paper

0

0

0

0

20:05

03/05/2021

QPLEX: Duplex Dueling Multi-Agent Q-Learning

Jianhao Wang, Zhizhou Ren, Terry Liu and
Yang Yu, Chongjie Zhang

Keywords Paper

Dueling structure, Value factorization, Multi-agent reinforcement learning

0

0

0

0

4:52

04/07/2020

Using Context in Neural Machine Translation Training Objectives

Danielle Saunders, Felix Stahlberg, Bill Byrne

Keywords Paper

Neural training, NMT training, document-level training, NMT objective

0

0

0

0

6:48

22/11/2021

Prototype-based Incremental Few-Shot Segmentation

Fabio Cermelli, Massimiliano Mancini, Yongqin Xian and
Zeynep Akata, Barbara Caputo

Keywords Paper

segmentation, incremental learning, continual learning, few shot learning, any shot learning, prototype, knowledge distillation

0

0

0

0

2:56

25/07/2020

DVGAN: A minimax game for search result diversification combining explicit and implicit features

Jiongnan Liu, Zhicheng Dou, Xiaojie Wang and
Shuqi Lu, Ji-Rong Wen

Keywords Paper

generative adversarial network, search result diversification

0

0

0

0

12:46

26/04/2020

Variational Template Machine for Data-to-Text Generation

Rong Ye, Wenxian Shi, Hao Zhou and
Zhongyu Wei, Lei Li

Keywords Paper

0

0

0

0

4:55

23/08/2020

Targeted data-driven regularization for out-of-distribution generalization

Mohammad Mahdi Kamani, Sadegh Farhang, Mehrdad Mahdavi, James Z. Wang

Keywords Paper

data-driven regularization, out-of-distribution generalization, bilevel programming

0

0

0

0

6:36

02/02/2021

Dual Distribution Alignment Network for Generalizable Person Re-Identification

Peixian Chen, Pingyang Dai, Jianzhuang Liu and
Feng Zheng, Mingliang Xu, Qi Tian, Rongrong Ji

Keywords Paper

0

0

0

0

18:19

12/07/2020

Puzzle Mix: Exploiting Saliency and Local Statistics for Optimal Mixup

Jang-Hyun Kim, Wonho Choo, Hyun Oh Song

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

13:40

04/07/2020

ClarQ: A large-scale and diverse dataset for Clarification Question Generation

Vaibhav Kumar, Alan W Black

Keywords Paper

Clarification Generation, Question answering, classifying questions, downstream question-answering

0

0

0

0

6:17

25/07/2020

Correlated features synthesis and alignment for zero-shot cross-modal retrieval

Xing Xu, Kaiyi Lin, Huimin Lu and
Lianli Gao, Heng Tao Shen

Keywords Paper

zero-shot learning, feature synthesis, cross-modal retrieval

0

0

0

0

9:11

02/02/2021

DecAug: Out-of-Distribution Generalization via Decomposed Feature Representation and Semantic Augmentation

Haoyue Bai, Rui Sun, Lanqing Hong and
Fengwei Zhou, Nanyang Ye, Han-Jia Ye, S.-H. Gary Chan, Zhenguo Li

Keywords Paper

0

0

0

0

15:59