Unsupervised Opinion Summarization with Noising and Denoising

04/07/2020

Unsupervised Opinion Summarization with Noising and Denoising

Reinald Kim Amplayo, Mirella Lapata

Keywords: Unsupervised Summarization, supervised models, abstractive summarization, Noising

Abstract Paper Similar Papers

Abstract: The supervised training of high-capacity models on large datasets containing hundreds of thousands of document-summary pairs is critical to the recent success of deep learning techniques for abstractive summarization. Unfortunately, in most domains (other than news) such training data is not available and cannot be easily sourced. In this paper we enable the use of supervised learning for the setting where there are only documents available (e.g., product or business reviews) without ground truth summaries. We create a synthetic dataset from a corpus of user reviews by sampling a review, pretending it is a summary, and generating noisy versions thereof which we treat as pseudo-review input. We introduce several linguistically motivated noise generation functions and a summarization model which learns to denoise the input and generate the original review. At test time, the model accepts genuine reviews and generates a summary containing salient opinions, treating those that do not reach consensus as noise. Extensive automatic and human evaluation shows that our model brings substantial improvements over both abstractive and extractive baselines.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ACL 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

04/07/2020

Unsupervised Opinion Summarization as Copycat-Review Generation

Arthur Bražinskas, Mirella Lapata, Ivan Titov

Keywords Paper

Unsupervised Summarization, Copycat-Review Generation, Opinion summarization, automatically summaries

0

0

0

0

10:55

18/07/2021

The Impact of Record Linkage on Learning from Feature Partitioned Data

Richard Nock, Stephen J Hardy, Wilko Henecka and
Hamish Ivey-Law, Jakub Nabaglo, Giorgio Patrini, Guillaume Smith, Brian Thorne

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

6:02

02/02/2021

Tackling Instance-Dependent Label Noise via a Universal Probabilistic Model

Qizhou Wang, Bo Han, Tongliang Liu and
Gang Niu, Jian Yang, Chen Gong

Keywords Paper

0

0

0

0

14:56

02/02/2021

Aspect-Level Sentiment-Controllable Review Generation with Mutual Learning Framework

Huimin Chen, Yankai Lin, Fanchao Qi and
Jinyi Hu, Peng Li, Jie Zhou, Maosong Sun

Keywords Paper

0

0

0

0

14:57

02/02/2021

Meta Label Correction for Noisy Label Learning

Guoqing Zheng, Ahmed Hassan Awadallah, Susan Dumais

Keywords Paper

0

0

0

0

20:16

16/11/2020

Weakly-Supervised Aspect-Based Sentiment Analysis via Joint Aspect-Sentiment Topic Embedding

Jiaxin Huang, Yu Meng, Fang Guo and
Heng Ji, Jiawei Han

Keywords Paper

extracting aspects, classifying reviews, aspect-based analysis, aspect classification

0

0

0

0

11:23

02/02/2021

Learning to Purify Noisy Labels via Meta Soft Label Corrector

Yichen Wu, Jun Shu, Qi Xie and
Qian Zhao, Deyu Meng

Keywords Paper

0

0

0

0

13:01

26/04/2020

Revisiting Self-Training for Neural Sequence Generation

Junxian He, Jiatao Gu, Jiajun Shen, Marc'Aurelio Ranzato

Keywords Paper

self-training, semi-supervised learning, neural sequence generatioin

0

0

0

0

5:07

06/12/2021

Risk Bounds for Over-parameterized Maximum Margin Classification on Sub-Gaussian Mixtures

Yuan Cao, Quanquan Gu, Mikhail Belkin

Keywords Paper

deep learning, machine learning

0

0

0

0

13:47

16/11/2020

Few-Shot Learning for Opinion Summarization

Arthur Bražinskas, Mirella Lapata, Ivan Titov

Keywords Paper

opinion summarization, automatic text, summary production, summarization mode

0

0

0

0

11:48

04/07/2020

Parallel Corpus Filtering via Pre-trained Language Models

Boliang Zhang, Ajay Nagesh, Kevin Knight

Keywords Paper

machine models, WMT task, Parallel Filtering, Pre-trained Models

0

0

0

0

12:04

19/04/2021

Keep learning: Self-supervised meta-learning for learning from inference

Akhil Kedia, Sai Chetan Chinthakindi

Keywords Paper

0

0

0

0

11:27

26/04/2020

Learning from Rules Generalizing Labeled Exemplars

Abhijeet Awasthi, Sabyasachi Ghosh, Rasna Goyal, Sunita Sarawagi

Keywords Paper

Learning from Rules, Learning from limited labeled data, Weakly Supervised Learning

0

0

0

0

5:18

16/11/2020

An Imitation Game for Learning Semantic Parsers from User Interaction

Ziyu Yao, Yiqi Tang, Wen-tau Yih and
Huan Sun, Yu Su

Keywords Paper

bootstrapping, fine-tuning parsers, theoretical analysis, text-to-sql problem

0

0

0

0

11:49

02/02/2021

Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning

Elad Amrani, Rami Ben-Ari, Daniel Rotman, Alex Bronstein

Keywords Paper

0

0

0

0

14:04

16/11/2020

A Diagnostic Study of Explainability Techniques for Text Classification

Pepa Atanasova, Jakob Grue Simonsen, Christina Lioma, Isabelle Augenstein

Keywords Paper

downstream tasks, machine learning, explainability techniques, diverse techniques

0

0

0

0

11:24

04/07/2020

Cross-Lingual Unsupervised Sentiment Classification with Multi-View Transfer Learning

Hongliang Fei, Ping Li

Keywords Paper

Cross-Lingual Classification, sentiment classification, unsupervised system, classification

0

0

0

0

12:23

02/02/2021

Revisiting Mahalanobis Distance for Transformer-Based Out-of-Domain Detection

Alexander Podolskiy, Dmitry Lipin, Andrey Bout and
Ekaterina Artemova, Irina Piontkovskaya

Keywords Paper

0

0

0

0

16:08

08/12/2020

Model-agnostic Methods for Text Classification with Inherent Noise

Kshitij Tayal, Rahul Ghosh, Vipin Kumar

Keywords Paper

0

0

0

0

8:46

02/02/2021

Improving Model Robustness by Adaptively Correcting Perturbation Levels with Active Queries

Kun-Peng Ning, Lue Tao, Songcan Chen, Sheng-Jun Huang

Keywords Paper

0

1

0

0

16:10

04/07/2020

Data Manipulation: Towards Effective Instance Learning for Neural Dialogue Generation via Learning to Augment and Reweight

Hengyi Cai, Hongshen Chen, Yonghao Song and
Cheng Zhang, Xiaofang Zhao, Dawei Yin

Keywords Paper

Data Manipulation, Neural Generation, learning, dialogue generation

0

0

0

1

9:39

02/02/2021

Unsupervised Opinion Summarization with Content Planning

Reinald Kim Amplayo, Stefanos Angelidis, Mirella Lapata

Keywords Paper

0

0

0

0

16:03

06/12/2020

Unsupervised Data Augmentation for Consistency Training

Qizhe Xie, Zihang Dai, Eduard Hovy and
Thang Luong, Quoc V Le

Keywords Paper

0

0

0

0

3:29

05/01/2021

Facial Emotion Recognition With Noisy Multi-Task Annotations

Siwei Zhang, Zhiwu Huang, Danda Pani Paudel, Luc Van Gool

Keywords Paper

0

0

0

0

4:48

06/12/2020

Learning to summarize with human feedback

Nisan Stiennon, Long Ouyang, Jeffrey Wu and
Daniel Ziegler, Ryan Lowe, Chelsea Voss, Alec Radford, Dario Amodei, Paul Christiano

Keywords Paper

0

0

0

0

3:17

04/07/2020

Does Multi-Encoder Help? A Case Study on Context-Aware Neural Machine Translation

Bei Li, Hui Liu, Ziyang Wang and
Yufan Jiang, Tong Xiao, Jingbo Zhu, Tongran Liu, Changliang Li

Keywords Paper

Context-Aware Translation, document-level translation, document-level NMT, document-level

0

0

0

0

6:42

19/04/2021

Quantifying appropriateness of summarization data for curriculum learning

Ryuji Kano, Takumi Takahashi, Toru Nishino and
Motoki Taniguchi, Tomoki Taniguchi, Tomoko Ohkuma

Keywords Paper

0

0

0

0

5:13

03/05/2021

Regularization Matters in Policy Optimization - An Empirical Study on Continuous Control

Zhuang Liu, Xuanlin Li, Bingyi Kang, trevor darrell

Keywords Paper

Deep Reinforcement Learning, Regularization, Continuous Control, Policy Optimization

0

0

0

0

8:45

06/12/2020

PLANS: Neuro-Symbolic Program Learning from Videos

Raphaël Dang-Nhu

Keywords Paper

0

0

0

0

3:52

14/06/2020

CONSAC: Robust Multi-Model Fitting by Conditional Sample Consensus

Florian Kluger, Eric Brachmann, Hanno Ackermann and
Carsten Rother, Michael Ying Yang, Bodo Rosenhahn

Keywords Paper

robust estimator, reinforcement learning, self-supervised, unsupervised, multi-model, ransac, dataset, vanishing points, homography, 3d reconstruction

0

0

0

0

1:00

19/08/2021

Multi-Scale Selective Feedback Network with Dual Loss for Real Image Denoising

Xiaowan Hu, Yuanhao Cai, Zhihong Liu and
Haoqian Wang, Yulun Zhang

Keywords Paper

Computer Vision, Computational Photography, Photometry, Shape from X, Deep Learning

0

0

0

0

9:52

02/02/2021

Beyond Class-Conditional Assumption: A Primary Attempt to Combat Instance-Dependent Label Noise

Pengfei Chen, Junjie Ye, Guangyong Chen and
Jingwei Zhao, Pheng-Ann Heng

Keywords Paper

0

0

0

0

14:08

26/08/2020

Robust Learning from Discriminative Feature Feedback

Sanjoy Dasgupta, Sivan Sabato

Keywords Paper

0

0

0

0

14:37

05/01/2021

EvidentialMix: Learning With Combined Open-Set and Closed-Set Noisy Labels

Ragav Sachdeva, Filipe R. Cordeiro, Vasileios Belagiannis and
Ian Reid, Gustavo Carneiro

Keywords Paper

0

0

0

0

4:58

06/12/2021

Learning to Generate Realistic Noisy Images via Pixel-level Noise-aware Adversarial Training

Yuanhao Cai, Xiaowan Hu, Haoqian Wang and
Yulun Zhang, Hanspeter Pfister, Donglai Wei

Keywords Paper

deep learning, adversarial robustness and security, vision, generative model, graph learning

0

0

0

0

3:05

22/11/2021

Noisy Annotation Refinement for Object Detection

Jiafeng Mao, Qing Yu, Yoko Yamakata, Kiyoharu Aizawa

Keywords Paper

noise-resistant object detection, robust learning, annotation refinement

0

0

0

0

3:00

18/11/2020

Boosting-based reliable model reuse

Yao-Xiang Ding, Zhi-Hua Zhou

Keywords Paper

1

1

0

0

11:59

13/04/2021

Good classifiers are abundant in the interpolating regime

Ryan Theisen, Jason Klusowski, Michael Mahoney

Keywords Paper

0

0

0

0

2:59

16/11/2020

Robust Policies via Mid-Level Visual Representations: An Experimental Study in Manipulation and Navigation

Bryan Chen, Alexander Sax, Francis Lewis and
Iro Armeni, Silvio Savarese, Amir Zamir, Jitendra Malik, Lerrel Pinto

Keywords Paper

0

0

0

0

5:06

04/07/2020

Relabel the Noise: Joint Extraction of Entities and Relations via Cooperative Multiagents

Daoyuan Chen, Yaliang Li, Kai Lei, Ying Shen

Keywords Paper

entity extraction, re-labeling instances, extraction tasks, re-labeling instance

0

0

0

0

11:24