Globalizing BERT-based transformer architectures for long document summarization

19/04/2021

Globalizing BERT-based transformer architectures for long document summarization

Quentin Grail, Julien Perez, Eric Gaussier

Keywords:

Abstract Paper Similar Papers

Abstract: Fine-tuning a large language model on downstream tasks has become a commonly adopted process in the Natural Language Processing (NLP) (CITATION). However, such a process, when associated with the current transformer-based (CITATION) architectures, shows several limitations when the target task requires to reason with long documents. In this work, we introduce a novel hierarchical propagation layer that spreads information between multiple transformer windows. We adopt a hierarchical approach where the input is divided in multiple blocks independently processed by the scaled dot-attentions and combined between the successive layers. We validate the effectiveness of our approach on three extractive summarization corpora of long scientific papers and news articles. We compare our approach to standard and pre-trained language-model-based summarizers and report state-of-the-art results for long document summarization and comparable results for smaller document summarization.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EACL 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

Variational Inference for Learning Representations of Natural Language Edits

Edison Marrese-Taylor, Machel Reid, Yutaka Matsuo

Keywords Paper

0

0

0

0

19:28

16/11/2020

Text Segmentation by Cross Segment Attention

Michal Lukasik, Boris Dadachev, Kishore Papineni, Gonçalo Simões

Keywords Paper

document segmentation, nlp tasks, downstream tasks, information retrieval

0

0

0

0

11:17

16/11/2020

Multi-Stage Pre-training for Low-Resource Domain Adaptation

Rong Zhang, Revanth Gangi Reddy, Md Arafat Sultan and
Vittorio Castelli, Anthony Ferritto, Radu Florian, Efsun Sarioglu Kayi, Salim Roukos, Avi Sil, Todd Ward

Keywords Paper

nlp tasks, fine-tuning, auxiliary tasks, lm transfer

0

0

0

0

6:56

04/07/2020

Improving Segmentation for Technical Support Problems

Kushal Chauhan, Abhirut Gupta

Keywords Paper

Segmentation, Technical Problems, attempted resolution, problem resolution

0

0

0

0

11:03

16/11/2020

Form2Seq : A Framework for Higher-Order Form Structure Extraction

Milan Aggarwal, Hiresh Gupta, Mausoom Sarkar, Balaji Krishnamurthy

Keywords Paper

document extraction, semantic task, image resolution, structure extraction

0

0

0

0

11:26

06/12/2021

UniDoc: Unified Pretraining Framework for Document Understanding

Jiuxiang Gu, Jason Kuen, Vlad I. Morariu and
Handong Zhao, Rajiv Jain, Nikolaos Barmpalios, Ani Nenkova, Tong Sun

Keywords Paper

self-supervised learning, transformers

0

0

0

0

6:45

04/07/2020

Neural Syntactic Preordering for Controlled Paraphrase Generation

Tanya Goyal, Greg Durrett

Keywords Paper

Controlled Generation, Paraphrasing sentences, machine translation, Neural Preordering

0

0

0

0

11:37

19/08/2021

Automatically Paraphrasing via Sentence Reconstruction and Round-trip Translation

Zilu Guo, Zhongqiang Huang, Kenny Q. Zhu and
Guandan Chen, Kaibo Zhang, Boxing Chen, Fei Huang

Keywords Paper

Natural Language Processing, Machine Translation, Natural Language Generation, NLP Applications and Tools

0

0

0

0

13:53

16/11/2020

Long-Short Term Masking Transformer: A Simple but Effective Baseline for Document-level Neural Machine Translation

Pei Zhang, Boxing Chen, Niyu Ge, Kai Fan

Keywords Paper

document-level translation, document-level systems, context-aware architecture, transformer

0

0

0

0

6:36

04/07/2020

Generalizing Natural Language Analysis through Span-relation Representations

Zhengbao Jiang, Wei Xu, Jun Araki, Graham Neubig

Keywords Paper

Natural Analysis, Natural processing, dependency parsing, semantic labeling

0

0

0

0

8:30

06/12/2020

Heavy-tailed Representations, Text Polarity Classification & Data Augmentation

Hamid Jalalzai, Pierre Colombo, Chloé Clavel and
Eric Gaussier, Giovanna Varni, Emmanuel Vignon, Anne Sabourin

Keywords Paper

0

0

0

0

2:57

08/12/2020

Automatic Interlinear Glossing for Under-Resourced Languages Leveraging Translations

Xingyuan Zhao, Satoru Ozaki, Antonios Anastasopoulos and
Graham Neubig, Lori Levin

Keywords Paper

0

0

0

0

13:52

19/08/2021

Keep the Structure: A Latent Shift-Reduce Parser for Semantic Parsing

Yuntao Li, Bei Chen, Qian Liu and
Yan Gao, Jian-Guang Lou, Yan Zhang, Dongmei Zhang

Keywords Paper

Natural Language Processing, Natural Language Semantics

0

0

0

0

12:37

16/11/2020

ETC: Encoding Long and Structured Inputs in Transformers

Joshua Ainslie, Santiago Ontanon, Chris Alberti and
Vaclav Cvicek, Zachary Fisher, Philip Pham, Anirudh Ravula, Sumit Sanghai, Qifan Wang, Li Yang

Keywords Paper

natural tasks, encoding inputs, transformer models, transformer architecture

0

0

0

0

11:04

15/06/2020

Multi-modal synthesis of regular expressions

Qiaochu Chen, Xinyu Wang, Xi Ye and
Greg Durrett, Isil Dillig

Keywords Paper

Programming by Example, Programming by Natural Languages, Program Synthesis, Regular Expression

0

0

0

0

16:17

04/07/2020

TextBrewer: An Open-Source Knowledge Distillation Toolkit for Natural Language Processing

Ziqing Yang, Yiming Cui, Zhipeng Chen and
Wanxiang Che, Ting Liu, Shijin Wang, Guoping Hu

Keywords Paper

Natural Processing, supervised tasks, text classification, reading comprehension

0

0

0

0

10:36

04/07/2020

SPECTER: Document-level Representation Learning using Citation-informed Transformers

Arman Cohan, Sergey Feldman, Iz Beltagy and
Doug Downey, Daniel Weld

Keywords Paper

Document-level Learning, Representation learning, natural systems, classification

0

0

0

0

13:07

03/05/2021

CoDA: Contrast-enhanced and Diversity-promoting Data Augmentation for Natural Language Understanding

Yanru Qu, Dinghan Shen, Yelong Shen and
Sandra Sajeev, Weizhu Chen, Jiawei Han

Keywords Paper

consistency training, contrastive learning, data augmentation, natural language understanding

0

0

0

0

6:02

04/07/2020

Abstract Syntax as Interlingua: Scaling Up the Grammatical Framework from Controlled Languages to Robust Pipelines

Aarne Ranta, Krasimir Angelov, Normunds Gruzitis, Prasanth Kolachina

Keywords Paper

Abstract Syntax, controlled implementations, accurate generation, accurate translation

0

0

0

0

13:59

02/02/2021

Document-Level Relation Extraction with Adaptive Thresholding and Localized Context Pooling

Wenxuan Zhou, Kevin Huang, Tengyu Ma, Jing Huang

Keywords Paper

0

0

0

0

16:11

16/11/2020

ToTTo: A Controlled Table-To-Text Generation Dataset

Ankur Parikh, Xuezhi Wang, Sebastian Gehrmann and
Manaal Faruqui, Bhuwan Dhingra, Diyi Yang, Dipanjan Das

Keywords Paper

controlled task, high-precision generation, totto, dataset process

0

0

0

0

11:53

14/06/2020

Image Search With Text Feedback by Visiolinguistic Attention Learning

Yanbei Chen, Shaogang Gong, Loris Bazzani

Keywords Paper

vision and language, image search, text feedback, attention mechanism, transformer, multimodal learning, representation learning, composition, image retrieval, interactive image search

0

0

0

0

1:00

04/07/2020

Fact-based Text Editing

Hayate Iso, Chao Qiao, Hang Li

Keywords Paper

Fact-based Editing, text task, text editing, automatically dataset

0

0

0

0

12:41

03/05/2021

Structured Prediction as Translation between Augmented Natural Languages

Giovanni Paolini, Ben Athiwaratkun, Jason Krone and
Jie Ma, Alessandro Achille, RISHITA ANUBHAI, Cicero Nogueira dos Santos, Bing Xiang, Stefano Soatto

Keywords Paper

sequence to sequence, structured prediction, language models, transfer learning, few-shot learning, multi-task learning, generative modeling

0

0

0

0

12:16

26/04/2020

Neural Machine Translation with Universal Visual Representation

Zhuosheng Zhang, Kehai Chen, Rui Wang and
Masao Utiyama, Eiichiro Sumita, Zuchao Li, Hai Zhao

Keywords Paper

Neural Machine Translation, Visual Representation, Multimodal Machine Translation, Language Representation

0

0

0

0

4:50

03/05/2021

Filtered Inner Product Projection for Crosslingual Embedding Alignment

Vin Sachidananda, Ziyi Yang, Chenguang Zhu

Keywords Paper

multilingual representations, natural language processing, word embeddings

0

0

0

0

5:22

19/04/2021

StructSum: Summarization via structured representations

Vidhisha Balachandran, Artidoro Pagnoni, Jay Yoon Lee and
Dheeraj Rajagopal, Jaime Carbonell, Yulia Tsvetkov

Keywords Paper

0

0

0

0

6:32

04/07/2020

Using Context in Neural Machine Translation Training Objectives

Danielle Saunders, Felix Stahlberg, Bill Byrne

Keywords Paper

Neural training, NMT training, document-level training, NMT objective

0

0

0

0

6:48

08/12/2020

Aspect-based Document Similarity for Research Papers

Malte Ostendorff, Terry Ruas, Till Blume and
Bela Gipp, Georg Rehm

Keywords Paper

0

0

0

0

14:50

04/07/2020

Efficient Strategies for Hierarchical Text Classification: External Knowledge and Auxiliary Tasks

Kervy Rivas Rojas, Gina Bustamante, Arturo Oncevay, Marco Antonio Sobrevilla Cabezudo

Keywords Paper

Hierarchical Classification, External Tasks, sequence-to-sequence problem, auxiliary bottom-up-classification

0

0

0

0

5:44

08/12/2020

Best Practices for Data-Efficient Modeling in NLG:How to Train Production-Ready Neural Models with Less Data

Ankit Arun, Soumya Batra, Vikas Bhardwaj and
Ashwini Challa, Pinar Donmez, Peyman Heidari, Hakan Inan, Shashank Jain, Anuj Kumar, Shawn Mei, Karthik Mohan, Michael White

Keywords Paper

0

0

0

0

15:01

02/02/2021

Contrastive Triple Extraction with Generative Transformer

Hongbin Ye, Ningyu Zhang, Shumin Deng and
Mosha Chen, Chuanqi Tan, Fei Huang, Huajun Chen

Keywords Paper

0

0

0

0

18:52

26/04/2020

Encoding word order in complex embeddings

Benyou Wang, Donghao Zhao, Christina Lioma and
Qiuchi Li, Peng Zhang, Jakob Grue Simonsen

Keywords Paper

word embedding, complex-valued neural network, position embedding

0

0

0

0

4:51

04/07/2020

Machine Reading of Historical Events

Or Honovich, Lucas Torroba Hennigen, Omri Abend, Shay B. Cohen

Keywords Paper

Machine Events, Machine reading, NLP, classification

0

0

0

0

12:01

02/02/2021

A Unified Pretraining Framework for Passage Ranking and Expansion

Ming Yan, Chenliang Li, Bin Bi and
Wei Wang, Songfang Huang

Keywords Paper

0

0

0

0

16:33

19/10/2020

Learning to generate reformulation actions for scalable conversational query understanding

Zihan Xu, Jiangang Zhu, Ling Geng and
Yang Yang, Bojia Lin, Daxin Jiang

Keywords Paper

contextual query reformulation, question answering, conversational query understanding

0

0

0

0

6:58

02/02/2021

Encoder-Decoder Based Unified Semantic Role Labeling with Label-Aware Syntax

Hao Fei, Fei Li, Bobo Li, Donghong Ji

Keywords Paper

0

0

0

0

16:10

08/12/2020

The ApposCorpus: a new multilingual, multi-domain dataset for factual appositive generation

Yova Kementchedjhieva, Di Lu, Joel Tetreault

Keywords Paper

0

0

0

0

14:55

08/12/2020

RANCC: Rationalizing Neural Networks via Concept Clustering

Housam Khalifa Bashier, Mi-Young Kim, Randy Goebel

Keywords Paper

0

0

0

0

12:10

06/12/2021

Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling

Hongyu Gong, Yun Tang, Juan Pino, Xian Li

Keywords Paper

0

0

0

0

10:04