A Joint Model for Document Segmentation and Segment Labeling

04/07/2020

A Joint Model for Document Segmentation and Segment Labeling

Joe Barrow, Rajiv Jain, Vlad Morariu, Varun Manjunatha, Douglas Oard, Philip Resnik

Keywords: Document Labeling, Text segmentation, document segmentation, segment labeling

Abstract Paper Similar Papers

Abstract: Text segmentation aims to uncover latent structure by dividing text from a document into coherent sections. Where previous work on text segmentation considers the tasks of document segmentation and segment labeling separately, we show that the tasks contain complementary information and are best addressed jointly. We introduce Segment Pooling LSTM (S-LSTM), which is capable of jointly segmenting a document and labeling segments. In support of joint training, we develop a method for teaching the model to recover from errors by aligning the predicted and ground truth segments. We show that S-LSTM reduces segmentation error by 30% on average, while also improving segment labeling.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ACL 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

25/07/2020

Neural unified review recommendation with cross attention

Hongtao Liu, Wenjun Wang, Hongyan Xu and
Qiyao Peng, Pengfei Jiao

Keywords Paper

cross attention, review analysis, recommender system

0

0

0

0

10:12

04/07/2020

Multi-Granularity Interaction Network for Extractive and Abstractive Multi-Document Summarization

Hanqi Jin, Tianming Wang, Xiaojun Wan

Keywords Paper

Extractive Summarization, Extractive , abstractive summarization, Multi-Granularity Network

0

0

0

0

10:38

19/08/2021

Step-Wise Hierarchical Alignment Network for Image-Text Matching

Zhong Ji, Kexin Chen, Haoran Wang

Keywords Paper

Computer Vision, Language and Vision

0

0

0

0

6:07

16/11/2020

Better Highlighting: Creating Sub-Sentence Summary Highlights

Sangwoo Cho, Kaiqiang Song, Chen Li and
Dong Yu, Hassan Foroosh, Fei Liu

Keywords Paper

highlighting, summarization, abstractive summarizers, determinantal processes

0

0

0

0

12:02

19/04/2021

Globalizing BERT-based transformer architectures for long document summarization

Quentin Grail, Julien Perez, Eric Gaussier

Keywords Paper

0

0

0

0

11:53

08/12/2020

Aspect-based Document Similarity for Research Papers

Malte Ostendorff, Terry Ruas, Till Blume and
Bela Gipp, Georg Rehm

Keywords Paper

0

0

0

0

14:50

19/04/2021

An end-to-end model for entity-level relation extraction using multi-instance learning

Markus Eberts, Adrian Ulges

Keywords Paper

0

0

0

0

11:33

06/12/2021

Referring Transformer: A One-step Approach to Multi-task Visual Grounding

Muchen Li, Leonid Sigal

Keywords Paper

transformers, vision

0

0

0

0

7:54

16/11/2020

Text Segmentation by Cross Segment Attention

Michal Lukasik, Boris Dadachev, Kishore Papineni, Gonçalo Simões

Keywords Paper

document segmentation, nlp tasks, downstream tasks, information retrieval

0

0

0

0

11:17

16/11/2020

What Have We Achieved on Text Summarization?

Dandan Huang, Leyang Cui, Sen Yang and
Guangsheng Bao, Kun Wang, Jun Xie, Yue Zhang

Keywords Paper

text summarization, deep learning, automatic summarizers, summarization systems

0

0

0

0

11:20

14/06/2020

Multi-Task Collaborative Network for Joint Referring Expression Comprehension and Segmentation

Gen Luo, Yiyi Zhou, Xiaoshuai Sun and
Liujuan Cao, Chenglin Wu, Cheng Deng, Rongrong Ji

Keywords Paper

referring expression comprehension, referring expression segmentation, multi-task learning, visual grounding, object detection

0

0

0

0

5:00

08/12/2020

Joint Training for Learning Cross-lingual Embeddings with Sub-word Information without Parallel Corpora

Ali Hakimi Parizi, Paul Cook

Keywords Paper

0

0

0

0

12:38

26/04/2020

A Mutual Information Maximization Perspective of Language Representation Learning

Lingpeng Kong, Cyprien de Masson d'Autume, Lei Yu and
Wang Ling, Zihang Dai, Dani Yogatama

Keywords Paper

0

0

0

0

4:13

08/12/2020

Mixup-Transformer: Dynamic Data Augmentation for NLP Tasks

Lichao Sun, Congying Xia, Wenpeng Yin and
Tingting Liang, Philip Yu, Lifang He

Keywords Paper

0

0

0

0

9:52

14/06/2020

Learn to Augment: Joint Data Augmentation and Network Optimization for Text Recognition

Canjie Luo, Yuanzhi Zhu, Lianwen Jin, Yongpan Wang

Keywords Paper

data augmentation, text recognition, joint training

0

0

0

0

0:59

04/07/2020

Using Context in Neural Machine Translation Training Objectives

Danielle Saunders, Felix Stahlberg, Bill Byrne

Keywords Paper

Neural training, NMT training, document-level training, NMT objective

0

0

0

0

6:48

04/07/2020

Jointly Learning to Align and Summarize for Neural Cross-Lingual Summarization

Yue Cao, Hui Liu, Xiaojun Wan

Keywords Paper

Neural Summarization, Cross-lingual summarization, cross-lingual training, pipeline methods

0

0

0

0

9:30

19/04/2021

The role of syntactic planning in compositional image captioning

Emanuele Bugliarello, Desmond Elliott

Keywords Paper

0

0

0

0

9:38

04/07/2020

Emerging Cross-lingual Structure in Pretrained Language Models

Alexis Conneau, Shijie Wu, Haoran Li and
Luke Zettlemoyer, Veselin Stoyanov

Keywords Paper

multilingual modeling, cross-lingual transfer, transfer, Cross-lingual Models

0

0

0

0

11:49

16/11/2020

Form2Seq : A Framework for Higher-Order Form Structure Extraction

Milan Aggarwal, Hiresh Gupta, Mausoom Sarkar, Balaji Krishnamurthy

Keywords Paper

document extraction, semantic task, image resolution, structure extraction

0

0

0

0

11:26

14/06/2020

Learning Selective Self-Mutual Attention for RGB-D Saliency Detection

Nian Liu, Ni Zhang, Junwei Han

Keywords Paper

rgb-d saliency detection, middle fusion, self-attention, mutual-attention, non-local network, two-stream cnn

0

0

0

0

1:01

19/08/2021

Dependent Multi-Task Learning with Causal Intervention for Image Captioning

Wenqing Chen, Jidong Tian, Caoyun Fan and
Hao He, Yaohui Jin

Keywords Paper

Machine Learning, Transfer, Adaptation, Multi-task Learning, Natural Language Generation, Language and Vision

0

0

0

0

12:02

06/12/2021

UniDoc: Unified Pretraining Framework for Document Understanding

Jiuxiang Gu, Jason Kuen, Vlad I. Morariu and
Handong Zhao, Rajiv Jain, Nikolaos Barmpalios, Ani Nenkova, Tong Sun

Keywords Paper

self-supervised learning, transformers

0

0

0

0

6:45

19/04/2021

Contrastive multi-document question generation

Woon Sang Cho, Yizhe Zhang, Sudha Rao and
Asli Celikyilmaz, Chenyan Xiong, Jianfeng Gao, Mengdi Wang, Bill Dolan

Keywords Paper

0

0

0

0

10:26

30/11/2020

Scale-Aware Polar Representation for Arbitrarily-Shaped Text Detection

Yanguang Bi, Zhiqiang Hu

Keywords Paper

0

0

0

0

9:56

16/11/2020

Dynamic Data Selection and Weighting for Iterative Back-Translation

Zi-Yi Dou, Antonios Anastasopoulos, Graham Neubig

Keywords Paper

neural translation, neural nmt, nmt, domain adaptation

0

0

0

0

11:30

26/04/2020

Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework

Zirui Wang, Jiateng Xie, Ruochen Xu and
Yiming Yang, Graham Neubig, Jaime G. Carbonell

Keywords Paper

Cross-lingual Representation

0

0

0

0

4:53

16/11/2020

Exploring and Predicting Transferability across NLP Tasks

Tu Vu, Tong Wang, Tsendsuren Munkhdalai and
Alessandro Sordoni, Adam Trischler, Andrew Mattarella-Micke, Subhransu Maji, Mohit Iyyer

Keywords Paper

language modeling, nlp tasks, text classification, question answering

0

0

0

0

10:55

16/11/2020

Pre-training for Abstractive Document Summarization by Reinstating Source Text

Yanyan Zou, Xingxing Zhang, Wei Lu and
Furu Wei, Ming Zhou

Keywords Paper

abstractive summarization, sequence-to-sequence problem, sentence reordering, next generation

0

0

0

0

10:25

03/05/2021

Structured Prediction as Translation between Augmented Natural Languages

Giovanni Paolini, Ben Athiwaratkun, Jason Krone and
Jie Ma, Alessandro Achille, RISHITA ANUBHAI, Cicero Nogueira dos Santos, Bing Xiang, Stefano Soatto

Keywords Paper

sequence to sequence, structured prediction, language models, transfer learning, few-shot learning, multi-task learning, generative modeling

0

0

0

0

12:16

06/12/2021

Diversity Enhanced Active Learning with Strictly Proper Scoring Rules

Wei Tan, Lan Du, Wray Buntine

Keywords Paper

machine learning, active learning

0

0

0

0

13:21

03/05/2021

Filtered Inner Product Projection for Crosslingual Embedding Alignment

Vin Sachidananda, Ziyi Yang, Chenguang Zhu

Keywords Paper

multilingual representations, natural language processing, word embeddings

0

0

0

0

5:22

25/07/2020

Leveraging adversarial training in self-learning for cross-lingual text classification

Xin Dong, Yaxin Zhu, Yupeng Zhang and
Zuohui Fu, Dongkuan Xu, Sen Yang, Gerard Melo

Keywords Paper

multilingual, semantics, text classification, cross-lingual

0

0

0

0

9:19

22/11/2021

Domain Attention Consistency for Multi-Source Domain Adaptation

Zhongying Deng, Kaiyang Zhou, Yongxin Yang, Tao Xiang

Keywords Paper

Transferable Attribute Learning, Domain Attention Consistency, Multi-Source Domain Adaptation

0

0

0

0

9:24

14/06/2020

12-in-1: Multi-Task Vision and Language Representation Learning

Jiasen Lu, Vedanuj Goswami, Marcus Rohrbach and
Devi Parikh, Stefan Lee

Keywords Paper

multi-task learning, visiolinguistic representations, visual question answering, image retrieval, referring expressions, multimodal learning, transformers, visual grounding, vision and language pretraining, bert

0

0

0

0

1:02

02/02/2021

Learning Intact Features by Erasing-Inpainting for Few-shot Classification

Junjie Li, Zilei Wang, Xiaoming Hu

Keywords Paper

0

0

0

0

15:15

04/07/2020

Aspect Sentiment Classification with Document-level Sentiment Preference Modeling

Xiao Chen, Changlong Sun, Jingjing Wang and
Shoushan Li, Luo Si, Min Zhang, Guodong Zhou

Keywords Paper

Aspect Classification, ASC, independent problem, information problem

0

0

0

0

10:12

06/12/2021

Re-ranking for image retrieval and transductive few-shot classification

Xi SHEN, Yang Xiao, Shell Hu and
Othman Sbai, Mathieu Aubry

Keywords Paper

machine learning, graph learning, meta learning, few shot learning

0

0

0

0

5:46

07/09/2020

Advancing weakly supervised cross-domain alignment with optimal transport

Siyang Yuan, Ke Bai, Liqun Chen and
Yizhe Zhang, Chenyang Tao, Chunyuan Li, Guoyin Wang, Ricardo Henao, Lawrence Carin Duke

Keywords Paper

Optimal Transport, Cross Domain Alignment

0

0

0

0

10:04

04/07/2020

Benefits of Intermediate Annotations in Reading Comprehension

Dheeru Dua, Sameer Singh, Matt Gardner

Keywords Paper

Reading Comprehension, data collection, data process, Intermediate Annotations

0

0

0

0

6:01