Form2Seq : A Framework for Higher-Order Form Structure Extraction

16/11/2020

Form2Seq : A Framework for Higher-Order Form Structure Extraction

Milan Aggarwal, Hiresh Gupta, Mausoom Sarkar, Balaji Krishnamurthy

Keywords: document extraction, semantic task, image resolution, structure extraction

Abstract Paper Similar Papers

Abstract: Document structure extraction has been a widely researched area for decades with recent works performing it as a semantic segmentation task over document images using fully-convolution networks. Such methods are limited by image resolution due to which they fail to disambiguate structures in dense regions which appear commonly in forms. To mitigate this, we propose Form2Seq, a novel sequence-to-sequence (Seq2Seq) inspired framework for structure extraction using text, with a specific focus on forms, which leverages relative spatial arrangement of structures. We discuss two tasks; 1) Classification of low-level constituent elements (TextBlock and empty fillable Widget) into ten types such as field captions, list items, and others; 2) Grouping lower-level elements into higher-order constructs, such as Text Fields, ChoiceFields and ChoiceGroups, used as information collection mechanism in forms. To achieve this, we arrange the constituent elements linearly in natural reading order, feed their spatial and textual representations to Seq2Seq framework, which sequentially outputs prediction of each element depending on the final task. We modify Seq2Seq for grouping task and discuss improvements obtained through cascaded end-to-end training of two tasks versus training in isolation. Experimental results show the effectiveness of our text-based approach achieving an accuracy of 90% on classification task and an F1 of 75.82, 86.01, 61.63 on groups discussed above respectively, outperforming segmentation baselines. Further we show our framework achieves state of the results for table structure recognition on ICDAR 2013 dataset.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EMNLP 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

19/04/2021

StructSum: Summarization via structured representations

Vidhisha Balachandran, Artidoro Pagnoni, Jay Yoon Lee and
Dheeraj Rajagopal, Jaime Carbonell, Yulia Tsvetkov

Keywords Paper

0

0

0

0

6:32

16/11/2020

Multilevel Text Alignment with Cross-Document Attention

Xuhui Zhou, Nikolaos Pappas, Noah A. Smith

Keywords Paper

text alignment, citation recommendation, plagiarism detection, predicting relationships

0

0

0

0

11:45

26/04/2020

Variational Template Machine for Data-to-Text Generation

Rong Ye, Wenxian Shi, Hao Zhou and
Zhongyu Wei, Lei Li

Keywords Paper

0

0

0

0

4:55

04/07/2020

A Probabilistic Generative Model for Typographical Analysis of Early Modern Printing

Kartik Goyal, Chris Dyer, Christopher Warren and
Maxwell G'Sell, Taylor Berg-Kirkpatrick

Keywords Paper

Typographical Printing, clustering images, archiving process, Early printing

0

0

0

0

7:07

30/11/2020

Image Captioning through Image Transformer

Sen He, Wentong Liao, Hamed R. Tavakoli and
Michael Yang, Bodo Rosenhahn, Nicolas Pugeault

Keywords Paper

0

0

0

0

9:49

15/11/2020

Structure Interpretation of Text Formats

Sumit Gulwani, Vu Le, Arjun Radhakrishna and
Ivan Radiček, Mohammad Raza

Keywords Paper

format diversity, program synthesis, data extraction

0

0

0

0

15:16

14/06/2020

ContourNet: Taking a Further Step Toward Accurate Arbitrary-Shaped Scene Text Detection

Yuxin Wang, Hongtao Xie, Zheng-Jun Zha and
Mengting Xing, Zilong Fu, Yongdong Zhang

Keywords Paper

scene text detection, arbitrary shapes, false-positive suppression, large scale variance

0

0

0

0

1:01

19/08/2021

Text-based Person Search via Multi-Granularity Embedding Learning

Chengji Wang, Zhiming Luo, Yaojin Lin, Shaozi Li

Keywords Paper

Computer Vision, Language and Vision, Recognition

0

0

0

0

12:25

30/11/2020

Query by Strings and Return Ranking Word Regions with Only One Look

Peng Zhao, Wenyuan Xue, Qingyong Li, Siqi Cai

Keywords Paper

0

0

0

0

6:46

16/11/2020

Compressive Summarization with Plausibility and Salience Modeling

Shrey Desai, Jiacheng Xu, Greg Durrett

Keywords Paper

compressive systems, compressions, rouge, pre-trained model

0

0

0

0

12:04

19/08/2021

Proposal-free One-stage Referring Expression via Grid-Word Cross-Attention

Wei Suo, MengYang Sun, Peng Wang, Qi Wu

Keywords Paper

Computer Vision, Language and Vision, Structural and Model-Based Approaches, Knowledge Representation and Reasoning

0

0

0

0

17:31

06/12/2021

A Multi-Implicit Neural Representation for Fonts

Pradyumna Reddy, Zhifei Zhang, Matthew Fisher and
Hailin Jin, Zhaowen Wang, Niloy Mitra

Keywords Paper

deep learning, representation learning

0

0

0

0

8:42

04/07/2020

Benchmarking Multimodal Regex Synthesis with Complex Structures

Xi Ye, Qiaochu Chen, Isil Dillig, Greg Durrett

Keywords Paper

Multimodal Synthesis, regular generation, regex tasks, StackOverflow

0

0

0

0

11:51

19/08/2021

Keep the Structure: A Latent Shift-Reduce Parser for Semantic Parsing

Yuntao Li, Bei Chen, Qian Liu and
Yan Gao, Jian-Guang Lou, Yan Zhang, Dongmei Zhang

Keywords Paper

Natural Language Processing, Natural Language Semantics

0

0

0

0

12:37

19/08/2021

Generating Senses and RoLes: An End-to-End Model for Dependency- and Span-based Semantic Role Labeling

Rexhina Blloshmi, Simone Conia, Rocco Tripodi, Roberto Navigli

Keywords Paper

Natural Language Processing, Natural Language Semantics, Natural Language Generation, Natural Language Processing

0

0

0

0

15:18

25/07/2020

SetRank: Learning a permutation-invariant ranking model for information retrieval

Liang Pang, Jun Xu, Qingyao Ai and
Yanyan Lan, Xueqi Cheng, Jirong Wen

Keywords Paper

learning to rank, permutation-invariant ranking model

0

0

0

0

14:02

19/08/2021

A Sequence-to-Set Network for Nested Named Entity Recognition

Zeqi Tan, Yongliang Shen, Shuai Zhang and
Weiming Lu, Yueting Zhuang

Keywords Paper

Natural Language Processing, Information Extraction, Named Entities

0

0

0

0

10:38

02/02/2021

Confidence-aware Non-repetitive Multimodal Transformers for TextCaps

Zhaokai Wang, Renda Bao, Qi Wu, Si Liu

Keywords Paper

0

0

0

0

15:04

22/11/2021

SuperStyleNet: Deep Image Synthesis with Superpixel Based Style Encoder

Jonghyun Kim, Gen Li, Cheolkon Jung, Joongkyu Kim

Keywords Paper

image-to-image translation, semantic image synthesis, image generation, superpixel, style encoder, graph self-attention

0

0

0

0

2:52

06/12/2021

CentripetalText: An Efficient Text Instance Representation for Scene Text Detection

Tao Sheng, Jie Chen, Zhouhui Lian

Keywords Paper

robustness

0

0

0

0

9:55

18/11/2020

Enhancing topic models by incorporating explicit and implicit external knowledge

Yang Hong, Xinhuai Tang, Tiancheng Tang and
Yunlong Hu, Jintai Tian

Keywords Paper

0

0

0

0

9:57

02/02/2021

Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps

Qi Zhu, Chenyu Gao, Peng Wang, Qi Wu

Keywords Paper

0

0

0

0

15:58

02/02/2021

FILTER: An Enhanced Fusion Method for Cross-lingual Language Understanding

Yuwei Fang, Shuohang Wang, Zhe Gan and
Siqi Sun, Jingjing Liu

Keywords Paper

0

0

0

0

17:39

06/12/2021

TopicNet: Semantic Graph-Guided Topic Discovery

Zhibin Duan, Yi.shi Xu, Bo Chen and
dongsheng wang, Chaojie Wang, Mingyuan Zhou

Keywords Paper

optimization, generative model, graph learning

0

0

0

0

10:15

14/06/2020

A Real-Time Cross-Modality Correlation Filtering Method for Referring Expression Comprehension

Yue Liao, Si Liu, Guanbin Li and
Fei Wang, Yanjie Chen, Chen Qian, Bo Li

Keywords Paper

referring expression comprehension, cross modality, correlation filtering, real-time, one stage

0

0

0

0

1:00

05/12/2020

Knowledge-enhanced named entity disambiguation for short text

Zhifan Feng, Qi Wang, Wenbin Jiang and
Yajuan Lyu, Yong Zhu

Keywords Paper

0

0

0

0

14:40

12/09/2020

Plausible Reasoning about EL-Ontologies using Concept Interpolation

Yazmín Ibáñez-García, Víctor Gutiérrez-Basulto, Steven Schockaert

Keywords Paper

Description logics-General, Commonsense reasoning-General, Knowledge representation languages-General, Concept formation, similarity-based reasoning-General

0

0

0

0

15:50

04/07/2020

Discourse-Aware Neural Extractive Text Summarization

Jiacheng Xu, Zhe Gan, Yu Cheng, Jingjing Liu

Keywords Paper

Discourse-Aware Summarization, document encoding, extractive selection, text models

0

0

0

0

11:50

04/07/2020

Recurrent Chunking Mechanisms for Long-Text Machine Reading Comprehension

Hongyu Gong, Yelong Shen, Dian Yu and
Jianshu Chen, Dong Yu

Keywords Paper

Long-Text Comprehension, machine comprehension, MRC, question answering

0

0

0

0

11:25

30/11/2020

Branch Interaction Network for Person Re-identification

Zengming Tang, Jun Huang

Keywords Paper

0

0

0

0

4:51

06/12/2020

Hierarchical Granularity Transfer Learning

Shaobo Min, Hongtao Xie, Hantao Yao and
Xuran Deng, Zheng-Jun Zha, Yongdong Zhang

Keywords Paper

0

0

0

0

3:07

19/04/2021

Globalizing BERT-based transformer architectures for long document summarization

Quentin Grail, Julien Perez, Eric Gaussier

Keywords Paper

0

0

0

0

11:53

04/07/2020

A Top-down Neural Architecture towards Text-level Parsing of Discourse Rhetorical Structure

Longyin Zhang, Yuqing Xing, Fang Kong and
Peifeng Li, Guodong Zhou

Keywords Paper

Text-level Structure, natural understanding, down-stream applications, text-level parsing

0

0

0

0

11:25

19/04/2021

Contrastive multi-document question generation

Woon Sang Cho, Yizhe Zhang, Sudha Rao and
Asli Celikyilmaz, Chenyan Xiong, Jianfeng Gao, Mengdi Wang, Bill Dolan

Keywords Paper

0

0

0

0

10:26

26/04/2020

Plug and Play Language Models: A Simple Approach to Controlled Text Generation

Sumanth Dathathri, Andrea Madotto, Janice Lan and
Jane Hung, Eric Frank, Piero Molino, Jason Yosinski, Rosanne Liu

Keywords Paper

controlled text generation, generative models, conditional generative models, language modeling, transformer

0

0

1

1

4:58

16/11/2020

Weakly-Supervised Aspect-Based Sentiment Analysis via Joint Aspect-Sentiment Topic Embedding

Jiaxin Huang, Yu Meng, Fang Guo and
Heng Ji, Jiawei Han

Keywords Paper

extracting aspects, classifying reviews, aspect-based analysis, aspect classification

0

0

0

0

11:23

06/12/2021

HSVA: Hierarchical Semantic-Visual Adaptation for Zero-Shot Learning

Shiming Chen, Guosen Xie, Yang Liu and
Qinmu Peng, Baigui Sun, Hao Li, Xinge You, Ling Shao

Keywords Paper

generative model, domain adaptation

0

0

0

0

9:19

02/02/2021

PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network

Pengfei Wang, Chengquan Zhang, Fei Qi and
Shanshan Liu, Xiaoqiang Zhang, Pengyuan Lyu, Junyu Han, Jingtuo Liu, Errui Ding, Guangming Shi

Keywords Paper

0

0

0

0

18:06

06/12/2021

Joint Semantic Mining for Weakly Supervised RGB-D Salient Object Detection

Jingjing Li, Wei Ji, Qi Bi and
Cheng Yan, Miao Zhang, Yongri Piao, Huchuan Lu, Li cheng

Keywords Paper

vision

0

0

0

0

9:03

16/11/2020

Topic Modeling in Embedding Spaces

Adji Bousso Dieng, Francisco Ruiz, David Blei

Keywords Paper

generative documents, topic modeling, topic models, embedded model

0

0

0

0

12:46