Structural encoding and pre-training matter: Adapting BERT for table-based fact verification

19/04/2021

Structural encoding and pre-training matter: Adapting BERT for table-based fact verification

Rui Dong, David Smith

Keywords:

Abstract Paper Similar Papers

Abstract: Growing concern with online misinformation has encouraged NLP research on fact verification. Since writers often base their assertions on structured data, we focus here on verifying textual statements given evidence in tables. Starting from the Table Parsing (TAPAS) model developed for question answering (Herzig et al., 2020), we find that modeling table structure improves a language model pre-trained on unstructured text. Pre-training language models on English Wikipedia table data further improves performance. Pre-training on a question answering task with column-level cell rank information achieves the best performance. With improved pre-training and cell embeddings, this approach outperforms the state-of-the-art Numerically-aware Graph Neural Network table fact verification model (GNN-TabFact), increasing statement classification accuracy from 72.2% to 73.9% even without modeling numerical information. Incorporating numerical information with cell rankings and pre-training on a question-answering task increases accuracy to 76%. We further analyze accuracy on statements implicating single rows or multiple rows and columns of tables, on different numerical reasoning subtasks, and on generalizing to detecting errors in statements derived from the ToTTo table-to-text generation dataset.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EACL 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

04/07/2020

TaPas: Weakly Supervised Table Parsing via Pre-training

Jonathan Herzig, Pawel Krzysztof Nowak, Thomas Müller and
Francesco Piccinno, Julian Eisenschlos

Keywords Paper

Weakly Parsing, semantic task, question tables, SQA

0

0

0

0

12:49

02/02/2021

A Hybrid Probabilistic Approach for Table Understanding

Kexuan Sun, Harsha Rayudu, Jay Pujara

Keywords Paper

0

0

0

0

18:27

26/04/2020

TabFact: A Large-scale Dataset for Table-based Fact Verification

Wenhu Chen, Hongmin Wang, Jianshu Chen and
Yunkai Zhang, Hong Wang, Shiyang Li, Xiyou Zhou, William Yang Wang

Keywords Paper

Fact Verification, Tabular Data, Symbolic Reasoning

0

0

0

0

5:49

05/01/2021

Global Table Extractor (GTE): A Framework for Joint Table Identification and Cell Structure Recognition Using Visual Context

Xinyi Zheng, Douglas Burdick, Lucian Popa and
Xu Zhong, Nancy Xin Ru Wang

Keywords Paper

0

0

0

0

5:08

19/04/2021

Expanding, retrieving and infilling: Diversifying cross-domain question generation with flexible templates

Xiaojing Yu, Anxiao Jiang

Keywords Paper

0

0

0

0

11:40

04/07/2020

A Methodology for Creating Question Answering Corpora Using Inverse Data Annotation

Jan Deriu, Katsiaryna Mlynchyk, Philippe Schläpfer and
Alvaro Rodrigo, Dirk von Grünigen, Nicolas Kaiser, Kurt Stockinger, Eneko Agirre, Mark Cieliebak

Keywords Paper

question answering, annotation, Inverse Annotation, intermediate representation

0

0

0

0

12:51

26/04/2020

CLN2INV: Learning Loop Invariants with Continuous Logic Networks

Gabriel Ryan, Justin Wong, Jianan Yao and
Ronghui Gu, Suman Jana

Keywords Paper

loop invariants, deep learning, logic learning

0

0

0

0

5:12

02/02/2021

Entity Guided Question Generation with Contextual Structure and Sequence Information Capturing

Qingbao Huang, Mingyi Fu, Linzhang Mo and
Yi Cai, Jingyun Xu, Pijian Li, Qing Li, Ho-fung Leung

Keywords Paper

0

0

0

0

19:41

02/02/2021

SMART: A Situation Model for Algebra Story Problems via Attributed Grammar

Yining Hong, Qing Li, Ran Gong and
Daniel Ciao, Siyuan Huang, Song-Chun Zhu

Keywords Paper

0

0

0

0

15:29

04/07/2020

A Novel Cascade Binary Tagging Framework for Relational Triple Extraction

Zhepei Wei, Jianlin Su, Yue Wang and
Yuan Tian, Yi Chang

Keywords Paper

Relational Extraction, large-scale construction, overlapping problem, relational task

0

0

0

0

11:05

16/11/2020

Table Fact Verification with Structure-Aware Transformer

Hongzhi Zhang, Yingyao Wang, Sirui Wang and
Xuezhi Cao, Fuzheng Zhang, Zhongyuan Wang

Keywords Paper

symbolic reasoning, pre-trained models, pre-trained transformers, table representation

0

0

0

0

6:52

02/06/2020

Detecting Synonymous Properties by Shared Data-Driven Definitions

Jan-Christoph Kalo, Stephan Mennicke, Philipp Ehler, Wolf-Tilo Balke

Keywords Paper

0

0

0

0

19:39

19/10/2020

Neural relation extraction on wikipedia tables for augmenting knowledge graphs

Erin Macdonald, Denilson Barbosa

Keywords Paper

information extraction, benchmarking, web tables

0

0

0

0

6:14

26/04/2020

Variational Template Machine for Data-to-Text Generation

Rong Ye, Wenxian Shi, Hao Zhou and
Zhongyu Wei, Lei Li

Keywords Paper

0

0

0

0

4:55

16/11/2020

Program Enhanced Fact Verification with Verbalization and Graph Attention Network

Xiaoyu Yang, Feng Nie, Yufei Feng and
Quan Liu, Zhigang Chen, Xiaodan Zhu

Keywords Paper

fact verification, real-life applications, symbolic operations, programs

0

0

0

0

12:04

26/04/2020

Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering

Akari Asai, Kazuma Hashimoto, Hannaneh Hajishirzi and
Richard Socher, Caiming Xiong

Keywords Paper

Multi-hop Open-domain Question Answering, Graph-based Retrieval, Multi-step Retrieval

0

0

0

0

5:15

16/11/2020

Neural Deepfake Detection with Factual Structure of Text

Wanjun Zhong, Duyu Tang, Zenan Xu and
Ruize Wang, Nan Duan, Ming Zhou, Jiahai Wang, Jian Yin

Keywords Paper

deepfake detection, automatically text, deepfake text, natural models

0

0

0

0

10:48

03/05/2021

GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing

Tao Yu, Jason Wu, Xi V Lin and
bailin wang, Yi Tan, Xinyi Yang, Dragomir Radev, Richard Socher, Caiming Xiong

Keywords Paper

pre-training, nlp, semantic parsing, text-to-sql

0

0

0

0

5:13

04/07/2020

Logical Natural Language Generation from Open-Domain Tables

Wenhu Chen, Jianshu Chen, Yu Su and
Zhiyu Chen, William Yang Wang

Keywords Paper

Logical Generation, neural NLG, surface-level realizations, logical inference

0

0

0

0

11:48

06/12/2021

Ensembling Graph Predictions for AMR Parsing

Thanh Lam Hoang, Gabriele Picco, Yufang Hou and
Young-Suk Lee, Lam Nguyen, Dzung Phan, Vanessa Lopez, Ramon Fernandez Astudillo

Keywords Paper

deep learning, machine learning, graph learning, language

0

0

0

0

11:53

08/12/2020

TableGPT: Few-shot Table-to-Text Generation with Table Structure Reconstruction and Content Matching

Heng Gong, Yawei Sun, Xiaocheng Feng and
Bing Qin, Wei Bi, Xiaojiang Liu, Ting Liu

Keywords Paper

0

0

0

0

8:45

03/05/2021

Mathematical Reasoning via Self-supervised Skip-tree Training

Markus Rabe, Dennis Lee, Kshitij Bansal, Christian Szegedy

Keywords Paper

theorem proving, reasoning, self-supervised learning, mathematics, language modeling

0

0

0

0

10:12

16/11/2020

An Imitation Game for Learning Semantic Parsers from User Interaction

Ziyu Yao, Yiqi Tang, Wen-tau Yih and
Huan Sun, Yu Su

Keywords Paper

bootstrapping, fine-tuning parsers, theoretical analysis, text-to-sql problem

0

0

0

0

11:49

02/02/2021

Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training

Peng Shi, Patrick Ng, Zhiguo Wang and
Henghui Zhu, Alexander Hanbo Li, Jun Wang, Cicero Nogueira dos Santos, Bing Xiang

Keywords Paper

0

0

0

0

15:15

02/02/2021

Leveraging Table Content for Zero-shot Text-to-SQL with Meta-Learning

Yongrui Chen, Xinnan Guo, Chaojie Wang and
Jian Qiu, Guilin Qi, Meng Wang, Huiying Li

Keywords Paper

0

0

0

0

15:56

04/07/2020

Dscorer: A Fast Evaluation Metric for Discourse Representation Structure Parsing

Jiangming Liu, Shay B. Cohen, Mirella Lapata

Keywords Paper

Discourse Parsing, Dscorer, Fast Metric,

0

0

0

0

7:02

15/11/2020

Finding Bugs in Database Systems via Query Partitioning

Manuel Rigger, Zhendong Su

Keywords Paper

database testing, three-valued logic, DBMS testing, test oracle

0

0

0

0

15:01

02/02/2021

Recognizing and Verifying Mathematical Equations using Multiplicative Differential Neural Units

Ankur Mali, Alexander G. Ororbia, Daniel Kifer, C. Lee Giles

Keywords Paper

0

0

0

0

15:07

26/04/2020

In Search for a SAT-friendly Binarized Neural Network Architecture

Nina Narodytska, Hongce Zhang, Aarti Gupta, Toby Walsh

Keywords Paper

verification, Boolean satisfiability, Binarized Neural Networks

0

0

0

0

4:58

08/12/2020

Solving Math Word Problems with Multi-Encoders and Multi-Decoders

Yibin Shen, Cheqing Jin

Keywords Paper

0

0

0

0

13:26

04/07/2020

INFOTABS: Inference on Tables as Semi-structured Data

Vivek Gupta, Maitrey Mehta, Pegah Nokhiz, Vivek Srikumar

Keywords Paper

INFOTABS, complex reasoning, modeling strategies, meaning fragments

0

0

0

0

11:38

02/02/2021

A Bottom-Up DAG Structure Extraction Model for Math Word Problems

Yixuan Cao, Feng Hong, Hongwei Li, Ping Luo

Keywords Paper

0

0

0

0

14:01

02/02/2021

Graph-to-Graph: Towards Accurate and Interpretable Online Handwritten Mathematical Expression Recognition

Jin-Wen Wu, Fei Yin, Yan-Ming Zhang and
Xu-Yao Zhang, Cheng-Lin Liu

Keywords Paper

0

0

0

0

15:39

05/01/2021

ChartOCR: Data Extraction From Charts Images via a Deep Hybrid Framework

Junyu Luo, Zekun Li, Jinpeng Wang, Chin-Yew Lin

Keywords Paper

0

0

0

0

4:58

13/04/2021

PClean: Bayesian data cleaning at scale with domain-specific probabilistic programming

Alexander Lew, Monica Agrawal, David Sontag, Vikash Mansinghka

Keywords Paper

0

0

0

0

3:08

16/11/2020

PathQG: Neural Question Generation from Facts

Siyuan Wang, Zhongyu Wei, Zhihao Fan and
Zengfeng Huang, Weijian Sun, Qi Zhang, Xuanjing Huang

Keywords Paper

question generation, query learning, query-based generation, sequence problem

0

0

0

0

11:16

12/07/2020

Learning Reasoning Strategies in End-to-End Differentiable Proving

Pasquale Minervini, Tim Rocktäschel, Sebastian Riedel and
Edward Grefenstette, Pontus Stenetorp

Keywords Paper

Sequential, Network, and Time-Series Modeling

0

0

0

0

16:38

16/11/2020

IGSQL: Database Schema Interaction Graph Based Neural Model for Context-Dependent Text-to-SQL Generation

Yitao Cai, Xiaojun Wan

Keywords Paper

context-dependent task, context-dependent drawn, decoding phase, encoders

0

0

0

0

9:19

18/07/2021

SpreadsheetCoder: Formula Prediction from Semi-structured Context

Xinyun Chen, Petros Maniatis, Rishabh Singh and
Charles Sutton, Hanjun Dai, Max Lin, Denny Zhou

Keywords Paper

Algorithms, Structured Prediction

0

0

0

0

5:34

26/04/2020

Neural Symbolic Reader: Scalable Integration of Distributed and Symbolic Representations for Reading Comprehension

Xinyun Chen, Chen Liang, Adams Wei Yu and
Denny Zhou, Dawn Song, Quoc V. Le

Keywords Paper

neural symbolic, reading comprehension, question answering

0

0

0

0

4:50