Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training

02/02/2021

Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training

Peng Shi, Patrick Ng, Zhiguo Wang, Henghui Zhu, Alexander Hanbo Li, Jun Wang, Cicero Nogueira dos Santos, Bing Xiang

Keywords:

Abstract Paper Similar Papers

Abstract: Most recently, there has been significant interest in learning contextual representations for various NLP tasks, by leveraging large scale text corpora to train powerful language models with self-supervised learning objectives, such as Masked Language Model (MLM). Based on a pilot study, we observe three issues of existing general-purpose language models when they are applied in the text-to-SQL semantic parsers: fail to detect the column mentions in the utterances, to infer the column mentions from the cell values, and to compose target SQL queries when they are complex. To mitigate these issues, we present a model pretraining framework, Generation-Augmented Pre-training (GAP), that jointly learns representations of natural language utterance and table schemas, by leveraging generation models to generate high-quality pre-train data. GAP Model is trained on 2 million utterance-schema pairs and 30K utterance-schema-SQL triples, whose utterances are generated by generation models. Based on experimental results, neural semantic parsers that leverage GAP Model as a representation encoder obtain new state-of-the-art results on both Spider and Criteria-to-SQL benchmarks.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38948654

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AAAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

03/05/2021

SCoRe: Pre-Training for Context Representation in Conversational Semantic Parsing

Tao Yu, Rui Zhang, Alex Polozov and
Christopher Meek, Ahmed H Awadallah

Keywords Paper

0

0

0

0

5:11

06/12/2020

Leap-Of-Thought: Teaching Pre-Trained Models to Systematically Reason Over Implicit Knowledge

Alon Talmor, Oyvind Tafjord, Peter Clark and
Yoav Goldberg, Jonathan Berant

Keywords Paper

0

0

0

0

3:28

12/07/2020

Retrieval Augmented Language Model Pre-Training

Kelvin Guu, Kenton Lee, Zora Tung and
Panupong Pasupat, Mingwei Chang

Keywords Paper

Applications - Language, Speech and Dialog

0

0

0

0

14:44

04/07/2020

Document Modeling with Graph Attention Networks for Multi-grained Machine Reading Comprehension

Bo Zheng, Haoyang Wen, Yaobo Liang and
Nan Duan, Wanxiang Che, Daxin Jiang, Ming Zhou, Ting Liu

Keywords Paper

Document Modeling, Multi-grained Comprehension, machine comprehension, Graph Networks

0

0

0

0

10:51

03/05/2021

GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing

Tao Yu, Jason Wu, Xi V Lin and
bailin wang, Yi Tan, Xinyi Yang, Dragomir Radev, Richard Socher, Caiming Xiong

Keywords Paper

pre-training, nlp, semantic parsing, text-to-sql

0

0

0

0

5:13

04/07/2020

Logical Natural Language Generation from Open-Domain Tables

Wenhu Chen, Jianshu Chen, Yu Su and
Zhiyu Chen, William Yang Wang

Keywords Paper

Logical Generation, neural NLG, surface-level realizations, logical inference

0

0

0

0

11:48

04/07/2020

INSET: Sentence Infilling with INter-SEntential Transformer

Yichen Huang, Yizhe Zhang, Oussama Elachqar, Yu Cheng

Keywords Paper

Sentence Infilling, Missing generation, sentence in-filling, natural generation

0

0

0

0

11:42

02/02/2021

Towards Semantics-Enhanced Pre-Training: Can Lexicon Definitions Help Learning Sentence Meanings?

Xuancheng Ren, Xu Sun, Houfeng Wang, Qun Liu

Keywords Paper

0

0

0

0

16:04

04/07/2020

Using Context in Neural Machine Translation Training Objectives

Danielle Saunders, Felix Stahlberg, Bill Byrne

Keywords Paper

Neural training, NMT training, document-level training, NMT objective

0

0

0

0

6:48

04/07/2020

Enhancing Cross-target Stance Detection with Transferable Semantic-Emotion Knowledge

Bowen Zhang, Min Yang, Xutao Li and
Yunming Ye, Xiaofei Xu, Kuai Dai

Keywords Paper

Cross-target Detection, Stance detection, knowledge transfer, stance classifier

0

0

0

0

11:57

04/07/2020

TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data

Pengcheng Yin, Graham Neubig, Wen-tau Yih, Sebastian Riedel

Keywords Paper

Joint Data, text-based tasks, semantic parsing, TaBERT

0

0

0

0

12:00

25/07/2020

Learning discriminative joint embeddings for efficient face and voice association

Rui Wang, Xin Liu, Yiu-ming Cheung and
Kai Cheng, Nannan Wang, Wentao Fan

Keywords Paper

bi-directional ranking constraint, face-voice association, cross-modal verification, discriminative joint embedding

0

0

0

0

8:33

08/12/2020

TableGPT: Few-shot Table-to-Text Generation with Table Structure Reconstruction and Content Matching

Heng Gong, Yawei Sun, Xiaocheng Feng and
Bing Qin, Wei Bi, Xiaojiang Liu, Ting Liu

Keywords Paper

0

0

0

0

8:45

16/11/2020

Learning to Represent Image and Text with Denotation Graph

Bowen Zhang, Hexiang Hu, Vihan Jain and
Eugene Ie, Fei Sha

Keywords Paper

cross-modal retrieval, referring expression, compositional recognition, pre-training

0

0

0

0

10:59

16/11/2020

Distilling Structured Knowledge for Text-Based Relational Reasoning

Jin Dong, Marc-Antoine Rondeau, William L. Hamilton

Keywords Paper

relational reasoning, reasoning task, cross-modal transfer, text-based systems

0

0

0

0

5:15

02/02/2021

Curriculum-Meta Learning for Order-Robust Continual Relation Extraction

Tongtong Wu, Xuekai Li, Yuan-Fang Li and
Gholamreza Haffari, Guilin Qi, Yujin Zhu, Guoqiang Xu

Keywords Paper

0

0

0

0

11:33

02/02/2021

HMS: A Hierarchical Solver with Dependency-Enhanced Understanding for Math Word Problem

Xin Lin, Zhenya Huang, Hongke Zhao and
Enhong Chen, Qi Liu, Hao Wang, Shijin Wang

Keywords Paper

0

0

0

0

18:01

04/07/2020

TaPas: Weakly Supervised Table Parsing via Pre-training

Jonathan Herzig, Pawel Krzysztof Nowak, Thomas Müller and
Francesco Piccinno, Julian Eisenschlos

Keywords Paper

Weakly Parsing, semantic task, question tables, SQA

0

0

0

0

12:49

22/06/2020

Exploiting Semantic Relations for Fine-grained Entity Typing

Hongliang Dai, Yangqiu Song, Xin Li

Keywords Paper

Fine-grained Entity Typing, Hypernym Extraction, Semantic Role Labeling

0

0

0

0

4:45

04/07/2020

Exploring Unexplored Generalization Challenges for Cross-Database Semantic Parsing

Alane Suhr, Ming-Wei Chang, Peter Shaw, Kenton Lee

Keywords Paper

Exploring Challenges, Cross-Database Parsing, cross-database XSP, cross-database

0

0

0

0

11:48

06/12/2020

Learning Sparse Prototypes for Text Generation

Junxian He, Taylor Berg-Kirkpatrick, Graham Neubig

Keywords Paper

0

0

0

0

3:22

02/02/2021

Tracking Interaction States for Multi-Turn Text-to-SQL Semantic Parsing

Run-Ze Wang, Zhen-Hua Ling, Jingbo Zhou, Yu Hu

Keywords Paper

0

0

0

0

15:13

16/11/2020

Improving AMR Parsing with Sequence-to-Sequence Pre-training

Dongqin Xu, Junhui Li, Muhua Zhu and
Min Zhang, Guodong Zhou

Keywords Paper

abstract parsing, amr parsing, sequence-to-sequence parsing, machine translation

0

0

0

0

11:42

06/12/2020

Latent Template Induction with Gumbel-CRFs

Yao Fu, Chuanqi Tan, Bin Bi and
Mosha Chen, Yansong Feng, Alexander Rush

Keywords Paper

0

0

0

0

3:14

16/11/2020

Exploiting Structured Knowledge in Text via Graph-Guided Representation Learning

Tao Shen, Yi Mao, Pengcheng He and
Guodong Long, Adam Trischler, Weizhu Chen

Keywords Paper

self-supervised tasks, pre-training, entity linking, finetuning

0

0

0

0

11:38

04/07/2020

Exploiting the Syntax-Model Consistency for Neural Relation Extraction

Amir Pouran Ben Veyseh, Franck Dernoncourt, Dejing Dou, Thien Huu Nguyen

Keywords Paper

Neural Extraction, Relation Extraction, RE, syntactic injection

0

0

0

0

11:03

04/07/2020

Tabula nearly Rasa: Probing the linguistic knowledge of character-level neural language models trained on unsegmented text

Michael Hahn, Marco Baroni

Keywords Paper

natural tasks, morphological tasks, language usage, Tabula

0

0

0

0

14:40

16/11/2020

Discern: Discourse-Aware Entailment Reasoning Network for Conversational Machine Reading

Yifan Gao, Chien-Sheng Wu, Jingjing Li and
Shafiq Joty, Steven C.H. Hoi, Caiming Xiong, Irwin King, Michael Lyu

Keywords Paper

document interpretation, dialog understanding, conversational reading, discern

0

0

0

0

11:47

06/12/2021

Think Big, Teach Small: Do Language Models Distil Occam’s Razor?

Gonzalo Jaimovitch-Lopez, David Castellano Falcón, Cesar Ferri, José Hernández-Orallo

Keywords Paper

machine learning, interpretability, few shot learning

0

0

0

0

12:12

04/07/2020

Semantic Parsing for English as a Second Language

Yuanyuan Zhao, Weiwei Sun, Junjie Cao, Xiaojun Wan

Keywords Paper

semantic parsing, second acquisition, Semantic Parsing, ESL

0

0

0

0

11:04

01/07/2020

DeepMet: A Reading Comprehension Paradigm for Token-level Metaphor Detection

Chuandong Su, Fumiyo Fukumoto, Xiaoxi Huang and
Jiyi Li, Rongbo Wang, Zhiqun Chen

Keywords Paper

0

0

0

0

10:37

06/12/2021

End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering

Devendra Singh, Siva Reddy, Will Hamilton and
Chris Dyer, Dani Yogatama

Keywords Paper

0

0

0

0

14:42

16/11/2020

Pre-training for Abstractive Document Summarization by Reinstating Source Text

Yanyan Zou, Xingxing Zhang, Wei Lu and
Furu Wei, Ming Zhou

Keywords Paper

abstractive summarization, sequence-to-sequence problem, sentence reordering, next generation

0

0

0

0

10:25

12/07/2020

Mapping natural-language problems to formal-language solutions using structured neural representations

Kezhen Chen, Qiuyuan Huang, Hamid Palangi and
Paul Smolensky, Ken Forbus, Jianfeng Gao

Keywords Paper

Applications - Neuroscience, Cognitive Science, Biology and Health

0

0

0

0

11:34

19/04/2021

Generating syntactically controlled paraphrases without using annotated parallel pairs

Kuan-Hao Huang, Kai-Wei Chang

Keywords Paper

0

0

0

1

10:41

06/12/2021

Leveraging the Inductive Bias of Large Language Models for Abstract Textual Reasoning

Christopher Rytting, David Wingate

Keywords Paper

language, transfer learning

0

0

0

0

14:16

12/07/2020

Pseudo-Masked Language Models for Unified Language Model Pre-Training

Hangbo Bao, Li Dong, Furu Wei and
Wenhui Wang, Nan Yang, Xiaodong Liu, Yu Wang, Jianfeng Gao, Songhao Piao, Ming Zhou, Hsiao-Wuen Hon

Keywords Paper

Applications - Language, Speech and Dialog

0

0

0

0

13:55

12/07/2020

PoKED: A Semi-Supervised System for Word Sense Disambiguation

Feng Wei

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

15:39

04/07/2020

Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine Reading

Yifan Gao, Chien-Sheng Wu, Shafiq Joty and
Caiming Xiong, Richard Socher, Irwin King, Michael Lyu, Steven C.H. Hoi

Keywords Paper

Conversational Reading, decision making, Explicit Tracker, Coarse-to-Fine Reasoning

0

0

0

0

11:51

16/11/2020

Learning Explainable Linguistic Expressions with Neural Inductive Logic Programming for Sentence Classification

Prithviraj Sen, Marina Danilevsky, Yunyao Li and
Siddhartha Brahma, Matthias Boehm, Laura Chiticariu, Rajasekar Krishnamurthy

Keywords Paper

interpretability models, sentence classification, le, human-machine models

0

0

0

0

9:42