RNNs can generate bounded hierarchical languages with optimal memory

16/11/2020

RNNs can generate bounded hierarchical languages with optimal memory

John Hewitt, Michael Hahn, Surya Ganguli, Percy Liang, Christopher D. Manning

Keywords: recurrent networks, rnns, rnn, finite-precision setting

Abstract Paper Similar Papers

Abstract: Recurrent neural networks empirically generate natural language with high syntactic fidelity. However, their success is not well-understood theoretically. We provide theoretical insight into this success, proving in a finite-precision setting that RNNs can efficiently generate bounded hierarchical languages that reflect the scaffolding of natural language syntax. We introduce Dyck-$(k,m)$, the language of well-nested brackets (of $k$ types) and $m$-bounded nesting depth, reflecting the bounded memory needs and long-distance dependencies of natural language syntax. The best known results use $O(k^\fracm2)$ memory (hidden units) to generate these languages. We prove that an RNN with $O(m łog k)$ hidden units suffices, an exponential reduction in memory, by an explicit construction. Finally, we show that no algorithm, even with unbounded computation, can suffice with $o(m łog k)$ hidden units.

1

1

1

0

Share

This is an embedded video. Talk and the respective paper are published at EMNLP 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

16/11/2020

How to Make Neural Natural Language Generation as Reliable as Templates in Task-Oriented Dialogue

Henry Elder, Alexander O'Connor, Jennifer Foster

Keywords Paper

generation, neural systems, data approach, neural approaches

0

0

0

0

6:39

04/07/2020

Theoretical Limitations of Self-Attention in Neural Sequence Models

Michael Hahn

Keywords Paper

NLP, Self-Attention Models, Neural Models, Transformers

1

1

0

0

14:02

06/12/2021

Learning and Generalization in RNNs

Abhishek Panigrahi, Navin Goyal

Keywords Paper

deep learning, optimization

0

0

0

0

15:51

26/04/2020

Span Recovery for Deep Neural Networks with Applications to Input Obfuscation

Rajesh Jayaram, David P. Woodruff, Qiuyi Zhang

Keywords Paper

Span recovery, low rank neural networks, adversarial attack

0

0

0

0

5:19

16/11/2020

Consistency of a Recurrent Language Model With Respect to Incomplete Decoding

Sean Welleck, Ilia Kulikov, Jaedeok Kim and
Richard Yuanzhe Pang, Kyunghyun Cho

Keywords Paper

receiving sequences, neural models, recurrent model, common algorithms

0

0

0

0

9:58

04/07/2020

Few-Shot NLG with Pre-Trained Language Model

Zhiyu Chen, Harini Eavani, Wenhu Chen and
Yinyin Liu, William Yang Wang

Keywords Paper

natural generation, NLG, real-world applications, content selection

0

0

0

0

5:59

01/07/2020

Lexicalization of Probabilistic Linear Context-free Rewriting Systems

Richard Mörbitz, Thomas Ruprecht

Keywords Paper

0

0

0

0

7:58

08/12/2020

Emergent Communication Pretraining for Few-Shot Machine Translation

Yaoyiran Li, Edoardo Maria Ponti, Ivan Vulić, Anna Korhonen

Keywords Paper

0

0

0

0

14:42

12/07/2020

On Efficient Low Distortion Ultrametric Embedding

Vincent Cohen-Addad, Karthik C. S., Guillaume Lagarde

Keywords Paper

Unsupervised and Semi-Supervised Learning

0

0

0

0

16:37

16/11/2020

Tractable Lexical-Functional Grammar

Jürgen Wedekind, Ronald M. Kaplan

Keywords Paper

recognition, emptiness, tractable computation, constraint-based formalisms

0

0

0

0

11:27

26/04/2020

word2ket: Space-efficient Word Embeddings inspired by Quantum Entanglement

Aliakbar Panahi, Seyran Saeedi, Tom Arodz

Keywords Paper

word embeddings, natural language processing, model reduction

0

0

0

0

4:28

26/04/2020

Continual learning with hypernetworks

Johannes von Oswald, Christian Henning, João Sacramento, Benjamin F. Grewe

Keywords Paper

Continual Learning, Catastrophic Forgetting, Meta Model, Hypernetwork

0

0

0

0

5:04

06/12/2020

Compositional Generalization via Neural-Symbolic Stack Machines

Xinyun Chen, Chen Liang, Adams Wei Yu and
Dawn Song, Denny Zhou

Keywords Paper

Applications -> Computer Vision; Applications -> Visual Scene Analysis and Interpretation; Deep Learning -> Adversarial Network, Deep Learning -> Generative Models

0

0

0

0

3:26

19/04/2021

Zero-shot neural passage retrieval via domain-targeted synthetic question generation

Ji Ma, Ivan Korotkov, Yinfei Yang and
Keith Hall, Ryan McDonald

Keywords Paper

0

0

0

0

12:47

22/06/2020

Learning Credal Sum-Product Networks

Amelie Levray, Vaishak Belle

Keywords Paper

credal networks, imprecise probabilities, tractable learning

0

0

0

0

5:10

23/06/2021

Vectorized Secure Evaluation of Decision Forests

Raghav Malik, Vidush Singhal, Benjamin Gottfried, Milind Kulkarni

Keywords Paper

Homomorphic Encryption, Decision Forests, Vectorization

0

0

0

0

20:43

26/04/2020

Learning from Explanations with Neural Execution Tree

Ziqi Wang, Yujia Qin, Wenxuan Zhou and
Jun Yan, Qinyuan Ye, Leonardo Neves, Zhiyuan Liu, Xiang Ren

Keywords Paper

0

0

0

0

4:58

06/12/2021

Task-Agnostic Undesirable Feature Deactivation Using Out-of-Distribution Data

Dongmin Park, Hwanjun Song, Minseok Kim, Jae-Gil Lee

Keywords Paper

deep learning, machine learning

0

0

0

0

14:30

16/11/2020

Named Entity Recognition Only from Word Embeddings

Ying Luo, Hai Zhao, Junlang Zhan

Keywords Paper

named recognition, entity detection, type prediction, deep models

0

0

0

0

9:54

22/06/2020

Syntactic Question Abstraction and Retrieval for Data-Scarce Semantic Parsing

Wonseok Hwang, Jinyeong Yim, Seunghyun Park, Minjoon Seo

Keywords Paper

Semantic Parsing, NLIDB, WikiSQL, Question Answering, SQL, Information Retrieval

0

0

0

0

4:37

14/09/2020

Squeezing Correlated Neurons for Resource-Efficient Deep Neural Networks

Elbruz Ozen, Alex Orailoglu

Keywords Paper

deep learning, information redundancy, pruning

0

0

0

0

14:48

06/12/2021

Structured Reordering for Modeling Latent Alignments in Sequence Transduction

bailin wang, Mirella Lapata, Ivan Titov

Keywords Paper

language

0

0

0

0

15:00

04/08/2021

From Local Pseudorandom Generators to Hardness of Learning

Amit Daniely, Gal Vardi

Keywords Paper

0

0

0

0

15:46

04/07/2020

Logical Natural Language Generation from Open-Domain Tables

Wenhu Chen, Jianshu Chen, Yu Su and
Zhiyu Chen, William Yang Wang

Keywords Paper

Logical Generation, neural NLG, surface-level realizations, logical inference

0

0

0

0

11:48

03/05/2021

Learning N:M Fine-grained Structured Sparse Neural Networks From Scratch

Aojun Zhou, Yukun Ma, Junnan Zhu and
Jianbo Liu, Zhijie Zhang, Kun Yuan, Wenxiu Sun, Hongsheng Li

Keywords Paper

sparsity, efficient training and inference.

0

0

0

0

5:09

19/08/2021

A Sequence-to-Set Network for Nested Named Entity Recognition

Zeqi Tan, Yongliang Shen, Shuai Zhang and
Weiming Lu, Yueting Zhuang

Keywords Paper

Natural Language Processing, Information Extraction, Named Entities

0

0

0

0

10:38

08/12/2020

Best Practices for Data-Efficient Modeling in NLG:How to Train Production-Ready Neural Models with Less Data

Ankit Arun, Soumya Batra, Vikas Bhardwaj and
Ashwini Challa, Pinar Donmez, Peyman Heidari, Hakan Inan, Shashank Jain, Anuj Kumar, Shawn Mei, Karthik Mohan, Michael White

Keywords Paper

0

0

0

0

15:01

19/04/2021

Generating syntactically controlled paraphrases without using annotated parallel pairs

Kuan-Hao Huang, Kai-Wei Chang

Keywords Paper

0

0

0

1

10:41

02/02/2021

Adversarial Turing Patterns from Cellular Automata

Nurislam Tursynbek, Ilya Vilkoviskiy, Maria Sindeeva, Ivan Oseledets

Keywords Paper

0

0

0

0

14:50

26/04/2020

CLN2INV: Learning Loop Invariants with Continuous Logic Networks

Gabriel Ryan, Justin Wong, Jianan Yao and
Ronghui Gu, Suman Jana

Keywords Paper

loop invariants, deep learning, logic learning

0

0

0

0

5:12

06/12/2020

Pruning neural networks without any data by iteratively conserving synaptic flow

Hidenori Tanaka, Daniel Kunin, Daniel Yamins, Surya Ganguli

Keywords Paper

Deep Learning -> Optimization for Deep Networks; Optimization -> Non-Convex Optimization, Theory

1

0

0

0

3:19

12/07/2020

PoKED: A Semi-Supervised System for Word Sense Disambiguation

Feng Wei

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

15:39

20/08/2020

Sealing Pointer-Based Optimizations Behind Pure Functions

Daniel Selsam, Simon Hudon, Leonardo De Moura

Keywords Paper

functional programming, interactive theorem proving, Lean

0

0

0

0

15:09

15/06/2020

Blended, precise semantic program embeddings

Ke Wang, Zhendong Su

Keywords Paper

Static and Dynamic Program Features, Attention Network, Semantic Program Embedding

0

0

0

0

15:39

06/12/2021

Intrinsic Dimension, Persistent Homology and Generalization in Neural Networks

Tolga Birdal, Aaron Lou, Leonidas Guibas, Umut Simsekli

Keywords Paper

theory, deep learning, optimization

0

0

0

0

14:38

26/04/2020

Implicit Bias of Gradient Descent based Adversarial Training on Separable Data

Yan Li, Ethan X.Fang, Huan Xu, Tuo Zhao

Keywords Paper

implicit bias, adversarial training, robustness, gradient descent

0

0

0

0

4:53

04/07/2020

Unsupervised Domain Clusters in Pretrained Language Models

Roee Aharoni, Yoav Goldberg

Keywords Paper

NLP, data-driven domains, neural translation, Unsupervised Clusters

0

0

0

0

11:55

12/07/2020

Robustness to Programmable String Transformations via Augmented Abstract Training

Yuhao Zhang, Aws Albarghouthi, Loris D'Antoni

Keywords Paper

Adversarial Examples

0

0

0

0

14:49

06/12/2021

NxMTransformer: Semi-Structured Sparsification for Natural Language Understanding via ADMM

Connor Holmes, Minjia Zhang, Yuxiong He, Bo Wu

Keywords Paper

optimization, transformers, language

0

0

0

0

10:53

19/08/2021

Learning Implicitly with Noisy Data in Linear Arithmetic

Alexander Rader, Ionela G Mocanu, Vaishak Belle, Brendan Juba

Keywords Paper

Constraints and SAT, Constraints and Data Mining; Constraints and Machine Learning

0

0

0

0

15:43