Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks

06/12/2021

Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks

Avi Schwarzschild, Eitan Borgnia, Arjun Gupta, Furong Huang, Uzi Vishkin, Micah Goldblum, Tom Goldstein

Keywords: deep learning

Abstract Paper Similar Papers

Abstract: Deep neural networks are powerful machines for visual pattern recognition, but reasoning tasks that are easy for humans may still be difficult for neural models. Humans possess the ability to extrapolate reasoning strategies learned on simple problems to solve harder examples, often by thinking for longer. For example, a person who has learned to solve small mazes can easily extend the very same search techniques to solve much larger mazes by spending more time. In computers, this behavior is often achieved through the use of algorithms, which scale to arbitrarily hard problem instances at the cost of more computation. In contrast, the sequential computing budget of feed-forward neural networks is limited by their depth, and networks trained on simple problems have no way of extending their reasoning to accommodate harder problems. In this work, we show that recurrent networks trained to solve simple problems with few recurrent steps can indeed solve much more complex problems simply by performing additional recurrences during inference. We demonstrate this algorithmic behavior of recurrent networks on prefix sum computation, mazes, and chess. In all three domains, networks trained on simple problem instances are able to extend their reasoning abilities at test time simply by "thinking for longer."

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

12/07/2020

It's Not What Machines Can Learn, It's What We Cannot Teach

Gal Yehuda, Moshe Gabel, Assaf Schuster

Keywords Paper

Supervised Learning

0

0

0

0

10:41

02/02/2021

Recognizing and Verifying Mathematical Equations using Multiplicative Differential Neural Units

Ankur Mali, Alexander G. Ororbia, Daniel Kifer, C. Lee Giles

Keywords Paper

0

0

0

0

15:07

06/12/2020

Transferable Graph Optimizers for ML Compilers

yanqiz Zhou, Sudip Roy, Amirali Abdolrashidi and
Daniel Wong, Peter Ma, Qiumin Xu, Hanxiao Liu, Phitchaya Phothilimtha, Shen Wang, Anna Goldie, Azalia Mirhoseini, James Laudon

Keywords Paper

0

0

0

0

3:05

03/05/2021

Complex Query Answering with Neural Link Predictors

Erik Arakelyan, Daniel Daza, Pasquale Minervini, Michael Cochez

Keywords Paper

neural link prediction, complex query answering

0

0

0

0

15:28

12/07/2020

Learning What to Defer for Maximum Independent Sets

Sungsoo Ahn, Younggyo Seo, Jinwoo Shin

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

14:47

06/12/2020

Set2Graph: Learning Graphs From Sets

Hadar Serviansky, Nimrod Segol, Jonathan Shlomi and
Kyle Cranmer, Eilam Gross, Haggai Maron, Yaron Lipman

Keywords Paper

0

0

0

0

3:04

12/07/2020

Can Increasing Input Dimensionality Improve Deep Reinforcement Learning?

Kei Ota, Tomoaki Oiki, Devesh Jha and
Toshisada Mariyama, Daniel Nikovski

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

14:55

06/12/2020

Neural Execution Engines: Learning to Execute Subroutines

Yujun Yan, Kevin Swersky, Danai Koutra and
Parthasarathy Ranganathan, Milad Hashemi

Keywords Paper

0

0

0

0

3:20

03/05/2021

Efficient Continual Learning with Modular Networks and Task-Driven Priors

Tom Veniat, Ludovic Denoyer, Marc'Aurelio Ranzato

Keywords Paper

Benchmark, Neural Network, Modular network, Lifelong learning, Continual learning

0

0

0

0

5:26

18/07/2021

A Novel Sequential Coreset Method for Gradient Descent Algorithms

Jiawei Huang, Ruomin Huang, wenjie liu and
Nikolaos Freris, Hu Ding

Keywords Paper

Optimization

0

0

0

0

5:15

22/06/2020

Learning Credal Sum-Product Networks

Amelie Levray, Vaishak Belle

Keywords Paper

credal networks, imprecise probabilities, tractable learning

0

0

0

0

5:10

06/12/2021

Learning and Generalization in RNNs

Abhishek Panigrahi, Navin Goyal

Keywords Paper

deep learning, optimization

0

0

0

0

15:51

06/12/2020

Understanding spiking networks through convex optimization

Allan Mancoo, Sander Keemink, Christian Machens

Keywords Paper

0

0

0

0

3:21

23/08/2020

Diverse rule sets

Guangyi Zhang, Aristides Gionis

Keywords Paper

sampling, classifier, pattern mining, rule learning, diversification, rule sets

0

0

0

0

9:41

13/04/2021

Faster & more reliable tuning of neural networks: Bayesian optimization with importance sampling

Setareh Ariafar, Zelda Mariet, Dana Brooks and
Jennifer Dy, Jasper Snoek

Keywords Paper

0

0

0

0

3:01

26/04/2020

Continual learning with hypernetworks

Johannes von Oswald, Christian Henning, João Sacramento, Benjamin F. Grewe

Keywords Paper

Continual Learning, Catastrophic Forgetting, Meta Model, Hypernetwork

0

0

0

0

5:04

26/04/2020

Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation

Byung Hoon Ahn, Prannoy Pilligundla, Amir Yazdanbakhsh, Hadi Esmaeilzadeh

Keywords Paper

Reinforcement Learning, Learning to Optimize, Combinatorial Optimization, Compilers, Code Optimization, Neural Networks, ML for Systems, Learning for Systems

0

0

0

0

4:55

26/04/2020

Neural Symbolic Reader: Scalable Integration of Distributed and Symbolic Representations for Reading Comprehension

Xinyun Chen, Chen Liang, Adams Wei Yu and
Denny Zhou, Dawn Song, Quoc V. Le

Keywords Paper

neural symbolic, reading comprehension, question answering

0

0

0

0

4:50

06/12/2021

Practical, Provably-Correct Interactive Learning in the Realizable Setting: The Power of True Believers

Julian Katz-Samuels, Blake Mason, Kevin Jamieson, Rob Nowak

Keywords Paper

theory, machine learning, bandits, kernel methods, active learning

0

0

0

0

7:41

06/12/2021

Intrinsic Dimension, Persistent Homology and Generalization in Neural Networks

Tolga Birdal, Aaron Lou, Leonidas Guibas, Umut Simsekli

Keywords Paper

theory, deep learning, optimization

0

0

0

0

14:38

26/04/2020

MACER: Attack-free and Scalable Robust Training via Maximizing Certified Radius

Runtian Zhai, Chen Dan, Di He and
Huan Zhang, Boqing Gong, Pradeep Ravikumar, Cho-Jui Hsieh, Liwei Wang

Keywords Paper

Adversarial Robustness, Provable Adversarial Defense, Randomized Smoothing, Robustness Certification

0

0

0

0

5:10

06/12/2021

Structured Reordering for Modeling Latent Alignments in Sequence Transduction

bailin wang, Mirella Lapata, Ivan Titov

Keywords Paper

language

0

0

0

0

15:00

06/12/2020

Learning Parities with Neural Networks

Amit Daniely, Eran Malach

Keywords Paper

0

0

0

0

3:21

06/12/2020

Pointer Graph Networks

Petar Veličković, Lars Buesing, Matt Overlan and
Razvan Pascanu, Oriol Vinyals, Charles Blundell

Keywords Paper

0

0

0

0

2:50

26/04/2020

In Search for a SAT-friendly Binarized Neural Network Architecture

Nina Narodytska, Hongce Zhang, Aarti Gupta, Toby Walsh

Keywords Paper

verification, Boolean satisfiability, Binarized Neural Networks

0

0

0

0

4:58

06/12/2020

GCOMB: Learning Budget-constrained Combinatorial Algorithms over Billion-sized Graphs

Sahil Manchanda, Akash MITTAL, Anuj Dhawan and
Sourav Medya, Sayan Ranu, Ambuj K Singh

Keywords Paper

0

0

0

0

3:21

12/07/2020

Learning Algebraic Multigrid Using Graph Neural Networks

Ilay Luz, Meirav Galun, Haggai Maron and
Ronen Basri, Irad Yavneh

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

14:32

06/12/2020

BanditPAM: Almost Linear Time k-Medoids Clustering via Multi-Armed Bandits

Mo Tiwari, Martin Zhang, James J Mayclin and
Sebastian Thrun, Chris Piech, Ilan Shomorony

Keywords Paper

0

0

0

0

3:16

22/06/2020

Efficiently learning structured distributions from untrusted batches

Sitan Chen, Jerry Li, Ankur Moitra

Keywords Paper

sum-of-squares, federated learning, VC complexity, Robust statistics

0

0

0

0

24:38

12/07/2020

Scalable Deep Generative Modeling for Sparse Graphs

Hanjun Dai, Azade Nazi, Yujia Li and
Bo Dai, Dale Schuurmans

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

12:19

12/07/2020

Searching to Exploit Memorization Effect in Learning with Noisy Labels

QUANMING YAO, Hansi Yang, Bo Han and
Gang Niu, James Kwok

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

12:25

06/12/2020

GPU-Accelerated Primal Learning for Extremely Fast Large-Scale Classification

John Halloran, David M Rocke

Keywords Paper

0

0

0

0

3:33

26/04/2020

Minimizing FLOPs to Learn Efficient Sparse Representations

Biswajit Paria, Chih-Kuan Yeh, Ian E.H. Yen and
Ning Xu, Pradeep Ravikumar, Barnabás Póczos

Keywords Paper

sparse embeddings, deep representations, metric learning, regularization

0

0

0

0

4:41

06/12/2021

Environment Generation for Zero-Shot Compositional Reinforcement Learning

Izzeddin Gur, Natasha Jaques, Yingjie Miao and
Jongwook Choi, Manoj Tiwari, Honglak Lee, Aleksandra Faust

Keywords Paper

reinforcement learning and planning, robustness, graph learning

0

0

0

0

8:40

06/12/2021

Contrastive Reinforcement Learning of Symbolic Reasoning Domains

Gabriel Poesia, WenXin Dong, Noah Goodman

Keywords Paper

reinforcement learning and planning, machine learning, contrastive learning

0

0

0

0

10:41

02/02/2021

Programmatic Strategies for Real-Time Strategy Games

Julian R. H. Mariño, Rubens O. Moraes, Tassiana C. Oliveira and
Claudio Toledo, Levi H. S. Lelis

Keywords Paper

0

0

0

0

19:22

30/11/2020

Fast and Differentiable Message Passing on Pairwise Markov Random Fields

Zhiwei Xu, Thalaiyasingam Ajanthan, Richard Hartley

Keywords Paper

0

0

0

0

9:41

06/12/2021

A Faster Decentralized Algorithm for Nonconvex Minimax Problems

Wenhan Xian, Feihu Huang, Yanfu Zhang, Heng Huang

Keywords Paper

optimization, machine learning, adversarial robustness and security

0

0

0

0

13:59

26/04/2020

Watch the Unobserved: A Simple Approach to Parallelizing Monte Carlo Tree Search

Anji Liu, Jianshu Chen, Mingze Yu and
Yu Zhai, Xuewen Zhou, Ji Liu

Keywords Paper

parallel Monte Carlo Tree Search (MCTS), Upper Confidence bound for Trees (UCT), Reinforcement Learning (RL)

0

0

0

0

14:43

18/07/2021

Provable Meta-Learning of Linear Representations

Nilesh Tripuraneni, Chi Jin, Michael Jordan

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:09