Dynamic Inference with Neural Interpreters

Abstract: Modern neural network architectures can leverage large amounts of data to generalize well within the training distribution. However, they are less capable of systematic generalization to data drawn from unseen but related distributions, a feat that is hypothesized to require compositional reasoning and reuse of knowledge. In this work, we present Neural Interpreters, an architecture that factorizes inference in a self-attention network as a system of modules, which we call _functions_. Inputs to the model are routed through a sequence of functions in a way that is end-to-end learned. The proposed architecture can flexibly compose computation along width and depth, and lends itself well to capacity extension after training. To demonstrate the versatility of Neural Interpreters, we evaluate it in two distinct settings: image classification and visual abstract reasoning on Raven Progressive Matrices. In the former, we show that Neural Interpreters perform on par with the vision transformer using fewer parameters, while being transferrable to a new task in a sample efficient manner. In the latter, we find that Neural Interpreters are competitive with respect to the state-of-the-art in terms of systematic generalization.

03/05/2021

Dynamic Inference with Neural Interpreters

Nasim Rahaman, Muhammad Waleed Gondal, Shruti Joshi, Peter Gehler, Yoshua Bengio, Francesco Locatello, Bernhard Schölkopf

Comments

Similar Papers

Neurally Augmented ALISTA

Freya Behrens, Jonathan Sauder, Peter Jung

Keywords Abstract Paper

learned ISTA, unrolled algorithms, compressed sensing, sparse reconstruction

Meta-Learning with Neural Tangent Kernels

Yufan Zhou, Zhenyi Wang, Jiayi Xian and Changyou Chen, Jinhui Xu

Keywords Abstract Paper

neural tangent kernel, meta-learning

Incremental training of a recurrent neural network exploiting a multi-scale dynamic memory

Antonio Carta, Alessandro Sperduti, Davide Bacciu

Keywords Abstract Paper

recurrent neural networks, linear dynamical systems, incremental learning

Learning to Recombine and Resample Data For Compositional Generalization

Ekin Akyürek, Afra Feyza Akyürek, Jacob Andreas

Keywords Abstract Paper

sequence models, language processing, compositional generalization, data augmentation, generative modeling

Self-Supervised Learning of Event-Based Optical Flow with Spiking Neural Networks

Jesse Hagenaars, Federico Paredes-Valles, Guido de Croon

Keywords Abstract Paper

deep learning, optimization, self-supervised learning

Flexible IR pipelines with capreolus

Andrew Yates, Kevin Martin Jose, Xinyu Zhang, Jimmy Lin

Keywords Abstract Paper

neural information retrieval, retrieval pipeline, ad hoc ranking

Learning Invariances in Neural Networks from Training Data

Greg Benton, Marc Finzi, Pavel Izmailov, Andrew Wilson

Keywords Abstract Paper

Fast Neural Network Adaptation via Parameter Remapping and Architecture Search

Jiemin Fang*, Yuzhu Sun*, Kangjian Peng* and Qian Zhang, Yuan Li, Wenyu Liu, Xinggang Wang

Keywords Abstract Paper

Meta-Learning Sparse Implicit Neural Representations

Jaeho Lee, Jihoon Tack, Namhoon Lee, Jinwoo Shin

Keywords Abstract Paper

deep learning, optimization, meta learning, representation learning

Logsig-RNN: a novel network for robust and efficient skeleton-based action recognition

Hao Ni, Shujian Liao, Weixin Yang and Kevin Schlegel, Terry J Lyons

Keywords Abstract Paper

skeleton-based action recognition, recurrent neural network, log-signature

Any-Precision Deep Neural Networks

Haichao Yu, Haoxiang Li, Humphrey Shi and Thomas S. Huang, Gang Hua

Keywords Abstract Paper

Adversarially-Trained Deep Nets Transfer Better: Illustration on Image Classification

Francisco Utrera, Evan Kravitz, N. Benjamin Erichson and Rajiv Khanna, Michael W Mahoney

Keywords Abstract Paper

adversarial training, limited data, influence functions, transfer learning

Auxiliary Learning by Implicit Differentiation

Aviv Navon, Idan Achituve, Haggai Maron and Gal Chechik, Ethan Fetaya

Keywords Abstract Paper

Multi-task Learning, Auxiliary Learning

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Tejas Gokhale, Rushil Anirudh, Bhavya Kailkhura and Jayaraman J. Thiagarajan, Chitta Baral, Yezhou Yang

Keywords Abstract Paper

Encoding Robustness to Image Style via Adversarial Feature Perturbations

Manli Shu, Zuxuan Wu, Micah Goldblum, Tom Goldstein

Keywords Abstract Paper

deep learning, machine learning, robustness, adversarial robustness and security, domain adaptation

F-BRS: Rethinking Backpropagating Refinement for Interactive Segmentation

Konstantin Sofiiuk, Ilia Petrov, Olga Barinova, Anton Konushin

Keywords Abstract Paper

interactive segmentation, interactive, instance segmentation, segmentation, backpropagating refinement, refinement

Rethinking Neural Operations for Diverse Tasks

Nicholas Roberts, Mikhail Khodak, Tri Dao and Liam Li, Christopher Ré, Ameet S Talwalkar

Keywords Abstract Paper

deep learning, machine learning

APQ: Joint Search for Network Architecture, Pruning and Quantization Policy

Tianzhe Wang, Kuan Wang, Han Cai and Ji Lin, Zhijian Liu, Hanrui Wang, Yujun Lin, Song Han

Keywords Abstract Paper

efficiency, model compression, joint design, neural architecture search, channel pruning, mixed-precision quantization

Finding the Optimal Network Depth in Classification Tasks

Bartosz Wójcik, Maciej Wołczyk, Klaudia Bałazy, Jacek Tabor

Keywords Abstract Paper

model compression and acceleration, multi-head networks

Towards Fast Adaptation of Neural Architectures with Meta Learning

Dongze Lian, Yin Zheng, Yintao Xu and Yanxiong Lu, Leyu Lin, Peilin Zhao, Junzhou Huang, Shenghua Gao

Keywords Abstract Paper

Fast adaptation, Meta learning, NAS

Keywords Paper

Yufan Zhou, Zhenyi Wang, Jiayi Xian and
Changyou Chen, Jinhui Xu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Jiemin Fang, Yuzhu Sun, Kangjian Peng* and
Qian Zhang, Yuan Li, Wenyu Liu, Xinggang Wang

Keywords Paper

Keywords Paper

Hao Ni, Shujian Liao, Weixin Yang and
Kevin Schlegel, Terry J Lyons

Keywords Paper

Haichao Yu, Haoxiang Li, Humphrey Shi and
Thomas S. Huang, Gang Hua

Keywords Paper

Francisco Utrera, Evan Kravitz, N. Benjamin Erichson and
Rajiv Khanna, Michael W Mahoney

Keywords Paper

Aviv Navon, Idan Achituve, Haggai Maron and
Gal Chechik, Ethan Fetaya

Keywords Paper

Tejas Gokhale, Rushil Anirudh, Bhavya Kailkhura and
Jayaraman J. Thiagarajan, Chitta Baral, Yezhou Yang

Keywords Paper

Keywords Paper

Keywords Paper

Nicholas Roberts, Mikhail Khodak, Tri Dao and
Liam Li, Christopher Ré, Ameet S Talwalkar

Keywords Paper

Tianzhe Wang, Kuan Wang, Han Cai and
Ji Lin, Zhijian Liu, Hanrui Wang, Yujun Lin, Song Han

Keywords Paper

Keywords Paper

Dongze Lian, Yin Zheng, Yintao Xu and
Yanxiong Lu, Leyu Lin, Peilin Zhao, Junzhou Huang, Shenghua Gao

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Jonathan Gordon, Wessel P. Bruinsma, Andrew Y. K. Foong and
James Requeima, Yann Dubois, Richard E. Turner

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Hartmut Maennel, Ibrahim Alabdulmohsin, Ilya Tolstikhin and
Robert Baldock, Olivier Bousquet, Sylvain Gelly, Daniel Keysers

Keywords Paper

Ervine Zheng, Qi Yu, Rui Li and
Pengcheng Shi, Anne Haake

Keywords Paper

Keywords Paper

Alex Lamb, Anirudh Goyal, Agnieszka Słowik and
Michael Mozer, Philippe Beaudoin, Yoshua Bengio

Keywords Paper

Keywords Paper

Shizun Wang, Ming Lu, Kaixin Chen and
Jiaming Liu, Xiaoqi Li, Chuang Zhang, Ming Wu

Keywords Paper

Atli Kosson, Vitaliy Chiley, Abhi Venigalla and
Joel Hestness, Urs Koster

Keywords Paper

Atli Kosson, Vitaliy Chiley, Abhi Venigalla and
Joel Hestness, Urs Koster

Keywords Paper

Samarth Sinha, Homanga Bharadhwaj, Anirudh Goyal and
Hugo Larochelle, Animesh Garg, Florian Shkurti

Keywords Paper