Meta-trained agents implement Bayes-optimal agents

Abstract: Memory-based meta-learning is a powerful technique to build agents that adapt fast to any task within a target distribution. A previous theoretical study has argued that this remarkable performance is because the meta-training protocol incentivises agents to behave Bayes-optimally. We empirically investigate this claim on a number of prediction and bandit tasks. Inspired by ideas from theoretical computer science, we show that meta-learned and Bayes-optimal agents not only behave alike, but they even share a similar computational structure, in the sense that one agent system can approximately simulate the other. Furthermore, we show that Bayes-optimal agents are fixed points of the meta-learning dynamics. Our results suggest that memory-based meta-learning is a general technique for numerically approximating Bayes-optimal agents; that is, even for task distributions for which we currently don't possess tractable models.

06/12/2021

Meta-trained agents implement Bayes-optimal agents

Vlad Mikulik, Grégoire Delétang, Tom McGrath, Tim Genewein, Miljan Martic, Shane Legg, Pedro Ortega

Comments

Similar Papers

Model-Based Episodic Memory Induces Dynamic Hybrid Controls

Hung Le, Thommen Karimpanal George, Majid Abdolshah and Truyen Tran, Svetha Venkatesh

Keywords Abstract Paper

reinforcement learning and planning

Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning

Younggyo Seo, Kimin Lee, Ignasi Clavera Gilaberte and Thanard Kurutach, Jinwoo Shin, Pieter Abbeel

Keywords Abstract Paper

Efficient Training of Retrieval Models using Negative Cache

Erik Lindgren, Sashank Reddi, Ruiqi Guo, Sanjiv Kumar

Keywords Abstract Paper

deep learning, machine learning

Meta-Learning via Learning with Distributed Memory

Sudarshan Babu, Pedro Savarese, Michael Maire

Keywords Abstract Paper

deep learning, optimization, machine learning, vision, meta learning, online learning

Untangling tradeoffs between recurrence and self-attention in artificial neural networks

Giancarlo Kerg, bhargav104 Kanuparthi, Anirudh Goyal ALIAS PARTH GOYAL and Kyle Goyette, Yoshua Bengio, Guillaume Lajoie

Keywords Abstract Paper

Theoretical bounds on estimation error for meta-learning

James Lucas, Mengye Ren, Irene Raissa KAMENI KAMENI and Toniann Pitassi, Richard Zemel

Keywords Abstract Paper

meta learning, minimax risk, few-shot, lower bounds, learning theory

Neurally Augmented ALISTA

Freya Behrens, Jonathan Sauder, Peter Jung

Keywords Abstract Paper

learned ISTA, unrolled algorithms, compressed sensing, sparse reconstruction

Unsupervised Transfer Learning for Spatiotemporal Predictive Networks

Zhiyu Yao, Yunbo Wang, Mingsheng Long, Jianmin Wang

Keywords Abstract Paper

Sequential, Network, and Time-Series Modeling

Adversarial Regression with Doubly Non-negative Weighting Matrices

Tam Le, Truyen Nguyen, Makoto Yamada and Jose Blanchet, Viet Anh Nguyen

Keywords Abstract Paper

machine learning

Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces

Kirill Struminsky, Artyom Gadetsky, Denis Rakitin and Danil Karpushkin, Dmitry Vetrov

Keywords Abstract Paper

deep learning, optimization

Meta-Learning Requires Meta-Augmentation

Janarthanan Rajendran, Alex Irpan, Eric Jang

Keywords Abstract Paper

Noether Networks: meta-learning useful conserved quantities

Ferran Alet, Dylan Doblar, Allan Zhou and Josh Tenenbaum, Kenji Kawaguchi, Chelsea Finn

Keywords Abstract Paper

machine learning, vision, meta learning

Tailoring: encoding inductive biases by optimizing unsupervised objectives at prediction time

Ferran Alet, Maria Bauza, Kenji Kawaguchi and Nurullah Giray Kuru, Tomás Lozano-Pérez, Leslie Kaelbling

Keywords Abstract Paper

deep learning, optimization, machine learning, self-supervised learning, meta learning

Keep learning: Self-supervised meta-learning for learning from inference

Akhil Kedia, Sai Chetan Chinthakindi

Keywords Abstract Paper

Meta-Learning with Neural Tangent Kernels

Yufan Zhou, Zhenyi Wang, Jiayi Xian and Changyou Chen, Jinhui Xu

Keywords Abstract Paper

neural tangent kernel, meta-learning

Learning to Learn Dense Gaussian Processes for Few-Shot Learning

Ze Wang, Zichen Miao, Xiantong Zhen, Qiang Qiu

Keywords Abstract Paper

deep learning, optimization, generative model, meta learning, kernel methods, few shot learning

Continual learning with hypernetworks

Johannes von Oswald, Christian Henning, João Sacramento, Benjamin F. Grewe

Keywords Abstract Paper

Continual Learning, Catastrophic Forgetting, Meta Model, Hypernetwork

The Advantage of Conditional Meta-Learning for Biased Regularization and Fine Tuning

Giulia Denevi, Massimiliano Pontil, Carlo Ciliberto

Keywords Abstract Paper

Sequential Transfer in Reinforcement Learning with a Generative Model

Andrea Tirinzoni, Riccardo Poiani, Marcello Restelli

Keywords Abstract Paper

Reinforcement Learning - General

Semi-supervised Sequence Classification through Change Point Detection

Nauman Ahad, Mark A. Davenport

Keywords Abstract Paper

Learning What To Do by Simulating the Past

David Lindner, Rohin Shah, Pieter Abbeel, Anca Dragan

Hung Le, Thommen Karimpanal George, Majid Abdolshah and
Truyen Tran, Svetha Venkatesh

Keywords Paper

Younggyo Seo, Kimin Lee, Ignasi Clavera Gilaberte and
Thanard Kurutach, Jinwoo Shin, Pieter Abbeel

Keywords Paper

Keywords Paper

Keywords Paper

Giancarlo Kerg, bhargav104 Kanuparthi, Anirudh Goyal ALIAS PARTH GOYAL and
Kyle Goyette, Yoshua Bengio, Guillaume Lajoie

Keywords Paper

James Lucas, Mengye Ren, Irene Raissa KAMENI KAMENI and
Toniann Pitassi, Richard Zemel

Keywords Paper

Keywords Paper

Keywords Paper

Tam Le, Truyen Nguyen, Makoto Yamada and
Jose Blanchet, Viet Anh Nguyen

Keywords Paper

Kirill Struminsky, Artyom Gadetsky, Denis Rakitin and
Danil Karpushkin, Dmitry Vetrov

Keywords Paper

Keywords Paper

Ferran Alet, Dylan Doblar, Allan Zhou and
Josh Tenenbaum, Kenji Kawaguchi, Chelsea Finn

Keywords Paper

Ferran Alet, Maria Bauza, Kenji Kawaguchi and
Nurullah Giray Kuru, Tomás Lozano-Pérez, Leslie Kaelbling

Keywords Paper

Keywords Paper

Yufan Zhou, Zhenyi Wang, Jiayi Xian and
Changyou Chen, Jinhui Xu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Qi Cai, Yu Wang, Yingwei Pan and
Ting Yao, Tao Mei

Keywords Paper

Keywords Paper

Sayna Ebrahimi, Suzanne Petryk, Akash Gokul and
William Gan, Joseph E Gonzalez, Marcus Rohrbach, trevor darrell

Keywords Paper

Keywords Paper

Keywords Paper

Roland S. Zimmermann, Yash Sharma, Steffen Schneider and
Matthias Bethge, Wieland Brendel

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper