Active World Model Learning in Agent-rich Environments with Progress Curiosity

Abstract: World models are a family of predictive models that solve self-supervised problems on how the world evolves. Humans learn world models by curiously exploring their environment, in the process acquiring compact abstractions of high bandwidth sensory inputs, the ability to plan across long temporal horizons, and an understanding of the behavioral patterns of other agents. In this work, we study how to design such a curiosity-driven Active World Model Learning (AWML) system. To do so, we simulate a curious agent building world models while visually exploring a 3D physical environment rich with distillations of representative real-world stimuli. We propose an AWML system driven by $\gamma$-Progress: a scalable and effective learning progress-based curiosity signal. We show that $\gamma$-Progress is robust to "white noise" and naturally gives rise to an exploration policy that allocates attention in a balanced manner, with a preference towards agents displaying complex yet learnable behaviors. As a result, our $\gamma$-Progress driven controller achieves significantly higher AWML performance than baseline controllers equipped with state-of-the-art exploration strategies such as Random Network Distillation and Model Disagreement.

12/07/2020

Probing Emergent Semantics in Predictive Agents via Question Answering

Abhishek Das, Federico Carnevale, Hamza Merzic and
Laura Rimell, Rosalia Schneider, Josh Abramson, Alden Hung, Arun Ahuja, Stephen Clark, Greg Wayne, Feilx Hill

Domain Adaption, Third-Person Imitation, Observational Imitation, Reinforcement Learning, Machine Learning, Mutual Information, Imitation Learning

4:51

03/05/2021

Partial Amortization, Model Predictive Control, Planning, Mutual Information, Skill Discovery, World Models, Model-Based Reinforcement Learning

5:10

03/05/2021

Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments

Daochen Zha, Wenye Ma, Lei Yuan and
Xia Hu, Ji Liu

Vivek Veeriah, Tom Zahavy, Matteo Hessel and
Zhongwen Xu, Junhyuk Oh, Iurii Kemaev, Hado van Hasselt, David Silver, Satinder Singh

Tim Seyde, Igor Gilitschenski, Wilko Schwarting and
Bartolomeo Stellato, Martin Riedmiller, Markus Wulfmeier, Daniela Rus

Zhaohan Guo, Bernardo Avila Pires, Mohammad Gheshlaghi Azar and
Bilal Piot, Florent Altché, Jean-Bastien Grill, Remi Munos

Dong Ki Kim, Miao Liu, Matthew Riemer and
Chuangchuang Sun, Marwa Abdulhai, Golnaz Habibi, Sebastian Lopez-Cot, Gerald Tesauro, Jonathan How

Keywords Paper

Reinforcement Learning and Planning, Multi-Agent RL, Algorithms, Representation Learning, Algorithms, Relational Learning

5:20

18/07/2021