On-line Learning of Planning Domains from Sensor Data in PAL: Scaling up to Large State Spaces

Abstract: We propose an approach to learn an extensional representation of a discrete deterministic planning domain from observations in a continuous space navigated by the agent actions. This is achieved through the use of a perception function providing the likelihood of a real-value observation being in a given state of the planning domain after executing an action. The agent learns an extensional representation of the domain (the set of states, the transitions from states to states caused by actions) and the perception function on-line, while it acts for accomplishing its task. In order to provide a practical approach that can scale up to large state spaces, a “draft” intensional (PDDL-based) model of the planning domain is used to guide the exploration of the environment and learn the states and state transitions. The proposed approach uses a novel algorithm to (i) construct the extensional representation of the domain by interleaving symbolic planning in the PDDL intensional representation and search in the state transition graph of the extensional representation; (ii) incrementally refine the intensional representation taking into account information about the actions that the agent cannot execute. An experimental analysis shows that the novel approach can scale up to large state spaces, thus overcoming the limits in scalability of the previous work.

13/04/2021

Automated reasoning, Epistemic reasoning, Planning, Multi-agent, Knowledge representation, Non-well-founded sets, Kripke structures

9:53

06/12/2021

Optimization -> Non-Convex Optimization; Theory -> Computational Complexity; Theory -> Learning Theory, Deep Learning -> Optimization for Deep Networks

3:19

06/12/2020

Zhaohan Guo, Bernardo Avila Pires, Mohammad Gheshlaghi Azar and
Bilal Piot, Florent Altché, Jean-Bastien Grill, Remi Munos

Applications -> Robotics; Reinforcement Learning and Planning -> Exploration; Reinforcement Learning and Planning -> Reinforcem, Algorithms -> Multitask and Transfer Learning

3:16

02/02/2021

Deep Learning, Generative Models, Applications, Computational Biology and Bioinformatics, Reinforcement Learning and Planning, Deep RL

5:44

26/04/2020

Vivek Veeriah, Tom Zahavy, Matteo Hessel and
Zhongwen Xu, Junhyuk Oh, Iurii Kemaev, Hado van Hasselt, David Silver, Satinder Singh

Menghui Zhu, Minghuan Liu, Jian Shen and
Zhicheng Zhang, Sheng Chen, Weinan Zhang, Deheng Ye, Yong Yu, Qiang Fu, Wei Yang

Abhishek Das, Federico Carnevale, Hamza Merzic and
Laura Rimell, Rosalia Schneider, Josh Abramson, Alden Hung, Arun Ahuja, Stephen Clark, Greg Wayne, Feilx Hill