26/10/2020

Convex Hull Monte-Carlo Tree-Search

Michael Painter, Bruno Lacerda, Nick Hawes

Keywords: Multi-Objective Planning, Trial Based Heuristic Tree Search, Contextual Multi-Armed Bandits, Multi-Objective Optimisation

Abstract: This work investigates monte-carlo planning for agents in stochastic (and potentially large) environments, that may have multiple objectives for which the priorities are not known a priori, or may not be easy to quantify. In this work we propose Convex Hull Monte-Carlo Tree-Search, which builds upon Trial Based Heuristic Tree Search and Convex Hull Value Iteration, as a solution to planning with multiple objectives in large environments. Moreover, we consider how to pose the problem of multi-objective planning as a contextual multi-armed bandits problem, giving a principled motivation for how to select actions from the view of contextual regret.

 0
 0
 0
 0
This is an embedded video. Talk and the respective paper are published at ICAPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment
no comments yet
code of conduct: tbd

Similar Papers