Improved POMDP Tree Search Planning with Prioritized Action Branching

Abstract: Online solvers for partially observable Markov decision processes have difficulty scaling to problems with large action spaces. This paper proposes a method called PA-POMCPOW to sample a subset of the action space that provides varying mixtures of exploitation and exploration for inclusion in a search tree. The proposed method first evaluates the action space according to a score function that is a linear combination of expected reward and expected information gain. The actions with the highest score are then added to the search tree during tree expansion. Experiments show that PA-POMCPOW is able to outperform existing state-of-the-art solvers on problems with large discrete action spaces.

02/02/2021

Deep Learning, Generative Models, Applications, Computer Vision; Applications, Visual Scene Analysis and Interpretation; Deep Learning, Adversarial Network, Algorithms, Online Learning Algorithms

5:15

03/05/2021

Improved POMDP Tree Search Planning with Prioritized Action Branching

John Mern, Anil Yildiz, Lawrence Bush, Tapan Mukerji, Mykel J. Kochenderfer

Comments

Similar Papers

Bayesian Optimized Monte Carlo Planning

John Mern, Anil Yildiz, Zachary Sunberg and Tapan Mukerji, Mykel J. Kochenderfer

Keywords Abstract Paper

Pareto Optimization for Subset Selection with Dynamic Partition Matroid Constraints

Anh Viet Do, Frank Neumann

Keywords Abstract Paper

Efficient Online Estimation of Causal Effects by Deciding What to Observe

Shantanu Gupta, Zachary Lipton, David Childers

Keywords Abstract Paper

reinforcement learning and planning, graph learning, causality

Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value Functions

Lars Buesing, Nicolas Heess, Theophane Weber

Keywords Abstract Paper

Online Bayesian Goal Inference for Boundedly Rational Planning Agents

Tan Zhi-Xuan, Jordyn Mann, Tom Silver and Josh Tenenbaum, Vikash Mansinghka

Keywords Abstract Paper

Online Action Recognition

Alejandro Suárez-Hernández, Javier Segovia-Aguas, Carme Torras, Guillem Alenyà

Keywords Abstract Paper

ME-MCTS: Online Generalization by Combining Multiple Value Estimators

Hendrik Baier, Michael Kaisers

Keywords Abstract Paper

Planning and Scheduling, Markov Decisions Processes, Game Playing

Online Sign Identification: Minimization of the Number of Errors in Thresholding Bandits

Reda Ouhamma, Rémy Degenne, Vianney Perchet, Pierre Gaillard

Keywords Abstract Paper

bandits, online learning

Query-level early exit for additive learning-to-rank ensembles

Claudio Lucchese, Franco Maria Nardini, Salvatore Orlando and Raffaele Perego, Salvatore Trani

Keywords Abstract Paper

efficiency/effectiveness trade-offs, query-level earlyexit, additive regression trees, learning to rank

Joint Online Learning and Decision-making via Dual Mirror Descent

Alfonso Lobos Ruiz, Paul Grigas, Zheng Wen

Keywords Abstract Paper

Deep Learning, Generative Models, Applications, Computer Vision; Applications, Visual Scene Analysis and Interpretation; Deep Learning, Adversarial Network, Algorithms, Online Learning Algorithms

Learning to Make Decisions via Submodular Regularization

Ayya Alieva, Aiden Aceves, Jialin Song and Stephen Mayo, Yisong Yue, Yuxin Chen

Keywords Abstract Paper

Learning Gaussian Graphical Models via Multiplicative Weights

Anamay Chaturvedi, Jonathan Scarlett

Keywords Abstract Paper

Kernel Methods for Cooperative Multi-Agent Learning with Delays

Abhimanyu Dubey, Alex `Sandy' Pentland

Keywords Abstract Paper

Planning, Control, and Multiagent Learning

Learning Aggregation Functions

Giovanni Pellegrini, Alessandro Tibo, Paolo Frasconi and Andrea Passerini, Manfred Jaeger

Keywords Abstract Paper

Machine Learning, Deep Learning, Multi-instance; Multi-label; Multi-view learning, Relational Learning

Optimization Methods for Interpretable Differentiable Decision Trees Applied to Reinforcement Learning

Andrew Silva, Matthew Gombolay, Taylor Killian and Ivan Jimenez, Sung-Hyun Son

Keywords Abstract Paper

Learning Search Space Partition for Black-box Optimization using Monte Carlo Tree Search

Linnan Wang, Rodrigo Fonseca, Yuandong Tian

Keywords Abstract Paper

A Regression Approach to Learning-Augmented Online Algorithms

Keerti Anand, Rong Ge, Amit Kumar, Debmalya Panigrahi

Keywords Abstract Paper

theory, optimization

Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients

Brenden Petersen, Mikel Landajuela Larma, Terrell N Mundhenk and Claudio Santiago, Soo Kim, Joanne Kim

Keywords Abstract Paper

reinforcement learning, automated machine learning, symbolic regression

Regularized Online Allocation Problems: Fairness and Beyond

Santiago Balseiro, Haihao Lu, Vahab Mirrokni

Keywords Abstract Paper

Algorithms, Online Learning Algorithms

Policy Optimization as Online Learning with Mediator Feedback

Alberto Maria Metelli, Matteo Papini, Pierluca D'Oro, Marcello Restelli

Keywords Abstract Paper

Online Algorithms for Weighted Paging with Predictions

Zhihao Jiang, Debmalya Panigrahi and Kevin Sun

Keywords Abstract Paper

Online algorithms, paging

Through the Lens of Sequence Submodularity

Sara Bernardini, Fabio Fagnani, Chiara Piacentini

John Mern, Anil Yildiz, Zachary Sunberg and
Tapan Mukerji, Mykel J. Kochenderfer

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Tan Zhi-Xuan, Jordyn Mann, Tom Silver and
Josh Tenenbaum, Vikash Mansinghka

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Claudio Lucchese, Franco Maria Nardini, Salvatore Orlando and
Raffaele Perego, Salvatore Trani

Keywords Paper

Keywords Paper

Ayya Alieva, Aiden Aceves, Jialin Song and
Stephen Mayo, Yisong Yue, Yuxin Chen

Keywords Paper

Keywords Paper

Keywords Paper

Giovanni Pellegrini, Alessandro Tibo, Paolo Frasconi and
Andrea Passerini, Manfred Jaeger

Keywords Paper

Andrew Silva, Matthew Gombolay, Taylor Killian and
Ivan Jimenez, Sung-Hyun Son

Keywords Paper

Keywords Paper

Keywords Paper

Brenden Petersen, Mikel Landajuela Larma, Terrell N Mundhenk and
Claudio Santiago, Soo Kim, Joanne Kim

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Duanshun Li, Jing Liu, Dongeun Lee and
Ali S. Mazloom, Giridhar Kaushik , Kookjin Lee, Noseong Park

Keywords Paper

Keywords Paper

Keywords Paper

Gaode Chen, Xinghua Zhang, Yanyan Zhao and
Cong Xue, Ji Xiang

Keywords Paper

Alvaro Velasquez, Brett Bissey, Lior Barak and
Andre Beckus, Ismail Alkhouri, Daniel Melcer, George Atia

Keywords Paper

Panteha Naderian, Gabriel Loaiza-Ganem, Harry Braviner and
Anthony Caterini, Jesse C Cresswell, Tong Li, Animesh Garg

Keywords Paper

Keywords Paper

Luisa Zintgraf, Kyriacos Shiarlis, Maximilian Igl and
Sebastian Schulze, Yarin Gal, Katja Hofmann, Shimon Whiteson

Keywords Paper

Keywords Paper