Monte Carlo Tree Search With Iteratively Refining State Abstractions

06/12/2021

Monte Carlo Tree Search With Iteratively Refining State Abstractions

Samuel Sokota, Caleb Y Ho, Zaheen Ahmad, J. Zico Kolter

Keywords: reinforcement learning and planning

Abstract Paper Similar Papers

Abstract: Decision-time planning is the process of constructing a transient, local policy with the intent of using it to make the immediate decision. Monte Carlo tree search (MCTS), which has been leveraged to great success in Go, chess, shogi, Hex, Atari, and other settings, is perhaps the most celebrated decision-time planning algorithm. Unfortunately, in its original form, MCTS can degenerate to one-step search in domains with stochasticity. Progressive widening is one way to ameliorate this issue, but we argue that it possesses undesirable properties for some settings. In this work, we present a method, called abstraction refining, for extending MCTS to stochastic environments which, unlike progressive widening, leverages the geometry of the state space. We argue that leveraging the geometry of the space can offer advantages. To support this claim, we present a series of experimental examples in which abstraction refining outperforms progressive widening, given equal simulation budgets.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

18/07/2021

Outlier-Robust Optimal Transport

Debarghya Mukherjee, Aritra Guha, Justin Solomon and
Yuekai Sun, Mikhail Yurochkin

Keywords Paper

Algorithms, Meta-Learning, Algorithms, Few-Shot Learning, Algorithms, Optimal Transport

0

0

1

1

4:46

02/02/2021

Bayesian Optimized Monte Carlo Planning

John Mern, Anil Yildiz, Zachary Sunberg and
Tapan Mukerji, Mykel J. Kochenderfer

Keywords Paper

0

0

0

0

18:24

26/08/2020

MAP Inference for Customized Determinantal Point Processes via Maximum Inner Product Search

Insu Han, Jennifer Gillenwater

Keywords Paper

0

0

0

0

16:01

18/07/2021

Convex Regularization in Monte-Carlo Tree Search

Tuan Q Dam, Carlo D'Eramo, Jan Peters, Joni Pajarinen

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

4:52

03/05/2021

C-Learning: Horizon-Aware Cumulative Accessibility Estimation

Panteha Naderian, Gabriel Loaiza-Ganem, Harry Braviner and
Anthony Caterini, Jesse C Cresswell, Tong Li, Animesh Garg

Keywords Paper

reinforcement learning, goal reaching, Q-learning

0

0

0

0

4:49

06/12/2020

Normalizing Kalman Filters for Multivariate Time Series Analysis

Emmanuel de Bézenac, Syama Sundar Rangapuram, Konstantinos Benidis and
Michael Bohlke-Schneider, Richard Kurle, Lorenzo Stella, Hilaf Hasson, Patrick Gallinari, Tim Januschowski

Keywords Paper

0

0

0

0

3:19

06/12/2021

Practical, Provably-Correct Interactive Learning in the Realizable Setting: The Power of True Believers

Julian Katz-Samuels, Blake Mason, Kevin Jamieson, Rob Nowak

Keywords Paper

theory, machine learning, bandits, kernel methods, active learning

0

0

0

0

7:41

23/08/2020

Diverse rule sets

Guangyi Zhang, Aristides Gionis

Keywords Paper

sampling, classifier, pattern mining, rule learning, diversification, rule sets

0

0

0

0

9:41

13/04/2021

Regularized ERM on random subspaces

Andrea Della Vecchia, Jaouad Mourtada, Ernesto De Vito, Lorenzo Rosasco

Keywords Paper

0

0

0

0

2:57

18/07/2021

World Model as a Graph: Learning Latent Landmarks for Planning

Lunjun Zhang, Ge Yang, Bradly Stadie

Keywords Paper

Applications, Computer Vision, Algorithms, Classification; Applications, Computational Social Science; Applications, Visual Scene Analysis and Interpret, Reinforcement Learning and Planning, Deep RL

0

0

0

0

12:48

06/12/2020

Probabilistic Inference with Algebraic Constraints: Theoretical Limits and Practical Approximations

Zhe Zeng, Paolo Morettin, Fanqi Yan and
Antonio Vergari, Guy Van den Broeck

Keywords Paper

0

0

0

0

3:17

12/07/2020

Flexible and Efficient Long-Range Planning Through Curious Exploration

Aidan Curtis, Minjian Xin, Dilip Arumugam and
Kevin Feigelis, Daniel Yamins

Keywords Paper

Reinforcement Learning - General

0

0

0

0

16:25

03/08/2020

Amortized Bayesian Optimization over Discrete Spaces

Kevin Swersky, Yulia Rubanova, David Dohan, Kevin Murphy

Keywords Paper

0

0

0

0

7:40

19/08/2021

Compressing Exact Cover Problems with Zero-suppressed Binary Decision Diagrams

Masaaki Nishino, Norihito Yasuda, Kengo Nakamura

Keywords Paper

Knowledge Representation and Reasoning, Knowledge Representation Languages, Combinatorial Search and Optimisation

0

0

0

0

14:01

06/12/2020

Learning outside the Black-Box: The pursuit of interpretable models

Jonathan Crabbe, Yao Zhang, William Zame, Mihaela van der Schaar

Keywords Paper

0

0

0

0

3:16

12/07/2020

Handling the Positive-Definite Constraint in the Bayesian Learning Rule

Wu Lin, Mark Schmidt, Mohammad Emtiyaz Khan

Keywords Paper

Probabilistic Inference - Approximate, Monte Carlo, and Spectral Methods

0

0

0

0

14:51

06/12/2020

Promoting Stochasticity for Expressive Policies via a Simple and Efficient Regularization Method

Qi Zhou, Yufei Kuang, Zherui Qiu and
Houqiang Li, Jie Wang

Keywords Paper

0

0

0

0

3:10

18/07/2021

Provably Correct Optimization and Exploration with Non-linear Policies

Fei Feng, Wotao Yin, Alekh Agarwal, Lin Yang

Keywords Paper

Deep Learning, Adversarial Networks, Applications, Fairness, Accountability, and Transparency, Theory, RL, Decisions and Control Theory

0

0

0

0

5:03

06/12/2021

Reliable Causal Discovery with Improved Exact Search and Weaker Assumptions

Ignavier Ng, Yujia Zheng, Jiji Zhang, Kun Zhang

Keywords Paper

generative model, graph learning, causality

0

0

0

0

7:46

06/12/2020

Learning Search Space Partition for Black-box Optimization using Monte Carlo Tree Search

Linnan Wang, Rodrigo Fonseca, Yuandong Tian

Keywords Paper

0

1

0

0

3:21

02/02/2021

High-Dimensional Bayesian Optimization via Tree-Structured Additive Models

Eric Han, Ishank Arora, Jonathan Scarlett

Keywords Paper

0

0

0

0

19:46

22/11/2021

Planar Shape Based Registration for Multi-modal Geometry

Muxingzi Li, Florent Lafarge

Keywords Paper

global registration, energy minimization, geometric primitives, point cloud, polygonal mesh

0

0

0

0

3:00

12/07/2020

A Generic First-Order Algorithmic Framework for Bi-Level Programming Beyond Lower-Level Singleton

Risheng Liu, Pan Mu, Xiaoming Yuan and
Shangzhi Zeng, Jin Zhang

Keywords Paper

Optimization - Non-convex

0

0

0

0

13:51

12/07/2020

Stronger and Faster Wasserstein Adversarial Attacks

Kaiwen Wu, Allen Wang, Yaoliang Yu

Keywords Paper

Adversarial Examples

0

0

0

0

14:56

06/12/2021

Local Hyper-Flow Diffusion

Kimon Fountoulakis, Pan Li, Shenghao Yang

Keywords Paper

optimization, graph learning, clustering

0

0

0

0

14:24

26/08/2020

Learning Hierarchical Interactions at Scale: A Convex Optimization Approach

Hussein Hazimeh, Rahul Mazumder

Keywords Paper

0

0

0

0

15:07

06/12/2021

Provably efficient, succinct, and precise explanations

Guy Blanc, Jane Lange, Li-Yang Tan

Keywords Paper

theory

0

0

0

0

10:40

02/02/2021

Accelerated Combinatorial Search for Outlier Detection with Provable Bound on Sub-Optimality

Guihong Wan, Haim Schweitzer

Keywords Paper

0

0

0

0

15:04

18/07/2021

Unbalanced minibatch Optimal Transport; applications to Domain Adaptation

Kilian Fatras, Thibault Séjourné, Rémi Flamary, Nicolas Courty

Keywords Paper

Algorithms, Optimal Transport

0

0

0

2

4:57

06/12/2020

An Empirical Process Approach to the Union Bound: Practical Algorithms for Combinatorial and Linear Bandits

Julian Katz-Samuels, Lalit Jain, zohar karnin, Kevin Jamieson

Keywords Paper

0

0

0

0

3:20

03/05/2021

Auxiliary Task Update Decomposition: The Good, the Bad and the Neutral

Lucio Dery, Yann Dauphin, David Grangier

Keywords Paper

multitask learning, deeplearning, pre-training, gradient decomposition

0

0

0

0

5:22

12/07/2020

Low-Variance and Zero-Variance Baselines for Extensive-Form Games

Trevor Davis, Martin Schmid, Michael Bowling

Keywords Paper

Planning, Control, and Multiagent Learning

0

0

0

0

15:44

06/12/2021

Learning Space Partitions for Path Planning

Kevin Yang, Tianjun Zhang, Chris Cummins and
Brandon Cui, Benoit Steiner, Linnan Wang, Joseph Gonzalez, Dan Klein, Yuandong Tian

Keywords Paper

optimization, reinforcement learning and planning

0

0

0

0

14:31

26/04/2020

Stochastic AUC Maximization with Deep Neural Networks

Mingrui Liu, Zhuoning Yuan, Yiming Ying, Tianbao Yang

Keywords Paper

Stochastic AUC Maximization, Deep Neural Networks

0

0

0

0

4:58

06/12/2020

Hamiltonian Monte Carlo using an adjoint-differentiated Laplace approximation: Bayesian inference for latent Gaussian models and beyond

Charles Margossian, Aki Vehtari, Daniel Simpson, Raj Agrawal

Keywords Paper

0

0

0

0

3:05

06/12/2021

Structured Dropout Variational Inference for Bayesian Neural Networks

Son Nguyen, Duong Nguyen, Khai Nguyen and
Khoat Than, Hung Bui, Nhat Ho

Keywords Paper

deep learning, generative model

0

0

0

0

11:28

12/07/2020

Layered Sampling for Robust Optimization Problems

Hu Ding, Zixiu Wang

Keywords Paper

Unsupervised and Semi-Supervised Learning

0

0

0

0

13:00

06/12/2020

Beyond the Mean-Field: Structured Deep Gaussian Processes Improve the Predictive Uncertainties

Jakob Lindinger, David Reeb, Christoph Lippert, Barbara Rakitsch

Keywords Paper

0

0

0

0

3:21

18/07/2021

Leveraging Non-uniformity in First-order Non-convex Optimization

Jincheng Mei, Yue Gao, Bo Dai and
Csaba Szepesvari, Dale Schuurmans

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

4:49

18/07/2021

Generalised Lipschitz Regularisation Equals Distributional Robustness

Zac Cranko, Zhan Shi, Xinhua Zhang and
Richard Nock, Simon Kornblith

Keywords Paper

Algorithms, Kernel Methods

0

0

0

0

5:18