Learning General Planning Policies from Small Examples Without Supervision

02/02/2021

Learning General Planning Policies from Small Examples Without Supervision

Guillem Francès, Blai Bonet, Hector Geffner

Keywords:

Abstract Paper Similar Papers

Abstract: Generalized planning is concerned with the computation of general policies that solve multiple instances of a planning domain all at once. It has been recently shown that these policies can be computed in two steps: first, a suitable abstraction in the form of a qualitative numerical planning problem (QNP) is learned from sample plans, then the general policies are obtained from the learned QNP using a planner. In this work, we introduce an alternative approach for computing more expressive general policies which does not require sample plans or a QNP planner. The new formulation is very simple and can be cast in terms that are more standard in machine learning: a large but finite pool of features is defined from the predicates in the planning examples using a general grammar, and a small subset of features is sought for separating “good” from “bad” state transitions, and goals from non-goals. The problems of finding such a “separating surface” while labeling the transitions as “good” or “bad” are jointly addressed as a single combinatorial optimization problem expressed as a Weighted Max-SAT problem. The advantage of looking for the simplest policy in the given feature space that solves the given examples, possibly non-optimally, is that many domains have no general, compact policies that are optimal. The approach yields general policies for a number of benchmark domains.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38948959

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AAAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

General Policies, Representations, and Planning Width

Blai Bonet, Hector Geffner

Keywords Paper

0

0

0

0

18:23

03/05/2021

Differentiable Segmentation of Sequences

Erik Scharwächter, Jonathan Lennartz, Emmanuel Müller

Keywords Paper

warping functions, concept drift, change point detection, segmented models, segmentation, gradient descent

0

1

0

0

5:10

26/10/2020

Learning Neural Search Policies for Classical Planning

Pawel Gomoluch, Dalal Alrajeh, Alessandra Russo, Antonio Bucchiarone

Keywords Paper

classical planning, policy search, machine learning, cross-entropy method

0

0

0

0

9:55

26/08/2020

MAP Inference for Customized Determinantal Point Processes via Maximum Inner Product Search

Insu Han, Jennifer Gillenwater

Keywords Paper

0

0

0

0

16:01

02/02/2021

Endomorphisms of Classical Planning Tasks

Rostislav Horčík, Daniel Fišer

Keywords Paper

0

0

0

0

19:13

06/12/2020

Learning Linear Programs from Optimal Decisions

Yingcong Tan, Daria Terekhov, Andrew Delong

Keywords Paper

, Applications -> Privacy, Anonymity, and Security

0

0

0

0

3:21

12/07/2020

Efficient Continuous Pareto Exploration in Multi-Task Learning

Pingchuan Ma, Tao Du, Wojciech Matusik

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

14:56

02/02/2021

A Unified Framework for Planning with Learned Neural Network Transition Models

Buser Say

Keywords Paper

0

0

0

0

18:41

18/07/2021

World Model as a Graph: Learning Latent Landmarks for Planning

Lunjun Zhang, Ge Yang, Bradly Stadie

Keywords Paper

Applications, Computer Vision, Algorithms, Classification; Applications, Computational Social Science; Applications, Visual Scene Analysis and Interpret, Reinforcement Learning and Planning, Deep RL

0

0

0

0

12:48

03/05/2021

C-Learning: Horizon-Aware Cumulative Accessibility Estimation

Panteha Naderian, Gabriel Loaiza-Ganem, Harry Braviner and
Anthony Caterini, Jesse C Cresswell, Tong Li, Animesh Garg

Keywords Paper

reinforcement learning, goal reaching, Q-learning

0

0

0

0

4:49

02/02/2021

Multi-Party Campaigning

Martin Koutecký, Nimrod Talmon

Keywords Paper

0

0

0

0

20:36

15/06/2020

Question selection for interactive program synthesis

Ruyi Ji, Jingjing Liang, Yingfei Xiong and
Lu Zhang, Zhenjiang Hu

Keywords Paper

Program Synthesis, Interaction

0

0

0

0

16:07

26/08/2020

'Bring Your Own Greedy'+Max: Near-Optimal 1/2-Approximations for Submodular Knapsack

Grigory Yaroslavtsev, Samson Zhou, Dmitrii Avdiukhin

Keywords Paper

0

0

0

0

13:14

26/08/2020

Finite-Time Analysis of Decentralized Temporal-Difference Learning with Linear Function Approximation

Jun Sun, Gang Wang, Georgios B. Giannakis and
Qinmin Yang, Zaiyue Yang

Keywords Paper

0

0

0

0

17:07

06/12/2020

Distributionally Robust Federated Averaging

Yuyang Deng, Mohammad Mahdi Kamani, Mehrdad Mahdavi

Keywords Paper

0

0

0

0

3:11

14/06/2020

Efficient and Robust Shape Correspondence via Sparsity-Enforced Quadratic Assignment

Rui Xiang, Rongjie Lai, Hongkai Zhao

Keywords Paper

shape matching, non-rigid transformation, point cloud matching, quadratic assignment, sparsity control, laplace-beltrami operator, local distortion

0

0

0

0

1:01

18/07/2021

The Power of Adaptivity for Stochastic Submodular Cover

Rohan Ghuge, Anupam Gupta, viswanath nagarajan

Keywords Paper

Optimization, Stochastic Optimization

0

0

0

0

16:47

18/07/2021

Regularized Submodular Maximization at Scale

Ehsan Kazemi, shervin minaee, Moran Feldman, Amin Karbasi

Keywords Paper

Optimization, Combinatorial Optimization

0

0

0

0

5:17

06/12/2021

Neural Algorithmic Reasoners are Implicit Planners

Andreea-Ioana Deac, Petar Veličković, Ognjen Milinkovic and
Pierre-Luc Bacon, Jian Tang, Mladen Nikolic

Keywords Paper

deep learning, reinforcement learning and planning, self-supervised learning, generative model, graph learning

0

0

0

0

13:10

06/12/2021

Automatic and Harmless Regularization with Constrained and Lexicographic Optimization: A Dynamic Barrier Approach

Chengyue Gong, Xingchao Liu, Qiang Liu

Keywords Paper

optimization, machine learning, graph learning

0

0

0

0

15:32

18/07/2021

Training Data Subset Selection for Regression with Controlled Generalization Error

Durga S, Rishabh Iyer, Ganesh Ramakrishnan, Abir De

Keywords Paper

, Algorithms, Online Learning, Algorithms, Supervised Learning

0

0

0

0

4:15

06/12/2021

Learning Interpretable Decision Rule Sets: A Submodular Optimization Approach

Fan Yang, Kai He, Linxiao Yang and
Hongxia Du, Jingbang Yang, Bo Yang, Liang Sun

Keywords Paper

optimization

0

0

0

0

4:43

19/08/2021

GSPL: A Succinct Kernel Model for Group-Sparse Projections Learning of Multiview Data

Danyang Wu, Jin Xu, Xia Dong and
Meng Liao, Rong Wang, Feiping Nie, Xuelong Li

Keywords Paper

Machine Learning, Learning Sparse Models, Multi-instance; Multi-label; Multi-view learning, Unsupervised Learning

0

0

0

0

11:48

14/09/2020

A General Machine Learning Framework for Survival Analysis

Andreas Bender, David Rügamer, Fabian Scheipl, Bernd Bischl

Keywords Paper

survival analysis, gradient boosting, neural networks, competing risks, multi-state models

0

0

0

0

13:37

03/05/2021

Wasserstein-2 Generative Networks

Alexander Korotin, Vage Egiazarian, Arip Asadulaev and
Alexander Safin, Evgeny Burnaev

Keywords Paper

input-convex neural networks, cycle-consistency regularization, non-minimax optimization, optimal transport maps, wasserstein-2 distance

0

0

0

1

5:10

06/12/2021

Parameter Inference with Bifurcation Diagrams

Gregory Szep, Neil Dalchau, Attila Csikász-Nagy

Keywords Paper

theory, generative model

0

0

0

0

15:07

03/05/2021

Economic Hyperparameter Optimization With Blended Search Strategy

Chi Wang, Qingyun Wu, Silu Huang, Amin Saied

Keywords Paper

COST, HYPERPARAMETER OPTIMIZATION

0

0

0

0

5:09

06/12/2021

An online passive-aggressive algorithm for difference-of-squares classification

Lawrence Saul

Keywords Paper

machine learning, online learning

0

0

0

0

14:00

12/07/2020

Conditional gradient methods for stochastically constrained convex minimization

Maria-Luiza Vladarean, Ahmet Alacaoglu, Ya-Ping Hsieh, Volkan Cevher

Keywords Paper

Optimization - Convex

0

0

0

0

14:50

18/07/2021

A Scalable Deterministic Global Optimization Algorithm for Clustering Problems

Kaixun Hua, Mingfei Shi, Yankai Cao

Keywords Paper

Algorithms, Clustering, Algorithms, AutoML, Optimization, Combinatorial Optimization

0

0

0

0

11:40

06/12/2020

Dual-Free Stochastic Decentralized Optimization with Variance Reduction

Hadrien Hendrikx, Francis Bach, Laurent Massoulié

Keywords Paper

0

0

0

0

3:28

09/07/2020

Model-Based Reinforcement Learning with a Generative Model is Minimax Optimal

Alekh Agarwal, Sham Kakade, Lin Yang

Keywords Paper

Reinforcement learning, Sampling algorithms

0

0

0

0

15:13

02/02/2021

Submodular Span, with Applications to Conditional Data Summarization

Lilly Kumari, Jeff Bilmes

Keywords Paper

0

0

0

0

16:22

12/07/2020

dS^2LBI: Exploring Structural Sparsity on Deep Network via Differential Inclusion Paths

Yanwei Fu, Chen Liu, Donghao Li and
Xinwei Sun, Jinshan ZENG, Yuan Yao

Keywords Paper

Deep Learning - Algorithms

0

0

0

1

12:45

02/02/2021

On Exploiting Hitting Sets for Model Reconciliation

Stylianos Loukas Vasileiou, Alessandro Previti, William Yeoh

Keywords Paper

0

0

0

0

16:27

12/07/2020

Flexible and Efficient Long-Range Planning Through Curious Exploration

Aidan Curtis, Minjian Xin, Dilip Arumugam and
Kevin Feigelis, Daniel Yamins

Keywords Paper

Reinforcement Learning - General

0

0

0

0

16:25

06/12/2020

Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors

Karl Pertsch, Oleh Rybkin, Frederik Ebert and
Shenghao Zhou, Dinesh Jayaraman, Chelsea Finn, Sergey Levine

Keywords Paper

Applications -> Robotics; Reinforcement Learning and Planning -> Exploration; Reinforcement Learning and Planning -> Reinforcem, Algorithms -> Multitask and Transfer Learning

0

0

0

0

3:16

13/04/2021

Abstract value iteration for hierarchical reinforcement learning

Kishor Jothimurugan, Osbert Bastani, Rajeev Alur

Keywords Paper

0

0

0

0

2:57

14/09/2020

Model-based Clustering with HDBSCAN*

Michael Strobl, Joerg Sander, Ricardo Campello, Osmar Zaiane

Keywords Paper

hierarchical clustering, expectation maximization, model selection

0

0

0

0

15:31

18/07/2021

Bayesian Deep Learning via Subnetwork Inference

Erik Daxberger, Eric Nalisnick, James Allingham and
Javier Antorán, Jose Miguel Hernandez-Lobato

Keywords Paper

, Reinforcement Learning and Planning, Multi-Agent RL, Deep Learning, Bayesian Deep Learning

0

0

0

0

5:18