Integrating Acting, Planning, and Learning in Hierarchical Operational Models

26/10/2020

Integrating Acting, Planning, and Learning in Hierarchical Operational Models

Sunandita Patra, James Mason, Amit Kumar, Malik Ghallab, Paolo Traverso, Dana Nau

Keywords: integrated planning and acting, integrated planning and learning, hierarchical operational models, online planning, dynamic environments

Abstract Paper Similar Papers

Abstract: We present new planning and learning algorithms for use with the RAE (Refinement Acting Engine) acting procedure (Ghallab et al., 2016). RAE uses hierarchical operational models to perform tasks in dynamically changing environments. Our planning algorithm, UPOM, does a UCT-like search in the space of operational models in order to tell RAE which operational model to use for each task. Our learning strategies acquire, from online acting experiences and/or simulated planning results, a mapping from decision contexts to method instances as well as a heuristic function to guide UPOM. Our experimental results show that UPOM and our learning strategies significantly improve RAE’s performance in four test domains using two different metrics: efficiency and success ratio.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICAPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

03/05/2021

Latent Skill Planning for Exploration and Transfer

Kevin Xie, Homanga Bharadhwaj, Danijar Hafner and
Animesh Garg, Florian Shkurti

Keywords Paper

Partial Amortization, Model Predictive Control, Planning, Mutual Information, Skill Discovery, World Models, Model-Based Reinforcement Learning

0

0

0

0

5:10

03/05/2021

Adaptive Procedural Task Generation for Hard-Exploration Problems

Kuan Fang, Yuke Zhu, Silvio Savarese, Li Fei-Fei

Keywords Paper

reinforcement learning, task generation, procedural generation, curriculum learning

0

0

0

0

5:06

19/08/2021

Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts

Weinan Zhang, Xihuai Wang, Jian Shen, Ming Zhou

Keywords Paper

Machine Learning, Deep Reinforcement Learning, Multi-agent Learning

0

0

0

0

13:10

26/04/2020

Meta-Q-Learning

Rasool Fakoor, Pratik Chaudhari, Stefano Soatto, Alexander J. Smola

Keywords Paper

meta reinforcement learning, propensity estimation, off-policy

0

0

0

0

15:50

03/05/2021

Planning from Pixels using Inverse Dynamics Models

Keiran Paster, Sheila McIlraith, Jimmy Ba

Keywords Paper

model based reinforcement learning, deep learning, goal-conditioned reinforcement learning, deep reinforcement learning, multi-task learning

0

0

0

0

4:15

12/07/2020

Orthogonalized SGD and Nested Architectures for Anytime Neural Networks

Chengcheng Wan, Henry (Hank) Hoffmann, Shan Lu, Michael Maire

Keywords Paper

Deep Learning - General

0

0

0

0

14:36

06/12/2020

Inverse Reinforcement Learning from a Gradient-based Learner

Giorgia Ramponi, Gianluca Drappo, Marcello Restelli

Keywords Paper

0

0

0

0

2:42

26/04/2020

Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization

Michael Volpp, Lukas P. Fröhlich, Kirsten Fischer and
Andreas Doerr, Stefan Falkner, Frank Hutter, Christian Daniel

Keywords Paper

Transfer Learning, Meta Learning, Bayesian Optimization, Reinforcement Learning

0

0

0

0

5:05

19/08/2021

MapGo: Model-Assisted Policy Optimization for Goal-Oriented Tasks

Menghui Zhu, Minghuan Liu, Jian Shen and
Zhicheng Zhang, Sheng Chen, Weinan Zhang, Deheng Ye, Yong Yu, Qiang Fu, Wei Yang

Keywords Paper

Machine Learning, Deep Reinforcement Learning, Reinforcement Learning

0

0

0

0

11:28

14/06/2020

PointAugment: An Auto-Augmentation Framework for Point Cloud Classification

Ruihui Li, Xianzhi Li, Pheng-Ann Heng, Chi-Wing Fu

Keywords Paper

auto-augmentation framework, point cloud processing, sample-aware, jointly optimizing, classification

0

0

0

0

5:01

06/12/2020

EvolveGraph: Multi-Agent Trajectory Prediction with Dynamic Relational Reasoning

Jiachen Li, Fan Yang, Masayoshi Tomizuka, Chiho Choi

Keywords Paper

0

0

0

0

3:21

02/02/2021

Harmonized Dense Knowledge Distillation Training for Multi-Exit Architectures

Xinglu Wang, Yingming Li

Keywords Paper

0

0

0

0

15:12

26/04/2020

RaCT: Toward Amortized Ranking-Critical Training For Collaborative Filtering

Sam Lobel, Chunyuan Li, Jianfeng Gao, Lawrence Carin

Keywords Paper

Collaborative Filtering, Recommender Systems, Actor-Critic, Learned Metrics

0

0

0

0

5:27

14/09/2020

Learning a Sequence of Sentiment Classification Tasks

Zixuan Ke, Bing Liu, Hao Wang, Lei Shu

Keywords Paper

0

0

0

0

14:23

25/07/2020

Sequential recommendation with self-attentive multi-adversarial network

Ruiyang Ren, Zhaoyang Liu, Yaliang Li and
Wayne Xin Zhao, Hui Wang, Bolin Ding, Ji-Rong Wen

Keywords Paper

sequential recommendation, adversarial training, self-attentive mechanism

0

0

0

0

15:12

06/12/2021

Implicit Deep Adaptive Design: Policy-Based Experimental Design without Likelihoods

Desi R Ivanova, Adam Foster, Steven Kleinegesse and
Michael Gutmann, Thomas Rainforth

Keywords Paper

0

0

0

0

14:45

06/12/2020

Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning

Younggyo Seo, Kimin Lee, Ignasi Clavera Gilaberte and
Thanard Kurutach, Jinwoo Shin, Pieter Abbeel

Keywords Paper

0

0

0

0

3:20

02/02/2021

UBAR: Towards Fully End-to-End Task-Oriented Dialog System with GPT-2

Yunyi Yang, Yunhao Li, Xiaojun Quan

Keywords Paper

0

0

0

0

19:38

12/07/2020

Working Memory Graphs

Ricky Loynd, Roland Fernandez, Asli Celikyilmaz and
Adith Swaminathan, Matthew Hausknecht

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:36

13/04/2021

A theory of multiple-source adaptation with limited target labeled data

Yishay Mansour, Mehryar Mohri, Jae Ro and
Ananda Theertha Suresh, Ke Wu

Keywords Paper

0

0

0

0

2:39

26/10/2020

Exploring Context-Free Languages via Planning: The Case for Automating Machine Learning

Michael Katz, Parikshit Ram, Shirin Sohrabi, Octavian Udrea

Keywords Paper

Context-Free Grammar, HTN Planning, Classical Planning, AutoML

0

0

0

0

9:25

03/05/2021

Learning Task Decomposition with Ordered Memory Policy Network

Yuchen Lu, Yikang Shen, Siyuan Zhou and
Aaron Courville, Joshua B Tenenbaum, Chuang Gan

Keywords Paper

Task Segmentation, Network Inductive Bias, Hierarchical Imitation Learning

0

0

0

0

4:57

04/07/2020

Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition

Ryuichi Takanobu, Runze Liang, Minlie Huang

Keywords Paper

pretraining, Multi-Agent Learning, Role-Aware Decomposition, reinforcement learning

0

0

0

0

13:00

13/04/2021

On the importance of hyperparameter optimization for model-based reinforcement learning

Baohe Zhang, Raghu Rajan, Luis Pineda and
Nathan Lambert, André Biedenkapp, Kurtland Chua, Frank Hutter, Roberto Calandra

Keywords Paper

0

0

0

0

2:59

06/12/2021

On Effective Scheduling of Model-based Reinforcement Learning

Hang Lai, Jian Shen, Weinan Zhang and
Yimin Huang, Xing Zhang, Ruiming Tang, Yong Yu, Zhenguo Li

Keywords Paper

optimization, reinforcement learning and planning

0

0

0

0

10:28

06/12/2021

Heuristic-Guided Reinforcement Learning

Ching-An Cheng, Andrey Kolobov, Adith Swaminathan

Keywords Paper

theory, reinforcement learning and planning

0

0

0

0

8:39

16/11/2020

A Diagnostic Study of Explainability Techniques for Text Classification

Pepa Atanasova, Jakob Grue Simonsen, Christina Lioma, Isabelle Augenstein

Keywords Paper

downstream tasks, machine learning, explainability techniques, diverse techniques

0

0

0

0

11:24

06/12/2020

Multi-Stage Influence Function

Hongge Chen, Si Si, Yang Li and
Ciprian Chelba, Sanjiv Kumar, Duane Boning, Cho-Jui Hsieh

Keywords Paper

0

0

0

0

3:23

06/12/2020

Automatic Curriculum Learning through Value Disagreement

Yunzhi Zhang, Pieter Abbeel, Lerrel Pinto

Keywords Paper

0

0

0

0

3:17

12/07/2020

Provably Efficient Model-based Policy Adaptation

Yuda Song, Aditi Mavalankar, Wen Sun, Sicun Gao

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:49

06/12/2021

Meta-learning to Improve Pre-training

Aniruddh Raghu, Jonathan Lorraine, Simon Kornblith and
Matthew McDermott, David Duvenaud

Keywords Paper

deep learning, optimization, graph learning, meta learning

0

0

0

0

12:57

06/12/2020

PlanGAN: Model-based Planning With Sparse Rewards and Multiple Goals

Henry Charlesworth, Giovanni Montana

Keywords Paper

0

0

0

0

3:20

12/07/2020

Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data

Felipe Petroski Such, Aditya Rawal, Joel Lehman and
Kenneth Stanley, Jeffrey Clune

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

7:25

12/07/2020

Causal Modeling for Fairness In Dynamical Systems

Elliot Creager, David Madras, Toniann Pitassi, Richard Zemel

Keywords Paper

Fairness, Equity, Justice, and Safety

0

0

0

0

13:02

06/12/2021

Generalized Proximal Policy Optimization with Sample Reuse

James Queeney, Yannis Paschalidis, Christos G Cassandras

Keywords Paper

optimization, reinforcement learning and planning

0

0

0

0

13:45

06/12/2020

Forethought and Hindsight in Credit Assignment

Veronica Chelu, Doina Precup, Hado van Hasselt

Keywords Paper

0

0

0

0

3:18

03/05/2021

Teaching with Commentaries

Aniruddh Raghu, Maithra Raghu, Simon Kornblith and
David Duvenaud, Geoffrey Hinton

Keywords Paper

hypergradients, metalearning, learning to teach

0

0

0

0

5:11

04/11/2020

Heterogeneity-Aware Cluster Scheduling Policies for Deep Learning Workloads

Deepak Narayanan, Keshav Santhanam, Fiodar Kazhamiaka and
Amar Phanishayee, Matei Zaharia

Keywords Paper

0

0

1

0

20:14

02/02/2021

Value-Decomposition Multi-Agent Actor-Critics

Jianyu Su, Stephen Adams, Peter Beling

Keywords Paper

0

0

0

0

19:21

12/07/2020

Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning

Zhaohan Guo, Bernardo Avila Pires, Mohammad Gheshlaghi Azar and
Bilal Piot, Florent Altché, Jean-Bastien Grill, Remi Munos

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

12:47