Hierarchical IW

Implementation of the paper Hierarchical Width-Based Planning and Learning, appearing in the Proceedings of the 31st International Conference on Automated Planning and Scheduling (ICAPS 2021).

Summary

In this paper, we present a hierarchical approach to width-based planning. Based on two sets of high- and low-level features, we partition the state-space into high- and low-level states, where each high-level state contains a subset of low-level states. Our simple approach to hierarchical planning generates a new high-level node, each time, by performing a low-level search until a state with different high-level features is found (i.e. a state that belongs to another high-level state). We use this approach to apply width-based planners at two levels of abstraction, and show that the width of a problem can be reduced when choosing the appropriate high-level features. We present experiments in two settings:

Classical planning: We incrementally discover high-level feature candidates and show that our hierarchical approach HIW(1,1) (i.e., using width 1 at both levels of abstraction) outperforms IW(2) in several domains.
Pixel-based environments: We extend pi-IW with our hierarchical approach, producing pi-HIW, that learns a policy and a value function from the hierarchical plan, and uses them to guide the low-level search. We use a downsampling of the image as high-level features, and show a big improvement compared to the baseline in spare reward Atari games, where an agent moves in a fixed background (e.g. in Montezuma's Revenge).

Experiments

The experiments of the paper can be reproduced with scripts incremental_HIW.py and pi_HIW.py, for the classical planning and pixel-based environments, respectively. To illustrate intermediate steps on these two scripts, we provide planning_step.py where only one planning step is performed, and the resulting high-level feature candidates can be observed, and online_hierarchical_planning.py, where we perform on-line replanning with our hierarchical approach without learning. The scripts can be run with default parameters (which can be changed in the same script) or with console arguments as follows:

python3 pi_HIW.py --hierarchical True --seed 1234 --env MontezumaRevengeNoFrameskip-v4 --atari-frameskip 15

See the help (-h) section for more details.

For atari games, use the deterministic version of the gym environments, which can be specified by selecting v4 environments (e.g. "Breakout-v4"). Although the "NoFrameskip" environment is given, we set the frameskip anyway with parameter --atari-frameskip (15 in our experiments).

Installation

Install the requirements
Make sure that gridenvs and pddl2gym are added to the python path.

Name	Name	Last commit message	Last commit date
Latest commit mjunyentb Initial commit. Mar 23, 2021 a435a28 · Mar 23, 2021 History 1 Commit
.gitignore	.gitignore	Initial commit.	Mar 23, 2021
IW.py	IW.py	Initial commit.	Mar 23, 2021
atari_wrappers.py	atari_wrappers.py	Initial commit.	Mar 23, 2021
bfs.py	bfs.py	Initial commit.	Mar 23, 2021
countbased_rollout_iw.py	countbased_rollout_iw.py	Initial commit.	Mar 23, 2021
envs.py	envs.py	Initial commit.	Mar 23, 2021
experience_replay.py	experience_replay.py	Initial commit.	Mar 23, 2021
incremental_HIW.py	incremental_HIW.py	Initial commit.	Mar 23, 2021
online_hierarchical_planning.py	online_hierarchical_planning.py	Initial commit.	Mar 23, 2021
pi_HIW.py	pi_HIW.py	Initial commit.	Mar 23, 2021
planner.py	planner.py	Initial commit.	Mar 23, 2021
planning_step.py	planning_step.py	Initial commit.	Mar 23, 2021
readme.md	readme.md	Initial commit.	Mar 23, 2021
requirements.txt	requirements.txt	Initial commit.	Mar 23, 2021
rollout_IW.py	rollout_IW.py	Initial commit.	Mar 23, 2021
training.py	training.py	Initial commit.	Mar 23, 2021
tree.py	tree.py	Initial commit.	Mar 23, 2021
tree_actor.py	tree_actor.py	Initial commit.	Mar 23, 2021
utils.py	utils.py	Initial commit.	Mar 23, 2021
widthbased_planner.py	widthbased_planner.py	Initial commit.	Mar 23, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hierarchical IW

Summary

Experiments

Installation

About

Releases

Packages

Languages

mjunyentb/hierarchical-IW

Folders and files

Latest commit

History

Repository files navigation

Hierarchical IW

Summary

Experiments

Installation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages