Dreamer PyTorch

PA: This repository is in maintenance mode. No new features will be added but bugfixes and contributions are welcome. Please create a pull request with any fixes you have!

Dream to Control: Learning Behaviors by Latent Imagination

Paper: https://arxiv.org/abs/1912.01603
Project Website: https://danijar.com/project/dreamer/
TensorFlow 2 implementation: https://github.com/danijar/dreamer
TensorFlow 1 implementation: https://github.com/google-research/dreamer

Results

Task	Average Return @ 1M	Dreamer Paper @ 1M
Acrobot Swingup	69.54	~300
Cartpole Balance	877.5	~990
Cartpole Balance Sparse	814	~900
Cartpole Swingup	633.6	~800
Cup Catch	885.1	~990
Finger Turn Hard	212.8	~550
Hopper Hop	219	~250
Hopper Stand	511.6	~990
Pendulum Swingup	724.9	~760
Quadruped Run	112.4	~450
Quadruped Walk	52.82	~650
Reacher Easy	962.8	~950
Walker Stand	956.8	~990

Table 1. Dreamer PyTorch vs. Paper Implementation

1 random seed for PyTorch, 5 for the paper
Code @ commit ccea6ae
37H for 1M steps on P100, 20H for 1M steps on V100

Installation

Install Python 3.11
Install Python Poetry

# clone the repo with rlpyt submodule
git clone --recurse-submodules https://github.com/juliusfrost/dreamer-pytorch.git
cd dreamer-pytorch

# Windows
cd setup/windows_cu118

# Linux
cd setup/linux_cu118

# install with poetry
poetry install

# install with pip
pip install -r requirements.txt

Running Experiments

To run experiments on Atari, run python main.py, and add any extra arguments you would like. For example, to run with a single gpu set --cuda-idx 0.

To run experiments on DeepMind Control, run python main_dmc.py. You can also set any extra arguments here.

Experiments will automatically be stored in data/local/yyyymmdd/run_#
You can use tensorboard to keep track of your experiment. Run tensorboard --logdir=data.

If you have trouble reproducing any results, please raise a GitHub issue with your logs and results. Otherwise, if you have success, please share your trained model weights with us and with the broader community!

Testing

To run tests:

pytest tests

If you want additional code coverage information:

pytest tests --cov=dreamer

Code structure

main.py run atari experiment
main_dmc.py run deepmind control experiment
dreamer dreamer code
- agents agent code used in sampling
  - atari_dreamer_agent.py Atari agent
  - dmc_dreamer_agent.py DeepMind Control agent
  - dreamer_agent.py basic sampling agent, exploration, contains shared methods
- algos algorithm specific code
  - dreamer_algo.py optimization algorithm, loss functions, hyperparameters
  - replay.py replay buffer
- envs environment specific code
  - action_repeat.py action repeat wrapper. ported from tf2 dreamer
  - atari.py Atari environments. ported from tf2 dreamer
  - dmc.py DeepMind Control Suite environment. ported from tf2 dreamer
  - env.py base classes for environment
  - modified_atari.py unused atari environment from rlpyt
  - normalize_actions.py normalize actions wrapper. ported from tf2 dreamer
  - one_hot.py one hot action wrapper. ported from tf2 dreamer
  - time_limit.py Time limit wrapper. ported from tf2 dreamer
  - wrapper.py Base environment wrapper class
- experiments currently not used
- models all models used in the agent
  - action.py Action model
  - agent.py Summarizes all models for agent module
  - dense.py Dense fully connected models. Used for Reward Model, Value Model, Discount Model.
  - distribution.py Distributions, TanH Bijector
  - observation.py Observation Model
  - rnns.py Recurrent State Space Model
- utils utility functions
  - logging.py logging videos
  - module.py freezing parameters

Name	Name	Last commit message	Last commit date
Latest commit dependabot[bot] Bump numpy from 1.25.2 to 1.26.0 (#115 ) Oct 10, 2023 f973caf · Oct 10, 2023 History 297 Commits
.github	.github	Bump actions/checkout from 3 to 4 (#111 )	Oct 10, 2023
dreamer	dreamer	format dreamer	May 2, 2023
rlpyt @ f04f23d	rlpyt @ f04f23d	add rlpt submodule	May 2, 2023
setup	setup	Bump pillow from 9.5.0 to 10.0.1 in /setup/linux_cu118 (#121 )	Oct 10, 2023
tests	tests	fix tests	May 2, 2023
.gitignore	.gitignore	remove rlpt from .gitignore	May 2, 2023
.gitmodules	.gitmodules	add rlpt submodule	May 2, 2023
.pre-commit-config.yaml	.pre-commit-config.yaml	add pre-commit	May 2, 2023
CODE_OF_CONDUCT.md	CODE_OF_CONDUCT.md	Create CODE_OF_CONDUCT.md	Mar 30, 2020
CONTRIBUTING.md	CONTRIBUTING.md	Create CONTRIBUTING.md	Mar 30, 2020
LICENSE	LICENSE	Create LICENSE	Mar 30, 2020
README.md	README.md	Update README.md	Jun 26, 2023
main.py	main.py	format dreamer	May 2, 2023
main_dmc.py	main_dmc.py	format dreamer	May 2, 2023
poetry.lock	poetry.lock	Bump numpy from 1.25.2 to 1.26.0 (#115 )	Oct 10, 2023
pyproject.toml	pyproject.toml	Bump numpy from 1.25.2 to 1.26.0 (#115 )	Oct 10, 2023
requirements.txt	requirements.txt	add changes for builds to install from requirements.txt	Apr 27, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dreamer PyTorch

Results

Installation

Running Experiments

Testing

Code structure

About

Releases

Packages

Contributors 7

Languages

License

juliusfrost/dreamer-pytorch

Folders and files

Latest commit

History

Repository files navigation

Dreamer PyTorch

Results

Installation

Running Experiments

Testing

Code structure

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 7

Languages

Packages