A simple implementation of A3C for atari game Riverraid-v0
from scratch. Joint work with @xmty777
- Only use torch data structure and some commonly used torch functions.
- Implement my own Conv2d, Linear and LSTM layers, including forward and backward.
virtualenv --python=python3 venv
source venv/bin/activate
# install dependencies
pip install -r requirement.txt
# train
python A3C/my_main.py
The detail hyperparameters are configed by python class Config
in my_main.py
.
Train model and log would be stored in ./model
and ./log
separately.
No implementation. By you can implement your test file refer to PB17111656.py.
Welcome to Deep Reinforcement Learning Part 1 : DQN
Policy Gradient Methods for Reinforcement Learning with Function Approximation