Skip to content

An implementation of the reinforcement learning for CartPole-v0 by policy optimization

Notifications You must be signed in to change notification settings

grassking100/reinforcement_learning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

An implementation of the reinforcement learning for CartPole-v0 by policy optimization

record

The step plot of the result

step

The histogram of the 100 simulation result (mean value 199)

hist

Reference

  1. CartPole-v0: https://gym.openai.com/envs/CartPole-v0/
  2. Ilyas, Andrew, et al. "A closer look at deep policy gradients." arXiv preprint arXiv:1811.02553 (2018).

About

An implementation of the reinforcement learning for CartPole-v0 by policy optimization

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages