This is a repository for a class project in reinforcement learning.
Contributors:
Zoe Kanavas (zkanavas@ucdavis.edu)
Erin Musabandesu (enmusabandesu@ucdavis.edu)
Liam Lynch (wdlynch@ucdavis.edu)
UC Davis Google Drive Data Access
- Sample_A (data folder)
- heuristic_info_all_samples.csv
Creates a folder for results (pickled dictionary) and figs based on a trial number. The trial number is also taken as the random seed.
- Episodic semigradient SARSA (Sutton and Barto, pg. 244) - with linear approximation function
- Continuous semigradient SARSA (Sutton and Barto, pg. 251) - with linear approximation function