path-finding

This is a repository for a class project in reinforcement learning.

Data can be found here:

Creates a folder for results (pickled dictionary) and figs based on a trial number. The trial number is also taken as the random seed.

Episodic semigradient SARSA (Sutton and Barto, pg. 244) - with linear approximation function
Continuous semigradient SARSA (Sutton and Barto, pg. 251) - with linear approximation function

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
sarsa_cont_results_10		sarsa_cont_results_10
sarsa_cont_results_6		sarsa_cont_results_6
.gitignore		.gitignore
Episodic_figsfrompkl.py		Episodic_figsfrompkl.py
LICENSE		LICENSE
README.md		README.md
RL_testbed_final.py		RL_testbed_final.py
converting_sparselists_to_heuristics.py		converting_sparselists_to_heuristics.py
converting_sparselists_to_heuristics_2.py		converting_sparselists_to_heuristics_2.py