Authors: Nikhil Podila, Savinay Nagendra
This project was carried out by the authors in their Final semester of Undergraduation (Bachelor of Engineeering) under the guidance of Professor Koshy George at the Center of Intelligent Systems in PES Institute of Technology, Bangalore, India.
- The code must be opened in MATLAB R2017a and above.
- Tabular Reinforcement Learning solutions:
- CartPoleLearningSystem.m: Temporal Difference Learning (SARSA) algorithm as explained in Sutton's Dissertation has been implemented on the Inverted Pendulum problem.
- QLearningCartPole.m: Standard Q-Learning algorithm
- QLearningCartPoleLeastTrials.m: Q-Learning algorithm, optimized hyperparameters
- QLearningCartPoleThetaCheck.m: Q-Learning algorithm, with focus on pendulum angle
- SarsaLearningCartPole.m: Standard SARSA algorithm
- SarsaLearningCartPoleLeastTrials.m: SARSA algorithm, optimized hyperparameters
- SarsaLearningCartPoleThetaCheck.m: SARSA algorithm, with focus on pendulum angle
- Reinforcement Learning with Function approximation:
- linfun1.m: SARSA algorithm with Linear Function Approximation
- Solutions integrated with Swing-up:
- SarsaLearningCartPoleSwingUp.m: Tabular SARSA algorithm, with Pendulum swing up using Energy Method
- linfun1SwingUp.m: SARSA algorithm with Linear Function approximation, with Pendulum swing up using Energy method