Bachelor Project

In this paper, both model-based dynamic programming and model-free reinforcement learning are put into comparison. We show the necessary theory and basics to work on control problems for finite discrete-time dynamic systems and how to attain an optimal policy that optimizes our objective function. Stochastic dynamic programming and Q-learning are then applied to a practical example problem that showcases the different approaches and their respective results. Our practical results show that both methods are valid approaches to solving the example.

Content

Code
Thesis
Presentation Slides
Python code for dp, ql and statistics

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
bac-thesis		bac-thesis
code		code
presentation		presentation
DP_vs_RL_presentation_slides.pdf		DP_vs_RL_presentation_slides.pdf
Dynamic_Programming_vs_Reinforcement_Learning.pdf		Dynamic_Programming_vs_Reinforcement_Learning.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bachelor Project

Content

About

Releases

Packages

Languages

NoahRuhmer/bachelor-thesis

Folders and files

Latest commit

History

Repository files navigation

Bachelor Project

Content

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages