non-markovian-rl

Here are 3 public repositories matching this topic...

arnavkj1995 / Eta_Psi_Learning

Codebase of ηψ-Learning algorithm that learns a non-Markovian maximum state entropy exploration policy by combining predecessor and successor representation to estimate the state visitation distribution of a trajectory of finite length.

machine-learning reinforcement-learning deep-reinforcement-learning exploration non-markovian-rl

Updated Oct 23, 2023
Python

emuskardin / gridworld-gym

Star

Scallable partially observable and/or non-Markovian gridworld for planning or reinforcement learning

reinforcement-learning gridworld pomdp partially-observable-environment non-markovian-rl scalable-gridworld

Updated May 18, 2022
Python

corazza / non-markovian-rl

Star

Solving the problem of non-Markovian reward functions by providing agents access to a finite amount of memory

reinforcement-learning non-markovian-rl

Updated Aug 9, 2023
Rust

Improve this page

Add a description, image, and links to the non-markovian-rl topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the non-markovian-rl topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly