Skip to content

Activity

added print_every for atari env score reporting

philtaborpushed 1 commit to master • 39718ec…f0be0fc • 
on Jan 6

changed avg reward calculation

philtaborpushed 1 commit to master • 5039730…39718ec • 
on Jan 6

added monitor wrapper for atari envs

philtaborpushed 2 commits to master • 47b706c…5039730 • 
on Jan 6

add action space seeding, as per sb3

philtaborpushed 1 commit to master • dbfda3c…47b706c • 
on Jan 3

add params to correctly calculate conv dims

philtaborpushed 1 commit to master • c644003…dbfda3c • 
on Jan 3

code formatting

philtaborpushed 1 commit to master • 4114407…c644003 • 
on Jan 3

add kwargs to stackframes reset call

philtaborpushed 1 commit to master • fc0efe2…4114407 • 
on Jan 3

atari dependencies no longer required

philtaborpushed 1 commit to master • e287e02…fc0efe2 • 
on Jan 3

fix dtype issue in converting arrays to tensors

philtaborpushed 1 commit to master • 160b5d4…e287e02 • 
on Jan 3

compatibility with new learner interface

philtaborpushed 1 commit to master • d3fc348…160b5d4 • 
on Jan 1

remove dependency on miniworld

philtaborpushed 1 commit to master • 4a60639…d3fc348 • 
on Jan 1

improve examples

philtaborpushed 1 commit to master • 70fee23…4a60639 • 
on Dec 31, 2024

performance improvements

philtaborpushed 1 commit to master • 33360d3…70fee23 • 
on Dec 31, 2024

performance improvements

philtaborpushed 1 commit to master • da305dc…33360d3 • 
on Dec 31, 2024

add value estimation function

philtaborpushed 1 commit to master • e9cb268…da305dc • 
on Dec 31, 2024

better score handling

philtaborpushed 1 commit to master • eed4282…e9cb268 • 
on Dec 31, 2024

change actor/network architecture

philtaborpushed 1 commit to master • 2931eb9…eed4282 • 
on Dec 31, 2024

remove adv norm and add swap_and_flatten func

philtaborpushed 1 commit to master • 3e54162…2931eb9 • 
on Dec 31, 2024

change wrappers

philtaborpushed 1 commit to master • 5cf325b…3e54162 • 
on Dec 31, 2024

change atari wrappers

philtaborpushed 1 commit to master • 9c45830…5cf325b • 
on Dec 31, 2024

return env.reset()

philtaborpushed 1 commit to master • 7b9dfe7…9c45830 • 
on Dec 31, 2024

change batch sampling for ppo

philtaborpushed 1 commit to master • 71cd90e…7b9dfe7 • 
on Dec 31, 2024

add pixel environments

philtaborpushed 1 commit to master • 2f49a20…71cd90e • 
on Dec 4, 2024

code cleanup

philtaborpushed 1 commit to master • 46f2f83…2f49a20 • 
on Dec 4, 2024

cnn factory added

philtaborpushed 1 commit to master • 23b90da…46f2f83 • 
on Dec 4, 2024

ppo for atari

philtaborpushed 1 commit to master • 3552d7e…23b90da • 
on Dec 4, 2024

atari for ppo

philtaborpushed 1 commit to master • 0bd7a78…3552d7e • 
on Nov 28, 2024

pytorch observation wrapper

philtaborpushed 1 commit to master • 0024d6e…0bd7a78 • 
on Nov 27, 2024

added new wrappers from sb3

philtaborpushed 1 commit to master • 9d3eb54…0024d6e • 
on Nov 27, 2024

better conv output dim calculation

philtaborpushed 1 commit to master • 943757b…9d3eb54 • 
on Nov 27, 2024