Agent (Player) unwrapped from the _FabricModule
does not consider the chosen precision during env interaction
#236
Labels
bug
Something isn't working
enhancement
New feature or request
help wanted
Extra attention is needed
Every time, during the environment interaction, the we call
agent.module
to unwrap the agent from the distributed strategy, we also unwrap the agent from the precision plugin, this means that if we are training an agent withfloat16
orbfloat16
then the environment interaction happens infloat32
.I suggest to wrap every
player
agent with a_FabricModule
, i.e._FabricModule(agent, precision=fabric.precision)
so to unwrap the agent from the strategy but maintaining the precision plugin.cc @michele-milesi
The text was updated successfully, but these errors were encountered: