Agent (Player) unwrapped from the `_FabricModule` does not consider the chosen precision during env interaction #236

belerico · 2024-03-16T17:01:25Z

Every time, during the environment interaction, the we call agent.module to unwrap the agent from the distributed strategy, we also unwrap the agent from the precision plugin, this means that if we are training an agent with float16 or bfloat16 then the environment interaction happens in float32.

I suggest to wrap every player agent with a _FabricModule, i.e. _FabricModule(agent, precision=fabric.precision) so to unwrap the agent from the strategy but maintaining the precision plugin.

cc @michele-milesi

The text was updated successfully, but these errors were encountered:

belerico added the bug Something isn't working label Mar 16, 2024

belerico added enhancement New feature or request help wanted Extra attention is needed labels Mar 23, 2024

This was referenced Mar 25, 2024

fabric precision: 32 bit using far less memory than bf16 #240

Closed

Fix/player precision plugin #244

Merged

belerico closed this as completed in #244 Mar 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agent (Player) unwrapped from the `_FabricModule` does not consider the chosen precision during env interaction #236

Agent (Player) unwrapped from the `_FabricModule` does not consider the chosen precision during env interaction #236

belerico commented Mar 16, 2024

Agent (Player) unwrapped from the _FabricModule does not consider the chosen precision during env interaction #236

Agent (Player) unwrapped from the _FabricModule does not consider the chosen precision during env interaction #236

Comments

belerico commented Mar 16, 2024

Agent (Player) unwrapped from the `_FabricModule` does not consider the chosen precision during env interaction #236

Agent (Player) unwrapped from the `_FabricModule` does not consider the chosen precision during env interaction #236