some distribution creation doesn' t do anything #166

HiddeLekanne · 2023-12-02T22:13:45Z

Lines like:

predicted_values = Independent(
        Normal(critic(imagined_trajectories), 1, validate_args=validate_args),
        1,
        validate_args=validate_args,
    ).mean

and

    predicted_rewards = Independent(
        Normal(world_model.reward_model(imagined_trajectories), 1, validate_args=validate_args),
        1,
        validate_args=validate_args,
    ).mean

Don't do anything.

It's because you're not sampling from the distribution, your simply taking the mean, which is what you started with anyways. You can confirm it by running a training session with and without the whole distribution creation and see that the model learns exactly the same thing.

Lines are from DreamerV1

The text was updated successfully, but these errors were encountered:

HiddeLekanne · 2023-12-02T22:18:06Z

same should be for the .mode versions in DreamerV2, because for a normal distribution the mode equals the mean.

belerico · 2023-12-04T13:55:06Z

Hi @HiddeLekanne, yeah you're right: this should give us the same trained agent. I will try it asap

michele-milesi linked a pull request Dec 12, 2023 that will close this issue

feat: added optimizations #168

Merged

4 tasks

belerico closed this as completed in #168 Dec 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

some distribution creation doesn' t do anything #166

some distribution creation doesn' t do anything #166

HiddeLekanne commented Dec 2, 2023

HiddeLekanne commented Dec 2, 2023

belerico commented Dec 4, 2023

some distribution creation doesn' t do anything #166

some distribution creation doesn' t do anything #166

Comments

HiddeLekanne commented Dec 2, 2023

HiddeLekanne commented Dec 2, 2023

belerico commented Dec 4, 2023