[RLlib] Introduce IMPALA off_policyness test with GPU #31485

ArturNiederfahrenhorst · 2023-01-05T23:46:49Z

Signed-off-by: Artur Niederfahrenhorst artur@anyscale.com

Why are these changes needed?

IMPALA runs with a GPU learner "by original design". A compilation test without GPU is fine but something like off_policyness depends on learning/sampling speed and we should test it for our standard impala settings (gpu=1, num_rollout_workers=2) - with a GPU and without competing over resources. Only then will we be able to get meaningful measurements of it's off-policyness.
This PR also tries to deflake IMPALA tests.

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

gjoliver

this is great man!! separate testing of difference things into different tests!!
just 1 quick question.

gjoliver · 2023-01-09T19:48:37Z

rllib/execution/buffers/mixin_replay_buffer.py

@@ -71,7 +71,7 @@ def __init__(
        """
        self.capacity = capacity
        self.replay_ratio = replay_ratio
-        self.replay_proportion = None
+        self.replay_proportion = 1


why do you flip this flag here? it looks important.

Uh good point. That's by accident. I wondered why it was None.
I'll change it back and ping you after tests are green again.

ArturNiederfahrenhorst

suggested changes from jun

rllib/execution/buffers/mixin_replay_buffer.py

Signed-off-by: Artur Niederfahrenhorst <attaismyname@googlemail.com>

gjoliver

awesome!

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

initial

8f64d16

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

ArturNiederfahrenhorst assigned sven1977 Jan 5, 2023

ArturNiederfahrenhorst requested review from sven1977, gjoliver, avnishn, smorad, maxpumperla, kouroshHakha and krfricke as code owners January 5, 2023 23:46

ArturNiederfahrenhorst added 2 commits January 6, 2023 01:29

move op test to own file and test

19f5c25

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

exlcusive tests

687ce19

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

gjoliver reviewed Jan 9, 2023

View reviewed changes

ArturNiederfahrenhorst commented Jan 9, 2023

View reviewed changes

rllib/execution/buffers/mixin_replay_buffer.py Outdated Show resolved Hide resolved

Update rllib/execution/buffers/mixin_replay_buffer.py

1cf6355

Signed-off-by: Artur Niederfahrenhorst <attaismyname@googlemail.com>

gjoliver approved these changes Jan 10, 2023

View reviewed changes

gjoliver merged commit 4a070fc into ray-project:master Jan 10, 2023

AmeerHajAli pushed a commit that referenced this pull request Jan 12, 2023

[RLlib] Introduce IMPALA off_policyness test with GPU (#31485)

467b1d1

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Introduce IMPALA off_policyness test with GPU #31485

[RLlib] Introduce IMPALA off_policyness test with GPU #31485

ArturNiederfahrenhorst commented Jan 5, 2023 •

edited

Loading

gjoliver left a comment

gjoliver Jan 9, 2023

ArturNiederfahrenhorst Jan 9, 2023

ArturNiederfahrenhorst left a comment

gjoliver left a comment

[RLlib] Introduce IMPALA off_policyness test with GPU #31485

[RLlib] Introduce IMPALA off_policyness test with GPU #31485

Conversation

ArturNiederfahrenhorst commented Jan 5, 2023 • edited Loading

Why are these changes needed?

gjoliver left a comment

Choose a reason for hiding this comment

gjoliver Jan 9, 2023

Choose a reason for hiding this comment

ArturNiederfahrenhorst Jan 9, 2023

Choose a reason for hiding this comment

ArturNiederfahrenhorst left a comment

Choose a reason for hiding this comment

gjoliver left a comment

Choose a reason for hiding this comment

ArturNiederfahrenhorst commented Jan 5, 2023 •

edited

Loading