[RLlib] Remove unneeded args from offline learning examples #26666

ArturNiederfahrenhorst · 2022-07-18T17:51:45Z

Why are these changes needed?

We largely got rid of replay buffers in the context of offline RL in RLlib, but the examples have not beed changed.
This PR gets the offline learning example back on track.

#26665

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

…ningstarts

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

…apex test Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

ArturNiederfahrenhorst · 2022-08-09T21:07:38Z

I am actually not a fan of learning_starts -> min_size. min_size seems to imply the size of the RB, not how many samples there are in the buffer.

This comment is directed at another PR that moves the learning_starts parameters but needed to be merged into this one because they interfere. The comment has been resolved in the other PR.

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

kouroshHakha · 2022-08-15T15:38:20Z

rllib/algorithms/sac/sac.py

@@ -262,7 +262,7 @@ def training(
        if grad_clip is not None:
            self.grad_clip = grad_clip
        if optimization_config is not None:
-            self.optimization_config = optimization_config


nice catch man!!

kouroshHakha

Approved conditioned on all tests passing.

sven1977 · 2022-08-17T12:13:48Z

Hey @gjoliver , could you approve this, so it can be merged, now that questions have been addressed?

This comment is directed at another PR that moves the learning_starts parameters but needed to be merged into this one because they interfere. The comment has been resolved in the other PR.

ArturNiederfahrenhorst and others added 27 commits June 23, 2022 12:13

renaming

996042e

renaming

1a3f972

lint

6ec6d7c

utils fix

fb6f2a0

learning-starts occurences

fd022cf

num_ts_added_before_sampling_starts in recsim example

9d32f06

Adds extensive description to all occurences

4c75280

typo

ef2396a

Merge branch 'master' into learningstarts

e926355

fix maddpg test config

90cec5c

merge master again

5633595

Merge branch 'master' of https://github.com/ray-project/ray into lear…

2224a0b

…ningstarts

wip

0254a46

Merge branch 'master' of https://github.com/ray-project/ray into lear…

af3e899

…ningstarts

LINT

a9aed3d

Merge branch 'master' of https://github.com/ray-project/ray into lear…

daf408c

…ningstarts

wip

ad22d3a

wip

bfc5f08

merge master

7a41dcd

rename to min_size

777ffb2

fix recsim example

6b5d9cd

typo

9ff64ed

restore recsim example

b404c8b

Merge branch 'master' into learningstarts

74f14e2

merge master

d52320f

restores alpha_zero exec plan

0f52df3

remove unneeded params from offline learning examples

6389080

ArturNiederfahrenhorst requested review from sven1977, gjoliver and avnishn as code owners July 18, 2022 17:51

ArturNiederfahrenhorst added 10 commits August 2, 2022 15:38

merge master

137df37

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

revert 965ddd9

107f062

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

initial

5b0b5aa

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

merge master

acc45ab

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

Merge branch 'sequencesandprios' into learningstarts

f902a94

fix test_execution, checkpoint_restore and test_custom_resources and …

19bfa50

…apex test Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

initial

694d2cf

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

Merge branch 'apextestmintrainsteps' into learningstarts

09e32d4

merge learning_starts changes again

06f0ebf

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

Merge branch 'master' into removebufferconfigs

a3dc350

ArturNiederfahrenhorst added 6 commits August 10, 2022 12:45

initial

d491868

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

Merge branch 'sacoptimizationconfig' into removebufferconfigs

5574b43

more fixes

f441155

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

remove ray.__init__(local_mode=True)

e4eae4c

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

BUILD fix

069c6b3

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

merge master

24f59dd

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

ArturNiederfahrenhorst assigned kouroshHakha Aug 15, 2022

kouroshHakha reviewed Aug 15, 2022

View reviewed changes

kouroshHakha approved these changes Aug 15, 2022

View reviewed changes

sven1977 merged commit f7b4c5a into ray-project:master Aug 17, 2022

ArturNiederfahrenhorst deleted the removebufferconfigs branch September 21, 2022 10:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Remove unneeded args from offline learning examples #26666

[RLlib] Remove unneeded args from offline learning examples #26666

ArturNiederfahrenhorst commented Jul 18, 2022 •

edited

Loading

ArturNiederfahrenhorst commented Aug 9, 2022

kouroshHakha Aug 15, 2022

kouroshHakha left a comment

sven1977 commented Aug 17, 2022

[RLlib] Remove unneeded args from offline learning examples #26666

[RLlib] Remove unneeded args from offline learning examples #26666

Conversation

ArturNiederfahrenhorst commented Jul 18, 2022 • edited Loading

Why are these changes needed?

Checks

ArturNiederfahrenhorst commented Aug 9, 2022

kouroshHakha Aug 15, 2022

Choose a reason for hiding this comment

kouroshHakha left a comment

Choose a reason for hiding this comment

sven1977 commented Aug 17, 2022

ArturNiederfahrenhorst commented Jul 18, 2022 •

edited

Loading