Make general components #370

pilgrimygy · 2021-07-14T16:37:30Z

Put GaussianNetwork and DuelingNetwork into RLCore as general components. And make a general evaluate function.

findmyway

Remember to update NEWS.md.
The failure in CI seems to be unrelated to this PR. I'll take a look into it.

...ningCore/src/policies/q_based_policies/learners/approximators/neural_network_approximator.jl

src/ReinforcementLearningZoo/Manifest.toml

findmyway · 2021-07-16T15:32:25Z

It's quite strange that the tests now run much slower...

…Linux

pilgrimygy · 2021-07-17T03:27:16Z

Thanks @findmyway for finding so many bugs!

findmyway · 2021-07-17T04:19:58Z

@pilgrimygy could you confirm that the JuliaRL_BasicDQN_CartPole experiment in this PR can be finished in less than a minute?

~~It seems everything works as usual on my local machine. Not sure why it is so slow in CI.~~

Now I get it. Will fix it soon.

pilgrimygy · 2021-07-17T04:33:28Z

Well. Why JuliaRL_BasicDQN_CartPole in CI is too slow?

findmyway · 2021-07-17T05:33:59Z

I must say, this is the weirdest thing I've seen in Julia so far. And I really don't know why...

(@v1.6) pkg> activate src/ReinforcementLearningExperiments/

(ReinforcementLearningExperiments) pkg> build

(ReinforcementLearningExperiments) pkg> test

┌ Warning: `InplaceableThunk(t::Thunk, add!)` is deprecated, use `InplaceableThunk(add!, t)` instead.
│   caller = ip:0x0
└ @ Core :-1

Then it is stuck. Just like what we see in the CI.

Well, if we run it in REPL:

(@v1.6) pkg> activate src/ReinforcementLearningExperiments/

julia> using ReinforcementLearningExperiments

julia> ex = E`JuliaRL_BasicDQN_CartPole`;

julia> run(ex)

Things work as usual.

Make general components

bb45370

findmyway requested changes Jul 15, 2021

View reviewed changes

...ningCore/src/policies/q_based_policies/learners/approximators/neural_network_approximator.jl Outdated Show resolved Hide resolved

...ningCore/src/policies/q_based_policies/learners/approximators/neural_network_approximator.jl Outdated Show resolved Hide resolved

findmyway mentioned this pull request Jul 15, 2021

It seems a breaking change was introduced after v0.6.14 FluxML/Zygote.jl#1029

Closed

Update

40f140b

pilgrimygy requested a review from findmyway July 16, 2021 02:25

findmyway requested changes Jul 16, 2021

View reviewed changes

src/ReinforcementLearningZoo/Manifest.toml Show resolved Hide resolved

findmyway added 2 commits July 16, 2021 11:57

pass CI temporarily

17377c3

resolve comments

974f0da

findmyway linked an issue Jul 16, 2021 that may be closed by this pull request

Can evaluate function be used as a component of RLcore? #369

Closed

remove OpenSpiel from RLExps to keep this package usable not only on …

fc60978

…Linux

update dependencies to see if it still stuck in Github Action

6dd9d00

remove Project.toml in tests folder

f0a1831

findmyway added 2 commits July 17, 2021 13:54

hotfix to downgrade ChainRulesCore

20300c5

hotfix

3b3be75

findmyway mentioned this pull request Jul 17, 2021

Change argument order on InplaceableThunk and fix deprecated tests JuliaDiff/ChainRulesCore.jl#396

Merged

findmyway merged commit 32aa394 into JuliaReinforcementLearning:master Jul 17, 2021

pilgrimygy deleted the component branch July 17, 2021 07:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make general components #370

Make general components #370

pilgrimygy commented Jul 14, 2021

findmyway left a comment

findmyway commented Jul 16, 2021

pilgrimygy commented Jul 17, 2021

findmyway commented Jul 17, 2021 •

edited

Loading

pilgrimygy commented Jul 17, 2021

findmyway commented Jul 17, 2021

Make general components #370

Make general components #370

Conversation

pilgrimygy commented Jul 14, 2021

findmyway left a comment

Choose a reason for hiding this comment

findmyway commented Jul 16, 2021

pilgrimygy commented Jul 17, 2021

findmyway commented Jul 17, 2021 • edited Loading

pilgrimygy commented Jul 17, 2021

findmyway commented Jul 17, 2021

findmyway commented Jul 17, 2021 •

edited

Loading