You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm wondering if we should wait for the issue #227 to be solved before trying to evaluate the improvement of that new reward or even discuss the value of the different importance weights used in the reward.
I don't think so because it is really hard to make the agent learn in problems with objective functions with CPReward. We can stick with CPReward for problems like nqueens for example, but I think we will need this reward to evaluate the improvement of learning with the new heterogeneous pipeline.
Test it and make it work on:
The text was updated successfully, but these errors were encountered: