-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Eval reward is much lower than rollout reward. #2
Comments
Sorry for the delay in replying. |
Thanks for your reply.
Now the |
When I run the command below,
I found that the
eval/mean_reward
is much lower thanrollout/ep_rew_mean
. For example,Considering the action noise,
eval/mean_reward
should have been a little higher thanrollout/ep_rew_mean
. However, the case seems to be the opposite. I found that some environment wrappers could cause similar issues (DLR-RM/stable-baselines3#181). So does it have something to do with theConstraintEnvWrapper
? Or any other explanations to this observation?The text was updated successfully, but these errors were encountered: