You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I found that missing evaluation is caused by processes stuck in evaluate_performance(). It is possible that some policies fail start to play Breakout, preventing episodes from being terminated. If so, it might be necessary to use epsilon-greedy-like action selection in addition to sampling from softmax policies in test runs.
In
scores.txt
of the current uploaded trained model, evaluation results at55000000
and56000000
are missing.async-rl/trained_model/breakout/scores.txt
Line 55 in 0ec501c
I don't know why and whether it can affect performance. I need to check.
The text was updated successfully, but these errors were encountered: