Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some evaluation results are missing #5

Open
muupan opened this issue May 11, 2016 · 2 comments
Open

Some evaluation results are missing #5

muupan opened this issue May 11, 2016 · 2 comments

Comments

@muupan
Copy link
Owner

muupan commented May 11, 2016

In scores.txt of the current uploaded trained model, evaluation results at 55000000 and 56000000 are missing.

54000000 41383.44816946983 448.7 408.0 133.6006071177157

I don't know why and whether it can affect performance. I need to check.

@muupan
Copy link
Owner Author

muupan commented May 15, 2016

I found that missing evaluation is caused by processes stuck in evaluate_performance(). It is possible that some policies fail start to play Breakout, preventing episodes from being terminated. If so, it might be necessary to use epsilon-greedy-like action selection in addition to sampling from softmax policies in test runs.

@muupan
Copy link
Owner Author

muupan commented May 17, 2016

It didn't occurred for Space Invaders. For Breakout we might need to force long episodes to finish.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant