Some evaluation results are missing #5

muupan · 2016-05-11T02:05:08Z

In scores.txt of the current uploaded trained model, evaluation results at 55000000 and 56000000 are missing.

async-rl/trained_model/breakout/scores.txt

Line 55 in 0ec501c

54000000 41383.44816946983 448.7 408.0 133.6006071177157

I don't know why and whether it can affect performance. I need to check.

The text was updated successfully, but these errors were encountered:

muupan · 2016-05-15T03:45:18Z

I found that missing evaluation is caused by processes stuck in evaluate_performance(). It is possible that some policies fail start to play Breakout, preventing episodes from being terminated. If so, it might be necessary to use epsilon-greedy-like action selection in addition to sampling from softmax policies in test runs.

muupan · 2016-05-17T07:59:15Z

It didn't occurred for Space Invaders. For Breakout we might need to force long episodes to finish.

muupan mentioned this issue May 11, 2016

a3c_ale.py won't completely quit after finishing #6

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some evaluation results are missing #5

Some evaluation results are missing #5

muupan commented May 11, 2016 •

edited

Loading

muupan commented May 15, 2016

muupan commented May 17, 2016

Some evaluation results are missing #5

Some evaluation results are missing #5

Comments

muupan commented May 11, 2016 • edited Loading

muupan commented May 15, 2016

muupan commented May 17, 2016

muupan commented May 11, 2016 •

edited

Loading