Supervised learning & AlphaZero.jl #2

StepHaze · 2022-06-08T18:43:40Z

Sorry to disturb you again.
I'm making a project with AlphaZero.jl. I created files for a new board game and started a training, but the learning is VERY slow. I'm afraid it will take eternity.
So I decided to use played games of good players (i.e. supervised learning)

Jonathan Laurent wrote:
"I guess what you've have to do is generate many samples of the kind that are stored in AlphaZero's memory buffer. You can take these samples either from human play data or have other players play against each other to generate data. If you do so, be careful to add some exploration so that the same game is not played again and again and that you get some diversity in your data. Once you've got the data, you can either use the Trainer utility in learning.jl or just write your training procedure yourself in Flux."

I still don't understand, in which format the games and moves are stored in memory buffer.
Please help me.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Supervised learning & AlphaZero.jl #2

Supervised learning & AlphaZero.jl #2

StepHaze commented Jun 8, 2022

Supervised learning & AlphaZero.jl #2

Supervised learning & AlphaZero.jl #2

Comments

StepHaze commented Jun 8, 2022