You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Sorry to disturb you again.
I'm making a project with AlphaZero.jl. I created files for a new board game and started a training, but the learning is VERY slow. I'm afraid it will take eternity.
So I decided to use played games of good players (i.e. supervised learning)
Jonathan Laurent wrote:
"I guess what you've have to do is generate many samples of the kind that are stored in AlphaZero's memory buffer. You can take these samples either from human play data or have other players play against each other to generate data. If you do so, be careful to add some exploration so that the same game is not played again and again and that you get some diversity in your data. Once you've got the data, you can either use the Trainer utility in learning.jl or just write your training procedure yourself in Flux."
I still don't understand, in which format the games and moves are stored in memory buffer.
Please help me.
The text was updated successfully, but these errors were encountered:
Sorry to disturb you again.
I'm making a project with AlphaZero.jl. I created files for a new board game and started a training, but the learning is VERY slow. I'm afraid it will take eternity.
So I decided to use played games of good players (i.e. supervised learning)
Jonathan Laurent wrote:
"I guess what you've have to do is generate many samples of the kind that are stored in AlphaZero's memory buffer. You can take these samples either from human play data or have other players play against each other to generate data. If you do so, be careful to add some exploration so that the same game is not played again and again and that you get some diversity in your data. Once you've got the data, you can either use the Trainer utility in learning.jl or just write your training procedure yourself in Flux."
I still don't understand, in which format the games and moves are stored in memory buffer.
Please help me.
The text was updated successfully, but these errors were encountered: