-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
reproduce training dd #4
Comments
Hi. Thank you for your interest in our work. Unfortunately, I recently left my lab by graduation, so I cannot access the tensor/ directory. I have some questions to guess the problems.
Please ping me if you have further issues. Thank you! |
And actually I have got your checkpoints models, I'm recently working on training on my own data("one round dialogue", i.e. speaker A one turn + speaker one turn). So, the first step for me is to reproduce your training. |
Besides, I'm wondering your time cost in training dd model. One epoch costs me 9.5 hours by using 4 Tesla K40c GPUs. |
To reproduce training, can you share the training "tensor/" directory?
I am facing a overfitting situation that "train loss is close to zero".
The text was updated successfully, but these errors were encountered: