reproduce training dd #4

hppy139 · 2024-12-16T07:49:42Z

To reproduce training, can you share the training "tensor/" directory?
I am facing a overfitting situation that "train loss is close to zero".

ddehun · 2024-12-16T08:08:44Z

Hi. Thank you for your interest in our work.

Unfortunately, I recently left my lab by graduation, so I cannot access the tensor/ directory.

I have some questions to guess the problems.

Which hyperparameters do you use to train your reranker? I think https://github.com/ddehun/DEnsity/blob/master/scripts/train.sh this script can be used to obtain similar results to the paper.
It would be also good to evaluate your model in the downstream task (i.e., meta-evaluation dataset for dialogue).
If your targeted corpus is either ConvAI2 or DailyDialogue, You can also use the pre-released checkpoints in README (https://drive.google.com/drive/folders/1IUUg6xsmEr28oed2yPqIA2m6xsQ9yNRd).

Please ping me if you have further issues. Thank you!

hppy139 · 2024-12-16T08:24:05Z

And actually I have got your checkpoints models, I'm recently working on training on my own data("one round dialogue", i.e. speaker A one turn + speaker one turn). So, the first step for me is to reproduce your training.
As for "1": yes, I am using the same parameters as this "DATASET=dd; temp=0.1; weight=1; lr=5e-5".
It's hard for me to distinguish which part got wrong...
(Bad to hear that "Unfortunately, I recently left my lab by graduation, so I cannot access the tensor/ directory." o(╥﹏╥)o)

hppy139 · 2024-12-16T08:28:47Z

Besides, I'm wondering your time cost in training dd model. One epoch costs me 9.5 hours by using 4 Tesla K40c GPUs.

hppy139 changed the title ~~reproduce training~~ reproduce training dd Dec 16, 2024

hppy139 closed this as completed Dec 16, 2024

hppy139 reopened this Dec 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reproduce training dd #4

reproduce training dd #4

hppy139 commented Dec 16, 2024

ddehun commented Dec 16, 2024

hppy139 commented Dec 16, 2024 •

edited

Loading

hppy139 commented Dec 16, 2024

reproduce training dd #4

reproduce training dd #4

Comments

hppy139 commented Dec 16, 2024

ddehun commented Dec 16, 2024

hppy139 commented Dec 16, 2024 • edited Loading

hppy139 commented Dec 16, 2024

hppy139 commented Dec 16, 2024 •

edited

Loading