The optimizer may track some values across different training routines #136

DavdGao · 2022-06-02T10:17:33Z

Sine the optimizer is initialized within context, different training routines will use the same optimizer.
Some optimizers, like Adam, will track the past momentum.

Therefore, the optimizer may track some state variables across different training routines. Considering the initialized model of each training routine is broadcast by the server, maybe it is unnecessary or even wrong to track past variables.

The text was updated successfully, but these errors were encountered:

DavdGao mentioned this issue Jun 21, 2022

[Feature]Modification of the finetune mechanism #177

Merged

DavdGao self-assigned this Jun 23, 2022

DavdGao linked a pull request Jun 23, 2022 that will close this issue

[Feature]Modification of the finetune mechanism #177

Merged

xieyxclack closed this as completed in #177 Jul 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The optimizer may track some values across different training routines #136

The optimizer may track some values across different training routines #136

DavdGao commented Jun 2, 2022 •

edited

Loading

The optimizer may track some values across different training routines #136

The optimizer may track some values across different training routines #136

Comments

DavdGao commented Jun 2, 2022 • edited Loading

DavdGao commented Jun 2, 2022 •

edited

Loading