Add on_optimizer_step to callback options #31095

dhruvbpai · 2024-05-28T21:30:20Z

Add on_optimizer_step callback option in TrainerCallbacks

Aside: This is my first open source pull request, so any feedback would be much appreciated!

The test tests/trainer/test_trainer_callback.py has been modified appropriately to invoke the new callback method.

Fixes #31033 (issue)

Reviewers

As tagged in initial issue - @muellerzr @younesbelkada

younesbelkada

This looks great to me thanks !

HuggingFaceDocBuilderDev · 2024-05-29T09:06:10Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

muellerzr

Makes sense to me as well! Nice job. cc @amyeroberts for final review

LysandreJik

Awesome, LGTM! Thanks a lot 🤗

PangziZhang523 · 2024-06-20T03:12:15Z

Hello, thank you for your contribution. I used on_optimizer_step to print the gradient, and all the printed values were None, but the final grad_norm had a value. Why is that?

colmon46 · 2024-07-19T06:22:43Z

Hello, thank you for your contribution. I used on_optimizer_step to print the gradient, and all the printed values were None, but the final grad_norm had a value. Why is that?

Same question with @PangziZhang523 . Do you know how to fix this problem plz?

Arunprakash-A · 2024-08-21T11:25:29Z

It is working fine for me.

YeLuoSuiYou · 2024-10-15T03:30:51Z

Hello, thank you for your contribution. I used on_optimizer_step to print the gradient, and all the printed values were None, but the final grad_norm had a value. Why is that?

Same question with @PangziZhang523 . Do you know how to fix this problem plz?

Hi @colmon46, if we use deepspeed or FSDP, we should use self.model_wrapped to get gradient of every layers, but do you know how can we get self.model_wrap in callbacks, 3ku

Gaoyg · 2024-11-07T06:35:02Z

Hello, thank you for your contribution. I used on_optimizer_step to print the gradient, and all the printed values were None, but the final grad_norm had a value. Why is that?

Same question with @PangziZhang523 . Do you know how to fix this problem plz?

Same question. Is there a solution already? thx~

dhruvbpai added 4 commits May 28, 2024 13:41

Modified test

a69f0e9

Added on_optimizer_step to callbacks

95a4579

Move callback after step is called

5dcca4c

Added on optimizer step callback

87e740f

younesbelkada approved these changes May 29, 2024

View reviewed changes

younesbelkada requested a review from muellerzr May 29, 2024 08:45

muellerzr approved these changes May 29, 2024

View reviewed changes

muellerzr requested a review from amyeroberts May 29, 2024 14:15

LysandreJik approved these changes May 29, 2024

View reviewed changes

LysandreJik merged commit 5c88253 into huggingface:main May 29, 2024
21 checks passed

dhruvbpai deleted the before_optimizer_step branch May 29, 2024 19:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add on_optimizer_step to callback options #31095

Add on_optimizer_step to callback options #31095

dhruvbpai commented May 28, 2024

younesbelkada left a comment

HuggingFaceDocBuilderDev commented May 29, 2024

muellerzr left a comment

LysandreJik left a comment

PangziZhang523 commented Jun 20, 2024

colmon46 commented Jul 19, 2024

Arunprakash-A commented Aug 21, 2024

YeLuoSuiYou commented Oct 15, 2024

Gaoyg commented Nov 7, 2024

Add on_optimizer_step to callback options #31095

Add on_optimizer_step to callback options #31095

Conversation

dhruvbpai commented May 28, 2024

Add on_optimizer_step callback option in TrainerCallbacks

Reviewers

younesbelkada left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented May 29, 2024

muellerzr left a comment

Choose a reason for hiding this comment

LysandreJik left a comment

Choose a reason for hiding this comment

PangziZhang523 commented Jun 20, 2024

colmon46 commented Jul 19, 2024

Arunprakash-A commented Aug 21, 2024

YeLuoSuiYou commented Oct 15, 2024

Gaoyg commented Nov 7, 2024