add functions to inspect model and optimizer status to trainer.py #29838

CKeibel · 2024-03-24T10:58:01Z

What does this PR do?

Add functions get_num_trainable_parameters to return the number of parameters which require grad, get_learning_rates to return the learning rates of parameter groups and get_optimizer_group to return optimizer groups for parameters.

Fixes #29016

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@amyeroberts @muellerzr

…er group for parameters and get learning rates of param groups to trainer.py

muellerzr

Very nice PR! Any chance you could add some basic tests in tests/trainer/test_trainer.py for these?

CKeibel · 2024-03-25T13:40:20Z

Very nice PR! Any chance you could add some basic tests in tests/trainer/test_trainer.py for these?

Of course, I wasn't sure where to add them, but now I see that I missed that test_trainer.py exists. I will add the tests later. :)

HuggingFaceDocBuilderDev · 2024-03-25T13:53:12Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

muellerzr · 2024-03-25T15:10:08Z

Great, thanks! I'll give a ✅ after, great job and very clean implementation :)

amyeroberts

Nice - thanks for adding!

+1 for adding tests. Once those are done, you can ping for a final review and we can merge 🤗

muellerzr · 2024-03-25T22:51:10Z

tests/trainer/test_trainer.py

+        out_features = 64
+        # in_features * out_features + bias
+        expected_num_params = in_features * out_features + out_features
+        model = nn.Sequential(nn.Linear(in_features, out_features))


Let's perhaps make this two linear layers, one that's frozen, one that's not, so this way trainable checks if we have different optimizer groups? :)

Yes, you're right, that would definitely be better :)

amyeroberts

Nice tests! Thanks for adding!

Just a small nit on the torch condition - then we're good to merge!

tests/trainer/test_trainer.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

muellerzr

Thanks! Looks good to me as well, let's fix that spacing though 😉

tests/trainer/test_trainer.py

Co-authored-by: Zach Mueller <muellerzr@gmail.com>

CKeibel · 2024-03-27T17:47:44Z

Is there anything else to do? :) @amyeroberts @muellerzr

amyeroberts · 2024-03-28T10:37:12Z

@CKeibel Nope - merging now!

CKeibel · 2024-03-28T11:00:07Z

Excellent thanks, then I'll start looking for a new issue to work on. :)

amyeroberts · 2024-03-28T14:24:09Z

Looking forward to the future PRs :) Thanks again for contributing to improving the library!

add functions to get number of params which require grad, get optimiz…

afb02ba

…er group for parameters and get learning rates of param groups to trainer.py

muellerzr reviewed Mar 25, 2024

View reviewed changes

amyeroberts reviewed Mar 25, 2024

View reviewed changes

add tests and raise ValueError when optimizer is None

b0c39dc

muellerzr reviewed Mar 25, 2024

View reviewed changes

CKeibel added 2 commits March 26, 2024 08:23

add second layer to test and freeze its weigths

d7fac7e

check if torch is available before running tests

6b29066

amyeroberts approved these changes Mar 26, 2024

View reviewed changes

tests/trainer/test_trainer.py Outdated Show resolved Hide resolved

use decorator to check if torch is available

ab2cb72

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

muellerzr approved these changes Mar 26, 2024

View reviewed changes

tests/trainer/test_trainer.py Outdated Show resolved Hide resolved

fix test indentation

383220e

Co-authored-by: Zach Mueller <muellerzr@gmail.com>

amyeroberts merged commit aac7099 into huggingface:main Mar 28, 2024
21 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add functions to inspect model and optimizer status to trainer.py #29838

add functions to inspect model and optimizer status to trainer.py #29838

CKeibel commented Mar 24, 2024

muellerzr left a comment

CKeibel commented Mar 25, 2024

HuggingFaceDocBuilderDev commented Mar 25, 2024

muellerzr commented Mar 25, 2024

amyeroberts left a comment

muellerzr Mar 25, 2024

CKeibel Mar 26, 2024

amyeroberts left a comment

muellerzr left a comment

CKeibel commented Mar 27, 2024

amyeroberts commented Mar 28, 2024

CKeibel commented Mar 28, 2024

amyeroberts commented Mar 28, 2024

add functions to inspect model and optimizer status to trainer.py #29838

add functions to inspect model and optimizer status to trainer.py #29838

Conversation

CKeibel commented Mar 24, 2024

What does this PR do?

Before submitting

Who can review?

muellerzr left a comment

Choose a reason for hiding this comment

CKeibel commented Mar 25, 2024

HuggingFaceDocBuilderDev commented Mar 25, 2024

muellerzr commented Mar 25, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

muellerzr Mar 25, 2024

Choose a reason for hiding this comment

CKeibel Mar 26, 2024

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment

muellerzr left a comment

Choose a reason for hiding this comment

CKeibel commented Mar 27, 2024

amyeroberts commented Mar 28, 2024

CKeibel commented Mar 28, 2024

amyeroberts commented Mar 28, 2024