Fix weight loading of weight_g_idx
compressed-tensors parameters
#35741
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
What does this PR do?
modeling_utils.py
which replaces parameters with theweight_g
substring in the parameter key withparametrizations.weight.original0
. This incorrectly replaces the substring ofcompressed-tensors
model parameters with the keyweight_g_idx
, resulting in a loading error.transformers/src/transformers/modeling_utils.py
Line 4380 in 02a492a
src/transformers/quantizers/quantizer_compressed_tensors.py
logic to temporarily turn off theweight_norm
attribute such that the substring replacement does not occur inmodeling_utils.py
during weight loading. It then turns it back on in_process_model_after_weight_loading
. This allows the models to be loaded correctlyBefore submitting
Pull Request section?
to it if that's the case.
documentation guidelines, and
here are tips on formatting docstrings.
Who can review?
@SunMarc
@ArthurZucker