[`GPTQ`, `CompressedTensors`] Fix unsafe imports and metada check #34815

vasqu · 2024-11-19T17:13:23Z

What does this PR do?

Fixes some unsafe imports for GPTQ and CompressedTensors as well as an unsafe metadata check for GPTQ, i.e. metadata can't be checked if the package doesn't exist...

I tried looking into integrating the env validation function before even creating the quantizers but that would require a complete restructure, especially since the env is checked after creation to get additional info like device map etc. So I kept it as simple as I could and wrapped the unsafe stuff.

Minimal reproducible script to trigger gptq errors (when optimum is not installed):

from transformers import AutoModelForCausalLM, AutoTokenizer, GPTQConfig


model_id = "facebook/opt-125m"
tokenizer = AutoTokenizer.from_pretrained(model_id)
dataset = ["auto-gptq is an easy-to-use model quantization library with user-friendly apis, based on GPTQ algorithm."]
gptq_config = GPTQConfig(bits=4, dataset=dataset, tokenizer=tokenizer)


quantized_model = AutoModelForCausalLM.from_pretrained(model_id, device_map="auto", quantization_config=gptq_config)

Fixes #34765

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@SunMarc @MekkCyber

src/transformers/quantizers/quantizer_gptq.py

MekkCyber · 2024-11-22T11:35:30Z

Thanks for the fix @vasqu, LGTM !

SunMarc

Thanks for the fixes !

HuggingFaceDocBuilderDev · 2024-11-22T16:43:59Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ArthurZucker

Looks good but I think we should always call validate_environment before even initializing the quantizer (so probably update super (HfQuantizer) to call it at init time no?

vasqu · 2024-11-25T16:19:00Z

@ArthurZucker
See my comments above:

I tried looking into integrating the env validation function before even creating the quantizers but that would require a complete restructure, especially since the env is checked after creation to get additional info like device map etc. So I kept it as simple as I could and wrapped the unsafe stuff.

The problem is some quantizers need additional information that is not available at creation; another option I could see it to add a post_init to create those unsafe parts :)

vasqu · 2024-12-08T20:33:07Z

gentle ping @ArthurZucker

ArthurZucker

Thanks for explaining

ArthurZucker · 2024-12-20T10:08:33Z

Not sure if it was fixed by recent quantizer compressed changes, let's resolve conflicts and good to merge

vasqu · 2024-12-24T18:03:27Z

Forgot about this PR myself 😅 should be good to merge now! Thx for the merges here, ci sometimes do be flaky tho

MekkCyber · 2024-12-24T18:18:21Z

will merge it once the ci is green @vasqu 🤗

vasqu added 2 commits November 19, 2024 18:03

fix gptq creation when optimum is not installed + fix metadata checking

e32bcdc

fix compressed tensors as well

3b19ea9

vasqu commented Nov 19, 2024

View reviewed changes

src/transformers/quantizers/quantizer_gptq.py Outdated Show resolved Hide resolved

style

f3c7f78

vasqu force-pushed the fix-quant-loading branch from 3c927c3 to f3c7f78 Compare November 19, 2024 17:27

vasqu added 2 commits November 19, 2024 18:28

Merge remote-tracking branch 'upstream/main' into fix-quant-loading

8b1b76a

pray for ci luck on flaky tests :prayge:

d2e0edc

MekkCyber self-requested a review November 20, 2024 11:15

vasqu changed the title ~~Fix quant loading for gptq and compressed tensors~~ [GPTQ, CompressedTensors] Fix unsafe imports and metada check Nov 22, 2024

MekkCyber requested review from SunMarc and removed request for MekkCyber November 22, 2024 11:40

SunMarc approved these changes Nov 22, 2024

View reviewed changes

SunMarc requested a review from ArthurZucker November 22, 2024 16:13

ArthurZucker reviewed Nov 25, 2024

View reviewed changes

ArthurZucker approved these changes Dec 20, 2024

View reviewed changes

SunMarc and others added 4 commits December 23, 2024 18:22

Merge branch 'main' into fix-quant-loading

ef92834

Merge branch 'main' into fix-quant-loading

ddd4e1b

Merge branch 'main' into fix-quant-loading

c6c6746

trigger ci

7765a95

MekkCyber merged commit 24c91f0 into huggingface:main Dec 24, 2024
25 checks passed

vasqu deleted the fix-quant-loading branch December 24, 2024 18:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`GPTQ`, `CompressedTensors`] Fix unsafe imports and metada check #34815

[`GPTQ`, `CompressedTensors`] Fix unsafe imports and metada check #34815

vasqu commented Nov 19, 2024

MekkCyber commented Nov 22, 2024

SunMarc left a comment

HuggingFaceDocBuilderDev commented Nov 22, 2024

ArthurZucker left a comment

vasqu commented Nov 25, 2024 •

edited

Loading

vasqu commented Dec 8, 2024

ArthurZucker left a comment

ArthurZucker commented Dec 20, 2024

vasqu commented Dec 24, 2024 •

edited

Loading

MekkCyber commented Dec 24, 2024

[GPTQ, CompressedTensors] Fix unsafe imports and metada check #34815

[GPTQ, CompressedTensors] Fix unsafe imports and metada check #34815

Conversation

vasqu commented Nov 19, 2024

What does this PR do?

Before submitting

Who can review?

MekkCyber commented Nov 22, 2024

SunMarc left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Nov 22, 2024

ArthurZucker left a comment

Choose a reason for hiding this comment

vasqu commented Nov 25, 2024 • edited Loading

vasqu commented Dec 8, 2024

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker commented Dec 20, 2024

vasqu commented Dec 24, 2024 • edited Loading

MekkCyber commented Dec 24, 2024

[`GPTQ`, `CompressedTensors`] Fix unsafe imports and metada check #34815

[`GPTQ`, `CompressedTensors`] Fix unsafe imports and metada check #34815

vasqu commented Nov 25, 2024 •

edited

Loading

vasqu commented Dec 24, 2024 •

edited

Loading