Fix: Falcon tie_word_embeddings in GGUF #35715

MekkCyber · 2025-01-15T15:56:24Z

What does this PR do?

In the modeling_gguf_pytorch_utils.py file, the value of tie_word_embeddings is determined by the presence (or absence) of the tensor output.weight in the GGUF file. While this approach is generally a good indicator, the Falcon architecture is an exception to the rule as you can see here : https://huggingface.co/tiiuae/falcon-7b/tree/main?show_file_info=model-00002-of-00002.safetensors (hf format) and here https://huggingface.co/tensorblock/falcon-7b-GGUF/tree/main?show_file_info=falcon-7b-Q2_K.gguf (gguf format)

In Falcon, word_embeddings are tied to the lm_head weights. Despite this, output.weight is still present in the GGUF file, and lm_head is included in the Hugging Face model format. To handle this edge case, I added an exception array for such architectures.

This issue was causing a subtle error related to parameters not being on the same device, which was only discoverable in multi-GPU settings.

Who can review ?

@SunMarc @Isotr0py

HuggingFaceDocBuilderDev · 2025-01-15T16:25:17Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Isotr0py · 2025-01-15T16:36:09Z

But I remember tie_word_embeddings is not used by modeling_falcon.py?

BTW, if the issue about tie_word_embedding is the case, can we also update the falcon conversion test to cover this edge case?

MekkCyber · 2025-01-15T16:45:59Z

tie_word_embeddings is not used directly in the modeling file of the model, instead it's used in modeling_utils.py : https://github.com/huggingface/transformers/blob/main/src/transformers/modeling_utils.py#L1858
It was the failure of the Falcon conversion test that made me realize the error, and now it passes successfully.

Isotr0py

LGTM! Thanks for fixing!

SunMarc

LGTM ! Let's do that for falcon but if more issues starts to appear, let's modify the tests slightly to not take into account the device of the tensors

* fix falcon tie_word_embeddings * fix style

fix falcon tie_word_embeddings

757765c

MekkCyber requested review from Rocketknight1 and ArthurZucker as code owners January 15, 2025 15:56

fix style

2c59f0b

MekkCyber requested review from SunMarc and removed request for Rocketknight1 and ArthurZucker January 15, 2025 15:57

Merge branch 'main' into fix_ggml_falcon_tiewordembeddings

8c313a0

Isotr0py approved these changes Jan 15, 2025

View reviewed changes

Merge branch 'main' into fix_ggml_falcon_tiewordembeddings

17fec72

SunMarc approved these changes Jan 16, 2025

View reviewed changes

SunMarc merged commit fd4f14c into main Jan 16, 2025
26 checks passed

SunMarc deleted the fix_ggml_falcon_tiewordembeddings branch January 16, 2025 12:18

MekkCyber mentioned this pull request Jan 21, 2025

Fix : BLOOM tie_word_embeddings in GGUF #35812

Merged

bursteratom pushed a commit to bursteratom/transformers that referenced this pull request Jan 31, 2025

Fix: Falcon tie_word_embeddings in GGUF (huggingface#35715)

96f120d

* fix falcon tie_word_embeddings * fix style

elvircrn pushed a commit to elvircrn/transformers that referenced this pull request Feb 13, 2025

Fix: Falcon tie_word_embeddings in GGUF (huggingface#35715)

d36d7a2

* fix falcon tie_word_embeddings * fix style

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Falcon tie_word_embeddings in GGUF #35715

Fix: Falcon tie_word_embeddings in GGUF #35715

MekkCyber commented Jan 15, 2025 •

edited

Loading

HuggingFaceDocBuilderDev commented Jan 15, 2025

Isotr0py commented Jan 15, 2025 •

edited

Loading

MekkCyber commented Jan 15, 2025

Isotr0py left a comment

SunMarc left a comment

Fix: Falcon tie_word_embeddings in GGUF #35715

Fix: Falcon tie_word_embeddings in GGUF #35715

Conversation

MekkCyber commented Jan 15, 2025 • edited Loading

What does this PR do?

Who can review ?

HuggingFaceDocBuilderDev commented Jan 15, 2025

Isotr0py commented Jan 15, 2025 • edited Loading

MekkCyber commented Jan 15, 2025

Isotr0py left a comment

Choose a reason for hiding this comment

SunMarc left a comment

Choose a reason for hiding this comment

MekkCyber commented Jan 15, 2025 •

edited

Loading

Isotr0py commented Jan 15, 2025 •

edited

Loading