LLM: fix mpt load_low_bit issue #10075

JinBridger · 2024-02-02T07:04:22Z

https://github.com/analytics-zoo/nano/issues/948

Enable tie_word_embeddings when save mpt model. Otherwise, mpt that saved with save_low_bit cannot run normally.

This may because MPT model requires tie_word_embeddings=True while we manually save this variable as False when saving low bit models.

leonardozcm

LGTM， the last layer of mpt is a customized normal layer instead of linear which is usually seen in other llm models.

* fix * retry * retry

fix

c9812dd

leonardozcm approved these changes Feb 2, 2024

View reviewed changes

JinBridger added 2 commits February 5, 2024 09:16

retry

b0bded6

retry

700b95e

Oscilloscope98 merged commit a9da1a5 into intel:main Feb 5, 2024
19 checks passed

Jasonzzt pushed a commit to Jasonzzt/BigDL that referenced this pull request Feb 19, 2024

LLM: fix mpt load_low_bit issue (intel#10075)

3fa67c0

* fix * retry * retry

liu-shaojun pushed a commit that referenced this pull request Mar 25, 2024

LLM: fix mpt load_low_bit issue (#10075)

ad05010

* fix * retry * retry

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLM: fix mpt load_low_bit issue #10075

LLM: fix mpt load_low_bit issue #10075

JinBridger commented Feb 2, 2024 •

edited

Loading

leonardozcm left a comment

LLM: fix mpt load_low_bit issue #10075

LLM: fix mpt load_low_bit issue #10075

Conversation

JinBridger commented Feb 2, 2024 • edited Loading

leonardozcm left a comment

Choose a reason for hiding this comment

JinBridger commented Feb 2, 2024 •

edited

Loading