Vocab size mismatch #3900

eswarthammana · 2023-11-02T07:32:11Z

Llama_2_7B-chat
vocab size mismatch (model has -1 but tokenizer.model has 32000)

yaashwardhan · 2023-11-02T10:56:38Z

This is an easy fix.

There should be a .json file (probably params.json) inside the llama-2-7b-chat folder.
Open the json file and set the "vocab_size": to 32000 from -1.

Here is my params.json file:

{"dim": 4096, "multiple_of": 256, "n_heads": 32, "n_layers": 32, "norm_eps": 1e-06, "vocab_size": 32000}

eswarthammana · 2023-11-03T04:02:03Z

Thanks for the fix, i did downloaded older version repo where i have not faced this issue few days back. :)

ishowshao · 2023-11-06T05:48:53Z

same issue, why meta write -1 ?

lorddoskias · 2023-11-06T10:50:07Z

I also hit this and I think the correct way to fix it is in the convert script to simply remove the "vocab_size" if it equals -1 which will result in getting it from the tok_embeddings.weight.

When vocab_size is detected to be -1 simply remove its value from the parsed params.json and fallback to using the tok_embeddings.weight. Fixes ggerganov#3900

kaiwren · 2023-11-16T03:45:22Z

+1 I ran into this also (Exception: Vocab size mismatch (model has -1, but ../llama/tokenizer.model has 32000)) today with llama-2-7b.

guertsen · 2023-11-23T16:09:17Z

Had the same issue with llama-2-7b.

rsbepvb · 2023-11-26T14:47:17Z

Confirming this fixed the issue with the most recent Llama download

ursachec · 2023-12-07T12:19:53Z

Ran into a similar issue today, using the current tip of master bcc0eb4:

$ python3 llama.cpp/convert.py ./Magicoder-S-DS-6.7B/ --outtype f16 --outfile magicoder-S-DS-6.7B.FP16.gguf
//...
Exception: Vocab size mismatch (model has 32256, but Magicoder-S-DS-6.7B/tokenizer.model combined with Magicoder-S-DS-6.7B/added_tokens.json has 32022)

FotieMConstant · 2024-03-05T10:38:53Z

I confirm this fixed the issue with Llama2 7b models

github-actions · 2024-04-19T01:07:12Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

ericcanton mentioned this issue Dec 18, 2023

Linear layer is initialized with invalid shape ml-explore/mlx#213

Closed

teleprint-me mentioned this issue Jan 11, 2024

Failed to convert Llama-v2 models #4493

Closed

seuwins mentioned this issue Feb 16, 2024

LLAMA 2 HF tokenizer len is 32001, 迅雷7B model异常需更新。 LlamaFamily/Llama-Chinese#291

Closed

FotieMConstant mentioned this issue Mar 7, 2024

Ollama returns: Error: error loading model when importing a fined-tuned converted and quantized model ollama/ollama#2935

Closed

github-actions bot added the stale label Apr 5, 2024

github-actions bot closed this as completed Apr 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vocab size mismatch #3900

Vocab size mismatch #3900

eswarthammana commented Nov 2, 2023

yaashwardhan commented Nov 2, 2023

eswarthammana commented Nov 3, 2023

ishowshao commented Nov 6, 2023

lorddoskias commented Nov 6, 2023

kaiwren commented Nov 16, 2023

guertsen commented Nov 23, 2023 •

edited

Loading

rsbepvb commented Nov 26, 2023

ursachec commented Dec 7, 2023

FotieMConstant commented Mar 5, 2024

github-actions bot commented Apr 19, 2024

Vocab size mismatch #3900

Vocab size mismatch #3900

Comments

eswarthammana commented Nov 2, 2023

yaashwardhan commented Nov 2, 2023

eswarthammana commented Nov 3, 2023

ishowshao commented Nov 6, 2023

lorddoskias commented Nov 6, 2023

kaiwren commented Nov 16, 2023

guertsen commented Nov 23, 2023 • edited Loading

rsbepvb commented Nov 26, 2023

ursachec commented Dec 7, 2023

FotieMConstant commented Mar 5, 2024

github-actions bot commented Apr 19, 2024

guertsen commented Nov 23, 2023 •

edited

Loading