Count model parameters and size using only the GGUF tensors #10285

slaren · 2024-11-14T01:19:29Z

Discussed in #10274

The number of parameters and size of the model currently is calculated from the tensors created after the model is loaded, which in some cases may contain duplicated tensors, resulting in an inaccurate and inconsistent reporting of the model size. To address this, llama_model_n_params and llama_model_size should be modified to return the value as calculated in llama_model_loader::n_elements and n_bytes, which could be stored in llama_model while loading the model.

The text was updated successfully, but these errors were encountered:

FirstTimeEZ · 2024-11-15T09:07:01Z

This issue is fixed by #10286

fixes #10285

…nov#10286) fixes ggerganov#10285

slaren added bug Something isn't working good first issue Good for newcomers labels Nov 14, 2024

FirstTimeEZ mentioned this issue Nov 14, 2024

save number of parameters and the size in llama_model, fixes #10285 #10286

Merged

1 task

slaren pushed a commit that referenced this issue Nov 16, 2024

llama : save number of parameters and the size in llama_model (#10286)

89e4caa

fixes #10285

slaren closed this as completed in #10286 Nov 16, 2024

arthw pushed a commit to arthw/llama.cpp that referenced this issue Nov 18, 2024

llama : save number of parameters and the size in llama_model (ggerga…

0d12d86

…nov#10286) fixes ggerganov#10285

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Count model parameters and size using only the GGUF tensors #10285

Count model parameters and size using only the GGUF tensors #10285

slaren commented Nov 14, 2024

FirstTimeEZ commented Nov 15, 2024

Count model parameters and size using only the GGUF tensors #10285

Count model parameters and size using only the GGUF tensors #10285

Comments

slaren commented Nov 14, 2024

Discussed in #10274

FirstTimeEZ commented Nov 15, 2024