[Bug]: ValidationError when loading fp8-dynamic model with empty "sparsity_config" #12044
Closed
1 task done
Labels
bug
Something isn't working
Your current environment
OS: Ubuntu Server 22.04 LTS
GPU: Nvidia H200
Driver: 550.127.08
The output of `pip list`
Model Input Dumps
No response
🐛 Describe the bug
When starting vllm like this:
The following error occurs:
The model in question is
cortecs/Llama-3.3-70B-Instruct-FP8-Dynamic
from huggingface: https://huggingface.co/cortecs/Llama-3.3-70B-Instruct-FP8-Dynamic/blob/main/config.jsonThe error does not occur with vllm 0.6.4.post1, and 0.6.5, it starts to happen with 0.6.6.
When the line containing
"sparsity_config": {}
is removed from the model'sconfig.json
the error doesn't happen and the model works fine even with 0.6.6.post1. While this may be considered a workaround, it's better to fix this issue as there are potentially many models with empty sparsity_config.Before submitting a new issue...
The text was updated successfully, but these errors were encountered: