StaticCache
Bad generation results with Llama after v4.39.0
#30417
Labels
StaticCache
Bad generation results with Llama after v4.39.0
#30417
System Info
transformers version: 4.41.0.dev0
Platform: Linux-5.15.0-89-generic-x86_64-with-glibc2.
Python version: 3.10.
Huggingface_hub version: 0.20.
Safetensors version: 0.4.
Accelerate version: 0.21.0
Who can help?
@ArthurZucker @gante
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
The generation output quality with the current 4.41.0.dev0 version is very bad compared to the previous 4.39.0 version, at least with Llama. With quantized models, it outputs complete gibberish. The same code works totally fine with 4.39.0
Expected behavior
The output should be the same as with the previous 4.39.0 version
The text was updated successfully, but these errors were encountered: