Update to latest upstream BLOOM implementation / BLOOM Quantization does not work #228

danforbes · 2023-05-15T19:29:32Z

I am able to perform inference using the f16 model found here https://huggingface.co/nouamanetazi/bloomz-560m-ggml/tree/main but when I use llm to quantize it to q4_0 format, the model produces gibberish.

The text was updated successfully, but these errors were encountered:

philpax · 2023-05-15T19:33:37Z

Just to confirm - the uploaded q4_0 works, right?

danforbes added issue:bug Something isn't working model:bloom BLOOM model labels May 15, 2023

danforbes mentioned this issue May 16, 2023

How to run these models? #230

Closed

philpax mentioned this issue May 17, 2023

Build Errors #234

Closed

philpax added this to the 0.2 milestone May 18, 2023

philpax mentioned this issue May 20, 2023

Model verification #249

Open

philpax changed the title ~~BLOOM Quantization Does Not Work~~ Update to latest upstream BLOOM implementation / BLOOM Quantization does not work May 22, 2023

philpax closed this as completed in a1e5921 May 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update to latest upstream BLOOM implementation / BLOOM Quantization does not work #228

Update to latest upstream BLOOM implementation / BLOOM Quantization does not work #228

danforbes commented May 15, 2023

philpax commented May 15, 2023

Update to latest upstream BLOOM implementation / BLOOM Quantization does not work #228

Update to latest upstream BLOOM implementation / BLOOM Quantization does not work #228

Comments

danforbes commented May 15, 2023

philpax commented May 15, 2023