quantize: k_quants.c:73: nearest_int: Assertion `fval <= 4194303.f' failed. #2982

cebtenzzre · 2023-09-03T05:51:46Z

While trying to quantize Huginn-22b-Prototype to Q5_0, I ran into this assertion failure while quantizing the output tensor:

[ 331/ 363]                        output.weight - [ 6656, 32000,     1,     1], type =    f16, quantizing to q6_K .. quantize: k_quants.c:73: nearest_int: Assertion `fval <= 4194303.f' failed.
quantize: k_quants.c:73: nearest_int: Assertion `fval <= 4194303.f' failed.

It happens here:

#5  0x00007fed68032d26 in __assert_fail (assertion=0x557583a83303 "fval <= 4194303.f", file=0x557583a832f8 "k_quants.c", 
    line=73, function=0x557583a83338 <__PRETTY_FUNCTION__.31> "nearest_int") at assert.c:101
#6  0x0000557583a6c991 in nearest_int (fval=-nan(0x400000)) at k_quants.c:73
#7  0x0000557583a7142c in quantize_row_q6_K_reference (x=0x7fed18577010, y=0x7fecdd38c210, k=16384) at k_quants.c:1092
#8  0x0000557583a71cad in ggml_quantize_q6_K (src=0x7fed18577010, dst=0x7fecdd38c210, n=16384, k=16384, hist=0x7feccc000b70)
    at k_quants.c:1200
#9  0x00005575839dad38 in ggml_quantize_chunk (type=GGML_TYPE_Q6_K, src=0x7fed18537010, dst=0x7fecdd37f010, start=65536, 
    n=16384, hist=0x7feccc000b70) at ggml.c:19527

The text was updated successfully, but these errors were encountered:

KerfuffleV2 · 2023-09-03T07:21:54Z

#2434 might fix this if implemented?

ikawrakow · 2023-09-04T11:26:45Z

Does #3010 solve it?

In order to get this assertion, all weights in a block of 256 must be zero. This has never happened before, so I wonder how meaningful this model is. Although I'm somewhat surprised to see NaN instead of Inf as the argument triggering the assert in the nearest_int() function. Any chance there are already NaNs in the fp16 model?

@KerfuffleV2 No, #2434 will not be solving it. The zeros (or NaNs) will remain zeros or NaNs after normalization, so the unforeseen situation will still arise and the assert will still be triggered.

KerfuffleV2 · 2023-09-04T13:46:07Z

so the unforeseen situation will still arise and the assert will still be triggered.

My mistake. I was thinking it was a model that just had particularly large values in the weights.

@JohannesGaessler

This assertion fails when quantizing Mixtral 8x7b as Q5_K_M, because I used `convert.py --outtype f32` and the Mixtral weights use bf16 which has a much larger exponent range than the K quantizer is expecting. If --outtype f16 is used then the assert doesn't fail. See ggerganov/llama.cpp#2982 cc: @JohannesGaessler

This assertion fails when quantizing Mixtral 8x7b as Q5_K_M, because I used `convert.py --outtype f32` and the Mixtral weights use bf16 which has a much larger exponent range than the K quantizer is expecting. If --outtype f16 is used then the assert doesn't fail. See ggerganov#2982

cebtenzzre added the bug Something isn't working label Sep 3, 2023

ikawrakow mentioned this issue Sep 4, 2023

Guard against all weights in a super-block being zero #3010

Merged

ikawrakow closed this as completed in d59bd97 Sep 5, 2023

cebtenzzre mentioned this issue Sep 6, 2023

Regression in output of quantized Huginn-22b-Prototype #3040

Closed

jart mentioned this issue Apr 25, 2024

Clamp out of range values in K quantizer #6888

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

quantize: k_quants.c:73: nearest_int: Assertion `fval <= 4194303.f' failed. #2982

quantize: k_quants.c:73: nearest_int: Assertion `fval <= 4194303.f' failed. #2982

cebtenzzre commented Sep 3, 2023

KerfuffleV2 commented Sep 3, 2023

ikawrakow commented Sep 4, 2023

KerfuffleV2 commented Sep 4, 2023

quantize: k_quants.c:73: nearest_int: Assertion `fval <= 4194303.f' failed. #2982

quantize: k_quants.c:73: nearest_int: Assertion `fval <= 4194303.f' failed. #2982

Comments

cebtenzzre commented Sep 3, 2023

KerfuffleV2 commented Sep 3, 2023

ikawrakow commented Sep 4, 2023

KerfuffleV2 commented Sep 4, 2023