Skip to content

Introduction of gemm4xN and gemmMx4 for Q4_0 and Q8_0 for better performance results#8908

Merged
ggerganov merged 1 commit intoggerganov:masterfrom Srihari-mcw:q8_0_q4_0_fp16_delta_multiply_parallelAug 31, 2024