Release v0.4.6 · intel/auto-round

Highlights:

1 set torch compile to false by default in #447
2 Fix packing hang and force to fp16 at exporting in #430
3 align auto_quantizer with Transformers 4.49 in #437

What's Changed

Fix packing hang, torch compile and force to fp16 at exporting by @wenhuach21 in #430
fix nblocks issues by @wenhuach21 in #432
rm gc collect in packing by @wenhuach21 in #438
align auto_quantizer with main branch in Transformers by @WeiweiZhang1 in #437
[HPU]Fix compile bug when quant layer by @yiliu30 in #441
remove tricky setting in mxfp4 by @wenhuach21 in #445
fix bug of evaluate user model by @n1ck-guo in #444
Refine funcs by @WeiweiZhang1 in #446
set torch compile to false by default by @WeiweiZhang1 in #447

Full Changelog: v0.4.5...v0.4.6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.4.6

Highlights:

What's Changed

Contributors