Highlights:
1 set torch compile to false by default in #447
2 Fix packing hang and force to fp16 at exporting in #430
3 align auto_quantizer with Transformers 4.49 in #437
What's Changed
- Fix packing hang, torch compile and force to fp16 at exporting by @wenhuach21 in #430
- fix nblocks issues by @wenhuach21 in #432
- rm gc collect in packing by @wenhuach21 in #438
- align auto_quantizer with main branch in Transformers by @WeiweiZhang1 in #437
- [HPU]Fix compile bug when quant layer by @yiliu30 in #441
- remove tricky setting in mxfp4 by @wenhuach21 in #445
- fix bug of evaluate user model by @n1ck-guo in #444
- Refine funcs by @WeiweiZhang1 in #446
- set torch compile to false by default by @WeiweiZhang1 in #447
Full Changelog: v0.4.5...v0.4.6