Support running on CPU #1402

fzyzcjy · 2024-10-25T01:49:49Z

Feature request

Hi thanks for the library! It would be great if the optimizers can be run on CPU. For example, I would like to try adamw_8bit to full-finetune a 8B model on a 24GB GPU card (RTX4090). With deepspeed offload, the GPU memory is OK, but the CPU memory requirement is still very huge, partially because it uses normal adamw, thus needs 8x8=64GB for the optimizer itself.

This package creates the super helpful adamw_8bit, thus I would appreciate it if it can be used with the settings above, hopefully reducing 64GB to 8x2=16GB for optimizer state.

Motivation

(see above)

Your contribution

Yes

werruww · 2024-10-26T21:49:51Z

and iam

rickardp · 2024-11-18T09:37:53Z

See #1021. I proposed that this should be a step on the path of implementing cross platform support (especially Apple Silicon, as CUDA and Apple Silicon won't run on the same hardware, which makes validation complicated)

matthewdouglas added the cross-platform label Jan 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support running on CPU #1402

Support running on CPU #1402

fzyzcjy commented Oct 25, 2024

werruww commented Oct 26, 2024

rickardp commented Nov 18, 2024 •

edited

Loading

Support running on CPU #1402

Support running on CPU #1402

Comments

fzyzcjy commented Oct 25, 2024

Feature request

Motivation

Your contribution

werruww commented Oct 26, 2024

rickardp commented Nov 18, 2024 • edited Loading

rickardp commented Nov 18, 2024 •

edited

Loading