From 2b5a885689b8ab5fd444cce50b89c51cc079322a Mon Sep 17 00:00:00 2001 From: Adam Stachowicz Date: Mon, 25 Nov 2024 15:47:49 +0200 Subject: [PATCH] [SW-196761] Add G3 in T5-L README --- examples/summarization/README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/examples/summarization/README.md b/examples/summarization/README.md index e1fc98c4dd..46173c63f6 100644 --- a/examples/summarization/README.md +++ b/examples/summarization/README.md @@ -179,7 +179,7 @@ python ../gaudi_spawn.py \ ## Using DeepSpeed -Here is an example on 8 HPUs on Gaudi2 with DeepSpeed-ZeRO3 to fine-tune [FLAN-T5 XXL](https://huggingface.co/google/flan-t5-xxl): +Here is an example on 8 HPUs on Gaudi2/Gaudi3 with DeepSpeed-ZeRO3 to fine-tune [FLAN-T5 XXL](https://huggingface.co/google/flan-t5-xxl): ```bash PT_HPU_MAX_COMPOUND_OP_SIZE=512 python ../gaudi_spawn.py \ --world_size 8 --use_deepspeed run_summarization.py \