[Broken] Generation with Sequential Model #854

satpalsr · 2023-03-23T15:06:28Z

Take up a config and set "pipe-parallel-size": 1
Run python deepy.py generate.py configs/70M-deduped.yml -i input_prompt.txt -o prompt_out.txt
This will bring sequential model in action.

Error: 'SequentialWrapper' object has no attribute 'clear_cache' in line

The text was updated successfully, but these errors were encountered:

Quentin-Anthony · 2023-03-27T14:55:35Z

I don't have the bandwidth to handle this for now. Would appreciate if someone can take a look.

curt-tigges · 2023-03-28T22:08:38Z

Will take a look!

FourWinds021 · 2023-04-12T11:39:18Z

is this error solved now? I encountered the same problem when I train and inference a 125M GPT2 model according to the guidelines.

StellaAthena · 2023-04-12T18:59:24Z

is this error solved now? I encountered the same problem when I train and inference a 125M GPT2 model according to the guidelines.

If you encountered the same problem, it’s safe to say that it’s not solved.

DaoD · 2023-04-13T08:50:58Z

The same problem here. Could you please help to check it?

yizhilll · 2023-04-16T07:50:35Z

Same here. Would the temporary comment the line out affect anything?

Quentin-Anthony · 2023-04-21T17:35:53Z

Sorry all. This happened due to a line that snuck into the neox 2.0 release. To clarify for all, there are three pipeline parallelism cases in gpt-neox:

(pipe_parallel_size == 0): In this case, the model is wrapped in a standard nn.Sequential module

gpt-neox/megatron/training.py

Line 416 in c64bacc

if not neox_args.is_pipe_parallel:

This is done to reduce memory overhead and latency. This case is rarely used, and is where the issue you're seeing above lies.

(pipe_parallel_size == 1): This is the most common case. The model is wrapped in a single GPT2ModelPipe module, which makes it easier for both DeepSpeed and us to handle. This should be the default case, which we just resolved in #866.

(pipe_parallel_size > 1): This is for large models that require multiple pipeline module stages to distribute the model. This case remains unchanged.

What should you do: If you trained a model and need those weights to stay in a sequential module, we're working on a fix and will share it shortly. If you don't need those weights and instead simply need to run inference on a public model, apply the patch in #866 and run again.

StellaAthena · 2023-04-25T16:20:46Z

@satpalsr @FourWinds021 @DaoD @yizhilll have your issues been resolved by the recent patches?

satpalsr · 2023-04-25T17:58:09Z

yes

satpalsr added the bug Something isn't working label Mar 23, 2023

Quentin-Anthony added the help wanted This issue needs assistance label Mar 27, 2023

curt-tigges self-assigned this Mar 28, 2023

curt-tigges mentioned this issue Mar 31, 2023

Changed is_pipe_parallel setting to fix pipeline-parallel inference #866

Merged

DaoD mentioned this issue Apr 14, 2023

Some problems when using generate.py and how to fix it #883

Closed

StellaAthena closed this as completed May 15, 2023

xu-song mentioned this issue Sep 14, 2023

Fix Generation with Sequential Model #1026

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Broken] Generation with Sequential Model #854

[Broken] Generation with Sequential Model #854

satpalsr commented Mar 23, 2023

Quentin-Anthony commented Mar 27, 2023

curt-tigges commented Mar 28, 2023

FourWinds021 commented Apr 12, 2023

StellaAthena commented Apr 12, 2023

DaoD commented Apr 13, 2023

yizhilll commented Apr 16, 2023

Quentin-Anthony commented Apr 21, 2023

StellaAthena commented Apr 25, 2023

satpalsr commented Apr 25, 2023

[Broken] Generation with Sequential Model #854

[Broken] Generation with Sequential Model #854

Comments

satpalsr commented Mar 23, 2023

Quentin-Anthony commented Mar 27, 2023

curt-tigges commented Mar 28, 2023

FourWinds021 commented Apr 12, 2023

StellaAthena commented Apr 12, 2023

DaoD commented Apr 13, 2023

yizhilll commented Apr 16, 2023

Quentin-Anthony commented Apr 21, 2023

StellaAthena commented Apr 25, 2023

satpalsr commented Apr 25, 2023