RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! in ch05/01_main-chapter-code/gpt_generate.py #435

xfpg21421 · 2024-11-14T08:31:12Z

Bug description

gpt = GPTModel(gpt_config)
load_weights_into_gpt(gpt, params)
gpt.to(device)
gpt.eval()

tokenizer = tiktoken.get_encoding("gpt2")
torch.manual_seed(123)

token_ids = generate(
    model=gpt,
    idx=text_to_token_ids(input_prompt, tokenizer),
    max_new_tokens=25,
    context_size=gpt_config["context_length"],
    top_k=50,
    temperature=1.0
)

print("Output text:\n", token_ids_to_text(token_ids, tokenizer))`

If code above runs in GPU device, error below would appear:
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! (when checking argument for argument index in method wrapper_CUDA__index_select)

Since the gpt model has been put to the GPU, the idx tensor from text_to_token_ids(input_prompt, tokenizer) has not been put to the GPU. So it should be modified as text_to_token_ids(input_prompt, tokenizer).to(device)

What operating system are you using?

Linux

Where do you run your code?

Other cloud environment (AWS, Azure, GCP)

Environment

The text was updated successfully, but these errors were encountered:

rasbt · 2024-11-14T10:03:41Z

Good catch, thanks for reporting this. Looks like it was missing a .to(device) for the prompt.

xfpg21421 added the bug Something isn't working label Nov 14, 2024

xfpg21421 assigned rasbt Nov 14, 2024

rasbt mentioned this issue Nov 14, 2024

Add missing device transfer in optional gpt_generate.py code #436

Merged

rasbt closed this as completed in #436 Nov 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! in ch05/01_main-chapter-code/gpt_generate.py #435

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! in ch05/01_main-chapter-code/gpt_generate.py #435

xfpg21421 commented Nov 14, 2024

rasbt commented Nov 14, 2024

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! in ch05/01_main-chapter-code/gpt_generate.py #435

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! in ch05/01_main-chapter-code/gpt_generate.py #435

Comments

xfpg21421 commented Nov 14, 2024

Bug description

What operating system are you using?

Where do you run your code?

Environment

rasbt commented Nov 14, 2024