PyTorch: check point support #401

tensorbuffer · 2019-12-24T01:26:03Z

Hello, when I try to use the web version to load pytorch checkpoint file, I got this error: Error loading PyTorch model. File does not contain root module or state dictionary in 'checkpoint.pth'.
The checkpoint is saved as:
checkpoint = {
'model': model_without_ddp.state_dict(),
'optimizer': optimizer.state_dict(),
'lr_scheduler': lr_scheduler.state_dict(),
'epoch': epoch,
'args': args}
torch.save(
checkpoint, path_to_file)

This is pytorch standard way of saving checkpoint:
https://pytorch.org/tutorials/beginner/saving_loading_models.html#saving-loading-a-general-checkpoint-for-inference-and-or-resuming-training

lutzroeder · 2019-12-25T06:12:36Z

Can you share the model file?

tensorbuffer · 2019-12-25T18:08:08Z

https://drive.google.com/file/d/1eIyTCGctlInencZBUwvBokPQikGz03Oq/view?usp=sharing

lutzroeder · 2019-12-25T21:40:01Z

features.1.skip_add.observer.scale tensor is null.

tensorbuffer · 2020-01-03T07:20:59Z

great, thanks! I can load the checkpoint file.
However all that's shown are 'Module' blocks, there's no connection between them. Is this the right visualization?

abhigoku10 · 2020-01-04T10:23:45Z

@tensorbuffer @lutzroeder @demid5111 @scottcjt i am also facing the same issues not able to visualize any connection between the modules , any idea how to obtain the connections ??

tensorbuffer · 2020-01-06T18:13:54Z

I think this is because pytorch graph is dynamic, the graph is generated on the fly, so you have to feed an input and execute the model to get the graph.

simphide · 2021-07-18T19:47:17Z

Hi, I also have this problem with the models of this project: https://github.com/BloodAxe/Kaggle-2020-Alaska2/releases

Does anybody have an idea how I can see the connections etc.? Thank you!

lutzroeder · 2021-07-18T20:36:32Z

@simphide the files include only training weights. See #720.

lutzroeder added the no repro label Dec 24, 2019

lutzroeder closed this as completed Dec 24, 2019

lutzroeder added a commit that referenced this issue Dec 25, 2019

Fix PyTorch null tensor (#401)

29d184a

Repository owner deleted a comment from tensorbuffer Dec 25, 2019

lutzroeder changed the title ~~pytorch check point support~~ PyTorch: check point support Aug 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PyTorch: check point support #401

PyTorch: check point support #401

tensorbuffer commented Dec 24, 2019

lutzroeder commented Dec 25, 2019 •

edited

Loading

tensorbuffer commented Dec 25, 2019

lutzroeder commented Dec 25, 2019 •

edited

Loading

tensorbuffer commented Jan 3, 2020

abhigoku10 commented Jan 4, 2020

tensorbuffer commented Jan 6, 2020

simphide commented Jul 18, 2021

lutzroeder commented Jul 18, 2021 •

edited

Loading

PyTorch: check point support #401

PyTorch: check point support #401

Comments

tensorbuffer commented Dec 24, 2019

lutzroeder commented Dec 25, 2019 • edited Loading

tensorbuffer commented Dec 25, 2019

lutzroeder commented Dec 25, 2019 • edited Loading

tensorbuffer commented Jan 3, 2020

abhigoku10 commented Jan 4, 2020

tensorbuffer commented Jan 6, 2020

simphide commented Jul 18, 2021

lutzroeder commented Jul 18, 2021 • edited Loading

lutzroeder commented Dec 25, 2019 •

edited

Loading

lutzroeder commented Dec 25, 2019 •

edited

Loading

lutzroeder commented Jul 18, 2021 •

edited

Loading