loading pretrained model #37

jawaechan · 2021-08-25T08:31:22Z

Hi,
I download your trained model BiDet-SSD300-VOC_66.0.pth and want to load it with the network in bidet.ssd.py. But I met an issue of mismatching size as follows:

RuntimeError: Error(s) in loading state_dict for BiDetSSD:
size mismatch for conf.0.weight: copying a param with shape torch.Size([84, 512, 3, 3]) from checkpoint, the shape in current model is torch.Size([324, 512, 3, 3]).
size mismatch for conf.0.bias: copying a param with shape torch.Size([84]) from checkpoint, the shape in current model is torch.Size([324]).
size mismatch for conf.1.weight: copying a param with shape torch.Size([126, 1024, 3, 3]) from checkpoint, the shape in current model is torch.Size([486, 1024, 3, 3]).
size mismatch for conf.1.bias: copying a param with shape torch.Size([126]) from checkpoint, the shape in current model is torch.Size([486]).
size mismatch for conf.2.weight: copying a param with shape torch.Size([126, 512, 3, 3]) from checkpoint, the shape in current model is torch.Size([486, 512, 3, 3]).
size mismatch for conf.2.bias: copying a param with shape torch.Size([126]) from checkpoint, the shape in current model is torch.Size([486]).
size mismatch for conf.3.weight: copying a param with shape torch.Size([126, 256, 3, 3]) from checkpoint, the shape in current model is torch.Size([486, 256, 3, 3]).
size mismatch for conf.3.bias: copying a param with shape torch.Size([126]) from checkpoint, the shape in current model is torch.Size([486]).
size mismatch for conf.4.weight: copying a param with shape torch.Size([84, 256, 3, 3]) from checkpoint, the shape in current model is torch.Size([324, 256, 3, 3]).
size mismatch for conf.4.bias: copying a param with shape torch.Size([84]) from checkpoint, the shape in current model is torch.Size([324]).
size mismatch for conf.5.weight: copying a param with shape torch.Size([84, 256, 3, 3]) from checkpoint, the shape in current model is torch.Size([324, 256, 3, 3]).
size mismatch for conf.5.bias: copying a param with shape torch.Size([84]) from checkpoint, the shape in current model is torch.Size([324]).
Can you advise how to fix it?

Thanks

samagalhaes · 2021-08-25T08:48:01Z

Hi,
I am using it now and looks fine. How are you loading the weights? Are you using the internal function?

ssd = build_bidet_ssd('test', 300, 21, nms_conf_thre=0.03)

model_path=build_dir + "/pretrain/BiDet-SSD300-VOC_66.0.pth"
ssd.load_weights(model_path)

jawaechan · 2021-08-25T08:56:19Z

Hi,
I use the following codes:

model = build_bidet_ssd('train', 300, 80,
nms_conf_thre=0.03)
vgg_weights = torch.load('./pretrained/BiDet-SSD300-VOC_66.0.pth')
model.load_state_dict(vgg_weights,strict=True)

Thanks

samagalhaes · 2021-08-25T09:03:58Z

Hi,

That should also work. I think you need to set strict to False. Anyway the best is to use the load function.

jawaechan · 2021-08-25T09:08:11Z

I tried your code. I find when the parameter num_class is set as 21, there is no problem. But I met the error when it is set as 80.

Thanks

samagalhaes · 2021-08-25T09:44:41Z

Mind that the pre-trained SSD was trained in the PASCAL VOC dataset, which only has 21 classes. To use the COCO dataset, you need to train your own network.

Wuziyi616 · 2021-08-25T11:56:24Z

@samagalhaes Thanks for your kind reply, and yes @Jawae the pre-trained model is for VOC which has 21 classes, it can't be used for COCO which has 81 classes.

jawaechan · 2021-08-25T13:21:04Z

The final question: can I know what is the loss value when the training converges?

Wuziyi616 · 2021-09-01T14:35:53Z

Sorry for my late reply. If I remember correctly, on VOC dataset, the loc_loss+cls_loss is around 5, with about 1.6 and 3.2 for each (sorry I can't remember exactly which belongs to which loss term)

Wuziyi616 · 2021-09-01T14:36:37Z

But actually loss value doesn't mean final mAP, I will refer you to this issue.

ZiweiWangTHU closed this as completed Oct 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

loading pretrained model #37

loading pretrained model #37

jawaechan commented Aug 25, 2021

samagalhaes commented Aug 25, 2021 •

edited

Loading

jawaechan commented Aug 25, 2021

samagalhaes commented Aug 25, 2021

jawaechan commented Aug 25, 2021

samagalhaes commented Aug 25, 2021

Wuziyi616 commented Aug 25, 2021

jawaechan commented Aug 25, 2021

Wuziyi616 commented Sep 1, 2021

Wuziyi616 commented Sep 1, 2021

loading pretrained model #37

loading pretrained model #37

Comments

jawaechan commented Aug 25, 2021

samagalhaes commented Aug 25, 2021 • edited Loading

jawaechan commented Aug 25, 2021

samagalhaes commented Aug 25, 2021

jawaechan commented Aug 25, 2021

samagalhaes commented Aug 25, 2021

Wuziyi616 commented Aug 25, 2021

jawaechan commented Aug 25, 2021

Wuziyi616 commented Sep 1, 2021

Wuziyi616 commented Sep 1, 2021

samagalhaes commented Aug 25, 2021 •

edited

Loading