You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for your work. I find the pretrained weight you converted from jax to pytorch has some incompatible key in backbone.
You deifined vit_small_resnet50d_s3_224 here. Howerver, the resnet50d has so many incompatible key that I cant load pretrained checkpoint of jx_vit_base_resnet50_224_in21k-6f7c7740.pth.
Further, I also try resnetv2 your provided in timm.models.resnetv2, the layer name is close to pretrained weight, after I change their key name to be consistent, it notices that their weights has different shape.
`size mismatch for patch_embed.backbone.stages_0.blocks.1.norm1.bias: copying a param with shape torch.Size([64]) from checkpoint, the shape in current model is torch.Size([256]).
size mismatch for patch_embed.backbone.stages_0.blocks.1.norm3.weight: copying a param with shape torch.Size([256]) from checkpoint, the shape in current model is torch.Size([64]).
size mismatch for patch_embed.backbone.stages_0.blocks.1.norm3.bias: copying a param with shape torch.Size([256]) from checkpoint, the shape in current model is torch.Size([64]).
size mismatch for patch_embed.backbone.stages_0.blocks.2.norm1.weight: copying a param with shape torch.Size([64]) from checkpoint, the shape in current model is torch.Size([256]).`
Can your provide some advice or example for how to load the hybrid model's pretrained weight, especially backbone?
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Thanks for your work. I find the pretrained weight you converted from jax to pytorch has some incompatible key in backbone.
You deifined vit_small_resnet50d_s3_224 here. Howerver, the resnet50d has so many incompatible key that I cant load pretrained checkpoint of jx_vit_base_resnet50_224_in21k-6f7c7740.pth.
Further, I also try resnetv2 your provided in timm.models.resnetv2, the layer name is close to pretrained weight, after I change their key name to be consistent, it notices that their weights has different shape.
Can your provide some advice or example for how to load the hybrid model's pretrained weight, especially backbone?
Beta Was this translation helpful? Give feedback.
All reactions