Replies: 1 comment
-
Hi, I think you could mix features from frame-wise extractors such as resnet50 and CLIP. Also, note that RAFT extracts a full-resolution frame with optical flow directions. I will convert it to a discussion as it is not an issue. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
thanks for your great code!
In my recently works,i have to mix the features from different network ,but the out features' sizes were not match.
I want to mix the feature from the resnet50 and RAFT(or I3D )
I don't know how to deal with that,could some one help me?😥
Beta Was this translation helpful? Give feedback.
All reactions