Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Details about prediction on KITTI dataset #65

Open
gxytcrc opened this issue Jul 9, 2024 · 1 comment
Open

Details about prediction on KITTI dataset #65

gxytcrc opened this issue Jul 9, 2024 · 1 comment

Comments

@gxytcrc
Copy link

gxytcrc commented Jul 9, 2024

Hello! Thank you for your excellent work. I have a question regarding the KITTI prediction. Should the input image be resized to [420, 560] for evaluating v2? I've already tried resizing, but the predicted intrinsics deviate significantly from the intrinsic provided by KITTI. Is this normal? Additionally, what other preprocessing steps should be performed before feeding the image into the model?

@lpiccinelli-eth
Copy link
Owner

The predicted intrinsics corresponds to the input image size. The infer method takes care of rescaling the predicted intrinsics to match the original input shape. If you are using your own forward method, you have to take care of it yourself.

V2 does not require a fixed image shape, the infer method takes care of that too, by using the input image ratio and just resizing the image to fit the max dimension seen during training.

KITTI shapes are outside the training domain (really elongated), so the results may be a bit off, since it would be better to pad it and have it inside the image ratios seen during training.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants