Computer Science > Computer Vision and Pattern Recognition

arXiv:2211.14950v1 (cs)

[Submitted on 27 Nov 2022 (this version), latest version 16 Apr 2024 (v2)]

Title:GRelPose: Generalizable End-to-End Relative Camera Pose Regression

Authors:Fadi Khatib, Yuval Margalit, Meirav Galun, Ronen Basri

View PDF

Abstract:This paper proposes a generalizable, end-to-end deep learning-based method for relative pose regression between two images. Given two images of the same scene captured from different viewpoints, our algorithm predicts the relative rotation and translation between the two respective cameras. Despite recent progress in the field, current deep-based methods exhibit only limited generalization to scenes not seen in training. Our approach introduces a network architecture that extracts a grid of coarse features for each input image using the pre-trained LoFTR network. It subsequently relates corresponding features in the two images, and finally uses a convolutional network to recover the relative rotation and translation between the respective cameras. Our experiments indicate that the proposed architecture can generalize to novel scenes, obtaining higher accuracy than existing deep-learning-based methods in various settings and datasets, in particular with limited training data.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2211.14950 [cs.CV]
	(or arXiv:2211.14950v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2211.14950

Submission history

From: Fadi Khatib [view email]
[v1] Sun, 27 Nov 2022 22:01:47 UTC (9,549 KB)
[v2] Tue, 16 Apr 2024 12:40:41 UTC (8,170 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:GRelPose: Generalizable End-to-End Relative Camera Pose Regression

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:GRelPose: Generalizable End-to-End Relative Camera Pose Regression

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators