Computer Science > Computer Vision and Pattern Recognition

arXiv:2405.01353v1 (cs)

[Submitted on 2 May 2024]

Title:Sparse multi-view hand-object reconstruction for unseen environments

Authors:Yik Lung Pang, Changjae Oh, Andrea Cavallaro

Abstract:Recent works in hand-object reconstruction mainly focus on the single-view and dense multi-view settings. On the one hand, single-view methods can leverage learned shape priors to generalise to unseen objects but are prone to inaccuracies due to occlusions. On the other hand, dense multi-view methods are very accurate but cannot easily adapt to unseen objects without further data collection. In contrast, sparse multi-view methods can take advantage of the additional views to tackle occlusion, while keeping the computational cost low compared to dense multi-view methods. In this paper, we consider the problem of hand-object reconstruction with unseen objects in the sparse multi-view setting. Given multiple RGB images of the hand and object captured at the same time, our model SVHO combines the predictions from each view into a unified reconstruction without optimisation across views. We train our model on a synthetic hand-object dataset and evaluate directly on a real world recorded hand-object dataset with unseen objects. We show that while reconstruction of unseen hands and objects from RGB is challenging, additional views can help improve the reconstruction quality.

Comments:	Camera-ready version. Paper accepted to CVPRW 2024. 8 pages, 7 figures, 1 table
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2405.01353 [cs.CV]
	(or arXiv:2405.01353v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2405.01353

Submission history

From: Yik Lung Pang [view email]
[v1] Thu, 2 May 2024 15:01:25 UTC (6,375 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Sparse multi-view hand-object reconstruction for unseen environments

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Sparse multi-view hand-object reconstruction for unseen environments

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators