Visual Programming for Zero-Shot Open-Vocabulary 3D Visual Grounding.

AllVideos Books Images Maps News Shopping

Visual Programming for Zero-shot Open-Vocabulary 3D ... - arXiv

Nov 26, 2023 � We propose a novel visual programming approach for zero-shot open-vocabulary 3DVG, leveraging the capabilities of large language models (LLMs).

[CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary ...

github.com › CurryYuan

Zero-shot 3DVG identifies the location of target objects using programmatic representation generated by LLMs, ie, target category, anchor category, and�...

Scholarly articles for Visual Programming for Zero-Shot Open-Vocabulary 3D Visual Grounding.

scholar.google.com › citations

Visual programming for zero-shot open-vocabulary 3d …
Yuan � Cited by 10

[PDF] Visual Programming for Zero-shot Open-Vocabulary 3D Visual ...

openaccess.thecvf.com › content › CVPR2024 › papers › Yuan_Visua...

3D Visual Grounding (3DVG) aims to localize specific ob- jects within 3D scenes by using a series of textual descrip- tions. This has become a crucial component�...

Visual Programming for Zero-shot Open-Vocabulary 3D Visual ...

curryyuan.github.io › ...

We propose a novel visual programming approach for zero-shot open-vocabulary 3DVG, leveraging the capabilities of large language models (LLMs).

Visual Programming for Zero-shot Open-Vocabulary 3D ... - arXiv

arxiv.org › html

3D Visual Grounding (3DVG) aims at localizing 3D object based on textual descriptions. Conventional supervised methods for 3DVG often necessitate extensive�...

Visual Programming for Zero-Shot Open-Vocabulary ... - IEEE Xplore

ieeexplore.ieee.org › iel8

3D Visual Grounding (3DVG) aims to localize specific ob- jects within 3D scenes by using a series of textual descrip- tions. This has become a crucial component�...

Visual Programming for Zero-Shot Open-Vocabulary 3D Visual ...

www.computer.org › csdl › proceedings-article › cvpr

We propose a novel visual programming approach for zero-shot open-vocabulary 3DVG, leveraging the capabilities of large language models (LLMs).

Visual Programming for Zero-shot Open-Vocabulary 3D Visual ...

openaccess.thecvf.com › content › CVPR2024 › supplemental › Yuan_Vis...

Answer: Based on the description, we are looking for a storage shelf that is white in color and is above a desk with a chair in front.

Visual Programming for Zero-Shot Open-Vocabulary 3D Visual ...

www.researchgate.net › ... › 3D Visualization

Sep 24, 2024 � An LLM is then used to reason which object satisfies the grounding relationship. ZS3DVG [3] follows a similar pipeline but requires the LLM to�...

Visual Programming for Zero-shot Open-Vocabulary ... - BibSonomy

www.bibsonomy.org › bibtex

Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding. Z. Yuan, J. Ren, C. Feng, H. Zhao, S. Cui, and Z. Li. CoRR, (2023 ).

People also search for

Naturally Supervised 3D Visual Grounding with Language-Regularized Concept Learners

Visual grounding LLM