Computer Science > Computer Vision and Pattern Recognition

arXiv:2209.06970 (cs)

[Submitted on 14 Sep 2022 (v1), last revised 17 Oct 2022 (this version, v2)]

Title:Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative Models

Authors:Chen Henry Wu, Saman Motamed, Shaunak Srivastava, Fernando De la Torre

View PDF

Abstract:Generative models (e.g., GANs, diffusion models) learn the underlying data distribution in an unsupervised manner. However, many applications of interest require sampling from a particular region of the output space or sampling evenly over a range of characteristics. For efficient sampling in these scenarios, we propose Generative Visual Prompt (PromptGen), a framework for distributional control over pre-trained generative models by incorporating knowledge of other off-the-shelf models. PromptGen defines control as energy-based models (EBMs) and samples images in a feed-forward manner by approximating the EBM with invertible neural networks, avoiding optimization at inference. Our experiments demonstrate how PromptGen can efficiently sample from several unconditional generative models (e.g., StyleGAN2, StyleNeRF, diffusion autoencoder, NVAE) in a controlled or/and de-biased manner using various off-the-shelf models: (1) with the CLIP model as control, PromptGen can sample images guided by text, (2) with image classifiers as control, PromptGen can de-bias generative models across a set of attributes or attribute combinations, and (3) with inverse graphics models as control, PromptGen can sample images of the same identity in different poses. (4) Finally, PromptGen reveals that the CLIP model shows a "reporting bias" when used as control, and PromptGen can further de-bias this controlled distribution in an iterative manner. The code is available at this https URL.

Comments:	NeurIPS 2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
Cite as:	arXiv:2209.06970 [cs.CV]
	(or arXiv:2209.06970v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2209.06970

Submission history

From: Chen Henry Wu [view email]
[v1] Wed, 14 Sep 2022 22:55:18 UTC (48,580 KB)
[v2] Mon, 17 Oct 2022 16:53:10 UTC (48,422 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators