Computer Science > Computer Vision and Pattern Recognition

arXiv:2212.06909 (cs)

[Submitted on 13 Dec 2022 (v1), last revised 12 Apr 2023 (this version, v2)]

Title:Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting

Authors:Su Wang, Chitwan Saharia, Ceslee Montgomery, Jordi Pont-Tuset, Shai Noy, Stefano Pellegrini, Yasumasa Onoe, Sarah Laszlo, David J. Fleet, Radu Soricut, Jason Baldridge, Mohammad Norouzi, Peter Anderson, William Chan

View PDF

Abstract:Text-guided image editing can have a transformative impact in supporting creative applications. A key challenge is to generate edits that are faithful to input text prompts, while consistent with input images. We present Imagen Editor, a cascaded diffusion model built, by fine-tuning Imagen on text-guided image inpainting. Imagen Editor's edits are faithful to the text prompts, which is accomplished by using object detectors to propose inpainting masks during training. In addition, Imagen Editor captures fine details in the input image by conditioning the cascaded pipeline on the original high resolution image. To improve qualitative and quantitative evaluation, we introduce EditBench, a systematic benchmark for text-guided image inpainting. EditBench evaluates inpainting edits on natural and generated images exploring objects, attributes, and scenes. Through extensive human evaluation on EditBench, we find that object-masking during training leads to across-the-board improvements in text-image alignment -- such that Imagen Editor is preferred over DALL-E 2 and Stable Diffusion -- and, as a cohort, these models are better at object-rendering than text-rendering, and handle material/color/size attributes better than count/shape attributes.

Comments:	CVPR 2023 Camera Ready
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2212.06909 [cs.CV]
	(or arXiv:2212.06909v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2212.06909

Submission history

From: Su Wang [view email]
[v1] Tue, 13 Dec 2022 21:25:11 UTC (9,956 KB)
[v2] Wed, 12 Apr 2023 22:42:08 UTC (9,956 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators