subscribe to arXiv mailings

Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image

Authors: Yu Zhao, Hao Fei, Xiangtai Li, Libo Qin, Jiayi Ji, Hongyuan Zhu, Meishan Zhang, Min Zhang, Jianguo Wei

Abstract: In the visual spatial understanding (VSU) area, spatial image-to-text (SI2T) and spatial text-to-image (ST2I) are two fundamental tasks that appear in dual form. Existing methods for standalone SI2T or ST2I perform imperfectly in spatial understanding, due to the difficulty of 3D-wise spatial feature modeling. In this work, we consider modeling the SI2T and ST2I together under a dual learning fram… ▽ More In the visual spatial understanding (VSU) area, spatial image-to-text (SI2T) and spatial text-to-image (ST2I) are two fundamental tasks that appear in dual form. Existing methods for standalone SI2T or ST2I perform imperfectly in spatial understanding, due to the difficulty of 3D-wise spatial feature modeling. In this work, we consider modeling the SI2T and ST2I together under a dual learning framework. During the dual framework, we then propose to represent the 3D spatial scene features with a novel 3D scene graph (3DSG) representation that can be shared and beneficial to both tasks. Further, inspired by the intuition that the easier 3D$\to$image and 3D$\to$text processes also exist symmetrically in the ST2I and SI2T, respectively, we propose the Spatial Dual Discrete Diffusion (SD$^3$) framework, which utilizes the intermediate features of the 3D$\to$X processes to guide the hard X$\to$3D processes, such that the overall ST2I and SI2T will benefit each other. On the visual spatial understanding dataset VSD, our system outperforms the mainstream T2I and I2T methods significantly. Further in-depth analysis reveals how our dual learning strategy advances. △ Less

Submitted 20 October, 2024; originally announced October 2024.

arXiv:2410.14538 [pdf, other]

Nearly query-optimal classical shadow estimation of unitary channels

Authors: Zihao Li, Changhao Yi, You Zhou, Huangjun Zhu

Abstract: Classical shadow estimation (CSE) is a powerful tool for learning properties of quantum states and quantum processes. Here we consider the CSE task for quantum unitary channels. By querying an unknown unitary channel $\mathcal{U}$ multiple times in quantum experiments, the goal is to learn a classical description of $\mathcal{U}$ such that one can later use it to accurately predict many different… ▽ More Classical shadow estimation (CSE) is a powerful tool for learning properties of quantum states and quantum processes. Here we consider the CSE task for quantum unitary channels. By querying an unknown unitary channel $\mathcal{U}$ multiple times in quantum experiments, the goal is to learn a classical description of $\mathcal{U}$ such that one can later use it to accurately predict many different linear properties of the channel, i.e., the expectation values of arbitrary observables measured on the output of $\mathcal{U}$ upon arbitrary input states. Based on collective measurements on multiple systems, we propose a query efficient protocol for this task, whose query complexity achieves a quadratic advantage over previous best approach for this problem, and almost saturates the information-theoretic lower bound. To enhance practicality, we also present a variant protocol using only single-copy measurements, which still offers better query performance than any previous protocols that do not use additional quantum memories. In addition to linear properties, our protocol can also be applied to simultaneously predict many non-linear properties such as out-of-time-ordered correlators. Given the importance of CSE, this work may represent a significant advance in the study of learning unitary channels. △ Less

Submitted 18 October, 2024; originally announced October 2024.

Comments: 13+23 pages, 3 figures, and 1+5 tables; comments and suggestions are welcome!

arXiv:2410.13688 [pdf, other]

Variational Quantum Framework for Nonlinear PDE Constrained Optimization Using Carleman Linearization

Authors: Abeynaya Gnanasekaran, Amit Surana, Hongyu Zhu

Abstract: We present a novel variational quantum framework for nonlinear partial differential equation (PDE) constrained optimization problems. The proposed work extends the recently introduced bi-level variational quantum PDE constrained optimization (BVQPCO) framework for linear PDE to a nonlinear setting by leveraging Carleman linearization (CL). CL framework allows one to transform a system of polynomia… ▽ More We present a novel variational quantum framework for nonlinear partial differential equation (PDE) constrained optimization problems. The proposed work extends the recently introduced bi-level variational quantum PDE constrained optimization (BVQPCO) framework for linear PDE to a nonlinear setting by leveraging Carleman linearization (CL). CL framework allows one to transform a system of polynomial ordinary differential equations (ODE), i,e. ODE with polynomial vector field, into an system of infinite but linear system of ODE. For instance, such polynomial ODEs naturally arise when the PDE are semi-discretized in the spatial dimensions. By truncating the CL system to a finite order, one obtains a finite system of linear ODE to which the linear BVQPCO framework can be applied. In particular, the finite system of linear ODE is discretized in time and embedded as a system of linear equations. The variational quantum linear solver (VQLS) is used to solve the linear system for given optimization parameters, and evaluate the design cost/objective function, and a classical black box optimizer is used to select next set of parameter values based on this evaluated cost. We present detailed computational error and complexity analysis and prove that under suitable assumptions, our proposed framework can provide potential advantage over classical techniques. We implement our framework using the PennyLane library and apply it to solve inverse Burgers' problem. We also explore an alternative tensor product decomposition which exploits the sparsity/structure of linear system arising from PDE discretization to facilitate the computation of VQLS cost functions. △ Less