subscribe to arXiv mailings

Cryogenic Control and Readout Integrated Circuits for Solid-State Quantum Computing

Authors: Lingxiao Lei, Heng Huang, Pingxing Chen, Mingtang Deng

Abstract: In the pursuit of quantum computing, solid-state quantum systems, particularly superconducting ones, have made remarkable advancements over the past two decades. However, achieving fault-tolerant quantum computing for next-generation applications necessitates the integration of several million qubits, which presents significant challenges in terms of interconnection complexity and latency that are… ▽ More In the pursuit of quantum computing, solid-state quantum systems, particularly superconducting ones, have made remarkable advancements over the past two decades. However, achieving fault-tolerant quantum computing for next-generation applications necessitates the integration of several million qubits, which presents significant challenges in terms of interconnection complexity and latency that are currently unsolvable with state-of-the-art room-temperature control and readout electronics. Recently, cryogenic integrated circuits (ICs), including CMOS radio-frequency ICs and rapid-single-flux-quantum-logic ICs, have emerged as potential alternatives to room-temperature electronics. Unlike their room-temperature counterparts, these ICs are deployed within cryostats to enhance scalability by reducing the number and length of transmission lines. Additionally, operating at cryogenic temperatures can suppress electronic noise and improve qubit control fidelity. However, for CMOS ICs specifically, circuit design uncertainties arise due to a lack of reliable models for cryogenic field effect transistors as well as issues related to severe fickle noises and power dissipation at cryogenic temperatures. This paper provides a comprehensive review of recent research on both types of cryogenic control and readout ICs but primarily focuses on the more mature CMOS technology. The discussion encompasses principles underlying control and readout techniques employed in cryogenic CMOS ICs along with their architectural designs; characterization and modeling approaches for field effect transistors under cryogenic conditions; as well as fundamental concepts pertaining to rapid single flux quantum circuits. △ Less

Submitted 21 October, 2024; originally announced October 2024.

arXiv:2410.11925 [pdf, other]

A Study of Decay Rate of Bound Negative Muons

Authors: Jian-Bo Deng, Miao-Yi Deng, Shi-Jie Ma, Rui-Bo Wang, Qi-Qi Fan, Peng-Zhang He, Yi-Peng He, Shuo-Wen Li, Xian-Ru Hu

Abstract: A number of experiments show that the decay lifetimes of muons bound to atomic nuclei are longer than the decay lifetimes of free muons. In this paper, a scheme of extending quantum mechanics (EQM) is proposed to resolve this problem. The Schr$\ddot{\text{o}}$dinger's equation is obtained to prove the validation of this attempt. The decay ratio of bound muons is also calculated in EQM, and the res… ▽ More A number of experiments show that the decay lifetimes of muons bound to atomic nuclei are longer than the decay lifetimes of free muons. In this paper, a scheme of extending quantum mechanics (EQM) is proposed to resolve this problem. The Schr$\ddot{\text{o}}$dinger's equation is obtained to prove the validation of this attempt. The decay ratio of bound muons is also calculated in EQM, and the result is in good agreement with the experimental data. △ Less

Submitted 15 October, 2024; originally announced October 2024.

Comments: 5 pages, 1 figure, 2 tables

arXiv:2410.06646 [pdf, other]

Exploration of Halo Substructures in IoM Space with \textit{Gaia} DR3

Authors: Haoyang Liu, Cuihua Du, Dashuang Ye, Jian Zhang, Mingji Deng

Abstract: Using kinematic data from the Gaia Data Release 3 catalog, along with metallicity estimates robustly derived from Gaia XP spectra, we have explored the Galactic stellar halo in search of both known and potentially new substructures. By applying the HDBSCAN clustering algorithm in IoM space (i.e. $E,L_{z}$ and $L_{\perp}$$ = \sqrt{L_{x}^2+L_{y}^2}$), we identified 5 previously known substructures:… ▽ More Using kinematic data from the Gaia Data Release 3 catalog, along with metallicity estimates robustly derived from Gaia XP spectra, we have explored the Galactic stellar halo in search of both known and potentially new substructures. By applying the HDBSCAN clustering algorithm in IoM space (i.e. $E,L_{z}$ and $L_{\perp}$$ = \sqrt{L_{x}^2+L_{y}^2}$), we identified 5 previously known substructures: Gaia-Sausage-Enceladus (GSE), Helmi Streams, I'itoi + Sequoia and Hot Thick Disc. We additionally found NGC 3201 and NGC 5139 in this work, and NGC 3201 shares similar distributions in phase space and metallicties to Arjuna, which possibly implies that they have the same origin. Three newly discovered substructures are Prograde Substructure 1 (PG1), Prograde Substructure 2 (PG2) and the Low Energy Group. PG1, with a higher $V_φ$ than typical GSE member stars, is considered as either a low eccentricity and metal-rich part of GSE or part of the metal-poor disc. PG2, sharing kinematic similarities with Aleph, is thought to be its relatively highly eccentric component or the mixture of Aleph and disc. The Low Energy Group, whose metal-poor component of metallicity distribution function has a mean value [M/H] $\sim$ $-$1.29 (compared to that of Heracles [M/H] $\sim$ $-$1.26), may have associations with Heracles. △ Less

Submitted 9 October, 2024; originally announced October 2024.

Comments: 14 Pages, 8 Figure, accepted for publication in ApJ

arXiv:2409.08459 [pdf, other]

Toward satisfactory public accessibility: A crowdsourcing approach through online reviews to inclusive urban design

Authors: Lingyao Li, Songhua Hu, Yinpei Dai, Min Deng, Parisa Momeni, Gabriel Laverghetta, Lizhou Fan, Zihui Ma, Xi Wang, Siyuan Ma, Jay Ligatti, Libby Hemphill

Abstract: As urban populations grow, the need for accessible urban design has become urgent. Traditional survey methods for assessing public perceptions of accessibility are often limited in scope. Crowdsourcing via online reviews offers a valuable alternative to understanding public perceptions, and advancements in large language models can facilitate their use. This study uses Google Maps reviews across t… ▽ More As urban populations grow, the need for accessible urban design has become urgent. Traditional survey methods for assessing public perceptions of accessibility are often limited in scope. Crowdsourcing via online reviews offers a valuable alternative to understanding public perceptions, and advancements in large language models can facilitate their use. This study uses Google Maps reviews across the United States and fine-tunes Llama 3 model with the Low-Rank Adaptation technique to analyze public sentiment on accessibility. At the POI level, most categories -- restaurants, retail, hotels, and healthcare -- show negative sentiments. Socio-spatial analysis reveals that areas with higher proportions of white residents and greater socioeconomic status report more positive sentiment, while areas with more elderly, highly-educated residents exhibit more negative sentiment. Interestingly, no clear link is found between the presence of disabilities and public sentiments. Overall, this study highlights the potential of crowdsourcing for identifying accessibility challenges and providing insights for urban planners. △ Less

Submitted 12 September, 2024; originally announced September 2024.

arXiv:2409.06201 [pdf, other]

doi 10.1145/3687996

An Eulerian Vortex Method on Flow Maps

Authors: Sinan Wang, Yitong Deng, Molin Deng, Hong-Xing Yu, Junwei Zhou, Duowen Chen, Taku Komura, Jiajun Wu, Bo Zhu

Abstract: We present an Eulerian vortex method based on the theory of flow maps to simulate the complex vortical motions of incompressible fluids. Central to our method is the novel incorporation of the flow-map transport equations for line elements, which, in combination with a bi-directional marching scheme for flow maps, enables the high-fidelity Eulerian advection of vorticity variables. The fundamental… ▽ More We present an Eulerian vortex method based on the theory of flow maps to simulate the complex vortical motions of incompressible fluids. Central to our method is the novel incorporation of the flow-map transport equations for line elements, which, in combination with a bi-directional marching scheme for flow maps, enables the high-fidelity Eulerian advection of vorticity variables. The fundamental motivation is that, compared to impulse $\mathbf{m}$, which has been recently bridged with flow maps to encouraging results, vorticity $\boldsymbolω$ promises to be preferable for its numerical stability and physical interpretability. To realize the full potential of this novel formulation, we develop a new Poisson solving scheme for vorticity-to-velocity reconstruction that is both efficient and able to accurately handle the coupling near solid boundaries. We demonstrate the efficacy of our approach with a range of vortex simulation examples, including leapfrog vortices, vortex collisions, cavity flow, and the formation of complex vortical structures due to solid-fluid interactions. △ Less

Submitted 14 September, 2024; v1 submitted 10 September, 2024; originally announced September 2024.

Comments: Accepted at ACM Transactions on Graphics (SIGGRAPH Asia 2024)

arXiv:2409.05282 [pdf, other]

Improving Tree Probability Estimation with Stochastic Optimization and Variance Reduction

Authors: Tianyu Xie, Musu Yuan, Minghua Deng, Cheng Zhang

Abstract: Probability estimation of tree topologies is one of the fundamental tasks in phylogenetic inference. The recently proposed subsplit Bayesian networks (SBNs) provide a powerful probabilistic graphical model for tree topology probability estimation by properly leveraging the hierarchical structure of phylogenetic trees. However, the expectation maximization (EM) method currently used for learning SB… ▽ More Probability estimation of tree topologies is one of the fundamental tasks in phylogenetic inference. The recently proposed subsplit Bayesian networks (SBNs) provide a powerful probabilistic graphical model for tree topology probability estimation by properly leveraging the hierarchical structure of phylogenetic trees. However, the expectation maximization (EM) method currently used for learning SBN parameters does not scale up to large data sets. In this paper, we introduce several computationally efficient methods for training SBNs and show that variance reduction could be the key for better performance. Furthermore, we also introduce the variance reduction technique to improve the optimization of SBN parameters for variational Bayesian phylogenetic inference (VBPI). Extensive synthetic and real data experiments demonstrate that our methods outperform previous baseline methods on the tasks of tree topology probability estimation as well as Bayesian phylogenetic inference using SBNs. △ Less

Submitted 8 September, 2024; originally announced September 2024.

Comments: 23 pages, 6 figures, 7 tables

arXiv:2409.03264 [pdf, other]

A Potential Dynamical Origin of The Galactic Disk Warp: The Gaia-Sausage-Enceladus Major Merger

Authors: Mingji Deng, Cuihua Du, Yanbin Yang, Jiwei Liao, Dashuang Ye

Abstract: Previous studies have revealed that the Galactic warp is a long-lived, nonsteady, and asymmetric structure. There is a need for a model that accounts for the warp's long-term evolution. Given that this structure has persisted for over 5 Gyrs, its timeline may coincide with the completion of Gaia-Sausage-Enceladus (GSE) merger. Recent studies indicate that the GSE, the significant merger of our Gal… ▽ More Previous studies have revealed that the Galactic warp is a long-lived, nonsteady, and asymmetric structure. There is a need for a model that accounts for the warp's long-term evolution. Given that this structure has persisted for over 5 Gyrs, its timeline may coincide with the completion of Gaia-Sausage-Enceladus (GSE) merger. Recent studies indicate that the GSE, the significant merger of our Galaxy, was likely a gas-rich merger and the large amount of gas introduced could have created a profound impact on the Galactic morphology. This study utilizes GIZMO simulation code to construct a gas-rich GSE merger. By reconstructing the observed characteristics of the GSE, we successfully reproduce the disk warp and capture nearly all of its documented features that aligns closely with observational data from both stellar and gas disks. This simulation demonstrates the possibility that the single major merger could generate the Galactic warp amplitude and precession. Furthermore, the analysis of the warp's long-term evolution may offer more clues into the formation history of the Milky Way. △ Less

Submitted 5 September, 2024; originally announced September 2024.

Comments: 14 pages, 7 Figure, accepted for publication in ApJ

arXiv:2409.01075 [pdf, other]

Vortex: Efficient Sample-Free Dynamic Tensor Program Optimization via Hardware-aware Strategy Space Hierarchization

Authors: Yangjie Zhou, Honglin Zhu, Qian Qiu, Weihao Cui, Zihan Liu, Cong Guo, Siyuan Feng, Jintao Meng, Haidong Lan, Jingwen Leng, Wenxi Zhu, Minwen Deng

Abstract: Dynamic-shape deep neural networks (DNNs) are rapidly evolving, attracting attention for their ability to handle variable input sizes in real-time applications. However, existing compilation optimization methods for such networks often rely heavily on predefined samples to guide the compilation process, which restricts their adaptability and efficiency. These sample-driven methods struggle to effi… ▽ More Dynamic-shape deep neural networks (DNNs) are rapidly evolving, attracting attention for their ability to handle variable input sizes in real-time applications. However, existing compilation optimization methods for such networks often rely heavily on predefined samples to guide the compilation process, which restricts their adaptability and efficiency. These sample-driven methods struggle to efficiently manage the diverse and unpredictable shapes encountered in real-world scenarios, often resulting in suboptimal performance. To tackle these issues, we introduce Vortex, a hardware-driven and sample-free compiler tailored for dynamic-shape tensor programs. Vortex capitalizes on detailed hardware information and hierarchizes the strategy space to facilitate high-performance code generation without relying on runtime shape samples. It features a unique bidirectional compilation workflow, combining top-down abstraction for aligning tensor program execution with hardware hierarchies and bottom-up kernel construction to narrow the search space, enabling Vortex to achieve remarkable efficiency. Comprehensive evaluations confirm that Vortex reduces compilation time by $176\times$ compared to the existing dynamic-shape compiler. Additionally, it substantially outperforms existing vendor-provided libraries and dynamic-shape compilers on both CPU and GPU platforms, delivering speedups of $2.53\times$ and $3.01\times$, respectively. △ Less

Submitted 2 September, 2024; originally announced September 2024.

arXiv:2408.10556 [pdf, other]

Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks

Authors: Yun Qu, Boyuan Wang, Jianzhun Shao, Yuhang Jiang, Chen Chen, Zhenbin Ye, Lin Liu, Junfeng Yang, Lin Lai, Hongyang Qin, Minwen Deng, Juchao Zhuo, Deheng Ye, Qiang Fu, Wei Yang, Guang Yang, Lanxiao Huang, Xiangyang Ji

Abstract: The advancement of Offline Reinforcement Learning (RL) and Offline Multi-Agent Reinforcement Learning (MARL) critically depends on the availability of high-quality, pre-collected offline datasets that represent real-world complexities and practical applications. However, existing datasets often fall short in their simplicity and lack of realism. To address this gap, we propose Hokoff, a comprehens… ▽ More The advancement of Offline Reinforcement Learning (RL) and Offline Multi-Agent Reinforcement Learning (MARL) critically depends on the availability of high-quality, pre-collected offline datasets that represent real-world complexities and practical applications. However, existing datasets often fall short in their simplicity and lack of realism. To address this gap, we propose Hokoff, a comprehensive set of pre-collected datasets that covers both offline RL and offline MARL, accompanied by a robust framework, to facilitate further research. This data is derived from Honor of Kings, a recognized Multiplayer Online Battle Arena (MOBA) game known for its intricate nature, closely resembling real-life situations. Utilizing this framework, we benchmark a variety of offline RL and offline MARL algorithms. We also introduce a novel baseline algorithm tailored for the inherent hierarchical action space of the game. We reveal the incompetency of current offline RL approaches in handling task complexity, generalization and multi-task learning. △ Less

Submitted 20 August, 2024; originally announced August 2024.

arXiv:2408.07341 [pdf, other]

Robust Semi-supervised Multimodal Medical Image Segmentation via Cross Modality Collaboration

Authors: Xiaogen Zhou, Yiyou Sun, Min Deng, Winnie Chiu Wing Chu, Qi Dou

Abstract: Multimodal learning leverages complementary information derived from different modalities, thereby enhancing performance in medical image segmentation. However, prevailing multimodal learning methods heavily rely on extensive well-annotated data from various modalities to achieve accurate segmentation performance. This dependence often poses a challenge in clinical settings due to limited availabi… ▽ More Multimodal learning leverages complementary information derived from different modalities, thereby enhancing performance in medical image segmentation. However, prevailing multimodal learning methods heavily rely on extensive well-annotated data from various modalities to achieve accurate segmentation performance. This dependence often poses a challenge in clinical settings due to limited availability of such data. Moreover, the inherent anatomical misalignment between different imaging modalities further complicates the endeavor to enhance segmentation performance. To address this problem, we propose a novel semi-supervised multimodal segmentation framework that is robust to scarce labeled data and misaligned modalities. Our framework employs a novel cross modality collaboration strategy to distill modality-independent knowledge, which is inherently associated with each modality, and integrates this information into a unified fusion layer for feature amalgamation. With a channel-wise semantic consistency loss, our framework ensures alignment of modality-independent information from a feature-wise perspective across modalities, thereby fortifying it against misalignments in multimodal scenarios. Furthermore, our framework effectively integrates contrastive consistent learning to regulate anatomical structures, facilitating anatomical-wise prediction alignment on unlabeled data in semi-supervised segmentation tasks. Our method achieves competitive performance compared to other multimodal methods across three tasks: cardiac, abdominal multi-organ, and thyroid-associated orbitopathy segmentations. It also demonstrates outstanding robustness in scenarios involving scarce labeled data and misaligned modalities. △ Less

Submitted 3 September, 2024; v1 submitted 14 August, 2024; originally announced August 2024.

arXiv:2407.18256 [pdf, other]

Kibble-Zurek Behavior in the Boundary-obstructed Phase Transitions

Authors: Menghua Deng, Zhoujian Sun, Fuxiang Li

Abstract: We study the nonadiabatic dynamics of a two-dimensional higher-order topological insulator when the system is slowly quenched across the boundary-obstructed phase transition, which is characterized by edge band gap closing. We find that the number of excitations produced after the quench exhibits power-law scaling behaviors with the quench rate. Boundary conditions can drastically modify the sca… ▽ More We study the nonadiabatic dynamics of a two-dimensional higher-order topological insulator when the system is slowly quenched across the boundary-obstructed phase transition, which is characterized by edge band gap closing. We find that the number of excitations produced after the quench exhibits power-law scaling behaviors with the quench rate. Boundary conditions can drastically modify the scaling behaviors: The scaling exponent is found to be $α=1/2$ for hybridized and fully open boundary conditions, and $α=2$ for periodic boundary condition. We argue that the exponent $α=1/2$ cannot be explained by the Kibble-Zurek mechanism unless we adopt an effective dimension $d^{\rm eff}=1$ instead of the real dimension $d=2$. For comparison, we also investigate the slow quench dynamics across the bulk-obstructed phase transitions and a single multicritical point, which obeys the Kibble-Zurek mechanism with dimension $d=2$. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 6+10 pages, 3=7 figures

arXiv:2407.03713 [pdf, other]

Compositions of the Hercules-Aquila Cloud and Virgo Over-density

Authors: Dashuang Ye, Cuihua Du, Mingji Deng, Jiwei Liao, Yang Huang, Jianrong Shi, Jun Ma

Abstract: Based on a sample of K giant from Large sky Area Multi-Object fiber Spectroscopic Telescope (LAMOST) Data Release 8 and a sample of RR Lyrae (RRL) from \textit{Gaia} Data Release 3, we investigate the compositions of the Hercules-Aquila Cloud (HAC) and Virgo Over-density (VOD) and their collective contribution to the tilt and triaxiality of the stellar halo ($r\,\textless\,40\,{\rm kpc}$) as well… ▽ More Based on a sample of K giant from Large sky Area Multi-Object fiber Spectroscopic Telescope (LAMOST) Data Release 8 and a sample of RR Lyrae (RRL) from \textit{Gaia} Data Release 3, we investigate the compositions of the Hercules-Aquila Cloud (HAC) and Virgo Over-density (VOD) and their collective contribution to the tilt and triaxiality of the stellar halo ($r\,\textless\,40\,{\rm kpc}$) as well as two breaks at $\approx15\,{\rm kpc}$ and 30\,kpc. We apply the Gaussian mixture model (GMM) to divide the stellar halo into the isotropic component and the radially biased anisotropic component, namely Gaia-Sausage-Enceladus (GSE), and find that both HAC and VOD are dominated by the GSE debris stars with weights of $0.67^{+0.09}_{-0.07}$ and $0.57^{+0.07}_{-0.06}$, respectively. In addition, using the K giants with orbital parameters, we identify the member stars of known substructures, including GSE, Sagittarius (Sgr), Helmi Streams, Sequoia, Thamnos, Pontus, Wukong, and Metal-weak Thick Disk (MWTD), to probe the compositions of low-eccentricity stars in the HAC and VOD regions. In density fittings of the RRL sample, we note that the absence of HAC and VOD has a weak effect on the shape of halo. Finally, we find that the radially biased anisotropic halo contributes majorly to the stellar halo that can be modelled with a tilted triaxial ellipsoid and a doubly broken power law with breaking radii at $18.08^{+2.04}_{-3.22}\,{\rm kpc}$ and $33.03^{+1.30}_{-1.21}\,{\rm kpc}$. This has important significance for understanding the status of large diffuse over-densities in the Milky Way. △ Less

Submitted 4 July, 2024; originally announced July 2024.

Comments: 11 pages, 9 figures, accepted for publication in MNRAS

arXiv:2406.20098 [pdf, other]

Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs

Authors: Sukmin Yun, Haokun Lin, Rusiru Thushara, Mohammad Qazim Bhat, Yongxin Wang, Zutao Jiang, Mingkai Deng, Jinhong Wang, Tianhua Tao, Junbo Li, Haonan Li, Preslav Nakov, Timothy Baldwin, Zhengzhong Liu, Eric P. Xing, Xiaodan Liang, Zhiqiang Shen

Abstract: Multimodal large language models (MLLMs) have shown impressive success across modalities such as image, video, and audio in a variety of understanding and generation tasks. However, current MLLMs are surprisingly poor at understanding webpage screenshots and generating their corresponding HTML code. To address this problem, we propose Web2Code, a benchmark consisting of a new large-scale webpage-t… ▽ More Multimodal large language models (MLLMs) have shown impressive success across modalities such as image, video, and audio in a variety of understanding and generation tasks. However, current MLLMs are surprisingly poor at understanding webpage screenshots and generating their corresponding HTML code. To address this problem, we propose Web2Code, a benchmark consisting of a new large-scale webpage-to-code dataset for instruction tuning and an evaluation framework for the webpage understanding and HTML code translation abilities of MLLMs. For dataset construction, we leverage pretrained LLMs to enhance existing webpage-to-code datasets as well as generate a diverse pool of new webpages rendered into images. Specifically, the inputs are webpage images and instructions, while the responses are the webpage's HTML code. We further include diverse natural language QA pairs about the webpage content in the responses to enable a more comprehensive understanding of the web content. To evaluate model performance in these tasks, we develop an evaluation framework for testing MLLMs' abilities in webpage understanding and web-to-code generation. Extensive experiments show that our proposed dataset is beneficial not only to our proposed tasks but also in the general visual domain, while previous datasets result in worse performance. We hope our work will contribute to the development of general MLLMs suitable for web-based content generation and task automation. Our data and code will be available at https://github.com/MBZUAI-LLM/web2code. △ Less

Submitted 28 June, 2024; originally announced June 2024.

Comments: Website at https://mbzuai-llm.github.io/webpage2code/

arXiv:2406.18016 [pdf, other]

Universal scaling of quantum state transport in one-dimensional topological chain under nonadiabatic dynamics

Authors: Lingzi Huang, Menghua Deng, Chen Sun, Fuxiang Li

Abstract: When a system is driven across a continuous phase transition, the density of topological defects demonstrates a power-law scaling behavior versus the quenching rate, as predicted by Kibble-Zurek mechanism. In this study, we generalized this idea and address the scaling of quantum state transport in a one-dimensional topological system subject to a linear drive through its topological quantum phase… ▽ More When a system is driven across a continuous phase transition, the density of topological defects demonstrates a power-law scaling behavior versus the quenching rate, as predicted by Kibble-Zurek mechanism. In this study, we generalized this idea and address the scaling of quantum state transport in a one-dimensional topological system subject to a linear drive through its topological quantum phase transition point. We illustrate the power-law dependencies of the quantum state's transport distance, width, and peak magnitude on the driving velocity. Crucially, the power-law exponents are distinct for the edge state and bulk state. Our results offer a novel perspective on quantum state transfer and enriches the field of Kibble-Zurek behaviors and nonadiabatic quantum dynamics. △ Less

Submitted 25 June, 2024; originally announced June 2024.

Comments: 10 pages, 8 figures; version accepted by PRB

arXiv:2406.11838 [pdf, other]

Autoregressive Image Generation without Vector Quantization

Authors: Tianhong Li, Yonglong Tian, He Li, Mingyang Deng, Kaiming He

Abstract: Conventional wisdom holds that autoregressive models for image generation are typically accompanied by vector-quantized tokens. We observe that while a discrete-valued space can facilitate representing a categorical distribution, it is not a necessity for autoregressive modeling. In this work, we propose to model the per-token probability distribution using a diffusion procedure, which allows us t… ▽ More Conventional wisdom holds that autoregressive models for image generation are typically accompanied by vector-quantized tokens. We observe that while a discrete-valued space can facilitate representing a categorical distribution, it is not a necessity for autoregressive modeling. In this work, we propose to model the per-token probability distribution using a diffusion procedure, which allows us to apply autoregressive models in a continuous-valued space. Rather than using categorical cross-entropy loss, we define a Diffusion Loss function to model the per-token probability. This approach eliminates the need for discrete-valued tokenizers. We evaluate its effectiveness across a wide range of cases, including standard autoregressive models and generalized masked autoregressive (MAR) variants. By removing vector quantization, our image generator achieves strong results while enjoying the speed advantage of sequence modeling. We hope this work will motivate the use of autoregressive generation in other continuous-valued domains and applications. Code is available at: https://github.com/LTH14/mar △ Less

Submitted 28 July, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

Comments: Tech report. Code: https://github.com/LTH14/mar

arXiv:2405.11733 [pdf, other]

doi 10.1088/0256-307X/41/9/090301

Simulating a Chern Insulator with C = $\pm$2 on Synthetic Floquet Lattice

Authors: Lingxiao Lei, Weichen Wang, Guangyao Huang, Shun Hu, Xi Cao, Xinfang Zhang, Mingtang Deng, Pingxing Chen

Abstract: The synthetic Floquet lattice, generated by multiple strong drives with mutually incommensurate frequencies, provides a powerful platform for the quantum simulation of topological phenomena. In this study, we propose a 4-band tight-binding model of the Chern insulator with a Chern number C = $\pm$2 by coupling two layers of the half-BHZ lattice and subsequently mapping it onto the Floquet lattice… ▽ More The synthetic Floquet lattice, generated by multiple strong drives with mutually incommensurate frequencies, provides a powerful platform for the quantum simulation of topological phenomena. In this study, we propose a 4-band tight-binding model of the Chern insulator with a Chern number C = $\pm$2 by coupling two layers of the half-BHZ lattice and subsequently mapping it onto the Floquet lattice to simulate its topological properties. To determine the Chern number of our Floquet-version model, we extend the energy pumping method proposed by Martin et al. [Phys. Rev. X 7, 041008 (2017)] and the topological oscillation method introduced by Boyers et al. [Phys. Rev. Lett. 125, 160505 (2020)], followed by numerical simulations for both methodologies. The simulation results demonstrate the successful extraction of the Chern number using either of these methods, providing an excellent prediction of the phase diagram that closely aligns with the theoretical one derived from the original bilayer half-BHZ model. Finally, we briefly discuss a potential experimental implementation for our model. Our work demonstrates significant potential for simulating complex topological matter using quantum computing platforms, thereby paving the way for constructing a more universal simulator for non-interacting topological quantum states and advancing our understanding of these intriguing phenomena. △ Less

Submitted 25 August, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

Journal ref: Chinese Physics Letters, Vol. 41, No. 9, 2024

arXiv:2405.09672 [pdf, other]

doi 10.1145/3658180

Eulerian-Lagrangian Fluid Simulation on Particle Flow Maps

Authors: Junwei Zhou, Duowen Chen, Molin Deng, Yitong Deng, Yuchen Sun, Sinan Wang, Shiying Xiong, Bo Zhu

Abstract: We propose a novel Particle Flow Map (PFM) method to enable accurate long-range advection for incompressible fluid simulation. The foundation of our method is the observation that a particle trajectory generated in a forward simulation naturally embodies a perfect flow map. Centered on this concept, we have developed an Eulerian-Lagrangian framework comprising four essential components: Lagrangian… ▽ More We propose a novel Particle Flow Map (PFM) method to enable accurate long-range advection for incompressible fluid simulation. The foundation of our method is the observation that a particle trajectory generated in a forward simulation naturally embodies a perfect flow map. Centered on this concept, we have developed an Eulerian-Lagrangian framework comprising four essential components: Lagrangian particles for a natural and precise representation of bidirectional flow maps; a dual-scale map representation to accommodate the mapping of various flow quantities; a particle-to-grid interpolation scheme for accurate quantity transfer from particles to grid nodes; and a hybrid impulse-based solver to enforce incompressibility on the grid. The efficacy of PFM has been demonstrated through various simulation scenarios, highlighting the evolution of complex vortical structures and the details of turbulent flows. Notably, compared to NFM, PFM reduces computing time by up to 49 times and memory consumption by up to 41%, while enhancing vorticity preservation as evidenced in various tests like leapfrog, vortex tube, and turbulent flow. △ Less

Submitted 15 May, 2024; originally announced May 2024.

arXiv:2404.18578 [pdf, other]

Scheme for braiding Majorana zero modes in vortices using an STT-matrix

Authors: Guangyao Huang, Xinfang Zhang, Xiaofeng Yi, Jibang Fu, Weichen Wang, Mingtang Deng

Abstract: Recently conducted experiments on two-dimensional topological superconductors have revealed various indications of Majorana zero modes (MZMs). However, progress in the manipulation of MZM braiding has been limited, impeding the realization of topological quantum computing. In this study, we propose a potential braiding scheme based on a spintronic device matrix. This scheme involves utilizing a ma… ▽ More Recently conducted experiments on two-dimensional topological superconductors have revealed various indications of Majorana zero modes (MZMs). However, progress in the manipulation of MZM braiding has been limited, impeding the realization of topological quantum computing. In this study, we propose a potential braiding scheme based on a spintronic device matrix. This scheme involves utilizing a matrix composed of spin-transfer torque devices (STT-matrix) alongside a two-dimensional topological superconductor material. By programming the ON/OFF states of the spintronic devices within the STT-matrix, it becomes possible to manipulate vortices hosting MZMs in the two-dimensional topological superconductor. To further investigate this concept, we construct a time-dependent Ginzburg-Landau model and perform numerical simulations to analyze vortex-driving dynamics, MZM braiding processes, and MZM fusion phenomena. Our findings demonstrate that this system exhibits high versatility and flexibility in manipulating vortices. With advancements in spintronic device technology, our proposed scheme offers a feasible and practical method for operating MZMs within vortices present in topological superconductors. △ Less

Submitted 25 August, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

arXiv:2404.09294 [pdf, other]

doi 10.1103/PhysRevA.109.043324

Miscibility of Binary Bose-Einstein Condensates with $p$-wave Interaction

Authors: Min Deng, Ming Xue, Jinghan Pang, Hui Luo, Zhiguo Wang, Jinbin Li, Dayou Yang

Abstract: We investigate the ground-state phase diagram of a binary mixture of Bose-Einstein condensates (BECs) with competing interspecies $s$- and $p$-wave interactions. Exploiting a pseudopotential model for the $l=1$ partial wave, we derive an extended Gross-Pitaevskii (GP) equation for the BEC mixture that incorporates both $s$- and $p$-wave interactions. Based on it, we study the miscible-immiscible t… ▽ More We investigate the ground-state phase diagram of a binary mixture of Bose-Einstein condensates (BECs) with competing interspecies $s$- and $p$-wave interactions. Exploiting a pseudopotential model for the $l=1$ partial wave, we derive an extended Gross-Pitaevskii (GP) equation for the BEC mixture that incorporates both $s$- and $p$-wave interactions. Based on it, we study the miscible-immiscible transition of a binary BEC mixture in the presence of interspecies $p$-wave interaction, by combining numerical solution of the GP equation and Gaussian variational analysis. Our study uncovers a dual effect -- either enhance or reduce miscibility -- of positive interspecies $p$-wave interaction, which can be precisely controlled by adjusting relevant experimental parameters. By complete characterizing the miscibility phase diagram, we establish a promising avenue towards experimental control of the miscibility of binary BEC mixtures via high partial-wave interactions. △ Less

Submitted 14 April, 2024; originally announced April 2024.

Comments: 10+3 pages, 6 figures, Phys. Rev. A (2024)

Journal ref: Phys. Rev. A 109, 043324 (2024)

arXiv:2403.16684 [pdf]

Electrically tunable, rapid spin-orbit torque induced modulation of colossal magnetoresistance in Mn$_3$Si$_2$Te$_6$ nanoflakes

Authors: Cheng Tan, Mingxun Deng, Yuanjun Yang, Linlin An, Weifeng Ge, Sultan Albarakati, Majid Panahandeh-Fard, James Partridge, Dimitrie Culcer, Bin Lei, Tao Wu, Xiangde Zhu, Mingliang Tian, Xianhui Chen, Rui-Qiang Wang, Lan Wang

Abstract: As a quasi-layered ferrimagnetic material, Mn$_3$Si$_2$Te$_6$ nanoflakes exhibit magnetoresistance behaviour that is fundamentally different from their bulk crystal counterparts. They offer three key properties crucial for spintronics. Firstly, at least 10^6 times faster response comparing to that exhibited by bulk crystals has been observed in current-controlled resistance and magnetoresistance.… ▽ More As a quasi-layered ferrimagnetic material, Mn$_3$Si$_2$Te$_6$ nanoflakes exhibit magnetoresistance behaviour that is fundamentally different from their bulk crystal counterparts. They offer three key properties crucial for spintronics. Firstly, at least 10^6 times faster response comparing to that exhibited by bulk crystals has been observed in current-controlled resistance and magnetoresistance. Secondly, ultra-low current density is required for resistance modulation (~ 5 A/cm$^2$). Thirdly, electrically gate-tunable magnetoresistance has been realized. Theoretical calculations reveal that the unique magnetoresistance behaviour in the Mn$_3$Si$_2$Te$_6$ nanoflakes arises from a magnetic field induced band gap shift across the Fermi level. The rapid current induced resistance variation is attributed to spin-orbit torque, an intrinsically ultra-fast process (~nanoseconds). This study suggests promising avenues for spintronic applications. In addition, it highlights Mn$_3$Si$_2$Te$_6$ nanoflakes as a suitable platform for investigating the intriguing physics underlying chiral orbital moments, magnetic field induced band variation and spin torque. △ Less

Submitted 25 March, 2024; originally announced March 2024.

Comments: 22 pages,4 figures

arXiv:2402.17356 [pdf, ps, other]

Modulation of chiral anomaly and bilinear magnetoconductivity in Weyl semimetals by impurity-resonance states

Authors: Mei-Wei Hu, Zhuo-Yan Fang, Hou-Jian Duan, Mou Yang, Ming-Xun Deng, Rui-Qiang Wang

Abstract: The phenomenon of nonlinear transport has attracted tremendous interest within the condensed matter community. We present a theoretical framework for nonlinear transport based on the nonequilibrium retarded Green's function, and examine the impact of disorder on nonlinear magnetotransport in Weyl semimetals (WSMs). It is demonstrated that bilinear magnetoconductivity can be induced in disordered W… ▽ More The phenomenon of nonlinear transport has attracted tremendous interest within the condensed matter community. We present a theoretical framework for nonlinear transport based on the nonequilibrium retarded Green's function, and examine the impact of disorder on nonlinear magnetotransport in Weyl semimetals (WSMs). It is demonstrated that bilinear magnetoconductivity can be induced in disordered WSMs by several mechanisms, including impurity-induced tilting of the Weyl cones, Lorentz-force-induced normal orbital magnetic moment, and chiral anomaly arising from the Berry-curvature-induced anomalous orbital magnetic moment. Additionally, we observe that the localization of Weyl fermions by impurity scattering will lead to resonant dips in both the chiral chemical potential and magnetoconductivity when the Fermi energy approaches the impurity resonance states. Our findings offer a theoretical proposition for modulating nonreciprocal transport in topological semimetals. △ Less

Submitted 27 February, 2024; originally announced February 2024.

Comments: 5 figures

arXiv:2401.02017 [pdf, other]

The origin of High-velocity stars considering the impact of the Large Magellanic Cloud

Authors: Jiwei Liao, Cuihua Du, Mingji Deng, Dashuang Ye, Hefan Li, Yang Huang, Jianrong Shi, Jun Ma

Abstract: Utilizing astrometric parameters sourced from \textit{Gaia} Data Release 3 and radial velocities obtained from various spectroscopic surveys, we identify 519 high-velocity stars (HiVels) with a total velocity in the Galactocentric restframe greater than 70\% of their local escape velocity under the {\tt\string Gala} {\tt\string MilkyWayPotential}. Our analysis reveals that the majority of these Hi… ▽ More Utilizing astrometric parameters sourced from \textit{Gaia} Data Release 3 and radial velocities obtained from various spectroscopic surveys, we identify 519 high-velocity stars (HiVels) with a total velocity in the Galactocentric restframe greater than 70\% of their local escape velocity under the {\tt\string Gala} {\tt\string MilkyWayPotential}. Our analysis reveals that the majority of these HiVels are metal-poor late-type giants, and we show 9 HiVels that are unbound candidates to the Galaxy with escape probabilities of 50\%. To investigate the origins of these HiVels, we classify them into four categories and consider the impact of the Large Magellanic Cloud (LMC) potential on their backward-integration trajectories. Specifically, we find that one of the HiVels can track back to the Galactic Center, and three HiVels may originate from the Sagittarius dwarf spheroidal galaxy (Sgr dSph). Furthermore, some HiVels appear to be ejected from the Galactic disk, while others formed within the Milky Way or have an extragalactic origin. Given that the LMC has a significant impact on the orbits of Sgr dSph, we examine the reported HiVels that originate from the Sgr dSph, with a few of them passing within the half-light radius of the Sgr dSph. △ Less

Submitted 3 January, 2024; originally announced January 2024.

Comments: 17 pages, 5figures, accepted for publication in AJ

arXiv:2401.01111 [pdf, ps, other]

RKKY signals characterizing the topological phase transitions in Floquet Dirac semimetals

Authors: Hou-Jian Duan, Shi-Ming Cai, Xing Wei, Yong-Chi Chen, Yong-Jia Wu, Ming-Xun Deng, Ruiqiang Wang, Mou Yang

Abstract: Recently, the Floquet ${\rm Na_3Bi}$-type material has been proposed as an ideal platform for realizing various phases, i.e., the spin-degenerate Dirac semimetal (DSM) can be turned into the Weyl semimetal (WSM), and even to the Weyl half-metal (WHM). Instead of the conventional electrical methods, we use the RKKY interaction to characterize the topological phase transitions in this paper. It is f… ▽ More Recently, the Floquet ${\rm Na_3Bi}$-type material has been proposed as an ideal platform for realizing various phases, i.e., the spin-degenerate Dirac semimetal (DSM) can be turned into the Weyl semimetal (WSM), and even to the Weyl half-metal (WHM). Instead of the conventional electrical methods, we use the RKKY interaction to characterize the topological phase transitions in this paper. It is found that detecting the Ising term $J_I$ is feasible for distinguishing the phase transition of DSM/WSM, since the emergence of $J_I$ is induced by the broken spin degeneracy. For the case with impurities deposited on $z$ axis (the line connecting the Weyl points), the Heisenberg term $J_H$ coexists with $J_I$ in the WSM, while $J_H$ is filtered out and only $J_I$ survives in the WHM. This magnetic filtering effect is a reflection of the fully spin-polarized property (one spin band is in the WSM phase while the other is gapped) of the WHM, and it can act a signal to capture the phase transition of WSM/WHM. This signal can not be disturbed unless the direction of the impurities greatly deviates from $z$ axis. Interestingly, as the impurities are moved into the $x$-$y$ plane, there arises another signal (a dip structure for $J_H$ at the phase boundary), which can also identify the phase transition of WSM/WHM. Furthermore, we have verified that all magnetic signals are robust to the term that breaks the electron-hole symmetry. Besides characterizing the phase transitions, our results also suggest that the Floquet DSMs are power platforms for controlling the magnetic interaction. △ Less

Submitted 4 January, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

Comments: 15 pages, 10 figures

MSC Class: 81V15 ACM Class: J.2

arXiv:2312.06062 [pdf, other]

Randomised benchmarking for characterizing and forecasting correlated processes

Authors: Xinfang Zhang, Zhihao Wu, Gregory A. L. White, Zhongcheng Xiang, Shun Hu, Zhihui Peng, Yong Liu, Dongning Zheng, Xiang Fu, Anqi Huang, Dario Poletti, Kavan Modi, Junjie Wu, Mingtang Deng, Chu Guo

Abstract: The development of fault-tolerant quantum processors relies on the ability to control noise. A particularly insidious form of noise is temporally correlated or non-Markovian noise. By combining randomized benchmarking with supervised machine learning algorithms, we develop a method to learn the details of temporally correlated noise. In particular, we can learn the time-independent evolution opera… ▽ More The development of fault-tolerant quantum processors relies on the ability to control noise. A particularly insidious form of noise is temporally correlated or non-Markovian noise. By combining randomized benchmarking with supervised machine learning algorithms, we develop a method to learn the details of temporally correlated noise. In particular, we can learn the time-independent evolution operator of system plus bath and this leads to (i) the ability to characterize the degree of non-Markovianity of the dynamics and (ii) the ability to predict the dynamics of the system even beyond the times we have used to train our model. We exemplify this by implementing our method on a superconducting quantum processor. Our experimental results show a drastic change between the Markovian and non-Markovian regimes for the learning accuracies. △ Less

Submitted 10 December, 2023; originally announced December 2023.

Comments: 4 pages, 3 figures

arXiv:2310.07837 [pdf, other]

Measuring Feature Sparsity in Language Models

Authors: Mingyang Deng, Lucas Tao, Joe Benton

Abstract: Recent works have proposed that activations in language models can be modelled as sparse linear combinations of vectors corresponding to features of input text. Under this assumption, these works aimed to reconstruct feature directions using sparse coding. We develop metrics to assess the success of these sparse coding techniques and test the validity of the linearity and sparsity assumptions. We… ▽ More Recent works have proposed that activations in language models can be modelled as sparse linear combinations of vectors corresponding to features of input text. Under this assumption, these works aimed to reconstruct feature directions using sparse coding. We develop metrics to assess the success of these sparse coding techniques and test the validity of the linearity and sparsity assumptions. We show our metrics can predict the level of sparsity on synthetic sparse linear activations, and can distinguish between sparse linear data and several other distributions. We use our metrics to measure levels of sparsity in several language models. We find evidence that language model activations can be accurately modelled by sparse linear combinations of features, significantly more so than control datasets. We also show that model activations appear to be sparsest in the first and final layers. △ Less

Submitted 13 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

arXiv:2310.07371 [pdf, other]

doi 10.1364/OL.494560

Experimental quantum natural gradient optimization in photonics

Authors: Yizhi Wang, Shichuan Xue, Yaxuan Wang, Jiangfang Ding, Weixu Shi, Dongyang Wang, Yong Liu, Yingwen Liu, Xiang Fu, Guangyao Huang, Anqi Huang, Mingtang Deng, Junjie Wu

Abstract: Variational quantum algorithms (VQAs) combining the advantages of parameterized quantum circuits and classical optimizers, promise practical quantum applications in the Noisy Intermediate-Scale Quantum era. The performance of VQAs heavily depends on the optimization method. Compared with gradient-free and ordinary gradient descent methods, the quantum natural gradient (QNG), which mirrors the geom… ▽ More Variational quantum algorithms (VQAs) combining the advantages of parameterized quantum circuits and classical optimizers, promise practical quantum applications in the Noisy Intermediate-Scale Quantum era. The performance of VQAs heavily depends on the optimization method. Compared with gradient-free and ordinary gradient descent methods, the quantum natural gradient (QNG), which mirrors the geometric structure of the parameter space, can achieve faster convergence and avoid local minima more easily, thereby reducing the cost of circuit executions. We utilized a fully programmable photonic chip to experimentally estimate the QNG in photonics for the first time. We obtained the dissociation curve of the He-H$^+$ cation and achieved chemical accuracy, verifying the outperformance of QNG optimization on a photonic device. Our work opens up a vista of utilizing QNG in photonics to implement practical near-term quantum applications. △ Less

Submitted 11 October, 2023; originally announced October 2023.

Journal ref: Optics Letters Vol. 48, Issue 14, pp. 3745-3748 (2023)

arXiv:2310.00585 [pdf, other]

doi 10.1364/OL.505084

Quantum generative adversarial learning in photonics

Authors: Yizhi Wang, Shichuan Xue, Yaxuan Wang, Yong Liu, Jiangfang Ding, Weixu Shi, Dongyang Wang, Yingwen Liu, Xiang Fu, Guangyao Huang, Anqi Huang, Mingtang Deng, Junjie Wu

Abstract: Quantum Generative Adversarial Networks (QGANs), an intersection of quantum computing and machine learning, have attracted widespread attention due to their potential advantages over classical analogs. However, in the current era of Noisy Intermediate-Scale Quantum (NISQ) computing, it is essential to investigate whether QGANs can perform learning tasks on near-term quantum devices usually affecte… ▽ More Quantum Generative Adversarial Networks (QGANs), an intersection of quantum computing and machine learning, have attracted widespread attention due to their potential advantages over classical analogs. However, in the current era of Noisy Intermediate-Scale Quantum (NISQ) computing, it is essential to investigate whether QGANs can perform learning tasks on near-term quantum devices usually affected by noise and even defects. In this Letter, using a programmable silicon quantum photonic chip, we experimentally demonstrate the QGAN model in photonics for the first time, and investigate the effects of noise and defects on its performance. Our results show that QGANs can generate high-quality quantum data with a fidelity higher than 90\%, even under conditions where up to half of the generator's phase shifters are damaged, or all of the generator and discriminator's phase shifters are subjected to phase noise up to 0.04$π$. Our work sheds light on the feasibility of implementing QGANs on NISQ-era quantum hardware. △ Less

Submitted 1 October, 2023; originally announced October 2023.

Journal ref: Optics Letters Vol. 48, Issue 20, pp. 5197-5200 (2023)

arXiv:2309.13579 [pdf, other]

Seeing Is Not Always Believing: Invisible Collision Attack and Defence on Pre-Trained Models

Authors: Minghang Deng, Zhong Zhang, Junming Shao

Abstract: Large-scale pre-trained models (PTMs) such as BERT and GPT have achieved great success in diverse fields. The typical paradigm is to pre-train a big deep learning model on large-scale data sets, and then fine-tune the model on small task-specific data sets for downstream tasks. Although PTMs have rapidly progressed with wide real-world applications, they also pose significant risks of potential at… ▽ More Large-scale pre-trained models (PTMs) such as BERT and GPT have achieved great success in diverse fields. The typical paradigm is to pre-train a big deep learning model on large-scale data sets, and then fine-tune the model on small task-specific data sets for downstream tasks. Although PTMs have rapidly progressed with wide real-world applications, they also pose significant risks of potential attacks. Existing backdoor attacks or data poisoning methods often build up the assumption that the attacker invades the computers of victims or accesses the target data, which is challenging in real-world scenarios. In this paper, we propose a novel framework for an invisible attack on PTMs with enhanced MD5 collision. The key idea is to generate two equal-size models with the same MD5 checksum by leveraging the MD5 chosen-prefix collision. Afterwards, the two ``same" models will be deployed on public websites to induce victims to download the poisoned model. Unlike conventional attacks on deep learning models, this new attack is flexible, covert, and model-independent. Additionally, we propose a simple defensive strategy for recognizing the MD5 chosen-prefix collision and provide a theoretical justification for its feasibility. We extensively validate the effectiveness and stealthiness of our proposed attack and defensive method on different models and data sets. △ Less

Submitted 7 May, 2024; v1 submitted 24 September, 2023; originally announced September 2023.

Comments: 10 pages, 4 figures

ACM Class: I.2.0

arXiv:2309.08233 [pdf, ps, other]

Quantum Hall effect in topological Dirac semimetals modulated by the Lifshitz transition of the Fermi arc surface states

Authors: Tao-Rui Qin, Zhuo-Hua Chen, Tian-Xing Liu, Fu-Yang Chen, Hou-Jian Duan, Ming-Xun Deng, Rui-Qiang Wang

Abstract: We investigate the magnetotransport of topological Dirac semimetals (DSMs) by taking into account the Lifshitz transition of the Fermi arc surface states. We demonstrate that a bulk momentum-dependent gap term, which is usually neglected in study of the bulk energy-band topology, can cause the Lifshitz transition by developing an additional Dirac cone for the surface to prevent the Fermi arcs from… ▽ More We investigate the magnetotransport of topological Dirac semimetals (DSMs) by taking into account the Lifshitz transition of the Fermi arc surface states. We demonstrate that a bulk momentum-dependent gap term, which is usually neglected in study of the bulk energy-band topology, can cause the Lifshitz transition by developing an additional Dirac cone for the surface to prevent the Fermi arcs from connecting the bulk Dirac points. As a result, the Weyl orbits can be turned off by the surface Dirac cone without destroying the bulk Dirac points. In response to the surface Lifshitz transition, the Weyl-orbit mechanism for the 3D quantum Hall effect (QHE) in topological DSMs will break down. The resulting quantized Hall plateaus can be thickness-dependent, similar to the Weyl-orbit mechanism, but their widths and quantized values become irregular. Accordingly, we propose that apart from the bulk Weyl nodes and Fermi arcs, the surface Lifshitz transition is also crucial for realizing stable Weyl orbits and 3D QHE in real materials. △ Less

Submitted 15 September, 2023; originally announced September 2023.

Comments: 7

arXiv:2309.08100 [pdf]

Research on Joint Representation Learning Methods for Entity Neighborhood Information and Description Information

Authors: Le Xiao, Xin Shan, Yuhua Wang, Miaolei Deng

Abstract: To address the issue of poor embedding performance in the knowledge graph of a programming design course, a joint represen-tation learning model that combines entity neighborhood infor-mation and description information is proposed. Firstly, a graph at-tention network is employed to obtain the features of entity neigh-boring nodes, incorporating relationship features to enrich the structural infor… ▽ More To address the issue of poor embedding performance in the knowledge graph of a programming design course, a joint represen-tation learning model that combines entity neighborhood infor-mation and description information is proposed. Firstly, a graph at-tention network is employed to obtain the features of entity neigh-boring nodes, incorporating relationship features to enrich the structural information. Next, the BERT-WWM model is utilized in conjunction with attention mechanisms to obtain the representation of entity description information. Finally, the final entity vector representation is obtained by combining the vector representations of entity neighborhood information and description information. Experimental results demonstrate that the proposed model achieves favorable performance on the knowledge graph dataset of the pro-gramming design course, outperforming other baseline models. △ Less

Submitted 14 September, 2023; originally announced September 2023.

arXiv:2307.06914 [pdf, ps, other]

Uniform sets with few progressions via colorings

Authors: Mingyang Deng, Jonathan Tidor, Yufei Zhao

Abstract: Ruzsa asked whether there exist Fourier-uniform subsets of $\mathbb Z/N\mathbb Z$ with density $α$ and 4-term arithmetic progression (4-APs) density at most $α^C$, for arbitrarily large $C$. Gowers constructed Fourier uniform sets with density $α$ and 4-AP density at most $α^{4+c}$ for some small constant $c>0$. We show that an affirmative answer to Ruzsa's question would follow from the existence… ▽ More Ruzsa asked whether there exist Fourier-uniform subsets of $\mathbb Z/N\mathbb Z$ with density $α$ and 4-term arithmetic progression (4-APs) density at most $α^C$, for arbitrarily large $C$. Gowers constructed Fourier uniform sets with density $α$ and 4-AP density at most $α^{4+c}$ for some small constant $c>0$. We show that an affirmative answer to Ruzsa's question would follow from the existence of an $N^{o(1)}$-coloring of $[N]$ without symmetrically colored 4-APs. For a broad and natural class of constructions of Fourier-uniform subsets of $\mathbb Z/N\mathbb Z$, we show that Ruzsa's question is equivalent to our arithmetic Ramsey question. We prove analogous results for all even-length APs. For each odd $k\geq 5$, we show that there exist $U^{k-2}$-uniform subsets of $\mathbb Z/N\mathbb Z$ with density $α$ and $k$-AP density at most $α^{c_k \log(1/α)}$. We also prove generalizations to arbitrary one-dimensional patterns. △ Less

Submitted 13 July, 2023; originally announced July 2023.

Comments: 20 pages

arXiv:2306.14878 [pdf, other]

Restart Sampling for Improving Generative Processes

Authors: Yilun Xu, Mingyang Deng, Xiang Cheng, Yonglong Tian, Ziming Liu, Tommi Jaakkola

Abstract: Generative processes that involve solving differential equations, such as diffusion models, frequently necessitate balancing speed and quality. ODE-based samplers are fast but plateau in performance while SDE-based samplers deliver higher sample quality at the cost of increased sampling time. We attribute this difference to sampling errors: ODE-samplers involve smaller discretization errors while… ▽ More Generative processes that involve solving differential equations, such as diffusion models, frequently necessitate balancing speed and quality. ODE-based samplers are fast but plateau in performance while SDE-based samplers deliver higher sample quality at the cost of increased sampling time. We attribute this difference to sampling errors: ODE-samplers involve smaller discretization errors while stochasticity in SDE contracts accumulated errors. Based on these findings, we propose a novel sampling algorithm called Restart in order to better balance discretization errors and contraction. The sampling method alternates between adding substantial noise in additional forward steps and strictly following a backward ODE. Empirically, Restart sampler surpasses previous SDE and ODE samplers in both speed and accuracy. Restart not only outperforms the previous best SDE results, but also accelerates the sampling speed by 10-fold / 2-fold on CIFAR-10 / ImageNet $64 \times 64$. In addition, it attains significantly better sample quality than ODE samplers within comparable sampling times. Moreover, Restart better balances text-image alignment/visual quality versus diversity than previous samplers in the large-scale text-to-image Stable Diffusion model pre-trained on LAION $512 \times 512$. Code is available at https://github.com/Newbeeer/diffusion_restart_sampling △ Less

Submitted 1 November, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

Comments: Code is available at https://github.com/Newbeeer/diffusion_restart_sampling

arXiv:2304.11688 [pdf, other]

TGNN: A Joint Semi-supervised Framework for Graph-level Classification

Authors: Wei Ju, Xiao Luo, Meng Qu, Yifan Wang, Chong Chen, Minghua Deng, Xian-Sheng Hua, Ming Zhang

Abstract: This paper studies semi-supervised graph classification, a crucial task with a wide range of applications in social network analysis and bioinformatics. Recent works typically adopt graph neural networks to learn graph-level representations for classification, failing to explicitly leverage features derived from graph topology (e.g., paths). Moreover, when labeled data is scarce, these methods are… ▽ More This paper studies semi-supervised graph classification, a crucial task with a wide range of applications in social network analysis and bioinformatics. Recent works typically adopt graph neural networks to learn graph-level representations for classification, failing to explicitly leverage features derived from graph topology (e.g., paths). Moreover, when labeled data is scarce, these methods are far from satisfactory due to their insufficient topology exploration of unlabeled data. We address the challenge by proposing a novel semi-supervised framework called Twin Graph Neural Network (TGNN). To explore graph structural information from complementary views, our TGNN has a message passing module and a graph kernel module. To fully utilize unlabeled data, for each module, we calculate the similarity of each unlabeled graph to other labeled graphs in the memory bank and our consistency loss encourages consistency between two similarity distributions in different embedding spaces. The two twin modules collaborate with each other by exchanging instance similarity knowledge to fully explore the structure information of both labeled and unlabeled data. We evaluate our TGNN on various public datasets and show that it achieves strong performance. △ Less

Submitted 23 April, 2023; originally announced April 2023.

Comments: Accepted by Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence (IJCAI 2022)

arXiv:2304.02995 [pdf, ps, other]

Growth of Sobolev norms for 2D cubic nonlinear Schrödinger equation with partial harmonic potential

Authors: Mingming Deng, Xiaoyan Su, Jiqiang Zheng

Abstract: In this paper, we study the $2$D cubic nonlinear Schrödinger equation (NLS) with the partial harmonic potential. First, we prove the local well-posedness in Bourgain spaces by establishing a key bilinear estimate associated with the partial harmonic oscillator. Then, we give the polynomial bound of the Sobolev norms for the solutions using the method of the Planchon, Tzvetkov, and Visciglia. In this paper, we study the $2$D cubic nonlinear Schrödinger equation (NLS) with the partial harmonic potential. First, we prove the local well-posedness in Bourgain spaces by establishing a key bilinear estimate associated with the partial harmonic oscillator. Then, we give the polynomial bound of the Sobolev norms for the solutions using the method of the Planchon, Tzvetkov, and Visciglia. △ Less

Submitted 6 April, 2023; originally announced April 2023.

Comments: 19pages

MSC Class: 35A01; 35B40; 35Q55

arXiv:2303.12381 [pdf, other]

doi 10.1016/j.physa.2023.129286

GSQAS: Graph Self-supervised Quantum Architecture Search

Authors: Zhimin He, Maijie Deng, Shenggen Zheng, Lvzhou Li, Haozhen Situ

Abstract: Quantum Architecture Search (QAS) is a promising approach to designing quantum circuits for variational quantum algorithms (VQAs). However, existing QAS algorithms require to evaluate a large number of quantum circuits during the search process, which makes them computationally demanding and limits their applications to large-scale quantum circuits. Recently, predictor-based QAS has been proposed… ▽ More Quantum Architecture Search (QAS) is a promising approach to designing quantum circuits for variational quantum algorithms (VQAs). However, existing QAS algorithms require to evaluate a large number of quantum circuits during the search process, which makes them computationally demanding and limits their applications to large-scale quantum circuits. Recently, predictor-based QAS has been proposed to alleviate this problem by directly estimating the performances of circuits according to their structures with a predictor trained on a set of labeled quantum circuits. However, the predictor is trained by purely supervised learning, which suffers from poor generalization ability when labeled training circuits are scarce. It is very time-consuming to obtain a large number of labeled quantum circuits because the gate parameters of quantum circuits need to be optimized until convergence to obtain their ground-truth performances. To overcome these limitations, we propose GSQAS, a graph self-supervised QAS, which trains a predictor based on self-supervised learning. Specifically, we first pre-train a graph encoder on a large number of unlabeled quantum circuits using a well-designed pretext task in order to generate meaningful representations of circuits. Then the downstream predictor is trained on a small number of quantum circuits' representations and their labels. Once the encoder is trained, it can apply to different downstream tasks. In order to better encode the spatial topology information and avoid the huge dimension of feature vectors for large-scale quantum circuits, we design a scheme to encode quantum circuits as graphs. Simulation results on searching circuit structures for variational quantum eigensolver and quantum state classification show that GSQAS outperforms the state-of-the-art predictor-based QAS, achieving better performance with fewer labeled circuits. △ Less

Submitted 22 March, 2023; originally announced March 2023.

Comments: 22 pages, 11 figures

arXiv:2302.09258 [pdf, ps, other]

Digital Privacy Under Attack: Challenges and Enablers

Authors: Baobao Song, Mengyue Deng, Shiva Raj Pokhrel, Qiujun Lan, Robin Doss, Gang Li

Abstract: Users have renewed interest in protecting their private data in the digital space. When they don't believe that their privacy is sufficiently covered by one platform, they will readily switch to another. Such an increasing level of privacy awareness has made privacy preservation an essential research topic. Nevertheless, new privacy attacks are emerging day by day. Therefore, a holistic survey to… ▽ More Users have renewed interest in protecting their private data in the digital space. When they don't believe that their privacy is sufficiently covered by one platform, they will readily switch to another. Such an increasing level of privacy awareness has made privacy preservation an essential research topic. Nevertheless, new privacy attacks are emerging day by day. Therefore, a holistic survey to compare the discovered techniques on attacks over privacy preservation and their mitigation schemes is essential in the literature. We develop a study to fill this gap by assessing the resilience of privacy-preserving methods to various attacks and conducting a comprehensive review of countermeasures from a broader perspective. First, we introduce the fundamental concepts and critical components of privacy attacks. Second, we comprehensively cover major privacy attacks targeted at anonymous data, statistical aggregate data, and privacy-preserving models. We also summarize popular countermeasures to mitigate these attacks. Finally, some promising future research directions and related issues in the privacy community are envisaged. We believe this survey will successfully shed some light on privacy research and encourage researchers to entirely understand the resilience of different existing privacy-preserving approaches. △ Less

Submitted 18 February, 2023; originally announced February 2023.

arXiv:2302.00249 [pdf, ps, other]

On the growth of high Sobolev norms of the cubic nonlinear Schrödinger equation on $\mathbb{R}\times \mathbb{T}$

Authors: Mingming Deng, Kailong Yang

Abstract: We consider the cubic nonlinear Schrödinger equation on product manifolds $\mathbb{R}\times \mathbb{T}$. In this paper, we obtain polynomial bounds on the growth in time of high Sobolev norms of the solutions. The main ingredient of the proof is to establish an iteration bound, which is based on the idea used by Bourgain in \cite{B1}. We consider the cubic nonlinear Schrödinger equation on product manifolds $\mathbb{R}\times \mathbb{T}$. In this paper, we obtain polynomial bounds on the growth in time of high Sobolev norms of the solutions. The main ingredient of the proof is to establish an iteration bound, which is based on the idea used by Bourgain in \cite{B1}. △ Less

Submitted 1 February, 2023; originally announced February 2023.

Comments: 15 pages, Comments are welcome!

arXiv:2301.09333 [pdf, ps, other]

doi 10.1137/1.9781611977554.ch113

Approximating Knapsack and Partition via Dense Subset Sums

Authors: Mingyang Deng, Ce Jin, Xiao Mao

Abstract: Knapsack and Partition are two important additive problems whose fine-grained complexities in the $(1-\varepsilon)$-approximation setting are not yet settled. In this work, we make progress on both problems by giving improved algorithms. - Knapsack can be $(1 - \varepsilon)$-approximated in $\tilde O(n + (1/\varepsilon) ^ {2.2} )$ time, improving the previous… ▽ More Knapsack and Partition are two important additive problems whose fine-grained complexities in the $(1-\varepsilon)$-approximation setting are not yet settled. In this work, we make progress on both problems by giving improved algorithms. - Knapsack can be $(1 - \varepsilon)$-approximated in $\tilde O(n + (1/\varepsilon) ^ {2.2} )$ time, improving the previous $\tilde O(n + (1/\varepsilon) ^ {2.25} )$ by Jin (ICALP'19). There is a known conditional lower bound of $(n+\varepsilon)^{2-o(1)}$ based on $(\min,+)$-convolution hypothesis. - Partition can be $(1 - \varepsilon)$-approximated in $\tilde O(n + (1/\varepsilon) ^ {1.25} )$ time, improving the previous $\tilde O(n + (1/\varepsilon) ^ {1.5} )$ by Bringmann and Nakos (SODA'21). There is a known conditional lower bound of $(1/\varepsilon)^{1-o(1)}$ based on Strong Exponential Time Hypothesis. Both of our new algorithms apply the additive combinatorial results on dense subset sums by Galil and Margalit (SICOMP'91), Bringmann and Wellnitz (SODA'21). Such techniques have not been explored in the context of Knapsack prior to our work. In addition, we design several new methods to speed up the divide-and-conquer steps which naturally arise in solving additive problems. △ Less

Submitted 23 January, 2023; originally announced January 2023.

Comments: To appear in SODA 2023. Corrects minor mistakes in Lemma 3.3 and Lemma 3.5 in the proceedings version of this paper

arXiv:2211.12350 [pdf, ps, other]

Indirect magnetic signals mediated by a single surface band in Weyl semimetals

Authors: Hou-Jian Duan, Yong-Jia Wu, Ming-Xun Deng, Ruiqiang Wang, Mou Yang

Abstract: Recently, abundant transport phenomena characterizing the surface states of Weyl semimetals (WSMs) have been reported. To generate these phenomena, electrons have to complete a closed intersurface orbit. Due to the unavoidable impurities in real materials, this orbit would be destroyed by the impurity scattering, which limits the detection of the surface states in WSMs. Here, we investigate the RK… ▽ More Recently, abundant transport phenomena characterizing the surface states of Weyl semimetals (WSMs) have been reported. To generate these phenomena, electrons have to complete a closed intersurface orbit. Due to the unavoidable impurities in real materials, this orbit would be destroyed by the impurity scattering, which limits the detection of the surface states in WSMs. Here, we investigate the RKKY interaction between magnetic impurities, solely mediated by a single surface band, in semi-infinite WSMs. It is found that peculiar oscillations and slowly decaying laws of the RKKY interaction can act as the signals to capture the dispersive nature of the surface states of WSMs. The underlying physics is attributed to two effects: the band-edge effect and the bending effect of the surface band, which can control the RKKY interaction individually or compete with each other to produce more complex magnetic behaviors. In addition, the band-edge effect together with the finite Fermi energy would result in another interesting oscillation with battering pattern. All the results are significantly different from that in previous literatures where surface states have to couple with bulk states (or other surface states of different spins) to generate nonzero magnetic interaction. Compared to the previous models of surface states, the model here is more practical and is helpful for the deeper understanding of the surface magnetic properties in WSMs. △ Less

Submitted 14 December, 2022; v1 submitted 22 November, 2022; originally announced November 2022.

Comments: 12 pages, 9 figures

MSC Class: 81V15 ACM Class: J.2

arXiv:2209.08483 [pdf, other]

Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning

Authors: Hua Wei, Jingxiao Chen, Xiyang Ji, Hongyang Qin, Minwen Deng, Siqin Li, Liang Wang, Weinan Zhang, Yong Yu, Lin Liu, Lanxiao Huang, Deheng Ye, Qiang Fu, Wei Yang

Abstract: This paper introduces Honor of Kings Arena, a reinforcement learning (RL) environment based on Honor of Kings, one of the world's most popular games at present. Compared to other environments studied in most previous work, ours presents new generalization challenges for competitive reinforcement learning. It is a multi-agent problem with one agent competing against its opponent; and it requires th… ▽ More This paper introduces Honor of Kings Arena, a reinforcement learning (RL) environment based on Honor of Kings, one of the world's most popular games at present. Compared to other environments studied in most previous work, ours presents new generalization challenges for competitive reinforcement learning. It is a multi-agent problem with one agent competing against its opponent; and it requires the generalization ability as it has diverse targets to control and diverse opponents to compete with. We describe the observation, action, and reward specifications for the Honor of Kings domain and provide an open-source Python-based interface for communicating with the game engine. We provide twenty target heroes with a variety of tasks in Honor of Kings Arena and present initial baseline results for RL-based methods with feasible computing resources. Finally, we showcase the generalization challenges imposed by Honor of Kings Arena and possible remedies to the challenges. All of the software, including the environment-class, are publicly available at https://github.com/tencent-ailab/hok_env . The documentation is available at https://aiarena.tencent.com/hok/doc/ . △ Less

Submitted 18 October, 2022; v1 submitted 18 September, 2022; originally announced September 2022.

Comments: Accepted by NeurIPS 2022

Journal ref: Advances in Neural Information Processing Systems, 2022, 35: 11881-11892

arXiv:2208.13186 [pdf, other]

Large-scale full-programmable quantum walk and its applications

Authors: Yizhi Wang, Yingwen Liu, Junwei Zhan, Shichuan Xue, Yuzhen Zheng, Ru Zeng, Zhihao Wu, Zihao Wang, Qilin Zheng, Dongyang Wang, Weixu Shi, Xiang Fu, Ping Xu, Yang Wang, Yong Liu, Jiangfang Ding, Guangyao Huang, Chunlin Yu, Anqi Huang, Xiaogang Qiang, Mingtang Deng, Weixia Xu, Kai Lu, Xuejun Yang, Junjie Wu

Abstract: With photonics, the quantum computational advantage has been demonstrated on the task of boson sampling. Next, developing quantum-enhanced approaches for practical problems becomes one of the top priorities for photonic systems. Quantum walks are powerful kernels for developing new and useful quantum algorithms. Here we realize large-scale quantum walks using a fully programmable photonic quantum… ▽ More With photonics, the quantum computational advantage has been demonstrated on the task of boson sampling. Next, developing quantum-enhanced approaches for practical problems becomes one of the top priorities for photonic systems. Quantum walks are powerful kernels for developing new and useful quantum algorithms. Here we realize large-scale quantum walks using a fully programmable photonic quantum computing system. The system integrates a silicon quantum photonic chip, enabling the simulation of quantum walk dynamics on graphs with up to 400 vertices and possessing full programmability over quantum walk parameters, including the particle property, initial state, graph structure, and evolution time. In the 400-dimensional Hilbert space, the average fidelity of random entangled quantum states after the whole on-chip circuit evolution reaches as high as 94.29$\pm$1.28$\%$. With the system, we demonstrated exponentially faster hitting and quadratically faster mixing performance of quantum walks over classical random walks, achieving more than two orders of magnitude of enhancement in the experimental hitting efficiency and almost half of the reduction in the experimental evolution time for mixing. We utilize the system to implement a series of quantum applications, including measuring the centrality of scale-free networks, searching targets on Erdös-Rényi networks, distinguishing non-isomorphic graph pairs, and simulating the topological phase of higher-order topological insulators. Our work shows one feasible path for quantum photonics to address applications of practical interests in the near future. △ Less

Submitted 28 August, 2022; originally announced August 2022.

arXiv:2208.07722 [pdf]

doi 10.1109/TGRS.2023.3243042

Unsupervised domain adaptation semantic segmentation of high-resolution remote sensing imagery with invariant domain-level prototype memory

Authors: Jingru Zhu, Ya Guo, Geng Sun, Libo Yang, Min Deng, Jie Chen

Abstract: Semantic segmentation is a key technique involved in automatic interpretation of high-resolution remote sensing (HRS) imagery and has drawn much attention in the remote sensing community. Deep convolutional neural networks (DCNNs) have been successfully applied to the HRS imagery semantic segmentation task due to their hierarchical representation ability. However, the heavy dependency on a large n… ▽ More Semantic segmentation is a key technique involved in automatic interpretation of high-resolution remote sensing (HRS) imagery and has drawn much attention in the remote sensing community. Deep convolutional neural networks (DCNNs) have been successfully applied to the HRS imagery semantic segmentation task due to their hierarchical representation ability. However, the heavy dependency on a large number of training data with dense annotation and the sensitiveness to the variation of data distribution severely restrict the potential application of DCNNs for the semantic segmentation of HRS imagery. This study proposes a novel unsupervised domain adaptation semantic segmentation network (MemoryAdaptNet) for the semantic segmentation of HRS imagery. MemoryAdaptNet constructs an output space adversarial learning scheme to bridge the domain distribution discrepancy between source domain and target domain and to narrow the influence of domain shift. Specifically, we embed an invariant feature memory module to store invariant domain-level context information because the features obtained from adversarial learning only tend to represent the variant feature of current limited inputs. This module is integrated by a category attention-driven invariant domain-level context aggregation module to current pseudo invariant feature for further augmenting the pixel representations. An entropy-based pseudo label filtering strategy is used to update the memory module with high-confident pseudo invariant feature of current target images. Extensive experiments under three cross-domain tasks indicate that our proposed MemoryAdaptNet is remarkably superior to the state-of-the-art methods. △ Less

Submitted 14 February, 2023; v1 submitted 16 August, 2022; originally announced August 2022.

Comments: 17 pages, 12 figures and 8 tables

Journal ref: IEEE Transactions on Geoscience and Remote Sensing, 2023

arXiv:2207.13876 [pdf, other]

Characterization of the John A. Galt telescope for radio holography with CHIME

Authors: Alex Reda, Tristan Pinsonneault-Marotte, Meiling Deng, Mandana Amiri, Kevin Bandura, Arnab Chakraborty, Simon Foreman, Mark Halpern, Alex S. Hill, Carolin Höfer, Joseph Kania, T. L. Landecker, Joshua MacEachern, Kiyoshi Masui, Juan Mena-Parra, Nikola Milutinovic, Laura Newburgh, Anna Ordog, Sourabh Paul, J. Richard Shaw, Seth R. Siegel, Rick Smegal, Haochen Wang, Dallas Wulf

Abstract: The Canadian Hydrogen Intensity Mapping Experiment (CHIME) will measure the 21 cm emission of astrophysical neutral hydrogen to probe large scale structure at redshifts z=0.8-2.5. However, detecting the 21 cm signal beneath substantially brighter foregrounds remains a key challenge. Due to the high dynamic range between 21 cm and foreground emission, an exquisite calibration of instrument systemat… ▽ More The Canadian Hydrogen Intensity Mapping Experiment (CHIME) will measure the 21 cm emission of astrophysical neutral hydrogen to probe large scale structure at redshifts z=0.8-2.5. However, detecting the 21 cm signal beneath substantially brighter foregrounds remains a key challenge. Due to the high dynamic range between 21 cm and foreground emission, an exquisite calibration of instrument systematics, notably the telescope beam, is required to successfully filter out the foregrounds. One technique being used to achieve a high fidelity measurement of the CHIME beam is radio holography, wherein signals from each of CHIME's analog inputs are correlated with the signal from a co-located reference antenna, the 26 m John A. Galt telescope, as the 26 m Galt telescope tracks a bright point source transiting over CHIME. In this work we present an analysis of several of the Galt telescope's properties. We employ driftscan measurements of several bright sources, along with background estimates derived from the 408 MHz Haslam map, to estimate the Galt system temperature. To determine the Galt telescope's beam shape, we perform and analyze a raster scan of the bright radio source Cassiopeia A. Finally, we use early holographic measurements to measure the Galt telescope's geometry with respect to CHIME for the holographic analysis of the CHIME and Galt interferometric data set. △ Less

Submitted 30 September, 2022; v1 submitted 28 July, 2022; originally announced July 2022.

arXiv:2207.12461 [pdf, other]

Antenna characterization for the HIRAX experiment

Authors: Emily R. Kuhn, Benjamin R. B. Saliwanchik, Kevin Bandura, Michele Bianco, H. Cynthia Chiang, Devin Crichton, Meiling Deng, Sindhu Gaddam, Kit Gerodias, Austin Gumba, Maile Harris, Kavilan Moodley, V. Mugundhan, Laura Newburgh, Jeffrey Peterson, Elizabeth Pieters, Anna R. Polish, Alexandre Refregier, Ajith Sampath, Mario G. Santos, Onkabetse Sengate, Jonathan Sievers, Ema Smith, Will Tyndall, Anthony Walters , et al. (2 additional authors not shown)

Abstract: The Hydrogen Intensity and Real-time Analysis eXperiment (HIRAX) aims to improve constraints on the dark energy equation of state through measurements of large-scale structure at high redshift ($0.8<z<2.5$), while serving as a state-of-the-art fast radio burst detector. Bright galactic foregrounds contaminate the 400--800~MHz HIRAX frequency band, so meeting the science goals will require precise… ▽ More The Hydrogen Intensity and Real-time Analysis eXperiment (HIRAX) aims to improve constraints on the dark energy equation of state through measurements of large-scale structure at high redshift ($0.8<z<2.5$), while serving as a state-of-the-art fast radio burst detector. Bright galactic foregrounds contaminate the 400--800~MHz HIRAX frequency band, so meeting the science goals will require precise instrument characterization. In this paper we describe characterization of the HIRAX antenna, focusing on measurements of the antenna beam and antenna noise temperature. Beam measurements of the current HIRAX antenna design were performed in an anechoic chamber and compared to simulations. We report measurement techniques and results, which find a broad and symmetric antenna beam for $ν<$650MHz, and elevated cross-polarization levels and beam asymmetries for $ν>$700MHz. Noise temperature measurements of the HIRAX feeds were performed in a custom apparatus built at Yale. In this system, identical loads, one cryogenic and the other at room temperature, are used to take a differential (Y-factor) measurement from which the noise of the system is inferred. Several measurement sets have been conducted using the system, involving CHIME feeds as well as four of the HIRAX active feeds. These measurements give the first noise temperature measurements of the HIRAX feed, revealing a $\sim$60K noise temperature (relative to 30K target) with 40K peak- to-peak frequency-dependent features, and provide the first demonstration of feed repeatability. Both findings inform current and future feed designs. △ Less

Submitted 25 July, 2022; originally announced July 2022.

Comments: 20 pages, 14 figures, SPIE proceedings

arXiv:2207.06151 [pdf, ps, other]

doi 10.1088/1748-0221/17/11/T11003

Study on SiPM performance at low temperatures between $-60^{\circ}$C and $-20^{\circ}$C

Authors: C. Zhong, F. J. Luo, B. Zheng, X. D. Wang, M. Y. Bu, J. Zou, M. N. Deng

Abstract: Radon is the main background source of dark matter and neutrino experiments. Radon concentration ($\rm mBq/m^3$) measurement by liquid scintillation detector is a highly sensitive method at low temperatures using silicon photomultipliers (SiPMs) arrays. The SiPM performance characteristics are closely related to the lower detection limit of the detector. In this study, we built an automatic and ac… ▽ More Radon is the main background source of dark matter and neutrino experiments. Radon concentration ($\rm mBq/m^3$) measurement by liquid scintillation detector is a highly sensitive method at low temperatures using silicon photomultipliers (SiPMs) arrays. The SiPM performance characteristics are closely related to the lower detection limit of the detector. In this study, we built an automatic and accurate low-temperature measurement system to study the single photoelectron spectrum, SPE resolution, optical crosstalk, and after-pulse of the SiPM at different temperatures. As a result, we obtained the variation trend of the SiPM parameters at different temperatures, and the SiPM optimal working conditions were obtained, which can improve the detector's sensitivity △ Less

Submitted 26 October, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

arXiv:2205.12548 [pdf, other]

RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning

Authors: Mingkai Deng, Jianyu Wang, Cheng-Ping Hsieh, Yihan Wang, Han Guo, Tianmin Shu, Meng Song, Eric P. Xing, Zhiting Hu

Abstract: Prompting has shown impressive success in enabling large pretrained language models (LMs) to perform diverse NLP tasks, especially when only few downstream data are available. Automatically finding the optimal prompt for each task, however, is challenging. Most existing work resorts to tuning soft prompt (e.g., embeddings) which falls short of interpretability, reusability across LMs, and applicab… ▽ More Prompting has shown impressive success in enabling large pretrained language models (LMs) to perform diverse NLP tasks, especially when only few downstream data are available. Automatically finding the optimal prompt for each task, however, is challenging. Most existing work resorts to tuning soft prompt (e.g., embeddings) which falls short of interpretability, reusability across LMs, and applicability when gradients are not accessible. Discrete prompt, on the other hand, is difficult to optimize, and is often created by "enumeration (e.g., paraphrasing)-then-selection" heuristics that do not explore the prompt space systematically. This paper proposes RLPrompt, an efficient discrete prompt optimization approach with reinforcement learning (RL). RLPrompt formulates a parameter-efficient policy network that generates the desired discrete prompt after training with reward. To overcome the complexity and stochasticity of reward signals by the large LM environment, we incorporate effective reward stabilization that substantially enhances the training efficiency. RLPrompt is flexibly applicable to different types of LMs, such as masked (e.g., BERT) and left-to-right models (e.g., GPTs), for both classification and generation tasks. Experiments on few-shot classification and unsupervised text style transfer show superior performance over a wide range of existing finetuning or prompting methods. Interestingly, the resulting optimized prompts are often ungrammatical gibberish text; and surprisingly, those gibberish prompts are transferrable between different LMs to retain significant performance, indicating LM prompting may not follow human language patterns. △ Less

Submitted 22 October, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

Comments: EMNLP 2022 Camera Ready. Code available at https://github.com/mingkaid/rl-prompt

arXiv:2202.13484 [pdf, ps, other]

On Problems Related to Unbounded SubsetSum: A Unified Combinatorial Approach

Authors: Mingyang Deng, Xiao Mao, Ziqian Zhong

Abstract: Unbounded SubsetSum is a classical textbook problem: given integers $w_1,w_2,\cdots,w_n\in [1,u],~c,u$, we need to find if there exists $m_1,m_2,\cdots,m_n\in \mathbb{N}$ satisfying $c=\sum_{i=1}^n w_im_i$. In its all-target version, $t\in \mathbb{Z}_+$ is given and answer for all integers $c\in[0,t]$ is required. In this paper, we study three generalizations of this simple problem: All-Target Unb… ▽ More Unbounded SubsetSum is a classical textbook problem: given integers $w_1,w_2,\cdots,w_n\in [1,u],~c,u$, we need to find if there exists $m_1,m_2,\cdots,m_n\in \mathbb{N}$ satisfying $c=\sum_{i=1}^n w_im_i$. In its all-target version, $t\in \mathbb{Z}_+$ is given and answer for all integers $c\in[0,t]$ is required. In this paper, we study three generalizations of this simple problem: All-Target Unbounded Knapsack, All-Target CoinChange and Residue Table. By new combinatorial insights into the structures of solutions, we present a novel two-phase approach for such problems. As a result, we present the first near-linear algorithms for CoinChange and Residue Table, which runs in $\tilde{O}(u+t)$ and $\tilde{O}(u)$ time deterministically. We also show if we can compute $(\min,+)$ convolution for $n$-length arrays in $T(n)$ time, then All-Target Unbounded Knapsack can be solved in $\tilde{O}(T(u)+t)$ time, thus establishing sub-quadratic equivalence between All-Target Unbounded Knapsack and $(\min,+)$ convolution. △ Less

Submitted 27 February, 2022; originally announced February 2022.

arXiv:2202.03627 [pdf]

Single-frame label-free cell tomography at speed of more than 10,000 volumes per second

Authors: Baoliang Ge, Yanping He, Mo Deng, Md Habibur Rahman, Yijin Wang, Ziling Wu, Chung Hong N. Wong, Michael K. Chan, Yi-Ping Ho, Liting Duan, Zahid Yaqoob, Peter T. C. So, George Barbastathis, Renjie Zhou

Abstract: Three-dimensional (3D) image cytometers may significantly improve the cell analysis accuracy to facilitate biological discoveries and clinical diagnosis, but their development is curbed by the low imaging throughput. Here we report SIngle-frame LAbel-free Cell Tomography (SILACT) with diffraction-limited resolution and unprecedented imaging speed of over 10,000 volumes/second. SILACT is built on a… ▽ More Three-dimensional (3D) image cytometers may significantly improve the cell analysis accuracy to facilitate biological discoveries and clinical diagnosis, but their development is curbed by the low imaging throughput. Here we report SIngle-frame LAbel-free Cell Tomography (SILACT) with diffraction-limited resolution and unprecedented imaging speed of over 10,000 volumes/second. SILACT is built on a unique interferometric microscope with angle-multiplexing illumination and a pre-trained physics-incorporating Deep Neural Network for efficient 3D Refractive Index (RI) reconstruction, from which 3D morphological and biophysical parameters of cells are extracted. With microfluidics and a high-speed camera, SILACT is capable of imaging over 20,000 cells/second and distinguishing different cell species during rapid measurements of large cell quantities, as well as visualizing shear-induced 3D transient deformation of red blood cells on a sub-millisecond scale. △ Less

Submitted 7 February, 2022; originally announced February 2022.

Comments: Baoliang Ge, Yanping He, Mo Deng contributed equally to this work

arXiv:2202.01242 [pdf, other]

doi 10.3847/1538-4357/acb13f

Detection of Cosmological 21 cm Emission with the Canadian Hydrogen Intensity Mapping Experiment

Authors: CHIME Collaboration, Mandana Amiri, Kevin Bandura, Tianyue Chen, Meiling Deng, Matt Dobbs, Mateus Fandino, Simon Foreman, Mark Halpern, Alex S. Hill, Gary Hinshaw, Carolin Höfer, Joseph Kania, T. L. Landecker, Joshua MacEachern, Kiyoshi Masui, Juan Mena-Parra, Nikola Milutinovic, Arash Mirhosseini, Laura Newburgh, Anna Ordog, Ue-Li Pen, Tristan Pinsonneault-Marotte, Ava Polzin, Alex Reda , et al. (8 additional authors not shown)

Abstract: We present a detection of 21-cm emission from large-scale structure (LSS) between redshift 0.78 and 1.43 made with the Canadian Hydrogen Intensity Mapping Experiment (CHIME). Radio observations acquired over 102 nights are used to construct maps which are foreground filtered and stacked on the angular and spectral locations of luminous red galaxies (LRG), emission line galaxies (ELG), and quasars… ▽ More We present a detection of 21-cm emission from large-scale structure (LSS) between redshift 0.78 and 1.43 made with the Canadian Hydrogen Intensity Mapping Experiment (CHIME). Radio observations acquired over 102 nights are used to construct maps which are foreground filtered and stacked on the angular and spectral locations of luminous red galaxies (LRG), emission line galaxies (ELG), and quasars (QSO) from the eBOSS clustering catalogs. We find decisive evidence for a detection when stacking on all three tracers of LSS, with the logarithm of the Bayes Factor equal to 18.9 (LRG), 10.8 (ELG), and 56.3 (QSO). An alternative frequentist interpretation, based on the likelihood-ratio test, yields a detection significance of $7.1σ$ (LRG), $5.7σ$ (ELG), and $11.1σ$ (QSO). These are the first 21-cm intensity mapping measurements made with an interferometer. We constrain the effective clustering amplitude of neutral hydrogen (HI), defined as $\mathcal{A}_{\rm HI}\equiv 10^{3}\,Ω_\mathrm{HI}\left(b_\mathrm{HI}+\langle\,fμ^{2}\rangle\right)$, where $Ω_\mathrm{HI}$ is the cosmic abundance of HI, $b_\mathrm{HI}$ is the linear bias of HI, and $\langle\,fμ^{2}\rangle=0.552$ encodes the effect of redshift-space distortions at linear order. We find $\mathcal{A}_\mathrm{HI}=1.51^{+3.60}_{-0.97}$ for LRGs $(z=0.84)$, $\mathcal{A}_\mathrm{HI}=6.76^{+9.04}_{-3.79}$ for ELGs $(z=0.96)$, and $\mathcal{A}_\mathrm{HI}=1.68^{+1.10}_{-0.67}$ for QSOs $(z=1.20)$, with constraints limited by modeling uncertainties at nonlinear scales. We are also sensitive to bias in the spectroscopic redshifts of each tracer, and find a non-zero bias $Δ\,v= -66 \pm 20 \mathrm{km/s}$ for the QSOs. We split the QSO catalog into three redshift bins and have a decisive detection in each, with the upper bin at $z=1.30$ producing the highest redshift 21-cm intensity mapping measurement thus far. △ Less

Submitted 2 February, 2022; originally announced February 2022.

Comments: 66 pages, 30 figures

arXiv:2201.11822 [pdf, other]

doi 10.3847/1538-4357/ac6b9f

Using the Sun to Measure the Primary Beam Response of the Canadian Hydrogen Intensity Mapping Experiment

Authors: CHIME Collaboration, Mandana Amiri, Kevin Bandura, Anja Boskovic, Jean-François Cliche, Meiling Deng, Matt Dobbs, Mateus Fandino, Simon Foreman, Mark Halpern, Alex S. Hill, Gary Hinshaw, Carolin Höfer, Joseph Kania, T. L. Landecker, Joshua MacEachern, Kiyoshi Masui, Juan Mena-Parra, Laura Newburgh, Anna Ordog, Tristan Pinsonneault-Marotte, Ava Polzin, Alex Reda, J. Richard Shaw, Seth R. Siegel , et al. (5 additional authors not shown)

Abstract: We present a beam pattern measurement of the Canadian Hydrogen Intensity Mapping Experiment (CHIME) made using the Sun as a calibration source. As CHIME is a pure drift scan instrument, we rely on the seasonal North-South motion of the Sun to probe the beam at different elevations. This semiannual range in elevation, combined with the radio brightness of the Sun, enables a beam measurement which s… ▽ More We present a beam pattern measurement of the Canadian Hydrogen Intensity Mapping Experiment (CHIME) made using the Sun as a calibration source. As CHIME is a pure drift scan instrument, we rely on the seasonal North-South motion of the Sun to probe the beam at different elevations. This semiannual range in elevation, combined with the radio brightness of the Sun, enables a beam measurement which spans ~7,200 square degrees on the sky without the need to move the telescope. We take advantage of observations made near solar minimum to minimize the impact of solar variability, which is observed to be <10% in intensity over the observation period. The resulting data set is highly complementary to other CHIME beam measurements -- both in terms of angular coverage and systematics -- and plays an important role in the ongoing program to characterize the CHIME primary beam. △ Less

Submitted 3 May, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

Comments: 11 pages, 9 figures, Accepted by ApJ

Journal ref: ApJ 923 100 (2022)

Showing 1–50 of 156 results for author: Deng, M