Skip to main content

Showing 1–50 of 478 results for author: Ye, F

  1. arXiv:2410.14088  [pdf, other

    cs.DC

    Overcoming Memory Constraints in Quantum Circuit Simulation with a High-Fidelity Compression Framework

    Authors: Boyuan Zhang, Bo Fang, Fanjiang Ye, Yida Gu, Nathan Tallent, Guangming Tan, Dingwen Tao

    Abstract: Full-state quantum circuit simulation requires exponentially increased memory size to store the state vector as the number of qubits scales, presenting significant limitations in classical computing systems. Our paper introduces BMQSim, a novel state vector quantum simulation framework that employs lossy compression to address the memory constraints on graphics processing unit (GPU) machines. BMQS… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  2. arXiv:2410.13994  [pdf, other

    cond-mat.str-el

    Vacancy-induced suppression of CDW order and its impact on magnetic order in kagome antiferromagnet FeGe

    Authors: Mason L. Klemm, Saif Siddique, Yuan-Chun Chang, Sijie Xu, Yaofeng Xie, Tanner Legvold, Mehrdad T. Kiani, Feng Ye, Huibo Cao, Yiqing Hao, Wei Tian, Hubertus Luetkens, Masaaki Matsuda, Douglas Natelson, Zurab Guguchia, Chien-Lung Huang, Ming Yi, Judy J. Cha, Pengcheng Dai

    Abstract: Two-dimensional (2D) kagome lattice metals are interesting because they display flat electronic bands, Dirac points, Van Hove singularities, and can have interplay between charge density wave (CDW), magnetic order, and superconductivity. In kagome lattice antiferromagnet FeGe, a short-range CDW order was found deep within an antiferromagnetically ordered state, interacting with the magnetic order.… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  3. arXiv:2410.13782  [pdf, other

    cs.LG q-bio.QM

    DPLM-2: A Multimodal Diffusion Protein Language Model

    Authors: Xinyou Wang, Zaixiang Zheng, Fei Ye, Dongyu Xue, Shujian Huang, Quanquan Gu

    Abstract: Proteins are essential macromolecules defined by their amino acid sequences, which determine their three-dimensional structures and, consequently, their functions in all living organisms. Therefore, generative protein modeling necessitates a multimodal approach to simultaneously model, understand, and generate both sequences and structures. However, existing methods typically use separate models f… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  4. arXiv:2410.12457  [pdf, other

    cs.LG cs.AI

    Sharpness-Aware Black-Box Optimization

    Authors: Feiyang Ye, Yueming Lyu, Xuehao Wang, Masashi Sugiyama, Yu Zhang, Ivor Tsang

    Abstract: Black-box optimization algorithms have been widely used in various machine learning problems, including reinforcement learning and prompt fine-tuning. However, directly optimizing the training loss value, as commonly done in existing black-box optimization methods, could lead to suboptimal model quality and generalization performance. To address those problems in black-box optimization, we propose… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 27 pages, 5 figures

  5. Spontaneous Symmetry Breaking In Nonlinear Binary Periodic Systems

    Authors: Ruihan Peng, Qidong Fu, Yejia Chen, Weidong Luo, Changming Huang, Fangwei Ye

    Abstract: Spontaneous symmetry breaking (SSB) occurs when modes of asymmetric profile appear in a symmetric, double-well potential, due to the nonlinearity of the potential exceeding a critical value. In this study, we examine SSB in a periodic potential where the unit cell itself is a symmetric double-well, in both one-dimensional and two-dimensional periodic systems. Using the tight-binding model, we deri… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

    Journal ref: Phys. Rev. A 110, 043513 (2024)

  6. arXiv:2410.03090  [pdf, other

    cs.CL cs.LG

    UNComp: Uncertainty-Aware Long-Context Compressor for Efficient Large Language Model Inference

    Authors: Jing Xiong, Jianghan Shen, Fanghua Ye, Chaofan Tao, Zhongwei Wan, Jianqiao Lu, Xun Wu, Chuanyang Zheng, Zhijiang Guo, Lingpeng Kong, Ngai Wong

    Abstract: Deploying large language models (LLMs) is challenging due to their high memory and computational demands, especially during long-context inference. While key-value (KV) caching accelerates inference by reusing previously computed keys and values, it also introduces significant memory overhead. Existing KV cache compression methods such as eviction and merging typically compress the KV cache after… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  7. arXiv:2410.02719  [pdf, other

    cs.CL

    UncertaintyRAG: Span-Level Uncertainty Enhanced Long-Context Modeling for Retrieval-Augmented Generation

    Authors: Zixuan Li, Jing Xiong, Fanghua Ye, Chuanyang Zheng, Xun Wu, Jianqiao Lu, Zhongwei Wan, Xiaodan Liang, Chengming Li, Zhenan Sun, Lingpeng Kong, Ngai Wong

    Abstract: We present UncertaintyRAG, a novel approach for long-context Retrieval-Augmented Generation (RAG) that utilizes Signal-to-Noise Ratio (SNR)-based span uncertainty to estimate similarity between text chunks. This span uncertainty enhances model calibration, improving robustness and mitigating semantic inconsistencies introduced by random chunking. Leveraging this insight, we propose an efficient un… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  8. arXiv:2409.15119  [pdf, other

    cs.AI

    Log-normal Mutations and their Use in Detecting Surreptitious Fake Images

    Authors: Ismail Labiad, Thomas Bäck, Pierre Fernandez, Laurent Najman, Tom Sander, Furong Ye, Mariia Zameshina, Olivier Teytaud

    Abstract: In many cases, adversarial attacks are based on specialized algorithms specifically dedicated to attacking automatic image classifiers. These algorithms perform well, thanks to an excellent ad hoc distribution of initial attacks. However, these attacks are easily detected due to their specific initial distribution. We therefore consider other black-box attacks, inspired from generic black-box opti… ▽ More

    Submitted 25 September, 2024; v1 submitted 23 September, 2024; originally announced September 2024.

    Comments: log-normal mutations and their use in detecting surreptitious fake images

  9. Structural and electronic transformations in TiO2 induced by electric current

    Authors: Tyler C. Sterling, Feng Ye, Seohyeon Jo, Anish Parulekar, Yu Zhang, Gang Cao, Rishi Raj, Dmitry Reznik

    Abstract: In-situ diffuse neutron scattering experiments revealed that when electric current is passed through single crystals of rutile TiO2 under conditions conducive to flash sintering, it induces the formation of parallel planes of oxygen vacancies. Specifically, a current perpendicular to the c-axis generates planes normal to the (132) reciprocal lattice vector, whereas currents aligned with the c-axis… ▽ More

    Submitted 21 October, 2024; v1 submitted 12 September, 2024; originally announced September 2024.

  10. arXiv:2409.08070  [pdf, other

    physics.optics

    All-optical Fourier neural network using partially coherent light

    Authors: Jianwei Qin, Yanbing Liu, Yan Liu, Xun Liu, Wei Li, Fangwei Ye

    Abstract: Optical neural networks present distinct advantages over traditional electrical counterparts, such as accelerated data processing and reduced energy consumption. While coherent light is conventionally employed in optical neural networks, our study proposes harnessing spatially incoherent light in all-optical Fourier neural networks. Contrary to numerical predictions of declining target recognition… ▽ More

    Submitted 20 September, 2024; v1 submitted 12 September, 2024; originally announced September 2024.

    Comments: 19 pages,5 figures

  11. arXiv:2409.06744  [pdf, other

    q-bio.QM cs.AI cs.LG q-bio.BM

    ProteinBench: A Holistic Evaluation of Protein Foundation Models

    Authors: Fei Ye, Zaixiang Zheng, Dongyu Xue, Yuning Shen, Lihao Wang, Yiming Ma, Yan Wang, Xinyou Wang, Xiangxin Zhou, Quanquan Gu

    Abstract: Recent years have witnessed a surge in the development of protein foundation models, significantly improving performance in protein prediction and generative tasks ranging from 3D structure prediction and protein design to conformational dynamics. However, the capabilities and limitations associated with these models remain poorly understood due to the absence of a unified evaluation framework. To… ▽ More

    Submitted 7 October, 2024; v1 submitted 10 September, 2024; originally announced September 2024.

    Comments: 30 pages, 2 figures and 15 tables

  12. arXiv:2409.05324  [pdf, other

    cs.CV

    FIF-UNet: An Efficient UNet Using Feature Interaction and Fusion for Medical Image Segmentation

    Authors: Xiaolin Gou, Chuanlin Liao, Jizhe Zhou, Fengshuo Ye, Yi Lin

    Abstract: Nowadays, pre-trained encoders are widely used in medical image segmentation because of their ability to capture complex feature representations. However, the existing models fail to effectively utilize the rich features obtained by the pre-trained encoder, resulting in suboptimal segmentation results. In this work, a novel U-shaped model, called FIF-UNet, is proposed to address the above issue, i… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

  13. arXiv:2409.00578  [pdf, other

    cond-mat.stat-mech

    Exact moments for a run and tumble particle in a harmonic trap with a finite tumble time

    Authors: Aoran Sun, Fangfu Ye, Rudolf Podgornik

    Abstract: We study the problem of a run and tumble particle in a harmonic trap, with a finite run and tumble time, by a direct integration of the equation of motion. An exact 1D steady state distribution, diagram laws and a programmable Volterra difference equation are derived to calculate any order of moments in any other dimension, both for steady state as well as the Laplace transform in time for the int… ▽ More

    Submitted 31 August, 2024; originally announced September 2024.

    Comments: 12 pages 5 figures

  14. arXiv:2407.17011  [pdf, other

    cs.CL

    Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mechanism

    Authors: Anhao Zhao, Fanghua Ye, Jinlan Fu, Xiaoyu Shen

    Abstract: Large language models (LLMs) exhibit remarkable in-context learning (ICL) capabilities. However, the underlying working mechanism of ICL remains poorly understood. Recent research presents two conflicting views on ICL: One emphasizes the impact of similar examples in the demonstrations, stressing the need for label correctness and more shots. The other attributes it to LLMs' inherent ability of ta… ▽ More

    Submitted 9 October, 2024; v1 submitted 24 July, 2024; originally announced July 2024.

  15. arXiv:2407.16531  [pdf, other

    cond-mat.str-el quant-ph

    1-Form Symmetric Projected Entangled-Pair States

    Authors: Yi Tan, Ji-Yao Chen, Didier Poilblanc, Fei Ye, Jia-Wei Mei

    Abstract: The 1-form symmetry, manifesting as loop-like symmetries, has gained prominence in the study of quantum phases, deepening our understanding of symmetry. However, the role of 1-form symmetries in Projected Entangled-Pair States (PEPS), two-dimensional tensor network states, remains largely underexplored. We present a novel framework for understanding 1-form symmetries within tensor networks, specif… ▽ More

    Submitted 1 August, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

    Comments: 6 pages, 1 figure. In memory of T. M. Rice for his invaluable teaching on the physics of strongly correlated systems

  16. arXiv:2407.15399  [pdf, other

    cs.CL cs.AI cs.CR

    Imposter.AI: Adversarial Attacks with Hidden Intentions towards Aligned Large Language Models

    Authors: Xiao Liu, Liangzhi Li, Tong Xiang, Fuying Ye, Lu Wei, Wangyue Li, Noa Garcia

    Abstract: With the development of large language models (LLMs) like ChatGPT, both their vast applications and potential vulnerabilities have come to the forefront. While developers have integrated multiple safety mechanisms to mitigate their misuse, a risk remains, particularly when models encounter adversarial inputs. This study unveils an attack mechanism that capitalizes on human conversation strategies… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  17. arXiv:2407.13246  [pdf, other

    cs.CV

    STS MICCAI 2023 Challenge: Grand challenge on 2D and 3D semi-supervised tooth segmentation

    Authors: Yaqi Wang, Yifan Zhang, Xiaodiao Chen, Shuai Wang, Dahong Qian, Fan Ye, Feng Xu, Hongyuan Zhang, Qianni Zhang, Chengyu Wu, Yunxiang Li, Weiwei Cui, Shan Luo, Chengkai Wang, Tianhao Li, Yi Liu, Xiang Feng, Huiyu Zhou, Dongyun Liu, Qixuan Wang, Zhouhao Lin, Wei Song, Yuanlin Li, Bing Wang, Chunshi Wang , et al. (2 additional authors not shown)

    Abstract: Computer-aided design (CAD) tools are increasingly popular in modern dental practice, particularly for treatment planning or comprehensive prognosis evaluation. In particular, the 2D panoramic X-ray image efficiently detects invisible caries, impacted teeth and supernumerary teeth in children, while the 3D dental cone beam computed tomography (CBCT) is widely used in orthodontics and endodontics d… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  18. arXiv:2407.07640  [pdf, other

    cond-mat.str-el

    Single Crystal Diffuse Neutron Scattering Study of the Dipole-Octupole Quantum Spin Ice Candidate Ce$_2$Zr$_2$O$_7$: No Apparent Octupolar Correlations Above $T = 0.05$ K

    Authors: E. M. Smith, R. Schäfer, J. Dudemaine, B. Placke, B. Yuan, Z. Morgan, F. Ye, R. Moessner, O. Benton, A. D. Bianchi, B. D. Gaulin

    Abstract: The insulating magnetic pyrochlore Ce$_2$Zr$_2$O$_7$ has attracted much attention as a quantum spin ice candidate with dipole-octupole character that permits spin ice phases based not only on magnetic dipole moments but also allows for even-more-exotic octupole-based spin ice phases. This work reports low-temperature neutron diffraction measurements on single crystal Ce$_2$Zr$_2$O$_7$ with $Q$-cov… ▽ More

    Submitted 5 October, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

  19. arXiv:2407.05439  [pdf, other

    cond-mat.mes-hall quant-ph

    Stabilizing an individual charge fluctuator in a Si/SiGe quantum dot

    Authors: Feiyang Ye, Ammar Ellaboudy, John M. Nichol

    Abstract: Charge noise is a major obstacle to improved gate fidelities in silicon spin qubits. Numerous methods exist to mitigate charge noise, including improving device fabrication, dynamical decoupling, and real-time parameter estimation. In this work, we demonstrate a new class of techniques to mitigate charge noise in semiconductor quantum dots by controlling the noise sources themselves. Using two dif… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  20. arXiv:2407.04272  [pdf, other

    cs.LG cs.DC

    Accelerating Communication in Deep Learning Recommendation Model Training with Dual-Level Adaptive Lossy Compression

    Authors: Hao Feng, Boyuan Zhang, Fanjiang Ye, Min Si, Ching-Hsiang Chu, Jiannan Tian, Chunxing Yin, Summer Deng, Yuchen Hao, Pavan Balaji, Tong Geng, Dingwen Tao

    Abstract: DLRM is a state-of-the-art recommendation system model that has gained widespread adoption across various industry applications. The large size of DLRM models, however, necessitates the use of multiple devices/GPUs for efficient training. A significant bottleneck in this process is the time-consuming all-to-all communication required to collect embedding data from all devices. To mitigate this, we… ▽ More

    Submitted 1 October, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: camera-ready version for SC '24

  21. arXiv:2407.02015  [pdf, ps, other

    math.NA

    Robust First and Second-Order Differentiation for Regularized Optimal Transport

    Authors: Xingjie Li, Fei Lu, Molei Tao, Felix X. -F. Ye

    Abstract: Applications such as unbalanced and fully shuffled regression can be approached by optimizing regularized optimal transport (OT) distances, such as the entropic OT and Sinkhorn distances. A common approach for this optimization is to use a first-order optimizer, which requires the gradient of the OT distance. For faster convergence, one might also resort to a second-order optimizer, which addition… ▽ More

    Submitted 20 October, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    MSC Class: 68Q25; 68R10; 68U05

  22. arXiv:2407.01445  [pdf, other

    cs.LG cs.CV

    FastCLIP: A Suite of Optimization Techniques to Accelerate CLIP Training with Limited Resources

    Authors: Xiyuan Wei, Fanjiang Ye, Ori Yonay, Xingyu Chen, Baixi Sun, Dingwen Tao, Tianbao Yang

    Abstract: Existing studies of training state-of-the-art Contrastive Language-Image Pretraining (CLIP) models on large-scale data involve hundreds of or even thousands of GPUs due to the requirement of a large batch size. However, such a large amount of resources is not accessible to most people. While advanced compositional optimization techniques for optimizing global contrastive losses have been demonstra… ▽ More

    Submitted 2 October, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

    Comments: 29 pages

  23. arXiv:2406.13922  [pdf, ps, other

    cs.IT

    Explicit Performance Bound of Finite Blocklength Coded MIMO: Time-Domain versus Spatiotemporal Channel Coding

    Authors: Feng Ye, Xiaohu You, Jiamin Li, Chuan Zhang, Chen Ji

    Abstract: In the sixth generation (6G), ultra-reliable low-latency communications (URLLC) will further develop to achieve TKu extreme connectivity, and multiple-input multiple-output (MIMO) is expected to be a key enabler for its realization. Since the latency constraint can be represented by the blocklength of a codeword, it is essential to analyze different coded MIMO schemes under finite blocklength regi… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 9 pages, 5 figures

  24. arXiv:2406.13249  [pdf, other

    cs.CL cs.AI cs.IR

    R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation

    Authors: Fuda Ye, Shuangyin Li, Yongqi Zhang, Lei Chen

    Abstract: Retrieval augmented generation (RAG) has been applied in many scenarios to augment large language models (LLMs) with external documents provided by retrievers. However, a semantic gap exists between LLMs and retrievers due to differences in their training objectives and architectures. This misalignment forces LLMs to passively accept the documents provided by the retrievers, leading to incomprehen… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  25. arXiv:2406.12844  [pdf, other

    cs.LG cs.AI

    Synergizing Foundation Models and Federated Learning: A Survey

    Authors: Shenghui Li, Fanghua Ye, Meng Fang, Jiaxu Zhao, Yun-Hin Chan, Edith C. -H. Ngai, Thiemo Voigt

    Abstract: The recent development of Foundation Models (FMs), represented by large language models, vision transformers, and multimodal models, has been making a significant impact on both academia and industry. Compared with small-scale models, FMs have a much stronger demand for high-volume data during the pre-training phase. Although general FMs can be pre-trained on data collected from open sources such… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  26. arXiv:2406.00114  [pdf, other

    cs.RO cs.NE

    Dynamic Multi-Objective Lion Swarm Optimization with Multi-strategy Fusion: An application in 6R robot trajectory planning

    Authors: Bao Liu, Tianbao Liu, Zhongshuo Hu, Fei Ye, Lei Gao

    Abstract: The advancement of industrialization has spurred the development of innovative swarm intelligence algorithms, with Lion Swarm Optimization (LSO) notable for its robustness, parallelism, simplicity, and efficiency. While LSO excels in single-objective optimization, its multi-objective variants face challenges such as poor initialization, local optima entrapment, and so on. This study proposes Dynam… ▽ More

    Submitted 7 June, 2024; v1 submitted 31 May, 2024; originally announced June 2024.

  27. arXiv:2405.18973  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Codimension-Two Spiral Spin-Liquid in the Effective Honeycomb-Lattice Compound Cs$_3$Fe$_2$Cl$_9$

    Authors: Shang Gao, Chris Pasco, Otkur Omar, Qiang Zhang, Daniel M. Pajerowski, Feng Ye, Matthias Frontzek, Andrew F. May, Matthew B. Stone, Andrew D. Christianson

    Abstract: A codimension-two spiral spin-liquid is a correlated paramagnetic state with one-dimensional ground state degeneracy hosted within a three-dimensional lattice. Here, via neutron scattering experiments and numerical simulations, we establish the existence of a codimension-two spiral spin-liquid in the effective honeycomb-lattice compound Cs$_3$Fe$_2$Cl$_9$ and demonstrate the selective visibility o… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 21 pages, 23 figures

  28. arXiv:2405.16252  [pdf, other

    math.GT math.DG

    2-torsion in instanton Floer homology

    Authors: Zhenkun Li, Fan Ye

    Abstract: This paper studies the existence of $2$-torsion in instanton Floer homology with $\mathbb{Z}$ coefficients for closed $3$-manifolds and singular knots. First, we show that the non-existence of $2$-torsion in the framed instanton Floer homology $I^\sharp(S_n^3(K);\mathbb{Z})$ of any nonzero integral $n$-surgery along a knot $K$ in $S^3$ would imply that $K$ is fibered. Also, we show that… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 41 pages, 17 figures; comments are welcome

  29. arXiv:2405.13249  [pdf

    cond-mat.mtrl-sci

    Structural Properties of Plastically Deformed SrTiO3 and KTaO3

    Authors: Issam Khayr, Sajna Hameed, Jakov Budić, Xing He, Richard Spieker, Ana Najev, Zinan Zhao, Li Yue, Matthew Krogstad, Feng Ye, Yaohua Liu, Raymond Osborn, Stephan Rosenkranz, Yuan Li, Damjan Pelc, Martin Greven

    Abstract: Dislocation engineering has the potential to open new avenues toward the exploration and modification of the properties of quantum materials. Strontium titanate (SrTiO3, STO) and potassium tantalate (KTaO3, KTO) are incipient ferroelectrics that show metallization and superconductivity at extremely low charge carrier concentrations, and have been the subject of resurgent interest. These materials… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  30. arXiv:2405.13215  [pdf, other

    cond-mat.mtrl-sci cond-mat.str-el

    Role of stacking defects on the magnetic behavior of CrCl$_3$

    Authors: John A. Schneeloch, Adam A. Aczel, Feng Ye, Despina Louca

    Abstract: In the study of van der Waals-layered magnetic materials, the properties of CrCl$_3$ continue to attract attention. This compound is reported to undergo antiferromagnetic (AFM) ordering below $\sim$14 K, with a ferromagneticlike region proposed to exist between 14 and 17 K. Ideally, the crystal structure is rhombohedral (R) below $\sim$235 K, separated from a higher-temperature monoclinic (M) phas… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: Supplement is part of pdf file

  31. arXiv:2405.13034  [pdf, other

    cs.CL cs.AI cs.HC

    Autonomous Workflow for Multimodal Fine-Grained Training Assistants Towards Mixed Reality

    Authors: Jiahuan Pei, Irene Viola, Haochen Huang, Junxiao Wang, Moonisa Ahsan, Fanghua Ye, Jiang Yiming, Yao Sai, Di Wang, Zhumin Chen, Pengjie Ren, Pablo Cesar

    Abstract: Autonomous artificial intelligence (AI) agents have emerged as promising protocols for automatically understanding the language-based environment, particularly with the exponential development of large language models (LLMs). However, a fine-grained, comprehensive understanding of multimodal environments remains under-explored. This work designs an autonomous workflow tailored for integrating AI a… ▽ More

    Submitted 5 June, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: Accepted by ACL 2024

  32. arXiv:2405.10411  [pdf, other

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    Nanoscale structural correlations in a model cuprate superconductor

    Authors: Zachary W. Anderson, Marin Spaić, Nikolaos Biniskos, Liam Thompson, Biqiong Yu, Jack Zwettler, Yaohua Liu, Feng Ye, Garrett E. Granroth, Matthew Krogstad, Raymond Osborn, Damjan Pelc, Martin Greven

    Abstract: Understanding the extent and role of inhomogeneity is a pivotal challenge in the physics of cuprate superconductors. While it is known that structural and electronic inhomogeneity is prevalent in the cuprates, it has proven difficult to disentangle compound-specific features from universally relevant effects. Here we combine advanced neutron and x-ray diffuse scattering with numerical modeling to… ▽ More

    Submitted 14 October, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: 14 pages, 5 figures. V2 changes: exchanged Fig. 4 and 5.; added neutron panels to Fig. 4.; fixed incorrect neutron panels in Fig. 5 and updated related text; added 3D-DeltaPDF resolution paragraph: clarified text about correlation lengths, temperature control, absence of occupational disorder and CuO2 plane buckling; removed appendix; typo fixes, small changes to phrasing, etc

  33. arXiv:2405.05542  [pdf, other

    cs.RO cs.MA

    Dynamic Deep Factor Graph for Multi-Agent Reinforcement Learning

    Authors: Yuchen Shi, Shihong Duan, Cheng Xu, Ran Wang, Fangwen Ye, Chau Yuen

    Abstract: This work introduces a novel value decomposition algorithm, termed \textit{Dynamic Deep Factor Graphs} (DDFG). Unlike traditional coordination graphs, DDFG leverages factor graphs to articulate the decomposition of value functions, offering enhanced flexibility and adaptability to complex value function structures. Central to DDFG is a graph structure generation policy that innovatively generates… ▽ More

    Submitted 7 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: submitted to IEEE TPAMI

  34. arXiv:2405.03692  [pdf, other

    eess.IV cs.NI eess.SY

    Imitation Learning for Adaptive Video Streaming with Future Adversarial Information Bottleneck Principle

    Authors: Shuoyao Wang, Jiawei Lin, Fangwei Ye

    Abstract: Adaptive video streaming plays a crucial role in ensuring high-quality video streaming services. Despite extensive research efforts devoted to Adaptive BitRate (ABR) techniques, the current reinforcement learning (RL)-based ABR algorithms may benefit the average Quality of Experience (QoE) but suffers from fluctuating performance in individual video sessions. In this paper, we present a novel appr… ▽ More

    Submitted 12 March, 2024; originally announced May 2024.

    Comments: submitted to IEEE Journal

  35. arXiv:2404.13396  [pdf

    cond-mat.mtrl-sci

    Angle-Resolved Magneto-Chiral Anisotropy in a Non-Centrosymmetric Atomic Layer Superlattice

    Authors: Long Cheng, Mingrui Bao, Jingxian Zhang, Xue Zhang, Qun Yang, Qiang Li, Hui Cao, Dawei Qiu, Jia Liu, Fei Ye, Qing Wang, Genhao Liang, Hui Li, Guanglei Cheng, Hua Zhou, Jian-Min Zuo, Xiaodong Zhou, Jian Shen, Zhifeng Zhu, Sai Mu, Wenbo Wang, Xiaofang Zhai

    Abstract: Chirality in solid-state materials has sparked significant interest due to potential applications of topologically-protected chiral states in next-generation information technology. The electrical magneto-chiral effect (eMChE), arising from relativistic spin-orbit interactions, shows great promise for developing chiral materials and devices for electronic integration. Here we demonstrate an angle-… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  36. arXiv:2403.10971  [pdf, other

    cs.CV

    Task-Aware Low-Rank Adaptation of Segment Anything Model

    Authors: Xuehao Wang, Feiyang Ye, Yu Zhang

    Abstract: The Segment Anything Model (SAM), with its remarkable zero-shot capability, has been proven to be a powerful foundation model for image segmentation tasks, which is an important task in computer vision. However, the transfer of its rich semantic information to multiple different downstream tasks remains unexplored. In this paper, we propose the Task-Aware Low-Rank Adaptation (TA-LoRA) method, whic… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  37. arXiv:2403.06568  [pdf, other

    cs.AI

    Better Understandings and Configurations in MaxSAT Local Search Solvers via Anytime Performance Analysis

    Authors: Furong Ye, Chuan Luo, Shaowei Cai

    Abstract: Though numerous solvers have been proposed for the MaxSAT problem, and the benchmark environment such as MaxSAT Evaluations provides a platform for the comparison of the state-of-the-art solvers, existing assessments were usually evaluated based on the quality, e.g., fitness, of the best-found solutions obtained within a given running time budget. However, concerning solely the final obtained solu… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  38. arXiv:2403.06144  [pdf, other

    cs.CY

    Simulating Family Conversations using LLMs: Demonstration of Parenting Styles

    Authors: Frank Tian-fang Ye, Xiaozi Gao

    Abstract: This study presents a framework for conducting psychological and linguistic research through simulated conversations using large language models (LLMs). The proposed methodology offers significant advantages, particularly for simulating human interactions involving potential unethical language or behaviors that would be impermissible in traditional experiments with human participants. As a demonst… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  39. arXiv:2403.03310  [pdf, other

    quant-ph cs.LG

    Graph Learning for Parameter Prediction of Quantum Approximate Optimization Algorithm

    Authors: Zhiding Liang, Gang Liu, Zheyuan Liu, Jinglei Cheng, Tianyi Hao, Kecheng Liu, Hang Ren, Zhixin Song, Ji Liu, Fanny Ye, Yiyu Shi

    Abstract: In recent years, quantum computing has emerged as a transformative force in the field of combinatorial optimization, offering novel approaches to tackling complex problems that have long challenged classical computational methods. Among these, the Quantum Approximate Optimization Algorithm (QAOA) stands out for its potential to efficiently solve the Max-Cut problem, a quintessential example of com… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  40. arXiv:2402.18567  [pdf, other

    cs.LG q-bio.BM

    Diffusion Language Models Are Versatile Protein Learners

    Authors: Xinyou Wang, Zaixiang Zheng, Fei Ye, Dongyu Xue, Shujian Huang, Quanquan Gu

    Abstract: This paper introduces diffusion protein language model (DPLM), a versatile protein language model that demonstrates strong generative and predictive capabilities for protein sequences. We first pre-train scalable DPLMs from evolutionary-scale protein sequences within a generative self-supervised discrete diffusion probabilistic framework, which generalizes language modeling for proteins in a princ… ▽ More

    Submitted 16 October, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: ICML 2024 camera-ready version

  41. arXiv:2402.18070  [pdf, other

    cs.AR eess.SP

    A Hierarchical Dataflow-Driven Heterogeneous Architecture for Wireless Baseband Processing

    Authors: Limin Jiang, Yi Shi, Haiqin Hu, Qingyu Deng, Siyi Xu, Yintao Liu, Feng Yuan, Si Wang, Yihao Shen, Fangfang Ye, Shan Cao, Zhiyuan Jiang

    Abstract: Wireless baseband processing (WBP) is a key element of wireless communications, with a series of signal processing modules to improve data throughput and counter channel fading. Conventional hardware solutions, such as digital signal processors (DSPs) and more recently, graphic processing units (GPUs), provide various degrees of parallelism, yet they both fail to take into account the cyclical and… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 7 pages, 7 figures, conference

  42. arXiv:2402.16983  [pdf, other

    cond-mat.str-el

    Thermal evolution of spin excitations in honeycomb Ising antiferromagnetic FePSe3

    Authors: Lebing Chen, Xiaokun Teng, Ding Hu, Feng Ye, Garrett E. Granroth, Ming Yi, Jae-Ho Chung, Robert J. Birgeneau, Pengcheng Dai

    Abstract: We use elastic and inelastic neutron scattering (INS) to study the antiferromagnetic (AF) phase transitions and spin excitations in the two-dimensional (2D) zig-zag antiferromagnet FePSe$_3$. By determining the magnetic order parameter across the AF phase transition, we conclude that the AF phase transition in FePSe$_3$ is first-order in nature. In addition, our INS measurements reveal that the sp… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  43. arXiv:2402.07654  [pdf, other

    cs.NE

    Impact of spatial transformations on landscape features of CEC2022 basic benchmark problems

    Authors: Haoran Yin, Diederick Vermetten, Furong Ye, Thomas H. W. Bäck, Anna V. Kononova

    Abstract: When benchmarking optimization heuristics, we need to take care to avoid an algorithm exploiting biases in the construction of the used problems. One way in which this might be done is by providing different versions of each problem but with transformations applied to ensure the algorithms are equipped with mechanisms for successfully tackling a range of problems. In this paper, we investigate sev… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  44. arXiv:2402.07616  [pdf, other

    cs.CL cs.AI

    Anchor-based Large Language Models

    Authors: Jianhui Pang, Fanghua Ye, Derek Fai Wong, Xin He, Wanshun Chen, Longyue Wang

    Abstract: Large language models (LLMs) predominantly employ decoder-only transformer architectures, necessitating the retention of keys/values information for historical tokens to provide contextual information and avoid redundant computation. However, the substantial size and parameter volume of these LLMs require massive GPU memory. This memory demand increases with the length of the input text, leading t… ▽ More

    Submitted 1 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: The paper has been accepted by the ACL2024 conference. Work was done when Jianhui Pang and Fanghua Ye were interning at Tencent AI Lab

  45. arXiv:2401.17669  [pdf, other

    eess.SP

    Compression before Fusion: Broadcast Semantic Communication System for Heterogeneous Tasks

    Authors: Mingze Gong, Shuoyao Wang, Fangwei Ye, Suzhi Bi

    Abstract: Semantic communication has emerged as new paradigm shifts in 6G from the conventional syntax-oriented communications. Recently, the wireless broadcast technology has been introduced to support semantic communication system toward higher communication efficiency. Nevertheless, existing broadcast semantic communication systems target on general representation within one stage and fail to balance the… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  46. arXiv:2401.17141  [pdf, other

    cond-mat.str-el

    Incipient nematicity from electron flat bands in a kagome metal

    Authors: Nathan Drucker, Thanh Nguyen, Manasi Mandal, Phum Siriviboon, Yujie Quan, Artittaya Boonkird, Ryotaro Okabe, Fankang Li, Kaleb Buragge, Fumiaki Funuma, Masaaki Matsuda, Douglas Abernathy, Travis Williams, Songxue Chi, Feng Ye, Christie Nelson, Bolin Liao, Pavel Volkov, Mingda Li

    Abstract: Engineering new quantum phases requires fine tuning of the electronic, orbital, spin, and lattice degrees of freedom. To this end, the kagome lattice with flat bands has garnered great attention by hosting various topological and correlated phases, when the flat band is at the Fermi level. Here we discover unconventional nematiciy in kagome metal CoSn, where flat bands are fully occupied below the… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: 37 pages, 5 main figures, 14 supplementary figures

  47. arXiv:2401.14541  [pdf, other

    cond-mat.mes-hall quant-ph

    Characterization of individual charge fluctuators in Si/SiGe quantum dots

    Authors: Feiyang Ye, Ammar Ellaboudy, Dylan Albrecht, Rohith Vudatha, N. Tobias Jacobson, John M. Nichol

    Abstract: Electron spins in silicon quantum dots are excellent qubits due to their long coherence times, scalability, and compatibility with advanced semiconductor technology. Although high gate fidelities can be achieved with spin qubits, charge noise in the semiconductor environment still hinders further improvements. Despite the importance of charge noise, key questions about the specific nature of the f… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  48. SEER: Facilitating Structured Reasoning and Explanation via Reinforcement Learning

    Authors: Guoxin Chen, Kexin Tang, Chao Yang, Fuying Ye, Yu Qiao, Yiming Qian

    Abstract: Elucidating the reasoning process with structured explanations from question to answer is crucial, as it significantly enhances the interpretability, traceability, and trustworthiness of question-answering (QA) systems. However, structured explanations demand models to perform intricately structured reasoning, which poses great challenges. Most existing methods focus on single-step reasoning throu… ▽ More

    Submitted 27 September, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: Camera ready version for ACL 2024 Main Conference

  49. arXiv:2401.12794  [pdf, other

    cs.CL

    Benchmarking LLMs via Uncertainty Quantification

    Authors: Fanghua Ye, Mingming Yang, Jianhui Pang, Longyue Wang, Derek F. Wong, Emine Yilmaz, Shuming Shi, Zhaopeng Tu

    Abstract: The proliferation of open-source Large Language Models (LLMs) from various institutions has highlighted the urgent need for comprehensive evaluation methods. However, current evaluation platforms, such as the widely recognized HuggingFace open LLM leaderboard, neglect a crucial aspect -- uncertainty, which is vital for thoroughly assessing LLMs. To bridge this gap, we introduce a new benchmarking… ▽ More

    Submitted 25 April, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: 25 pages, preprints

  50. arXiv:2401.11929  [pdf, other

    cs.LG

    Parsimony or Capability? Decomposition Delivers Both in Long-term Time Series Forecasting

    Authors: Jinliang Deng, Feiyang Ye, Du Yin, Xuan Song, Ivor W. Tsang, Hui Xiong

    Abstract: Long-term time series forecasting (LTSF) represents a critical frontier in time series analysis, characterized by extensive input sequences, as opposed to the shorter spans typical of traditional approaches. While longer sequences inherently offer richer information for enhanced predictive precision, prevailing studies often respond by escalating model complexity. These intricate models can inflat… ▽ More

    Submitted 16 October, 2024; v1 submitted 22 January, 2024; originally announced January 2024.