Skip to main content

Showing 1–50 of 5,051 results for author: Yang, Z

  1. arXiv:2410.15891  [pdf, other

    cs.GR cs.CV

    TexPro: Text-guided PBR Texturing with Procedural Material Modeling

    Authors: Ziqiang Dang, Wenqi Dong, Zesong Yang, Bangbang Yang, Liang Li, Yuewen Ma, Zhaopeng Cui

    Abstract: In this paper, we present TexPro, a novel method for high-fidelity material generation for input 3D meshes given text prompts. Unlike existing text-conditioned texture generation methods that typically generate RGB textures with baked lighting, TexPro is able to produce diverse texture maps via procedural material modeling, which enables physical-based rendering, relighting, and additional benefit… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: In submission. Supplementary material is included at the end of the main paper (5 pages, 2 figures)

  2. arXiv:2410.15698  [pdf, other

    cs.LG

    Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces

    Authors: Jifeng Hu, Sili Huang, Li Shen, Zhejian Yang, Shengchao Hu, Shisong Tang, Hechang Chen, Yi Chang, Dacheng Tao, Lichao Sun

    Abstract: Continual offline reinforcement learning (CORL) has shown impressive ability in diffusion-based lifelong learning systems by modeling the joint distributions of trajectories. However, most research only focuses on limited continual task settings where the tasks have the same observation and action space, which deviates from the realistic demands of training agents in various environments. In view… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  3. arXiv:2410.15281  [pdf, other

    cs.RO cs.AI cs.CL cs.HC

    Large Language Models for Autonomous Driving (LLM4AD): Concept, Benchmark, Simulation, and Real-Vehicle Experiment

    Authors: Can Cui, Yunsheng Ma, Zichong Yang, Yupeng Zhou, Peiran Liu, Juanwu Lu, Lingxi Li, Yaobin Chen, Jitesh H. Panchal, Amr Abdelraouf, Rohit Gupta, Kyungtae Han, Ziran Wang

    Abstract: With the broader usage and highly successful development of Large Language Models (LLMs), there has been a growth of interest and demand for applying LLMs to autonomous driving technology. Driven by their natural language understanding and reasoning ability, LLMs have the potential to enhance various aspects of autonomous driving systems, from perception and scene understanding to language interac… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

  4. arXiv:2410.15252  [pdf, other

    cs.CL cs.AI

    Lossless KV Cache Compression to 2%

    Authors: Zhen Yang, J. N. Han, Kan Wu, Ruobing Xie, An Wang, Xingwu Sun, Zhanhui Kang

    Abstract: Large language models have revolutionized data processing in numerous domains, with their ability to handle extended context reasoning receiving notable recognition. To speed up inference, maintaining a key-value (KV) cache memory is essential. Nonetheless, the growing demands for KV cache memory create significant hurdles for efficient implementation. This work introduces a novel architecture, Cr… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

  5. arXiv:2410.15127  [pdf, other

    cs.LG cs.AI

    Reinfier and Reintrainer: Verification and Interpretation-Driven Safe Deep Reinforcement Learning Frameworks

    Authors: Zixuan Yang, Jiaqi Zheng, Guihai Chen

    Abstract: Ensuring verifiable and interpretable safety of deep reinforcement learning (DRL) is crucial for its deployment in real-world applications. Existing approaches like verification-in-the-loop training, however, face challenges such as difficulty in deployment, inefficient training, lack of interpretability, and suboptimal performance in property satisfaction and reward performance. In this work, we… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

  6. arXiv:2410.14716  [pdf, other

    cs.LG cs.AI cs.CL

    A Systematic Survey on Large Language Models for Algorithm Design

    Authors: Fei Liu, Yiming Yao, Ping Guo, Zhiyuan Yang, Xi Lin, Xialiang Tong, Mingxuan Yuan, Zhichao Lu, Zhenkun Wang, Qingfu Zhang

    Abstract: Algorithm Design (AD) is crucial for effective problem-solving across various domains. The advent of Large Language Models (LLMs) has notably enhanced the automation and innovation within this field, offering new perspectives and superior solutions. Over the past three years, the integration of LLMs into AD (LLM4AD) has progressed significantly, finding applications in diverse areas such as optimi… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  7. SPFresh: Incremental In-Place Update for Billion-Scale Vector Search

    Authors: Yuming Xu, Hengyu Liang, Jin Li, Shuotao Xu, Qi Chen, Qianxi Zhang, Cheng Li, Ziyue Yang, Fan Yang, Yuqing Yang, Peng Cheng, Mao Yang

    Abstract: Approximate Nearest Neighbor Search (ANNS) is now widely used in various applications, ranging from information retrieval, question answering, and recommendation, to search for similar high-dimensional vectors. As the amount of vector data grows continuously, it becomes important to support updates to vector index, the enabling technique that allows for efficient and accurate ANNS on vectors. Beca… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: SOSP 23

  8. arXiv:2410.14111  [pdf, other

    cs.ET

    HyCiM: A Hybrid Computing-in-Memory QUBO Solver for General Combinatorial Optimization Problems with Inequality Constraints

    Authors: Yu Qian, Zeyu Yang, Kai Ni, Alptekin Vardar, Thomas Kämpfe, Xunzhao Yin

    Abstract: Computationally challenging combinatorial optimization problems (COPs) play a fundamental role in various applications. To tackle COPs, many Ising machines and Quadratic Unconstrained Binary Optimization (QUBO) solvers have been proposed, which typically involve direct transformation of COPs into Ising models or equivalent QUBO forms (D-QUBO). However, when addressing COPs with inequality constrai… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  9. arXiv:2410.13987  [pdf, other

    cs.CL

    RiTeK: A Dataset for Large Language Models Complex Reasoning over Textual Knowledge Graphs

    Authors: Jiatan Huang, Mingchen Li, Zonghai Yao, Zhichao Yang, Yongkang Xiao, Feiyun Ouyang, Xiaohan Li, Shuo Han, Hong Yu

    Abstract: Answering complex real-world questions often requires accurate retrieval from textual knowledge graphs (TKGs). The scarcity of annotated data, along with intricate topological structures, makes this task particularly challenging. As the nature of relational path information could enhance the inference ability of Large Language Models (LLMs), efficiently retrieving more complex relational path info… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  10. arXiv:2410.13915  [pdf, other

    cs.SI cs.AI cs.CY

    A Simulation System Towards Solving Societal-Scale Manipulation

    Authors: Maximilian Puelma Touzel, Sneheel Sarangi, Austin Welch, Gayatri Krishnakumar, Dan Zhao, Zachary Yang, Hao Yu, Ethan Kosak-Hine, Tom Gibbs, Andreea Musulan, Camille Thibault, Busra Tugce Gurbuz, Reihaneh Rabbany, Jean-François Godbout, Kellin Pelrine

    Abstract: The rise of AI-driven manipulation poses significant risks to societal trust and democratic processes. Yet, studying these effects in real-world settings at scale is ethically and logistically impractical, highlighting a need for simulation tools that can model these dynamics in controlled settings to enable experimentation with possible defenses. We present a simulation environment designed to ad… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  11. arXiv:2410.13905  [pdf, other

    cs.SI cs.AI cs.IR cs.LG

    P4GCN: Vertical Federated Social Recommendation with Privacy-Preserving Two-Party Graph Convolution Networks

    Authors: Zheng Wang, Wanwan Wang, Yimin Huang, Zhaopeng Peng, Ziqi Yang, Cheng Wang, Xiaoliang Fan

    Abstract: In recent years, graph neural networks (GNNs) have been commonly utilized for social recommendation systems. However, real-world scenarios often present challenges related to user privacy and business constraints, inhibiting direct access to valuable social information from other platforms. While many existing methods have tackled matrix factorization-based social recommendations without direct so… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  12. arXiv:2410.13748  [pdf, other

    hep-ex

    Test of lepton flavour universality with $B_s^0 \rightarrow φ\ell^+\ell^-$ decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1124 additional authors not shown)

    Abstract: Lepton flavour universality in rare $b\rightarrow s$ transitions is tested for the first time using $B_s^0$ meson decays. The measurements are performed using $pp$ collision data collected by the LHCb experiment between 2011 and 2018, corresponding to a total integrated luminosity of 9$\,{\rm fb}^{-1}$. Branching fraction ratios between the $B_s^0 \rightarrow φe^+e^-$ and… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lbfence.cern.ch/alcm/public/analysis/full-details/3513/ (LHCb public pages)

    Report number: LHCb-PAPER-2024-032, CERN-EP-2024-255

  13. arXiv:2410.13659  [pdf, ps, other

    hep-ph

    Neutralino dark matter in the extension of MSSM with two triplets and singlet

    Authors: Zhong-Jun Yang, Jin-Lei Yang, Shu-Min Zhao, Xing-Gang Wu, Tai-Fu Feng

    Abstract: In an extension of MSSM with two triplets and a singlet, called the TNMSSM, there are seven neutralinos which can enrich the study of cold dark matter if one expects that the weakly interacting massive particle (WIMP) is responsible for the observation of Planck satellite. Such a model, compared to the MSSM, can naturally offer a solution to the $μ$ problem, and its lightest neutralino, which is b… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  14. arXiv:2410.13515  [pdf, other

    hep-ex hep-lat hep-ph nucl-ex

    Observation of a rare beta decay of the charmed baryon with a Graph Neural Network

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (637 additional authors not shown)

    Abstract: The study of beta decay of the charmed baryon provides unique insights into the fundamental mechanism of the strong and electro-weak interactions. The $Λ_c^+$, being the lightest charmed baryon, undergoes disintegration solely through the charm quark weak decay. Its beta decay provides an ideal laboratory for investigating non-perturbative effects in quantum chromodynamics and for constraining the… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 28 pages, 6 figures

  15. arXiv:2410.13478  [pdf, other

    hep-ex

    Observation of $χ_{c0}\toΣ^{+}\barΣ^{-}η$ and evidence for $χ_{c1,2}\toΣ^{+}\barΣ^{-}η$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: Using $(27.12\pm 0.14)\times10^{8}$ $ψ(3686)$ events collected with the BESIII detector, the decay $χ_{c0}\toΣ^{+}\barΣ^{-}η$ is observed for the first time with a statistical significance of $7.0σ$, and evidence for $χ_{c1}\toΣ^{+}\barΣ^{-}η$ and $χ_{c2}\toΣ^{+}\barΣ^{-}η$ is found with statistical significances of $4.3σ$ and $4.6σ$, respectively. The branching fractions are determined to be… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  16. arXiv:2410.13428  [pdf, other

    cs.IR

    Generate and Instantiate What You Prefer: Text-Guided Diffusion for Sequential Recommendation

    Authors: Guoqing Hu, Zhangyi Yang, Zhibo Cai, An Zhang, Xiang Wang

    Abstract: Recent advancements in generative recommendation systems, particularly in the realm of sequential recommendation tasks, have shown promise in enhancing generalization to new items. Among these approaches, diffusion-based generative recommendation has emerged as an effective tool, leveraging its ability to capture data distributions and generate high-quality samples. Despite effectiveness, two prim… ▽ More

    Submitted 18 October, 2024; v1 submitted 17 October, 2024; originally announced October 2024.

  17. arXiv:2410.13405  [pdf, other

    cs.AR cs.CR

    Trinity: A General Purpose FHE Accelerator

    Authors: Xianglong Deng, Shengyu Fan, Zhicheng Hu, Zhuoyu Tian, Zihao Yang, Jiangrui Yu, Dingyuan Cao, Dan Meng, Rui Hou, Meng Li, Qian Lou, Mingzhe Zhang

    Abstract: In this paper, we present the first multi-modal FHE accelerator based on a unified architecture, which efficiently supports CKKS, TFHE, and their conversion scheme within a single accelerator. To achieve this goal, we first analyze the theoretical foundations of the aforementioned schemes and highlight their composition from a finite number of arithmetic kernels. Then, we investigate the challenge… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: To be appeared in MICRO 2024. The first ASIC-based FHE accelerator which supports both CKKS, TFHE and their conversions. Provide new SOTA performance record for CKKS, TFHE and conversion

  18. arXiv:2410.13402  [pdf, other

    astro-ph.IM

    Monte Carlo Simulation of Angular Response of GRID Detectors for GRID Mission

    Authors: Qize Liu, Xiaofan Pan, Xutao Zheng, Huaizhong Gao, Longhao Li, Qidong Wang, Zirui Yang, Chenchong Tang, Wenxuan Wu, Jianping Cheng, Zhi Zeng, Ming Zeng, Hua Feng, Binbin Zhang, Zhonghai Wang, Rong Zhou, Yuanyuan Liu, Lin Lin, Jiayong Zhong, Jianyong Jiang, Wentao Han, Yang Tian, Benda Xu, GRID Collaboration

    Abstract: The Gamma-Ray Integrated Detectors (GRID) are a space science mission that employs compact gamma-ray detectors mounted on NanoSats in low Earth orbit (LEO) to monitor the transient gamma-ray sky. Owing to the unpredictability of the time and location of gamma-ray bursts (GRBs), obtaining the photon responses of gamma-ray detectors at various incident angles is important for the scientific analysis… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 15 pages, 9 figures

  19. arXiv:2410.13368  [pdf, other

    hep-ex hep-ph

    Observation of the Singly Cabibbo-Suppressed Decay $Λ_c^{+}\to pπ^0$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Utilizing 4.5${~\rm{fb}}^{-1}$ of $e^+e^-$ annihilation data collected with the BESIII detector at the BEPCII collider at center-of-mass energies between 4.600 and 4.699 GeV, the first observation of the singly Cabibbo-suppressed decay $Λ_c^{+}\to pπ^0$ is presented, with a statistical significance of $5.4σ$. The ratio of the branching fractions of $Λ_c^{+}\to pπ^0$ and $Λ_c^{+}\to pη$ is measured… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 9 pages, 4 figures

  20. arXiv:2410.13191  [pdf, other

    cs.CL cs.AI

    MCQG-SRefine: Multiple Choice Question Generation and Evaluation with Iterative Self-Critique, Correction, and Comparison Feedback

    Authors: Zonghai Yao, Aditya Parashar, Huixue Zhou, Won Seok Jang, Feiyun Ouyang, Zhichao Yang, Hong Yu

    Abstract: Automatic question generation (QG) is essential for AI and NLP, particularly in intelligent tutoring, dialogue systems, and fact verification. Generating multiple-choice questions (MCQG) for professional exams, like the United States Medical Licensing Examination (USMLE), is particularly challenging, requiring domain expertise and complex multi-hop reasoning for high-quality questions. However, cu… ▽ More

    Submitted 18 October, 2024; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: Equal contribution for the first two authors

  21. arXiv:2410.13178  [pdf, other

    cs.LG cs.AI

    GeSubNet: Gene Interaction Inference for Disease Subtype Network Generation

    Authors: Ziwei Yang, Zheng Chen, Xin Liu, Rikuto Kotoge, Peng Chen, Yasuko Matsubara, Yasushi Sakurai, Jimeng Sun

    Abstract: Retrieving gene functional networks from knowledge databases presents a challenge due to the mismatch between disease networks and subtype-specific variations. Current solutions, including statistical and deep learning methods, often fail to effectively integrate gene interaction knowledge from databases or explicitly learn subtype-specific interactions. To address this mismatch, we propose GeSubN… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: Under review as a conference paper at ICLR 2025

  22. arXiv:2410.12836  [pdf, other

    cs.GR cs.AI cs.CV cs.HC

    EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing

    Authors: Kaizhi Zheng, Xiaotong Chen, Xuehai He, Jing Gu, Linjie Li, Zhengyuan Yang, Kevin Lin, Jianfeng Wang, Lijuan Wang, Xin Eric Wang

    Abstract: Given the steep learning curve of professional 3D software and the time-consuming process of managing large 3D assets, language-guided 3D scene editing has significant potential in fields such as virtual reality, augmented reality, and gaming. However, recent approaches to language-guided 3D scene editing either require manual interventions or focus only on appearance modifications without support… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  23. PC-Planner: Physics-Constrained Self-Supervised Learning for Robust Neural Motion Planning with Shape-Aware Distance Function

    Authors: Xujie Shen, Haocheng Peng, Zesong Yang, Juzhan Xu, Hujun Bao, Ruizhen Hu, Zhaopeng Cui

    Abstract: Motion Planning (MP) is a critical challenge in robotics, especially pertinent with the burgeoning interest in embodied artificial intelligence. Traditional MP methods often struggle with high-dimensional complexities. Recently neural motion planners, particularly physics-informed neural planners based on the Eikonal equation, have been proposed to overcome the curse of dimensionality. However, th… ▽ More

    Submitted 30 September, 2024; originally announced October 2024.

    Comments: Accepted to SIGGRAPH Asia 2024 Conference. Project Page: https://zju3dv.github.io/pc-planner

  24. arXiv:2410.12794  [pdf, other

    cs.IR cs.AI

    Disaggregating Embedding Recommendation Systems with FlexEMR

    Authors: Yibo Huang, Zhenning Yang, Jiarong Xing, Yi Dai, Yiming Qiu, Dingming Wu, Fan Lai, Ang Chen

    Abstract: Efficiently serving embedding-based recommendation (EMR) models remains a significant challenge due to their increasingly large memory requirements. Today's practice splits the model across many monolithic servers, where a mix of GPUs, CPUs, and DRAM is provisioned in fixed proportions. This approach leads to suboptimal resource utilization and increased costs. Disaggregating embedding operations… ▽ More

    Submitted 27 September, 2024; originally announced October 2024.

  25. arXiv:2410.12669  [pdf, other

    cs.CV

    3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image Generation

    Authors: Dewei Zhou, Ji Xie, Zongxin Yang, Yi Yang

    Abstract: The increasing demand for controllable outputs in text-to-image generation has spurred advancements in multi-instance generation (MIG), allowing users to define both instance layouts and attributes. However, unlike image-conditional generation methods such as ControlNet, MIG techniques have not been widely adopted in state-of-the-art models like SD2 and SDXL, primarily due to the challenge of buil… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 10 pages

  26. arXiv:2410.12624  [pdf

    cond-mat.supr-con

    Field-free superconducting diode effect and magnetochiral anisotropy in FeTe0.7Se0.3 junctions with the inherent asymmetric barrier

    Authors: Shengyao Li, Ya Deng, Dianyi Hu, Chao Zhu, Zherui Yang, Wanghao Tian, Xueyan Wang, Ming Yue, Qiong Wu, Zheng Liu, Xiao Renshaw Wang

    Abstract: Nonreciprocal electrical transport, characterized by an asymmetric relationship between current and voltage, plays a crucial role in modern electronic industries. Recent studies have extended this phenomenon to superconductors, introducing the concept of the superconducting diode effect (SDE). The SDE is characterized by unequal critical supercurrents along opposite directions. Due to the requirem… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  27. arXiv:2410.12620  [pdf, other

    hep-ex

    Search for $e^{+}e^{-} \to φχ_{c0}$ and $φη_{c2}(1D)$ at center-of-mass energies from 4.47 to 4.95 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: Utilizing a data set of $6.7$ fb$^{-1}$ from electron-positron collisions recorded by the BESIII detector at the BEPCII storage ring, a search is conducted for the processes $e^{+}e^{-} \to φχ_{c0}$ and $φη_{c2}(1D)$ across center-of-mass energies from 4.47 to 4.95 GeV. In the absence of any significant signals, upper limits are set. These include limits on the Born cross sections for… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 14 pages, 6 figures

  28. arXiv:2410.11607  [pdf, other

    hep-ex

    Observation of $χ_{cJ}\to p \bar p K^0_S K^- π^+ + c.c.$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (648 additional authors not shown)

    Abstract: By analyzing $(27.12\pm0.14)\times10^8$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decays of $χ_{cJ} \to p \bar{p} K^0_S K^- π^+ +c.c.(J=0, 1, 2)$ are observed for the first time with statistical significances greater than $10σ$. The branching fractions of these decays are determined to be… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 12 pages, 5 figures

  29. arXiv:2410.11288  [pdf, ps, other

    math.CO

    Maximal and maximum induced matchings in connected graphs

    Authors: Bo-Jun Yuan, Zhao-Yu Yang, Lu Zheng, Shi-Cai Gong

    Abstract: An induced matching in a graph is a set of edges whose endpoints induce a $1$-regular subgraph. Gupta et al. (2012,\cite{Gupta}) showed that every $n$-vertex graph has at most $10^{\frac{n}{5}}\approx 1.5849^n$ maximal induced matchings, which is attained by the disjoint union of copies of the complete graph $K_5$. In this paper, we show that the maximum number of maximal and maximum induced mat… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  30. arXiv:2410.11208  [pdf, other

    cs.CV cs.LG

    DreamSteerer: Enhancing Source Image Conditioned Editability using Personalized Diffusion Models

    Authors: Zhengyang Yu, Zhaoyuan Yang, Jing Zhang

    Abstract: Recent text-to-image personalization methods have shown great promise in teaching a diffusion model user-specified concepts given a few images for reusing the acquired concepts in a novel context. With massive efforts being dedicated to personalized generation, a promising extension is personalized editing, namely to edit an image using personalized concepts, which can provide a more precise guida… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: Published as a conference paper at NeurIPS 2024

  31. arXiv:2410.10663  [pdf, other

    cs.CV cs.LG

    Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework

    Authors: Zhengwei Yang, Yuke Li, Qiang Sun, Basura Fernando, Heng Huang, Zheng Wang

    Abstract: Most existing studies on few-shot learning focus on unimodal settings, where models are trained to generalize on unseen data using only a small number of labeled examples from the same modality. However, real-world data are inherently multi-modal, and unimodal approaches limit the practical applications of few-shot learning. To address this gap, this paper introduces the Cross-modal Few-Shot Learn… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: 19 pages, 7 figures

  32. arXiv:2410.10539  [pdf

    cond-mat.str-el

    Incommensurate Transverse Peierls Transition

    Authors: F. Z. Yang, K. F. Luo, Weizhe Zhang, Xiaoyu Guo, W. R. Meier, H. Ni, H. X. Li, P. Mercado Lozano, G. Fabbris, A. H. Said, C. Nelson, T. T. Zhang, A. F. May, M. A. McGuire, R. Juneja, L. Lindsay, H. N. Lee, J. -M. Zuo, M. F. Chi, X. Dai, Liuyan Zhao, H. Miao

    Abstract: In one-dimensional quantum materials, conducting electrons and the underlying lattices can undergo a spontaneous translational symmetry breaking, known as Peierls transition. For nearly a century, the Peierls transition has been understood within the paradigm of electron-electron interactions mediated by longitudinal acoustic phonons. This classical picture has recently been revised in topological… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: Supplementary materials are available upon request

  33. arXiv:2410.10253  [pdf, other

    cs.LG cs.AI cs.NE

    Feedback Favors the Generalization of Neural ODEs

    Authors: Jindou Jia, Zihan Yang, Meng Wang, Kexin Guo, Jianfei Yang, Xiang Yu, Lei Guo

    Abstract: The well-known generalization problem hinders the application of artificial neural networks in continuous-time prediction tasks with varying latent dynamics. In sharp contrast, biological systems can neatly adapt to evolving environments benefiting from real-time feedback mechanisms. Inspired by the feedback philosophy, we present feedback neural networks, showing that a feedback loop can flexibly… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: 22 pages, 17 figures

  34. arXiv:2410.10248  [pdf

    cs.CR

    Yuan: Research on the Concept of Digital World Analogue Scientific Infrastructure and Science Popularization Communication Based on Suzhou Gardens Pattern

    Authors: Zhang Lvyang, Lu Wen, Zhao Yang, Li Jiaqi, Zhai Lidong

    Abstract: In the current digital era, high security relies significantly on advanced concepts such as native security. However, the design and implementation of these concepts face challenges in enterprises and organizations. Leveraging advancements in Large Language Models (LLMs), we draw inspiration from the design principles of Suzhou Gardens, a UNESCO World Heritage site. By examining its core features,… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  35. arXiv:2410.10148  [pdf, other

    cs.LG cs.AI cs.CL

    $α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs

    Authors: Junkang Wu, Xue Wang, Zhengyi Yang, Jiancan Wu, Jinyang Gao, Bolin Ding, Xiang Wang, Xiangnan He

    Abstract: Aligning large language models (LLMs) with human values and intentions is crucial for their utility, honesty, and safety. Reinforcement learning from human feedback (RLHF) is a popular approach to achieve this alignment, but it faces challenges in computational efficiency and training stability. Recent methods like Direct Preference Optimization (DPO) and Simple Preference Optimization (SimPO) hav… ▽ More

    Submitted 19 October, 2024; v1 submitted 14 October, 2024; originally announced October 2024.

  36. arXiv:2410.09733  [pdf, other

    cs.CV

    MMCOMPOSITION: Revisiting the Compositionality of Pre-trained Vision-Language Models

    Authors: Hang Hua, Yunlong Tang, Ziyun Zeng, Liangliang Cao, Zhengyuan Yang, Hangfeng He, Chenliang Xu, Jiebo Luo

    Abstract: The advent of large Vision-Language Models (VLMs) has significantly advanced multimodal understanding, enabling more sophisticated and accurate integration of visual and textual information across various tasks, including image and video captioning, visual question answering, and cross-modal retrieval. Despite VLMs' superior capabilities, researchers lack a comprehensive understanding of their com… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

    Comments: 21 pages, 15 figures

  37. arXiv:2410.09383  [pdf, ps, other

    cs.LG stat.ML

    Deep Transfer Learning: Model Framework and Error Analysis

    Authors: Yuling Jiao, Huazhen Lin, Yuchen Luo, Jerry Zhijian Yang

    Abstract: This paper presents a framework for deep transfer learning, which aims to leverage information from multi-domain upstream data with a large number of samples $n$ to a single-domain downstream task with a considerably smaller number of samples $m$, where $m \ll n$, in order to enhance performance on downstream task. Our framework has several intriguing features. First, it allows the existence of bo… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

  38. arXiv:2410.09340  [pdf, other

    math.AP

    Global well-posedness and uniform-in-time vanishing damping limit for the inviscid Oldroyd-B model

    Authors: Xinyu Cheng, Zhaonan Luo, Zhaojie Yang, Cheng Yuan

    Abstract: In this paper, we consider global strong solutions and uniform-in-time vanishing damping limit for the inviscid Oldroyd-B model in R^d, where d=2 and 3. The well-recognized problem of the global existence of smooth solutions for the 2D inviscid Oldroyd-B model without smallness assumptions is open due to the complex structure of Q. Therefore improving the smallness assumptions, especially in lower… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: 91 pages, 14 figures

  39. arXiv:2410.09151  [pdf, other

    astro-ph.HE

    A search using GEO600 for gravitational waves coincident with fast radio bursts from SGR 1935+2154

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, I. Abouelfettouh, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, D. Agarwal, M. Agathos, M. Aghaei Abchouyeh, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, A. Al-Jodah, C. Alléné , et al. (1758 additional authors not shown)

    Abstract: The magnetar SGR 1935+2154 is the only known Galactic source of fast radio bursts (FRBs). FRBs from SGR 1935+2154 were first detected by CHIME/FRB and STARE2 in 2020 April, after the conclusion of the LIGO, Virgo, and KAGRA Collaborations' O3 observing run. Here we analyze four periods of gravitational wave (GW) data from the GEO600 detector coincident with four periods of FRB activity detected by… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: 15 pages of text including references, 4 figures, 5 tables

    Report number: LIGO-P2400192

  40. arXiv:2410.08666  [pdf, other

    cs.LG cs.AI

    DeltaDQ: Ultra-High Delta Compression for Fine-Tuned LLMs via Group-wise Dropout and Separate Quantization

    Authors: Yanfeng Jiang, Zelan Yang, Bohua Chen, Shen Li, Yong Li, Tao Li

    Abstract: Large language models achieve exceptional performance on various downstream tasks through supervised fine-tuning. However, the diversity of downstream tasks and practical requirements makes deploying multiple full-parameter fine-tuned models challenging. Current methods that compress the delta weight struggle to achieve ultra-high compression, failing to minimize the deployment overhead. To addres… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  41. arXiv:2410.08603  [pdf, other

    hep-ex

    Observation of $D^+\toη^\primeμ^+ν_μ$ and First Study of $D^+\to η^\prime \ell^+ν_\ell$ Decay Dynamics

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Using $20.3\,\rm fb^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy 3.773\,GeV with the BESIII detector, we report the first observation of the semileptonic decay $D^+\to η^\prime μ^+ν_μ$ with significance of $8.6σ$ including systematic uncertainties, and an improved measurement of $D^+\to η^\prime e^+ν_e$. The branching fractions of $D^+\to η^\prime μ^+ν_μ$ and… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  42. arXiv:2410.08474  [pdf, other

    cs.CV cs.CL

    SPORTU: A Comprehensive Sports Understanding Benchmark for Multimodal Large Language Models

    Authors: Haotian Xia, Zhengbang Yang, Junbo Zou, Rhys Tracy, Yuqing Wang, Chi Lu, Christopher Lai, Yanjun He, Xun Shao, Zhuoqing Xie, Yuan-fang Wang, Weining Shen, Hanjie Chen

    Abstract: Multimodal Large Language Models (MLLMs) are advancing the ability to reason about complex sports scenarios by integrating textual and visual information. To comprehensively evaluate their capabilities, we introduce SPORTU, a benchmark designed to assess MLLMs across multi-level sports reasoning tasks. SPORTU comprises two key components: SPORTU-text, featuring 900 multiple-choice questions with h… ▽ More

    Submitted 19 October, 2024; v1 submitted 10 October, 2024; originally announced October 2024.

  43. arXiv:2410.08249  [pdf, other

    cs.LG cs.AI

    Federated Graph Learning for Cross-Domain Recommendation

    Authors: Ziqi Yang, Zhaopeng Peng, Zihui Wang, Jianzhong Qi, Chaochao Chen, Weike Pan, Chenglu Wen, Cheng Wang, Xiaoliang Fan

    Abstract: Cross-domain recommendation (CDR) offers a promising solution to the data sparsity problem by enabling knowledge transfer across source and target domains. However, many recent CDR models overlook crucial issues such as privacy as well as the risk of negative transfer (which negatively impact model performance), especially in multi-domain settings. To address these challenges, we propose FedGCDR,… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: Accepted by NeurIPS'24

  44. arXiv:2410.07985  [pdf, other

    cs.CL

    Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

    Authors: Bofei Gao, Feifan Song, Zhe Yang, Zefan Cai, Yibo Miao, Qingxiu Dong, Lei Li, Chenghao Ma, Liang Chen, Runxin Xu, Zhengyang Tang, Benyou Wang, Daoguang Zan, Shanghaoran Quan, Ge Zhang, Lei Sha, Yichang Zhang, Xuancheng Ren, Tianyu Liu, Baobao Chang

    Abstract: Recent advancements in large language models (LLMs) have led to significant breakthroughs in mathematical reasoning capabilities. However, existing benchmarks like GSM8K or MATH are now being solved with high accuracy (e.g., OpenAI o1 achieves 94.8% on MATH dataset), indicating their inadequacy for truly challenging these models. To bridge this gap, we propose a comprehensive and challenging bench… ▽ More

    Submitted 10 October, 2024; v1 submitted 10 October, 2024; originally announced October 2024.

    Comments: 26 Pages, 17 Figures

  45. arXiv:2410.07626  [pdf, other

    hep-ex

    Precision Measurement of the Branching Fraction of $D^{+}\to μ^{+}ν_μ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Using $20.3~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected at a center-of-mass energy of $E_{\rm cm}=3.773$ GeV with the BESIII detector operating at the BEPCII collider, we determine the branching fraction of the leptonic decay $D^+\toμ^+ν_μ$ to be $(3.981\pm0.079_{\rm stat}\pm0.040_{\rm syst})\times10^{-4}$. Interpreting our measurement with knowledge of the Fermi coupling constant… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: 9 pages, 2 figures

  46. arXiv:2410.07516  [pdf, other

    cs.SE

    Exploring and Lifting the Robustness of LLM-powered Automated Program Repair with Metamorphic Testing

    Authors: Pengyu Xue, Linhao Wu, Zhen Yang, Xinyi Li, Zhongxing Yu, Zhi Jin, Ge Li, Yan Xiao, Jingwen Wu

    Abstract: In recent years, Large language model-powered Automated Program Repair (LAPR) techniques have achieved state-of-the-art bug-fixing performance and have been pervasively applied and studied in both industry and academia. Nonetheless, LLMs were proved to be highly sensitive to input prompts, with slight differences in the expressions of semantically equivalent programs potentially causing repair fai… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  47. arXiv:2410.07076  [pdf, other

    cs.CL cs.AI cs.LG

    MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses

    Authors: Zonglin Yang, Wanhao Liu, Ben Gao, Tong Xie, Yuqiang Li, Wanli Ouyang, Soujanya Poria, Erik Cambria, Dongzhan Zhou

    Abstract: Scientific discovery contributes largely to human society's prosperity, and recent progress shows that LLMs could potentially catalyze this process. However, it is still unclear whether LLMs can discover novel and valid hypotheses in chemistry. In this work, we investigate this central research question: Can LLMs automatically discover novel and valid chemistry research hypotheses given only a che… ▽ More

    Submitted 12 October, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

    Comments: Code and Benchmark are available at https://github.com/ZonglinY/MOOSE-Chem.git

  48. arXiv:2410.06911  [pdf, other

    cs.RO cs.AI

    Combining Planning and Diffusion for Mobility with Unknown Dynamics

    Authors: Yajvan Ravan, Zhutian Yang, Tao Chen, Tomás Lozano-Pérez, Leslie Pack Kaelbling

    Abstract: Manipulation of large objects over long horizons (such as carts in a warehouse) is an essential skill for deployable robotic systems. Large objects require mobile manipulation which involves simultaneous manipulation, navigation, and movement with the object in tow. In many real-world situations, object dynamics are incredibly complex, such as the interaction of an office chair (with a rotating ba… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: Submitted to ICRA 2025

  49. arXiv:2410.06777  [pdf, other

    cs.CV

    HERM: Benchmarking and Enhancing Multimodal LLMs for Human-Centric Understanding

    Authors: Keliang Li, Zaifei Yang, Jiahe Zhao, Hongze Shen, Ruibing Hou, Hong Chang, Shiguang Shan, Xilin Chen

    Abstract: The significant advancements in visual understanding and instruction following from Multimodal Large Language Models (MLLMs) have opened up more possibilities for broader applications in diverse and universal human-centric scenarios. However, existing image-text data may not support the precise modality alignment and integration of multi-grained information, which is crucial for human-centric visu… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  50. arXiv:2410.06719  [pdf, other

    cs.CV cs.AI

    Suppress Content Shift: Better Diffusion Features via Off-the-Shelf Generation Techniques

    Authors: Benyuan Meng, Qianqian Xu, Zitai Wang, Zhiyong Yang, Xiaochun Cao, Qingming Huang

    Abstract: Diffusion models are powerful generative models, and this capability can also be applied to discrimination. The inner activations of a pre-trained diffusion model can serve as features for discriminative tasks, namely, diffusion feature. We discover that diffusion feature has been hindered by a hidden yet universal phenomenon that we call content shift. To be specific, there are content difference… ▽ More

    Submitted 18 October, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2410.03558