Skip to main content

Showing 1–50 of 4,336 results for author: Gao, Y

  1. arXiv:2410.16020  [pdf, other

    cs.CV

    START: A Generalized State Space Model with Saliency-Driven Token-Aware Transformation

    Authors: Jintao Guo, Lei Qi, Yinghuan Shi, Yang Gao

    Abstract: Domain Generalization (DG) aims to enable models to generalize to unseen target domains by learning from multiple source domains. Existing DG methods primarily rely on convolutional neural networks (CNNs), which inherently learn texture biases due to their limited receptive fields, making them prone to overfitting source domains. While some works have introduced transformer-based methods (ViTs) fo… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: Accepted by NeurIPS2024. The code is available at https://github.com/lingeringlight/START

  2. arXiv:2410.15039  [pdf, other

    astro-ph.SR

    Double-edged sword: the influence of tidal interaction on stellar activity in binaries

    Authors: Yuedan Ding, Shidi Zhang, Henggeng Han, Wenyuan Cui, Song Wang, Min Fang, Yawei Gao

    Abstract: Using the LAMOST DR7 low-resolution spectra, we carried out a systematic study of stellar chromospheric activity in both single and binary stars. We constructed a binary sample and a single-star sample, mainly using the binary belt and the main sequence in the Hertzsprung-Russell diagram, respectively. By comparing the $S$ indices between single and binary stars within each color bin, we found for… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

    Comments: 10 pages,7 figures. Accepted for publication in ApJ

  3. arXiv:2410.14321  [pdf, other

    cs.CR cs.PL cs.SE

    From Solitary Directives to Interactive Encouragement! LLM Secure Code Generation by Natural Language Prompting

    Authors: Shigang Liu, Bushra Sabir, Seung Ick Jang, Yuval Kansal, Yansong Gao, Kristen Moore, Alsharif Abuadbba, Surya Nepal

    Abstract: Large Language Models (LLMs) have shown remarkable potential in code generation, making them increasingly important in the field. However, the security issues of generated code have not been fully addressed, and the usability of LLMs in code generation still requires further exploration. This work introduces SecCode, a framework that leverages an innovative interactive encouragement prompting (E… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

  4. arXiv:2410.14268  [pdf, other

    cs.CL cs.LG

    MoDification: Mixture of Depths Made Easy

    Authors: Chen Zhang, Meizhi Zhong, Qimeng Wang, Xuantao Lu, Zheyu Ye, Chengqiang Lu, Yan Gao, Yao Hu, Kehai Chen, Min Zhang, Dawei Song

    Abstract: Long-context efficiency has recently become a trending topic in serving large language models (LLMs). And mixture of depths (MoD) is proposed as a perfect fit to bring down both latency and memory. In this paper, however, we discover that MoD can barely transform existing LLMs without costly training over an extensive number of tokens. To enable the transformations from any LLMs to MoD ones, we sh… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: 12 pages, 9 figures, 5 tables, work in progress

  5. arXiv:2410.13748  [pdf, other

    hep-ex

    Test of lepton flavour universality with $B_s^0 \rightarrow φ\ell^+\ell^-$ decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1124 additional authors not shown)

    Abstract: Lepton flavour universality in rare $b\rightarrow s$ transitions is tested for the first time using $B_s^0$ meson decays. The measurements are performed using $pp$ collision data collected by the LHCb experiment between 2011 and 2018, corresponding to a total integrated luminosity of 9$\,{\rm fb}^{-1}$. Branching fraction ratios between the $B_s^0 \rightarrow φe^+e^-$ and… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lbfence.cern.ch/alcm/public/analysis/full-details/3513/ (LHCb public pages)

    Report number: LHCb-PAPER-2024-032, CERN-EP-2024-255

  6. arXiv:2410.13588  [pdf, other

    cs.IR cs.SI

    Cross-Domain Sequential Recommendation via Neural Process

    Authors: Haipeng Li, Jiangxia Cao, Yiwen Gao, Yunhuai Liu, Shuchao Pang

    Abstract: Cross-Domain Sequential Recommendation (CDSR) is a hot topic in sequence-based user interest modeling, which aims at utilizing a single model to predict the next items for different domains. To tackle the CDSR, many methods are focused on domain overlapped users' behaviors fitting, which heavily relies on the same user's different-domain item sequences collaborating signals to capture the synergy… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: Work in progress

  7. arXiv:2410.13515  [pdf, other

    hep-ex hep-lat hep-ph nucl-ex

    Observation of a rare beta decay of the charmed baryon with a Graph Neural Network

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (637 additional authors not shown)

    Abstract: The study of beta decay of the charmed baryon provides unique insights into the fundamental mechanism of the strong and electro-weak interactions. The $Λ_c^+$, being the lightest charmed baryon, undergoes disintegration solely through the charm quark weak decay. Its beta decay provides an ideal laboratory for investigating non-perturbative effects in quantum chromodynamics and for constraining the… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 28 pages, 6 figures

  8. arXiv:2410.13478  [pdf, other

    hep-ex

    Observation of $χ_{c0}\toΣ^{+}\barΣ^{-}η$ and evidence for $χ_{c1,2}\toΣ^{+}\barΣ^{-}η$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: Using $(27.12\pm 0.14)\times10^{8}$ $ψ(3686)$ events collected with the BESIII detector, the decay $χ_{c0}\toΣ^{+}\barΣ^{-}η$ is observed for the first time with a statistical significance of $7.0σ$, and evidence for $χ_{c1}\toΣ^{+}\barΣ^{-}η$ and $χ_{c2}\toΣ^{+}\barΣ^{-}η$ is found with statistical significances of $4.3σ$ and $4.6σ$, respectively. The branching fractions are determined to be… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  9. arXiv:2410.13368  [pdf, other

    hep-ex hep-ph

    Observation of the Singly Cabibbo-Suppressed Decay $Λ_c^{+}\to pπ^0$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Utilizing 4.5${~\rm{fb}}^{-1}$ of $e^+e^-$ annihilation data collected with the BESIII detector at the BEPCII collider at center-of-mass energies between 4.600 and 4.699 GeV, the first observation of the singly Cabibbo-suppressed decay $Λ_c^{+}\to pπ^0$ is presented, with a statistical significance of $5.4σ$. The ratio of the branching fractions of $Λ_c^{+}\to pπ^0$ and $Λ_c^{+}\to pη$ is measured… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 9 pages, 4 figures

  10. arXiv:2410.13276  [pdf, other

    cs.CL

    SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs

    Authors: Yizhao Gao, Zhichen Zeng, Dayou Du, Shijie Cao, Hayden Kwok-Hay So, Ting Cao, Fan Yang, Mao Yang

    Abstract: Attention is the cornerstone of modern Large Language Models (LLMs). Yet its quadratic complexity limits the efficiency and scalability of LLMs, especially for those with a long-context window. A promising approach addressing this limitation is to leverage the sparsity in attention. However, existing sparsity-based solutions predominantly rely on predefined patterns or heuristics to approximate sp… ▽ More

    Submitted 18 October, 2024; v1 submitted 17 October, 2024; originally announced October 2024.

  11. arXiv:2410.13267  [pdf, other

    cs.SD cs.CL eess.AS

    CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models

    Authors: Shangda Wu, Yashan Wang, Ruibin Yuan, Zhancheng Guo, Xu Tan, Ge Zhang, Monan Zhou, Jing Chen, Xuefeng Mu, Yuejie Gao, Yuanliang Dong, Jiafeng Liu, Xiaobing Li, Feng Yu, Maosong Sun

    Abstract: Challenges in managing linguistic diversity and integrating various musical modalities are faced by current music information retrieval systems. These limitations reduce their effectiveness in a global, multimodal music environment. To address these issues, we introduce CLaMP 2, a system compatible with 101 languages that supports both ABC notation (a text-based musical notation format) and MIDI (… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 17 pages, 10 figures, 4 tables

  12. arXiv:2410.13119  [pdf, other

    astro-ph.GA

    PGC 44685: A Dwarf Star-forming Lenticular Galaxy with Wolf-Rayet Population

    Authors: Shiying Lu, Qiusheng Gu, Yulong Gao, Yong Shi, Luwenjia Zhou, Rubén García-Benito, Xiangdong Li, Jiantong Cui, Xin Li, Liuze Long, Zhengyi Chen

    Abstract: Lenticular galaxies (S0s) are formed mainly from the gas stripping of spirals in the cluster. But how S0s form and evolve in the field is still untangled. Based on spatially resolved observations from the optical Hispanic Astronomical Center in Andalusia 3.5-m telescope with the PPAK Integral Field Spectroscopy instrument and NOrthern Extended Millimeter Array, we study a dwarf (M*<10^9 Msun) S0,… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 19 pages, 12 figures, 3 tables, ApJ accepted

  13. arXiv:2410.12620  [pdf, other

    hep-ex

    Search for $e^{+}e^{-} \to φχ_{c0}$ and $φη_{c2}(1D)$ at center-of-mass energies from 4.47 to 4.95 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: Utilizing a data set of $6.7$ fb$^{-1}$ from electron-positron collisions recorded by the BESIII detector at the BEPCII storage ring, a search is conducted for the processes $e^{+}e^{-} \to φχ_{c0}$ and $φη_{c2}(1D)$ across center-of-mass energies from 4.47 to 4.95 GeV. In the absence of any significant signals, upper limits are set. These include limits on the Born cross sections for… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 14 pages, 6 figures

  14. arXiv:2410.12464  [pdf, other

    cs.MA

    Enhancing LLM Trading Performance with Fact-Subjectivity Aware Reasoning

    Authors: Qian Wang, Yuchen Gao, Zhenheng Tang, Bingqiao Luo, Bingsheng He

    Abstract: While many studies prove more advanced LLMs perform better on tasks such as math and coding, we notice that in cryptocurrency trading, stronger LLMs work worse than weaker LLMs often. To study how this counter-intuitive phenomenon occurs, we examine the LLM reasoning processes on making trading decisions. We find that separating the reasoning process into factual and subjective components can lead… ▽ More

    Submitted 17 October, 2024; v1 submitted 16 October, 2024; originally announced October 2024.

  15. arXiv:2410.12438  [pdf

    eess.SY

    Modeling, Prediction and Risk Management of Distribution System Voltages with Non-Gaussian Probability Distributions

    Authors: Yuanhai Gao, Xiaoyuan Xu, Zheng Yan, Mohammad Shahidehpour, Bo Yang, Xinping Guan

    Abstract: High renewable energy penetration into power distribution systems causes a substantial risk of exceeding voltage security limits, which needs to be accurately assessed and properly managed. However, the existing methods usually rely on the joint probability models of power generation and loads provided by probabilistic prediction to quantify the voltage risks, where inaccurate prediction results c… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  16. arXiv:2410.12342  [pdf, other

    cs.CV cs.AI

    TAS: Distilling Arbitrary Teacher and Student via a Hybrid Assistant

    Authors: Guopeng Li, Qiang Wang, Ke Yan, Shouhong Ding, Yuan Gao, Gui-Song Xia

    Abstract: Most knowledge distillation (KD) methodologies predominantly focus on teacher-student pairs with similar architectures, such as both being convolutional neural networks (CNNs). However, the potential and flexibility of KD can be greatly improved by expanding it to novel Cross-Architecture KD (CAKD), where the knowledge of homogeneous and heterogeneous teachers can be transferred flexibly to a give… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 18 pages, 6 figures, and 12 tables

  17. arXiv:2410.11607  [pdf, other

    hep-ex

    Observation of $χ_{cJ}\to p \bar p K^0_S K^- π^+ + c.c.$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (648 additional authors not shown)

    Abstract: By analyzing $(27.12\pm0.14)\times10^8$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decays of $χ_{cJ} \to p \bar{p} K^0_S K^- π^+ +c.c.(J=0, 1, 2)$ are observed for the first time with statistical significances greater than $10σ$. The branching fractions of these decays are determined to be… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 12 pages, 5 figures

  18. arXiv:2410.11495  [pdf, other

    eess.SP

    GBSense: A GHz-Bandwidth Compressed Spectrum Sensing System

    Authors: Zihang Song, Xingjian Zhang, Zhe Chen, Rahim Tafazolli, Yue Gao

    Abstract: This paper presents GBSense, an innovative compressed spectrum sensing system designed for GHz-bandwidth signals in dynamic spectrum access (DSA) applications. GBSense introduces an efficient approach to periodic nonuniform sampling, capturing wideband signals using significantly lower sampling rates compared to traditional Nyquist sampling. By integrating time-interleaved analog-to-digital conver… ▽ More

    Submitted 18 October, 2024; v1 submitted 15 October, 2024; originally announced October 2024.

  19. arXiv:2410.11173  [pdf

    cond-mat.mtrl-sci cond-mat.other

    Ferroaxial phonons in chiral and polar NiCo2TeO6

    Authors: V. A. Martinez, Y. Gao, J. Yang, F. Lyzwa, Z. Liu, C. J. Won, K. Du, V. Kiryukhin, S-W. Cheong, A. A. Sirenko

    Abstract: Perfect circular dichroism has been observed in the Raman scattering by the optical phonons in single chiral domain NiCo2TeO6 crystals. The selection rules for the optical phonons are determined by the combination of the chiral structure C and the electric polarization P along the c-axis. These two symmetry operations are equivalent to the ferroaxial order (C dot P) = A, so the observed optical ph… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  20. arXiv:2410.11002  [pdf, other

    eess.SP

    Optimizing Radio Access Technology Selection and Precoding in CV-Aided ISAC Systems

    Authors: Yulan Gao, Ziqiang Ye, Ming Xiao, Yue Xiao

    Abstract: Integrated Sensing and Communication (ISAC) systems promise to revolutionize wireless networks by concurrently supporting high-resolution sensing and high-performance communication. This paper presents a novel radio access technology (RAT) selection framework that capitalizes on vision sensing from base station (BS) cameras to optimize both communication and perception capabilities within the ISAC… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  21. arXiv:2410.10454  [pdf, other

    cs.CV

    Improve Meta-learning for Few-Shot Text Classification with All You Can Acquire from the Tasks

    Authors: Xinyue Liu, Yunlong Gao, Linlin Zong, Bo Xu

    Abstract: Meta-learning has emerged as a prominent technology for few-shot text classification and has achieved promising performance. However, existing methods often encounter difficulties in drawing accurate class prototypes from support set samples, primarily due to probable large intra-class differences and small inter-class differences within the task. Recent approaches attempt to incorporate external… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: Accepted by EMNLP 2024 Findings

  22. arXiv:2410.10083  [pdf, other

    cs.AI

    Beyond Graphs: Can Large Language Models Comprehend Hypergraphs?

    Authors: Yifan Feng, Chengwu Yang, Xingliang Hou, Shaoyi Du, Shihui Ying, Zongze Wu, Yue Gao

    Abstract: Existing benchmarks like NLGraph and GraphQA evaluate LLMs on graphs by focusing mainly on pairwise relationships, overlooking the high-order correlations found in real-world data. Hypergraphs, which can model complex beyond-pairwise relationships, offer a more robust framework but are still underexplored in the context of LLMs. To address this gap, we introduce LLM4Hypergraph, the first comprehen… ▽ More

    Submitted 16 October, 2024; v1 submitted 13 October, 2024; originally announced October 2024.

  23. arXiv:2410.09834  [pdf, other

    cs.CV eess.IV

    Towards Defining an Efficient and Expandable File Format for AI-Generated Contents

    Authors: Yixin Gao, Runsen Feng, Xin Li, Weiping Li, Zhibo Chen

    Abstract: Recently, AI-generated content (AIGC) has gained significant traction due to its powerful creation capability. However, the storage and transmission of large amounts of high-quality AIGC images inevitably pose new challenges for recent file formats. To overcome this, we define a new file format for AIGC images, named AIGIF, enabling ultra-low bitrate coding of AIGC images. Unlike compressing AIGC… ▽ More

    Submitted 15 October, 2024; v1 submitted 13 October, 2024; originally announced October 2024.

  24. arXiv:2410.09747  [pdf, other

    cs.CV cs.AI cs.DC cs.LG cs.RO

    t-READi: Transformer-Powered Robust and Efficient Multimodal Inference for Autonomous Driving

    Authors: Pengfei Hu, Yuhang Qian, Tianyue Zheng, Ang Li, Zhe Chen, Yue Gao, Xiuzhen Cheng, Jun Luo

    Abstract: Given the wide adoption of multimodal sensors (e.g., camera, lidar, radar) by autonomous vehicles (AVs), deep analytics to fuse their outputs for a robust perception become imperative. However, existing fusion methods often make two assumptions rarely holding in practice: i) similar data distributions for all inputs and ii) constant availability for all sensors. Because, for example, lidars have v… ▽ More

    Submitted 17 October, 2024; v1 submitted 13 October, 2024; originally announced October 2024.

    Comments: 14 pages, 16 figures

  25. arXiv:2410.08860  [pdf, other

    cs.CL cs.CV

    Audio Description Generation in the Era of LLMs and VLMs: A Review of Transferable Generative AI Technologies

    Authors: Yingqiang Gao, Lukas Fischer, Alexa Lintner, Sarah Ebling

    Abstract: Audio descriptions (ADs) function as acoustic commentaries designed to assist blind persons and persons with visual impairments in accessing digital media content on television and in movies, among other settings. As an accessibility service typically provided by trained AD professionals, the generation of ADs demands significant human effort, making the process both time-consuming and costly. Rec… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  26. arXiv:2410.08696  [pdf, other

    cs.CL

    AMPO: Automatic Multi-Branched Prompt Optimization

    Authors: Sheng Yang, Yurong Wu, Yan Gao, Zineng Zhou, Bin Benjamin Zhu, Xiaodi Sun, Jian-Guang Lou, Zhiming Ding, Anbang Hu, Yuan Fang, Yunsong Li, Junyan Chen, Linjun Yang

    Abstract: Prompt engineering is very important to enhance the performance of large language models (LLMs). When dealing with complex issues, prompt engineers tend to distill multiple patterns from examples and inject relevant solutions to optimize the prompts, achieving satisfying results. However, existing automatic prompt optimization techniques are only limited to producing single flow instructions, stru… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: 13 pages, 7 figures, 6 tables

  27. arXiv:2410.08603  [pdf, other

    hep-ex

    Observation of $D^+\toη^\primeμ^+ν_μ$ and First Study of $D^+\to η^\prime \ell^+ν_\ell$ Decay Dynamics

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Using $20.3\,\rm fb^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy 3.773\,GeV with the BESIII detector, we report the first observation of the semileptonic decay $D^+\to η^\prime μ^+ν_μ$ with significance of $8.6σ$ including systematic uncertainties, and an improved measurement of $D^+\to η^\prime e^+ν_e$. The branching fractions of $D^+\to η^\prime μ^+ν_μ$ and… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  28. arXiv:2410.08601  [pdf, other

    cs.CL

    StraGo: Harnessing Strategic Guidance for Prompt Optimization

    Authors: Yurong Wu, Yan Gao, Bin Benjamin Zhu, Zineng Zhou, Xiaodi Sun, Sheng Yang, Jian-Guang Lou, Zhiming Ding, Linjun Yang

    Abstract: Prompt engineering is pivotal for harnessing the capabilities of large language models (LLMs) across diverse applications. While existing prompt optimization methods improve prompt effectiveness, they often lead to prompt drifting, where newly generated prompts can adversely impact previously successful cases while addressing failures. Furthermore, these methods tend to rely heavily on LLMs' intri… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: 19 pages, 3 figures, 20 tables

  29. arXiv:2410.08527  [pdf, other

    cs.CL cs.AI cs.LG

    Scaling Laws for Predicting Downstream Performance in LLMs

    Authors: Yangyi Chen, Binxuan Huang, Yifan Gao, Zhengyang Wang, Jingfeng Yang, Heng Ji

    Abstract: Precise estimation of downstream performance in large language models (LLMs) prior to training is essential for guiding their development process. Scaling laws analysis utilizes the statistics of a series of significantly smaller sampling language models (LMs) to predict the performance of the target LLM. For downstream performance prediction, the critical challenge lies in the emergent abilities… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  30. arXiv:2410.08500  [pdf, other

    cs.RO cs.AI

    Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning

    Authors: Yunpeng Gao, Zhigang Wang, Linglin Jing, Dong Wang, Xuelong Li, Bin Zhao

    Abstract: Aerial Vision-and-Language Navigation (VLN) is a novel task enabling Unmanned Aerial Vehicles (UAVs) to navigate in outdoor environments through natural language instructions and visual cues. It remains challenging due to the complex spatial relationships in outdoor aerial scenes. In this paper, we propose an end-to-end zero-shot framework for aerial VLN tasks, where the large language model (LLM)… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: Submitted to ICRA 2025

  31. arXiv:2410.08100  [pdf, other

    cs.CV

    CrackSegDiff: Diffusion Probability Model-based Multi-modal Crack Segmentation

    Authors: Xiaoyan Jiang, Licheng Jiang, Anjie Wang, Kaiying Zhu, Yongbin Gao

    Abstract: Integrating grayscale and depth data in road inspection robots could enhance the accuracy, reliability, and comprehensiveness of road condition assessments, leading to improved maintenance strategies and safer infrastructure. However, these data sources are often compromised by significant background noise from the pavement. Recent advancements in Diffusion Probabilistic Models (DPM) have demonstr… ▽ More

    Submitted 12 October, 2024; v1 submitted 10 October, 2024; originally announced October 2024.

  32. arXiv:2410.08041  [pdf, ps, other

    cs.LG cs.AI math.OC

    On the Convergence of (Stochastic) Gradient Descent for Kolmogorov--Arnold Networks

    Authors: Yihang Gao, Vincent Y. F. Tan

    Abstract: Kolmogorov--Arnold Networks (KANs), a recently proposed neural network architecture, have gained significant attention in the deep learning community, due to their potential as a viable alternative to multi-layer perceptrons (MLPs) and their broad applicability to various scientific tasks. Empirical investigations demonstrate that KANs optimized via stochastic gradient descent (SGD) are capable of… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

  33. arXiv:2410.07626  [pdf, other

    hep-ex

    Precision Measurement of the Branching Fraction of $D^{+}\to μ^{+}ν_μ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Using $20.3~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected at a center-of-mass energy of $E_{\rm cm}=3.773$ GeV with the BESIII detector operating at the BEPCII collider, we determine the branching fraction of the leptonic decay $D^+\toμ^+ν_μ$ to be $(3.981\pm0.079_{\rm stat}\pm0.040_{\rm syst})\times10^{-4}$. Interpreting our measurement with knowledge of the Fermi coupling constant… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: 9 pages, 2 figures

  34. arXiv:2410.07591  [pdf, other

    eess.SP

    Robustness and Security Enhancement of Radio Frequency Fingerprint Identification in Time-Varying Channels

    Authors: Lu Yang, Seyit Camtepe, Yansong Gao, Vicky Liu, Dhammika Jayalath

    Abstract: Radio frequency fingerprint identification (RFFI) is becoming increasingly popular, especially in applications with constrained power, such as the Internet of Things (IoT). Due to subtle manufacturing variations, wireless devices have unique radio frequency fingerprints (RFFs). These RFFs can be used with pattern recognition algorithms to classify wireless devices. However, Implementing reliable R… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: 15 pages

  35. arXiv:2410.07520  [pdf, ps, other

    cs.CL

    News Reporter: A Multi-lingual LLM Framework for Broadcast T.V News

    Authors: Tarun Jain, Yufei Gao, Sridhar Vanga, Karan Singla

    Abstract: Large Language Models (LLMs) have fast become an essential tools to many conversational chatbots due to their ability to provide coherent answers for varied queries. Datasets used to train these LLMs are often a mix of generic and synthetic samples, thus lacking the verification needed to provide correct and verifiable answers for T.V. News. We collect and share a large collection of QA pairs ex… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: 5 pages, under review at ICASSP 2025

  36. arXiv:2410.06549  [pdf, other

    cs.LG cs.AI cs.SI

    DiffGAD: A Diffusion-based Unsupervised Graph Anomaly Detector

    Authors: Jinghan Li, Yuan Gao, Jinda Lu, Junfeng Fang, Congcong Wen, Hui Lin, Xiang Wang

    Abstract: Graph Anomaly Detection (GAD) is crucial for identifying abnormal entities within networks, garnering significant attention across various fields. Traditional unsupervised methods, which decode encoded latent representations of unlabeled data with a reconstruction focus, often fail to capture critical discriminative content, leading to suboptimal anomaly detection. To address these challenges, we… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  37. arXiv:2410.06500  [pdf, other

    hep-ex

    Search for the radiative decays $D^+\toγρ^+$ and $D^+\toγK^{*+}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (648 additional authors not shown)

    Abstract: We search for the radiative decays $D^{+} \to γρ^+$ and $D^{+} \to γK^{*+}$ using 20.3~fb$^{-1}$ of $e^+e^-$ annihilation data collected at the center-of-mass energy $\sqrt{s}=3.773$ GeV by the BESIII detector operating at the BEPCII collider. No significant signals are observed, and the upper limits on the branching fractions of $D^{+} \to γρ^+$ and $D^{+} \to γK^{*+}$ at 90\% confidence level ar… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  38. arXiv:2410.05736  [pdf, ps, other

    hep-ex

    Observation of an axial-vector state in the study of $ψ(3686) \to φηη'$ decay

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (625 additional authors not shown)

    Abstract: Using (2712.4 $\pm$ 14.3)$\times 10^{6}$ $ψ(3686)$ events collected with the BESIII detector at BEPCII, a partial wave analysis of the decay $ψ(3686) \to φηη' $ is performed with the covariant tensor approach. An axial-vector state with a mass near 2.3 $\rm GeV/c^2$ is observed for the first time. Its mass and width are measured to be 2316… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  39. arXiv:2410.05091  [pdf, ps, other

    cs.DB cs.DC

    DIMS: Distributed Index for Similarity Search in Metric Spaces

    Authors: Yifan Zhu, Chengyang Luo, Tang Qian, Lu Chen, Yunjun Gao, Baihua Zheng

    Abstract: Similarity search finds objects that are similar to a given query object based on a similarity metric. As the amount and variety of data continue to grow, similarity search in metric spaces has gained significant attention. Metric spaces can accommodate any type of data and support flexible distance metrics, making similarity search in metric spaces beneficial for many real-world applications, suc… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

  40. arXiv:2410.05021  [pdf, other

    cs.LG cs.CL

    DEPT: Decoupled Embeddings for Pre-training Language Models

    Authors: Alex Iacob, Lorenzo Sani, Meghdad Kurmanji, William F. Shen, Xinchi Qiu, Dongqi Cai, Yan Gao, Nicholas D. Lane

    Abstract: Language model pre-training benefits from diverse data to enhance performance across domains and languages. However, training on such heterogeneous corpora requires extensive and costly efforts. Since these data sources vary lexically, syntactically, and semantically, they cause negative interference or the ``curse of multilinguality''. We propose a novel pre-training framework to alleviate this c… ▽ More

    Submitted 20 October, 2024; v1 submitted 7 October, 2024; originally announced October 2024.

  41. arXiv:2410.04798  [pdf, other

    cs.CL

    DAPE V2: Process Attention Score as Feature Map for Length Extrapolation

    Authors: Chuanyang Zheng, Yihang Gao, Han Shi, Jing Xiong, Jiankai Sun, Jingyao Li, Minbin Huang, Xiaozhe Ren, Michael Ng, Xin Jiang, Zhenguo Li, Yu Li

    Abstract: The attention mechanism is a fundamental component of the Transformer model, contributing to interactions among distinct tokens, in contrast to earlier feed-forward neural networks. In general, the attention scores are determined simply by the key-query products. However, this work's occasional trial (combining DAPE and NoPE) of including additional MLPs on attention scores without position encodi… ▽ More

    Submitted 10 October, 2024; v1 submitted 7 October, 2024; originally announced October 2024.

    Comments: Tech Report. Compared to DAPE, this work (DAPE V2) further analyzes the length extrapolation problem and translate the length extrapolation issue into a well-understood feature map processing problem. arXiv admin note: text overlap with arXiv:2405.14722

  42. Penalized Sparse Covariance Regression with High Dimensional Covariates

    Authors: Yuan Gao, Zhiyuan Zhang, Zhanrui Cai, Xuening Zhu, Tao Zou, Hansheng Wang

    Abstract: Covariance regression offers an effective way to model the large covariance matrix with the auxiliary similarity matrices. In this work, we propose a sparse covariance regression (SCR) approach to handle the potentially high-dimensional predictors (i.e., similarity matrices). Specifically, we use the penalization method to identify the informative predictors and estimate their associated coefficie… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

    MSC Class: 62J99; 62P20

  43. arXiv:2410.03101  [pdf, other

    astro-ph.EP

    Constraining the Presence of Companion Planets in Hot Jupiter Planetary System Using TTV Observation from TESS

    Authors: Zixin Zhang, Wenqin Wang, Xinyue Ma, Zhangliang Chen, Yonghao Wang, Cong Yu, Shangfei Liu, Yang Gao, Baitian Tang, Bo Ma

    Abstract: The presence of another planetary companion in a transiting exoplanet system can impact its transit light curve, leading to sinusoidal transit timing variations (TTV). By utilizing both $χ^2$ and RMS analysis, we have combined the TESS observation data with an N-body simulation to investigate the existence of an additional planet in the system and put a limit on its mass. We have developed CMAT, a… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: Accepted for publication in ApJS

  44. arXiv:2410.02502  [pdf, other

    hep-ex

    Measurement of the effective leptonic weak mixing angle

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1117 additional authors not shown)

    Abstract: Using $pp$ collision data at $\sqrt{s}=13$ TeV, recorded by the LHCb experiment between 2016 and 2018 and corresponding to an integrated luminosity of $5.4$ fb$^{-1}$, the forward-backward asymmetry in the $pp \to Z/γ^{*} \to μ^+μ^-$ process is measured. The measurement is carried out in ten intervals of the difference between the muon pseudorapidities, within a fiducial region covering dimuon mas… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lbfence.cern.ch/alcm/public/analysis/full-details/3360/ (LHCb public pages)

    Report number: LHCb-PAPER-2024-028, CERN-EP-2024-230

  45. arXiv:2410.02421  [pdf, other

    hep-ex

    Search for lepton number violating decays of $D_s^+\to h^-h^0e^+e^+$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (650 additional authors not shown)

    Abstract: Based on 7.33 fb$^{-1}$ of $e^+e^-$ collision data collected by the BESIII detector operating at the BEPCII collider at center-of-mass energies from 4.128 to 4.226 GeV, a search for the Majorana neutrino $ν_m$ is conducted in the lepton-number-violating decays of $D_s^+\to h^-h^0e^+e^+$. Here, $h^-$ represents a $K^-$ or $π^-$, and $h^0$ represents a $π^0$, $K_S^0$ or $φ$. No significant signal is… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  46. arXiv:2410.02358  [pdf

    eess.SY

    Cross-Domain Comparative Analysis of Digital Twins and Universalised Solutions

    Authors: Guanyu Xiong, Yan Gao, Haijiang Li

    Abstract: Digitalisation is one of the main drivers of most economic sectors nowadays and the digital twin, as a reification of digitalisation for complex systems has attracted much attention from both academics and industry. There have been studies focusing on digital twins in a specific sector while there are few exercising insightful comparisons of digital twins from different domains. Considering the di… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  47. arXiv:2410.02215  [pdf, other

    quant-ph cond-mat.str-el

    Fermionic tensor network contraction for arbitrary geometries

    Authors: Yang Gao, Huanchen Zhai, Johnnie Gray, Ruojing Peng, Gunhee Park, Wen-Yuan Liu, Eirik F. Kjønstad, Garnet Kin-Lic Chan

    Abstract: We describe our implementation of fermionic tensor network contraction on arbitrary lattices within both a globally ordered and locally ordered formalism. We provide a pedagogical description of these two conventions as implemented for the quimb library. Using hyperoptimized approximate contraction strategies, we present benchmark fermionic projected entangled pair states simulations of finite Hub… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: 9 pages, 9 figures

  48. arXiv:2410.01841  [pdf

    eess.AS cs.AI cs.CL cs.IR cs.SD

    A GEN AI Framework for Medical Note Generation

    Authors: Hui Yi Leong, Yi Fan Gao, Shuai Ji, Bora Kalaycioglu, Uktu Pamuksuz

    Abstract: The increasing administrative burden of medical documentation, particularly through Electronic Health Records (EHR), significantly reduces the time available for direct patient care and contributes to physician burnout. To address this issue, we propose MediNotes, an advanced generative AI framework designed to automate the creation of SOAP (Subjective, Objective, Assessment, Plan) notes from medi… ▽ More

    Submitted 27 September, 2024; originally announced October 2024.

    Comments: 8 Figures, 7 page, IEEE standard research paper

  49. arXiv:2410.00526  [pdf, other

    cs.CL

    Benchmarking Large Language Models for Conversational Question Answering in Multi-instructional Documents

    Authors: Shiwei Wu, Chen Zhang, Yan Gao, Qimeng Wang, Tong Xu, Yao Hu, Enhong Chen

    Abstract: Instructional documents are rich sources of knowledge for completing various tasks, yet their unique challenges in conversational question answering (CQA) have not been thoroughly explored. Existing benchmarks have primarily focused on basic factual question-answering from single narrative documents, making them inadequate for assessing a model`s ability to comprehend complex real-world instructio… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

  50. arXiv:2410.00425  [pdf, other

    cs.RO cs.AI

    ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for Generalizable Embodied AI

    Authors: Stone Tao, Fanbo Xiang, Arth Shukla, Yuzhe Qin, Xander Hinrichsen, Xiaodi Yuan, Chen Bao, Xinsong Lin, Yulin Liu, Tse-kai Chan, Yuan Gao, Xuanlin Li, Tongzhou Mu, Nan Xiao, Arnav Gurha, Zhiao Huang, Roberto Calandra, Rui Chen, Shan Luo, Hao Su

    Abstract: Simulation has enabled unprecedented compute-scalable approaches to robot learning. However, many existing simulation frameworks typically support a narrow range of scenes/tasks and lack features critical for scaling generalizable robotics and sim2real. We introduce and open source ManiSkill3, the fastest state-visual GPU parallelized robotics simulator with contact-rich physics targeting generali… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

    Comments: Project website: http://maniskill.ai/