Skip to main content

Showing 1–50 of 4,313 results for author: Liu, T

  1. arXiv:2410.15989  [pdf, other

    physics.space-ph physics.plasm-ph

    Interaction of the Prominence Plasma within the Magnetic Cloud of an ICME with the Earth's Bow Shock

    Authors: Hadi Madanian, Li-Jen Chen, Jonathan Ng, Michael J. Starkey, Stephen A. Fuselier, Naoki Bessho, Daniel J. Gershman, Terry Z. Liu

    Abstract: The magnetic cloud within an interplanetary coronal mass ejection (ICME) is characterized by high magnetic field intensities. In this study, we investigate the interaction of a magnetic cloud carrying a density structure with the Earth's bow shock during ICME event on 24 April 2023. Elevated abundances of cold protons and heavier ions, namely alpha particles, and singly charged helium ions associa… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  2. Policy-driven Knowledge Selection and Response Generation for Document-grounded Dialogue

    Authors: Longxuan Ma, Jiapeng Li, Mingda Li, Wei-Nan Zhang, Ting Liu

    Abstract: Document-grounded dialogue (DGD) uses documents as external knowledge for dialogue generation. Correctly understanding the dialogue context is crucial for selecting knowledge from the document and generating proper responses. In this paper, we propose using a dialogue policy to help the dialogue understanding in DGD. Our dialogue policy consists of two kinds of guiding signals: utterance function… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: 29 pages, 9 figures, 14 tables, TOIS 2024

    ACM Class: I.2.7

    Journal ref: ACM Transactions on Information Systems, Volume 42, Issue 2, 08 November 2023

  3. arXiv:2410.15913  [pdf, other

    astro-ph.GA astro-ph.SR

    The magnetic field in quiescent star-forming filament G16.96+0.27

    Authors: Qi-Lao Gu, Tie Liu, Zhi-Qiang Shen, Sihan Jiao, Julien Montillaud, Mika Juvela, Xing Lu, Chang Won Lee, Junhao Liu, Pak Shing Li, Xunchuan Liu, Doug Johnstone, Woojin Kwon, Kee-Tae Kim, Ken'ichi Tatematsu, Patricio Sanhueza, Isabelle Ristorcelli, Patrick Koch, Qizhou Zhang, Kate Pattle, Naomi Hirano, Dana Alina, James Di Francesco

    Abstract: We present 850 μm thermal dust polarization observations with a resolution of 14.4"(~ 0.13 pc) towards an infrared dark cloud G16.96+0.27 using JCMT/POL-2. The average magnetic field orientation, which roughly agrees with the larger-scale magnetic field orientation traced by the Planck 353 GHz data, is approximately perpendicular to the filament structure. The estimated plane-of-sky magnetic field… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: Accepted by ApJ. 13 pages, 5 figures

  4. arXiv:2410.15765  [pdf, other

    physics.geo-ph cs.LG

    SeisLM: a Foundation Model for Seismic Waveforms

    Authors: Tianlin Liu, Jannes Münchmeyer, Laura Laurenti, Chris Marone, Maarten V. de Hoop, Ivan Dokmanić

    Abstract: We introduce the Seismic Language Model (SeisLM), a foundational model designed to analyze seismic waveforms -- signals generated by Earth's vibrations such as the ones originating from earthquakes. SeisLM is pretrained on a large collection of open-source seismic datasets using a self-supervised contrastive loss, akin to BERT in language modeling. This approach allows the model to learn general s… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  5. arXiv:2410.15750  [pdf, ps, other

    math.AP

    Normalized solutions for a class of Sobolev critical Schrodinger systems

    Authors: Houwang Li, Tianhao Liu, Wenming Zou

    Abstract: This paper focuses on the existence and multiplicity of normalized solutions for the coupled Schrodinger system with Sobolev critical coupling term. We present several existence and multiplicity results under some explicit conditions. Furthermore, we present a non-existence result for the defocusing case. This paper, together with the paper [T. Bartsch, H. W. Li and W. M. Zou. Calc. Var. Partial D… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: Any comments are welcome

  6. arXiv:2410.15651  [pdf, other

    cs.LG

    Understanding and Alleviating Memory Consumption in RLHF for LLMs

    Authors: Jin Zhou, Hanmei Yang, Steven, Tang, Mingcan Xiang, Hui Guan, Tongping Liu

    Abstract: Fine-tuning with Reinforcement Learning with Human Feedback (RLHF) is essential for aligning large language models (LLMs). However, RLHF often encounters significant memory challenges. This study is the first to examine memory usage in the RLHF context, exploring various memory management strategies and unveiling the reasons behind excessive memory consumption. Additionally, we introduce a simple… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  7. arXiv:2410.15333  [pdf, other

    astro-ph.GA

    The ALMA-QUARKS Survey: Fibers' role in star formation unveiled in an intermediate-mass protocluster region of the Vela D cloud

    Authors: Dongting Yang, HongLi Liu, Tie Liu, Anandmayee Tej, Xunchuan Liu, Jinhua He, Guido Garay, Amelia Stutz, Lei Zhu, Sheng-Li Qin, Fengwei Xu, Pak-Shing Li, Mika Juvela, Pablo Garcia, Paul F. Goldsmith, Siju Zhang, Xindi Tang, Patricio Sanhueza, Shanghuo Li, Chang Won Lee, Swagat Ranjan Das, Wenyu Jiao, Xiaofeng Mai, Prasanta Gorai, Yichen Zhang , et al. (10 additional authors not shown)

    Abstract: In this paper, we present a detailed analysis of the IRS 17 filament within the intermediate-mass protocluster IRAS 08448-4343 (of $\sim\,10^3\,\rm M_{\odot}$), using ALMA data from the ATOMS 3-mm and QUARKS 1.3-mm surveys. The IRS 17 filament, which spans $\sim$54000 au ($0.26\,\rm pc$) in length and $\sim$4000 au ($0.02\,\rm pc$) in width, exhibits a complex, multi-component velocity field, and… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

    Comments: 19 pages, 10 figures, 4 tables, accepted by ApJ

  8. arXiv:2410.15229  [pdf

    cs.CV cs.LG physics.app-ph physics.med-ph

    Deep Learning-based Detection of Bacterial Swarm Motion Using a Single Image

    Authors: Yuzhu Li, Hao Li, Weijie Chen, Keelan O'Riordan, Neha Mani, Yuxuan Qi, Tairan Liu, Sridhar Mani, Aydogan Ozcan

    Abstract: Distinguishing between swarming and swimming, the two principal forms of bacterial movement, holds significant conceptual and clinical relevance. This is because bacteria that exhibit swarming capabilities often possess unique properties crucial to the pathogenesis of infectious diseases and may also have therapeutic potential. Here, we report a deep learning-based swarming classifier that rapidly… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

    Comments: 17 Pages, 4 Figures

  9. arXiv:2410.15061  [pdf, other

    cond-mat.dis-nn quant-ph

    Classifying extended, localized and critical states in quasiperiodic lattices via unsupervised learning

    Authors: Bohan Zheng, Siyu Zhu, Xingping Zhou, Tong Liu

    Abstract: Classification of quantum phases is one of the most important areas of research in condensed matter physics. In this work, we obtain the phase diagram of one-dimensional quasiperiodic models via unsupervised learning. Firstly, we choose two advanced unsupervised learning algorithms, Density-Based Spatial Clustering of Applications with Noise (DBSCAN) and Ordering Points To Identify the Clustering… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

  10. arXiv:2410.14361  [pdf, other

    cs.CL

    Efficiently Computing Susceptibility to Context in Language Models

    Authors: Tianyu Liu, Kevin Du, Mrinmaya Sachan, Ryan Cotterell

    Abstract: One strength of modern language models is their ability to incorporate information from a user-input context when answering queries. However, they are not equally sensitive to the subtle changes to that context. To quantify this, Du et al. (2024) gives an information-theoretic metric to measure such sensitivity. Their metric, susceptibility, is defined as the degree to which contexts can influence… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

  11. arXiv:2410.13515  [pdf, other

    hep-ex hep-lat hep-ph nucl-ex

    Observation of a rare beta decay of the charmed baryon with a Graph Neural Network

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (637 additional authors not shown)

    Abstract: The study of beta decay of the charmed baryon provides unique insights into the fundamental mechanism of the strong and electro-weak interactions. The $Λ_c^+$, being the lightest charmed baryon, undergoes disintegration solely through the charm quark weak decay. Its beta decay provides an ideal laboratory for investigating non-perturbative effects in quantum chromodynamics and for constraining the… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 28 pages, 6 figures

  12. arXiv:2410.13496  [pdf, other

    cs.RO

    State Estimation Transformers for Agile Legged Locomotion

    Authors: Chen Yu, Yichu Yang, Tianlin Liu, Yangwei You, Mingliang Zhou, Diyun Xiang

    Abstract: We propose a state estimation method that can accurately predict the robot's privileged states to push the limits of quadruped robots in executing advanced skills such as jumping in the wild. In particular, we present the State Estimation Transformers (SET), an architecture that casts the state estimation problem as conditional sequence modeling. SET outputs the robot states that are hard to obtai… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: Accepted by IROS 2024

  13. arXiv:2410.13478  [pdf, other

    hep-ex

    Observation of $χ_{c0}\toΣ^{+}\barΣ^{-}η$ and evidence for $χ_{c1,2}\toΣ^{+}\barΣ^{-}η$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: Using $(27.12\pm 0.14)\times10^{8}$ $ψ(3686)$ events collected with the BESIII detector, the decay $χ_{c0}\toΣ^{+}\barΣ^{-}η$ is observed for the first time with a statistical significance of $7.0σ$, and evidence for $χ_{c1}\toΣ^{+}\barΣ^{-}η$ and $χ_{c2}\toΣ^{+}\barΣ^{-}η$ is found with statistical significances of $4.3σ$ and $4.6σ$, respectively. The branching fractions are determined to be… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  14. arXiv:2410.13408  [pdf, other

    cs.LG cs.AI cs.CL

    MoR: Mixture of Ranks for Low-Rank Adaptation Tuning

    Authors: Chuanyu Tang, Yilong Chen, Zhenyu Zhang, Junyuan Shang, Wenyuan Zhang, Yong Huang, Tingwen Liu

    Abstract: Low-Rank Adaptation (LoRA) drives research to align its performance with full fine-tuning. However, significant challenges remain: (1) Simply increasing the rank size of LoRA does not effectively capture high-rank information, which leads to a performance bottleneck.(2) MoE-style LoRA methods substantially increase parameters and inference latency, contradicting the goals of efficient fine-tuning… ▽ More

    Submitted 17 October, 2024; v1 submitted 17 October, 2024; originally announced October 2024.

    Comments: 11 pages, 7 figures

  15. arXiv:2410.13369  [pdf, other

    astro-ph.HE

    A Neutron Capture Explanation for the 10 MeV Emission Line Seen in GRB 221009A

    Authors: Jiahuan Zhu, Hua Feng, Tong Liu

    Abstract: The brightest ever gamma-ray burst (GRB) 221009A displays a significant emission line component around 10 MeV. As the GRB central engine is neutron-rich, we propose that the emission line could be originally due to the 2.223 MeV gamma-rays following neutron capture with protons. The measured line profile can be adequately fitted with a neutron capture model that involves thermal broadening and a b… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 5 pages, 2 figures, 1 table, Submitted to ApJ Letters on July 9, 2024, referees' reports not received so far

  16. arXiv:2410.13368  [pdf, other

    hep-ex hep-ph

    Observation of the Singly Cabibbo-Suppressed Decay $Λ_c^{+}\to pπ^0$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Utilizing 4.5${~\rm{fb}}^{-1}$ of $e^+e^-$ annihilation data collected with the BESIII detector at the BEPCII collider at center-of-mass energies between 4.600 and 4.699 GeV, the first observation of the singly Cabibbo-suppressed decay $Λ_c^{+}\to pπ^0$ is presented, with a statistical significance of $5.4σ$. The ratio of the branching fractions of $Λ_c^{+}\to pπ^0$ and $Λ_c^{+}\to pη$ is measured… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 9 pages, 4 figures

  17. arXiv:2410.13351  [pdf, other

    cs.CL cs.AI cs.LG

    Representation Learning of Structured Data for Medical Foundation Models

    Authors: Vijay Prakash Dwivedi, Viktor Schlegel, Andy T. Liu, Thanh-Tung Nguyen, Abhinav Ramesh Kashyap, Jeng Wei, Wei-Hsian Yin, Stefan Winkler, Robby T. Tan

    Abstract: Large Language Models (LLMs) have demonstrated remarkable performance across various domains, including healthcare. However, their ability to effectively represent structured non-textual data, such as the alphanumeric medical codes used in records like ICD-10 or SNOMED-CT, is limited and has been particularly exposed in recent research. This paper examines the challenges LLMs face in processing me… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: NeurIPS 2024 Workshop on Unifying Representations in Neural Models (UniReps 2024)

  18. arXiv:2410.13051  [pdf, other

    cs.LG cs.CL cs.IR

    Supply Chain Network Extraction and Entity Classification Leveraging Large Language Models

    Authors: Tong Liu, Hadi Meidani

    Abstract: Supply chain networks are critical to the operational efficiency of industries, yet their increasing complexity presents significant challenges in mapping relationships and identifying the roles of various entities. Traditional methods for constructing supply chain networks rely heavily on structured datasets and manual data collection, limiting their scope and efficiency. In contrast, recent adva… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 11 pages, 4 figures

  19. arXiv:2410.12620  [pdf, other

    hep-ex

    Search for $e^{+}e^{-} \to φχ_{c0}$ and $φη_{c2}(1D)$ at center-of-mass energies from 4.47 to 4.95 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: Utilizing a data set of $6.7$ fb$^{-1}$ from electron-positron collisions recorded by the BESIII detector at the BEPCII storage ring, a search is conducted for the processes $e^{+}e^{-} \to φχ_{c0}$ and $φη_{c2}(1D)$ across center-of-mass energies from 4.47 to 4.95 GeV. In the absence of any significant signals, upper limits are set. These include limits on the Born cross sections for… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 14 pages, 6 figures

  20. arXiv:2410.12474  [pdf, other

    cs.CV cs.LG

    Mind the Gap Between Prototypes and Images in Cross-domain Finetuning

    Authors: Hongduan Tian, Feng Liu, Zhanke Zhou, Tongliang Liu, Chengqi Zhang, Bo Han

    Abstract: In cross-domain few-shot classification (CFC), recent works mainly focus on adapting a simple transformation head on top of a frozen pre-trained backbone with few labeled data to project embeddings into a task-specific metric space where classification can be performed by measuring similarities between image instance and prototype representations. Technically, an assumption implicitly adopted in s… ▽ More

    Submitted 20 October, 2024; v1 submitted 16 October, 2024; originally announced October 2024.

  21. arXiv:2410.12089  [pdf, other

    astro-ph.CO

    BICEP/Keck XVIII: Measurement of BICEP3 polarization angles and consequences for constraining cosmic birefringence and inflation

    Authors: BICEP/Keck Collaboration, :, P. A. R. Ade, Z. Ahmed, M. Amiri, D. Barkats, R. Basu Thakur, C. A. Bischoff, D. Beck, J. J. Bock, H. Boenish, V. Buza, J. R. Cheshire IV, J. Connors, J. Cornelison, M. Crumrine, A. J. Cukierman, E. Denison, L. Duband, M. Eiben, B. D. Elwood, S. Fatigoni, J. P. Filippini, A. Fortes, M. Gao , et al. (60 additional authors not shown)

    Abstract: We use a custom-made calibrator to measure individual detectors' polarization angles of BICEP3, a small aperture telescope observing the cosmic microwave background (CMB) at 95GHz from the South Pole. We describe our calibration strategy and the statistical and systematic uncertainties associated with the measurement. We reach an unprecedented precision for such measurement on a CMB experiment, wi… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 29 Pages, 17 Figures, 6 Tables, as submitted to PRD

  22. arXiv:2410.11607  [pdf, other

    hep-ex

    Observation of $χ_{cJ}\to p \bar p K^0_S K^- π^+ + c.c.$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (648 additional authors not shown)

    Abstract: By analyzing $(27.12\pm0.14)\times10^8$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decays of $χ_{cJ} \to p \bar{p} K^0_S K^- π^+ +c.c.(J=0, 1, 2)$ are observed for the first time with statistical significances greater than $10σ$. The branching fractions of these decays are determined to be… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 12 pages, 5 figures

  23. arXiv:2410.10118  [pdf, other

    cs.LG physics.chem-ph

    Physical Consistency Bridges Heterogeneous Data in Molecular Multi-Task Learning

    Authors: Yuxuan Ren, Dihan Zheng, Chang Liu, Peiran Jin, Yu Shi, Lin Huang, Jiyan He, Shengjie Luo, Tao Qin, Tie-Yan Liu

    Abstract: In recent years, machine learning has demonstrated impressive capability in handling molecular science tasks. To support various molecular properties at scale, machine learning models are trained in the multi-task learning paradigm. Nevertheless, data of different molecular properties are often not aligned: some quantities, e.g. equilibrium structure, demand more cost to compute than others, e.g.… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

    Comments: Published as a conference paper at NeurIPS 2024

  24. arXiv:2410.10100  [pdf, other

    astro-ph.GA astro-ph.HE

    Could the inter-band lag of active galactic nucleus vary randomly?

    Authors: Zhen-Bo Su, Zhen-Yi Cai, Jun-Xian Wang, Tinggui Wang, Yongquan Xue, Min-Xuan Cai, Lulu Fan, Hengxiao Guo, Zhicheng He, Zizhao He, Xu-Fan Hu, Ji-an Jiang, Ning Jiang, Wen-Yong Kang, Lei Lei, Guilin Liu, Teng Liu, Zhengyan Liu, Zhenfeng Sheng, Mouyuan Sun, Wen Zhao

    Abstract: The inter-band lags among the optical broad-band continua of active galactic nuclei (AGNs) have been intensively explored over the past decade. However, the nature of the lags remains under debate. Here utilizing two distinct scenarios for AGN variability, i.e., the thermal fluctuation of accretion disk and the reprocessing of both the accretion disk and clouds in the broad line region, we show th… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

    Comments: 16 pages, 10 figures. Accepted for publication in Astrophysical Journal, comments are welcome!

  25. arXiv:2410.09908  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Retrieval Instead of Fine-tuning: A Retrieval-based Parameter Ensemble for Zero-shot Learning

    Authors: Pengfei Jin, Peng Shu, Sekeun Kim, Qing Xiao, Sifan Song, Cheng Chen, Tianming Liu, Xiang Li, Quanzheng Li

    Abstract: Foundation models have become a cornerstone in deep learning, with techniques like Low-Rank Adaptation (LoRA) offering efficient fine-tuning of large models. Similarly, methods such as Retrieval-Augmented Generation (RAG), which leverage vectorized databases, have further improved model performance by grounding outputs in external information. While these approaches have demonstrated notable succe… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

  26. arXiv:2410.09845  [pdf, other

    cs.CV

    Understanding Robustness of Parameter-Efficient Tuning for Image Classification

    Authors: Jiacheng Ruan, Xian Gao, Suncheng Xiang, Mingye Xie, Ting Liu, Yuzhuo Fu

    Abstract: Parameter-efficient tuning (PET) techniques calibrate the model's predictions on downstream tasks by freezing the pre-trained models and introducing a small number of learnable parameters. However, despite the numerous PET methods proposed, their robustness has not been thoroughly investigated. In this paper, we systematically explore the robustness of four classical PET techniques (e.g., VPT, Ada… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

    Comments: 5 pages, 2 figures. Work in Progress

  27. arXiv:2410.09674  [pdf, other

    eess.IV cs.CV cs.LG cs.NE

    EG-SpikeFormer: Eye-Gaze Guided Transformer on Spiking Neural Networks for Medical Image Analysis

    Authors: Yi Pan, Hanqi Jiang, Junhao Chen, Yiwei Li, Huaqin Zhao, Yifan Zhou, Peng Shu, Zihao Wu, Zhengliang Liu, Dajiang Zhu, Xiang Li, Yohannes Abate, Tianming Liu

    Abstract: Neuromorphic computing has emerged as a promising energy-efficient alternative to traditional artificial intelligence, predominantly utilizing spiking neural networks (SNNs) implemented on neuromorphic hardware. Significant advancements have been made in SNN-based convolutional neural networks (CNNs) and Transformer architectures. However, their applications in the medical imaging domain remain un… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

  28. arXiv:2410.09540  [pdf, ps, other

    gr-qc astro-ph.HE

    Effects of orbital eccentricity on continuous gravitational waveforms from triaxially-deformed precessing neutron stars in tight binaries

    Authors: Wen-Fan Feng, Tan Liu, Yan Wang, Lijing Shao

    Abstract: The successful detection of continuous gravitational waves (GWs) from spinning neutron stars (NSs) will shape our understanding of the physical properties of dense matter under extreme conditions. Binary population synthesis simulations show that forthcoming space-borne GW detectors may be capable of detecting some tight Galactic double NSs (DNSs) with 10-minute orbital periods. Successfully searc… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

  29. arXiv:2410.09401  [pdf, other

    cs.CR cs.AI

    A Novel Approach to Malicious Code Detection Using CNN-BiLSTM and Feature Fusion

    Authors: Lixia Zhang, Tianxu Liu, Kaihui Shen, Cheng Chen

    Abstract: With the rapid advancement of Internet technology, the threat of malware to computer systems and network security has intensified. Malware affects individual privacy and security and poses risks to critical infrastructures of enterprises and nations. The increasing quantity and complexity of malware, along with its concealment and diversity, challenge traditional detection techniques. Static detec… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

  30. arXiv:2410.08613  [pdf, other

    cs.CV cs.AI

    Cross-Modal Bidirectional Interaction Model for Referring Remote Sensing Image Segmentation

    Authors: Zhe Dong, Yuzhe Sun, Yanfeng Gu, Tianzhu Liu

    Abstract: Given a natural language expression and a remote sensing image, the goal of referring remote sensing image segmentation (RRSIS) is to generate a pixel-level mask of the target object identified by the referring expression. In contrast to natural scenarios, expressions in RRSIS often involve complex geospatial relationships, with target objects of interest that vary significantly in scale and lack… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  31. arXiv:2410.08603  [pdf, other

    hep-ex

    Observation of $D^+\toη^\primeμ^+ν_μ$ and First Study of $D^+\to η^\prime \ell^+ν_\ell$ Decay Dynamics

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Using $20.3\,\rm fb^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy 3.773\,GeV with the BESIII detector, we report the first observation of the semileptonic decay $D^+\to η^\prime μ^+ν_μ$ with significance of $8.6σ$ including systematic uncertainties, and an improved measurement of $D^+\to η^\prime e^+ν_e$. The branching fractions of $D^+\to η^\prime μ^+ν_μ$ and… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  32. arXiv:2410.07985  [pdf, other

    cs.CL

    Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

    Authors: Bofei Gao, Feifan Song, Zhe Yang, Zefan Cai, Yibo Miao, Qingxiu Dong, Lei Li, Chenghao Ma, Liang Chen, Runxin Xu, Zhengyang Tang, Benyou Wang, Daoguang Zan, Shanghaoran Quan, Ge Zhang, Lei Sha, Yichang Zhang, Xuancheng Ren, Tianyu Liu, Baobao Chang

    Abstract: Recent advancements in large language models (LLMs) have led to significant breakthroughs in mathematical reasoning capabilities. However, existing benchmarks like GSM8K or MATH are now being solved with high accuracy (e.g., OpenAI o1 achieves 94.8% on MATH dataset), indicating their inadequacy for truly challenging these models. To bridge this gap, we propose a comprehensive and challenging bench… ▽ More

    Submitted 10 October, 2024; v1 submitted 10 October, 2024; originally announced October 2024.

    Comments: 26 Pages, 17 Figures

  33. arXiv:2410.07626  [pdf, other

    hep-ex

    Precision Measurement of the Branching Fraction of $D^{+}\to μ^{+}ν_μ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Using $20.3~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected at a center-of-mass energy of $E_{\rm cm}=3.773$ GeV with the BESIII detector operating at the BEPCII collider, we determine the branching fraction of the leptonic decay $D^+\toμ^+ν_μ$ to be $(3.981\pm0.079_{\rm stat}\pm0.040_{\rm syst})\times10^{-4}$. Interpreting our measurement with knowledge of the Fermi coupling constant… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: 9 pages, 2 figures

  34. arXiv:2410.07594  [pdf

    eess.SY

    Design and Characterization of High Efficiency Single-stage Electromagnetic Coil Guns

    Authors: Sophia Chen, Annie Peng, Ava Chen, Takyiu Liu

    Abstract: This study presents several novel approaches to improve the efficiency of a single-stage coil gun. Conventional designs typically feature a uniformly wound solenoid and a ferrite projectile. For our research, we constructed a microcontroller-based prototype to test several new enhancements, including the use of a bipolar current pulse, a stepped multilayer coil with non-uniform winding densities,… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: 10 pages, 23 figures

  35. arXiv:2410.07524  [pdf, other

    cs.CL cs.AI cs.LG

    Upcycling Large Language Models into Mixture of Experts

    Authors: Ethan He, Abhinav Khattar, Ryan Prenger, Vijay Korthikanti, Zijie Yan, Tong Liu, Shiqing Fan, Ashwath Aithal, Mohammad Shoeybi, Bryan Catanzaro

    Abstract: Upcycling pre-trained dense language models into sparse mixture-of-experts (MoE) models is an efficient approach to increase the model capacity of already trained models. However, optimal techniques for upcycling at scale remain unclear. In this work, we conduct an extensive study of upcycling methods and hyperparameters for billion-parameter scale language models. We propose a novel "virtual grou… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  36. arXiv:2410.07198  [pdf, other

    physics.class-ph nlin.PS

    Nonlinear Coupling between Magnetic Gears

    Authors: Tianchi Liu

    Abstract: This study investigates the complex nonlinear coupling of magnetic gears arranged in proximity on a plane. Acknowledging the rich array of geometric and electromagnetic parameters involved, we initiate our exploration with a simplified model. By nondimensionalizing the key variables, we derive a novel nonlinear dynamics framework that abstracts away electromagnetic dependencies. Our approach inclu… ▽ More

    Submitted 24 September, 2024; originally announced October 2024.

    Comments: 11 pages, 7 figures

  37. arXiv:2410.07033  [pdf, ps, other

    astro-ph.HE

    Probing blackbody components in gamma-ray bursts from black hole neutrino-dominated accretion flows

    Authors: Xiao-Yan Li, Tong Liu, Bao-Quan Huang, Guo-Yu Li, Da-Bin Lin, Zhi-Lin Chen, Yun Wang

    Abstract: A stellar-mass black hole (BH) surrounded by a neutrino-dominated accretion flow (NDAF) is generally considered to be the central engine of gamma-ray bursts (GRBs). Neutrinos escaping from the disk will annihilate out of the disk to produce the fireball that could power GRBs with blackbody (BB) components. The initial GRB jet power and fireball launch radius are related to the annihilation luminos… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: 8 pages, 1 table, 1 figure, accepted for publication in ApJ

  38. arXiv:2410.06811  [pdf, other

    cs.CV

    Rethinking the Evaluation of Visible and Infrared Image Fusion

    Authors: Dayan Guan, Yixuan Wu, Tianzhu Liu, Alex C. Kot, Yanfeng Gu

    Abstract: Visible and Infrared Image Fusion (VIF) has garnered significant interest across a wide range of high-level vision tasks, such as object detection and semantic segmentation. However, the evaluation of VIF methods remains challenging due to the absence of ground truth. This paper proposes a Segmentation-oriented Evaluation Approach (SEA) to assess VIF methods by incorporating the semantic segmentat… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: The code has been released in \url{https://github.com/Yixuan-2002/SEA/}

  39. arXiv:2410.06730  [pdf, other

    astro-ph.HE astro-ph.GA

    Systematic collapse of the accretion disc in AGN confirmed by UV photometry and broad line spectra

    Authors: Jia-Lai Kang, Chris Done, Scott Hagen, Matthew J. Temple, John D. Silverman, Junyao Li, Teng Liu

    Abstract: A recent study on the spectral energy distribution (SED) of AGN combined unobscured X-ray sources from the eROSITA eFEDS Survey with high quality optical imaging from Subaru's Hyper Suprime-Cam (HSC). The HSC data enabled accurate host galaxy subtraction as well as giving a uniform black hole mass estimator from the stellar mass. The resulting stacked optical/X-ray SEDs for black holes at fixed ma… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: 10 pages, 3 figure, 2 appendices. Submitted to MNRAS. Comments are very welcome!

  40. arXiv:2410.06511  [pdf, other

    cs.CL cs.AI cs.DC cs.LG

    TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training

    Authors: Wanchao Liang, Tianyu Liu, Less Wright, Will Constable, Andrew Gu, Chien-Chin Huang, Iris Zhang, Wei Feng, Howard Huang, Junjie Wang, Sanket Purandare, Gokul Nadathur, Stratos Idreos

    Abstract: The development of large language models (LLMs) has been instrumental in advancing state-of-the-art natural language processing applications. Training LLMs with billions of parameters and trillions of tokens require sophisticated distributed systems that enable composing and comparing several state-of-the-art techniques in order to efficiently scale across thousands of accelerators. However, exist… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  41. arXiv:2410.06500  [pdf, other

    hep-ex

    Search for the radiative decays $D^+\toγρ^+$ and $D^+\toγK^{*+}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (648 additional authors not shown)

    Abstract: We search for the radiative decays $D^{+} \to γρ^+$ and $D^{+} \to γK^{*+}$ using 20.3~fb$^{-1}$ of $e^+e^-$ annihilation data collected at the center-of-mass energy $\sqrt{s}=3.773$ GeV by the BESIII detector operating at the BEPCII collider. No significant signals are observed, and the upper limits on the branching fractions of $D^{+} \to γρ^+$ and $D^{+} \to γK^{*+}$ at 90\% confidence level ar… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  42. arXiv:2410.05808  [pdf, other

    cs.CV

    Vision Transformer based Random Walk for Group Re-Identification

    Authors: Guoqing Zhang, Tianqi Liu, Wenxuan Fang, Yuhui Zheng

    Abstract: Group re-identification (re-ID) aims to match groups with the same people under different cameras, mainly involves the challenges of group members and layout changes well. Most existing methods usually use the k-nearest neighbor algorithm to update node features to consider changes in group membership, but these methods cannot solve the problem of group layout changes. To this end, we propose a no… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: 6 pages

  43. arXiv:2410.05736  [pdf, ps, other

    hep-ex

    Observation of an axial-vector state in the study of $ψ(3686) \to φηη'$ decay

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (625 additional authors not shown)

    Abstract: Using (2712.4 $\pm$ 14.3)$\times 10^{6}$ $ψ(3686)$ events collected with the BESIII detector at BEPCII, a partial wave analysis of the decay $ψ(3686) \to φηη' $ is performed with the covariant tensor approach. An axial-vector state with a mass near 2.3 $\rm GeV/c^2$ is observed for the first time. Its mass and width are measured to be 2316… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  44. arXiv:2410.05317  [pdf, other

    cs.LG cs.AI cs.CV

    Accelerating Diffusion Transformers with Token-wise Feature Caching

    Authors: Chang Zou, Xuyang Liu, Ting Liu, Siteng Huang, Linfeng Zhang

    Abstract: Diffusion transformers have shown significant effectiveness in both image and video synthesis at the expense of huge computation costs. To address this problem, feature caching methods have been introduced to accelerate diffusion transformers by caching the features in previous timesteps and reusing them in the following timesteps. However, previous caching methods ignore that different tokens exh… ▽ More

    Submitted 14 October, 2024; v1 submitted 4 October, 2024; originally announced October 2024.

  45. arXiv:2410.05281  [pdf, other

    cs.CE cond-mat.mtrl-sci cs.LG physics.comp-ph

    Micrometer: Micromechanics Transformer for Predicting Mechanical Responses of Heterogeneous Materials

    Authors: Sifan Wang, Tong-Rui Liu, Shyam Sankaran, Paris Perdikaris

    Abstract: Heterogeneous materials, crucial in various engineering applications, exhibit complex multiscale behavior, which challenges the effectiveness of traditional computational methods. In this work, we introduce the Micromechanics Transformer ({\em Micrometer}), an artificial intelligence (AI) framework for predicting the mechanical response of heterogeneous materials, bridging the gap between advanced… ▽ More

    Submitted 23 September, 2024; originally announced October 2024.

    Comments: 36 pages, 12 figures, 9 tables

  46. arXiv:2410.05218  [pdf, other

    cs.LG cs.CL stat.ML

    Density estimation with LLMs: a geometric investigation of in-context learning trajectories

    Authors: Toni J. B. Liu, Nicolas Boullé, Raphaël Sarfati, Christopher J. Earls

    Abstract: Large language models (LLMs) demonstrate remarkable emergent abilities to perform in-context learning across various tasks, including time series forecasting. This work investigates LLMs' ability to estimate probability density functions (PDFs) from data observed in-context; such density estimation (DE) is a fundamental task underlying many probabilistic modeling problems. We leverage the Intensiv… ▽ More

    Submitted 9 October, 2024; v1 submitted 7 October, 2024; originally announced October 2024.

  47. arXiv:2410.04524  [pdf, other

    cs.CL

    Towards Secure Tuning: Mitigating Security Risks Arising from Benign Instruction Fine-Tuning

    Authors: Yanrui Du, Sendong Zhao, Jiawei Cao, Ming Ma, Danyang Zhao, Fenglei Fan, Ting Liu, Bing Qin

    Abstract: Instruction Fine-Tuning (IFT) has become an essential method for adapting base Large Language Models (LLMs) into variants for professional and private use. However, researchers have raised concerns over a significant decrease in LLMs' security following IFT, even when the IFT process involves entirely benign instructions (termed Benign IFT). Our study represents a pioneering effort to mitigate the… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

  48. arXiv:2410.04503  [pdf, other

    cs.CL cs.AI

    LRHP: Learning Representations for Human Preferences via Preference Pairs

    Authors: Chenglong Wang, Yang Gan, Yifu Huo, Yongyu Mu, Qiaozhi He, Murun Yang, Tong Xiao, Chunliang Zhang, Tongran Liu, Jingbo Zhu

    Abstract: To improve human-preference alignment training, current research has developed numerous preference datasets consisting of preference pairs labeled as "preferred" or "dispreferred". These preference pairs are typically used to encode human preferences into a single numerical value through reward modeling, which acts as a reward signal during reinforcement learning from human feedback (RLHF). Howeve… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

  49. arXiv:2410.04407  [pdf, other

    cs.CL

    Lens: Rethinking Multilingual Enhancement for Large Language Models

    Authors: Weixiang Zhao, Yulin Hu, Jiahe Guo, Xingyu Sui, Tongtong Wu, Yang Deng, Yanyan Zhao, Bing Qin, Wanxiang Che, Ting Liu

    Abstract: Despite the growing global demand for large language models (LLMs) that serve users from diverse linguistic backgrounds, most cutting-edge LLMs remain predominantly English-centric. This creates a performance gap across languages, restricting access to advanced AI services for non-English speakers. Current methods to enhance multilingual capabilities largely rely on data-driven post-training techn… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

    Comments: 21 pages, 9 figures, 5 tables

  50. arXiv:2410.04358  [pdf

    physics.med-ph

    Enabling Clinical Use of Linear Energy Transfer in Proton Therapy for Head and Neck Cancer -- A Review of Implications for Treatment Planning and Adverse Events Study

    Authors: Jingyuan Chen, Yunze Yang, Hongying Feng, Chenbin Liu, Lian Zhang, Jason M. Holmes, Zhengliang Liu, Haibo Lin, Tianming Liu, Charles B. Simone II, Nancy Y. Lee, Steven E. Frank, Daniel J. Ma, Samir H. Patel, Wei Liu

    Abstract: Proton therapy offers significant advantages due to its unique physical and biological properties, particularly the Bragg peak, enabling precise dose delivery to tumors while sparing healthy tissues. However, the clinical implementation is challenged by the oversimplification of the relative biological effectiveness (RBE) as a fixed value of 1.1, which does not account for the complex interplay be… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.