Skip to main content

Showing 1–50 of 7,745 results for author: Zhang, C

  1. arXiv:2410.16237  [pdf, other

    cs.MA

    IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems

    Authors: Yihuan Mao, Yipeng Kang, Peilun Li, Ning Zhang, Wei Xu, Chongjie Zhang

    Abstract: As large language model (LLM) agents increasingly integrate into our infrastructure, their robust coordination and message synchronization become vital. The Byzantine Generals Problem (BGP) is a critical model for constructing resilient multi-agent systems (MAS) under adversarial attacks. It describes a scenario where malicious agents with unknown identities exist in the system-situations that, in… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  2. arXiv:2410.15768  [pdf, other

    cs.CV cs.AI cs.GR

    Learning to Synthesize Graphics Programs for Geometric Artworks

    Authors: Qi Bing, Chaoyi Zhang, Weidong Cai

    Abstract: Creating and understanding art has long been a hallmark of human ability. When presented with finished digital artwork, professional graphic artists can intuitively deconstruct and replicate it using various drawing tools, such as the line tool, paint bucket, and layer features, including opacity and blending modes. While most recent research in this field has focused on art generation, proposing… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: ICPR 2024

  3. arXiv:2410.15760  [pdf, other

    cs.CV cs.AI

    DeepIcon: A Hierarchical Network for Layer-wise Icon Vectorization

    Authors: Qi Bing, Chaoyi Zhang, Weidong Cai

    Abstract: In contrast to the well-established technique of rasterization, vectorization of images poses a significant challenge in the field of computer graphics. Recent learning-based methods for converting raster images to vector formats frequently suffer from incomplete shapes, redundant path prediction, and a lack of accuracy in preserving the semantics of the original content. These shortcomings severe… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: Accepted as Oral Presentation at DICTA 2024

  4. arXiv:2410.15526  [pdf, other

    cs.LG cs.DC

    SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training

    Authors: Jinda Jia, Cong Xie, Hanlin Lu, Daoce Wang, Hao Feng, Chengming Zhang, Baixi Sun, Haibin Lin, Zhi Zhang, Xin Liu, Dingwen Tao

    Abstract: Recent years have witnessed a clear trend towards language models with an ever-increasing number of parameters, as well as the growing training overhead and memory usage. Distributed training, particularly through Sharded Data Parallelism (ShardedDP) which partitions optimizer states among workers, has emerged as a crucial technique to mitigate training time and memory usage. Yet, a major challeng… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

    Comments: Accepted by NeurIPS 2024

  5. arXiv:2410.15336  [pdf, other

    stat.ML cs.LG

    Diffusion-PINN Sampler

    Authors: Zhekun Shi, Longlin Yu, Tianyu Xie, Cheng Zhang

    Abstract: Recent success of diffusion models has inspired a surge of interest in developing sampling techniques using reverse diffusion processes. However, accurately estimating the drift term in the reverse stochastic differential equation (SDE) solely from the unnormalized target density poses significant challenges, hindering existing methods from achieving state-of-the-art performance. In this paper, we… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

    Comments: 33 pages, 7 figures

  6. arXiv:2410.15099  [pdf

    cond-mat.supr-con

    A new approach to N-doped di-molybdenum carbide with enhanced superconductivity via Urea

    Authors: Longfu Li, Lei Shi, Lingyong Zeng, Kuan Li, Peifeng Yu, Kangwang Wang, Chao Zhang, Rui Chen, Zaichen Xiang, Yunwei Zhang, Huixia Luo

    Abstract: Chemical doping is a critical factor in the development of new superconductors or optimizing the superconducting transition temperature (Tc) of the parent superconducting materials. Herein, a new simple urea approach is developed to synthesize the N-doped alfa-Mo2C. Benefiting from the simple urea method, a broad superconducting dome is found in the Mo2C1-xNx compositions. XRD results show that th… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

    Comments: 15 pages, 6 Figures, 1 Table

    Journal ref: Chin. Phys. Lett. 2024

  7. arXiv:2410.14309  [pdf, other

    cs.CL cs.AI

    LoGU: Long-form Generation with Uncertainty Expressions

    Authors: Ruihan Yang, Caiqi Zhang, Zhisong Zhang, Xinting Huang, Sen Yang, Nigel Collier, Dong Yu, Deqing Yang

    Abstract: While Large Language Models (LLMs) demonstrate impressive capabilities, they still struggle with generating factually incorrect content (i.e., hallucinations). A promising approach to mitigate this issue is enabling models to express uncertainty when unsure. Previous research on uncertainty modeling has primarily focused on short-form QA, but realworld applications often require much longer respon… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

  8. arXiv:2410.14268  [pdf, other

    cs.CL cs.LG

    MoDification: Mixture of Depths Made Easy

    Authors: Chen Zhang, Meizhi Zhong, Qimeng Wang, Xuantao Lu, Zheyu Ye, Chengqiang Lu, Yan Gao, Yao Hu, Kehai Chen, Min Zhang, Dawei Song

    Abstract: Long-context efficiency has recently become a trending topic in serving large language models (LLMs). And mixture of depths (MoD) is proposed as a perfect fit to bring down both latency and memory. In this paper, however, we discover that MoD can barely transform existing LLMs without costly training over an extensive number of tokens. To enable the transformations from any LLMs to MoD ones, we sh… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: 12 pages, 9 figures, 5 tables, work in progress

  9. arXiv:2410.14215  [pdf, other

    eess.SP cs.IT

    Jamming Detection and Channel Estimation for Spatially Correlated Beamspace Massive MIMO

    Authors: Pengguang Du, Cheng Zhang, Yindi Jing, Chao Fang, Zhilei Zhang, Yongming Huang

    Abstract: In this paper, we investigate the problem of jamming detection and channel estimation during multi-user uplink beam training under random pilot jamming attacks in beamspace massive multi-input-multi-output (MIMO) systems. For jamming detection, we distinguish the signals from the jammer and the user by projecting the observation signals onto the pilot space. By using the multiple projected observa… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: 13 pages, 9 figures. The paper has been submitted to an IEEE journal for possible publication

  10. arXiv:2410.14144  [pdf, other

    cs.CL cs.AI

    A Lightweight Multi Aspect Controlled Text Generation Solution For Large Language Models

    Authors: Chenyang Zhang, Jiayi Lin, Haibo Tong, Bingxuan Hou, Dongyu Zhang, Jialin Li, Junli Wang

    Abstract: Large language models (LLMs) show remarkable abilities with instruction tuning. However, they fail to achieve ideal tasks when lacking high-quality instruction tuning data on target tasks. Multi-Aspect Controllable Text Generation (MCTG) is a representative task for this dilemma, where aspect datasets are usually biased and correlated. Existing work exploits additional model structures and strateg… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  11. arXiv:2410.13854  [pdf, other

    cs.CL cs.AI cs.CV cs.CY

    Can MLLMs Understand the Deep Implication Behind Chinese Images?

    Authors: Chenhao Zhang, Xi Feng, Yuelin Bai, Xinrun Du, Jinchang Hou, Kaixin Deng, Guangzeng Han, Qinrui Li, Bingli Wang, Jiaheng Liu, Xingwei Qu, Yifei Zhang, Qixuan Zhao, Yiming Liang, Ziqiang Liu, Feiteng Fang, Min Yang, Wenhao Huang, Chenghua Lin, Ge Zhang, Shiwen Ni

    Abstract: As the capabilities of Multimodal Large Language Models (MLLMs) continue to improve, the need for higher-order capability evaluation of MLLMs is increasing. However, there is a lack of work evaluating MLLM for higher-order perception and understanding of Chinese visual content. To fill the gap, we introduce the **C**hinese **I**mage **I**mplication understanding **Bench**mark, **CII-Bench**, which… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 32 pages,18 figures. Project Page: https://cii-bench.github.io/ Code: https://github.com/MING_X/CII-Bench Dataset: https://huggingface.co/datasets/m-a-p/CII-Bench

  12. arXiv:2410.13837  [pdf, other

    cs.LG cs.AI cs.RO

    ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization

    Authors: Chen Bo Calvin Zhang, Zhang-Wei Hong, Aldo Pacchiano, Pulkit Agrawal

    Abstract: Reward shaping is a critical component in reinforcement learning (RL), particularly for complex tasks where sparse rewards can hinder learning. While shaping rewards have been introduced to provide additional guidance, selecting effective shaping functions remains challenging and computationally expensive. This paper introduces Online Reward Selection and Policy Optimization (ORSO), a novel approa… ▽ More

    Submitted 19 October, 2024; v1 submitted 17 October, 2024; originally announced October 2024.

    Comments: preprint, 35 pages, 23 figures

  13. arXiv:2410.13748  [pdf, other

    hep-ex

    Test of lepton flavour universality with $B_s^0 \rightarrow φ\ell^+\ell^-$ decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1124 additional authors not shown)

    Abstract: Lepton flavour universality in rare $b\rightarrow s$ transitions is tested for the first time using $B_s^0$ meson decays. The measurements are performed using $pp$ collision data collected by the LHCb experiment between 2011 and 2018, corresponding to a total integrated luminosity of 9$\,{\rm fb}^{-1}$. Branching fraction ratios between the $B_s^0 \rightarrow φe^+e^-$ and… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lbfence.cern.ch/alcm/public/analysis/full-details/3513/ (LHCb public pages)

    Report number: LHCb-PAPER-2024-032, CERN-EP-2024-255

  14. arXiv:2410.13515  [pdf, other

    hep-ex hep-lat hep-ph nucl-ex

    Observation of a rare beta decay of the charmed baryon with a Graph Neural Network

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (637 additional authors not shown)

    Abstract: The study of beta decay of the charmed baryon provides unique insights into the fundamental mechanism of the strong and electro-weak interactions. The $Λ_c^+$, being the lightest charmed baryon, undergoes disintegration solely through the charm quark weak decay. Its beta decay provides an ideal laboratory for investigating non-perturbative effects in quantum chromodynamics and for constraining the… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 28 pages, 6 figures

  15. arXiv:2410.13478  [pdf, other

    hep-ex

    Observation of $χ_{c0}\toΣ^{+}\barΣ^{-}η$ and evidence for $χ_{c1,2}\toΣ^{+}\barΣ^{-}η$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: Using $(27.12\pm 0.14)\times10^{8}$ $ψ(3686)$ events collected with the BESIII detector, the decay $χ_{c0}\toΣ^{+}\barΣ^{-}η$ is observed for the first time with a statistical significance of $7.0σ$, and evidence for $χ_{c1}\toΣ^{+}\barΣ^{-}η$ and $χ_{c2}\toΣ^{+}\barΣ^{-}η$ is found with statistical significances of $4.3σ$ and $4.6σ$, respectively. The branching fractions are determined to be… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  16. arXiv:2410.13368  [pdf, other

    hep-ex hep-ph

    Observation of the Singly Cabibbo-Suppressed Decay $Λ_c^{+}\to pπ^0$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Utilizing 4.5${~\rm{fb}}^{-1}$ of $e^+e^-$ annihilation data collected with the BESIII detector at the BEPCII collider at center-of-mass energies between 4.600 and 4.699 GeV, the first observation of the singly Cabibbo-suppressed decay $Λ_c^{+}\to pπ^0$ is presented, with a statistical significance of $5.4σ$. The ratio of the branching fractions of $Λ_c^{+}\to pπ^0$ and $Λ_c^{+}\to pη$ is measured… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 9 pages, 4 figures

  17. arXiv:2410.13306  [pdf, other

    astro-ph.IM physics.ao-ph

    The cloud cover and meteorological parameters at the Lenghu site on the Tibetan Plateau

    Authors: Ruiyue Li, Fei He, Licai Deng, Xiaodian Chen, Fan Yang, Yong Zhao, Bo Zhang, Chunguang Zhang, Chen Yang, Tian Lan

    Abstract: The cloud cover and meteorological parameters serve as fundamental criteria for the qualification of an astronomical observatory working in optical and infrared wavelengths. In this paper, we present a systematic assessment of key meteorological parameters at the Lenghu site. The datasets adopted in this study includes the meteorological parameters collected at the local weather stations at the si… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: accepted for publication in MNRAS

  18. arXiv:2410.13285  [pdf, other

    cs.CV

    Composing Novel Classes: A Concept-Driven Approach to Generalized Category Discovery

    Authors: Chuyu Zhang, Peiyan Gu, Xueyang Yu, Xuming He

    Abstract: We tackle the generalized category discovery (GCD) problem, which aims to discover novel classes in unlabeled datasets by leveraging the knowledge of known classes. Previous works utilize the known class knowledge through shared representation spaces. Despite their progress, our analysis experiments show that novel classes can achieve impressive clustering results on the feature space of a known c… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: Underreview. The first two authors contribute equally

  19. arXiv:2410.13260  [pdf, other

    cs.CR

    Cyber Attacks Prevention Towards Prosumer-based EV Charging Stations: An Edge-assisted Federated Prototype Knowledge Distillation Approach

    Authors: Luyao Zou, Quang Hieu Vo, Kitae Kim, Huy Q. Le, Chu Myaet Thwal, Chaoning Zhang, Choong Seon Hong

    Abstract: In this paper, cyber-attack prevention for the prosumer-based electric vehicle (EV) charging stations (EVCSs) is investigated, which covers two aspects: 1) cyber-attack detection on prosumers' network traffic (NT) data, and 2) cyber-attack intervention. To establish an effective prevention mechanism, several challenges need to be tackled, for instance, the NT data per prosumer may be non-independe… ▽ More

    Submitted 18 October, 2024; v1 submitted 17 October, 2024; originally announced October 2024.

    Comments: 27 pages, 12 figures

  20. arXiv:2410.13247  [pdf, other

    cs.SE cs.AI cs.HC

    Enhancing Sentiment Analysis with Collaborative AI: Architecture, Predictions, and Deployment Strategies

    Authors: Chaofeng Zhang, Jia Hou, Xueting Tan, Caijuan Chen, Hiroshi Hashimoto

    Abstract: The advancement of large language model (LLM) based artificial intelligence technologies has been a game-changer, particularly in sentiment analysis. This progress has enabled a shift from highly specialized research environments to practical, widespread applications within the industry. However, integrating diverse AI models for processing complex multimodal data and the associated high costs of… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  21. arXiv:2410.13246  [pdf, other

    cs.CL cs.AI

    Atomic Calibration of LLMs in Long-Form Generations

    Authors: Caiqi Zhang, Ruihan Yang, Zhisong Zhang, Xinting Huang, Sen Yang, Dong Yu, Nigel Collier

    Abstract: Large language models (LLMs) often suffer from hallucinations, posing significant challenges for real-world applications. Confidence calibration, which estimates the underlying uncertainty of model predictions, is essential to enhance the LLMs' trustworthiness. Existing research on LLM calibration has primarily focused on short-form tasks, providing a single confidence score at the response level… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  22. arXiv:2410.12896  [pdf, other

    cs.CL

    A Survey on Data Synthesis and Augmentation for Large Language Models

    Authors: Ke Wang, Jiahui Zhu, Minjie Ren, Zeming Liu, Shiwei Li, Zongye Zhang, Chenkai Zhang, Xiaoyu Wu, Qiqi Zhan, Qingjie Liu, Yunhong Wang

    Abstract: The success of Large Language Models (LLMs) is inherently linked to the availability of vast, diverse, and high-quality data for training and evaluation. However, the growth rate of high-quality data is significantly outpaced by the expansion of training datasets, leading to a looming data exhaustion crisis. This underscores the urgent need to enhance data efficiency and explore new data sources.… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  23. arXiv:2410.12790  [pdf, other

    cs.CV cs.LG

    Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models

    Authors: Ce Zhang, Simon Stepputtis, Katia Sycara, Yaqi Xie

    Abstract: Test-time adaptation, which enables models to generalize to diverse data with unlabeled test samples, holds significant value in real-world scenarios. Recently, researchers have applied this setting to advanced pre-trained vision-language models (VLMs), developing approaches such as test-time prompt tuning to further extend their practical applicability. However, these methods typically focus sole… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: Accepted by NeurIPS 2024. Project page: https://zhangce01.github.io/DPE-CLIP

  24. arXiv:2410.12620  [pdf, other

    hep-ex

    Search for $e^{+}e^{-} \to φχ_{c0}$ and $φη_{c2}(1D)$ at center-of-mass energies from 4.47 to 4.95 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: Utilizing a data set of $6.7$ fb$^{-1}$ from electron-positron collisions recorded by the BESIII detector at the BEPCII storage ring, a search is conducted for the processes $e^{+}e^{-} \to φχ_{c0}$ and $φη_{c2}(1D)$ across center-of-mass energies from 4.47 to 4.95 GeV. In the absence of any significant signals, upper limits are set. These include limits on the Born cross sections for… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 14 pages, 6 figures

  25. arXiv:2410.12496  [pdf, other

    cs.DB cs.PL cs.SE

    Finding Logic Bugs in Spatial Database Engines via Affine Equivalent Inputs

    Authors: Wenjing Deng, Qiuyang Mang, Chengyu Zhang, Manuel Rigger

    Abstract: Spatial Database Management Systems (SDBMSs) aim to store, manipulate, and retrieve spatial data. SDBMSs are employed in various modern applications, such as geographic information systems, computer-aided design tools, and location-based services. However, the presence of logic bugs in SDBMSs can lead to incorrect results, substantially undermining the reliability of these applications. Detecting… ▽ More

    Submitted 17 October, 2024; v1 submitted 16 October, 2024; originally announced October 2024.

  26. arXiv:2410.12474  [pdf, other

    cs.CV cs.LG

    Mind the Gap Between Prototypes and Images in Cross-domain Finetuning

    Authors: Hongduan Tian, Feng Liu, Zhanke Zhou, Tongliang Liu, Chengqi Zhang, Bo Han

    Abstract: In cross-domain few-shot classification (CFC), recent works mainly focus on adapting a simple transformation head on top of a frozen pre-trained backbone with few labeled data to project embeddings into a task-specific metric space where classification can be performed by measuring similarities between image instance and prototype representations. Technically, an assumption implicitly adopted in s… ▽ More

    Submitted 20 October, 2024; v1 submitted 16 October, 2024; originally announced October 2024.

  27. arXiv:2410.12444  [pdf, other

    cs.CL

    Expanding Chatbot Knowledge in Customer Service: Context-Aware Similar Question Generation Using Large Language Models

    Authors: Mengze Hong, Yuanfeng Song, Di Jiang, Lu Wang, Zichang Guo, Chen Jason Zhang

    Abstract: Reliable responses of service chatbots are often achieved by employing retrieval-based methods that restrict answers to a knowledge base comprising predefined question-answer pairs (QA pairs). To accommodate potential variations in how a customer's query may be expressed, it emerges as the favored solution to augment these QA pairs with similar questions that are possibly diverse while remaining s… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  28. arXiv:2410.12428  [pdf, other

    cs.CL cs.AI

    Conformity in Large Language Models

    Authors: Xiaochen Zhu, Caiqi Zhang, Tom Stafford, Nigel Collier, Andreas Vlachos

    Abstract: The conformity effect describes the tendency of individuals to align their responses with the majority. Studying this bias in large language models (LLMs) is crucial, as LLMs are increasingly used in various information-seeking and decision-making tasks as conversation partners to improve productivity. Thus, conformity to incorrect responses can compromise their effectiveness. In this paper, we ad… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 16 pages (8 pages main body), 14 figures

  29. arXiv:2410.12347  [pdf, ps, other

    cs.GT

    Guaranteeing MMS for All but One Agent When Allocating Indivisible Chores

    Authors: Jiawei Qiu, Xiaowei Wu, Cong Zhang, Shengwei Zhou

    Abstract: We study the problem of allocating $m$ indivisible chores to $n$ agents with additive cost functions under the fairness notion of maximin share (MMS). In this work, we propose a notion called $α$-approximate all-but-one maximin share ($α$-AMMS) which is a stronger version of $α$-approximate MMS. An allocation is called $α$-AMMS if $n-1$ agents are guaranteed their MMS values and the remaining agen… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 18 pages, 7 figures

  30. arXiv:2410.12089  [pdf, other

    astro-ph.CO

    BICEP/Keck XVIII: Measurement of BICEP3 polarization angles and consequences for constraining cosmic birefringence and inflation

    Authors: BICEP/Keck Collaboration, :, P. A. R. Ade, Z. Ahmed, M. Amiri, D. Barkats, R. Basu Thakur, C. A. Bischoff, D. Beck, J. J. Bock, H. Boenish, V. Buza, J. R. Cheshire IV, J. Connors, J. Cornelison, M. Crumrine, A. J. Cukierman, E. Denison, L. Duband, M. Eiben, B. D. Elwood, S. Fatigoni, J. P. Filippini, A. Fortes, M. Gao , et al. (60 additional authors not shown)

    Abstract: We use a custom-made calibrator to measure individual detectors' polarization angles of BICEP3, a small aperture telescope observing the cosmic microwave background (CMB) at 95GHz from the South Pole. We describe our calibration strategy and the statistical and systematic uncertainties associated with the measurement. We reach an unprecedented precision for such measurement on a CMB experiment, wi… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 29 Pages, 17 Figures, 6 Tables, as submitted to PRD

  31. arXiv:2410.11710  [pdf, other

    cs.CL

    MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models

    Authors: Pei Wang, Yanan Wu, Zekun Wang, Jiaheng Liu, Xiaoshuai Song, Zhongyuan Peng, Ken Deng, Chenchen Zhang, Jiakai Wang, Junran Peng, Ge Zhang, Hangyu Guo, Zhaoxiang Zhang, Wenbo Su, Bo Zheng

    Abstract: Large Language Models (LLMs) have displayed massive improvements in reasoning and decision-making skills and can hold natural conversations with users. Recently, many tool-use benchmark datasets have been proposed. However, existing datasets have the following limitations: (1). Insufficient evaluation scenarios (e.g., only cover limited tool-use scenes). (2). Extensive evaluation costs (e.g., GPT… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  32. arXiv:2410.11607  [pdf, other

    hep-ex

    Observation of $χ_{cJ}\to p \bar p K^0_S K^- π^+ + c.c.$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (648 additional authors not shown)

    Abstract: By analyzing $(27.12\pm0.14)\times10^8$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decays of $χ_{cJ} \to p \bar{p} K^0_S K^- π^+ +c.c.(J=0, 1, 2)$ are observed for the first time with statistical significances greater than $10σ$. The branching fractions of these decays are determined to be… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 12 pages, 5 figures

  33. arXiv:2410.11576  [pdf, other

    cs.LG stat.ML

    The Best of Both Worlds: On the Dilemma of Out-of-distribution Detection

    Authors: Qingyang Zhang, Qiuxuan Feng, Joey Tianyi Zhou, Yatao Bian, Qinghua Hu, Changqing Zhang

    Abstract: Out-of-distribution (OOD) detection is essential for model trustworthiness which aims to sensitively identify semantic OOD samples and robustly generalize for covariate-shifted OOD samples. However, we discover that the superior OOD detection performance of state-of-the-art methods is achieved by secretly sacrificing the OOD generalization ability. Specifically, the classification accuracy of thes… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

    Comments: Accepted by NeurlPS24. Code is available at https://github.com/QingyangZhang/DUL

  34. arXiv:2410.11560  [pdf, other

    cs.CV

    PSVMA+: Exploring Multi-granularity Semantic-visual Adaption for Generalized Zero-shot Learning

    Authors: Man Liu, Huihui Bai, Feng Li, Chunjie Zhang, Yunchao Wei, Meng Wang, Tat-Seng Chua, Yao Zhao

    Abstract: Generalized zero-shot learning (GZSL) endeavors to identify the unseen categories using knowledge from the seen domain, necessitating the intrinsic interactions between the visual features and attribute semantic features. However, GZSL suffers from insufficient visual-semantic correspondences due to the attribute diversity and instance diversity. Attribute diversity refers to varying semantic gran… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: Accepted to TPAMI 2024. arXiv admin note: text overlap with arXiv:2303.15322

  35. arXiv:2410.11285  [pdf, other

    cs.CV

    Scalable Indoor Novel-View Synthesis using Drone-Captured 360 Imagery with 3D Gaussian Splatting

    Authors: Yuanbo Chen, Chengyu Zhang, Jason Wang, Xuefan Gao, Avideh Zakhor

    Abstract: Scene reconstruction and novel-view synthesis for large, complex, multi-story, indoor scenes is a challenging and time-consuming task. Prior methods have utilized drones for data capture and radiance fields for scene reconstruction, both of which present certain challenges. First, in order to capture diverse viewpoints with the drone's front-facing camera, some approaches fly the drone in an unsta… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: Accepted to ECCV 2024 S3DSGR Workshop

  36. arXiv:2410.10894  [pdf, other

    stat.ML cs.LG

    COME: Test-time adaption by Conservatively Minimizing Entropy

    Authors: Qingyang Zhang, Yatao Bian, Xinke Kong, Peilin Zhao, Changqing Zhang

    Abstract: Machine learning models must continuously self-adjust themselves for novel data distribution in the open world. As the predominant principle, entropy minimization (EM) has been proven to be a simple yet effective cornerstone in existing test-time adaption (TTA) methods. While unfortunately its fatal limitation (i.e., overconfidence) tends to result in model collapse. For this issue, we propose to… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

    Comments: Ongoing work

  37. arXiv:2410.10551  [pdf, other

    eess.IV cs.CV

    Preserving Cardiac Integrity: A Topology-Infused Approach to Whole Heart Segmentation

    Authors: Chenyu Zhang, Wenxue Guan, Xiaodan Xing, Guang Yang

    Abstract: Whole heart segmentation (WHS) supports cardiovascular disease (CVD) diagnosis, disease monitoring, treatment planning, and prognosis. Deep learning has become the most widely used method for WHS applications in recent years. However, segmentation of whole-heart structures faces numerous challenges including heart shape variability during the cardiac cycle, clinical artifacts like motion and poor… ▽ More

    Submitted 17 October, 2024; v1 submitted 14 October, 2024; originally announced October 2024.

  38. arXiv:2410.10298  [pdf, other

    cs.CV

    ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object

    Authors: Jiwei Chen, Laiyan Ding, Chi Zhang, Feifei Li, Rui Huang

    Abstract: Vision-based BEV (Bird-Eye-View) 3D object detection has recently become popular in autonomous driving. However, objects with a high similarity to the background from a camera perspective cannot be detected well by existing methods. In this paper, we propose 2D Region-oriented Attention for a BEV-based 3D Object Detection Network (ROA-BEV), which can make the backbone focus more on feature learnin… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  39. arXiv:2410.09828  [pdf, other

    gr-qc

    The semiclassical propagator for coherent state on twisted geometry

    Authors: Gaoping Long, Hongguang Liu, Cong Zhang

    Abstract: A new set of twisted geometric variables is introduced to parametrize the holonomy-flux phase space in loop quantum gravity. It is verified that these new geometric variables, after symplectic reduction with respect to the Gauss constraint, form a Poisson algebra which is analogue to that in quantum mechanics. This property ensures that these new geometric variables provide a simple path measure,… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

  40. arXiv:2410.09591  [pdf, other

    cs.CR

    Unlearn and Burn: Adversarial Machine Unlearning Requests Destroy Model Accuracy

    Authors: Yangsibo Huang, Daogao Liu, Lynn Chua, Badih Ghazi, Pritish Kamath, Ravi Kumar, Pasin Manurangsi, Milad Nasr, Amer Sinha, Chiyuan Zhang

    Abstract: Machine unlearning algorithms, designed for selective removal of training data from models, have emerged as a promising approach to growing privacy concerns. In this work, we expose a critical yet underexplored vulnerability in the deployment of unlearning systems: the assumption that the data requested for removal is always part of the original training set. We present a threat model where an att… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

  41. arXiv:2410.09151  [pdf, other

    astro-ph.HE

    A search using GEO600 for gravitational waves coincident with fast radio bursts from SGR 1935+2154

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, I. Abouelfettouh, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, D. Agarwal, M. Agathos, M. Aghaei Abchouyeh, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, A. Al-Jodah, C. Alléné , et al. (1758 additional authors not shown)

    Abstract: The magnetar SGR 1935+2154 is the only known Galactic source of fast radio bursts (FRBs). FRBs from SGR 1935+2154 were first detected by CHIME/FRB and STARE2 in 2020 April, after the conclusion of the LIGO, Virgo, and KAGRA Collaborations' O3 observing run. Here we analyze four periods of gravitational wave (GW) data from the GEO600 detector coincident with four periods of FRB activity detected by… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: 15 pages of text including references, 4 figures, 5 tables

    Report number: LIGO-P2400192

  42. arXiv:2410.08871  [pdf, other

    cs.CE

    Adaptive optimization of wave energy conversion in oscillatory wave surge converters via SPH simulation and deep reinforcement learning

    Authors: Mai Ye, Chi Zhang, Yaru Ren, Ziyuan Liu, Oskar J. Haidn, Xiangyu Hu

    Abstract: The nonlinear damping characteristics of the oscillating wave surge converter (OWSC) significantly impact the performance of the power take-off system. This study presents a framework by integrating deep reinforcement learning (DRL) with numerical simulations of OWSC to identify optimal adaptive damping policy under varying wave conditions, thereby enhancing wave energy harvesting efficiency. Firs… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: 67 pages and 25 figures

  43. arXiv:2410.08603  [pdf, other

    hep-ex

    Observation of $D^+\toη^\primeμ^+ν_μ$ and First Study of $D^+\to η^\prime \ell^+ν_\ell$ Decay Dynamics

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Using $20.3\,\rm fb^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy 3.773\,GeV with the BESIII detector, we report the first observation of the semileptonic decay $D^+\to η^\prime μ^+ν_μ$ with significance of $8.6σ$ including systematic uncertainties, and an improved measurement of $D^+\to η^\prime e^+ν_e$. The branching fractions of $D^+\to η^\prime μ^+ν_μ$ and… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  44. arXiv:2410.08582  [pdf, ps, other

    cs.CV

    DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention

    Authors: Nguyen Huu Bao Long, Chenyu Zhang, Yuzhi Shi, Tsubasa Hirakawa, Takayoshi Yamashita, Tohgoroh Matsui, Hironobu Fujiyoshi

    Abstract: Vision Transformers with various attention modules have demonstrated superior performance on vision tasks. While using sparsity-adaptive attention, such as in DAT, has yielded strong results in image classification, the key-value pairs selected by deformable points lack semantic relevance when fine-tuning for semantic segmentation tasks. The query-aware sparsity attention in BiFormer seeks to focu… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: 20 pages, 7 figures. arXiv admin note: text overlap with arXiv:2303.08810 by other authors

    Journal ref: ACCV 2024

  45. arXiv:2410.08478  [pdf, other

    cs.IR cs.AI cs.LG

    Personalized Item Representations in Federated Multimodal Recommendation

    Authors: Zhiwei Li, Guodong Long, Jing Jiang, Chengqi Zhang

    Abstract: Federated recommendation systems are essential for providing personalized recommendations while protecting user privacy. However, current methods mainly rely on ID-based item embeddings, neglecting the rich multimodal information of items. To address this, we propose a Federated Multimodal Recommendation System, called FedMR. FedMR uses a foundation model on the server to encode multimodal item da… ▽ More

    Submitted 14 October, 2024; v1 submitted 10 October, 2024; originally announced October 2024.

    Comments: 12 pages, 4 figures, 5 tables, conference

  46. arXiv:2410.08102  [pdf, other

    cs.CL

    Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining

    Authors: Tianyi Bai, Ling Yang, Zhen Hao Wong, Jiahui Peng, Xinlin Zhuang, Chi Zhang, Lijun Wu, Jiantao Qiu, Wentao Zhang, Binhang Yuan, Conghui He

    Abstract: Efficient data selection is crucial to accelerate the pretraining of large language models (LLMs). While various methods have been proposed to enhance data efficiency, limited research has addressed the inherent conflicts between these approaches to achieve optimal data selection for LLM pretraining. To tackle this problem, we propose a novel multi-agent collaborative data selection mechanism. In… ▽ More

    Submitted 14 October, 2024; v1 submitted 10 October, 2024; originally announced October 2024.

  47. arXiv:2410.07983  [pdf, other

    quant-ph

    Characterizing Quantum Codes via the Coefficients in Knill-Laflamme Conditions

    Authors: Mengxin Du, Chao Zhang, Yiu-Tung Poon, Bei Zeng

    Abstract: Quantum error correction (QEC) is essential for protecting quantum information against noise, yet understanding the structure of the Knill-Laflamme (KL) coefficients $λ_{ij}$ from the condition $PE_i^\dagger E_j P = λ_{ij} P$ remains challenging, particularly for nonadditive codes. In this work, we introduce the signature vector $\vecλ(P)$, composed of the off-diagonal KL coefficients $λ_{ij}$, wh… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: 18 pages, 2 figures

  48. arXiv:2410.07626  [pdf, other

    hep-ex

    Precision Measurement of the Branching Fraction of $D^{+}\to μ^{+}ν_μ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Using $20.3~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected at a center-of-mass energy of $E_{\rm cm}=3.773$ GeV with the BESIII detector operating at the BEPCII collider, we determine the branching fraction of the leptonic decay $D^+\toμ^+ν_μ$ to be $(3.981\pm0.079_{\rm stat}\pm0.040_{\rm syst})\times10^{-4}$. Interpreting our measurement with knowledge of the Fermi coupling constant… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: 9 pages, 2 figures

  49. arXiv:2410.07538  [pdf, other

    cs.LG

    Rank Aggregation in Crowdsourcing for Listwise Annotations

    Authors: Wenshui Luo, Haoyu Liu, Yongliang Ding, Tao Zhou, Sheng wan, Runze Wu, Minmin Lin, Cong Zhang, Changjie Fan, Chen Gong

    Abstract: Rank aggregation through crowdsourcing has recently gained significant attention, particularly in the context of listwise ranking annotations. However, existing methods primarily focus on a single problem and partial ranks, while the aggregation of listwise full ranks across numerous problems remains largely unexplored. This scenario finds relevance in various applications, such as model quality a… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: 19 pages

  50. arXiv:2410.07484  [pdf, other

    cs.AI

    WALL-E: World Alignment by Rule Learning Improves World Model-based LLM Agents

    Authors: Siyu Zhou, Tianyi Zhou, Yijun Yang, Guodong Long, Deheng Ye, Jing Jiang, Chengqi Zhang

    Abstract: Can large language models (LLMs) directly serve as powerful world models for model-based agents? While the gaps between the prior knowledge of LLMs and the specified environment's dynamics do exist, our study reveals that the gaps can be bridged by aligning an LLM with its deployed environment and such "world alignment" can be efficiently achieved by rule learning on LLMs. Given the rich prior kno… ▽ More

    Submitted 11 October, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

    Comments: 35 pages, including references and appendix. Code is available at https://github.com/elated-sawyer/WALL-E