Skip to main content

Showing 1–50 of 2,620 results for author: Zhu, H

  1. arXiv:2410.15312  [pdf, other

    cs.CV cs.AI

    Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image

    Authors: Yu Zhao, Hao Fei, Xiangtai Li, Libo Qin, Jiayi Ji, Hongyuan Zhu, Meishan Zhang, Min Zhang, Jianguo Wei

    Abstract: In the visual spatial understanding (VSU) area, spatial image-to-text (SI2T) and spatial text-to-image (ST2I) are two fundamental tasks that appear in dual form. Existing methods for standalone SI2T or ST2I perform imperfectly in spatial understanding, due to the difficulty of 3D-wise spatial feature modeling. In this work, we consider modeling the SI2T and ST2I together under a dual learning fram… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

  2. arXiv:2410.14538  [pdf, other

    quant-ph

    Nearly query-optimal classical shadow estimation of unitary channels

    Authors: Zihao Li, Changhao Yi, You Zhou, Huangjun Zhu

    Abstract: Classical shadow estimation (CSE) is a powerful tool for learning properties of quantum states and quantum processes. Here we consider the CSE task for quantum unitary channels. By querying an unknown unitary channel $\mathcal{U}$ multiple times in quantum experiments, the goal is to learn a classical description of $\mathcal{U}$ such that one can later use it to accurately predict many different… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: 13+23 pages, 3 figures, and 1+5 tables; comments and suggestions are welcome!

  3. arXiv:2410.13688  [pdf, other

    quant-ph

    Variational Quantum Framework for Nonlinear PDE Constrained Optimization Using Carleman Linearization

    Authors: Abeynaya Gnanasekaran, Amit Surana, Hongyu Zhu

    Abstract: We present a novel variational quantum framework for nonlinear partial differential equation (PDE) constrained optimization problems. The proposed work extends the recently introduced bi-level variational quantum PDE constrained optimization (BVQPCO) framework for linear PDE to a nonlinear setting by leveraging Carleman linearization (CL). CL framework allows one to transform a system of polynomia… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  4. arXiv:2410.13575  [pdf, other

    quant-ph math-ph

    Third moments of qudit Clifford orbits and 3-designs based on magic orbits

    Authors: Huangjun Zhu, Chengsi Mao, Changhao Yi

    Abstract: When the local dimension $d$ is an odd prime, the qudit Clifford group is only a 2-design, but not a 3-design, unlike the qubit counterpart. This distinction and its extension to Clifford orbits have profound implications for many applications in quantum information processing. In this work we systematically delve into general qudit Clifford orbits with a focus on the third moments and potential a… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 56+56 pages, 12 figures, and 6+1 tables; comments and suggestions are very welcome! See also the companion paper "The Magic in Qudit Shadow Estimation based on the Clifford Group"

  5. arXiv:2410.13572  [pdf, other

    quant-ph

    The Magic in Qudit Shadow Estimation based on the Clifford Group

    Authors: Chengsi Mao, Changhao Yi, Huangjun Zhu

    Abstract: Shadow estimation is a sample-efficient protocol for learning the properties of a quantum system through randomized measurements, but the current understanding on qudit shadow estimation is quite limited compared with the qubit setting. Here we clarify the sample complexity of qudit shadow estimation based on the Clifford group, where the local dimension $d$ is an odd prime. Notably, we show that… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 8+20 pages, 5+6 figures, and 2 tables; comments and suggestions are very welcome! See also the companion paper "Third moments of qudit Clifford orbits and 3-designs based on magic orbits"

  6. arXiv:2410.13515  [pdf, other

    hep-ex hep-lat hep-ph nucl-ex

    Observation of a rare beta decay of the charmed baryon with a Graph Neural Network

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (637 additional authors not shown)

    Abstract: The study of beta decay of the charmed baryon provides unique insights into the fundamental mechanism of the strong and electro-weak interactions. The $Λ_c^+$, being the lightest charmed baryon, undergoes disintegration solely through the charm quark weak decay. Its beta decay provides an ideal laboratory for investigating non-perturbative effects in quantum chromodynamics and for constraining the… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 28 pages, 6 figures

  7. arXiv:2410.13478  [pdf, other

    hep-ex

    Observation of $χ_{c0}\toΣ^{+}\barΣ^{-}η$ and evidence for $χ_{c1,2}\toΣ^{+}\barΣ^{-}η$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: Using $(27.12\pm 0.14)\times10^{8}$ $ψ(3686)$ events collected with the BESIII detector, the decay $χ_{c0}\toΣ^{+}\barΣ^{-}η$ is observed for the first time with a statistical significance of $7.0σ$, and evidence for $χ_{c1}\toΣ^{+}\barΣ^{-}η$ and $χ_{c2}\toΣ^{+}\barΣ^{-}η$ is found with statistical significances of $4.3σ$ and $4.6σ$, respectively. The branching fractions are determined to be… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  8. arXiv:2410.13368  [pdf, other

    hep-ex hep-ph

    Observation of the Singly Cabibbo-Suppressed Decay $Λ_c^{+}\to pπ^0$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Utilizing 4.5${~\rm{fb}}^{-1}$ of $e^+e^-$ annihilation data collected with the BESIII detector at the BEPCII collider at center-of-mass energies between 4.600 and 4.699 GeV, the first observation of the singly Cabibbo-suppressed decay $Λ_c^{+}\to pπ^0$ is presented, with a statistical significance of $5.4σ$. The ratio of the branching fractions of $Λ_c^{+}\to pπ^0$ and $Λ_c^{+}\to pη$ is measured… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 9 pages, 4 figures

  9. arXiv:2410.13367  [pdf, other

    astro-ph.HE astro-ph.SR

    Wavelet analysis of low-frequency quasi-periodic oscillations in MAXI J1803$-$298 observed with Insight-HXMT and NICER

    Authors: Y. J. Jin, X. Chen, H. F. Zhu, Z. J. Jiang, L. Zhang, W. Wang

    Abstract: With data observed by the Hard X-ray Modulation Telescope (\textit{Insight}-HXMT) and the Neutron star Interior Composition Explorer (\textit {NICER}), we study low-frequency quasi-periodic oscillations (LFQPOs) of the black hole candidate MAXI J1803$-$298 during the 2021 outburst. Based on hardness intensity diagram and difference of the QPOs properties, Type-C and Type-B QPOs are found in the lo… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 11 pages, 11 figures, 2 tables, MNRAS in press

  10. arXiv:2410.13187  [pdf, other

    cs.CL cs.AI cs.SE

    aiXcoder-7B: A Lightweight and Effective Large Language Model for Code Completion

    Authors: Siyuan Jiang, Jia Li, He Zong, Huanyu Liu, Hao Zhu, Shukai Hu, Erlu Li, Jiazheng Ding, Yu Han, Wei Ning, Ge Li

    Abstract: Large Language Models (LLMs) have been widely used in code completion, and researchers are focusing on scaling up LLMs to improve their accuracy. However, larger LLMs will increase the response time of code completion and decrease the developers' productivity. In this paper, we propose a lightweight and effective LLM for code completion named aiXcoder-7B. Compared to existing LLMs, aiXcoder-7B ach… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: aiXcoder-7B is available at https://github.com/aixcoder-plugin/aiXcoder-7B/tree/main

  11. arXiv:2410.12620  [pdf, other

    hep-ex

    Search for $e^{+}e^{-} \to φχ_{c0}$ and $φη_{c2}(1D)$ at center-of-mass energies from 4.47 to 4.95 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: Utilizing a data set of $6.7$ fb$^{-1}$ from electron-positron collisions recorded by the BESIII detector at the BEPCII storage ring, a search is conducted for the processes $e^{+}e^{-} \to φχ_{c0}$ and $φη_{c2}(1D)$ across center-of-mass energies from 4.47 to 4.95 GeV. In the absence of any significant signals, upper limits are set. These include limits on the Born cross sections for… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 14 pages, 6 figures

  12. arXiv:2410.11607  [pdf, other

    hep-ex

    Observation of $χ_{cJ}\to p \bar p K^0_S K^- π^+ + c.c.$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (648 additional authors not shown)

    Abstract: By analyzing $(27.12\pm0.14)\times10^8$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decays of $χ_{cJ} \to p \bar{p} K^0_S K^- π^+ +c.c.(J=0, 1, 2)$ are observed for the first time with statistical significances greater than $10σ$. The branching fractions of these decays are determined to be… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 12 pages, 5 figures

  13. arXiv:2410.11076  [pdf, other

    cs.CL cs.AI

    PRACTIQ: A Practical Conversational Text-to-SQL dataset with Ambiguous and Unanswerable Queries

    Authors: Mingwen Dong, Nischal Ashok Kumar, Yiqun Hu, Anuj Chauhan, Chung-Wei Hang, Shuaichen Chang, Lin Pan, Wuwei Lan, Henghui Zhu, Jiarong Jiang, Patrick Ng, Zhiguo Wang

    Abstract: Previous text-to-SQL datasets and systems have primarily focused on user questions with clear intentions that can be answered. However, real user questions can often be ambiguous with multiple interpretations or unanswerable due to a lack of relevant data. In this work, we construct a practical conversational text-to-SQL dataset called PRACTIQ, consisting of ambiguous and unanswerable questions in… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  14. arXiv:2410.10873  [pdf, other

    cs.CL cs.AI cs.CY

    AuditWen:An Open-Source Large Language Model for Audit

    Authors: Jiajia Huang, Haoran Zhu, Chao Xu, Tianming Zhan, Qianqian Xie, Jimin Huang

    Abstract: Intelligent auditing represents a crucial advancement in modern audit practices, enhancing both the quality and efficiency of audits within the realm of artificial intelligence. With the rise of large language model (LLM), there is enormous potential for intelligent models to contribute to audit domain. However, general LLMs applied in audit domain face the challenges of lacking specialized knowle… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: 18 pages,1 figures

  15. arXiv:2410.10093  [pdf, other

    cs.CL cs.LG

    How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective

    Authors: Teng Xiao, Mingxiao Li, Yige Yuan, Huaisheng Zhu, Chao Cui, Vasant G Honavar

    Abstract: This paper introduces a novel generalized self-imitation learning ($\textbf{GSIL}$) framework, which effectively and efficiently aligns large language models with offline demonstration data. We develop $\textbf{GSIL}$ by deriving a surrogate objective of imitation learning with density ratio estimates, facilitating the use of self-generated data and optimizing the imitation learning objective with… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

    Comments: EMNLP 2024 Main

  16. arXiv:2410.09958  [pdf, ps, other

    cs.HC

    Beyond the "Industry Standard": Focusing Gender-Affirming Voice Training Technologies on Individualized Goal Exploration

    Authors: Kassie Povinelli, Hanxiu "Hazel" Zhu, Yuhang Zhao

    Abstract: Gender-affirming voice training is critical for the transition process for many transgender individuals, enabling their voice to align with their gender identity. Individualized voice goals guide and motivate the voice training journey, but existing voice training technologies fail to define clear goals. We interviewed six voice experts and ten transgender individuals with voice training experienc… ▽ More

    Submitted 17 October, 2024; v1 submitted 13 October, 2024; originally announced October 2024.

    Comments: 17 pages, 0 figures, 2 tables (main text), 2 tables (appendix)

    MSC Class: 68U35 ACM Class: J.4; J.3; H.5.2

  17. arXiv:2410.09444  [pdf

    eess.IV cs.CV

    Diabetic retinopathy image classification method based on GreenBen data augmentation

    Authors: Yutong Liu, Jie Gao, Haijiang Zhu

    Abstract: For the diagnosis of diabetes retinopathy (DR) images, this paper proposes a classification method based on artificial intelligence. The core lies in a new data augmentation method, GreenBen, which first extracts the green channel grayscale image from the retinal image and then performs Ben enhancement. Considering that diabetes macular edema (DME) is a complication closely related to DR, this pap… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

  18. arXiv:2410.09103  [pdf, other

    cs.LG cs.AI

    Parameter-Efficient Fine-Tuning via Selective Discrete Cosine Transform

    Authors: Yixian Shen, Qi Bi, Jia-Hong Huang, Hongyi Zhu, Anuj Pathania

    Abstract: In the era of large language models, parameter-efficient fine-tuning (PEFT) has been extensively studied. However, these approaches usually rely on the space domain, which encounters storage challenges especially when handling extensive adaptations or larger models. The frequency domain, in contrast, is more effective in compressing trainable parameters while maintaining the expressive capability.… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  19. arXiv:2410.08603  [pdf, other

    hep-ex

    Observation of $D^+\toη^\primeμ^+ν_μ$ and First Study of $D^+\to η^\prime \ell^+ν_\ell$ Decay Dynamics

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Using $20.3\,\rm fb^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy 3.773\,GeV with the BESIII detector, we report the first observation of the semileptonic decay $D^+\to η^\prime μ^+ν_μ$ with significance of $8.6σ$ including systematic uncertainties, and an improved measurement of $D^+\to η^\prime e^+ν_e$. The branching fractions of $D^+\to η^\prime μ^+ν_μ$ and… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  20. arXiv:2410.08208  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    SPA: 3D Spatial-Awareness Enables Effective Embodied Representation

    Authors: Haoyi Zhu, Honghui Yang, Yating Wang, Jiange Yang, Limin Wang, Tong He

    Abstract: In this paper, we introduce SPA, a novel representation learning framework that emphasizes the importance of 3D spatial awareness in embodied AI. Our approach leverages differentiable neural rendering on multi-view images to endow a vanilla Vision Transformer (ViT) with intrinsic spatial understanding. We present the most comprehensive evaluation of embodied representation learning to date, coveri… ▽ More

    Submitted 11 October, 2024; v1 submitted 10 October, 2024; originally announced October 2024.

    Comments: Project Page: https://haoyizhu.github.io/spa/

  21. arXiv:2410.07725  [pdf

    cs.LG cs.NE

    Towards Trustworthy Web Attack Detection: An Uncertainty-Aware Ensemble Deep Kernel Learning Model

    Authors: Yonghang Zhou, Hongyi Zhu, Yidong Chai, Yuanchun Jiang, Yezheng Liu

    Abstract: Web attacks are one of the major and most persistent forms of cyber threats, which bring huge costs and losses to web application-based businesses. Various detection methods, such as signature-based, machine learning-based, and deep learning-based, have been proposed to identify web attacks. However, these methods either (1) heavily rely on accurate and complete rule design and feature engineering… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

  22. arXiv:2410.07718  [pdf, other

    cs.CV

    Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation

    Authors: Jiahao Cui, Hui Li, Yao Yao, Hao Zhu, Hanlin Shang, Kaihui Cheng, Hang Zhou, Siyu Zhu, Jingdong Wang

    Abstract: Recent advances in latent diffusion-based generative models for portrait image animation, such as Hallo, have achieved impressive results in short-duration video synthesis. In this paper, we present updates to Hallo, introducing several design enhancements to extend its capabilities. First, we extend the method to produce long-duration videos. To address substantial challenges such as appearance d… ▽ More

    Submitted 14 October, 2024; v1 submitted 10 October, 2024; originally announced October 2024.

  23. arXiv:2410.07671  [pdf, other

    cs.IR cs.AI

    DISCO: A Hierarchical Disentangled Cognitive Diagnosis Framework for Interpretable Job Recommendation

    Authors: Xiaoshan Yu, Chuan Qin, Qi Zhang, Chen Zhu, Haiping Ma, Xingyi Zhang, Hengshu Zhu

    Abstract: The rapid development of online recruitment platforms has created unprecedented opportunities for job seekers while concurrently posing the significant challenge of quickly and accurately pinpointing positions that align with their skills and preferences. Job recommendation systems have significantly alleviated the extensive search burden for job seekers by optimizing user engagement metrics, such… ▽ More

    Submitted 15 October, 2024; v1 submitted 10 October, 2024; originally announced October 2024.

    Comments: Accepted by ICDM 2024. 10 pages

  24. arXiv:2410.07626  [pdf, other

    hep-ex

    Precision Measurement of the Branching Fraction of $D^{+}\to μ^{+}ν_μ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Using $20.3~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected at a center-of-mass energy of $E_{\rm cm}=3.773$ GeV with the BESIII detector operating at the BEPCII collider, we determine the branching fraction of the leptonic decay $D^+\toμ^+ν_μ$ to be $(3.981\pm0.079_{\rm stat}\pm0.040_{\rm syst})\times10^{-4}$. Interpreting our measurement with knowledge of the Fermi coupling constant… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: 9 pages, 2 figures

  25. arXiv:2410.07273  [pdf, other

    cs.CV cs.LG

    BELM: Bidirectional Explicit Linear Multi-step Sampler for Exact Inversion in Diffusion Models

    Authors: Fangyikang Wang, Hubery Yin, Yuejiang Dong, Huminhao Zhu, Chao Zhang, Hanbin Zhao, Hui Qian, Chen Li

    Abstract: The inversion of diffusion model sampling, which aims to find the corresponding initial noise of a sample, plays a critical role in various tasks. Recently, several heuristic exact inversion samplers have been proposed to address the inexact inversion issue in a training-free manner. However, the theoretical properties of these heuristic samplers remain unknown and they often exhibit mediocre samp… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: accepted paper by NeurIPS

  26. arXiv:2410.06500  [pdf, other

    hep-ex

    Search for the radiative decays $D^+\toγρ^+$ and $D^+\toγK^{*+}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (648 additional authors not shown)

    Abstract: We search for the radiative decays $D^{+} \to γρ^+$ and $D^{+} \to γK^{*+}$ using 20.3~fb$^{-1}$ of $e^+e^-$ annihilation data collected at the center-of-mass energy $\sqrt{s}=3.773$ GeV by the BESIII detector operating at the BEPCII collider. No significant signals are observed, and the upper limits on the branching fractions of $D^{+} \to γρ^+$ and $D^{+} \to γK^{*+}$ at 90\% confidence level ar… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  27. arXiv:2410.05736  [pdf, ps, other

    hep-ex

    Observation of an axial-vector state in the study of $ψ(3686) \to φηη'$ decay

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (625 additional authors not shown)

    Abstract: Using (2712.4 $\pm$ 14.3)$\times 10^{6}$ $ψ(3686)$ events collected with the BESIII detector at BEPCII, a partial wave analysis of the decay $ψ(3686) \to φηη' $ is performed with the covariant tensor approach. An axial-vector state with a mass near 2.3 $\rm GeV/c^2$ is observed for the first time. Its mass and width are measured to be 2316… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  28. arXiv:2410.05101  [pdf, other

    eess.AS cs.LG cs.SD

    CR-CTC: Consistency regularization on CTC for improved speech recognition

    Authors: Zengwei Yao, Wei Kang, Xiaoyu Yang, Fangjun Kuang, Liyong Guo, Han Zhu, Zengrui Jin, Zhaoqing Li, Long Lin, Daniel Povey

    Abstract: Connectionist Temporal Classification (CTC) is a widely used method for automatic speech recognition (ASR), renowned for its simplicity and computational efficiency. However, it often falls short in recognition performance compared to transducer or systems combining CTC and attention-based encoder-decoder (CTC/AED). In this work, we propose the Consistency-Regularized CTC (CR-CTC), which enforces… ▽ More

    Submitted 13 October, 2024; v1 submitted 7 October, 2024; originally announced October 2024.

  29. arXiv:2410.04478  [pdf, other

    cs.SD cs.CL eess.AS

    Configurable Multilingual ASR with Speech Summary Representations

    Authors: Harrison Zhu, Ivan Fung, Yingke Zhu, Lahiru Samarakoon

    Abstract: Approximately half of the world's population is multilingual, making multilingual ASR (MASR) essential. Deploying multiple monolingual models is challenging when the ground-truth language is unknown in advance. This motivates research efforts on configurable multilingual MASR models that can be prompted manually or adapted automatically to recognise specific languages. In this paper, we present th… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

    Comments: A preprint

  30. arXiv:2410.04425  [pdf, other

    astro-ph.HE

    LHAASO detection of very-high-energy gamma-ray emission surrounding PSR J0248+6021

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: We report the detection of an extended very-high-energy (VHE) gamma-ray source coincident with the locations of middle-aged (62.4~\rm kyr) pulsar PSR J0248+6021, by using the LHAASO-WCDA data of live 796 days and LHAASO-KM2A data of live 1216 days. A significant excess of \gray induced showers is observed both by WCDA in energy bands of 1-25~\rm TeV and KM2A in energy bands of $>$ 25~\rm TeV with… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

    Comments: 12 pages, 10 figures, Accepted by Sci. China-Phys. Mech. Astron

  31. arXiv:2410.03351  [pdf, other

    cs.CL cs.PL cs.SE

    Generating Equivalent Representations of Code By A Self-Reflection Approach

    Authors: Jia Li, Ge Li, Lecheng Wang, Hao Zhu, Zhi Jin

    Abstract: Equivalent Representations (ERs) of code are textual representations that preserve the same semantics as the code itself, e.g., natural language comments and pseudocode. ERs play a critical role in software development and maintenance. However, how to automatically generate ERs of code remains an open challenge. In this paper, we propose a self-reflection approach to generating ERs of code. It ena… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

  32. arXiv:2410.02639  [pdf, other

    cs.LG

    Labor Migration Modeling through Large-scale Job Query Data

    Authors: Zhuoning Guo, Le Zhang, Hengshu Zhu, Weijia Zhang, Hui Xiong, Hao Liu

    Abstract: Accurate and timely modeling of labor migration is crucial for various urban governance and commercial tasks, such as local policy-making and business site selection. However, existing studies on labor migration largely rely on limited survey data with statistical methods, which fail to deliver timely and fine-grained insights for time-varying regional trends. To this end, we propose a deep learni… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  33. arXiv:2410.02421  [pdf, other

    hep-ex

    Search for lepton number violating decays of $D_s^+\to h^-h^0e^+e^+$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (650 additional authors not shown)

    Abstract: Based on 7.33 fb$^{-1}$ of $e^+e^-$ collision data collected by the BESIII detector operating at the BEPCII collider at center-of-mass energies from 4.128 to 4.226 GeV, a search for the Majorana neutrino $ν_m$ is conducted in the lepton-number-violating decays of $D_s^+\to h^-h^0e^+e^+$. Here, $h^-$ represents a $K^-$ or $π^-$, and $h^0$ represents a $π^0$, $K_S^0$ or $φ$. No significant signal is… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  34. arXiv:2410.02378  [pdf, other

    cs.CL cs.AI

    Towards Comprehensive Detection of Chinese Harmful Memes

    Authors: Junyu Lu, Bo Xu, Xiaokun Zhang, Hongbo Wang, Haohao Zhu, Dongyu Zhang, Liang Yang, Hongfei Lin

    Abstract: This paper has been accepted in the NeurIPS 2024 D & B Track. Harmful memes have proliferated on the Chinese Internet, while research on detecting Chinese harmful memes significantly lags behind due to the absence of reliable datasets and effective detectors. To this end, we focus on the comprehensive detection of Chinese harmful memes. We construct ToxiCN MM, the first Chinese harmful meme datase… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  35. arXiv:2410.02133  [pdf, other

    cs.LG

    TrajGPT: Irregular Time-Series Representation Learning for Health Trajectory Analysis

    Authors: Ziyang Song, Qingcheng Lu, He Zhu, David Buckeridge, Yue Li

    Abstract: In many domains, such as healthcare, time-series data is often irregularly sampled with varying intervals between observations. This poses challenges for classical time-series models that require equally spaced data. To address this, we propose a novel time-series Transformer called Trajectory Generative Pre-trained Transformer (TrajGPT). TrajGPT employs a novel Selective Recurrent Attention (SRA)… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

    Comments: 9 pages

  36. arXiv:2410.01946  [pdf, other

    cs.CL

    SciPrompt: Knowledge-augmented Prompting for Fine-grained Categorization of Scientific Topics

    Authors: Zhiwen You, Kanyao Han, Haotian Zhu, Bertram Ludäscher, Jana Diesner

    Abstract: Prompt-based fine-tuning has become an essential method for eliciting information encoded in pre-trained language models for a variety of tasks, including text classification. For multi-class classification tasks, prompt-based fine-tuning under low-resource scenarios has resulted in performance levels comparable to those of fully fine-tuning methods. Previous studies have used crafted prompt templ… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

    Comments: EMNLP 2024 Main

  37. arXiv:2410.01672  [pdf, other

    cs.HC

    Practicing Stress Relief for the Everyday: Designing Social Simulation Using VR, AR, and LLMs

    Authors: Anna Fang, Hriday Chhabria, Alekhya Maram, Haiyi Zhu

    Abstract: Stress is an inevitable part of day-to-day life yet many find themselves unable to manage it themselves, particularly when professional or peer support are not always readily available. As self-care becomes increasingly vital for mental well-being, this paper explores the potential of social simulation as a safe, virtual environment for practicing stress relief for everyday situations. Leveraging… ▽ More

    Submitted 3 October, 2024; v1 submitted 2 October, 2024; originally announced October 2024.

  38. Facial Action Unit Detection by Adaptively Constraining Self-Attention and Causally Deconfounding Sample

    Authors: Zhiwen Shao, Hancheng Zhu, Yong Zhou, Xiang Xiang, Bing Liu, Rui Yao, Lizhuang Ma

    Abstract: Facial action unit (AU) detection remains a challenging task, due to the subtlety, dynamics, and diversity of AUs. Recently, the prevailing techniques of self-attention and causal inference have been introduced to AU detection. However, most existing methods directly learn self-attention guided by AU detection, or employ common patterns for all AUs during causal intervention. The former often capt… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

    Comments: This paper is accepted by International Journal of Computer Vision

  39. arXiv:2410.01226  [pdf, other

    cs.CV

    Towards Native Generative Model for 3D Head Avatar

    Authors: Yiyu Zhuang, Yuxiao He, Jiawei Zhang, Yanwen Wang, Jiahe Zhu, Yao Yao, Siyu Zhu, Xun Cao, Hao Zhu

    Abstract: Creating 3D head avatars is a significant yet challenging task for many applicated scenarios. Previous studies have set out to learn 3D human head generative models using massive 2D image data. Although these models are highly generalizable for human appearance, their result models are not 360$^\circ$-renderable, and the predicted 3D geometry is unreliable. Therefore, such results cannot be used i… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  40. arXiv:2410.00372  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci quant-ph

    Direct writing of high temperature superconducting Josephson junctions using a thermal scanning probe

    Authors: Ngoc My Hanh Duong, Amanuel M. Berhane, Dave Mitchell, Rifat Ullah, Ting Zhang, He Zhu, Jia Du, Simon K. H. Lam, Emma E. Mitchell, Avi Bendavid

    Abstract: In this letter, we demonstrate for the first time the creation of Josephson-like superconducting nanojunctions using a thermal scanning probe to directly inscribe weak links into microstrips of YBa2Cu3O7-x (YBCO). Our method effectively reduces the critical current (Ic) over an order of magnitude. The resulting nanobridges exhibit clear evidence of Josephson effects, of SNS-type junctions, as show… ▽ More

    Submitted 30 September, 2024; originally announced October 2024.

    Comments: 14 pages, 4 figures

  41. arXiv:2409.19672  [pdf, other

    cs.CL cs.MM

    Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding

    Authors: Chong Zhang, Yi Tu, Yixi Zhao, Chenshu Yuan, Huan Chen, Yue Zhang, Mingxu Chai, Ya Guo, Huijia Zhu, Qi Zhang, Tao Gui

    Abstract: Modeling and leveraging layout reading order in visually-rich documents (VrDs) is critical in document intelligence as it captures the rich structure semantics within documents. Previous works typically formulated layout reading order as a permutation of layout elements, i.e. a sequence containing all the layout elements. However, we argue that this formulation does not adequately convey the compl… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

    Comments: Accepted as a long paper in the main conference of EMNLP 2024

  42. arXiv:2409.19648  [pdf, other

    cs.CV

    OrientedFormer: An End-to-End Transformer-Based Oriented Object Detector in Remote Sensing Images

    Authors: Jiaqi Zhao, Zeyu Ding, Yong Zhou, Hancheng Zhu, Wen-Liang Du, Rui Yao, Abdulmotaleb El Saddik

    Abstract: Oriented object detection in remote sensing images is a challenging task due to objects being distributed in multi-orientation. Recently, end-to-end transformer-based methods have achieved success by eliminating the need for post-processing operators compared to traditional CNN-based methods. However, directly extending transformers to oriented object detection presents three main issues: 1) objec… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

    Comments: The paper is accepted by IEEE Transactions on Geoscience and Remote Sensing (TGRS)

  43. arXiv:2409.19545  [pdf, other

    cs.LG

    Convergence-aware Clustered Federated Graph Learning Framework for Collaborative Inter-company Labor Market Forecasting

    Authors: Zhuoning Guo, Hao Liu, Le Zhang, Qi Zhang, Hengshu Zhu, Hui Xiong

    Abstract: Labor market forecasting on talent demand and supply is essential for business management and economic development. With accurate and timely forecasts, employers can adapt their recruitment strategies to align with the evolving labor market, and employees can have proactive career path planning according to future demand and supply. However, previous studies ignore the interconnection between dema… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

  44. arXiv:2409.19301  [pdf, other

    cs.CR cs.AI

    Privacy Attack in Federated Learning is Not Easy: An Experimental Study

    Authors: Hangyu Zhu, Liyuan Huang, Zhenping Xie

    Abstract: Federated learning (FL) is an emerging distributed machine learning paradigm proposed for privacy preservation. Unlike traditional centralized learning approaches, FL enables multiple users to collaboratively train a shared global model without disclosing their own data, thereby significantly reducing the potential risk of privacy leakage. However, recent studies have indicated that FL cannot enti… ▽ More

    Submitted 28 September, 2024; originally announced September 2024.

  45. arXiv:2409.18785  [pdf, other

    cs.CV

    Student-Oriented Teacher Knowledge Refinement for Knowledge Distillation

    Authors: Chaomin Shen, Yaomin Huang, Haokun Zhu, Jinsong Fan, Guixu Zhang

    Abstract: Knowledge distillation has become widely recognized for its ability to transfer knowledge from a large teacher network to a compact and more streamlined student network. Traditional knowledge distillation methods primarily follow a teacher-oriented paradigm that imposes the task of learning the teacher's complex knowledge onto the student network. However, significant disparities in model capacity… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

  46. arXiv:2409.16427  [pdf, other

    cs.AI

    HAICOSYSTEM: An Ecosystem for Sandboxing Safety Risks in Human-AI Interactions

    Authors: Xuhui Zhou, Hyunwoo Kim, Faeze Brahman, Liwei Jiang, Hao Zhu, Ximing Lu, Frank Xu, Bill Yuchen Lin, Yejin Choi, Niloofar Mireshghallah, Ronan Le Bras, Maarten Sap

    Abstract: AI agents are increasingly autonomous in their interactions with human users and tools, leading to increased interactional safety risks. We present HAICOSYSTEM, a framework examining AI agent safety within diverse and complex social interactions. HAICOSYSTEM features a modular sandbox environment that simulates multi-turn interactions between human users and AI agents, where the AI agents are equi… ▽ More

    Submitted 26 September, 2024; v1 submitted 24 September, 2024; originally announced September 2024.

    Comments: Both the second and third authors contributed equally

  47. arXiv:2409.15644  [pdf, other

    cs.HC

    PolicyCraft: Supporting Collaborative and Participatory Policy Design through Case-Grounded Deliberation

    Authors: Tzu-Sheng Kuo, Quan Ze Chen, Amy X. Zhang, Jane Hsieh, Haiyi Zhu, Kenneth Holstein

    Abstract: Community and organizational policies are typically designed in a top-down, centralized fashion, with limited input from impacted stakeholders. This can result in policies that are misaligned with community needs or perceived as illegitimate. How can we support more collaborative, participatory approaches to policy design? In this paper, we present PolicyCraft, a system that structures collaborati… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  48. arXiv:2409.15044  [pdf, ps, other

    hep-ex

    Search for $D^0\to K^-ηe^+ν_e$, $D^+\to K_S^0 ηe^+ν_e$ and $D^+\to ηηe^+ν_e$ decays

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: By analyzing $e^+e^-$ annihilation data corresponding to an integrated luminosity of 7.93 fb$^{-1}$, collected at the center-of-mass energy of 3.773 GeV with the BESIII detector, we search for the semileptonic decays $D^0\to K^-ηe^+ν_e$, $D^+\to K_S^0 ηe^+ν_e$ and $D^+\to ηηe^+ν_e$ for the first time. We present evidence for $D^0\to K^-ηe^+ν_e$ with a significance of $3.3σ$. The branching fraction… ▽ More

    Submitted 24 September, 2024; v1 submitted 23 September, 2024; originally announced September 2024.

    Comments: 10 pages,4 figures

  49. arXiv:2409.14550  [pdf, ps, other

    cs.NI

    Interpretable Nonroutine Network Traffic Prediction with a Case Study

    Authors: Liangzhi Wang, Haoyuan Zhu, Jiliang Zhang, Zitian Zhang, Jie Zhang

    Abstract: This paper pioneers a nonroutine network traffic prediction (NNTP) method to prospectively provide a theoretical basis for avoiding large-scale network disruption by accurately predicting bursty traffic. Certain events that impact user behavior subsequently trigger nonroutine traffic, which significantly constrains the performance of network traffic prediction (NTP) models. By analyzing nonroutine… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  50. arXiv:2409.14454  [pdf, other

    eess.SY cs.LG

    A Unified Approach for Learning the Dynamics of Power System Generators and Inverter-based Resources

    Authors: Shaohui Liu, Weiqian Cai, Hao Zhu, Brian Johnson

    Abstract: The growing prevalence of inverter-based resources (IBRs) for renewable energy integration and electrification greatly challenges power system dynamic analysis. To account for both synchronous generators (SGs) and IBRs, this work presents an approach for learning the model of an individual dynamic component. The recurrent neural network (RNN) model is used to match the recursive structure in predi… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.