Skip to main content

Showing 1–50 of 2,544 results for author: Guo, J

  1. arXiv:2410.16033  [pdf, other

    cs.CL cs.AI cs.LG

    TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling

    Authors: Jiahao Qiu, Yifu Lu, Yifan Zeng, Jiacheng Guo, Jiayi Geng, Huazheng Wang, Kaixuan Huang, Yue Wu, Mengdi Wang

    Abstract: Inference-time alignment enhances the performance of large language models without requiring additional training or fine-tuning but presents challenges due to balancing computational efficiency with high-quality output. Best-of-N (BoN) sampling, as a simple yet powerful approach, generates multiple responses and selects the best one, achieving improved performance but with a high computational cos… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

  2. arXiv:2410.16020  [pdf, other

    cs.CV

    START: A Generalized State Space Model with Saliency-Driven Token-Aware Transformation

    Authors: Jintao Guo, Lei Qi, Yinghuan Shi, Yang Gao

    Abstract: Domain Generalization (DG) aims to enable models to generalize to unseen target domains by learning from multiple source domains. Existing DG methods primarily rely on convolutional neural networks (CNNs), which inherently learn texture biases due to their limited receptive fields, making them prone to overfitting source domains. While some works have introduced transformer-based methods (ViTs) fo… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: Accepted by NeurIPS2024. The code is available at https://github.com/lingeringlight/START

  3. arXiv:2410.15847  [pdf, other

    cs.CV cs.AI cs.LG

    Random Token Fusion for Multi-View Medical Diagnosis

    Authors: Jingyu Guo, Christos Matsoukas, Fredrik Strand, Kevin Smith

    Abstract: In multi-view medical diagnosis, deep learning-based models often fuse information from different imaging perspectives to improve diagnostic performance. However, existing approaches are prone to overfitting and rely heavily on view-specific features, which can lead to trivial solutions. In this work, we introduce Random Token Fusion (RTF), a novel technique designed to enhance multi-view medical… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: Originally published at the NeurIPS 2024 Workshop on Advancements In Medical Foundation Models: Explainability, Robustness, Security, and Beyond (AIM-FM)

  4. arXiv:2410.14196  [pdf

    cond-mat.mtrl-sci cond-mat.str-el

    Quantum-Confined Tunable Ferromagnetism on the Surface of a van der Waals Antiferromagnet NaCrTe2

    Authors: Yidian Li, Xian Du, Junjie Wang, Runzhe Xu, Wenxuan Zhao, Kaiyi Zhai, Jieyi Liu, Houke Chen, Yiheng Yang, Nicolas C. Plumb, Sailong Ju, Ming Shi, Zhongkai Liu, Jiangang Guo, Xiaolong Chen, Yulin Chen, Lexian Yang

    Abstract: The surface of three-dimensional materials provides an ideal and versatile platform to explore quantum-confined physics. Here, we systematically investigate the electronic structure of Na-intercalated CrTe2, a van der Waals antiferromagnet, using angle-resolved photoemission spectroscopy and ab-initio calculations. The measured band structure deviates from the calculation of bulk NaCrTe2 but agree… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Journal ref: Nano Lett. 24, 9832-9838 (2024)

  5. arXiv:2410.13808  [pdf, other

    cs.CL

    De-mark: Watermark Removal in Large Language Models

    Authors: Ruibo Chen, Yihan Wu, Junfeng Guo, Heng Huang

    Abstract: Watermarking techniques offer a promising way to identify machine-generated content via embedding covert information into the contents generated from language models (LMs). However, the robustness of the watermarking schemes has not been well explored. In this paper, we present De-mark, an advanced framework designed to remove n-gram-based watermarks effectively. Our method utilizes a novel queryi… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  6. arXiv:2410.13805  [pdf, other

    cs.CL

    A Watermark for Order-Agnostic Language Models

    Authors: Ruibo Chen, Yihan Wu, Yanshuo Chen, Chenxi Liu, Junfeng Guo, Heng Huang

    Abstract: Statistical watermarking techniques are well-established for sequentially decoded language models (LMs). However, these techniques cannot be directly applied to order-agnostic LMs, as the tokens in order-agnostic LMs are not generated sequentially. In this work, we introduce Pattern-mark, a pattern-based watermarking framework specifically designed for order-agnostic LMs. We develop a Markov-chain… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  7. arXiv:2410.13515  [pdf, other

    hep-ex hep-lat hep-ph nucl-ex

    Observation of a rare beta decay of the charmed baryon with a Graph Neural Network

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (637 additional authors not shown)

    Abstract: The study of beta decay of the charmed baryon provides unique insights into the fundamental mechanism of the strong and electro-weak interactions. The $Λ_c^+$, being the lightest charmed baryon, undergoes disintegration solely through the charm quark weak decay. Its beta decay provides an ideal laboratory for investigating non-perturbative effects in quantum chromodynamics and for constraining the… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 28 pages, 6 figures

  8. arXiv:2410.13478  [pdf, other

    hep-ex

    Observation of $χ_{c0}\toΣ^{+}\barΣ^{-}η$ and evidence for $χ_{c1,2}\toΣ^{+}\barΣ^{-}η$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: Using $(27.12\pm 0.14)\times10^{8}$ $ψ(3686)$ events collected with the BESIII detector, the decay $χ_{c0}\toΣ^{+}\barΣ^{-}η$ is observed for the first time with a statistical significance of $7.0σ$, and evidence for $χ_{c1}\toΣ^{+}\barΣ^{-}η$ and $χ_{c2}\toΣ^{+}\barΣ^{-}η$ is found with statistical significances of $4.3σ$ and $4.6σ$, respectively. The branching fractions are determined to be… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  9. arXiv:2410.13368  [pdf, other

    hep-ex hep-ph

    Observation of the Singly Cabibbo-Suppressed Decay $Λ_c^{+}\to pπ^0$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Utilizing 4.5${~\rm{fb}}^{-1}$ of $e^+e^-$ annihilation data collected with the BESIII detector at the BEPCII collider at center-of-mass energies between 4.600 and 4.699 GeV, the first observation of the singly Cabibbo-suppressed decay $Λ_c^{+}\to pπ^0$ is presented, with a statistical significance of $5.4σ$. The ratio of the branching fractions of $Λ_c^{+}\to pπ^0$ and $Λ_c^{+}\to pη$ is measured… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 9 pages, 4 figures

  10. arXiv:2410.13349  [pdf, other

    cs.CV

    GlossyGS: Inverse Rendering of Glossy Objects with 3D Gaussian Splatting

    Authors: Shuichang Lai, Letian Huang, Jie Guo, Kai Cheng, Bowen Pan, Xiaoxiao Long, Jiangjing Lyu, Chengfei Lv, Yanwen Guo

    Abstract: Reconstructing objects from posed images is a crucial and complex task in computer graphics and computer vision. While NeRF-based neural reconstruction methods have exhibited impressive reconstruction ability, they tend to be time-comsuming. Recent strategies have adopted 3D Gaussian Splatting (3D-GS) for inverse rendering, which have led to quick and effective outcomes. However, these techniques… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  11. arXiv:2410.13186  [pdf, other

    astro-ph.SR physics.space-ph

    A statistical study on the peak and fluence spectra of Solar Energetic Particles observed over 4 solar cycles

    Authors: Yubao Wang, Jingnan Guo

    Abstract: Solar energetic particles (SEPs) are an important space radiation source, especially for the space weather environment in the inner heliosphere. The energy spectrum of SEP events is crucial both for evaluating their radiation effects and for understanding their acceleration process at the source region and their propagation mechanism. In this work, we investigate the properties of the SEP peak flu… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  12. arXiv:2410.13185  [pdf, other

    cs.AI cs.CL

    Chain of Ideas: Revolutionizing Research in Novel Idea Development with LLM Agents

    Authors: Long Li, Weiwen Xu, Jiayan Guo, Ruochen Zhao, Xinxuan Li, Yuqian Yuan, Boqiang Zhang, Yuming Jiang, Yifei Xin, Ronghao Dang, Deli Zhao, Yu Rong, Tian Feng, Lidong Bing

    Abstract: Effective research ideation is a critical step for scientific research. However, the exponential increase in scientific literature makes it challenging for researchers to stay current with recent advances and identify meaningful research directions. Recent developments in large language models~(LLMs) suggest a promising avenue for automating the generation of novel research ideas. However, existin… ▽ More

    Submitted 20 October, 2024; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: 10 pages,5 figures, conference

  13. arXiv:2410.12841  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    UniAutoML: A Human-Centered Framework for Unified Discriminative and Generative AutoML with Large Language Models

    Authors: Jiayi Guo, Zan Chen, Yingrui Ji, Liyun Zhang, Daqin Luo, Zhigang Li, Yiqin Shen

    Abstract: Automated Machine Learning (AutoML) has simplified complex ML processes such as data pre-processing, model selection, and hyper-parameter searching. However, traditional AutoML frameworks focus solely on discriminative tasks, often falling short in tackling AutoML for generative models. Additionally, these frameworks lack interpretability and user engagement during the training process, primarily… ▽ More

    Submitted 17 October, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

  14. arXiv:2410.12620  [pdf, other

    hep-ex

    Search for $e^{+}e^{-} \to φχ_{c0}$ and $φη_{c2}(1D)$ at center-of-mass energies from 4.47 to 4.95 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: Utilizing a data set of $6.7$ fb$^{-1}$ from electron-positron collisions recorded by the BESIII detector at the BEPCII storage ring, a search is conducted for the processes $e^{+}e^{-} \to φχ_{c0}$ and $φη_{c2}(1D)$ across center-of-mass energies from 4.47 to 4.95 GeV. In the absence of any significant signals, upper limits are set. These include limits on the Born cross sections for… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 14 pages, 6 figures

  15. arXiv:2410.12558  [pdf, other

    cs.CL cs.AI

    A Claim Decomposition Benchmark for Long-form Answer Verification

    Authors: Zhihao Zhang, Yixing Fan, Ruqing Zhang, Jiafeng Guo

    Abstract: The advancement of LLMs has significantly boosted the performance of complex long-form question answering tasks. However, one prominent issue of LLMs is the generated "hallucination" responses that are not factual. Consequently, attribution for each claim in responses becomes a common solution to improve the factuality and verifiability. Existing researches mainly focus on how to provide accurate… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: Accepted by CCIR 2024

  16. arXiv:2410.11607  [pdf, other

    hep-ex

    Observation of $χ_{cJ}\to p \bar p K^0_S K^- π^+ + c.c.$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (648 additional authors not shown)

    Abstract: By analyzing $(27.12\pm0.14)\times10^8$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decays of $χ_{cJ} \to p \bar{p} K^0_S K^- π^+ +c.c.(J=0, 1, 2)$ are observed for the first time with statistical significances greater than $10σ$. The branching fractions of these decays are determined to be… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 12 pages, 5 figures

  17. arXiv:2410.11527  [pdf, other

    q-bio.BM cs.LG

    It Takes Two to Tango: Directly Optimizing for Constrained Synthesizability in Generative Molecular Design

    Authors: Jeff Guo, Philippe Schwaller

    Abstract: Constrained synthesizability is an unaddressed challenge in generative molecular design. In particular, designing molecules satisfying multi-parameter optimization objectives, while simultaneously being synthesizable and enforcing the presence of specific commercial building blocks in the synthesis. This is practically important for molecule re-purposing, sustainability, and efficiency. In this wo… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  18. arXiv:2410.11400  [pdf, other

    eess.SP cs.LG

    RSSI-Assisted CSI-Based Passenger Counting with Multiple Wi-Fi Receivers

    Authors: Jingtao Guo, Wenhao Zhuang, Yuyi Mao, Ivan Wang-Hei Ho

    Abstract: Passenger counting is crucial for public transport vehicle scheduling and traffic capacity evaluation. However, most existing methods are either costly or with low counting accuracy, leading to the recent use of Wi-Fi signals for this purpose. In this paper, we develop an efficient edge computing-based passenger counting system consists of multiple Wi-Fi receivers and an edge server. It leverages… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 6 pages, 9 figures, this article was submitted to IEEE for possible publication

  19. arXiv:2410.11217  [pdf, ps, other

    cs.CL cs.AI cs.IR

    On the Capacity of Citation Generation by Large Language Models

    Authors: Haosheng Qian, Yixing Fan, Ruqing Zhang, Jiafeng Guo

    Abstract: Retrieval-augmented generation (RAG) appears as a promising method to alleviate the "hallucination" problem in large language models (LLMs), since it can incorporate external traceable resources for response generation. The essence of RAG in combating the hallucination issue lies in accurately attributing claims in responses to the corresponding retrieved documents. However, most of existing works… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: Accepted by CCIR 2024

  20. arXiv:2410.10819  [pdf, other

    cs.CL

    DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

    Authors: Guangxuan Xiao, Jiaming Tang, Jingwei Zuo, Junxian Guo, Shang Yang, Haotian Tang, Yao Fu, Song Han

    Abstract: Deploying long-context large language models (LLMs) is essential but poses significant computational and memory challenges. Caching all Key and Value (KV) states across all attention heads consumes substantial memory. Existing KV cache pruning methods either damage the long-context capabilities of LLMs or offer only limited efficiency improvements. In this paper, we identify that only a fraction o… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  21. arXiv:2410.10481  [pdf, other

    cs.LG cs.AI cs.CR

    Model-Based Differentially Private Knowledge Transfer for Large Language Models

    Authors: Zhaomin Wu, Jizhou Guo, Junyi Hou, Bingsheng He, Lixin Fan, Qiang Yang

    Abstract: As large language models (LLMs) become increasingly prevalent in web services, effectively leveraging domain-specific knowledge while ensuring privacy has become critical. Existing methods, such as retrieval-augmented generation (RAG) and differentially private data synthesis, often compromise either the utility of domain knowledge or the privacy of sensitive data, limiting their applicability in… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  22. arXiv:2410.10441  [pdf, other

    cs.CV cs.AI

    Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs

    Authors: Kai Han, Jianyuan Guo, Yehui Tang, Wei He, Enhua Wu, Yunhe Wang

    Abstract: Vision-language large models have achieved remarkable success in various multi-modal tasks, yet applying them to video understanding remains challenging due to the inherent complexity and computational demands of video data. While training-based video-LLMs deliver high performance, they often require substantial resources for training and inference. Conversely, training-free approaches offer a mor… ▽ More

    Submitted 16 October, 2024; v1 submitted 14 October, 2024; originally announced October 2024.

    Comments: Tech report

  23. arXiv:2410.10399  [pdf, other

    cs.CV

    Parameterize Structure with Differentiable Template for 3D Shape Generation

    Authors: Changfeng Ma, Pengxiao Guo, Shuangyu Yang, Yinuo Chen, Jie Guo, Chongjun Wang, Yanwen Guo, Wenping Wang

    Abstract: Structural representation is crucial for reconstructing and generating editable 3D shapes with part semantics. Recent 3D shape generation works employ complicated networks and structure definitions relying on hierarchical annotations and pay less attention to the details inside parts. In this paper, we propose the method that parameterizes the shared structure in the same category using a differen… ▽ More

    Submitted 15 October, 2024; v1 submitted 14 October, 2024; originally announced October 2024.

  24. arXiv:2410.10157  [pdf, other

    eess.SP

    Caching Content Placement and Beamforming Co-design for IRS-Aided MIMO Systems with Imperfect CSI

    Authors: Meng Gao, Yang Wang, Huafu Li, Junqi Guo

    Abstract: When offloading links encounter deep fading and obstruction, edge caching cannot fully enhance wireless network performance and improve the QoS of edge nodes, as it fails to effectively reduce backhaul burden. The emerging technology of intelligent reflecting surfaces (IRS) compensates for this disadvantage by creating a smart and reconfigurable wireless environment. Subsequently, we jointly desig… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  25. arXiv:2410.08603  [pdf, other

    hep-ex

    Observation of $D^+\toη^\primeμ^+ν_μ$ and First Study of $D^+\to η^\prime \ell^+ν_\ell$ Decay Dynamics

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Using $20.3\,\rm fb^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy 3.773\,GeV with the BESIII detector, we report the first observation of the semileptonic decay $D^+\to η^\prime μ^+ν_μ$ with significance of $8.6σ$ including systematic uncertainties, and an improved measurement of $D^+\to η^\prime e^+ν_e$. The branching fractions of $D^+\to η^\prime μ^+ν_μ$ and… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  26. arXiv:2410.07626  [pdf, other

    hep-ex

    Precision Measurement of the Branching Fraction of $D^{+}\to μ^{+}ν_μ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Using $20.3~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected at a center-of-mass energy of $E_{\rm cm}=3.773$ GeV with the BESIII detector operating at the BEPCII collider, we determine the branching fraction of the leptonic decay $D^+\toμ^+ν_μ$ to be $(3.981\pm0.079_{\rm stat}\pm0.040_{\rm syst})\times10^{-4}$. Interpreting our measurement with knowledge of the Fermi coupling constant… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: 9 pages, 2 figures

  27. arXiv:2410.06500  [pdf, other

    hep-ex

    Search for the radiative decays $D^+\toγρ^+$ and $D^+\toγK^{*+}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (648 additional authors not shown)

    Abstract: We search for the radiative decays $D^{+} \to γρ^+$ and $D^{+} \to γK^{*+}$ using 20.3~fb$^{-1}$ of $e^+e^-$ annihilation data collected at the center-of-mass energy $\sqrt{s}=3.773$ GeV by the BESIII detector operating at the BEPCII collider. No significant signals are observed, and the upper limits on the branching fractions of $D^{+} \to γρ^+$ and $D^{+} \to γK^{*+}$ at 90\% confidence level ar… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  28. arXiv:2410.06497  [pdf, other

    cs.IR cs.AI cs.DC cs.LG

    ERCache: An Efficient and Reliable Caching Framework for Large-Scale User Representations in Meta's Ads System

    Authors: Fang Zhou, Yaning Huang, Dong Liang, Dai Li, Zhongke Zhang, Kai Wang, Xiao Xin, Abdallah Aboelela, Zheliang Jiang, Yang Wang, Jeff Song, Wei Zhang, Chen Liang, Huayu Li, ChongLin Sun, Hang Yang, Lei Qu, Zhan Shu, Mindi Yuan, Emanuele Maccherani, Taha Hayat, John Guo, Varna Puvvada, Uladzimir Pashkevich

    Abstract: The increasing complexity of deep learning models used for calculating user representations presents significant challenges, particularly with limited computational resources and strict service-level agreements (SLAs). Previous research efforts have focused on optimizing model inference but have overlooked a critical question: is it necessary to perform user model inference for every ad request in… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  29. arXiv:2410.06157  [pdf, other

    cs.CR cs.SE

    Detecting Android Malware by Visualizing App Behaviors from Multiple Complementary Views

    Authors: Zhaoyi Meng, Jiale Zhang, Jiaqi Guo, Wansen Wang, Wenchao Huang, Jie Cui, Hong Zhong, Yan Xiong

    Abstract: Deep learning has emerged as a promising technology for achieving Android malware detection. To further unleash its detection potentials, software visualization can be integrated for analyzing the details of app behaviors clearly. However, facing increasingly sophisticated malware, existing visualization-based methods, analyzing from one or randomly-selected few views, can only detect limited atta… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: Submitted to TIFS

  30. arXiv:2410.05736  [pdf, ps, other

    hep-ex

    Observation of an axial-vector state in the study of $ψ(3686) \to φηη'$ decay

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (625 additional authors not shown)

    Abstract: Using (2712.4 $\pm$ 14.3)$\times 10^{6}$ $ψ(3686)$ events collected with the BESIII detector at BEPCII, a partial wave analysis of the decay $ψ(3686) \to φηη' $ is performed with the covariant tensor approach. An axial-vector state with a mass near 2.3 $\rm GeV/c^2$ is observed for the first time. Its mass and width are measured to be 2316… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  31. arXiv:2410.05398  [pdf, other

    hep-ph

    Tracing the bottom electroweak dipole operators at future lepton colliders

    Authors: Jiayin Gu, Jiayu Guo, Xiao-Ze Tan

    Abstract: While often omitted in the SMEFT analyses of electroweak measurements, the electroweak dipole operators of the bottom quark have been found to be important in some cases, and are also related to processes involving the top quark. In this paper, we further investigate their effects, focusing on the measurements of the $e^+e^-\to b\bar{b}$ process at a future lepton collider. Their linear contributi… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

    Comments: 22 pages including references, 20 figures

    Report number: DESY-24-146

  32. arXiv:2410.04407  [pdf, other

    cs.CL

    Lens: Rethinking Multilingual Enhancement for Large Language Models

    Authors: Weixiang Zhao, Yulin Hu, Jiahe Guo, Xingyu Sui, Tongtong Wu, Yang Deng, Yanyan Zhao, Bing Qin, Wanxiang Che, Ting Liu

    Abstract: Despite the growing global demand for large language models (LLMs) that serve users from diverse linguistic backgrounds, most cutting-edge LLMs remain predominantly English-centric. This creates a performance gap across languages, restricting access to advanced AI services for non-English speakers. Current methods to enhance multilingual capabilities largely rely on data-driven post-training techn… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

    Comments: 21 pages, 9 figures, 5 tables

  33. arXiv:2410.04341  [pdf, ps, other

    math.GR math.CO

    On multivalued groups of order 3

    Authors: Jin Guo, Ilia Ponomarenko, Andrey V. Vasil'ev

    Abstract: A complete classification of the multivalued coset groups of order $3$ is given. The proof is based on the classification of rank $3$ groups having regular normal subgroups.

    Submitted 5 October, 2024; originally announced October 2024.

    MSC Class: 20N20; 20B25; 05E30

  34. arXiv:2410.03956  [pdf, other

    cond-mat.supr-con

    Charge Density Fluctuations with Enhanced Superconductivity at the Proposed Nematic Quantum Critical Point

    Authors: Youzhe Chen, Nathan Giles-Donovan, Jiayu Guo, Ruihan Chen, Hiroshi Fukui, Taishun Manjo, Daisuke Ishikawa, Alfred Q. R. Baron, Yu Song, Robert J Birgeneau

    Abstract: A quantum critical point (QCP) represents a continuous phase transition at absolute zero. At the QCP of an unconventional superconductor, enhanced superconducting transition temperature and magnetic fluctuations strength are often observed together, indicating magnetism-mediated superconductivity. This raises the question of whether quantum fluctuations in other degrees of freedom, such as charge,… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: 7pages, 4 figures

  35. arXiv:2410.02421  [pdf, other

    hep-ex

    Search for lepton number violating decays of $D_s^+\to h^-h^0e^+e^+$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (650 additional authors not shown)

    Abstract: Based on 7.33 fb$^{-1}$ of $e^+e^-$ collision data collected by the BESIII detector operating at the BEPCII collider at center-of-mass energies from 4.128 to 4.226 GeV, a search for the Majorana neutrino $ν_m$ is conducted in the lepton-number-violating decays of $D_s^+\to h^-h^0e^+e^+$. Here, $h^-$ represents a $K^-$ or $π^-$, and $h^0$ represents a $π^0$, $K_S^0$ or $φ$. No significant signal is… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  36. arXiv:2410.01702  [pdf, other

    cs.RO

    D(R, O) Grasp: A Unified Representation of Robot and Object Interaction for Cross-Embodiment Dexterous Grasping

    Authors: Zhenyu Wei, Zhixuan Xu, Jingxiang Guo, Yiwen Hou, Chongkai Gao, Zhehao Cai, Jiayu Luo, Lin Shao

    Abstract: Dexterous grasping is a fundamental yet challenging skill in robotic manipulation, requiring precise interaction between robotic hands and objects. In this paper, we present D(R,O) Grasp, a novel framework that models the interaction between the robotic hand in its grasping pose and the object, enabling broad generalization across various robot hands and object geometries. Our model takes the robo… ▽ More

    Submitted 8 October, 2024; v1 submitted 2 October, 2024; originally announced October 2024.

  37. arXiv:2410.01095  [pdf, other

    physics.optics

    Harnessing micro-Fabry-Perot reference cavities in photonic integrated circuits

    Authors: Haotian Cheng, Chao Xiang, Naijun Jin, Igor Kudelin, Joel Guo, Matthew Heyrich, Yifan Liu, Jonathan Peters, Qing-Xin Ji, Yishu Zhou, Kerry J. Vahala, Franklyn Quinlan, Scott A. Diddams, John E. Bowers, Peter T. Rakich

    Abstract: Compact photonic systems that offer high frequency stability and low noise are of increasing importance to applications in precision metrology, quantum computing, communication, and advanced sensing technologies. However, on-chip resonators comprised of dielectrics cannot match the frequency stability and noise characteristics of Fabry-Perot cavities, whose electromagnetic modes live almost entire… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

  38. arXiv:2410.00508  [pdf, other

    cs.CL cs.AI

    FlipGuard: Defending Preference Alignment against Update Regression with Constrained Optimization

    Authors: Mingye Zhu, Yi Liu, Quan Wang, Junbo Guo, Zhendong Mao

    Abstract: Recent breakthroughs in preference alignment have significantly improved Large Language Models' ability to generate texts that align with human preferences and values. However, current alignment metrics typically emphasize the post-hoc overall improvement, while overlooking a critical aspect: regression, which refers to the backsliding on previously correctly-handled data after updates. This poten… ▽ More

    Submitted 14 October, 2024; v1 submitted 1 October, 2024; originally announced October 2024.

    Comments: Accepted by EMNLP 2024 Main track

  39. arXiv:2409.19754  [pdf, other

    cs.CV

    Offline Signature Verification Based on Feature Disentangling Aided Variational Autoencoder

    Authors: Hansong Zhang, Jiangjian Guo, Kun Li, Yang Zhang, Yimei Zhao

    Abstract: Offline handwritten signature verification systems are used to verify the identity of individuals, through recognizing their handwritten signature image as genuine signatures or forgeries. The main tasks of signature verification systems include extracting features from signature images and training a classifier for classification. The challenges of these tasks are twofold. First, genuine signatur… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

  40. arXiv:2409.19205  [pdf, other

    astro-ph.GA

    A Search for $z=5$ H$α$ and H$β+$[O III] Dual-Line Emitting Galaxies in the JWST CEERS Field: Implications for the AGN Abundance

    Authors: Jingsong Guo, Masafusa Onoue, Kohei Inayoshi, Dale D. Kocevski, Steven L. Finkelstein, Micaela B. Bagley, Elizabeth J. McGrath

    Abstract: The James Webb Space Telescope (JWST) has enabled us to uncover faint galaxies and active galactic nuclei (AGNs) in the early universe. Taking advantage of the unique filter combination used in the Cosmic Evolution Early Release Science Survey (CEERS) program, we perform an extensive photometric search of galaxies emitting strong H$β+$[O III] and H$α$ lines. The redshift range of the galaxies is l… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

    Comments: 24 pages, 16 figures, 5 tables. submitted to ApJ

  41. arXiv:2409.18463  [pdf, other

    nucl-ex

    First Measurement of Near- and Sub-Threshold $J/ψ$ Photoproduction off Nuclei

    Authors: J. R. Pybus, L. Ehinger, T. Kolar, B. Devkota, P. Sharp, B. Yu, M. M. Dalton, D. Dutta, H. Gao, O. Hen, E. Piasetzky, S. N. Santiesteban, A. Schmidt, A. Somov, H. Szumila-Vance, S. Adhikari, A. Asaturyan, A. Austregesilo, C. Ayerbe Gayoso, J. Barlow, V. V. Berdnikov, H. D. Bhatt, Deepak Bhetuwal, T. Black, W. J. Briscoe , et al. (42 additional authors not shown)

    Abstract: We report on the first measurement of $J/ψ$ photoproduction from nuclei in the photon energy range of $7$ to $10.8$ GeV, extending above and below the photoproduction threshold in the free proton of $\sim8.2$ GeV. The experiment used a tagged photon beam incident on deuterium, helium, and carbon, and the GlueX detector at Jefferson Lab to measure the semi-inclusive $A(γ,e^+e^-p)$ reaction with a d… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

  42. arXiv:2409.18409  [pdf, ps, other

    cs.IR

    Generative Retrieval Meets Multi-Graded Relevance

    Authors: Yubao Tang, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Wei Chen, Xueqi Cheng

    Abstract: Generative retrieval represents a novel approach to information retrieval. It uses an encoder-decoder architecture to directly produce relevant document identifiers (docids) for queries. While this method offers benefits, current approaches are limited to scenarios with binary relevance data, overlooking the potential for documents to have multi-graded relevance. Extending generative retrieval to… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: Accepted by the NeurIPS 2024 (Spotlight)

  43. arXiv:2409.17891  [pdf, other

    quant-ph

    Quantum entanglement in phase space

    Authors: Shuheng Liu, Jiajie Guo, Qiongyi He, Matteo Fadel

    Abstract: While commonly used entanglement criteria for continuous variable systems are based on quadrature measurements, here we study entanglement detection from measurements of the Wigner function. These are routinely performed in platforms such as trapped ions and circuit QED, where homodyne measurements are difficult to be implemented. We provide complementary criteria which we show to be tight for a v… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: Comments are welcomed!

  44. arXiv:2409.17243  [pdf, other

    hep-ph

    Long-lived Sterile Neutrino Searches at Future Muon Colliders

    Authors: Qi Bi, Jinhui Guo, Jia Liu, Yan Luo, Xiao-Ping Wang

    Abstract: We explore the potential of studying sterile neutrinos at a future high-energy muon collider, where these particles can generate small active neutrino masses via the seesaw mechanism and exhibit long-lived particle signatures. A Dirac sterile neutrino model with ${\rm U(1)}_{L_μ-L_τ}$ symmetry is introduced, where the heavy right-handed neutrino ($N_R$) produces tiny active neutrino masses, and th… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

    Comments: 20 pages, 6 figures

  45. arXiv:2409.17199  [pdf

    physics.optics

    Optical Multilayer Thin Film Structure Inverse Design: From Optimization to Deep Learning

    Authors: Taigao Ma, Mingqian Ma, L. Jay Guo

    Abstract: Optical multilayer thin film structures have been widely used in numerous photonic domains and applications. The key component to enable these applications is the inverse design. Different from other photonic structures such as metasurface or waveguide, multilayer thin film is a one-dimensional structure, which deserves its own treatment for the design process. Optimization has always been the sta… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

    Comments: 30 Pages, 9 Figures

  46. arXiv:2409.16719  [pdf, other

    nlin.CD

    Multi-functional reservoir computing

    Authors: Yao Du, Haibo Luo, Jianmin Guo, Jinghua Xiao, Yizhen Yu, Xingang Wang

    Abstract: Whereas the power of reservoir computing (RC) in inferring chaotic systems has been well established in the literature, the studies are mostly restricted to mono-functional machines where the training and testing data are acquired from the same attractor. Here, using the strategies of attractor labeling and trajectory separation, we propose a new scheme of RC capable of learning multiple attractor… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

    Comments: 14 pages, 7 figures

  47. arXiv:2409.16694  [pdf, other

    cs.AI cs.CL cs.LG

    A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms

    Authors: Ruihao Gong, Yifu Ding, Zining Wang, Chengtao Lv, Xingyu Zheng, Jinyang Du, Haotong Qin, Jinyang Guo, Michele Magno, Xianglong Liu

    Abstract: Large language models (LLMs) have achieved remarkable advancements in natural language processing, showcasing exceptional performance across various tasks. However, the expensive memory and computational requirements present significant challenges for their practical deployment. Low-bit quantization has emerged as a critical approach to mitigate these challenges by reducing the bit-width of model… ▽ More

    Submitted 30 September, 2024; v1 submitted 25 September, 2024; originally announced September 2024.

    Comments: Ruihao Gong leads the overall organization of the survey, with Yifu Ding and Jinyang Du contributing to Sections 2 and 3. Xingyu Zheng is responsible for authoring Section 4, while Chengtao Lv and Zining Wang collaborate on Section 5. Haotong Qin, Jinyang Guo, Michele Magno, and Xianglong Liu provide guidance during the whole process and assist in refining the final manuscript

  48. arXiv:2409.16539  [pdf, other

    cs.AI

    Context-aware and Style-related Incremental Decoding framework for Discourse-Level Literary Translation

    Authors: Yuanchang Luo, Jiaxin Guo, Daimeng Wei, Hengchao Shang, Zongyao Li, Zhanglin Wu, Zhiqiang Rao, Shaojun Li, Jinlong Yang, Hao Yang

    Abstract: This report outlines our approach for the WMT24 Discourse-Level Literary Translation Task, focusing on the Chinese-English language pair in the Constrained Track. Translating literary texts poses significant challenges due to the nuanced meanings, idiomatic expressions, and intricate narrative structures inherent in such works. To address these challenges, we leveraged the Chinese-Llama2 model, sp… ▽ More

    Submitted 29 September, 2024; v1 submitted 24 September, 2024; originally announced September 2024.

    Comments: 7 pages, 2 figures, wmt24

  49. arXiv:2409.16331  [pdf, other

    cs.CL cs.AI

    Exploring the traditional NMT model and Large Language Model for chat translation

    Authors: Jinlong Yang, Hengchao Shang, Daimeng Wei, Jiaxin Guo, Zongyao Li, Zhanglin Wu, Zhiqiang Rao, Shaojun Li, Yuhao Xie, Yuanchang Luo, Jiawei Zheng, Bin Wei, Hao Yang

    Abstract: This paper describes the submissions of Huawei Translation Services Center(HW-TSC) to WMT24 chat translation shared task on English$\leftrightarrow$Germany (en-de) bidirection. The experiments involved fine-tuning models using chat data and exploring various strategies, including Minimum Bayesian Risk (MBR) decoding and self-training. The results show significant performance improvements in certai… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    Comments: 7 pages, 6 Tables, WMT24

  50. arXiv:2409.16146  [pdf, other

    cs.CL

    Controlling Risk of Retrieval-augmented Generation: A Counterfactual Prompting Framework

    Authors: Lu Chen, Ruqing Zhang, Jiafeng Guo, Yixing Fan, Xueqi Cheng

    Abstract: Retrieval-augmented generation (RAG) has emerged as a popular solution to mitigate the hallucination issues of large language models. However, existing studies on RAG seldom address the issue of predictive uncertainty, i.e., how likely it is that a RAG model's prediction is incorrect, resulting in uncontrollable risks in real-world applications. In this work, we emphasize the importance of risk co… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.