Skip to main content

Showing 1–50 of 1,104 results for author: Xia, C

  1. arXiv:2410.16078  [pdf, other

    hep-ph hep-ex

    The $Λ_c^+\toηπ^+Λ$ reaction and the $Λa_0^+(980)$ and $π^+Λ(1670)$ contributions

    Authors: Man-Yu Duan, Wen-Tao Lyu, Chu-Wen Xiao, En Wang, Ju-Jun Xie, Dian-Yong Chen, Dian-Yong Chen

    Abstract: We study from the theoretical point of view the $Λ_c^+\to π^+ ηΛ$ reaction, recently measured by the Belle and BESIII Collaborations, where clear signals are observed for $a_0(980)$, $Λ(1670)$, and $Σ(1385)$ excitation. By considering the $a_0(980)$ and $Λ(1670)$ as dynamically generated resonances from the meson meson and meson baryon interaction, respectively, we are able to determine their rela… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  2. arXiv:2410.15342  [pdf, other

    cs.SD cs.LG eess.AS

    ConSinger: Efficient High-Fidelity Singing Voice Generation with Minimal Steps

    Authors: Yulin Song, Guorui Sang, Jing Yu, Chuangbai Xiao

    Abstract: Singing voice synthesis (SVS) system is expected to generate high-fidelity singing voice from given music scores (lyrics, duration and pitch). Recently, diffusion models have performed well in this field. However, sacrificing inference speed to exchange with high-quality sample generation limits its application scenarios. In order to obtain high quality synthetic singing voice more efficiently, we… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

    Comments: Singing voice synthesis, Consistency models, diffusion models

  3. arXiv:2410.15283  [pdf

    cs.LG eess.SY

    TRIZ Method for Urban Building Energy Optimization: GWO-SARIMA-LSTM Forecasting model

    Authors: Shirong Zheng, Shaobo Liu, Zhenhong Zhang, Dian Gu, Chunqiu Xia, Huadong Pang, Enock Mintah Ampaw

    Abstract: With the advancement of global climate change and sustainable development goals, urban building energy consumption optimization and carbon emission reduction have become the focus of research. Traditional energy consumption prediction methods often lack accuracy and adaptability due to their inability to fully consider complex energy consumption patterns, especially in dealing with seasonal fluctu… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

    Comments: 29 pages

  4. arXiv:2410.15136  [pdf, other

    cs.CL

    CAST: Corpus-Aware Self-similarity Enhanced Topic modelling

    Authors: Yanan Ma, Chenghao Xiao, Chenhan Yuan, Sabine N van der Veer, Lamiece Hassan, Chenghua Lin, Goran Nenadic

    Abstract: Topic modelling is a pivotal unsupervised machine learning technique for extracting valuable insights from large document collections. Existing neural topic modelling methods often encode contextual information of documents, while ignoring contextual details of candidate centroid words, leading to the inaccurate selection of topic words due to the contextualization gap. In parallel, it is found th… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

  5. arXiv:2410.15091  [pdf, other

    cs.CV

    Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion

    Authors: Chaodong Xiao, Minghan Li, Zhengqiang Zhang, Deyu Meng, Lei Zhang

    Abstract: Selective state space models (SSMs), such as Mamba, highly excel at capturing long-range dependencies in 1D sequential data, while their applications to 2D vision tasks still face challenges. Current visual SSMs often convert images into 1D sequences and employ various scanning patterns to incorporate local spatial dependencies. However, these methods are limited in effectively capturing the compl… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

    Comments: 16 pages, 8 figures, 5 tables

  6. arXiv:2410.14676  [pdf, other

    cs.CL cs.AI

    SudoLM: Learning Access Control of Parametric Knowledge with Authorization Alignment

    Authors: Qin Liu, Fei Wang, Chaowei Xiao, Muhao Chen

    Abstract: Existing preference alignment is a one-size-fits-all alignment mechanism, where the part of the large language model (LLM) parametric knowledge with non-preferred features is uniformly blocked to all the users. However, this part of knowledge can be useful to advanced users whose expertise qualifies them to handle these information. The one-size-fits-all alignment mechanism undermines LLM's utilit… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

  7. arXiv:2410.14629  [pdf, other

    cs.LG cs.DB cs.IR

    SIMformer: Single-Layer Vanilla Transformer Can Learn Free-Space Trajectory Similarity

    Authors: Chuang Yang, Renhe Jiang, Xiaohang Xu, Chuan Xiao, Kaoru Sezaki

    Abstract: Free-space trajectory similarity calculation, e.g., DTW, Hausdorff, and Frechet, often incur quadratic time complexity, thus learning-based methods have been proposed to accelerate the computation. The core idea is to train an encoder to transform trajectories into representation vectors and then compute vector similarity to approximate the ground truth. However, existing methods face dual challen… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

  8. arXiv:2410.13601  [pdf, ps, other

    math.AP math.DG

    The Logarithmic Sobolev inequality on non-compact self-shrinkers

    Authors: Guofang Wang, Chao Xia, Xiqiang Zhang

    Abstract: In the paper we establish an optimal logarithmic Sobolev inequality for complete, non-compact, properly embedded self-shrinkers in the Euclidean space, which generalizes a recent result of Brendle \cite{Brendle22} for closed self-shrinkers. We first provide a proof for the logarithmic Sobolev inequality in the Euclidean space by using the Alexandrov-Bakelman-Pucci (ABP) method. Then we use this ap… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 16 pages

  9. arXiv:2410.13437  [pdf, other

    cs.CV

    Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation

    Authors: Changcheng Xiao, Qiong Cao, Yujie Zhong, Xiang Zhang, Tao Wang, Canqun Yang, Long Lan

    Abstract: Referring multi-object tracking (RMOT) is an emerging cross-modal task that aims to locate an arbitrary number of target objects and maintain their identities referred by a language expression in a video. This intricate task involves the reasoning of linguistic and visual modalities, along with the temporal association of target objects. However, the seminal work employs only loose feature fusion… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  10. arXiv:2410.12831  [pdf, other

    eess.IV cs.AI cs.CV

    Segment as You Wish -- Free-Form Language-Based Segmentation for Medical Images

    Authors: Longchao Da, Rui Wang, Xiaojian Xu, Parminder Bhatia, Taha Kass-Hout, Hua Wei, Cao Xiao

    Abstract: Medical imaging is crucial for diagnosing a patient's health condition, and accurate segmentation of these images is essential for isolating regions of interest to ensure precise diagnosis and treatment planning. Existing methods primarily rely on bounding boxes or point-based prompts, while few have explored text-related prompts, despite clinicians often describing their observations and instruct… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  11. arXiv:2410.11734  [pdf

    cond-mat.mes-hall

    Anomalously Enhanced Diffusivity of Moiré Excitons via Manipulating the Interplay with Correlated Electrons

    Authors: Li Yan, Lei Ma, Yuze Meng, Chengxin Xiao, Bo Chen, Qiran Wu, Jingyuan Cui, Qingrui Cao, Rounak Banerjee, Takashi Taniguchi, Kenji Watanabe, Seth Ariel Tongay, Benjamin Hunt, Yong-Tao Cui, Wang Yao, Su-Fei Shi

    Abstract: Semiconducting transitional metal dichalcogenides (TMDCs) moiré superlattice provides an exciting platform for manipulating excitons. The in-situ control of moiré potential confined exciton would usher in unprecedented functions of excitonic devices but remains challenging. Meanwhile, as a dipolar composite boson, interlayer exciton in the type-II aligned TMDC moiré superlattice strongly interacts… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  12. arXiv:2410.10353  [pdf, other

    cs.RO

    HumanFT: A Human-like Fingertip Multimodal Visuo-Tactile Sensor

    Authors: Yifan Wu, Yuzhou Chen, Zhengying Zhu, Xuhao Qin, Chenxi Xiao

    Abstract: Tactile sensors play a crucial role in enabling robots to interact effectively and safely with objects in everyday tasks. In particular, visuotactile sensors have seen increasing usage in two and three-fingered grippers due to their high-quality feedback. However, a significant gap remains in the development of sensors suitable for humanoid robots, especially five-fingered dexterous hands. One rea… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  13. arXiv:2410.10214  [pdf

    physics.app-ph cond-mat.mes-hall cond-mat.mtrl-sci

    Gaseous Scissor-mediated Electrochemical Exfoliation of Halogenated MXenes and its Boosting in Wear-Resisting Tribovoltaic Devices

    Authors: Qi Fan, Minghua Chen, Longyi Li, Minghui Li, Chuanxiao Xiao, Tianci Zhao, Long Pan, Ningning Liang, Qing Huang, Laipan Zhu, Michael Naguib, Kun Liang

    Abstract: Two-dimensional transition metal carbides (MXenes), especially their few-layered nanosheets, have triggered burgeoning research attentions owing to their superiorities including extraordinary conductivity, accessible active surface, and adjustable processability. Molten salts etching route further achieves their controllable surface chemistry. However, the method encounters challenges in achieving… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  14. arXiv:2410.09079  [pdf, other

    cs.CL cs.AI cs.LG

    BIPEFT: Budget-Guided Iterative Search for Parameter Efficient Fine-Tuning of Large Pretrained Language Models

    Authors: Aofei Chang, Jiaqi Wang, Han Liu, Parminder Bhatia, Cao Xiao, Ting Wang, Fenglong Ma

    Abstract: Parameter Efficient Fine-Tuning (PEFT) offers an efficient solution for fine-tuning large pretrained language models for downstream tasks. However, most PEFT strategies are manually designed, often resulting in suboptimal performance. Recent automatic PEFT approaches aim to address this but face challenges such as search space entanglement, inefficiency, and lack of integration between parameter b… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: Accepted to EMNLP 2024 (Findings)

  15. arXiv:2410.08660  [pdf, other

    cs.CR cs.AI

    RePD: Defending Jailbreak Attack through a Retrieval-based Prompt Decomposition Process

    Authors: Peiran Wang, Xiaogeng Liu, Chaowei Xiao

    Abstract: In this study, we introduce RePD, an innovative attack Retrieval-based Prompt Decomposition framework designed to mitigate the risk of jailbreak attacks on large language models (LLMs). Despite rigorous pretraining and finetuning focused on ethical alignment, LLMs are still susceptible to jailbreak exploits. RePD operates on a one-shot learning model, wherein it accesses a database of pre-collecte… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: arXiv admin note: text overlap with arXiv:2403.04783 by other authors

  16. arXiv:2410.07165  [pdf, other

    cs.AI cs.LG

    Complex Logical Query Answering by Calibrating Knowledge Graph Completion Models

    Authors: Changyi Xiao, Yixin Cao

    Abstract: Complex logical query answering (CLQA) is a challenging task that involves finding answer entities for complex logical queries over incomplete knowledge graphs (KGs). Previous research has explored the use of pre-trained knowledge graph completion (KGC) models, which can predict the missing facts in KGs, to answer complex logical queries. However, KGC models are typically evaluated using ranking e… ▽ More

    Submitted 30 September, 2024; originally announced October 2024.

  17. arXiv:2410.06862  [pdf, ps, other

    math.RT

    Representations of non-finitely graded Heisenberg-Virasoro type Lie algebras

    Authors: Chunguang Xia, Tianyu Ma, Wei Wang, Mingjing Zhang

    Abstract: We construct and study non-finitely graded Lie algebras $\mathcal{HV}(a,b;ε)$ related to Heisenberg-Virasoro type Lie algebras, where $a,b$ are complex numbers, and $ε= \pm 1$. Using combinatorial techniques, we completely classify the free $\mathcal{U}(\mathfrak h)$-modules of rank one over $\mathcal{HV}(a,b;ε)$. It turns out that these modules are more varied and complex than those over non-fini… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: 19 pages

  18. arXiv:2410.06581  [pdf, other

    cs.IR

    Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs

    Authors: Cheng Gao, Chaojun Xiao, Zhenghao Liu, Huimin Chen, Zhiyuan Liu, Maosong Sun

    Abstract: Legal case retrieval (LCR) aims to provide similar cases as references for a given fact description. This task is crucial for promoting consistent judgments in similar cases, effectively enhancing judicial fairness and improving work efficiency for judges. However, existing works face two main challenges for real-world applications: existing works mainly focus on case-to-case retrieval using lengt… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: 15 pages, 3 figures, accepted by EMNLP 2024

  19. arXiv:2410.06209  [pdf, other

    cs.LG cs.AI cs.LO

    LeanAgent: Lifelong Learning for Formal Theorem Proving

    Authors: Adarsh Kumarappan, Mo Tiwari, Peiyang Song, Robert Joseph George, Chaowei Xiao, Anima Anandkumar

    Abstract: Large Language Models (LLMs) have been successful in mathematical reasoning tasks such as formal theorem proving when integrated with interactive proof assistants like Lean. Existing approaches involve training or fine-tuning an LLM on a specific dataset to perform well on particular domains, such as undergraduate-level mathematics. These methods struggle with generalizability to advanced mathemat… ▽ More

    Submitted 17 October, 2024; v1 submitted 8 October, 2024; originally announced October 2024.

  20. arXiv:2410.06040  [pdf, other

    cs.LG

    QERA: an Analytical Framework for Quantization Error Reconstruction

    Authors: Cheng Zhang, Jeffrey T. H. Wong, Can Xiao, George A. Constantinides, Yiren Zhao

    Abstract: he growing number of parameters and computational demands of large language models (LLMs) present significant challenges for their efficient deployment. Recently, there is an increasing interest in quantizing weights to extremely low precision while offsetting the resulting error with low-rank, high-precision error reconstruction terms. The combination of quantization and low-rank approximation is… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  21. arXiv:2410.05749  [pdf, other

    hep-ph hep-ex

    The $X(4500)$ state considered as the mixture of hadronic molecule and diquark-antidiquark within effective field theory

    Authors: De-Shun Zhang, Wei He, Chu-Wen Xiao, Zhi-Feng Sun

    Abstract: In the present work, we construct the Lagrangians including three-meson, meson-diquark-antidiquark vertices, such that the diquark-antidiquark component as well as the molecular component are introduced within the effective field theory. With the obtained effective potentials projecting to spin 0, 1 and 2, we solve the Bethe-Salpeter equation with the on-shell approximation, and find that… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: 6 pages, 2 figures

  22. arXiv:2410.05578  [pdf, other

    cs.LG cs.AI

    Swift Sampler: Efficient Learning of Sampler by 10 Parameters

    Authors: Jiawei Yao, Chuming Li, Canran Xiao

    Abstract: Data selection is essential for training deep learning models. An effective data sampler assigns proper sampling probability for training data and helps the model converge to a good local minimum with high performance. Previous studies in data sampling are mainly based on heuristic rules or learning through a huge amount of time-consuming trials. In this paper, we propose an automatic \textbf{swif… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

    Comments: Accepted by NeurIPS 2024. Project page: https://github.com/Alexander-Yao/Swift-Sampler

  23. arXiv:2410.05295  [pdf, other

    cs.CR cs.AI cs.LG

    AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs

    Authors: Xiaogeng Liu, Peiran Li, Edward Suh, Yevgeniy Vorobeychik, Zhuoqing Mao, Somesh Jha, Patrick McDaniel, Huan Sun, Bo Li, Chaowei Xiao

    Abstract: In this paper, we propose AutoDAN-Turbo, a black-box jailbreak method that can automatically discover as many jailbreak strategies as possible from scratch, without any human intervention or predefined scopes (e.g., specified candidate strategies), and use them for red-teaming. As a result, AutoDAN-Turbo can significantly outperform baseline methods, achieving a 74.3% higher average attack success… ▽ More

    Submitted 13 October, 2024; v1 submitted 3 October, 2024; originally announced October 2024.

    Comments: Pre-print. Project Page: https://autodans.github.io/AutoDAN-Turbo Code: https://github.com/SaFoLab-WISC/AutoDAN-Turbo

  24. arXiv:2410.04981  [pdf, other

    cs.CL

    On the Rigour of Scientific Writing: Criteria, Analysis, and Insights

    Authors: Joseph James, Chenghao Xiao, Yucheng Li, Chenghua Lin

    Abstract: Rigour is crucial for scientific research as it ensures the reproducibility and validity of results and findings. Despite its importance, little work exists on modelling rigour computationally, and there is a lack of analysis on whether these criteria can effectively signal or measure the rigour of scientific papers in practice. In this paper, we introduce a bottom-up, data-driven framework to aut… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

    Comments: Accepted Findings at EMNLP 2024

  25. arXiv:2410.04727  [pdf, other

    cs.CL

    Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-context Models

    Authors: Xinyu Liu, Runsong Zhao, Pengcheng Huang, Chunyang Xiao, Bei Li, Jingang Wang, Tong Xiao, Jingbo Zhu

    Abstract: Numerous recent works target to extend effective context length for language models and various methods, tasks and benchmarks exist to measure model's effective memorization length. However, through thorough investigations, we find limitations for currently existing evaluations on model's memorization capability. We provide an extensive survey for limitations in this work and propose a new method… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

  26. arXiv:2410.04585  [pdf, other

    cs.CL

    Reasoning-Enhanced Healthcare Predictions with Knowledge Graph Community Retrieval

    Authors: Pengcheng Jiang, Cao Xiao, Minhao Jiang, Parminder Bhatia, Taha Kass-Hout, Jimeng Sun, Jiawei Han

    Abstract: Large language models (LLMs) have demonstrated significant potential in clinical decision support. Yet LLMs still suffer from hallucinations and lack fine-grained contextual medical knowledge, limiting their high-stake healthcare applications such as clinical diagnosis. Traditional retrieval-augmented generation (RAG) methods attempt to address these limitations but frequently retrieve sparse or i… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

    Comments: under review

  27. arXiv:2410.04385  [pdf, other

    math.NA

    HaTT: Hadamard avoiding TT recompression

    Authors: Zhonghao Sun, Jizu Huang, Chuanfu Xiao, Chao Yang

    Abstract: The Hadamard product of tensor train (TT) tensors is one of the most fundamental nonlinear operations in scientific computing and data analysis. Due to its tendency to significantly increase TT ranks, the Hadamard product presents a major computational challenge in TT tensor-based algorithms. Therefore, it is essential to develop recompression algorithms that mitigate the effects of this rank incr… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

    Comments: 17 pages, 9 figures

  28. arXiv:2410.03440  [pdf, other

    cs.CL cs.AI

    Exploring the Benefit of Activation Sparsity in Pre-training

    Authors: Zhengyan Zhang, Chaojun Xiao, Qiujieli Qin, Yankai Lin, Zhiyuan Zeng, Xu Han, Zhiyuan Liu, Ruobing Xie, Maosong Sun, Jie Zhou

    Abstract: Pre-trained Transformers inherently possess the characteristic of sparse activation, where only a small fraction of the neurons are activated for each token. While sparse activation has been explored through post-training methods, its potential in pre-training remains untapped. In this work, we first study how activation properties change during pre-training. Our examination reveals that Transform… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: ICML 2024

  29. arXiv:2410.02829  [pdf, other

    cs.AI cs.HC cs.LG

    LLMs May Not Be Human-Level Players, But They Can Be Testers: Measuring Game Difficulty with LLM Agents

    Authors: Chang Xiao, Brenda Z. Yang

    Abstract: Recent advances in Large Language Models (LLMs) have demonstrated their potential as autonomous agents across various tasks. One emerging application is the use of LLMs in playing games. In this work, we explore a practical problem for the gaming industry: Can LLMs be used to measure game difficulty? We propose a general game-testing framework using LLM agents and test it on two widely played stra… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

  30. arXiv:2410.02108  [pdf, other

    cs.CL

    ReGenesis: LLMs can Grow into Reasoning Generalists via Self-Improvement

    Authors: Xiangyu Peng, Congying Xia, Xinyi Yang, Caiming Xiong, Chien-Sheng Wu, Chen Xing

    Abstract: Post-training Large Language Models (LLMs) with explicit reasoning trajectories can enhance their reasoning abilities. However, acquiring such high-quality trajectory data typically demands meticulous supervision from humans or superior models, which can be either expensive or license-constrained. In this paper, we explore how far an LLM can improve its reasoning by self-synthesizing reasoning pat… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  31. arXiv:2410.01805  [pdf, other

    cs.CL

    Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads

    Authors: Yuxiang Huang, Binhang Yuan, Xu Han, Chaojun Xiao, Zhiyuan Liu

    Abstract: Large language models (LLMs) have shown remarkable advances in supporting long-context comprehension and processing tasks. However, scaling the generation inference of LLMs to such long contexts incurs significant additional computation load, and demands a substantial GPU memory footprint to maintain the key-value (KV) cache of transformer-based LLMs. Existing KV cache compression methods, such as… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

    Comments: Preprints

  32. arXiv:2410.01701  [pdf, ps, other

    hep-ph hep-ex

    Possible signal of an exotic $I=1$, $J=2$ state in the $B \to D^{*-}D^+K^+$ reaction

    Authors: Wen-Tao Lyu, Man-Yu Duan, Chu-Wen Xiao, En Wang, Eulogio Oset

    Abstract: We study the \proc reaction, showing that a peak in the \dk mass distribution around 2834 MeV reported by LHCb could be associated with a theoretical exotic state with that mass, a width of 19 MeV and $J^P=2^+$, stemming from the interaction of the $D^{*+}K^{*+}$ and $D^{*+}_s ρ^+$ channels, which is a partner of the $0^+$ $T_{c\bar{s}}(2900)$. We show that the data is compatible with this assumpt… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  33. arXiv:2409.19993  [pdf, other

    cs.CR cs.AI cs.CL cs.LG eess.SY

    Mitigating Backdoor Threats to Large Language Models: Advancement and Challenges

    Authors: Qin Liu, Wenjie Mo, Terry Tong, Jiashu Xu, Fei Wang, Chaowei Xiao, Muhao Chen

    Abstract: The advancement of Large Language Models (LLMs) has significantly impacted various domains, including Web search, healthcare, and software development. However, as these models scale, they become more vulnerable to cybersecurity risks, particularly backdoor attacks. By exploiting the potent memorization capacity of LLMs, adversaries can easily inject backdoors into LLMs by manipulating a small por… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

    Comments: The 60th Annual Allerton Conference (Invited Paper). The arXiv version is a pre-IEEE Press publication version

  34. arXiv:2409.19977  [pdf, other

    cs.LG cs.AI

    Knowledge Graph Embedding by Normalizing Flows

    Authors: Changyi Xiao, Xiangnan He, Yixin Cao

    Abstract: A key to knowledge graph embedding (KGE) is to choose a proper representation space, e.g., point-wise Euclidean space and complex vector space. In this paper, we propose a unified perspective of embedding and introduce uncertainty into KGE from the view of group theory. Our model can incorporate existing models (i.e., generality), ensure the computation is tractable (i.e., efficiency) and enjoy th… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

  35. arXiv:2409.19170  [pdf, other

    cs.RO

    An Interactive Hands-Free Controller for a Riding Ballbot to Enable Simple Shared Control Tasks

    Authors: Chenzhang Xiao, Seung Yun Song, Yu Chen, Mahshid Mansouri, Joao Ramos, William R. Norris, Elizabeth T. Hsiao-Wecksler

    Abstract: Our team developed a riding ballbot (called PURE) that is dynamically stable, omnidirectional, and driven by lean-to-steer control. A hands-free admittance control scheme (HACS) was previously integrated to allow riders with different torso functions to control the robot's movements via torso leaning and twisting. Such an interface requires motor coordination skills and could result in collisions… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

  36. arXiv:2409.19091  [pdf, other

    cs.CR

    System-Level Defense against Indirect Prompt Injection Attacks: An Information Flow Control Perspective

    Authors: Fangzhou Wu, Ethan Cecchetti, Chaowei Xiao

    Abstract: Large Language Model-based systems (LLM systems) are information and query processing systems that use LLMs to plan operations from natural-language prompts and feed the output of each successive step into the LLM to plan the next. This structure results in powerful tools that can process complex information from diverse sources but raises critical security concerns. Malicious information from any… ▽ More

    Submitted 10 October, 2024; v1 submitted 27 September, 2024; originally announced September 2024.

    Comments: 23 pages

  37. arXiv:2409.18452  [pdf, other

    cs.RO

    Exploiting Physical Human-Robot Interaction to Provide a Unique Rolling Experience with a Riding Ballbot

    Authors: Chenzhang Xiao, Seung Yun Song, Yu Chen, Mahshid Mansouri, João Ramos, Adam W. Bleakney, William R. Norris, Elizabeth T. Hsiao-Wecksler

    Abstract: This study introduces the development of hands-free control schemes for a riding ballbot, designed to allow riders including manual wheelchair users to control its movement through torso leaning and twisting. The hardware platform, Personal Unique Rolling Experience (PURE), utilizes a ballbot drivetrain, a dynamically stable mobile robot that uses a ball as its wheel to provide omnidirectional man… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

  38. arXiv:2409.18093  [pdf

    cond-mat.soft

    Identifying Bridges from Asymmetric Load-Bearing Structures in Tapped Granular Packings

    Authors: Chijin Zhou, Shuyang Zhang, Xueliang Dai, Yixin Cao, Ye Yuan, Chengjie Xia, Zhikun Zeng, Yujie Wang

    Abstract: Using high-resolution x-ray tomography, we experimentally investigate the bridge structures in tapped granular packings composed of particles with varying friction coefficients. We find that gravity can induce subtle structural changes on the load-bearing contacts, allowing us to identify the correct load-bearing contacts based on structural information alone. Using these identified load-bearing c… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: 21 pages, 5 figures

  39. arXiv:2409.17504  [pdf, other

    cs.LG cs.CL

    HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection

    Authors: Xuefeng Du, Chaowei Xiao, Yixuan Li

    Abstract: The surge in applications of large language models (LLMs) has prompted concerns about the generation of misleading or fabricated information, known as hallucinations. Therefore, detecting hallucinations has become critical to maintaining trust in LLM-generated content. A primary challenge in learning a truthfulness classifier is the lack of a large amount of labeled truthful and hallucinated data.… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

    Comments: NeurIPS 2024 Spotlight

  40. arXiv:2409.14955  [pdf, other

    cs.RO

    Efficient Collision Detection Framework for Enhancing Collision-Free Robot Motion

    Authors: Xiankun Zhu, Yucheng Xin, Shoujie Li, Houde Liu, Chongkun Xia, Bin Liang

    Abstract: Fast and efficient collision detection is essential for motion generation in robotics. In this paper, we propose an efficient collision detection framework based on the Signed Distance Field (SDF) of robots, seamlessly integrated with a self-collision detection module. Firstly, we decompose the robot's SDF using forward kinematics and leverage multiple extremely lightweight networks in parallel to… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  41. arXiv:2409.14775  [pdf, other

    cs.RO

    Like a Martial Arts Dodge: Safe Expeditious Whole-Body Control of Mobile Manipulators for Collision Avoidance

    Authors: Bingjie Chen, Houde Liu, Chongkun Xia, Liang Han, Xueqian Wang, Bin Liang

    Abstract: In the control task of mobile manipulators(MM), achieving efficient and agile obstacle avoidance in dynamic environments is challenging. In this letter, we present a safe expeditious whole-body(SEWB) control for MMs that ensures both external and internal collision-free. SEWB is constructed by a two-layer optimization structure. Firstly, control barrier functions(CBFs) are employed for a MM to est… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  42. arXiv:2409.14754  [pdf, other

    cs.RO

    CushionCatch: Compliant Catching Mechanism for Mobile Manipulators via Combined Optimization and Learning

    Authors: Bingjie Chen, Keyu Fan, Houde Liu, Chongkun Xia, Liang Han, Bin Liang

    Abstract: This paper presents a framework to achieve compliant catching with cushioning mechanism(CCCM) for mobile manipulators. First, we introduce a two-level motion optimization scheme, comprising a high-level capture planner and a low-level joint planner. The low-level joint planner consists of two distinct components: Pre-Catching (PRC) planner and Post-Catching (POC) planner. Next, we propose a networ… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  43. arXiv:2409.14364  [pdf, other

    cs.CL

    More Effective LLM Compressed Tokens with Uniformly Spread Position Identifiers and Compression Loss

    Authors: Runsong Zhao, Pengcheng Huang, Xinyu Liu, Chunyang Xiao, Tong Xiao, Jingbo Zhu

    Abstract: Compressing Transformer inputs into compressd tokens allows running LLMs with improved speed and cost efficiency. Based on the compression method ICAE, we carefully examine the position identifier choices for compressed tokens and also propose a new compression loss. We demonstrate empirically that our proposed methods achieve significantly higher compression ratios (15x compared to 4x for ICAE),… ▽ More

    Submitted 27 September, 2024; v1 submitted 22 September, 2024; originally announced September 2024.

  44. arXiv:2409.13853  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    Unlocking Memorization in Large Language Models with Dynamic Soft Prompting

    Authors: Zhepeng Wang, Runxue Bao, Yawen Wu, Jackson Taylor, Cao Xiao, Feng Zheng, Weiwen Jiang, Shangqian Gao, Yanfu Zhang

    Abstract: Pretrained large language models (LLMs) have revolutionized natural language processing (NLP) tasks such as summarization, question answering, and translation. However, LLMs pose significant security risks due to their tendency to memorize training data, leading to potential privacy breaches and copyright infringement. Accurate measurement of this memorization is essential to evaluate and mitigate… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

  45. arXiv:2409.12489  [pdf, other

    nucl-th hep-ph

    Mixed phases of compact star matter in a unified mean-field approach

    Authors: Cheng-Jun Xia, Toshiki Maruyama, Nobutoshi Yasutake, Toshitaka Tatsumi

    Abstract: Based on an extended NJL model that treats baryons as clusters of quarks, we investigate the properties and microscopic structures of mixed phases for various types of first-order phase transitions in a unified manner, where the model parameters are fixed by reproducing nuclear matter properties and the binding energies of finite nuclei. In particular, based on the Thomas-Fermi approximation, we i… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

  46. arXiv:2409.12455  [pdf, other

    cs.RO

    MuxHand: A Cable-driven Dexterous Robotic Hand Using Time-division Multiplexing Motors

    Authors: Jianle Xu, Shoujie Li, Hong Luo, Houde Liu, Xueqian Wang, Wenbo Ding, Chongkun Xia

    Abstract: The robotic dexterous hand is responsible for both grasping and dexterous manipulation. The number of motors directly influences both the dexterity and the cost of such systems. In this paper, we present MuxHand, a robotic hand that employs a time-division multiplexing motor (TDMM) mechanism. This system allows 9 cables to be independently controlled by just 4 motors, significantly reducing cost w… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

    Comments: 7 pages

  47. arXiv:2409.11295  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage

    Authors: Zeyi Liao, Lingbo Mo, Chejian Xu, Mintong Kang, Jiawei Zhang, Chaowei Xiao, Yuan Tian, Bo Li, Huan Sun

    Abstract: Generalist web agents have demonstrated remarkable potential in autonomously completing a wide range of tasks on real websites, significantly boosting human productivity. However, web tasks, such as booking flights, usually involve users' PII, which may be exposed to potential privacy risks if web agents accidentally interact with compromised websites, a scenario that remains largely unexplored in… ▽ More

    Submitted 3 October, 2024; v1 submitted 17 September, 2024; originally announced September 2024.

    Comments: 29 pages

  48. arXiv:2409.09669  [pdf, other

    cond-mat.mes-hall

    Intrinsic Dynamic Generation of Spin Polarization by Time-Varying Electric Field

    Authors: Xukun Feng, Jin Cao, Zhi-Fan Zhang, Lay Kee Ang, Shen Lai, Hua Jiang, Cong Xiao, Shengyuan A. Yang

    Abstract: Electric control of spin in insulators is desired for low-consumption and ultrafast spintronics, but the underlying mechanism remains largely unexplored. Here, we propose an intrinsic effect of dynamic spin generation driven by time-varying electric field. In the intraband response regime, it can be nicely formulated as a Berry curvature effect and leads to two phenomena that are forbidden in the… ▽ More

    Submitted 15 September, 2024; originally announced September 2024.

  49. arXiv:2409.08615  [pdf, other

    cs.GR

    DrawingSpinUp: 3D Animation from Single Character Drawings

    Authors: Jie Zhou, Chufeng Xiao, Miu-Ling Lam, Hongbo Fu

    Abstract: Animating various character drawings is an engaging visual content creation task. Given a single character drawing, existing animation methods are limited to flat 2D motions and thus lack 3D effects. An alternative solution is to reconstruct a 3D model from a character drawing as a proxy and then retarget 3D motion data onto it. However, the existing image-to-3D methods could not work well for ama… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

    Comments: 10 pages, 15 figures

  50. arXiv:2409.08475  [pdf, other

    cs.CV

    RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positive Supervision

    Authors: Shuo Wang, Chunlong Xia, Feng Lv, Yifeng Shi

    Abstract: RT-DETR is the first real-time end-to-end transformer-based object detector. Its efficiency comes from the framework design and the Hungarian matching. However, compared to dense supervision detectors like the YOLO series, the Hungarian matching provides much sparser supervision, leading to insufficient model training and difficult to achieve optimal results. To address these issues, we proposed a… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.