Skip to main content

Showing 1–50 of 299 results for author: Xing, X

  1. arXiv:2410.14894  [pdf, other

    cs.AI cs.CR cs.LG

    Soft-Label Integration for Robust Toxicity Classification

    Authors: Zelei Cheng, Xian Wu, Jiahao Yu, Shuo Han, Xin-Qiang Cai, Xinyu Xing

    Abstract: Toxicity classification in textual content remains a significant problem. Data with labels from a single annotator fall short of capturing the diversity of human perspectives. Therefore, there is a growing need to incorporate crowdsourced annotations for training an effective toxicity classifier. Additionally, the standard approach to training a classifier using empirical risk minimization (ERM) m… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: Accepted by Neurips 24

  2. arXiv:2410.13823  [pdf, other

    cs.CV

    Deep Generative Models Unveil Patterns in Medical Images Through Vision-Language Conditioning

    Authors: Xiaodan Xing, Junzhi Ning, Yang Nan, Guang Yang

    Abstract: Deep generative models have significantly advanced medical imaging analysis by enhancing dataset size and quality. Beyond mere data augmentation, our research in this paper highlights an additional, significant capacity of deep generative models: their ability to reveal and demonstrate patterns in medical images. We employ a generative structure with hybrid conditions, combining clinical data and… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: Accepted by AIM-FM Workshop of NeurIPS2024

  3. arXiv:2410.12318  [pdf, other

    cs.CR cs.AI

    UTF:Undertrained Tokens as Fingerprints A Novel Approach to LLM Identification

    Authors: Jiacheng Cai, Jiahao Yu, Yangguang Shao, Yuhang Wu, Xinyu Xing

    Abstract: Fingerprinting large language models (LLMs) is essential for verifying model ownership, ensuring authenticity, and preventing misuse. Traditional fingerprinting methods often require significant computational overhead or white-box verification access. In this paper, we introduce UTF, a novel and efficient approach to fingerprinting LLMs by leveraging under-trained tokens. Under-trained tokens are… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  4. arXiv:2410.10551  [pdf, other

    eess.IV cs.CV

    Preserving Cardiac Integrity: A Topology-Infused Approach to Whole Heart Segmentation

    Authors: Chenyu Zhang, Wenxue Guan, Xiaodan Xing, Guang Yang

    Abstract: Whole heart segmentation (WHS) supports cardiovascular disease (CVD) diagnosis, disease monitoring, treatment planning, and prognosis. Deep learning has become the most widely used method for WHS applications in recent years. However, segmentation of whole-heart structures faces numerous challenges including heart shape variability during the cardiac cycle, clinical artifacts like motion and poor… ▽ More

    Submitted 17 October, 2024; v1 submitted 14 October, 2024; originally announced October 2024.

  5. arXiv:2410.04039  [pdf, other

    cs.CR cs.AI

    BlockFound: Customized blockchain foundation model for anomaly detection

    Authors: Jiahao Yu, Xian Wu, Hao Liu, Wenbo Guo, Xinyu Xing

    Abstract: We propose BlockFound, a customized foundation model for anomaly blockchain transaction detection. Unlike existing methods that rely on rule-based systems or directly apply off-the-shelf large language models, BlockFound introduces a series of customized designs to model the unique data structure of blockchain transactions. First, a blockchain transaction is multi-modal, containing blockchain-spec… ▽ More

    Submitted 18 October, 2024; v1 submitted 5 October, 2024; originally announced October 2024.

  6. arXiv:2410.03743  [pdf, other

    cs.CL cs.AI cs.LG

    Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging

    Authors: Yiming Ju, Ziyi Ni, Xingrun Xing, Zhixiong Zeng, hanyu Zhao, Siqi Fan, Zheng Zhang

    Abstract: Supervised fine-tuning (SFT) is crucial for adapting Large Language Models (LLMs) to specific tasks. In this work, we demonstrate that the order of training data can lead to significant training imbalances, potentially resulting in performance degradation. Consequently, we propose to mitigate this imbalance by merging SFT models fine-tuned with different data orders, thereby enhancing the overall… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

    Comments: EMNLP 2024

  7. arXiv:2410.02155  [pdf, other

    cs.AI cs.CL cs.CV

    From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities

    Authors: Wanpeng Zhang, Zilong Xie, Yicheng Feng, Yijiang Li, Xingrun Xing, Sipeng Zheng, Zongqing Lu

    Abstract: Multimodal Large Language Models have made significant strides in integrating visual and textual information, yet they often struggle with effectively aligning these modalities. We introduce a novel image tokenizer that bridges this gap by applying the principle of Byte-Pair Encoding (BPE) to visual data. Unlike conventional approaches that rely on separate visual encoders, our method directly inc… ▽ More

    Submitted 4 October, 2024; v1 submitted 2 October, 2024; originally announced October 2024.

  8. arXiv:2410.02010  [pdf, other

    eess.IV cs.CV

    MONICA: Benchmarking on Long-tailed Medical Image Classification

    Authors: Lie Ju, Siyuan Yan, Yukun Zhou, Yang Nan, Xiaodan Xing, Peibo Duan, Zongyuan Ge

    Abstract: Long-tailed learning is considered to be an extremely challenging problem in data imbalance learning. It aims to train well-generalized models from a large number of images that follow a long-tailed class distribution. In the medical field, many diagnostic imaging exams such as dermoscopy and chest radiography yield a long-tailed distribution of complex clinical findings. Recently, long-tailed lea… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  9. arXiv:2410.00390  [pdf, other

    eess.AS

    Multi-Scale Temporal Transformer For Speech Emotion Recognition

    Authors: Zhipeng Li, Xiaofen Xing, Yuanbo Fang, Weibin Zhang, Hengsheng Fan, Xiangmin Xu

    Abstract: Speech emotion recognition plays a crucial role in human-machine interaction systems. Recently various optimized Transformers have been successfully applied to speech emotion recognition. However, the existing Transformer architectures focus more on global information and require large computation. On the other hand, abundant speech emotional representations exist locally on different parts of the… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

  10. arXiv:2409.14729  [pdf, other

    cs.CR cs.AI

    PROMPTFUZZ: Harnessing Fuzzing Techniques for Robust Testing of Prompt Injection in LLMs

    Authors: Jiahao Yu, Yangguang Shao, Hanwen Miao, Junzheng Shi, Xinyu Xing

    Abstract: Large Language Models (LLMs) have gained widespread use in various applications due to their powerful capability to generate human-like text. However, prompt injection attacks, which involve overwriting a model's original instructions with malicious prompts to manipulate the generated text, have raised significant concerns about the security and reliability of LLMs. Ensuring that LLMs are robust a… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  11. arXiv:2409.14113  [pdf, other

    eess.IV cs.CV

    Accelerated Multi-Contrast MRI Reconstruction via Frequency and Spatial Mutual Learning

    Authors: Qi Chen, Xiaohan Xing, Zhen Chen, Zhiwei Xiong

    Abstract: To accelerate Magnetic Resonance (MR) imaging procedures, Multi-Contrast MR Reconstruction (MCMR) has become a prevalent trend that utilizes an easily obtainable modality as an auxiliary to support high-quality reconstruction of the target modality with under-sampled k-space measurements. The exploration of global dependency and complementary information across different modalities is essential fo… ▽ More

    Submitted 21 September, 2024; originally announced September 2024.

    Comments: Accepted as a poster by Medical Image Computing and Computer Assisted Intervention (MICCAI) 2024

  12. arXiv:2409.11340  [pdf, other

    cs.CV cs.AI

    OmniGen: Unified Image Generation

    Authors: Shitao Xiao, Yueze Wang, Junjie Zhou, Huaying Yuan, Xingrun Xing, Ruiran Yan, Shuting Wang, Tiejun Huang, Zheng Liu

    Abstract: In this work, we introduce OmniGen, a new diffusion model for unified image generation. Unlike popular diffusion models (e.g., Stable Diffusion), OmniGen no longer requires additional modules such as ControlNet or IP-Adapter to process diverse control conditions. OmniGenis characterized by the following features: 1) Unification: OmniGen not only demonstrates text-to-image generation capabilities b… ▽ More

    Submitted 17 September, 2024; originally announced September 2024.

  13. arXiv:2409.05730  [pdf, other

    eess.AS

    AS-Speech: Adaptive Style For Speech Synthesis

    Authors: Zhipeng Li, Xiaofen Xing, Jun Wang, Shuaiqi Chen, Guoqiao Yu, Guanglu Wan, Xiangmin Xu

    Abstract: In recent years, there has been significant progress in Text-to-Speech (TTS) synthesis technology, enabling the high-quality synthesis of voices in common scenarios. In unseen situations, adaptive TTS requires a strong generalization capability to speaker style characteristics. However, the existing adaptive methods can only extract and integrate coarse-grained timbre or mixed rhythm attributes se… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

    Comments: Accepted by SLT 2024

  14. arXiv:2409.03087  [pdf, other

    eess.IV cs.CV

    Coupling AI and Citizen Science in Creation of Enhanced Training Dataset for Medical Image Segmentation

    Authors: Amir Syahmi, Xiangrong Lu, Yinxuan Li, Haoxuan Yao, Hanjun Jiang, Ishita Acharya, Shiyi Wang, Yang Nan, Xiaodan Xing, Guang Yang

    Abstract: Recent advancements in medical imaging and artificial intelligence (AI) have greatly enhanced diagnostic capabilities, but the development of effective deep learning (DL) models is still constrained by the lack of high-quality annotated datasets. The traditional manual annotation process by medical experts is time- and resource-intensive, limiting the scalability of these datasets. In this work, w… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

  15. arXiv:2408.13156  [pdf

    physics.space-ph

    Ultrafast measurement of field-particle energy transfer during chorus emissions in space

    Authors: C. M. Liu, B. N. Zhao, J. B. Cao, C. J. Pollock, C. T. Russell, Y. Y. Liu, X. N. Xing, P. A. Linqvist, J. L. Burch

    Abstract: Chorus is one of the strongest electromagnetic emissions naturally occurring in space, and can cause hazardous radiations to humans and satellites1-3. Although chorus has attracted extreme interest and been intensively studied for decades4-7, its generation and evolution remain highly debated, due to the complexity of the underlying physics and the limited capacity of previous spacecraft missions7… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

    Comments: under review; comments and suggestions are welcomed

  16. A Joint Noise Disentanglement and Adversarial Training Framework for Robust Speaker Verification

    Authors: Xujiang Xing, Mingxing Xu, Thomas Fang Zheng

    Abstract: Automatic Speaker Verification (ASV) suffers from performance degradation in noisy conditions. To address this issue, we propose a novel adversarial learning framework that incorporates noise-disentanglement to establish a noise-independent speaker invariant embedding space. Specifically, the disentanglement module includes two encoders for separating speaker related and irrelevant information, re… ▽ More

    Submitted 22 August, 2024; v1 submitted 21 August, 2024; originally announced August 2024.

    Comments: 5 pages, accepted by Interspeech2024

    Report number: 707-711

    Journal ref: Interspeech2024

  17. arXiv:2408.09859  [pdf, other

    cs.CV

    OccMamba: Semantic Occupancy Prediction with State Space Models

    Authors: Heng Li, Yuenan Hou, Xiaohan Xing, Xiao Sun, Yanyong Zhang

    Abstract: Training deep learning models for semantic occupancy prediction is challenging due to factors such as a large number of occupancy cells, severe occlusion, limited visual cues, complicated driving scenarios, etc. Recent methods often adopt transformer-based architectures given their strong capability in learning input-conditioned weights and long-range relationships. However, transformer-based netw… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: 9 pages, 4 figures

  18. arXiv:2408.08704  [pdf, other

    cs.CV cs.AI

    Beyond the Hype: A dispassionate look at vision-language models in medical scenario

    Authors: Yang Nan, Huichi Zhou, Xiaodan Xing, Guang Yang

    Abstract: Recent advancements in Large Vision-Language Models (LVLMs) have demonstrated remarkable capabilities across diverse tasks, garnering significant attention in AI communities. However, their performance and reliability in specialized domains such as medicine remain insufficiently assessed. In particular, most assessments over-concentrate in evaluating VLMs based on simple Visual Question Answering… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Comments: 10 pages

  19. arXiv:2408.08527  [pdf, other

    cs.CV cs.AI

    Focus on Focus: Focus-oriented Representation Learning and Multi-view Cross-modal Alignment for Glioma Grading

    Authors: Li Pan, Yupei Zhang, Qiushi Yang, Tan Li, Xiaohan Xing, Maximus C. F. Yeung, Zhen Chen

    Abstract: Recently, multimodal deep learning, which integrates histopathology slides and molecular biomarkers, has achieved a promising performance in glioma grading. Despite great progress, due to the intra-modality complexity and inter-modality heterogeneity, existing studies suffer from inadequate histopathology representation learning and inefficient molecular-pathology knowledge alignment. These two is… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  20. arXiv:2408.00940  [pdf, other

    eess.IV cs.CV

    A dual-task mutual learning framework for predicting post-thrombectomy cerebral hemorrhage

    Authors: Caiwen Jiang, Tianyu Wang, Xiaodan Xing, Mianxin Liu, Guang Yang, Zhongxiang Ding, Dinggang Shen

    Abstract: Ischemic stroke is a severe condition caused by the blockage of brain blood vessels, and can lead to the death of brain tissue due to oxygen deprivation. Thrombectomy has become a common treatment choice for ischemic stroke due to its immediate effectiveness. But, it carries the risk of postoperative cerebral hemorrhage. Clinically, multiple CT scans within 0-72 hours post-surgery are used to moni… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

  21. arXiv:2408.00938  [pdf, other

    eess.IV cs.AI cs.CV

    CIResDiff: A Clinically-Informed Residual Diffusion Model for Predicting Idiopathic Pulmonary Fibrosis Progression

    Authors: Caiwen Jiang, Xiaodan Xing, Zaixin Ou, Mianxin Liu, Walsh Simon, Guang Yang, Dinggang Shen

    Abstract: The progression of Idiopathic Pulmonary Fibrosis (IPF) significantly correlates with higher patient mortality rates. Early detection of IPF progression is critical for initiating timely treatment, which can effectively slow down the advancement of the disease. However, the current clinical criteria define disease progression requiring two CT scans with a one-year interval, presenting a dilemma: a… ▽ More

    Submitted 5 August, 2024; v1 submitted 1 August, 2024; originally announced August 2024.

  22. arXiv:2407.20785  [pdf, other

    cs.CV

    Retinex-Diffusion: On Controlling Illumination Conditions in Diffusion Models via Retinex Theory

    Authors: Xiaoyan Xing, Vincent Tao Hu, Jan Hendrik Metzen, Konrad Groh, Sezer Karaoglu, Theo Gevers

    Abstract: This paper introduces a novel approach to illumination manipulation in diffusion models, addressing the gap in conditional image generation with a focus on lighting conditions. We conceptualize the diffusion model as a black-box image render and strategically decompose its energy function in alignment with the image formation model. Our method effectively separates and controls illumination-relate… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

  23. arXiv:2407.17882  [pdf, other

    eess.IV

    Artificial Immunofluorescence in a Flash: Rapid Synthetic Imaging from Brightfield Through Residual Diffusion

    Authors: Xiaodan Xing, Chunling Tang, Siofra Murdoch, Giorgos Papanastasiou, Yunzhe Guo, Xianglu Xiao, Jan Cross-Zamirski, Carola-Bibiane Schönlieb, Kristina Xiao Liang, Zhangming Niu, Evandro Fei Fang, Yinhai Wang, Guang Yang

    Abstract: Immunofluorescent (IF) imaging is crucial for visualizing biomarker expressions, cell morphology and assessing the effects of drug treatments on sub-cellular components. IF imaging needs extra staining process and often requiring cell fixation, therefore it may also introduce artefects and alter endogenouous cell morphology. Some IF stains are expensive or not readily available hence hindering exp… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  24. arXiv:2407.09672  [pdf, other

    cs.CV

    Mixed-View Panorama Synthesis using Geospatially Guided Diffusion

    Authors: Zhexiao Xiong, Xin Xing, Scott Workman, Subash Khanal, Nathan Jacobs

    Abstract: We introduce the task of mixed-view panorama synthesis, where the goal is to synthesize a novel panorama given a small set of input panoramas and a satellite image of the area. This contrasts with previous work which only uses input panoramas (same-view synthesis), or an input satellite image (cross-view synthesis). We argue that the mixed-view setting is the most natural to support panorama synth… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  25. arXiv:2407.09507  [pdf, other

    eess.IV

    Can Generative AI Replace Immunofluorescent Staining Processes? A Comparison Study of Synthetically Generated CellPainting Images from Brightfield

    Authors: Xiaodan Xing, Siofra Murdoch, Chunling Tang, Giorgos Papanastasiou, Jan Cross-Zamirski, Yunzhe Guo, Xianglu Xiao, Carola-Bibiane Schönlieb, Yinhai Wang, Guang Yang

    Abstract: Cell imaging assays utilizing fluorescence stains are essential for observing sub-cellular organelles and their responses to perturbations. Immunofluorescent staining process is routinely in labs, however the recent innovations in generative AI is challenging the idea of IF staining are required. This is especially true when the availability and cost of specific fluorescence dyes is a problem to s… ▽ More

    Submitted 16 July, 2024; v1 submitted 15 June, 2024; originally announced July 2024.

  26. arXiv:2407.05355  [pdf, other

    cs.CV cs.CL

    VideoCoT: A Video Chain-of-Thought Dataset with Active Annotation Tool

    Authors: Yan Wang, Yawen Zeng, Jingsheng Zheng, Xiaofen Xing, Jin Xu, Xiangmin Xu

    Abstract: Multimodal large language models (MLLMs) are flourishing, but mainly focus on images with less attention than videos, especially in sub-fields such as prompt engineering, video chain-of-thought (CoT), and instruction tuning on videos. Therefore, we try to explore the collection of CoT datasets in videos to lead to video OpenQA and improve the reasoning ability of MLLMs. Unfortunately, making such… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: ACL 2024 Workshop

  27. arXiv:2407.04752  [pdf, other

    cs.LG cs.CL cs.NE

    SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking

    Authors: Xingrun Xing, Boyan Gao, Zheng Zhang, David A. Clifton, Shitao Xiao, Li Du, Guoqi Li, Jiajun Zhang

    Abstract: The recent advancements in large language models (LLMs) with billions of parameters have significantly boosted their performance across various real-world applications. However, the inference processes for these models require substantial energy and computational resources, presenting considerable deployment challenges. In contrast, human brains, which contain approximately 86 billion biological n… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  28. arXiv:2407.03542  [pdf

    eess.IV cs.CV cs.LG

    Probing Perfection: The Relentless Art of Meddling for Pulmonary Airway Segmentation from HRCT via a Human-AI Collaboration Based Active Learning Method

    Authors: Shiyi Wang, Yang Nan, Sheng Zhang, Federico Felder, Xiaodan Xing, Yingying Fang, Javier Del Ser, Simon L F Walsh, Guang Yang

    Abstract: In pulmonary tracheal segmentation, the scarcity of annotated data is a prevalent issue in medical segmentation. Additionally, Deep Learning (DL) methods face challenges: the opacity of 'black box' models and the need for performance enhancement. Our Human-Computer Interaction (HCI) based models (RS_UNet, LC_UNet, UUNet, and WD_UNet) address these challenges by combining diverse query strategies w… ▽ More

    Submitted 23 July, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

  29. arXiv:2407.02751  [pdf, other

    cs.CL cs.AI

    Emotion and Intent Joint Understanding in Multimodal Conversation: A Benchmarking Dataset

    Authors: Rui Liu, Haolin Zuo, Zheng Lian, Xiaofen Xing, Björn W. Schuller, Haizhou Li

    Abstract: Emotion and Intent Joint Understanding in Multimodal Conversation (MC-EIU) aims to decode the semantic information manifested in a multimodal conversational history, while inferring the emotions and intents simultaneously for the current utterance. MC-EIU is enabling technology for many human-computer interfaces. However, there is a lack of available datasets in terms of annotation, modality, lang… ▽ More

    Submitted 4 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: 26 pages, 8 figures, 12 tables, NeurIPS 2024 Dataset and Benchmark Track

  30. arXiv:2407.01358  [pdf, other

    cs.CL

    Evaluating Knowledge-based Cross-lingual Inconsistency in Large Language Models

    Authors: Xiaolin Xing, Zhiwei He, Haoyu Xu, Xing Wang, Rui Wang, Yu Hong

    Abstract: This paper investigates the cross-lingual inconsistencies observed in Large Language Models (LLMs), such as ChatGPT, Llama, and Baichuan, which have shown exceptional performance in various Natural Language Processing (NLP) tasks. Despite their successes, these models often exhibit significant inconsistencies when processing the same concepts across different languages. This study focuses on three… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  31. arXiv:2406.18950  [pdf, other

    eess.IV cs.CV

    MMR-Mamba: Multi-Modal MRI Reconstruction with Mamba and Spatial-Frequency Information Fusion

    Authors: Jing Zou, Lanqing Liu, Qi Chen, Shujun Wang, Zhanli Hu, Xiaohan Xing, Jing Qin

    Abstract: Multi-modal MRI offers valuable complementary information for diagnosis and treatment; however, its utility is limited by prolonged scanning times. To accelerate the acquisition process, a practical approach is to reconstruct images of the target modality, which requires longer scanning times, from under-sampled k-space data using the fully-sampled reference modality with shorter scanning times as… ▽ More

    Submitted 7 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: 10 pages, 5 figure

  32. arXiv:2406.18552  [pdf, other

    cs.CV cs.AI

    Decoding Decision Reasoning: A Counterfactual-Powered Model for Knowledge Discovery

    Authors: Yingying Fang, Zihao Jin, Xiaodan Xing, Simon Walsh, Guang Yang

    Abstract: In medical imaging, particularly in early disease detection and prognosis tasks, discerning the rationale behind an AI model's predictions is crucial for evaluating the reliability of its decisions. Conventional explanation methods face challenges in identifying discernible decisive features in medical image classifications, where discriminative features are subtle or not immediately apparent. To… ▽ More

    Submitted 23 May, 2024; originally announced June 2024.

  33. arXiv:2406.16189  [pdf, other

    eess.IV cs.CV

    Fuzzy Attention-based Border Rendering Network for Lung Organ Segmentation

    Authors: Sheng Zhang, Yang Nan, Yingying Fang, Shiyi Wang, Xiaodan Xing, Zhifan Gao, Guang Yang

    Abstract: Automatic lung organ segmentation on CT images is crucial for lung disease diagnosis. However, the unlimited voxel values and class imbalance of lung organs can lead to false-negative/positive and leakage issues in advanced methods. Additionally, some slender lung organs are easily lost during the recycled down/up-sample procedure, e.g., bronchioles & arterioles, causing severe discontinuity issue… ▽ More

    Submitted 1 July, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

    Comments: MICCAI 2024

  34. arXiv:2406.16017  [pdf, other

    quant-ph physics.atom-ph

    Competing excitation quenching and charge exchange in ultracold Li-Ba$^+$ collisions

    Authors: Xiaodong Xing, Pascal Weckesser, Fabian Thielemann, Tibor Jónás, Romain Vexiau, Nadia Bouloufa-Maafa, Eliane Luc-Koenig, Kirk W. Madison, Andrea Orbán, Ting Xie, Tobias Schaetz, Olivier Dulieu

    Abstract: Hybrid atom-ion systems are a rich and powerful platform for studying chemical reactions, as they feature both excellent control over the electronic state preparation and readout as well as a versatile tunability over the scattering energy, ranging from the few-partial wave regime to the quantum regime. In this work, we make use of these excellent control knobs, and present a joint experimental an… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 17 pages, 15 figures, 4 tables

  35. arXiv:2406.07006  [pdf, other

    cs.CV

    MIPI 2024 Challenge on Few-shot RAW Image Denoising: Methods and Results

    Authors: Xin Jin, Chunle Guo, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Ruoqi Li, Chang Liu, Ziyi Wang, Yao Du, Jingjing Yang, Long Bao, Heng Sun, Xiangyu Kong, Xiaoxia Xing, Jinlong Wu, Yuanyang Xue, Hyunhee Park, Sejun Song, Changho Kim, Jingfan Tan , et al. (17 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: CVPR 2024 Mobile Intelligent Photography and Imaging (MIPI) Workshop--Few-shot RAWImage Denoising Challenge Report. Website: https://mipi-challenge.org/MIPI2024/

  36. arXiv:2406.03287  [pdf, other

    cs.NE cs.CL cs.LG

    SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms

    Authors: Xingrun Xing, Zheng Zhang, Ziyi Ni, Shitao Xiao, Yiming Ju, Siqi Fan, Yequan Wang, Jiajun Zhang, Guoqi Li

    Abstract: Towards energy-efficient artificial intelligence similar to the human brain, the bio-inspired spiking neural networks (SNNs) have advantages of biological plausibility, event-driven sparsity, and binary activation. Recently, large-scale language models exhibit promising generalization capability, making it a valuable issue to explore more general spike-driven models. However, the binary spikes in… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  37. arXiv:2406.02737  [pdf, other

    cs.CR cs.SE

    CAMP: Compiler and Allocator-based Heap Memory Protection

    Authors: Zhenpeng Lin, Zheng Yu, Ziyi Guo, Simone Campanoni, Peter Dinda, Xinyu Xing

    Abstract: The heap is a critical and widely used component of many applications. Due to its dynamic nature, combined with the complexity of heap management algorithms, it is also a frequent target for security exploits. To enhance the heap's security, various heap protection techniques have been introduced, but they either introduce significant runtime overhead or have limited protection. We present CAMP,… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  38. arXiv:2406.02624  [pdf, other

    cs.CR cs.SE

    Take a Step Further: Understanding Page Spray in Linux Kernel Exploitation

    Authors: Ziyi Guo, Dang K Le, Zhenpeng Lin, Kyle Zeng, Ruoyu Wang, Tiffany Bao, Yan Shoshitaishvili, Adam Doupé, Xinyu Xing

    Abstract: Recently, a novel method known as Page Spray emerges, focusing on page-level exploitation for kernel vulnerabilities. Despite the advantages it offers in terms of exploitability, stability, and compatibility, comprehensive research on Page Spray remains scarce. Questions regarding its root causes, exploitation model, comparative benefits over other exploitation techniques, and possible mitigation… ▽ More

    Submitted 6 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  39. arXiv:2406.02220  [pdf, other

    cond-mat.stat-mech cond-mat.mes-hall cond-mat.soft

    Stochastic Thermodynamics of Micromagnetics with Spin Torque

    Authors: Mingnan Ding, Jun Wu, Xiangjun Xing

    Abstract: In this work, we study the stochastic dynamics of micro-magnetics interacting with a spin-current torque. We extend the previously constructed stochastic Landau-Lifshitz equation to the case with spin-current torque, and verify the conditions of detailed balance. Then we construct various thermodynamics quantities such as work and heat, and prove the second law of thermodynamics. Due to the existe… ▽ More

    Submitted 5 August, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: 7 pages. arXiv admin note: text overlap with arXiv:2404.13612

  40. arXiv:2406.02023  [pdf, other

    cs.CR

    ShadowBound: Efficient Heap Memory Protection Through Advanced Metadata Management and Customized Compiler Optimization

    Authors: Zheng Yu, Ganxiang Yang, Xinyu Xing

    Abstract: In software development, the prevalence of unsafe languages such as C and C++ introduces potential vulnerabilities, especially within the heap, a pivotal component for dynamic memory allocation. Despite its significance, heap management complexities have made heap corruption pervasive, posing severe threats to system security. While prior solutions aiming for temporal and spatial memory safety exh… ▽ More

    Submitted 23 September, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted by USENIX Security 2024

  41. arXiv:2406.01514  [pdf, other

    cs.CL cs.AI cs.CR

    Decoupled Alignment for Robust Plug-and-Play Adaptation

    Authors: Haozheng Luo, Jiahao Yu, Wenxin Zhang, Jialong Li, Jerry Yao-Chieh Hu, Xinyu Xing, Han Liu

    Abstract: We introduce a low-resource safety enhancement method for aligning large language models (LLMs) without the need for supervised fine-tuning (SFT) or reinforcement learning from human feedback (RLHF). Our main idea is to exploit knowledge distillation to extract the alignment information from existing well-aligned LLMs and integrate it into unaligned LLMs in a plug-and-play fashion. Methodology, we… ▽ More

    Submitted 6 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  42. arXiv:2405.20653  [pdf, other

    cs.AI

    Enhancing Jailbreak Attack Against Large Language Models through Silent Tokens

    Authors: Jiahao Yu, Haozheng Luo, Jerry Yao-Chieh Hu, Wenbo Guo, Han Liu, Xinyu Xing

    Abstract: Along with the remarkable successes of Language language models, recent research also started to explore the security threats of LLMs, including jailbreaking attacks. Attackers carefully craft jailbreaking prompts such that a target LLM will respond to the harmful question. Existing jailbreaking attacks require either human experts or leveraging complicated algorithms to craft jailbreaking prompts… ▽ More

    Submitted 4 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

  43. arXiv:2405.09597  [pdf, other

    cs.LG cs.AI

    When AI Eats Itself: On the Caveats of Data Pollution in the Era of Generative AI

    Authors: Xiaodan Xing, Fadong Shi, Jiahao Huang, Yinzhe Wu, Yang Nan, Sheng Zhang, Yingying Fang, Mike Roberts, Carola-Bibiane Schönlieb, Javier Del Ser, Guang Yang

    Abstract: Generative artificial intelligence (AI) technologies and large models are producing realistic outputs across various domains, such as images, text, speech, and music. Creating these advanced generative models requires significant resources, particularly large and high-quality datasets. To minimize training expenses, many algorithm developers use data created by the models themselves as a cost-effe… ▽ More

    Submitted 25 July, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  44. arXiv:2405.03064  [pdf, other

    cs.LG cs.AI cs.CR

    RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation

    Authors: Zelei Cheng, Xian Wu, Jiahao Yu, Sabrina Yang, Gang Wang, Xinyu Xing

    Abstract: Deep reinforcement learning (DRL) is playing an increasingly important role in real-world applications. However, obtaining an optimally performing DRL agent for complex tasks, especially with sparse rewards, remains a significant challenge. The training of a DRL agent can be often trapped in a bottleneck without further progress. In this paper, we propose RICE, an innovative refining scheme for re… ▽ More

    Submitted 5 June, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML 2024

  45. arXiv:2405.02962  [pdf, other

    cs.CV

    VectorPainter: A Novel Approach to Stylized Vector Graphics Synthesis with Vectorized Strokes

    Authors: Juncheng Hu, Ximing Xing, Zhengqi Zhang, Jing Zhang, Qian Yu

    Abstract: We propose a novel method, VectorPainter, for the task of stylized vector graphics synthesis. Given a text prompt and a reference style image, VectorPainter generates a vector graphic that aligns in content with the text prompt and remains faithful in style to the reference image. We recognize that the key to this task lies in fully leveraging the intrinsic properties of vector graphics. Innovativ… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  46. arXiv:2404.13845  [pdf, other

    cond-mat.stat-mech

    Stochastic thermodynamics of Brownian motion in a flowing fluid

    Authors: Jun Wu, Mingnan Ding, Xiangjun Xing

    Abstract: We study stochastic thermodynamics of over-damped Brownian motion in a flowing fluid. Unlike some previous works, we treat the effects of the flow field as a non-conservational driving force acting on the Brownian particle. This allows us to apply the theoretical formalism developed in a recent work for general non-conservative Langevin dynamics. We define heat and work both at the trajectory leve… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 20 pages, 13 figures

  47. arXiv:2404.13612  [pdf, other

    cond-mat.stat-mech cond-mat.mes-hall cond-mat.soft

    Stochastic Thermodynamics of Micromagnetics

    Authors: Mingnan Ding, Jun Wu, Xiangjun Xing

    Abstract: In this work, we study the stochastic thermodynamics of micro-magnetic systems. We first formulate the stochastic dynamics of micro-magnetic systems by incorporating noises into Landau-Lifshitz (LL) equation, which describes the irreversible and deterministic dynamics of magnetic moments. The resulting stochastic Landau-Lifshitz (sLL) equation obeys detailed balance, which guarantees that, with th… ▽ More

    Submitted 4 August, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

    Comments: 8 pages

  48. arXiv:2404.10249  [pdf

    cond-mat.mtrl-sci physics.chem-ph

    Picturing the Gap Between the Performance and US-DOE's Hydrogen Storage Target: A Data-Driven Model for MgH2 Dehydrogenation

    Authors: Chaoqun Li, Weijie Yang, Hao Liu, Xinyuan Liu, Xiujing Xing, Zhengyang Gao, Shuai Dong, Hao Li

    Abstract: Developing solid-state hydrogen storage materials is as pressing as ever, which requires a comprehensive understanding of the dehydrogenation chemistry of a solid-state hydride. Transition state search and kinetics calculations are essential to understanding and designing high-performance solid-state hydrogen storage materials by filling in the knowledge gap that current experimental techniques ca… ▽ More

    Submitted 29 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  49. arXiv:2403.11777  [pdf

    cond-mat.mtrl-sci

    Ultralarge polarization in ferroelectric hafnia-based thin films

    Authors: Han Wu, Kun Lin, Qinghua Zhang, Qian Yu, Xiaoqian Fu, Qiang Li, Meera Cheviri, Oswaldo Dieguez, Shuai Xu, Lin Gu, Yili Cao, Jiaou Wang, Zhen Wang, Yu Chen, Huanhua Wang, Jinxia Deng, Jun Miao, Xianran Xing

    Abstract: Hafnia-based ferroelectrics have become a valuable class of electronic functional materials at the nanoscale, showing great potential for next-generation memory and logic devices. However, more robust ferroelectric properties and better understanding of the polarization mechanisms are currently needed both in technology and science. Herein, we report the properties of oxygen-deficient Hf0.5Zr0.5O2… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  50. arXiv:2403.07246  [pdf, other

    cs.CV

    Towards Zero-shot Human-Object Interaction Detection via Vision-Language Integration

    Authors: Weiying Xue, Qi Liu, Qiwei Xiong, Yuxiao Wang, Zhenao Wei, Xiaofen Xing, Xiangmin Xu

    Abstract: Human-object interaction (HOI) detection aims to locate human-object pairs and identify their interaction categories in images. Most existing methods primarily focus on supervised learning, which relies on extensive manual HOI annotations. In this paper, we propose a novel framework, termed Knowledge Integration to HOI (KI2HOI), that effectively integrates the knowledge of visual-language model to… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.