Skip to main content

Showing 1–50 of 494 results for author: Yang, R

  1. arXiv:2410.14309  [pdf, other

    cs.CL cs.AI

    LoGU: Long-form Generation with Uncertainty Expressions

    Authors: Ruihan Yang, Caiqi Zhang, Zhisong Zhang, Xinting Huang, Sen Yang, Nigel Collier, Dong Yu, Deqing Yang

    Abstract: While Large Language Models (LLMs) demonstrate impressive capabilities, they still struggle with generating factually incorrect content (i.e., hallucinations). A promising approach to mitigate this issue is enabling models to express uncertainty when unsure. Previous research on uncertainty modeling has primarily focused on short-form QA, but realworld applications often require much longer respon… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

  2. arXiv:2410.13246  [pdf, other

    cs.CL cs.AI

    Atomic Calibration of LLMs in Long-Form Generations

    Authors: Caiqi Zhang, Ruihan Yang, Zhisong Zhang, Xinting Huang, Sen Yang, Dong Yu, Nigel Collier

    Abstract: Large language models (LLMs) often suffer from hallucinations, posing significant challenges for real-world applications. Confidence calibration, which estimates the underlying uncertainty of model predictions, is essential to enhance the LLMs' trustworthiness. Existing research on LLM calibration has primarily focused on short-form tasks, providing a single confidence score at the response level… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  3. arXiv:2410.12051  [pdf, other

    cs.HC cs.AI cs.ET cs.MM

    Enabling Data-Driven and Empathetic Interactions: A Context-Aware 3D Virtual Agent in Mixed Reality for Enhanced Financial Customer Experience

    Authors: Cindy Xu, Mengyu Chen, Pranav Deshpande, Elvir Azanli, Runqing Yang, Joseph Ligman

    Abstract: In this paper, we introduce a novel system designed to enhance customer service in the financial and retail sectors through a context-aware 3D virtual agent, utilizing Mixed Reality (MR) and Vision Language Models (VLMs). Our approach focuses on enabling data-driven and empathetic interactions that ensure customer satisfaction by introducing situational awareness of the physical location, personal… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: to appear at 1st Workshop on Intelligent XR: Harnessing AI for Next-Generation XR User Experiences at International Symposium on Mixed and Augmented Reality (ISMAR) 2024

    ACM Class: H.5.1; K.4.3

  4. arXiv:2410.11853  [pdf, other

    cs.DB cs.IR cs.LG cs.SI

    GeoLife+: Large-Scale Simulated Trajectory Datasets Calibrated to the GeoLife Dataset

    Authors: Hossein Amiri, Richard Yang, Andreas Zufle

    Abstract: Analyzing individual human trajectory data helps our understanding of human mobility and finds many commercial and academic applications. There are two main approaches to accessing trajectory data for research: one involves using real-world datasets like GeoLife, while the other employs simulations to synthesize data. Real-world data provides insights from real human activities, but such data is g… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

    Comments: Accepted paper at https://geosim.org/

  5. arXiv:2410.11531  [pdf, other

    cs.AI

    AGENTiGraph: An Interactive Knowledge Graph Platform for LLM-based Chatbots Utilizing Private Data

    Authors: Xinjie Zhao, Moritz Blum, Rui Yang, Boming Yang, Luis Márquez Carpintero, Mónica Pina-Navarro, Tony Wang, Xin Li, Huitao Li, Yanran Fu, Rongrong Wang, Juntao Zhang, Irene Li

    Abstract: Large Language Models~(LLMs) have demonstrated capabilities across various applications but face challenges such as hallucination, limited reasoning abilities, and factual inconsistencies, especially when tackling complex, domain-specific tasks like question answering~(QA). While Knowledge Graphs~(KGs) have been shown to help mitigate these issues, research on the integration of LLMs with backgrou… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 30 pages, 7 figures; Submitted to COLING 2025 System Demonstrations Track

  6. arXiv:2410.11345  [pdf, other

    cs.RO

    Visual Manipulation with Legs

    Authors: Xialin He, Chengjing Yuan, Wenxuan Zhou, Ruihan Yang, David Held, Xiaolong Wang

    Abstract: Animals use limbs for both locomotion and manipulation. We aim to equip quadruped robots with similar versatility. This work introduces a system that enables quadruped robots to interact with objects using their legs, inspired by non-prehensile manipulation. The system has two main components: a visual manipulation policy module and a loco-manipulator module. The visual manipulation policy, traine… ▽ More

    Submitted 16 October, 2024; v1 submitted 15 October, 2024; originally announced October 2024.

    Comments: More details can be found on our project page: https://legged-manipulation.github.io/

  7. arXiv:2410.10178  [pdf, other

    cs.LG cs.MM

    GUISE: Graph GaUssIan Shading watErmark

    Authors: Renyi Yang

    Abstract: In the expanding field of generative artificial intelligence, integrating robust watermarking technologies is essential to protect intellectual property and maintain content authenticity. Traditionally, watermarking techniques have been developed primarily for rich information media such as images and audio. However, these methods have not been adequately adapted for graph-based data, particularly… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  8. arXiv:2410.09318  [pdf, other

    cs.CL cs.CY cs.SE

    Impeding LLM-assisted Cheating in Introductory Programming Assignments via Adversarial Perturbation

    Authors: Saiful Islam Salim, Rubin Yuchan Yang, Alexander Cooper, Suryashree Ray, Saumya Debray, Sazzadur Rahaman

    Abstract: While Large language model (LLM)-based programming assistants such as CoPilot and ChatGPT can help improve the productivity of professional software developers, they can also facilitate cheating in introductory computer programming courses. Assuming instructors have limited control over the industrial-strength models, this paper investigates the baseline performance of 5 widely used LLMs on a coll… ▽ More

    Submitted 15 October, 2024; v1 submitted 11 October, 2024; originally announced October 2024.

  9. arXiv:2410.08656  [pdf, ps, other

    eess.SP cs.AI

    radarODE-MTL: A Multi-Task Learning Framework with Eccentric Gradient Alignment for Robust Radar-Based ECG Reconstruction

    Authors: Yuanyuan Zhang, Rui Yang, Yutao Yue, Eng Gee Lim

    Abstract: Millimeter-wave radar is promising to provide robust and accurate vital sign monitoring in an unobtrusive manner. However, the radar signal might be distorted in propagation by ambient noise or random body movement, ruining the subtle cardiac activities and destroying the vital sign recovery. In particular, the recovery of electrocardiogram (ECG) signal heavily relies on the deep-learning model an… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  10. arXiv:2410.08557  [pdf, other

    cs.LG

    MUSO: Achieving Exact Machine Unlearning in Over-Parameterized Regimes

    Authors: Ruikai Yang, Mingzhen He, Zhengbao He, Youmei Qiu, Xiaolin Huang

    Abstract: Machine unlearning (MU) is to make a well-trained model behave as if it had never been trained on specific data. In today's over-parameterized models, dominated by neural networks, a common approach is to manually relabel data and fine-tune the well-trained model. It can approximate the MU model in the output space, but the question remains whether it can achieve exact MU, i.e., in the parameter s… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  11. arXiv:2410.07961  [pdf, other

    quant-ph cs.DS cs.LG

    QCircuitNet: A Large-Scale Hierarchical Dataset for Quantum Algorithm Design

    Authors: Rui Yang, Yuntian Gu, Ziruo Wang, Yitao Liang, Tongyang Li

    Abstract: Quantum computing is an emerging field recognized for the significant speedup it offers over classical computing through quantum algorithms. However, designing and implementing quantum algorithms pose challenges due to the complex nature of quantum mechanics and the necessity for precise control over quantum states. Despite the significant advancements in AI, there has been a lack of datasets spec… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: 35 pages, 7 figures, 4 tables, GitHub repository: https://github.com/EstelYang/QCircuitNet_Dataset

  12. arXiv:2410.07493  [pdf, other

    cs.RO eess.SY

    Autonomous Robotic System with Optical Coherence Tomography Guidance for Vascular Anastomosis

    Authors: Jesse Haworth, Rishi Biswas, Justin Opfermann, Michael Kam, Yaning Wang, Desire Pantalone, Francis X. Creighton, Robin Yang, Jin U. Kang, Axel Krieger

    Abstract: Vascular anastomosis, the surgical connection of blood vessels, is essential in procedures such as organ transplants and reconstructive surgeries. The precision required limits accessibility due to the extensive training needed, with manual suturing leading to variable outcomes and revision rates up to 7.9%. Existing robotic systems, while promising, are either fully teleoperated or lack the capab… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: This paper was submitted to IEEE TMRB and is currently under review. There are 9 pages, 9 figures, and 2 tables

    MSC Class: 68T40: Robotics

  13. arXiv:2410.07196  [pdf, other

    eess.SP cs.LG

    EEGUnity: Open-Source Tool in Facilitating Unified EEG Datasets Towards Large-Scale EEG Model

    Authors: Chengxuan Qin, Rui Yang, Wenlong You, Zhige Chen, Longsheng Zhu, Mengjie Huang, Zidong Wang

    Abstract: The increasing number of dispersed EEG dataset publications and the advancement of large-scale Electroencephalogram (EEG) models have increased the demand for practical tools to manage diverse EEG datasets. However, the inherent complexity of EEG data, characterized by variability in content data, metadata, and data formats, poses challenges for integrating multiple datasets and conducting large-s… ▽ More

    Submitted 24 September, 2024; originally announced October 2024.

  14. arXiv:2410.02534  [pdf, other

    cs.CV

    Pseudo-Stereo Inputs: A Solution to the Occlusion Challenge in Self-Supervised Stereo Matching

    Authors: Ruizhi Yang, Xingqiang Li, Jiajun Bai, Jinsong Du

    Abstract: Self-supervised stereo matching holds great promise for application and research due to its independence from expensive labeled data. However, direct self-supervised stereo matching paradigms based on photometric loss functions have consistently struggled with performance issues due to the occlusion challenge. The crux of the occlusion challenge lies in the fact that the positions of occluded pixe… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: Submitted to IEEE Transactions on Image Processing (TIP)

  15. arXiv:2410.01481  [pdf, other

    cs.SD cs.AI eess.AS

    SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios

    Authors: Kai Li, Wendi Sang, Chang Zeng, Runxuan Yang, Guo Chen, Xiaolin Hu

    Abstract: The systematic evaluation of speech separation and enhancement models under moving sound source conditions typically requires extensive data comprising diverse scenarios. However, real-world datasets often contain insufficient data to meet the training and evaluation requirements of models. Although synthetic datasets offer a larger volume of data, their acoustic simulations lack realism. Conseque… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

    Comments: Technical report

  16. arXiv:2409.20306  [pdf, other

    cs.NI

    Diagnosing and Repairing Distributed Routing Configurations Using Selective Symbolic Simulation

    Authors: Rulan Yang, Hanyang Shao, Gao Han, Ziyi Wang, Xing Fang, Lizhao You, Qiao Xiang, Linghe Kong, Ruiting Zhou, Jiwu Shu

    Abstract: Although substantial progress has been made in automatically verifying whether distributed routing configurations conform to certain requirements, diagnosing and repairing configuration errors remains manual and time-consuming. To fill this gap, we propose S^2Sim, a novel system for automatic routing configuration diagnosis and repair. Our key insight is that by selectively simulating variants of… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

  17. arXiv:2409.18333  [pdf, other

    q-bio.NC cs.LG

    A Framework for Standardizing Similarity Measures in a Rapidly Evolving Field

    Authors: Nathan Cloos, Guangyu Robert Yang, Christopher J. Cueva

    Abstract: Similarity measures are fundamental tools for quantifying the alignment between artificial and biological systems. However, the diversity of similarity measures and their varied naming and implementation conventions makes it challenging to compare across studies. To facilitate comparisons and make explicit the implementation choices underlying a given code package, we have created and are continui… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: 11 pages, 9 figures

  18. arXiv:2409.17385  [pdf, other

    cs.LG cs.AI cs.CV

    Data-efficient Trajectory Prediction via Coreset Selection

    Authors: Ruining Yang, Lili Su

    Abstract: Modern vehicles are equipped with multiple information-collection devices such as sensors and cameras, continuously generating a large volume of raw data. Accurately predicting the trajectories of neighboring vehicles is a vital component in understanding the complex driving environment. Yet, training trajectory prediction models is challenging in two ways. Processing the large-scale data is compu… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

  19. arXiv:2409.16429  [pdf, other

    cs.CV cs.AI cs.LG

    Leveraging Local Structure for Improving Model Explanations: An Information Propagation Approach

    Authors: Ruo Yang, Binghui Wang, Mustafa Bilgic

    Abstract: Numerous explanation methods have been recently developed to interpret the decisions made by deep neural network (DNN) models. For image classifiers, these methods typically provide an attribution score to each pixel in the image to quantify its contribution to the prediction. However, most of these explanation methods appropriate attribution scores to pixels independently, even though both humans… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

  20. arXiv:2409.15461  [pdf, other

    cs.AI cs.CL

    RAM2C: A Liberal Arts Educational Chatbot based on Retrieval-augmented Multi-role Multi-expert Collaboration

    Authors: Haoyu Huang, Tong Niu, Rui Yang, Luping Shi

    Abstract: Recently, many studies focus on utilizing large language models (LLMs) into educational dialogues. Especially, within liberal arts dialogues, educators must balance \textbf{H}umanized communication, \textbf{T}eaching expertise, and \textbf{S}afety-ethics (\textbf{HTS}), besides the subject knowledge itself. However, due to collecting massive amounts of HTS-compliant teaching dialogues from real wo… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  21. arXiv:2409.14837  [pdf, other

    cs.AR

    MESC: Re-thinking Algorithmic Priority and/or Criticality Inversions for Heterogeneous MCSs

    Authors: Jiapeng Guan, Ran Wei, Dean You, Yingquan Wang, Ruizhe Yang, Hui Wang, Zhe Jiang

    Abstract: Modern Mixed-Criticality Systems (MCSs) rely on hardware heterogeneity to satisfy ever-increasing computational demands. However, most of the heterogeneous co-processors are designed to achieve high throughput, with their micro-architectures executing the workloads in a streaming manner. This streaming execution is often non-preemptive or limited-preemptive, preventing tasks' prioritisation based… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

    Comments: Accepted at the 2024 IEEE Real-Time Systems Symposium (RTSS)

    ACM Class: C.3; D.4.7

  22. arXiv:2409.10310  [pdf, other

    cs.RO eess.SY

    Safe and Real-Time Consistent Planning for Autonomous Vehicles in Partially Observed Environments via Parallel Consensus Optimization

    Authors: Lei Zheng, Rui Yang, Minzhe Zheng, Michael Yu Wang, Jun Ma

    Abstract: Ensuring safety and driving consistency is a significant challenge for autonomous vehicles operating in partially observed environments. This work introduces a consistent parallel trajectory optimization (CPTO) approach to enable safe and consistent driving in dense obstacle environments with perception uncertainties. Utilizing discrete-time barrier function theory, we develop a consensus safety b… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

  23. arXiv:2409.08147  [pdf, other

    cs.CL

    LLM-POTUS Score: A Framework of Analyzing Presidential Debates with Large Language Models

    Authors: Zhengliang Liu, Yiwei Li, Oleksandra Zolotarevych, Rongwei Yang, Tianming Liu

    Abstract: Large language models have demonstrated remarkable capabilities in natural language processing, yet their application to political discourse analysis remains underexplored. This paper introduces a novel approach to evaluating presidential debate performances using LLMs, addressing the longstanding challenge of objectively assessing debate outcomes. We propose a framework that analyzes candidates'… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

  24. arXiv:2409.04778  [pdf, other

    cs.CL cs.LG

    LoCa: Logit Calibration for Knowledge Distillation

    Authors: Runming Yang, Taiqiang Wu, Yujiu Yang

    Abstract: Knowledge Distillation (KD), aiming to train a better student model by mimicking the teacher model, plays an important role in model compression. One typical way is to align the output logits. However, we find a common issue named mis-instruction, that the student would be misled when the predictions based on teacher logits do not follow the labels. Meanwhile, there is other useful dark knowledge… ▽ More

    Submitted 7 September, 2024; originally announced September 2024.

    Comments: Accepted by ECAI 2024

  25. arXiv:2409.02938  [pdf, other

    cs.LG cs.AI

    CortexCompile: Harnessing Cortical-Inspired Architectures for Enhanced Multi-Agent NLP Code Synthesis

    Authors: Gautham Ramachandran, Rick Yang

    Abstract: Current approaches to automated code generation often rely on monolithic models that lack real-time adaptability and scalability. This limitation is particularly evident in complex programming tasks that require dynamic adjustment and efficiency. The integration of neuroscience principles into Natural Language Processing (NLP) has the potential to revolutionize automated code generation. This pape… ▽ More

    Submitted 23 August, 2024; originally announced September 2024.

    Comments: 17 pages, 6 figures

    ACM Class: I.2.2; I.2.7

  26. arXiv:2408.13915  [pdf, other

    cs.CL cs.AI

    LLMs are Superior Feedback Providers: Bootstrapping Reasoning for Lie Detection with Self-Generated Feedback

    Authors: Tanushree Banerjee, Richard Zhu, Runzhe Yang, Karthik Narasimhan

    Abstract: Large Language Models (LLMs) excel at generating human-like dialogues and comprehending text. However, understanding the subtleties of complex exchanges in language remains a challenge. We propose a bootstrapping framework that leverages self-generated feedback to enhance LLM reasoning capabilities for lie detection. The framework consists of three stages: suggestion, feedback collection, and modi… ▽ More

    Submitted 25 August, 2024; originally announced August 2024.

    Comments: 19 pages, 18 figures

  27. arXiv:2408.11805  [pdf, other

    cs.RO cs.CV cs.LG

    ACE: A Cross-Platform Visual-Exoskeletons System for Low-Cost Dexterous Teleoperation

    Authors: Shiqi Yang, Minghuan Liu, Yuzhe Qin, Runyu Ding, Jialong Li, Xuxin Cheng, Ruihan Yang, Sha Yi, Xiaolong Wang

    Abstract: Learning from demonstrations has shown to be an effective approach to robotic manipulation, especially with the recently collected large-scale robot data with teleoperation systems. Building an efficient teleoperation system across diverse robot platforms has become more crucial than ever. However, there is a notable lack of cost-effective and user-friendly teleoperation systems for different end-… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: Webpage: https://ace-teleop.github.io/

  28. arXiv:2408.11187  [pdf, other

    cs.RO cs.AI cs.MA

    Optimization of Multi-Agent Flying Sidekick Traveling Salesman Problem over Road Networks

    Authors: Ruixiao Yang, Chuchu Fan

    Abstract: The mixed truck-drone delivery systems have attracted increasing attention for last-mile logistics, but real-world complexities demand a shift from single-agent, fully connected graph models to multi-agent systems operating on actual road networks. We introduce the multi-agent flying sidekick traveling salesman problem (MA-FSTSP) on road networks, extending the single truck-drone model to multiple… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  29. arXiv:2408.10819  [pdf, other

    cs.CL cs.AI

    Exploiting Large Language Models Capabilities for Question Answer-Driven Knowledge Graph Completion Across Static and Temporal Domains

    Authors: Rui Yang, Jiahao Zhu, Jianping Man, Li Fang, Yi Zhou

    Abstract: Knowledge graph completion (KGC) aims to identify missing triples in a knowledge graph (KG). This is typically achieved through tasks such as link prediction and instance completion. However, these methods often focus on either static knowledge graphs (SKGs) or temporal knowledge graphs (TKGs), addressing only within-scope triples. This paper introduces a new generative completion framework called… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  30. A Versatile Framework for Attributed Network Clustering via K-Nearest Neighbor Augmentation

    Authors: Yiran Li, Gongyao Guo, Jieming Shi, Renchi Yang, Shiqi Shen, Qing Li, Jun Luo

    Abstract: Attributed networks containing entity-specific information in node attributes are ubiquitous in modeling social networks, e-commerce, bioinformatics, etc. Their inherent network topology ranges from simple graphs to hypergraphs with high-order interactions and multiplex graphs with separate layers. An important graph mining task is node clustering, aiming to partition the nodes of an attributed ne… ▽ More

    Submitted 5 October, 2024; v1 submitted 10 August, 2024; originally announced August 2024.

    Comments: 25 pages, 15 figures

    Journal ref: The VLDB Journal (2024) 1-31

  31. arXiv:2408.03468  [pdf, other

    cs.MM cs.AI cs.CV

    MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube and Bilibili

    Authors: Han Wang, Tan Rui Yang, Usman Naseem, Roy Ka-Wei Lee

    Abstract: Hate speech is a pressing issue in modern society, with significant effects both online and offline. Recent research in hate speech detection has primarily centered on text-based media, largely overlooking multimodal content such as videos. Existing studies on hateful video datasets have predominantly focused on English content within a Western context and have been limited to binary labels (hatef… ▽ More

    Submitted 12 August, 2024; v1 submitted 28 July, 2024; originally announced August 2024.

    Comments: 10 pages, 3 figures, ACM Multimedia 2024

    ACM Class: I.2.0

  32. arXiv:2408.01680  [pdf, ps, other

    cs.IT

    Service Placement and Trajectory Design for Heterogeneous Tasks in Multi-UAV Cooperative Computing Networks

    Authors: Bin Li, Rongrong Yang, Lei Liu, Celimuge Wu

    Abstract: In this paper, we consider deploying multiple Unmanned Aerial Vehicles (UAVs) to enhance the computation service of Mobile Edge Computing (MEC) through collaborative computation among UAVs. In particular, the tasks of different types and service requirements in MEC network are offloaded from one UAV to another. To pursue the goal of low-carbon edge computing, we study the problem of minimizing sys… ▽ More

    Submitted 3 August, 2024; originally announced August 2024.

    Comments: 11 pages, 10 figures

  33. arXiv:2408.01672  [pdf, ps, other

    eess.SP cs.AI

    radarODE: An ODE-Embedded Deep Learning Model for Contactless ECG Reconstruction from Millimeter-Wave Radar

    Authors: Yuanyuan Zhang, Runwei Guan, Lingxiao Li, Rui Yang, Yutao Yue, Eng Gee Lim

    Abstract: Radar-based contactless cardiac monitoring has become a popular research direction recently, but the fine-grained electrocardiogram (ECG) signal is still hard to reconstruct from millimeter-wave radar signal. The key obstacle is to decouple the cardiac activities in the electrical domain (i.e., ECG) from that in the mechanical domain (i.e., heartbeat), and most existing research only uses pure dat… ▽ More

    Submitted 3 August, 2024; originally announced August 2024.

  34. arXiv:2408.01413  [pdf, other

    cs.GT

    A Game Theoretic Analysis of High Occupancy Toll Lane Design

    Authors: Zhanhao Zhang, Ruifan Yang, Manxi Wu

    Abstract: In this article, we study the optimal design of High Occupancy Toll (HOT) lanes. The traffic authority determines the road capacity allocation between HOT lanes and ordinary lanes, as well as the toll price charged for travelers using HOT lanes who do not meet the high-occupancy eligibility criteria. We develop a game-theoretic model to analyze the decisions of travelers with heterogeneous prefere… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  35. arXiv:2407.20099  [pdf, other

    cs.CV

    RSC-SNN: Exploring the Trade-off Between Adversarial Robustness and Accuracy in Spiking Neural Networks via Randomized Smoothing Coding

    Authors: Keming Wu, Man Yao, Yuhong Chou, Xuerui Qiu, Rui Yang, Bo Xu, Guoqi Li

    Abstract: Spiking Neural Networks (SNNs) have received widespread attention due to their unique neuronal dynamics and low-power nature. Previous research empirically shows that SNNs with Poisson coding are more robust than Artificial Neural Networks (ANNs) on small-scale datasets. However, it is still unclear in theory how the adversarial robustness of SNNs is derived, and whether SNNs can still maintain it… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: Accepted by ACM MM 2024

  36. arXiv:2407.19639  [pdf, other

    cs.CR

    Segmented Private Data Aggregation in the Multi-message Shuffle Model

    Authors: Shaowei Wang, Ruilin Yang, Sufen Zeng, Kaiqi Yu, Rundong Mei, Shaozheng Huang, Wei Yang

    Abstract: The shuffle model of differential privacy (DP) offers compelling privacy-utility trade-offs in decentralized settings (e.g., internet of things, mobile edge networks). Particularly, the multi-message shuffle model, where each user may contribute multiple messages, has shown that accuracy can approach that of the central model of DP. However, existing studies typically assume a uniform privacy prot… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

  37. arXiv:2407.17466  [pdf, other

    cs.LG math.OC stat.ML

    Traversing Pareto Optimal Policies: Provably Efficient Multi-Objective Reinforcement Learning

    Authors: Shuang Qiu, Dake Zhang, Rui Yang, Boxiang Lyu, Tong Zhang

    Abstract: This paper investigates multi-objective reinforcement learning (MORL), which focuses on learning Pareto optimal policies in the presence of multiple reward functions. Despite MORL's significant empirical success, there is still a lack of satisfactory understanding of various MORL optimization targets and efficient learning algorithms. Our work offers a systematic analysis of several optimization t… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: Initially submitted in May 2024

  38. arXiv:2407.17356  [pdf, other

    cs.LG cs.NE

    Gradient-based inference of abstract task representations for generalization in neural networks

    Authors: Ali Hummos, Felipe del Río, Brabeeba Mien Wang, Julio Hurtado, Cristian B. Calderon, Guangyu Robert Yang

    Abstract: Humans and many animals show remarkably adaptive behavior and can respond differently to the same input depending on their internal goals. The brain not only represents the intermediate abstractions needed to perform a computation but also actively maintains a representation of the computation itself (task abstraction). Such separation of the computation and its abstraction is associated with fast… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  39. arXiv:2407.16554  [pdf, other

    cs.MM cs.CV cs.SD eess.AS

    Coarse-to-Fine Proposal Refinement Framework for Audio Temporal Forgery Detection and Localization

    Authors: Junyan Wu, Wei Lu, Xiangyang Luo, Rui Yang, Qian Wang, Xiaochun Cao

    Abstract: Recently, a novel form of audio partial forgery has posed challenges to its forensics, requiring advanced countermeasures to detect subtle forgery manipulations within long-duration audio. However, existing countermeasures still serve a classification purpose and fail to perform meaningful analysis of the start and end timestamps of partial forgery segments. To address this challenge, we introduce… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: 9pages, 3figures. This paper has been accepted for ACM MM 2024

    MSC Class: 68T07; 68T10 ACM Class: I.2; I.5

  40. arXiv:2407.14007  [pdf, other

    cs.CV cs.AI

    Multi-modal Relation Distillation for Unified 3D Representation Learning

    Authors: Huiqun Wang, Yiping Bao, Panwang Pan, Zeming Li, Xiao Liu, Ruijie Yang, Di Huang

    Abstract: Recent advancements in multi-modal pre-training for 3D point clouds have demonstrated promising results by aligning heterogeneous features across 3D shapes and their corresponding 2D images and language descriptions. However, current straightforward solutions often overlook intricate structural relations among samples, potentially limiting the full capabilities of multi-modal learning. To address… ▽ More

    Submitted 18 September, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV2024

  41. arXiv:2407.11401  [pdf, other

    cs.CV cs.IR

    EndoFinder: Online Image Retrieval for Explainable Colorectal Polyp Diagnosis

    Authors: Ruijie Yang, Yan Zhu, Peiyao Fu, Yizhe Zhang, Zhihua Wang, Quanlin Li, Pinghong Zhou, Xian Yang, Shuo Wang

    Abstract: Determining the necessity of resecting malignant polyps during colonoscopy screen is crucial for patient outcomes, yet challenging due to the time-consuming and costly nature of histopathology examination. While deep learning-based classification models have shown promise in achieving optical biopsy with endoscopic images, they often suffer from a lack of explainability. To overcome this limitatio… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: MICCAI 2024

  42. arXiv:2407.10794  [pdf, other

    cs.CL cs.AI

    Graphusion: Leveraging Large Language Models for Scientific Knowledge Graph Fusion and Construction in NLP Education

    Authors: Rui Yang, Boming Yang, Sixun Ouyang, Tianwei She, Aosong Feng, Yuang Jiang, Freddy Lecue, Jinghui Lu, Irene Li

    Abstract: Knowledge graphs (KGs) are crucial in the field of artificial intelligence and are widely applied in downstream tasks, such as enhancing Question Answering (QA) systems. The construction of KGs typically requires significant effort from domain experts. Recently, Large Language Models (LLMs) have been used for knowledge graph construction (KGC), however, most existing approaches focus on a local pe… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 24 pages, 11 figures, 13 tables. arXiv admin note: substantial text overlap with arXiv:2402.14293

  43. arXiv:2407.07913  [pdf, other

    cs.IR cs.AI

    CaseGPT: a case reasoning framework based on language models and retrieval-augmented generation

    Authors: Rui Yang

    Abstract: This paper presents CaseGPT, an innovative approach that combines Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) technology to enhance case-based reasoning in the healthcare and legal sectors. The system addresses the challenges of traditional database queries by enabling fuzzy searches based on imprecise descriptions, thereby improving data searchability and usability. Case… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Submitted to ICCBR

  44. arXiv:2407.07059  [pdf, other

    q-bio.NC cs.LG

    Differentiable Optimization of Similarity Scores Between Models and Brains

    Authors: Nathan Cloos, Moufan Li, Markus Siegel, Scott L. Brincat, Earl K. Miller, Guangyu Robert Yang, Christopher J. Cueva

    Abstract: How do we know if two systems - biological or artificial - process information in a similar way? Similarity measures such as linear regression, Centered Kernel Alignment (CKA), Normalized Bures Similarity (NBS), and angular Procrustes distance, are often used to quantify this similarity. However, it is currently unclear what drives high similarity scores and even what constitutes a "good" score. H… ▽ More

    Submitted 21 October, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: 19 pages, 9 figures

  45. arXiv:2407.04285  [pdf, other

    cs.LG cs.AI

    Robust Decision Transformer: Tackling Data Corruption in Offline RL via Sequence Modeling

    Authors: Jiawei Xu, Rui Yang, Feng Luo, Meng Fang, Baoxiang Wang, Lei Han

    Abstract: Learning policies from offline datasets through offline reinforcement learning (RL) holds promise for scaling data-driven decision-making and avoiding unsafe and costly online interactions. However, real-world data collected from sensors or humans often contains noise and errors, posing a significant challenge for existing offline RL methods. Our study indicates that traditional offline RL methods… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  46. arXiv:2407.03162  [pdf, other

    cs.RO cs.CV cs.LG

    Bunny-VisionPro: Real-Time Bimanual Dexterous Teleoperation for Imitation Learning

    Authors: Runyu Ding, Yuzhe Qin, Jiyue Zhu, Chengzhe Jia, Shiqi Yang, Ruihan Yang, Xiaojuan Qi, Xiaolong Wang

    Abstract: Teleoperation is a crucial tool for collecting human demonstrations, but controlling robots with bimanual dexterous hands remains a challenge. Existing teleoperation systems struggle to handle the complexity of coordinating two hands for intricate manipulations. We introduce Bunny-VisionPro, a real-time bimanual dexterous teleoperation system that leverages a VR headset. Unlike previous vision-bas… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: project page: https://dingry.github.io/projects/bunny_visionpro.html

  47. arXiv:2406.19414  [pdf, other

    q-fin.ST cs.LG q-fin.PR stat.AP stat.ML stat.OT

    Stock Volume Forecasting with Advanced Information by Conditional Variational Auto-Encoder

    Authors: Parley R Yang, Alexander Y Shestopaloff

    Abstract: We demonstrate the use of Conditional Variational Encoder (CVAE) to improve the forecasts of daily stock volume time series in both short and long term forecasting tasks, with the use of advanced information of input variables such as rebalancing dates. CVAE generates non-linear time series as out-of-sample forecasts, which have better accuracy and closer fit of correlation to the actual data, com… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  48. arXiv:2406.17624  [pdf, other

    cs.CL cs.AI

    Self-assessment, Exhibition, and Recognition: a Review of Personality in Large Language Models

    Authors: Zhiyuan Wen, Yu Yang, Jiannong Cao, Haoming Sun, Ruosong Yang, Shuaiqi Liu

    Abstract: As large language models (LLMs) appear to behave increasingly human-like in text-based interactions, more and more researchers become interested in investigating personality in LLMs. However, the diversity of psychological personality research and the rapid development of LLMs have led to a broad yet fragmented landscape of studies in this interdisciplinary field. Extensive studies across differen… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  49. arXiv:2406.17274  [pdf, other

    cs.CL cs.LG

    Can We Trust the Performance Evaluation of Uncertainty Estimation Methods in Text Summarization?

    Authors: Jianfeng He, Runing Yang, Linlin Yu, Changbin Li, Ruoxi Jia, Feng Chen, Ming Jin, Chang-Tien Lu

    Abstract: Text summarization, a key natural language generation (NLG) task, is vital in various domains. However, the high cost of inaccurate summaries in risk-critical applications, particularly those involving human-in-the-loop decision-making, raises concerns about the reliability of uncertainty estimation on text summarization (UE-TS) evaluation methods. This concern stems from the dependency of uncerta… ▽ More

    Submitted 9 October, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

    Comments: 62 pages, 41 figures, 11 tables

  50. arXiv:2406.13369  [pdf, other

    cs.LG cs.SI

    Effective Edge-wise Representation Learning in Edge-Attributed Bipartite Graphs

    Authors: Hewen Wang, Renchi Yang, Xiaokui Xiao

    Abstract: Graph representation learning (GRL) is to encode graph elements into informative vector representations, which can be used in downstream tasks for analyzing graph-structured data and has seen extensive applications in various domains. However, the majority of extant studies on GRL are geared towards generating node representations, which cannot be readily employed to perform edge-based analytics t… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 11 pages. Full version of the research paper accepted to KDD 2024