Skip to main content

Showing 1–50 of 212 results for author: Jin, W

  1. arXiv:2410.12583  [pdf, other

    cs.CL cs.AI

    STRUX: An LLM for Decision-Making with Structured Explanations

    Authors: Yiming Lu, Yebowen Hu, Hassan Foroosh, Wei Jin, Fei Liu

    Abstract: Countless decisions shape our daily lives, and it is paramount to understand the how and why behind these choices. In this paper, we introduce a new LLM decision-making framework called STRUX, which enhances LLM decision-making by providing structured explanations. These include favorable and adverse facts related to the decision, along with their respective strengths. STRUX begins by distilling l… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 10 pages, 7 figures, submitted to NAACL 2025

  2. arXiv:2410.09543  [pdf, other

    cs.CE cs.AI q-bio.BM

    Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactions

    Authors: Xiaoran Jiao, Weian Mao, Wengong Jin, Peiyuan Yang, Hao Chen, Chunhua Shen

    Abstract: Predicting the change in binding free energy ($ΔΔG$) is crucial for understanding and modulating protein-protein interactions, which are critical in drug design. Due to the scarcity of experimental $ΔΔG$ data, existing methods focus on pre-training, while neglecting the importance of alignment. In this work, we propose the Boltzmann Alignment technique to transfer knowledge from pre-trained invers… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

  3. arXiv:2410.09286  [pdf, other

    cs.RO cs.AI

    Language-Model-Assisted Bi-Level Programming for Reward Learning from Internet Videos

    Authors: Harsh Mahesheka, Zhixian Xie, Zhaoran Wang, Wanxin Jin

    Abstract: Learning from Demonstrations, particularly from biological experts like humans and animals, often encounters significant data acquisition challenges. While recent approaches leverage internet videos for learning, they require complex, task-specific pipelines to extract and retarget motion data for the agent. In this work, we introduce a language-model-assisted bi-level programming framework that e… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  4. arXiv:2410.06373  [pdf, other

    cs.CV cs.LG

    Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning

    Authors: Siyuan Li, Juanxi Tian, Zedong Wang, Luyuan Zhang, Zicheng Liu, Weiyang Jin, Yang Liu, Baigui Sun, Stan Z. Li

    Abstract: This paper delves into the interplay between vision backbones and optimizers, unvealing an inter-dependent phenomenon termed \textit{\textbf{b}ackbone-\textbf{o}ptimizer \textbf{c}oupling \textbf{b}ias} (BOCB). We observe that canonical CNNs, such as VGG and ResNet, exhibit a marked co-dependency with SGD families, while recent architectures like ViTs and ConvNeXt share a tight coupling with the a… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: Preprint V1. Online project at https://bocb-ai.github.io/

  5. arXiv:2410.06340  [pdf, other

    cs.LG

    FedGraph: A Research Library and Benchmark for Federated Graph Learning

    Authors: Yuhang Yao, Yuan Li, Xinyi Fan, Junhao Li, Kay Liu, Weizhao Jin, Srivatsan Ravi, Philip S. Yu, Carlee Joe-Wong

    Abstract: Federated graph learning is an emerging field with significant practical challenges. While many algorithms have been proposed to enhance model accuracy, their system performance, crucial for real-world deployment, is often overlooked. To address this gap, we present FedGraph, a research library designed for practical distributed deployment and benchmarking in federated graph learning. FedGraph sup… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: https://github.com/FedGraph/fedgraph

  6. arXiv:2410.00049  [pdf, other

    cs.LG cs.AI cs.SI

    Epidemiology-Aware Neural ODE with Continuous Disease Transmission Graph

    Authors: Guancheng Wan, Zewen Liu, Max S. Y. Lau, B. Aditya Prakash, Wei Jin

    Abstract: Effective epidemic forecasting is critical for public health strategies and efficient medical resource allocation, especially in the face of rapidly spreading infectious diseases. However, existing deep-learning methods often overlook the dynamic nature of epidemics and fail to account for the specific mechanisms of disease transmission. In response to these challenges, we introduce an innovative… ▽ More

    Submitted 28 September, 2024; originally announced October 2024.

  7. arXiv:2409.18042  [pdf, other

    cs.CV cs.CL

    EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

    Authors: Kai Chen, Yunhao Gou, Runhui Huang, Zhili Liu, Daxin Tan, Jing Xu, Chunwei Wang, Yi Zhu, Yihan Zeng, Kuo Yang, Dingdong Wang, Kun Xiang, Haoyuan Li, Haoli Bai, Jianhua Han, Xiaohui Li, Weike Jin, Nian Xie, Yu Zhang, James T. Kwok, Hengshuang Zhao, Xiaodan Liang, Dit-Yan Yeung, Xiao Chen, Zhenguo Li , et al. (5 additional authors not shown)

    Abstract: GPT-4o, an omni-modal model that enables vocal conversations with diverse emotions and tones, marks a milestone for omni-modal foundation models. However, empowering Large Language Models to perceive and generate images, texts, and speeches end-to-end with publicly available data remains challenging in the open-source community. Existing vision-language models rely on external tools for the speech… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: Project Page: https://emova-ollm.github.io/

  8. arXiv:2409.09371  [pdf, other

    physics.ao-ph cs.LG

    WeatherReal: A Benchmark Based on In-Situ Observations for Evaluating Weather Models

    Authors: Weixin Jin, Jonathan Weyn, Pengcheng Zhao, Siqi Xiang, Jiang Bian, Zuliang Fang, Haiyu Dong, Hongyu Sun, Kit Thambiratnam, Qi Zhang

    Abstract: In recent years, AI-based weather forecasting models have matched or even outperformed numerical weather prediction systems. However, most of these models have been trained and evaluated on reanalysis datasets like ERA5. These datasets, being products of numerical models, often diverge substantially from actual observations in some crucial variables like near-surface temperature, wind, precipitati… ▽ More

    Submitted 14 September, 2024; originally announced September 2024.

  9. arXiv:2409.08487  [pdf, other

    cs.LG cs.AI stat.ML

    Sub-graph Based Diffusion Model for Link Prediction

    Authors: Hang Li, Wei Jin, Geri Skenderi, Harry Shomer, Wenzhuo Tang, Wenqi Fan, Jiliang Tang

    Abstract: Denoising Diffusion Probabilistic Models (DDPMs) represent a contemporary class of generative models with exceptional qualities in both synthesis and maximizing the data likelihood. These models work by traversing a forward Markov Chain where data is perturbed, followed by a reverse process where a neural network learns to undo the perturbations and recover the original data. There have been incre… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

    Comments: 17 pages, 3 figures

  10. arXiv:2409.05212  [pdf, other

    eess.AS cs.SD

    SS-BRPE: Self-Supervised Blind Room Parameter Estimation Using Attention Mechanisms

    Authors: Chunxi Wang, Maoshen Jia, Meiran Li, Changchun Bao, Wenyu Jin

    Abstract: In recent years, dynamic parameterization of acoustic environments has garnered attention in audio processing. This focus includes room volume and reverberation time (RT60), which define local acoustics independent of sound source and receiver orientation. Previous studies show that purely attention-based models can achieve advanced results in room parameter estimation. However, their success reli… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

    Comments: 5 pages, 3 figures, submitted to ICASSP 2025

  11. arXiv:2408.09612  [pdf, other

    cs.RO

    ContactSDF: Signed Distance Functions as Multi-Contact Models for Dexterous Manipulation

    Authors: Wen Yang, Wanxin Jin

    Abstract: In this paper, we propose ContactSDF, a method that uses signed distance functions (SDFs) to approximate multi-contact models, including both collision detection and time-stepping routines. ContactSDF first establishes an SDF using the supporting plane representation of an object for collision detection, and then use the generated contact dual cones to build a second SDF for time stepping predicti… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

  12. arXiv:2408.09327  [pdf, other

    cs.LG cs.CL

    Threshold Filtering Packing for Supervised Fine-Tuning: Training Related Samples within Packs

    Authors: Jiancheng Dong, Lei Jiang, Wei Jin, Lu Cheng

    Abstract: Packing for Supervised Fine-Tuning (SFT) in autoregressive models involves concatenating data points of varying lengths until reaching the designed maximum length to facilitate GPU processing. However, randomly concatenating data points and feeding them into an autoregressive transformer can lead to cross-contamination of sequences due to the significant difference in their subject matter. The mai… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

    Comments: 13 pages, 4 figures

  13. arXiv:2408.07855  [pdf, other

    cs.RO

    Complementarity-Free Multi-Contact Modeling and Optimization for Dexterous Manipulation

    Authors: Wanxin Jin

    Abstract: A significant barrier preventing model-based methods from matching the high performance of reinforcement learning in dexterous manipulation is the inherent complexity of multi-contact dynamics. Traditionally formulated using complementarity models, multi-contact dynamics introduces combinatorial complexity and non-smoothness, complicating contact-rich planning and control. In this paper, we circum… ▽ More

    Submitted 17 August, 2024; v1 submitted 14 August, 2024; originally announced August 2024.

    Comments: Video demo: https://youtu.be/NsL4hbSXvFg

  14. arXiv:2407.19902  [pdf, other

    cs.RO eess.SY math.OC

    A Differential Dynamic Programming Framework for Inverse Reinforcement Learning

    Authors: Kun Cao, Xinhang Xu, Wanxin Jin, Karl H. Johansson, Lihua Xie

    Abstract: A differential dynamic programming (DDP)-based framework for inverse reinforcement learning (IRL) is introduced to recover the parameters in the cost function, system dynamics, and constraints from demonstrations. Different from existing work, where DDP was used for the inner forward problem with inequality constraints, our proposed framework uses it for efficient computation of the gradient requi… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: 20 pages, 15 figures; submitted to IEEE for potential publication

  15. arXiv:2407.12068  [pdf, other

    cs.LG cs.AI

    Learning on Graphs with Large Language Models(LLMs): A Deep Dive into Model Robustness

    Authors: Kai Guo, Zewen Liu, Zhikai Chen, Hongzhi Wen, Wei Jin, Jiliang Tang, Yi Chang

    Abstract: Large Language Models (LLMs) have demonstrated remarkable performance across various natural language processing tasks. Recently, several LLMs-based pipelines have been developed to enhance learning on graphs with text attributes, showcasing promising performance. However, graphs are well-known to be susceptible to adversarial attacks and it remains unclear whether LLMs exhibit robustness in learn… ▽ More

    Submitted 28 July, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

  16. arXiv:2407.04216  [pdf, other

    cs.RO

    Safe MPC Alignment with Human Directional Feedback

    Authors: Zhixian Xie, Wenlong Zhang, Yi Ren, Zhaoran Wang, George J. Pappas, Wanxin Jin

    Abstract: In safety-critical robot planning or control, manually specifying safety constraints or learning them from demonstrations can be challenging. In this paper, we propose a certifiable alignment method for a robot to learn a safety constraint in its model predictive control (MPC) policy with human online directional feedback. To our knowledge, it is the first method to learn safety constraints from h… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 18 pages, submission to T-RO

  17. arXiv:2406.19632  [pdf, other

    cs.CV

    PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation

    Authors: Deyi Ji, Wenwei Jin, Hongtao Lu, Feng Zhao

    Abstract: The ascension of Unmanned Aerial Vehicles (UAVs) in various fields necessitates effective UAV image segmentation, which faces challenges due to the dynamic perspectives of UAV-captured images. Traditional segmentation algorithms falter as they cannot accurately mimic the complexity of UAV perspectives, and the cost of obtaining multi-perspective labeled datasets is prohibitive. To address these is… ▽ More

    Submitted 11 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: IJCAI 2024

  18. arXiv:2406.18379  [pdf, other

    cs.CR cs.AI cs.SE

    MALSIGHT: Exploring Malicious Source Code and Benign Pseudocode for Iterative Binary Malware Summarization

    Authors: Haolang Lu, Hongrui Peng, Guoshun Nan, Jiaoyang Cui, Cheng Wang, Weifei Jin

    Abstract: Binary malware summarization aims to automatically generate human-readable descriptions of malware behaviors from executable files, facilitating tasks like malware cracking and detection. Previous methods based on Large Language Models (LLMs) have shown great promise. However, they still face significant issues, including poor usability, inaccurate explanations, and incomplete summaries, primarily… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 17 pages, 14 figures

  19. arXiv:2406.16715  [pdf, other

    cs.LG

    GC4NC: A Benchmark Framework for Graph Condensation on Node Classification with New Insights

    Authors: Shengbo Gong, Juntong Ni, Noveen Sachdeva, Carl Yang, Wei Jin

    Abstract: Graph condensation (GC) is an emerging technique designed to learn a significantly smaller graph that retains the essential information of the original graph. This condensed graph has shown promise in accelerating graph neural networks while preserving performance comparable to those achieved with the original, larger graphs. Additionally, this technique facilitates downstream applications like ne… ▽ More

    Submitted 6 October, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: 22 pages

  20. arXiv:2406.16042  [pdf, other

    cs.CV

    Pose-dIVE: Pose-Diversified Augmentation with Diffusion Model for Person Re-Identification

    Authors: Inès Hyeonsu Kim, JoungBin Lee, Woojeong Jin, Soowon Son, Kyusun Cho, Junyoung Seo, Min-Seop Kwak, Seokju Cho, JeongYeol Baek, Byeongwon Lee, Seungryong Kim

    Abstract: Person re-identification (Re-ID) often faces challenges due to variations in human poses and camera viewpoints, which significantly affect the appearance of individuals across images. Existing datasets frequently lack diversity and scalability in these aspects, hindering the generalization of Re-ID models to new camera systems. We propose Pose-dIVE, a novel data augmentation approach that incorpor… ▽ More

    Submitted 15 October, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  21. arXiv:2406.13873  [pdf, other

    cs.AI

    A Pure Transformer Pretraining Framework on Text-attributed Graphs

    Authors: Yu Song, Haitao Mao, Jiachen Xiao, Jingzhe Liu, Zhikai Chen, Wei Jin, Carl Yang, Jiliang Tang, Hui Liu

    Abstract: Pretraining plays a pivotal role in acquiring generalized knowledge from large-scale data, achieving remarkable successes as evidenced by large models in CV and NLP. However, progress in the graph domain remains limited due to fundamental challenges such as feature heterogeneity and structural heterogeneity. Recently, increasing efforts have been made to enhance node feature quality with Large Lan… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  22. arXiv:2406.10727  [pdf, other

    cs.LG

    Text-space Graph Foundation Models: Comprehensive Benchmarks and New Insights

    Authors: Zhikai Chen, Haitao Mao, Jingzhe Liu, Yu Song, Bingheng Li, Wei Jin, Bahare Fatemi, Anton Tsitsulin, Bryan Perozzi, Hui Liu, Jiliang Tang

    Abstract: Given the ubiquity of graph data and its applications in diverse domains, building a Graph Foundation Model (GFM) that can work well across different graphs and tasks with a unified backbone has recently garnered significant interests. A major obstacle to achieving this goal stems from the fact that graphs from different domains often exhibit diverse node features. Inspired by multi-modal models t… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: Preliminary version: if you find any mistakes regarding the evaluation, feel free to contact the first author

  23. arXiv:2406.10475  [pdf, other

    cs.CV

    Discrete Latent Perspective Learning for Segmentation and Detection

    Authors: Deyi Ji, Feng Zhao, Lanyun Zhu, Wenwei Jin, Hongtao Lu, Jieping Ye

    Abstract: In this paper, we address the challenge of Perspective-Invariant Learning in machine learning and computer vision, which involves enabling a network to understand images from varying perspectives to achieve consistent semantic interpretation. While standard approaches rely on the labor-intensive collection of multi-view images or limited data augmentation techniques, we propose a novel framework,… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: ICML 2024 Spotlight

  24. arXiv:2406.09681  [pdf, other

    cs.CV

    Asymmetrical Siamese Network for Point Clouds Normal Estimation

    Authors: Wei Jin, Jun Zhou, Nannan Li, Haba Madeline, Xiuping Liu

    Abstract: In recent years, deep learning-based point cloud normal estimation has made great progress. However, existing methods mainly rely on the PCPNet dataset, leading to overfitting. In addition, the correlation between point clouds with different noise scales remains unexplored, resulting in poor performance in cross-domain scenarios. In this paper, we explore the consistency of intrinsic features lear… ▽ More

    Submitted 24 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  25. arXiv:2406.06016  [pdf, other

    cs.LG

    EpiLearn: A Python Library for Machine Learning in Epidemic Modeling

    Authors: Zewen Liu, Yunxiao Li, Mingyang Wei, Guancheng Wan, Max S. Y. Lau, Wei Jin

    Abstract: EpiLearn is a Python toolkit developed for modeling, simulating, and analyzing epidemic data. Although there exist several packages that also deal with epidemic modeling, they are often restricted to mechanistic models or traditional statistical tools. As machine learning continues to shape the world, the gap between these packages and the latest models has become larger. To bridge the gap and ins… ▽ More

    Submitted 9 September, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  26. arXiv:2405.18768  [pdf, other

    q-bio.BM cs.LG

    RNAFlow: RNA Structure & Sequence Design via Inverse Folding-Based Flow Matching

    Authors: Divya Nori, Wengong Jin

    Abstract: The growing significance of RNA engineering in diverse biological applications has spurred interest in developing AI methods for structure-based RNA design. While diffusion models have excelled in protein design, adapting them for RNA presents new challenges due to RNA's conformational flexibility and the computational cost of fine-tuning large structure prediction models. To this end, we propose… ▽ More

    Submitted 9 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted to ICML 2024

  27. arXiv:2405.16113  [pdf, other

    cs.LG

    Enabling On-Device Learning via Experience Replay with Efficient Dataset Condensation

    Authors: Gelei Xu, Ningzhi Tang, Jun Xia, Wei Jin, Yiyu Shi

    Abstract: Upon deployment to edge devices, it is often desirable for a model to further learn from streaming data to improve accuracy. However, extracting representative features from such data is challenging because it is typically unlabeled, non-independent and identically distributed (non-i.i.d), and is seen only once. To mitigate this issue, a common strategy is to maintain a small data buffer on the ed… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 9 pages, 10 figures

  28. arXiv:2405.09470  [pdf, other

    cs.SD cs.CR cs.LG eess.AS

    Towards Evaluating the Robustness of Automatic Speech Recognition Systems via Audio Style Transfer

    Authors: Weifei Jin, Yuxin Cao, Junjie Su, Qi Shen, Kai Ye, Derui Wang, Jie Hao, Ziyao Liu

    Abstract: In light of the widespread application of Automatic Speech Recognition (ASR) systems, their security concerns have received much more attention than ever before, primarily due to the susceptibility of Deep Neural Networks. Previous studies have illustrated that surreptitiously crafting adversarial perturbations enables the manipulation of speech recognition systems, resulting in the production of… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: Accepted to SecTL (AsiaCCS Workshop) 2024

  29. arXiv:2405.08205  [pdf, other

    cs.LG

    Generative Enzyme Design Guided by Functionally Important Sites and Small-Molecule Substrates

    Authors: Zhenqiao Song, Yunlong Zhao, Wenxian Shi, Wengong Jin, Yang Yang, Lei Li

    Abstract: Enzymes are genetically encoded biocatalysts capable of accelerating chemical reactions. How can we automatically design functional enzymes? In this paper, we propose EnzyGen, an approach to learn a unified model to design enzymes across all functional families. Our key idea is to generate an enzyme's amino acid sequence and their three-dimensional (3D) coordinates based on functionally important… ▽ More

    Submitted 17 July, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  30. arXiv:2405.06693  [pdf, other

    q-bio.BM cs.LG

    SurfPro: Functional Protein Design Based on Continuous Surface

    Authors: Zhenqiao Song, Tinglin Huang, Lei Li, Wengong Jin

    Abstract: How can we design proteins with desired functions? We are motivated by a chemical intuition that both geometric structure and biochemical properties are critical to a protein's function. In this paper, we propose SurfPro, a new method to generate functional proteins given a desired surface and its associated biochemical properties. SurfPro comprises a hierarchical encoder that progressively models… ▽ More

    Submitted 17 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  31. arXiv:2404.16970  [pdf, other

    cs.NI cs.PF

    CarbonCP: Carbon-Aware DNN Partitioning with Conformal Prediction for Sustainable Edge Intelligence

    Authors: Hongyu Ke, Wanxin Jin, Haoxin Wang

    Abstract: This paper presents a solution to address carbon emission mitigation for end-to-end edge computing systems, including the computing at battery-powered edge devices and servers, as well as the communications between them. We design and implement, CarbonCP, a context-adaptive, carbon-aware, and uncertainty-aware AI inference framework built upon conformal prediction theory, which balances operationa… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  32. arXiv:2404.13541  [pdf, other

    cs.CV

    Generalizable Novel-View Synthesis using a Stereo Camera

    Authors: Haechan Lee, Wonjoon Jin, Seung-Hwan Baek, Sunghyun Cho

    Abstract: In this paper, we propose the first generalizable view synthesis approach that specifically targets multi-view stereo-camera images. Since recent stereo matching has demonstrated accurate geometry prediction, we introduce stereo matching into novel-view synthesis for high-quality geometry reconstruction. To this end, this paper proposes a novel framework, dubbed StereoNeRF, which integrates stereo… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024. Project page URL: https://jinwonjoon.github.io/stereonerf/

  33. arXiv:2403.19852  [pdf, other

    cs.LG cs.SI physics.soc-ph q-bio.PE

    A Review of Graph Neural Networks in Epidemic Modeling

    Authors: Zewen Liu, Guancheng Wan, B. Aditya Prakash, Max S. Y. Lau, Wei Jin

    Abstract: Since the onset of the COVID-19 pandemic, there has been a growing interest in studying epidemiological models. Traditional mechanistic models mathematically describe the transmission mechanisms of infectious diseases. However, they often suffer from limitations of oversimplified or fixed assumptions, which could cause sub-optimal predictive power and inefficiency in capturing complex relation inf… ▽ More

    Submitted 9 September, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  34. arXiv:2403.07932  [pdf, other

    cs.GT cs.AI

    Feint in Multi-Player Games

    Authors: Junyu Liu, Wangkai Jin, Xiangjun Peng

    Abstract: This paper introduces the first formalization, implementation and quantitative evaluation of Feint in Multi-Player Games. Our work first formalizes Feint from the perspective of Multi-Player Games, in terms of the temporal, spatial, and their collective impacts. The formalization is built upon Non-transitive Active Markov Game Model, where Feint can have a considerable amount of impacts. Then, our… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  35. arXiv:2403.07931  [pdf, other

    cs.GT cs.GR

    Formalizing Feint Actions, and Example Studies in Two-Player Games

    Authors: Junyu Liu, Wangkai Jin, Xiangjun Peng

    Abstract: Feint actions refer to a set of deceptive actions, which enable players to obtain temporal advantages from their opponents. Such actions are regarded as widely-used tactic in most non-deterministic Two-player Games (e.g. boxing and fencing). However, existing literature does not provide comprehensive and concrete formalization on Feint actions, and their implications on Two-Player Games. We argue… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  36. arXiv:2402.18934  [pdf, other

    cs.RO

    RELEAD: Resilient Localization with Enhanced LiDAR Odometry in Adverse Environments

    Authors: Zhiqiang Chen, Hongbo Chen, Yuhua Qi, Shipeng Zhong, Dapeng Feng, Wu Jin, Weisong Wen, Ming Liu

    Abstract: LiDAR-based localization is valuable for applications like mining surveys and underground facility maintenance. However, existing methods can struggle when dealing with uninformative geometric structures in challenging scenarios. This paper presents RELEAD, a LiDAR-centric solution designed to address scan-matching degradation. Our method enables degeneracy-free point cloud registration by solving… ▽ More

    Submitted 15 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Journal ref: published in ICRA 2024

  37. arXiv:2402.18127  [pdf, other

    cs.LG

    Hierarchical Multi-Relational Graph Representation Learning for Large-Scale Prediction of Drug-Drug Interactions

    Authors: Mengying Jiang, Guizhong Liu, Yuanchao Su, Weiqiang Jin, Biao Zhao

    Abstract: Most existing methods for predicting drug-drug interactions (DDI) predominantly concentrate on capturing the explicit relationships among drugs, overlooking the valuable implicit correlations present between drug pairs (DPs), which leads to weak predictions. To address this issue, this paper introduces a hierarchical multi-relational graph representation learning (HMGRL) approach. Within the frame… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 14 pages,10 figures

  38. arXiv:2402.16181  [pdf, other

    cs.LG cs.AI

    How Can LLM Guide RL? A Value-Based Approach

    Authors: Shenao Zhang, Sirui Zheng, Shuqi Ke, Zhihan Liu, Wanxin Jin, Jianbo Yuan, Yingxiang Yang, Hongxia Yang, Zhaoran Wang

    Abstract: Reinforcement learning (RL) has become the de facto standard practice for sequential decision-making problems by improving future acting policies with feedback. However, RL algorithms may require extensive trial-and-error interactions to collect useful feedback for improvement. On the other hand, recent developments in large language models (LLMs) have showcased impressive capabilities in language… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  39. arXiv:2402.13584  [pdf, other

    cs.CL

    WinoViz: Probing Visual Properties of Objects Under Different States

    Authors: Woojeong Jin, Tejas Srinivasan, Jesse Thomason, Xiang Ren

    Abstract: Humans perceive and comprehend different visual properties of an object based on specific contexts. For instance, we know that a banana turns brown ``when it becomes rotten,'' whereas it appears green ``when it is unripe.'' Previous studies on probing visual commonsense knowledge have primarily focused on examining language models' understanding of typical properties (e.g., colors and shapes) of o… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Preprint

  40. arXiv:2402.09240  [pdf, other

    cs.LG cs.CV

    Switch EMA: A Free Lunch for Better Flatness and Sharpness

    Authors: Siyuan Li, Zicheng Liu, Juanxi Tian, Ge Wang, Zedong Wang, Weiyang Jin, Di Wu, Cheng Tan, Tao Lin, Yang Liu, Baigui Sun, Stan Z. Li

    Abstract: Exponential Moving Average (EMA) is a widely used weight averaging (WA) regularization to learn flat optima for better generalizations without extra cost in deep neural network (DNN) optimization. Despite achieving better flatness, existing WA methods might fall into worse final performances or require extra test-time computations. This work unveils the full potential of EMA with a single line of… ▽ More

    Submitted 6 October, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: Preprint V2. Source code and models at https://github.com/Westlake-AI/SEMA

  41. arXiv:2402.08921  [pdf, other

    cs.IR cs.AI

    Enhancing ID and Text Fusion via Alternative Training in Session-based Recommendation

    Authors: Juanhui Li, Haoyu Han, Zhikai Chen, Harry Shomer, Wei Jin, Amin Javari, Jiliang Tang

    Abstract: Session-based recommendation has gained increasing attention in recent years, with its aim to offer tailored suggestions based on users' historical behaviors within sessions. To advance this field, a variety of methods have been developed, with ID-based approaches typically demonstrating promising performance. However, these methods often face challenges with long-tail items and overlook other r… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  42. arXiv:2402.08228  [pdf, other

    cs.LG cs.AI

    Investigating Out-of-Distribution Generalization of GNNs: An Architecture Perspective

    Authors: Kai Guo, Hongzhi Wen, Wei Jin, Yaming Guo, Jiliang Tang, Yi Chang

    Abstract: Graph neural networks (GNNs) have exhibited remarkable performance under the assumption that test data comes from the same distribution of training data. However, in real-world scenarios, this assumption may not always be valid. Consequently, there is a growing focus on exploring the Out-of-Distribution (OOD) problem in the context of graphs. Most existing efforts have primarily concentrated on im… ▽ More

    Submitted 14 February, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  43. arXiv:2402.05011  [pdf, other

    cs.LG

    Navigating Complexity: Toward Lossless Graph Condensation via Expanding Window Matching

    Authors: Yuchen Zhang, Tianle Zhang, Kai Wang, Ziyao Guo, Yuxuan Liang, Xavier Bresson, Wei Jin, Yang You

    Abstract: Graph condensation aims to reduce the size of a large-scale graph dataset by synthesizing a compact counterpart without sacrificing the performance of Graph Neural Networks (GNNs) trained on it, which has shed light on reducing the computational cost for training GNNs. Nevertheless, existing methods often fall short of accurately replicating the original graph for certain datasets, thereby failing… ▽ More

    Submitted 18 June, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: Lossless graph condensation method

  44. arXiv:2402.03578  [pdf, other

    cs.MA cs.AI

    LLM Multi-Agent Systems: Challenges and Open Problems

    Authors: Shanshan Han, Qifan Zhang, Yuhang Yao, Weizhao Jin, Zhaozhuo Xu, Chaoyang He

    Abstract: This paper explores existing works of multi-agent systems and identifies challenges that remain inadequately addressed. By leveraging the diverse capabilities and roles of individual agents within a multi-agent system, these systems can tackle complex tasks through collaboration. We discuss optimizing task allocation, fostering robust reasoning through iterative debates, managing complex and layer… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  45. arXiv:2402.03358  [pdf, other

    cs.SI cs.AI cs.DS cs.LG

    A Comprehensive Survey on Graph Reduction: Sparsification, Coarsening, and Condensation

    Authors: Mohammad Hashemi, Shengbo Gong, Juntong Ni, Wenqi Fan, B. Aditya Prakash, Wei Jin

    Abstract: Many real-world datasets can be naturally represented as graphs, spanning a wide range of domains. However, the increasing complexity and size of graph datasets present significant challenges for analysis and computation. In response, graph reduction, or graph summarization, has gained prominence for simplifying large graphs while preserving essential properties. In this survey, we aim to provide… ▽ More

    Submitted 29 June, 2024; v1 submitted 28 January, 2024; originally announced February 2024.

    Comments: Accepted by IJCAI 2024 (This ArXiv version is a long version of our IJCAI paper)

  46. arXiv:2402.01943  [pdf, other

    cs.LG

    Precedence-Constrained Winter Value for Effective Graph Data Valuation

    Authors: Hongliang Chi, Wei Jin, Charu Aggarwal, Yao Ma

    Abstract: Data valuation is essential for quantifying data's worth, aiding in assessing data quality and determining fair compensation. While existing data valuation methods have proven effective in evaluating the value of Euclidean data, they face limitations when applied to the increasingly popular graph-structured data. Particularly, graph data valuation introduces unique challenges, primarily stemming f… ▽ More

    Submitted 8 March, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 17 pages in total

  47. arXiv:2401.09700  [pdf, ps, other

    cs.DS

    Fully Dynamic Min-Cut of Superconstant Size in Subpolynomial Time

    Authors: Wenyu Jin, Xiaorui Sun, Mikkel Thorup

    Abstract: We present a deterministic fully dynamic algorithm with subpolynomial worst-case time per graph update such that after processing each update of the graph, the algorithm outputs a minimum cut of the graph if the graph has a cut of size at most $c$ for some $c = (\log n)^{o(1)}$. Previously, the best update time was $\widetilde O(\sqrt{n})$ for any $c > 2$ and $c = O(\log n)$ [Thorup, Combinatorica… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: SODA 2024

  48. arXiv:2401.08345  [pdf, other

    cs.CV

    Multi-view Distillation based on Multi-modal Fusion for Few-shot Action Recognition(CLIP-$\mathrm{M^2}$DF)

    Authors: Fei Guo, YiKang Wang, Han Qi, WenPing Jin, Li Zhu

    Abstract: In recent years, few-shot action recognition has attracted increasing attention. It generally adopts the paradigm of meta-learning. In this field, overcoming the overlapping distribution of classes and outliers is still a challenging problem based on limited samples. We believe the combination of Multi-modal and Multi-view can improve this issue depending on information complementarity. Therefore,… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  49. arXiv:2401.05739  [pdf, other

    cs.SE cs.CR

    Cross-Inlining Binary Function Similarity Detection

    Authors: Ang Jia, Ming Fan, Xi Xu, Wuxia Jin, Haijun Wang, Ting Liu

    Abstract: Binary function similarity detection plays an important role in a wide range of security applications. Existing works usually assume that the query function and target function share equal semantics and compare their full semantics to obtain the similarity. However, we find that the function mapping is more complex, especially when function inlining happens. In this paper, we will systematically… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: Accepted at ICSE 2024 (Second Cycle). Camera-ready version

  50. arXiv:2401.02339  [pdf

    cs.RO

    How Do Pedestrians' Perception Change toward Autonomous Vehicles during Unmarked Midblock Multilane Crossings: Role of AV Operation and Signal Indication

    Authors: Fengjiao Zou, Jennifer Harper Ogle, Patrick Gerard, Weimin Jin

    Abstract: One of the primary impediments hindering the widespread acceptance of autonomous vehicles (AVs) among pedestrians is their limited comprehension of AVs. This study employs virtual reality (VR) to provide pedestrians with an immersive environment for engaging with and comprehending AVs during unmarked midblock multilane crossings. Diverse AV driving behaviors were modeled to exhibit negotiation beh… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.