Skip to main content

Showing 1–24 of 24 results for author: Zhan, S

  1. arXiv:2410.12032  [pdf, other

    cs.AR cs.DC cs.LG

    MLPerf Power: Benchmarking the Energy Efficiency of Machine Learning Systems from μWatts to MWatts for Sustainable AI

    Authors: Arya Tschand, Arun Tejusve Raghunath Rajan, Sachin Idgunji, Anirban Ghosh, Jeremy Holleman, Csaba Kiraly, Pawan Ambalkar, Ritika Borkar, Ramesh Chukka, Trevor Cockrell, Oliver Curtis, Grigori Fursin, Miro Hodak, Hiwot Kassa, Anton Lokhmotov, Dejan Miskovic, Yuechao Pan, Manu Prasad Manmathan, Liz Raymond, Tom St. John, Arjun Suresh, Rowan Taubitz, Sean Zhan, Scott Wasson, David Kanter , et al. (1 additional authors not shown)

    Abstract: Rapid adoption of machine learning (ML) technologies has led to a surge in power consumption across diverse systems, from tiny IoT devices to massive datacenter clusters. Benchmarking the energy efficiency of these systems is crucial for optimization, but presents novel challenges due to the variety of hardware platforms, workload characteristics, and system-level interactions. This paper introduc… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 14 pages, 11 figures, 1 table

  2. arXiv:2410.03847  [pdf, other

    cs.LG cs.AI

    Model-Based Reward Shaping for Adversarial Inverse Reinforcement Learning in Stochastic Environments

    Authors: Simon Sinong Zhan, Qingyuan Wu, Philip Wang, Yixuan Wang, Ruochen Jiao, Chao Huang, Qi Zhu

    Abstract: In this paper, we aim to tackle the limitation of the Adversarial Inverse Reinforcement Learning (AIRL) method in stochastic environments where theoretical results cannot hold and performance is degraded. To address this issue, we propose a novel method which infuses the dynamics information into the reward shaping with the theoretical guarantee for the induced optimal policy in the stochastic env… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

  3. arXiv:2409.15763  [pdf, other

    cs.IR cs.AI

    IRSC: A Zero-shot Evaluation Benchmark for Information Retrieval through Semantic Comprehension in Retrieval-Augmented Generation Scenarios

    Authors: Hai Lin, Shaoxiong Zhan, Junyou Su, Haitao Zheng, Hui Wang

    Abstract: In Retrieval-Augmented Generation (RAG) tasks using Large Language Models (LLMs), the quality of retrieved information is critical to the final output. This paper introduces the IRSC benchmark for evaluating the performance of embedding models in multilingual RAG tasks. The benchmark encompasses five retrieval tasks: query retrieval, title retrieval, part-of-paragraph retrieval, keyword retrieval,… ▽ More

    Submitted 26 September, 2024; v1 submitted 24 September, 2024; originally announced September 2024.

  4. arXiv:2409.07202  [pdf, other

    cs.LG cs.AI

    Heterogeneity-Aware Coordination for Federated Learning via Stitching Pre-trained blocks

    Authors: Shichen Zhan, Yebo Wu, Chunlin Tian, Yan Zhao, Li Li

    Abstract: Federated learning (FL) coordinates multiple devices to collaboratively train a shared model while preserving data privacy. However, large memory footprint and high energy consumption during the training process excludes the low-end devices from contributing to the global model with their own data, which severely deteriorates the model performance in real-world scenarios. In this paper, we propose… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

    Journal ref: 2024 IEEE/ACM International Symposium on Quality of Service (IWQoS)

  5. arXiv:2409.04978  [pdf, other

    cs.CV

    Time-independent Spiking Neuron via Membrane Potential Estimation for Efficient Spiking Neural Networks

    Authors: Hanqi Chen, Lixing Yu, Shaojie Zhan, Penghui Yao, Jiankun Shao

    Abstract: The computational inefficiency of spiking neural networks (SNNs) is primarily due to the sequential updates of membrane potential, which becomes more pronounced during extended encoding periods compared to artificial neural networks (ANNs). This highlights the need to parallelize SNN computations effectively to leverage available hardware parallelism. To address this, we propose Membrane Potential… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

  6. arXiv:2409.04867  [pdf, other

    cs.CV

    Fine-Grained Representation Learning via Multi-Level Contrastive Learning without Class Priors

    Authors: Houwang Jiang, Zhuxian Liu, Guodong Liu, Xiaolong Liu, Shihua Zhan

    Abstract: Recent advances in unsupervised representation learning often rely on knowing the number of classes to improve feature extraction and clustering. However, this assumption raises an important question: is the number of classes always necessary, and do class labels fully capture the fine-grained features within the data? In this paper, we propose Contrastive Disentangling (CD), a framework designed… ▽ More

    Submitted 23 September, 2024; v1 submitted 7 September, 2024; originally announced September 2024.

  7. arXiv:2408.13473  [pdf, other

    cs.CL

    Why Antiwork: A RoBERTa-Based System for Work-Related Stress Identification and Leading Factor Analysis

    Authors: Tao Lu, Muzhe Wu, Xinyi Lu, Siyuan Xu, Shuyu Zhan, Anuj Tambwekar, Emily Mower Provost

    Abstract: Harsh working environments and work-related stress have been known to contribute to mental health problems such as anxiety, depression, and suicidal ideation. As such, it is paramount to create solutions that can both detect employee unhappiness and find the root cause of the problem. While prior works have examined causes of mental health using machine learning, they typically focus on general me… ▽ More

    Submitted 24 August, 2024; originally announced August 2024.

    Comments: 13 pages, 8 figures

  8. arXiv:2408.12569  [pdf, other

    cs.CV

    Sapiens: Foundation for Human Vision Models

    Authors: Rawal Khirodkar, Timur Bagautdinov, Julieta Martinez, Su Zhaoen, Austin James, Peter Selednik, Stuart Anderson, Shunsuke Saito

    Abstract: We present Sapiens, a family of models for four fundamental human-centric vision tasks -- 2D pose estimation, body-part segmentation, depth estimation, and surface normal prediction. Our models natively support 1K high-resolution inference and are extremely easy to adapt for individual tasks by simply fine-tuning models pretrained on over 300 million in-the-wild human images. We observe that, give… ▽ More

    Submitted 26 August, 2024; v1 submitted 22 August, 2024; originally announced August 2024.

    Comments: ECCV 2024 (Oral)

  9. arXiv:2408.08592  [pdf, other

    cs.RO

    Case Study: Runtime Safety Verification of Neural Network Controlled System

    Authors: Frank Yang, Sinong Simon Zhan, Yixuan Wang, Chao Huang, Qi Zhu

    Abstract: Neural networks are increasingly used in safety-critical applications such as robotics and autonomous vehicles. However, the deployment of neural-network-controlled systems (NNCSs) raises significant safety concerns. Many recent advances overlook critical aspects of verifying control and ensuring safety in real-time scenarios. This paper presents a case study on using POLAR-Express, a state-of-the… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Comments: 15 pages, 5 figures, submitted to Runtime Verification 2024

  10. arXiv:2407.20207  [pdf, other

    cs.CL cs.AI cs.IR

    QAEA-DR: A Unified Text Augmentation Framework for Dense Retrieval

    Authors: Hongming Tan, Shaoxiong Zhan, Hai Lin, Hai-Tao Zheng, Wai Kin, Chan

    Abstract: In dense retrieval, embedding long texts into dense vectors can result in information loss, leading to inaccurate query-text matching. Additionally, low-quality texts with excessive noise or sparse key information are unlikely to align well with relevant queries. Recent studies mainly focus on improving the sentence embedding model or retrieval process. In this work, we introduce a novel text augm… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  11. arXiv:2406.16588  [pdf, other

    eess.SY cs.FL

    Switching Controller Synthesis for Hybrid Systems Against STL Formulas

    Authors: Han Su, Shenghua Feng, Sinong Zhan, Naijun Zhan

    Abstract: Switching controllers play a pivotal role in directing hybrid systems (HSs) towards the desired objective, embodying a ``correct-by-construction'' approach to HS design. Identifying these objectives is thus crucial for the synthesis of effective switching controllers. While most of existing works focus on safety and liveness, few of them consider timing constraints. In this paper, we delves into t… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  12. arXiv:2406.08785  [pdf, other

    cs.CV

    BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection

    Authors: Wenjie Wang, Yehao Lu, Guangcong Zheng, Shuigen Zhan, Xiaoqing Ye, Zichang Tan, Jingdong Wang, Gaoang Wang, Xi Li

    Abstract: Vision-based roadside 3D object detection has attracted rising attention in autonomous driving domain, since it encompasses inherent advantages in reducing blind spots and expanding perception range. While previous work mainly focuses on accurately estimating depth or height for 2D-to-3D mapping, ignoring the position approximation error in the voxel pooling process. Inspired by this insight, we p… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  13. arXiv:2405.14226  [pdf, other

    cs.LG cs.AI

    Variational Delayed Policy Optimization

    Authors: Qingyuan Wu, Simon Sinong Zhan, Yixuan Wang, Yuhui Wang, Chung-Wei Lin, Chen Lv, Qi Zhu, Chao Huang

    Abstract: In environments with delayed observation, state augmentation by including actions within the delay window is adopted to retrieve Markovian property to enable reinforcement learning (RL). However, state-of-the-art (SOTA) RL techniques with Temporal-Difference (TD) learning frameworks often suffer from learning inefficiency, due to the significant expansion of the augmented state space with the dela… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  14. arXiv:2403.16132  [pdf, ps, other

    eess.SY cs.LG

    Runtime Monitoring and Fault Detection for Neural Network-Controlled Systems

    Authors: Jianglin Lan, Siyuan Zhan, Ron Patton, Xianxian Zhao

    Abstract: There is an emerging trend in applying deep learning methods to control complex nonlinear systems. This paper considers enhancing the runtime safety of nonlinear systems controlled by neural networks in the presence of disturbance and measurement noise. A robustly stable interval observer is designed to generate sound and precise lower and upper bounds for the neural network, nonlinear function, a… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Accepted to SAFEPROCESS 2024

  15. arXiv:2402.03141  [pdf, other

    cs.LG cs.AI eess.SY

    Boosting Reinforcement Learning with Strongly Delayed Feedback Through Auxiliary Short Delays

    Authors: Qingyuan Wu, Simon Sinong Zhan, Yixuan Wang, Yuhui Wang, Chung-Wei Lin, Chen Lv, Qi Zhu, Jürgen Schmidhuber, Chao Huang

    Abstract: Reinforcement learning (RL) is challenging in the common case of delays between events and their sensory perceptions. State-of-the-art (SOTA) state augmentation techniques either suffer from state space explosion or performance degeneration in stochastic environments. To address these challenges, we present a novel Auxiliary-Delayed Reinforcement Learning (AD-RL) method that leverages auxiliary ta… ▽ More

    Submitted 5 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  16. arXiv:2312.00812  [pdf, other

    cs.AI cs.LG eess.SY

    Empowering Autonomous Driving with Large Language Models: A Safety Perspective

    Authors: Yixuan Wang, Ruochen Jiao, Sinong Simon Zhan, Chengtian Lang, Chao Huang, Zhaoran Wang, Zhuoran Yang, Qi Zhu

    Abstract: Autonomous Driving (AD) encounters significant safety hurdles in long-tail unforeseen driving scenarios, largely stemming from the non-interpretability and poor generalization of the deep neural networks within the AD system, particularly in out-of-distribution and uncertain data. To this end, this paper explores the integration of Large Language Models (LLMs) into AD systems, leveraging their rob… ▽ More

    Submitted 22 March, 2024; v1 submitted 27 November, 2023; originally announced December 2023.

    Comments: Accepted to LLMAgent workshop @ICLR2024

  17. arXiv:2311.02227  [pdf, other

    cs.LG cs.AI eess.SY

    State-Wise Safe Reinforcement Learning With Pixel Observations

    Authors: Simon Sinong Zhan, Yixuan Wang, Qingyuan Wu, Ruochen Jiao, Chao Huang, Qi Zhu

    Abstract: In the context of safe exploration, Reinforcement Learning (RL) has long grappled with the challenges of balancing the tradeoff between maximizing rewards and minimizing safety violations, particularly in complex environments with contact-rich or non-smooth dynamics, and when dealing with high-dimensional pixel observations. Furthermore, incorporating state-wise safety constraints in the explorati… ▽ More

    Submitted 11 December, 2023; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: 10 pages, 5 figures

  18. arXiv:2305.10307  [pdf, other

    cs.CL

    FACE: Evaluating Natural Language Generation with Fourier Analysis of Cross-Entropy

    Authors: Zuhao Yang, Yingfang Yuan, Yang Xu, Shuo Zhan, Huajun Bai, Kefan Chen

    Abstract: Measuring the distance between machine-produced and human language is a critical open problem. Inspired by empirical findings from psycholinguistics on the periodicity of entropy in language, we propose FACE, a set of metrics based on Fourier Analysis of the estimated Cross-Entropy of language, for measuring the similarity between model-generated and human-written languages. Based on an open-ended… ▽ More

    Submitted 25 October, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  19. arXiv:2209.15090  [pdf, other

    eess.SY cs.LG

    Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments

    Authors: Yixuan Wang, Simon Sinong Zhan, Ruochen Jiao, Zhilu Wang, Wanxin Jin, Zhuoran Yang, Zhaoran Wang, Chao Huang, Qi Zhu

    Abstract: It is quite challenging to ensure the safety of reinforcement learning (RL) agents in an unknown and stochastic environment under hard constraints that require the system state not to reach certain specified unsafe regions. Many popular safe RL methods such as those based on the Constrained Markov Decision Process (CMDP) paradigm formulate safety violations in a cost function and try to constrain… ▽ More

    Submitted 13 June, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: Accepted to ICML 2023

  20. arXiv:2209.00798  [pdf, other

    cs.CV cs.GR

    PCDNF: Revisiting Learning-based Point Cloud Denoising via Joint Normal Filtering

    Authors: Zheng Liu, Yaowu Zhao, Sijing Zhan, Yuanyuan Liu, Renjie Chen, Ying He

    Abstract: Recovering high quality surfaces from noisy point clouds, known as point cloud denoising, is a fundamental yet challenging problem in geometry processing. Most of the existing methods either directly denoise the noisy input or filter raw normals followed by updating point positions. Motivated by the essential interplay between point cloud denoising and normal filtering, we revisit point cloud deno… ▽ More

    Submitted 3 July, 2023; v1 submitted 1 September, 2022; originally announced September 2022.

  21. arXiv:2201.12243  [pdf, other

    cs.LG eess.SY

    Joint Differentiable Optimization and Verification for Certified Reinforcement Learning

    Authors: Yixuan Wang, Simon Zhan, Zhilu Wang, Chao Huang, Zhaoran Wang, Zhuoran Yang, Qi Zhu

    Abstract: In model-based reinforcement learning for safety-critical control systems, it is important to formally certify system properties (e.g., safety, stability) under the learned controller. However, as existing methods typically apply formal verification \emph{after} the controller has been learned, it is sometimes difficult to obtain any certificate, even after many iterations between learning and ver… ▽ More

    Submitted 21 March, 2023; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: This paper is accepted to International Conference on Cyber-Physical Systems

  22. arXiv:2001.00191  [pdf, ps, other

    cs.LG eess.SP stat.ML

    Ensemble emotion recognizing with multiple modal physiological signals

    Authors: Jing Zhang, Yong Zhang, Suhua Zhan, Cheng Cheng

    Abstract: Physiological signals that provide the objective repression of human affective states are attracted increasing attention in the emotion recognition field. However, the single signal is difficult to obtain completely and accurately description for emotion. Multiple physiological signals fusing models, building the uniform classification model by means of consistent and complementary information fro… ▽ More

    Submitted 1 January, 2020; originally announced January 2020.

    Comments: under review for Multimedia tools and applications

  23. PCS: Predictive Component-level Scheduling for Reducing Tail Latency in Cloud Online Services

    Authors: Rui Han, Junwei Wang, Siguang Huang, Chenrong Shao, Shulin Zhan, Jianfeng Zhan, Jose Luis Vazquez-Poletti

    Abstract: Modern latency-critical online services often rely on composing results from a large number of server components. Hence the tail latency (e.g. the 99th percentile of response time), rather than the average, of these components determines the overall service performance. When hosted on a cloud environment, the components of a service typically co-locate with short batch jobs to increase machine uti… ▽ More

    Submitted 9 November, 2015; originally announced November 2015.

    Comments: 10 pages, 9 figures, ICPP conference

  24. arXiv:1504.02205  [pdf, other

    cs.DC

    BigDataBench-MT: A Benchmark Tool for Generating Realistic Mixed Data Center Workloads

    Authors: Rui Han, Shulin Zhan, Chenrong Shao, Junwei Wang, Lizy K. John, Jiangtao Xu, Gang Lu, Lei Wang

    Abstract: Long-running service workloads (e.g. web search engine) and short-term data analysis workloads (e.g. Hadoop MapReduce jobs) co-locate in today's data centers. Developing realistic benchmarks to reflect such practical scenario of mixed workload is a key problem to produce trustworthy results when evaluating and comparing data center systems. This requires using actual workloads as well as guarantee… ▽ More

    Submitted 4 December, 2015; v1 submitted 9 April, 2015; originally announced April 2015.

    Comments: 12 pages, 5 figures