Skip to main content

Showing 1–50 of 252 results for author: Wei, P

  1. arXiv:2410.15748  [pdf, other

    cs.AI

    Alchemy: Amplifying Theorem-Proving Capability through Symbolic Mutation

    Authors: Shaonan Wu, Shuai Lu, Yeyun Gong, Nan Duan, Ping Wei

    Abstract: Formal proofs are challenging to write even for experienced experts. Recent progress in Neural Theorem Proving (NTP) shows promise in expediting this process. However, the formal corpora available on the Internet are limited compared to the general text, posing a significant data scarcity challenge for NTP. To address this issue, this work proposes Alchemy, a general framework for data synthesis t… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  2. arXiv:2410.12634  [pdf, other

    hep-ph quant-ph

    Exploring Quantum Aspects of Dark Matter Axions and Dark Photons Transitions within a Resonant Cavity

    Authors: Ruifeng Zheng, Puxian Wei, Qiaoli Yang

    Abstract: When axionic dark matter interacts with a static magnetic field, it can convert into photons with energy near the axion's mass. Classical analysis shows that incorporating a resonant cavity significantly enhances this conversion rate, forming the basis for many experiments aimed at detecting dark matter axions. However, one question remains: Does the axion-photon conversion rate increase for a sin… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 7 pages, 1 figure

  3. arXiv:2409.13485  [pdf, ps, other

    math.NA math.AP

    Analysis of any order Runge-Kutta Spectral Volume Schemes for 1D Hyperbolic Equations

    Authors: Ping Wei, Qing-Song Zou

    Abstract: In this paper, we analyze any-order Runge-Kutta spectral volume schemes (RKSV(s,k)) for solving the one-dimensional scalar hyperbolic equation. The RKSV(s,k) was constructed by using the $s$-th explicit Runge-Kutta method in time-discretization which has {\it strong-stability-preserving} (SSP) property, and by letting a piecewise $k-$th degree($k\geq 1 $ is an arbitrary integer) polynomial satisfy… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

  4. arXiv:2409.13271  [pdf

    cond-mat.mtrl-sci

    Effects of residual stress on the isothermal tensile behavior of nanocrystalline superelastic NiTi shape memory alloy

    Authors: Kai Yan, Pengbo Wei, Weifeng He, Qingping Sun

    Abstract: The residual stress greatly affects the mechanical behavior of a material. In this work, the effect of residual stress on the isothermal tensile behavior of a NiTi shape memory alloy is studied. The focused ion beam and digital image correlation are combined to measure the two-dimensional residual stress in nanocrystalline NiTi plates processed with prestrain laser shock peening. A four-point bend… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

    Comments: 43 pages, 20 figures

  5. arXiv:2409.02416  [pdf, other

    cs.LG stat.ML

    Relative-Translation Invariant Wasserstein Distance

    Authors: Binshuai Wang, Qiwei Di, Ming Yin, Mengdi Wang, Quanquan Gu, Peng Wei

    Abstract: We introduce a new family of distances, relative-translation invariant Wasserstein distances ($RW_p$), for measuring the similarity of two probability distributions under distribution shift. Generalizing it from the classical optimal transport model, we show that $RW_p$ distances are also real distance metrics defined on the quotient set $\mathcal{P}_p(\mathbb{R}^n)/\sim$ and invariant to distribu… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

  6. arXiv:2409.01478  [pdf, other

    q-fin.MF

    Irreversible investment under weighted discounting: effects of decreasing impatience

    Authors: Pengyu Wei, Wei Wei

    Abstract: This paper employs an intra-personal game-theoretic framework to investigate how decreasing impatience influences irreversible investment behaviors in a continuous-time setting. We consider a capacity expansion problem under weighted discount functions, a class of nonexponential functions that exhibit decreasing impatience, including the hyperbolic discount function as a special case. By deriving… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

  7. arXiv:2408.16146  [pdf

    cond-mat.supr-con quant-ph

    Signatures of a Spin-Active Interface and Locally Enhanced Zeeman field in a Superconductor-Chiral Material Heterostructure

    Authors: Cliff Chen, Jason Tran, Anthony McFadden, Raymond Simmonds, Keisuke Saito, En-De Chu, Daniel Morales, Varrick Suezaki, Yasen Hou, Joe Aumentado, Patrick A. Lee, Jagadeesh S. Moodera, Peng Wei

    Abstract: A localized Zeeman field, intensified at heterostructure interfaces, could play a crucial role in a broad area including spintronics and unconventional superconductors. Conventionally, the generation of a local Zeeman field is achieved through magnetic exchange coupling with a magnetic material. However, magnetic elements often introduce defects, which could weaken or destroy superconductivity. Al… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: 27 pages, 11 figures

    Journal ref: Science Advances 10, eado4875 (2024)

  8. The Velocity Aberration Effect of the CSST Main Survey Camera

    Authors: Hui-Mei Feng, Zi-Huang Cao, Man I Lam, Ran Li, Hao Tian, Xin Zhang, Peng Wei, Xin-Feng Li, Wei Wang, Hugh R. A. Jones, Mao-Yuan Liu, Chao Liu

    Abstract: In this study, we conducted simulations to find the geometric aberrations expected for images taken by the Main Survey Camera (MSC) of the Chinese Space Station Telescope (CSST) due to its motion. As anticipated by previous work, our findings indicate that the geometric distortion of light impacts the focal plane's apparent scale, with a more pronounced influence as the size of the focal plane inc… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

    Comments: 13 pages, 8 figures; accepted by RAA

  9. arXiv:2408.12579  [pdf, other

    cs.CL cs.AI cs.HC cs.IR cs.LG

    RuleAlign: Making Large Language Models Better Physicians with Diagnostic Rule Alignment

    Authors: Xiaohan Wang, Xiaoyan Yang, Yuqi Zhu, Yue Shen, Jian Wang, Peng Wei, Lei Liang, Jinjie Gu, Huajun Chen, Ningyu Zhang

    Abstract: Large Language Models (LLMs) like GPT-4, MedPaLM-2, and Med-Gemini achieve performance competitively with human experts across various medical benchmarks. However, they still face challenges in making professional diagnoses akin to physicians, particularly in efficiently gathering patient information and reasoning the final diagnosis. To this end, we introduce the RuleAlign framework, designed to… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: Ongoing work

  10. arXiv:2408.10760  [pdf, other

    cs.CV cs.AI

    SAM-COD: SAM-guided Unified Framework for Weakly-Supervised Camouflaged Object Detection

    Authors: Huafeng Chen, Pengxu Wei, Guangqian Guo, Shan Gao

    Abstract: Most Camouflaged Object Detection (COD) methods heavily rely on mask annotations, which are time-consuming and labor-intensive to acquire. Existing weakly-supervised COD approaches exhibit significantly inferior performance compared to fully-supervised methods and struggle to simultaneously support all the existing types of camouflaged object labels, including scribbles, bounding boxes, and points… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: Accepted by ECCV2024

  11. arXiv:2408.10710  [pdf, other

    cs.CV cs.AI

    Coarse-to-Fine Detection of Multiple Seams for Robotic Welding

    Authors: Pengkun Wei, Shuo Cheng, Dayou Li, Ran Song, Yipeng Zhang, Wei Zhang

    Abstract: Efficiently detecting target weld seams while ensuring sub-millimeter accuracy has always been an important challenge in autonomous welding, which has significant application in industrial practice. Previous works mostly focused on recognizing and localizing welding seams one by one, leading to inferior efficiency in modeling the workpiece. This paper proposes a novel framework capable of multiple… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  12. arXiv:2408.08833  [pdf, other

    eess.SP

    Intra-symbol Differential Amplitude Shift Keying-aided Blind Detector for Ambient Backscatter Communication Systems

    Authors: Shuaijun Ma, Peng Wei, Sa Xiao, Jianquan Wang, Wanbin Tang, Wei Xiang

    Abstract: Ambient backscatter communications (AmBC) are a promising technology for addressing the energy consumption challenge in wireless communications through the reflection or absorption of surrounding radio frequency (RF) signals. However, it grapples with the intricacies of ambient RF signal and the round-trip path loss. For traditional detectors, the incorporation of pilot sequences results in a redu… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  13. arXiv:2408.07037  [pdf, other

    cs.CV cs.AI

    PathInsight: Instruction Tuning of Multimodal Datasets and Models for Intelligence Assisted Diagnosis in Histopathology

    Authors: Xiaomin Wu, Rui Xu, Pengchen Wei, Wenkang Qin, Peixiang Huang, Ziheng Li, Lin Luo

    Abstract: Pathological diagnosis remains the definitive standard for identifying tumors. The rise of multimodal large models has simplified the process of integrating image analysis with textual descriptions. Despite this advancement, the substantial costs associated with training and deploying these complex multimodal models, together with a scarcity of high-quality training datasets, create a significant… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

    Comments: 10 pages, 2 figures

  14. arXiv:2407.16600  [pdf, other

    cs.CV

    DHGS: Decoupled Hybrid Gaussian Splatting for Driving Scene

    Authors: Xi Shi, Lingli Chen, Peng Wei, Xi Wu, Tian Jiang, Yonggang Luo, Lecheng Xie

    Abstract: Existing Gaussian splatting methods often fall short in achieving satisfactory novel view synthesis in driving scenes, primarily due to the absence of crafty designs and geometric constraints for the involved elements. This paper introduces a novel neural rendering method termed Decoupled Hybrid Gaussian Splatting (DHGS), targeting at promoting the rendering quality of novel view synthesis for sta… ▽ More

    Submitted 17 August, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

    Comments: 13 pages, 14 figures, conference

  15. arXiv:2407.11699  [pdf, other

    cs.CV

    Relation DETR: Exploring Explicit Position Relation Prior for Object Detection

    Authors: Xiuquan Hou, Meiqin Liu, Senlin Zhang, Ping Wei, Badong Chen, Xuguang Lan

    Abstract: This paper presents a general scheme for enhancing the convergence and performance of DETR (DEtection TRansformer). We investigate the slow convergence problem in transformers from a new perspective, suggesting that it arises from the self-attention that introduces no structural bias over inputs. To address this issue, we explore incorporating position relation prior as attention bias to augment o… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  16. Sampling and active learning methods for network reliability estimation using K-terminal spanning tree

    Authors: Chen Ding, Pengfei Wei, Yan Shi, Jinxing Liu, Matteo Broggi, Michael Beer

    Abstract: Network reliability analysis remains a challenge due to the increasing size and complexity of networks. This paper presents a novel sampling method and an active learning method for efficient and accurate network reliability estimation under node failure and edge failure scenarios. The proposed sampling method adopts Monte Carlo technique to sample component lifetimes and the K-terminal spanning t… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Journal ref: Reliability Engineering & System Safety (2024) 110309

  17. arXiv:2406.16564  [pdf, other

    cs.CV

    FASTC: A Fast Attentional Framework for Semantic Traversability Classification Using Point Cloud

    Authors: Yirui Chen, Pengjin Wei, Zhenhuan Liu, Bingchao Wang, Jie Yang, Wei Liu

    Abstract: Producing traversability maps and understanding the surroundings are crucial prerequisites for autonomous navigation. In this paper, we address the problem of traversability assessment using point clouds. We propose a novel pillar feature extraction module that utilizes PointNet to capture features from point clouds organized in vertical volume and a 2D encoder-decoder structure to conduct travers… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted to ECAI2023 Our code is publicly available at [this](https://github.com/chenyirui/FASTC)

  18. arXiv:2406.14282  [pdf, other

    cs.CL cs.AI

    Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs

    Authors: Junjie Wang, Mingyang Chen, Binbin Hu, Dan Yang, Ziqi Liu, Yue Shen, Peng Wei, Zhiqiang Zhang, Jinjie Gu, Jun Zhou, Jeff Z. Pan, Wen Zhang, Huajun Chen

    Abstract: Improving the performance of large language models (LLMs) in complex question-answering (QA) scenarios has always been a research focal point. Recent studies have attempted to enhance LLMs' performance by combining step-wise planning with external retrieval. While effective for advanced models like GPT-3.5, smaller LLMs face challenges in decomposing complex questions, necessitating supervised fin… ▽ More

    Submitted 9 October, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: EMNLP2024 Findings

  19. arXiv:2406.12012  [pdf, other

    cond-mat.supr-con

    Highly Efficient Superconducting Diodes and Rectifiers for Quantum Circuitry

    Authors: Josep Ingla-Aynés, Yasen Hou, Sarah Wang, En-De Chu, Oleg A. Mukhanov, Peng Wei, Jagadeesh S. Moodera

    Abstract: Superconducting electronics is essential for energy-efficient quantum and classical high-end computing applications. Towards this goal, non-reciprocal superconducting circuit elements, such as superconducting diodes (SDs) can fulfill many critical needs. SDs have been the subject of multiple studies, but integrating several SDs in a superconducting circuit remains a challenge. Here we implement th… ▽ More

    Submitted 21 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 6 pages, 3 figures

  20. arXiv:2406.03712  [pdf, other

    cs.CL cs.LG

    A Survey on Medical Large Language Models: Technology, Application, Trustworthiness, and Future Directions

    Authors: Lei Liu, Xiaoyan Yang, Junchi Lei, Xiaoyang Liu, Yue Shen, Zhiqiang Zhang, Peng Wei, Jinjie Gu, Zhixuan Chu, Zhan Qin, Kui Ren

    Abstract: Large language models (LLMs), such as GPT series models, have received substantial attention due to their impressive capabilities for generating and understanding human-level language. More recently, LLMs have emerged as an innovative and powerful adjunct in the medical field, transforming traditional practices and heralding a new era of enhanced healthcare services. This survey provides a compreh… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  21. arXiv:2406.00632  [pdf, other

    cs.CV

    Diff-Mosaic: Augmenting Realistic Representations in Infrared Small Target Detection via Diffusion Prior

    Authors: Yukai Shi, Yupei Lin, Pengxu Wei, Xiaoyu Xian, Tianshui Chen, Liang Lin

    Abstract: Recently, researchers have proposed various deep learning methods to accurately detect infrared targets with the characteristics of indistinct shape and texture. Due to the limited variety of infrared datasets, training deep learning models with good generalization poses a challenge. To augment the infrared dataset, researchers employ data augmentation techniques, which often involve generating ne… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  22. arXiv:2405.11459  [pdf, other

    eess.SP cs.CL q-bio.NC

    Du-IN: Discrete units-guided mask modeling for decoding speech from Intracranial Neural signals

    Authors: Hui Zheng, Hai-Teng Wang, Wei-Bang Jiang, Zhong-Tao Chen, Li He, Pei-Yang Lin, Peng-Hu Wei, Guo-Guang Zhao, Yun-Zhe Liu

    Abstract: Invasive brain-computer interfaces with Electrocorticography (ECoG) have shown promise for high-performance speech decoding in medical applications, but less damaging methods like intracranial stereo-electroencephalography (sEEG) remain underexplored. With rapid advances in representation learning, leveraging abundant recordings to enhance speech decoding is increasingly attractive. However, popul… ▽ More

    Submitted 21 October, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

  23. arXiv:2405.11392  [pdf, ps, other

    q-fin.MF q-fin.CP

    Deep Penalty Methods: A Class of Deep Learning Algorithms for Solving High Dimensional Optimal Stopping Problems

    Authors: Yunfei Peng, Pengyu Wei, Wei Wei

    Abstract: We propose a deep learning algorithm for high dimensional optimal stopping problems. Our method is inspired by the penalty method for solving free boundary PDEs. Within our approach, the penalized PDE is approximated using the Deep BSDE framework proposed by \cite{weinan2017deep}, which leads us to coin the term "Deep Penalty Method (DPM)" to refer to our algorithm. We show that the error of the D… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  24. arXiv:2405.04336  [pdf, other

    cs.AI

    Temporal and Heterogeneous Graph Neural Network for Remaining Useful Life Prediction

    Authors: Zhihao Wen, Yuan Fang, Pengcheng Wei, Fayao Liu, Zhenghua Chen, Min Wu

    Abstract: Predicting Remaining Useful Life (RUL) plays a crucial role in the prognostics and health management of industrial systems that involve a variety of interrelated sensors. Given a constant stream of time series sensory data from such systems, deep learning models have risen to prominence at identifying complex, nonlinear temporal dependencies in these data. In addition to the temporal dependencies… ▽ More

    Submitted 1 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

    Comments: 12 pages

  25. arXiv:2405.03967  [pdf, other

    cs.LG cs.AI cs.AR

    SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems

    Authors: Kailash Gogineni, Sai Santosh Dayapule, Juan Gómez-Luna, Karthikeya Gogineni, Peng Wei, Tian Lan, Mohammad Sadrosadati, Onur Mutlu, Guru Venkataramani

    Abstract: Reinforcement Learning (RL) trains agents to learn optimal behavior by maximizing reward signals from experience datasets. However, RL training often faces memory limitations, leading to execution latencies and prolonged training times. To overcome this, SwiftRL explores Processing-In-Memory (PIM) architectures to accelerate RL workloads. We achieve near-linear performance scaling by implementing… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  26. arXiv:2405.00542  [pdf, other

    eess.IV cs.CV

    UWAFA-GAN: Ultra-Wide-Angle Fluorescein Angiography Transformation via Multi-scale Generation and Registration Enhancement

    Authors: Ruiquan Ge, Zhaojie Fang, Pengxue Wei, Zhanghao Chen, Hongyang Jiang, Ahmed Elazab, Wangting Li, Xiang Wan, Shaochong Zhang, Changmiao Wang

    Abstract: Fundus photography, in combination with the ultra-wide-angle fundus (UWF) techniques, becomes an indispensable diagnostic tool in clinical settings by offering a more comprehensive view of the retina. Nonetheless, UWF fluorescein angiography (UWF-FA) necessitates the administration of a fluorescent dye via injection into the patient's hand or elbow unlike UWF scanning laser ophthalmoscopy (UWF-SLO… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  27. arXiv:2404.14309  [pdf, other

    cs.CV

    Towards Understanding the Robustness of Diffusion-Based Purification: A Stochastic Perspective

    Authors: Yiming Liu, Kezhao Liu, Yao Xiao, Ziyi Dong, Xiaogang Xu, Pengxu Wei, Liang Lin

    Abstract: Diffusion-Based Purification (DBP) has emerged as an effective defense mechanism against adversarial attacks. The efficacy of DBP has been attributed to the forward diffusion process, which narrows the distribution gap between clean and adversarial images through the addition of Gaussian noise. Although this explanation has some theoretical support, the significance of its contribution to robustne… ▽ More

    Submitted 2 October, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

  28. arXiv:2404.10264  [pdf, other

    hep-ex hep-ph quant-ph

    Calibration of the Cryogenic Measurement System of a Resonant Haloscope Cavity

    Authors: Dong He, Jie Fan, Xin Gao, Yu Gao, Nick Houston, Zhongqing Ji, Yirong Jin, Chuang Li, Jinmian Li, Tianjun Li, Shi-hang Liu, Jia-Shu Niu, Zhihui Peng, Liang Sun, Zheng Sun, Jia Wang, Puxian Wei, Lina Wu, Zhongchen Xiang, Qiaoli Yang, Chi Zhang, Wenxing Zhang, Xin Zhang, Dongning Zheng, Ruifeng Zheng , et al. (1 additional authors not shown)

    Abstract: Possible light bosonic dark matter interactions with the Standard Model photon have been searched by microwave resonant cavities. In this paper, we demonstrate the cryogenic readout system calibration of a 7.138 GHz copper cavity with a loaded quality factor $Q_l=10^4$, operated at 22 mK temperature based on a dilution refrigerator. Our readout system consists of High Electron Mobility Transistors… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 7 pages, 5 figures, version to appear in CPC

  29. arXiv:2404.09790  [pdf, other

    cs.CV

    NTIRE 2024 Challenge on Image Super-Resolution ($\times$4): Methods and Results

    Authors: Zheng Chen, Zongwei Wu, Eduard Zamfir, Kai Zhang, Yulun Zhang, Radu Timofte, Xiaokang Yang, Hongyuan Yu, Cheng Wan, Yuxin Hong, Zhijuan Huang, Yajun Zou, Yuan Huang, Jiamin Lin, Bingnan Han, Xianyu Guan, Yongsheng Yu, Daoan Zhang, Xuanwu Yin, Kunlong Zuo, Jinhua Hao, Kai Zhao, Kun Yuan, Ming Sun, Chao Zhou , et al. (63 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 challenge on image super-resolution ($\times$4), highlighting the solutions proposed and the outcomes obtained. The challenge involves generating corresponding high-resolution (HR) images, magnified by a factor of four, from low-resolution (LR) inputs using prior information. The LR images originate from bicubic downsampling degradation. The aim of the challenge i… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: NTIRE 2024 webpage: https://cvlai.net/ntire/2024. Code: https://github.com/zhengchen1999/NTIRE2024_ImageSR_x4

  30. arXiv:2404.09263  [pdf, other

    cs.CV cs.AI

    Task-Driven Exploration: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection

    Authors: Jin Yang, Ping Wei, Huan Li, Ziyang Ren

    Abstract: Video moment retrieval and highlight detection are two highly valuable tasks in video understanding, but until recently they have been jointly studied. Although existing studies have made impressive advancement recently, they predominantly follow the data-driven bottom-up paradigm. Such paradigm overlooks task-specific and inter-task effects, resulting in poor model performance. In this paper, we… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  31. Dark photon constraints from a 7.139 GHz cavity haloscope experiment

    Authors: Dong He, Jie Fan, Xin Gao, Yu Gao, Nick Houston, Zhongqing Ji, Yirong Jin, Chuang Li, Jinmian Li, Tianjun Li, Shi-hang Liu, Jia-Shu Niu, Zhihui Peng, Liang Sun, Zheng Sun, Jia Wang, Puxian Wei, Lina Wu, Zhongchen Xiang, Qiaoli Yang, Chi Zhang, Wenxing Zhang, Xin Zhang, Dongning Zheng, Ruifeng Zheng , et al. (1 additional authors not shown)

    Abstract: The dark photon is a promising candidate for the dark matter which comprises most of the matter in our visible Universe. Via kinetic mixing with the Standard Model it can also be resonantly converted to photons in an electromagnetic cavity, offering novel experimental possibilities for the discovery and study of dark matter. We report the results of a pathfinder dark photon dark matter cavity sear… ▽ More

    Submitted 18 July, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: 5 pages, 4 figures. Updated to match journal version

    Journal ref: Phys.Rev.D 110 (2024) 2, L021101

  32. arXiv:2403.16131  [pdf, other

    cs.CV

    Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement

    Authors: Xiuquan Hou, Meiqin Liu, Senlin Zhang, Ping Wei, Badong Chen

    Abstract: DETR-like methods have significantly increased detection performance in an end-to-end manner. The mainstream two-stage frameworks of them perform dense self-attention and select a fraction of queries for sparse cross-attention, which is proven effective for improving performance but also introduces a heavy computational burden and high dependence on stable query selection. This paper demonstrates… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  33. IDF-CR: Iterative Diffusion Process for Divide-and-Conquer Cloud Removal in Remote-sensing Images

    Authors: Meilin Wang, Yexing Song, Pengxu Wei, Xiaoyu Xian, Yukai Shi, Liang Lin

    Abstract: Deep learning technologies have demonstrated their effectiveness in removing cloud cover from optical remote-sensing images. Convolutional Neural Networks (CNNs) exert dominance in the cloud removal tasks. However, constrained by the inherent limitations of convolutional operations, CNNs can address only a modest fraction of cloud occlusion. In recent years, diffusion models have achieved state-of… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted by IEEE TGRS, we first present an iterative diffusion process for cloud removal, the code is available at: https://github.com/SongYxing/IDF-CR

  34. arXiv:2403.11852  [pdf, other

    cs.RO cs.AI

    Reinforcement Learning with Latent State Inference for Autonomous On-ramp Merging under Observation Delay

    Authors: Amin Tabrizian, Zhitong Huang, Peng Wei

    Abstract: This paper presents a novel approach to address the challenging problem of autonomous on-ramp merging, where a self-driving vehicle needs to seamlessly integrate into a flow of vehicles on a multi-lane highway. We introduce the Lane-keeping, Lane-changing with Latent-state Inference and Safety Controller (L3IS) agent, designed to perform the on-ramp merging task safely without comprehensive knowle… ▽ More

    Submitted 21 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  35. arXiv:2403.06579  [pdf, other

    eess.SY

    Edge Information Hub: Orchestrating Satellites, UAVs, MEC, Sensing and Communications for 6G Closed-Loop Controls

    Authors: Chengleyang Lei, Wei Feng, Peng Wei, Yunfei Chen, Ning Ge, Shiwen Mao

    Abstract: An increasing number of field robots would be used for mission-critical tasks in remote or post-disaster areas. Due to the limited individual abilities, these robots usually require an edge information hub (EIH), with not only communication but also sensing and computing functions. Such EIH could be deployed on a flexibly-dispatched unmanned aerial vehicle (UAV). Different from traditional aerial… ▽ More

    Submitted 24 August, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: 16pages, 11 figures

  36. arXiv:2402.13769  [pdf, other

    cs.IR

    General Debiasing for Graph-based Collaborative Filtering via Adversarial Graph Dropout

    Authors: An Zhang, Wenchang Ma, Pengbo Wei, Leheng Sheng, Xiang Wang

    Abstract: Graph neural networks (GNNs) have shown impressive performance in recommender systems, particularly in collaborative filtering (CF). The key lies in aggregating neighborhood information on a user-item interaction graph to enhance user/item representations. However, we have discovered that this aggregation mechanism comes with a drawback, which amplifies biases present in the interaction graph. For… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted to WWW 2024

  37. arXiv:2401.06992  [pdf, other

    cs.CV cs.AI

    Progressive Feature Fusion Network for Enhancing Image Quality Assessment

    Authors: Kaiqun Wu, Xiaoling Jiang, Rui Yu, Yonggang Luo, Tian Jiang, Xi Wu, Peng Wei

    Abstract: Image compression has been applied in the fields of image storage and video broadcasting. However, it's formidably tough to distinguish the subtle quality differences between those distorted images generated by different algorithms. In this paper, we propose a new image quality assessment framework to decide which image is better in an image group. To capture the subtle differences, a fine-grained… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: Data Compression Conference

  38. arXiv:2312.10299  [pdf, other

    cs.CV cs.AI cs.LG

    Image Restoration Through Generalized Ornstein-Uhlenbeck Bridge

    Authors: Conghan Yue, Zhengwei Peng, Junlong Ma, Shiyan Du, Pengxu Wei, Dongyu Zhang

    Abstract: Diffusion models exhibit powerful generative capabilities enabling noise mapping to data via reverse stochastic differential equations. However, in image restoration, the focus is on the mapping relationship from low-quality to high-quality images. Regarding this issue, we introduce the Generalized Ornstein-Uhlenbeck Bridge (GOUB) model. By leveraging the natural mean-reverting property of the gen… ▽ More

    Submitted 17 May, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: ICML 2024

  39. arXiv:2311.05374  [pdf, other

    cs.CL cs.AI

    TencentLLMEval: A Hierarchical Evaluation of Real-World Capabilities for Human-Aligned LLMs

    Authors: Shuyi Xie, Wenlin Yao, Yong Dai, Shaobo Wang, Donlin Zhou, Lifeng Jin, Xinhua Feng, Pengzhi Wei, Yujie Lin, Zhichao Hu, Dong Yu, Zhengyou Zhang, Jing Nie, Yuhong Liu

    Abstract: Large language models (LLMs) have shown impressive capabilities across various natural language tasks. However, evaluating their alignment with human preferences remains a challenge. To this end, we propose a comprehensive human evaluation framework to assess LLMs' proficiency in following instructions on diverse real-world tasks. We construct a hierarchical task tree encompassing 7 major areas co… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  40. arXiv:2310.18670  [pdf, other

    eess.SP

    Two-stage space construction for real-time modeling of distributed parameter systems under sparse sensing

    Authors: Peng Wei

    Abstract: Numerous industrial processes can be defined using distributed parameter systems (DPSs). This study introduces a two-stage spatial construction approach for real-time modeling of DPSs in cases of limited sensors. Initially, a discrete space-completion approach is created to recuperate the spatiotemporal patterns of non-monitored locations under sparse sensing. The high-dimensional space constructi… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

  41. arXiv:2310.15138  [pdf, other

    cs.RO cs.CV

    Fusion-Driven Tree Reconstruction and Fruit Localization: Advancing Precision in Agriculture

    Authors: Kaiming Fu, Peng Wei, Juan Villacres, Zhaodan Kong, Stavros G. Vougioukas, Brian N. Bailey

    Abstract: Fruit distribution is pivotal in shaping the future of both agriculture and agricultural robotics, paving the way for a streamlined supply chain. This study introduces an innovative methodology that harnesses the synergy of RGB imagery, LiDAR, and IMU data, to achieve intricate tree reconstructions and the pinpoint localization of fruits. Such integration not only offers insights into the fruit di… ▽ More

    Submitted 14 October, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: This work was presented at IEEE/RSI International Conference on Intelligent Robots and Systems (IROS) Workshop

  42. arXiv:2310.08606  [pdf, other

    eess.SP

    Multiscale Fusion for Abnormality Detection and Localization of Distributed Parameter Systems

    Authors: Peng Wei, Han-Xiong Li

    Abstract: Numerous industrial thermal processes and fluid processes can be described by distributed parameter systems (DPSs), wherein many process parameters and variables vary in space and time. Early internal abnormalities in the DPS may develop into uncontrollable thermal failures, causing serious safety incidents. In this study, the multiscale information fusion is proposed for internal abnormality dete… ▽ More

    Submitted 1 December, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

  43. arXiv:2309.04803  [pdf, other

    cs.CV cs.AI

    Towards Real-World Burst Image Super-Resolution: Benchmark and Method

    Authors: Pengxu Wei, Yujing Sun, Xingbei Guo, Chang Liu, Jie Chen, Xiangyang Ji, Liang Lin

    Abstract: Despite substantial advances, single-image super-resolution (SISR) is always in a dilemma to reconstruct high-quality images with limited information from one input image, especially in realistic scenarios. In this paper, we establish a large-scale real-world burst super-resolution dataset, i.e., RealBSR, to explore the faithful reconstruction of image details from multiple frames. Furthermore, we… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

    Comments: Accepted by ICCV2023

  44. arXiv:2309.02906  [pdf, other

    math.PR

    Well-posedness and averaging principle for Lévy-type McKean-Vlasov stochastic differential equations under local Lipschitz conditions

    Authors: Ying Chao, Jinqiao Duan, Ting Gao, Pingyuan Wei

    Abstract: In this paper, we investigate a class of McKean-Vlasov stochastic differential equations under Lévy-type perturbations. We first establish the existence and uniqueness theorem for solutions of the McKean-Vlasov stochastic differential equations by utilizing the Euler-like approximation. Then under some suitable conditions, we show that the solutions of McKean-Vlasov stochastic differential equatio… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: 29 pages, 7 figures

    MSC Class: 60H10; 60G51; 34C29; 35Q83

  45. arXiv:2308.15016  [pdf, other

    cs.CV

    C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion Model

    Authors: Longbin Ji, Pengfei Wei, Yi Ren, Jinglin Liu, Chen Zhang, Xiang Yin

    Abstract: Co-speech gesture generation is crucial for automatic digital avatar animation. However, existing methods suffer from issues such as unstable training and temporal inconsistency, particularly in generating high-fidelity and comprehensive gestures. Additionally, these methods lack effective control over speaker identity and temporal editing of the generated gestures. Focusing on capturing temporal… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: 12 pages, 6 figures, 7 tables

  46. arXiv:2308.02263  [pdf, other

    cs.SD cs.CL eess.AS

    Efficient Monaural Speech Enhancement using Spectrum Attention Fusion

    Authors: Jinyu Long, Jetic Gū, Binhao Bai, Zhibo Yang, Ping Wei, Junli Li

    Abstract: Speech enhancement is a demanding task in automated speech processing pipelines, focusing on separating clean speech from noisy channels. Transformer based models have recently bested RNN and CNN models in speech enhancement, however at the same time they are much more computationally expensive and require much more high quality training data, which is always hard to come by. In this paper, we pre… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  47. arXiv:2308.01117  [pdf

    cs.RO eess.SY

    Optimization-Based Motion Planning for Autonomous Agricultural Vehicles Turning in Constrained Headlands

    Authors: Chen Peng, Peng Wei, Zhenghao Fei, Yuankai Zhu, Stavros G. Vougioukas

    Abstract: Headland maneuvering is a crucial aspect of unmanned field operations for autonomous agricultural vehicles (AAVs). While motion planning for headland turning in open fields has been extensively studied and integrated into commercial auto-guidance systems, the existing methods primarily address scenarios with ample headland space and thus may not work in more constrained headland geometries. Commer… ▽ More

    Submitted 11 June, 2024; v1 submitted 2 August, 2023; originally announced August 2023.

  48. arXiv:2307.16242  [pdf, other

    cs.CV

    SR-R$^2$KAC: Improving Single Image Defocus Deblurring

    Authors: Peng Tang, Zhiqiang Xu, Pengfei Wei, Xiaobin Hu, Peilin Zhao, Xin Cao, Chunlai Zhou, Tobias Lasser

    Abstract: We propose an efficient deep learning method for single image defocus deblurring (SIDD) by further exploring inverse kernel properties. Although the current inverse kernel method, i.e., kernel-sharing parallel atrous convolution (KPAC), can address spatially varying defocus blurs, it has difficulty in handling large blurs of this kind. To tackle this issue, we propose a Residual and Recursive Ke… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: Submitted to IEEE Transactions on Cybernetics on 2023-July-24

  49. arXiv:2307.11530  [pdf, other

    eess.IV cs.CV

    UWAT-GAN: Fundus Fluorescein Angiography Synthesis via Ultra-wide-angle Transformation Multi-scale GAN

    Authors: Zhaojie Fang, Zhanghao Chen, Pengxue Wei, Wangting Li, Shaochong Zhang, Ahmed Elazab, Gangyong Jia, Ruiquan Ge, Changmiao Wang

    Abstract: Fundus photography is an essential examination for clinical and differential diagnosis of fundus diseases. Recently, Ultra-Wide-angle Fundus (UWF) techniques, UWF Fluorescein Angiography (UWF-FA) and UWF Scanning Laser Ophthalmoscopy (UWF-SLO) have been gradually put into use. However, Fluorescein Angiography (FA) and UWF-FA require injecting sodium fluorescein which may have detrimental influence… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

    Comments: 26th International Conference on Medical Image Computing and Computer Assisted Intervention

  50. arXiv:2307.07218  [pdf, other

    eess.AS cs.SD

    Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis

    Authors: Ziyue Jiang, Jinglin Liu, Yi Ren, Jinzheng He, Zhenhui Ye, Shengpeng Ji, Qian Yang, Chen Zhang, Pengfei Wei, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao

    Abstract: Zero-shot text-to-speech (TTS) aims to synthesize voices with unseen speech prompts, which significantly reduces the data and computation requirements for voice cloning by skipping the fine-tuning process. However, the prompting mechanisms of zero-shot TTS still face challenges in the following aspects: 1) previous works of zero-shot TTS are typically trained with single-sentence prompts, which si… ▽ More

    Submitted 10 April, 2024; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: Accepted by ICLR 2024