Skip to main content

Showing 1–50 of 2,759 results for author: Chen, P

  1. arXiv:2410.15895  [pdf, other

    quant-ph eess.SY

    Cryogenic Control and Readout Integrated Circuits for Solid-State Quantum Computing

    Authors: Lingxiao Lei, Heng Huang, Pingxing Chen, Mingtang Deng

    Abstract: In the pursuit of quantum computing, solid-state quantum systems, particularly superconducting ones, have made remarkable advancements over the past two decades. However, achieving fault-tolerant quantum computing for next-generation applications necessitates the integration of several million qubits, which presents significant challenges in terms of interconnection complexity and latency that are… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  2. arXiv:2410.15257  [pdf, other

    cs.LG cs.DS math.OC

    Learning-Augmented Algorithms for the Bahncard Problem

    Authors: Hailiang Zhao, Xueyan Tang, Peng Chen, Shuiguang Deng

    Abstract: In this paper, we study learning-augmented algorithms for the Bahncard problem. The Bahncard problem is a generalization of the ski-rental problem, where a traveler needs to irrevocably and repeatedly decide between a cheap short-term solution and an expensive long-term one with an unknown future. Even though the problem is canonical, only a primal-dual-based learning-augmented algorithm was expli… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

    Comments: This paper has been accepted by the 38th Conference on Neural Information Processing Systems (NeurIPS 2024)

  3. arXiv:2410.14195  [pdf, other

    cs.CV

    Rethinking Transformer for Long Contextual Histopathology Whole Slide Image Analysis

    Authors: Honglin Li, Yunlong Zhang, Pingyi Chen, Zhongyi Shui, Chenglu Zhu, Lin Yang

    Abstract: Histopathology Whole Slide Image (WSI) analysis serves as the gold standard for clinical cancer diagnosis in the daily routines of doctors. To develop computer-aided diagnosis model for WSIs, previous methods typically employ Multi-Instance Learning to enable slide-level prediction given only slide-level labels. Among these models, vanilla attention mechanisms without pairwise interactions have tr… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: NeurIPS-2024. arXiv admin note: text overlap with arXiv:2311.12885

  4. arXiv:2410.14182  [pdf, other

    cs.CL cs.LG

    LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs

    Authors: Yujun Zhou, Jingdong Yang, Kehan Guo, Pin-Yu Chen, Tian Gao, Werner Geyer, Nuno Moniz, Nitesh V Chawla, Xiangliang Zhang

    Abstract: Laboratory accidents pose significant risks to human life and property, underscoring the importance of robust safety protocols. Despite advancements in safety training, laboratory personnel may still unknowingly engage in unsafe practices. With the increasing reliance on large language models (LLMs) for guidance in various fields, including laboratory settings, there is a growing concern about the… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: 50 pages, 19 figures

  5. arXiv:2410.13907  [pdf, other

    cs.CR cs.AI cs.CL

    NSmark: Null Space Based Black-box Watermarking Defense Framework for Pre-trained Language Models

    Authors: Haodong Zhao, Jinming Hu, Peixuan Li, Fangqi Li, Jinrui Sha, Peixuan Chen, Zhuosheng Zhang, Gongshen Liu

    Abstract: Pre-trained language models (PLMs) have emerged as critical intellectual property (IP) assets that necessitate protection. Although various watermarking strategies have been proposed, they remain vulnerable to Linear Functionality Equivalence Attacks (LFEA), which can invalidate most existing white-box watermarks without prior knowledge of the watermarking scheme or training data. This paper furth… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  6. arXiv:2410.13178  [pdf, other

    cs.LG cs.AI

    GeSubNet: Gene Interaction Inference for Disease Subtype Network Generation

    Authors: Ziwei Yang, Zheng Chen, Xin Liu, Rikuto Kotoge, Peng Chen, Yasuko Matsubara, Yasushi Sakurai, Jimeng Sun

    Abstract: Retrieving gene functional networks from knowledge databases presents a challenge due to the mismatch between disease networks and subtype-specific variations. Current solutions, including statistical and deep learning methods, often fail to effectively integrate gene interaction knowledge from databases or explicitly learn subtype-specific interactions. To address this mismatch, we propose GeSubN… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: Under review as a conference paper at ICLR 2025

  7. arXiv:2410.12655  [pdf, other

    cs.LG

    Position Specific Scoring Is All You Need? Revisiting Protein Sequence Classification Tasks

    Authors: Sarwan Ali, Taslim Murad, Prakash Chourasia, Haris Mansoor, Imdad Ullah Khan, Pin-Yu Chen, Murray Patterson

    Abstract: Understanding the structural and functional characteristics of proteins are crucial for developing preventative and curative strategies that impact fields from drug discovery to policy development. An important and popular technique for examining how amino acids make up these characteristics of the protein sequences with position-specific scoring (PSS). While the string kernel is crucial in natura… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  8. arXiv:2410.12056  [pdf

    cs.DB cs.CY

    Utilizing Spatiotemporal Data Analytics to Pinpoint Outage Location

    Authors: Reddy Mandati, Po-Chen Chen, Vladyslav Anderson, Bishwa Sapkota, Michael Jarrell Warren, Bobby Besharati, Ankush Agarwal, Samuel Johnston III

    Abstract: Understanding the exact fault location in the post-event analysis is the key to improving the accuracy of outage management. Unfortunately, the fault location is not generally well documented during the restoration process, creating a big challenge for post-event analysis. By utilizing various data source systems, including outage management system (OMS) data, asset geospatial information system (… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  9. arXiv:2410.11967  [pdf

    cs.CV cs.LG

    Integrating Artificial Intelligence Models and Synthetic Image Data for Enhanced Asset Inspection and Defect Identification

    Authors: Reddy Mandati, Vladyslav Anderson, Po-chen Chen, Ankush Agarwal, Tatjana Dokic, David Barnard, Michael Finn, Jesse Cromer, Andrew Mccauley, Clay Tutaj, Neha Dave, Bobby Besharati, Jamie Barnett, Timothy Krall

    Abstract: In the past utilities relied on in-field inspections to identify asset defects. Recently, utilities have started using drone-based inspections to enhance the field-inspection process. We consider a vast repository of drone images, providing a wealth of information about asset health and potential issues. However, making the collected imagery data useful for automated defect detection requires sign… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  10. arXiv:2410.11802  [pdf, other

    cs.LG

    FoundTS: Comprehensive and Unified Benchmarking of Foundation Models for Time Series Forecasting

    Authors: Zhe Li, Xiangfei Qiu, Peng Chen, Yihang Wang, Hanyin Cheng, Yang Shu, Jilin Hu, Chenjuan Guo, Aoying Zhou, Qingsong Wen, Christian S. Jensen, Bin Yang

    Abstract: Time Series Forecasting (TSF) is key functionality in numerous fields, including in finance, weather services, and energy management. While TSF methods are emerging these days, many of them require domain-specific data collection and model training and struggle with poor generalization performance on new domains. Foundation models aim to overcome this limitation. Pre-trained on large-scale languag… ▽ More

    Submitted 21 October, 2024; v1 submitted 15 October, 2024; originally announced October 2024.

  11. arXiv:2410.11290  [pdf, other

    cs.LG cs.AI cs.CR

    Backdoor Attack on Vertical Federated Graph Neural Network Learning

    Authors: Jirui Yang, Peng Chen, Zhihui Lu, Ruijun Deng, Qiang Duan, Jianping Zeng

    Abstract: Federated Graph Neural Network (FedGNN) is a privacy-preserving machine learning technology that combines federated learning (FL) and graph neural networks (GNNs). It offers a privacy-preserving solution for training GNNs using isolated graph data. Vertical Federated Graph Neural Network (VFGNN) is an important branch of FedGNN, where data features and labels are distributed among participants, an… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  12. arXiv:2410.10280  [pdf, other

    physics.ins-det astro-ph.IM hep-ex quant-ph

    Dual-Mode Calorimetric Superconducting Nanowire Single Photon Detectors

    Authors: Hsin-Yeh Wu, Marc Besançon, Jia-Wern Chen, Pisin Chen, Jean-François Glicenstein, Shu-Xiao Liu, Yu-Jung Lu, Xavier-François Navick, Stathes Paganis, Boris Tuchming, Dimitra Tsionou, Feng-Yang Tsai

    Abstract: A dual-operation mode SNSPD is demonstrated. In the conventional Geiger SNSPD mode the sensor operates at temperatures well below the critical temperature, Tc, working as an event counter without sensitivity to the number of photons impinging the sensor. In the calorimetric mode, the detector is operated at temperatures just below Tc and displays photon-number sensitivity for wavelengths in the op… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: Manuscript prepared for APL

  13. arXiv:2410.07471  [pdf, other

    cs.LG cs.AI cs.CL

    SEAL: Safety-enhanced Aligned LLM Fine-tuning via Bilevel Data Selection

    Authors: Han Shen, Pin-Yu Chen, Payel Das, Tianyi Chen

    Abstract: Fine-tuning on task-specific data to boost downstream performance is a crucial step for leveraging Large Language Models (LLMs). However, previous studies have demonstrated that fine-tuning the models on several adversarial samples or even benign data can greatly comprise the model's pre-equipped alignment and safety capabilities. In this work, we propose SEAL, a novel framework to enhance safety… ▽ More

    Submitted 10 October, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

  14. arXiv:2410.06755  [pdf, other

    quant-ph

    Magnetic field dependence of $V_B^-$ Defects in hexagonal boron nitride

    Authors: Mulin Zheng, Shizhuo Ale, Peiqin Chen, Jingpu Tu, Qiang Zhou, Haizhi Song, You Wang, Junfeng Wang, Guangcan Guo, Guangwei Deng

    Abstract: The interface with spin defects in hexagonal boron nitride has recently become a promising platform and has shown great potential in a wide range of quantum technologies. Varieties of spin properties of $V_B^-$ defects in hexagonal boron nitride (hBN) have been researched widely and deeply, like their structure and coherent control. However, little is known about the influence of off-axis magnetic… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: 5pages, 4 figures

  15. arXiv:2410.05255  [pdf, other

    cs.CV cs.LG

    SePPO: Semi-Policy Preference Optimization for Diffusion Alignment

    Authors: Daoan Zhang, Guangchen Lan, Dong-Jun Han, Wenlin Yao, Xiaoman Pan, Hongming Zhang, Mingxiao Li, Pengcheng Chen, Yu Dong, Christopher Brinton, Jiebo Luo

    Abstract: Reinforcement learning from human feedback (RLHF) methods are emerging as a way to fine-tune diffusion models (DMs) for visual generation. However, commonly used on-policy strategies are limited by the generalization capability of the reward model, while off-policy approaches require large amounts of difficult-to-obtain paired human-annotated data, particularly in visual generation tasks. To addre… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

  16. arXiv:2410.05137  [pdf, other

    cond-mat.supr-con

    Field-angle evolution of the superconducting and magnetic phases of UTe$_2$ around the $b$ axis

    Authors: Sylvia K. Lewin, Josephine J. Yu, Corey E. Frank, David Graf, Patrick Chen, Sheng Ran, Yun Suk Eo, Johnpierre Paglione, S. Raghu, Nicholas P. Butch

    Abstract: We experimentally determine the bounds of the magnetic-field-induced superconducting and magnetic phases near the crystalline $b$ axis of uranium ditelluride (UTe$_2$). By measuring the magnetoresistance as a function of rotation angle and field strength in magnetic fields as large as 41.5 T, we have studied these boundaries in three dimensions of magnetic field direction. The phase boundaries in… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

    Comments: 17 pages, 16 figures

  17. arXiv:2410.05111  [pdf, other

    cs.CV

    LiDAR-GS:Real-time LiDAR Re-Simulation using Gaussian Splatting

    Authors: Qifeng Chen, Sheng Yang, Sicong Du, Tao Tang, Peng Chen, Yuchi Huo

    Abstract: LiDAR simulation plays a crucial role in closed-loop simulation for autonomous driving. Although recent advancements, such as the use of reconstructed mesh and Neural Radiance Fields (NeRF), have made progress in simulating the physical properties of LiDAR, these methods have struggled to achieve satisfactory frame rates and rendering quality. To address these limitations, we present LiDAR-GS, the… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

  18. arXiv:2410.04324  [pdf, other

    cs.SD cs.AI eess.AS

    SONAR: A Synthetic AI-Audio Detection Framework and Benchmark

    Authors: Xiang Li, Pin-Yu Chen, Wenqi Wei

    Abstract: Recent advances in Text-to-Speech (TTS) and Voice-Conversion (VC) using generative Artificial Intelligence (AI) technology have made it possible to generate high-quality and realistic human-like audio. This introduces significant challenges to distinguishing AI-synthesized speech from the authentic human voice and could raise potential issues of misuse for malicious purposes such as impersonation… ▽ More

    Submitted 10 October, 2024; v1 submitted 5 October, 2024; originally announced October 2024.

  19. arXiv:2410.04041  [pdf, other

    eess.IV cs.CV

    Hybrid NeRF-Stereo Vision: Pioneering Depth Estimation and 3D Reconstruction in Endoscopy

    Authors: Pengcheng Chen, Wenhao Li, Nicole Gunderson, Jeremy Ruthberg, Randall Bly, Waleed M. Abuzeid, Zhenglong Sun, Eric J. Seibel

    Abstract: The 3D reconstruction of the surgical field in minimally invasive endoscopic surgery has posed a formidable challenge when using conventional monocular endoscopes. Existing 3D reconstruction methodologies are frequently encumbered by suboptimal accuracy and limited generalization capabilities. In this study, we introduce an innovative pipeline using Neural Radiance Fields (NeRF) for 3D reconstruct… ▽ More

    Submitted 10 October, 2024; v1 submitted 5 October, 2024; originally announced October 2024.

  20. arXiv:2410.03920  [pdf, other

    cs.RO cs.AI cs.CE cs.CV physics.comp-ph

    Learning Object Properties Using Robot Proprioception via Differentiable Robot-Object Interaction

    Authors: Peter Yichen Chen, Chao Liu, Pingchuan Ma, John Eastman, Daniela Rus, Dylan Randle, Yuri Ivanov, Wojciech Matusik

    Abstract: Differentiable simulation has become a powerful tool for system identification. While prior work has focused on identifying robot properties using robot-specific data or object properties using object-specific data, our approach calibrates object properties by using information from the robot, without relying on data from the object itself. Specifically, we utilize robot joint encoder information,… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

  21. arXiv:2410.03818  [pdf, other

    cs.LG cs.AI cs.CL

    Large Language Models can be Strong Self-Detoxifiers

    Authors: Ching-Yun Ko, Pin-Yu Chen, Payel Das, Youssef Mroueh, Soham Dan, Georgios Kollias, Subhajit Chaudhury, Tejaswini Pedapati, Luca Daniel

    Abstract: Reducing the likelihood of generating harmful and toxic output is an essential task when aligning large language models (LLMs). Existing methods mainly rely on training an external reward model (i.e., another language model) or fine-tuning the LLM using self-generated data to influence the outcome. In this paper, we show that LLMs have the capability of self-detoxification without the use of an ad… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: 20 pages

  22. arXiv:2410.03312  [pdf, other

    cs.CL eess.AS

    Context and System Fusion in Post-ASR Emotion Recognition with Large Language Models

    Authors: Pavel Stepachev, Pinzhen Chen, Barry Haddow

    Abstract: Large language models (LLMs) have started to play a vital role in modelling speech and text. To explore the best use of context and multiple systems' outputs for post-ASR speech emotion prediction, we study LLM prompting on a recent task named GenSEC. Our techniques include ASR transcript ranking, variable conversation context, and system output fusion. We show that the conversation context has di… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

  23. arXiv:2410.03128  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.other

    Spontaneously formed phonon frequency combs in van der Waals solid CrXTe$_3$ (X=Ge,Si)

    Authors: Lebing Chen, Gaihua Ye, Cynthia Nnokwe, Xing-Chen Pan, Katsumi Tanigaki, Guanghui Cheng, Yong P. Chen, Jiaqiang Yan, David G. Mandrus, Andres E. Llacsahuanga Allcca, Nathan Giles-Donovan, Robert J. Birgeneau, Rui He

    Abstract: Optical phonon engineering through nonlinear effects has been utilized in ultrafast control of material properties. However, nonlinear optical phonons typically exhibit rapid decay due to strong mode-mode couplings, limiting their effectiveness in temperature or frequency sensitive applications. In this study, we report the observation of long-lived nonlinear optical phonons through the spontaneou… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: 22 pages, 10 figures

  24. arXiv:2410.02736  [pdf, other

    cs.CL cs.AI

    Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge

    Authors: Jiayi Ye, Yanbo Wang, Yue Huang, Dongping Chen, Qihui Zhang, Nuno Moniz, Tian Gao, Werner Geyer, Chao Huang, Pin-Yu Chen, Nitesh V Chawla, Xiangliang Zhang

    Abstract: LLM-as-a-Judge has been widely utilized as an evaluation method in various benchmarks and served as supervised rewards in model training. However, despite their excellence in many domains, potential issues are under-explored, undermining their reliability and the scope of their utility. Therefore, we identify 12 key potential biases and propose a new automated bias quantification framework-CALM-wh… ▽ More

    Submitted 3 October, 2024; v1 submitted 3 October, 2024; originally announced October 2024.

  25. arXiv:2410.02167  [pdf, other

    cs.LG cs.CL

    Training Nonlinear Transformers for Chain-of-Thought Inference: A Theoretical Generalization Analysis

    Authors: Hongkang Li, Meng Wang, Songtao Lu, Xiaodong Cui, Pin-Yu Chen

    Abstract: Chain-of-Thought (CoT) is an efficient prompting method that enables the reasoning ability of large language models by augmenting the query using multiple examples with multiple intermediate steps. Despite the empirical success, the theoretical understanding of how to train a Transformer to achieve the CoT ability remains less explored. This is primarily due to the technical challenges involved in… ▽ More

    Submitted 5 October, 2024; v1 submitted 2 October, 2024; originally announced October 2024.

  26. arXiv:2410.01164  [pdf, ps, other

    math.CA

    On maximal functions generated by Hörmander-type spectral multipliers

    Authors: Peng Chen, Xixi Lin, Liangchuan Wu, Lixin Yan

    Abstract: Let $(X,d,μ)$ be a metric space with doubling measure and $L$ be a nonnegative self-adjoint operator on $L^2(X)$ whose heat kernel satisfies the Gaussian upper bound. We assume that there exists an $L$-harmonic function $h$ such that the semigroup $\exp(-tL)$, after applying the Doob transform related to $h$, satisfies the upper and lower Gaussian estimates. In this paper we apply the Doob transfo… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

    Comments: 37 pages

    MSC Class: 42B15; 42B25; 47F10

  27. arXiv:2410.00938  [pdf, other

    cs.LG

    MoS: Unleashing Parameter Efficiency of Low-Rank Adaptation with Mixture of Shards

    Authors: Sheng Wang, Liheng Chen, Pengan Chen, Jingwei Dong, Boyang Xue, Jiyue Jiang, Lingpeng Kong, Chuan Wu

    Abstract: The rapid scaling of large language models necessitates more lightweight finetuning methods to reduce the explosive GPU memory overhead when numerous customized models are served simultaneously. Targeting more parameter-efficient low-rank adaptation (LoRA), parameter sharing presents a promising solution. Empirically, our research into high-level sharing principles highlights the indispensable rol… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

  28. arXiv:2409.18296  [pdf, other

    astro-ph.SR astro-ph.EP

    An Eccentric Binary with a Misaligned Circumbinary Disk

    Authors: Zhecheng Hu, Wei Zhu, Fei Dai, Ping Chen, Yang Huang, Min Fang, Richard S. Post

    Abstract: We present spectroscopic and photometric observations of Bernhard-2, which was previously identified as a candidate system to host a misaligned circumbinary disk. Our spectroscopic measurements confirm that Bernhard-2 indeed contains an eccentric ($e=0.69 \pm 0.08$) binary and thus that the periodic variability in the photometric light curve is best explained by the occultation by the misaligned c… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: 10 pages, 4 figures, submitted to AAS Journals, comments welcome

  29. arXiv:2409.17892  [pdf, other

    cs.CL

    EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models

    Authors: Shaoxiong Ji, Zihao Li, Indraneil Paul, Jaakko Paavola, Peiqin Lin, Pinzhen Chen, Dayyán O'Brien, Hengyu Luo, Hinrich Schütze, Jörg Tiedemann, Barry Haddow

    Abstract: In this work, we introduce EMMA-500, a large-scale multilingual language model continue-trained on texts across 546 languages designed for enhanced multilingual performance, focusing on improving language coverage for low-resource languages. To facilitate continual pre-training, we compile the MaLA corpus, a comprehensive multilingual dataset enriched with curated datasets across diverse domains.… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

  30. arXiv:2409.15398  [pdf, other

    cs.CR cs.AI cs.LG

    Attack Atlas: A Practitioner's Perspective on Challenges and Pitfalls in Red Teaming GenAI

    Authors: Ambrish Rawat, Stefan Schoepf, Giulio Zizzo, Giandomenico Cornacchia, Muhammad Zaid Hameed, Kieran Fraser, Erik Miehling, Beat Buesser, Elizabeth M. Daly, Mark Purcell, Prasanna Sattigeri, Pin-Yu Chen, Kush R. Varshney

    Abstract: As generative AI, particularly large language models (LLMs), become increasingly integrated into production applications, new attack surfaces and vulnerabilities emerge and put a focus on adversarial threats in natural language and multi-modal systems. Red-teaming has gained importance in proactively identifying weaknesses in these systems, while blue-teaming works to protect against such adversar… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  31. arXiv:2409.14757  [pdf

    physics.optics

    Giant and Flexible Toroidal Circular Dichroism from Planar Chiral Metasurface

    Authors: Shijie Kang, Haitao Li, Jiayu Fan, Jiusi Yu, Boyang Qu, Peng Chen, Xiaoxiao Wu

    Abstract: Chirality, a fundamental concept describing an object cannot superpose with its mirror image, is crucial in optics and photonics and leads to various exotic phenomena, such as circular dichroism, and optical activity. Recent findings reveal that, besides electric and magnetic dipoles, toroidal dipoles, an elusive part of dynamic multipoles, can also contribute significantly to chirality. However,… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  32. arXiv:2409.13058  [pdf, other

    cs.HC cs.RO

    Mixed Reality Tele-ultrasound over 750 km: a Clinical Study

    Authors: Ryan Yeung, David Black, Patrick B. Chen, Victoria Lessoway, Janice Reid, Sergio Rangel-Suarez, Silvia D. Chang, Septimiu E. Salcudean

    Abstract: Ultrasound is a hand-held, low-cost, non-invasive medical imaging modality which plays a vital role in diagnosing various diseases. Despite this, many rural and remote communities do not have access to ultrasound scans due to the lack of local experts trained to perform them. To address this challenge, we built a mixed reality and haptics-based tele-ultrasound system to enable an expert to precise… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

    Comments: 8 pages, 10 figures, submitted to IEEE VR 2025

  33. arXiv:2409.12889  [pdf, other

    cs.AI

    Can VLMs Play Action Role-Playing Games? Take Black Myth Wukong as a Study Case

    Authors: Peng Chen, Pi Bu, Jun Song, Yuan Gao, Bo Zheng

    Abstract: Recently, large language model (LLM)-based agents have made significant advances across various fields. One of the most popular research areas involves applying these agents to video games. Traditionally, these methods have relied on game APIs to access in-game environmental and action data. However, this approach is limited by the availability of APIs and does not reflect how humans play games. W… ▽ More

    Submitted 22 September, 2024; v1 submitted 19 September, 2024; originally announced September 2024.

  34. arXiv:2409.12814  [pdf

    physics.optics physics.app-ph

    GeSn 320 \times 256 Focal Plane Array for Silicon-Based Short-wave Infrared Imaging

    Authors: Guoyin Xu, Hui Cong, Yue Li, Zhengjie Wu, Fenghe Fu, Ping Chen, Chao Zhao, Chi Xu, Chunlai Xue

    Abstract: Short-wave infrared (SWIR) imaging arrays have demonstrated great potential in applications spanning from military to civilian consumer electronics. However, the current focal plane arrays (FPAs), which are based on compound semiconductors, have limited applications in civilian circumstances due to elevated manufacturing costs and prolonged fabrication cycle time. To address this, a high-performan… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

  35. arXiv:2409.11905  [pdf, other

    cs.RO cs.AI cs.IR

    AlignBot: Aligning VLM-powered Customized Task Planning with User Reminders Through Fine-Tuning for Household Robots

    Authors: Zhaxizhuoma, Pengan Chen, Ziniu Wu, Jiawei Sun, Dong Wang, Peng Zhou, Nieqing Cao, Yan Ding, Bin Zhao, Xuelong Li

    Abstract: This paper presents AlignBot, a novel framework designed to optimize VLM-powered customized task planning for household robots by effectively aligning with user reminders. In domestic settings, aligning task planning with user reminders poses significant challenges due to the limited quantity, diversity, and multimodal nature of the reminders. To address these challenges, AlignBot employs a fine-t… ▽ More

    Submitted 18 September, 2024; originally announced September 2024.

  36. arXiv:2409.09668  [pdf, other

    cs.CV

    EditBoard: Towards A Comprehensive Evaluation Benchmark for Text-based Video Editing Models

    Authors: Yupeng Chen, Penglin Chen, Xiaoyu Zhang, Yixian Huang, Qian Xie

    Abstract: The rapid development of diffusion models has significantly advanced AI-generated content (AIGC), particularly in Text-to-Image (T2I) and Text-to-Video (T2V) generation. Text-based video editing, leveraging these generative capabilities, has emerged as a promising field, enabling precise modifications to videos based on text prompts. Despite the proliferation of innovative video editing models, th… ▽ More

    Submitted 15 September, 2024; originally announced September 2024.

  37. arXiv:2409.09601  [pdf, other

    cs.SD cs.AI cs.MM eess.AS

    A Survey of Foundation Models for Music Understanding

    Authors: Wenjun Li, Ying Cai, Ziyang Wu, Wenyi Zhang, Yifan Chen, Rundong Qi, Mengqi Dong, Peigen Chen, Xiao Dong, Fenghao Shi, Lei Guo, Junwei Han, Bao Ge, Tianming Liu, Lin Gan, Tuo Zhang

    Abstract: Music is essential in daily life, fulfilling emotional and entertainment needs, and connecting us personally, socially, and culturally. A better understanding of music can enhance our emotions, cognitive skills, and cultural connections. The rapid advancement of artificial intelligence (AI) has introduced new ways to analyze music, aiming to replicate human understanding of music and provide relat… ▽ More

    Submitted 14 September, 2024; originally announced September 2024.

    Comments: 20 pages, 2 figures

  38. arXiv:2409.09572  [pdf, other

    cs.RO

    A Novel Aerial-Aquatic Locomotion Robot with Variable Stiffness Propulsion Module

    Authors: Junzhe Hu, Pengyu Chen, Tianxiang Feng, Yuxuan Wen, Ke Wu, Janet Dong

    Abstract: In recent years, the development of robots capable of operating in both aerial and aquatic environments has gained significant attention. This study presents the design and fabrication of a novel aerial-aquatic locomotion robot (AALR). Inspired by the diving beetle, the AALR incorporates a biomimetic propulsion mechanism with power and recovery strokes. The variable stiffness propulsion module (VS… ▽ More

    Submitted 14 September, 2024; originally announced September 2024.

    Comments: 8 pages, 10 figures, ICRA

  39. arXiv:2409.09141  [pdf, other

    cs.CE

    Sequential infinite-dimensional Bayesian optimal experimental design with derivative-informed latent attention neural operator

    Authors: Jinwoo Go, Peng Chen

    Abstract: We develop a new computational framework to solve sequential Bayesian optimal experimental design (SBOED) problems constrained by large-scale partial differential equations with infinite-dimensional random parameters. We propose an adaptive terminal formulation of the optimality criteria for SBOED to achieve adaptive global optimality. We also establish an equivalent optimization formulation to ac… ▽ More

    Submitted 2 October, 2024; v1 submitted 13 September, 2024; originally announced September 2024.

  40. arXiv:2409.08365  [pdf, other

    nucl-ex

    Measurement of the nucleon spin structure functions for $0.01<Q^2<1$~GeV$^2$ using CLAS

    Authors: A. Deur, S. E. Kuhn, M. Ripani, X. Zheng, A. G. Acar, P. Achenbach, K. P. Adhikari, J. S. Alvarado, M. J. Amaryan, W. R. Armstrong, H. Atac, H. Avakian, L. Baashen, N. A. Baltzell, L. Barion, M. Bashkanov, M. Battaglieri, B. Benkel, F. Benmokhtar, A. Bianconi, A. S. Biselli, W. A. Booth, F. B ossu, P. Bosted, S. Boiarinov , et al. (124 additional authors not shown)

    Abstract: The spin structure functions of the proton and the deuteron were measured during the EG4 experiment at Jefferson Lab in 2006. Data were collected for longitudinally polarized electron scattering off longitudinally polarized NH$_3$ and ND$_3$ targets, for $Q^2$ values as small as 0.012 and 0.02 GeV$^2$, respectively, using the CEBAF Large Acceptance Spectrometer (CLAS). This is the archival paper o… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

    Comments: 33 pages. 26 figures. Data table provided in supplementary material (30 pages)

    Report number: JLAB-PHY-24-4184, DOE/OR/23177-7672

  41. arXiv:2409.06577  [pdf, other

    eess.SP

    Compressed Sensing based Detection Schemes for Differential Spatial Modulation in Visible Light Communication Systems

    Authors: Zichun Shi, Pu Miao, Peng Chen, Lei Xue, Li-Yang Zheng, Laiyuan Wang, Gaojie Chen

    Abstract: Differential spatial modulation (DSM) exploits the time dimension to facilitate the differential modulation, which can perfectly avoid the challenge in acquiring of heavily entangled channel state information of visible light communication (VLC) system. However, it has huge search space and high complexity for large number of transmitters. In this paper, a novel vector correction (VC)-based orthog… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: This paper has been accepted by 2024 IEEE 24th International Conference on Communication Technology (ICCT 2024)

  42. arXiv:2409.05987  [pdf, other

    astro-ph.IM astro-ph.EP

    Simulated performance of energy-resolving detectors towards exoplanet imaging with the Habitable Worlds Observatory

    Authors: Sarah Steiger, Laurent Pueyo, Emiel H. Por, Pin Chen, Rémi Soummer, Raphaël Pourcelot, Iva Laginja, Vanessa P. Bailey

    Abstract: One of the primary science goals of the Habitable Worlds Observatory (HWO) as defined by the Astro2020 decadal survey is the imaging of the first Earth-like planet around a Sun-like star. A key technology gap towards reaching this goal are the development of ultra-low-noise photon counting detectors capable of measuring the incredibly low count rates coming from these planets which are at contrast… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

    Comments: 13 pages, 7 figures

    Journal ref: Proc. SPIE 13092, Space Telescopes and Instrumentation 2024: Optical, Infrared, and Millimeter Wave; 130921W

  43. arXiv:2409.05337  [pdf, other

    hep-ph hep-ex

    Masses and radiative decay widths of the $D_{s0}^*(2317)$ and $D_{s1}^{\prime}(2460)$ and their bottom analogs

    Authors: Zi-Le Zhang, Zhan-Wei Liu, Si-Qiang Luo, Ping Chen, Zhi-Hui Guo

    Abstract: We study the mass spectra and radiative decays of $D_{s0}^*(2317)$ and $D_{s1}^{\prime}(2460)$ in an unquenched framework. In addition to coupled channel effects between the $c\bar{s}$ cores and $D^{(*)}K$ channels, $D^{(*)}K$-$D^{(*)}K$ self interactions are also considered in this work and we succeed to reproduce their mass spectra. Furthermore, we study the radiative decays of the… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

    Comments: 18 pages, 3 tables, 7 figures

  44. arXiv:2409.05064  [pdf, other

    astro-ph.GA

    Environmental effects as a key factor in shaping star-forming S0 galaxies

    Authors: Pei-Bin Chen, Junfeng Wang, Yan-Mei Chen, Xiao-Yu Xu, Tian-Wen Cao

    Abstract: The origins of lenticular galaxies (S0s) can be classified into two main categories: ``minor mergers" in low-density environments (LDEs) and ``faded spirals" in high-density environments (HDEs). The transitional phase in the evolution of S0s, namely, star-forming lenticular galaxies (SFS0s), can serve as an important probe for analyzing the complex processes involved in the transformation between… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

    Comments: 15 pages, 7 figures, 4 tables. Accepted for publication in A&A

  45. arXiv:2409.04363  [pdf, other

    cs.CV

    RCNet: Deep Recurrent Collaborative Network for Multi-View Low-Light Image Enhancement

    Authors: Hao Luo, Baoliang Chen, Lingyu Zhu, Peilin Chen, Shiqi Wang

    Abstract: Scene observation from multiple perspectives would bring a more comprehensive visual experience. However, in the context of acquiring multiple views in the dark, the highly correlated views are seriously alienated, making it challenging to improve scene understanding with auxiliary views. Recent single image-based enhancement methods may not be able to provide consistently desirable restoration pe… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

    Comments: 14 Pages, 10 Figures, Under Review

  46. arXiv:2409.04200  [pdf, other

    astro-ph.HE

    ZTF SN Ia DR2: The diversity and relative rates of the thermonuclear SN population

    Authors: G. Dimitriadis, U. Burgaz, M. Deckers, K. Maguire, J. Johansson, M. Smith, M. Rigault, C. Frohmaier, J. Sollerman, L. Galbany, Y. -L. Kim, C. Liu, A. A. Miller, P. E. Nugent, A. Alburai, P. Chen, S. Dhawan, M. Ginolin, A. Goobar, S. L. Groom, L. Harvey, W. D. Kenworthy, S. R. Kulkarni, B. Popovic, R. L. Riddle , et al. (5 additional authors not shown)

    Abstract: The Zwicky Transient Facility SN Ia Data Release 2 (ZTF SN Ia DR2) contains more than 3,000 Type Ia supernovae (SNe Ia), providing the largest homogeneous low-redshift sample of SNe Ia. Having at least one spectrum per event, this data collection is ideal for large-scale statistical studies of the photometric, spectroscopic and host-galaxy properties of SNe Ia, particularly of the more rare "pecul… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

    Comments: 19 pages, 13 figures, submitted to Astronomy and Astrophysics

  47. arXiv:2409.02054  [pdf, other

    astro-ph.HE

    A cosmic formation site of silicon and sulphur revealed by a new type of supernova explosion

    Authors: Steve Schulze, Avishay Gal-Yam, Luc Dessart, Adam A. Miller, Stan E. Woosley, Yi Yang, Mattia Bulla, Ofer Yaron, Jesper Sollerman, Alexei V. Filippenko, K-Ryan Hinds, Daniel A. Perley, Daichi Tsuna, Ragnhild Lunnan, Nikhil Sarin, Sean J. Brennan, Thomas G. Brink, Rachel J. Bruch, Ping Chen, Kaustav K. Das, Suhail Dhawan, Claes Fransson, Christoffer Fremling, Anjasha Gangopadhyay, Ido Irani , et al. (25 additional authors not shown)

    Abstract: The cores of stars are the cosmic furnaces where light elements are fused into heavier nuclei. The fusion of hydrogen to helium initially powers all stars. The ashes of the fusion reactions are then predicted to serve as fuel in a series of stages, eventually transforming massive stars into a structure of concentric shells. These are composed of natal hydrogen on the outside, and consecutively hea… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

    Comments: 48 pages, 12 figures and 10 tables. Submitted to a high-impact journal. The reduced spectra and photometry will be made available via the journal webpage and the WISeREP archive after the acceptance of the paper

  48. arXiv:2409.02038  [pdf, other

    cs.CL cs.AI cs.DB

    BEAVER: An Enterprise Benchmark for Text-to-SQL

    Authors: Peter Baile Chen, Fabian Wenz, Yi Zhang, Moe Kayali, Nesime Tatbul, Michael Cafarella, Çağatay Demiralp, Michael Stonebraker

    Abstract: Existing text-to-SQL benchmarks have largely been constructed using publicly available tables from the web with human-generated tests containing question and SQL statement pairs. They typically show very good results and lead people to think that LLMs are effective at text-to-SQL tasks. In this paper, we apply off-the-shelf LLMs to a benchmark containing enterprise data warehouse data. In this env… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

  49. arXiv:2409.01821  [pdf, other

    cs.CV cs.LG

    When Does Visual Prompting Outperform Linear Probing for Vision-Language Models? A Likelihood Perspective

    Authors: Hsi-Ai Tsao, Lei Hsiung, Pin-Yu Chen, Tsung-Yi Ho

    Abstract: Adapting pre-trained models to new tasks can exhibit varying effectiveness across datasets. Visual prompting, a state-of-the-art parameter-efficient transfer learning method, can significantly improve the performance of out-of-distribution tasks. On the other hand, linear probing, a standard transfer learning method, can sometimes become the best approach. We propose a log-likelihood ratio (LLR) a… ▽ More

    Submitted 4 September, 2024; v1 submitted 3 September, 2024; originally announced September 2024.

  50. Learning to Discover Forgery Cues for Face Forgery Detection

    Authors: Jiahe Tian, Peng Chen, Cai Yu, Xiaomeng Fu, Xi Wang, Jiao Dai, Jizhong Han

    Abstract: Locating manipulation maps, i.e., pixel-level annotation of forgery cues, is crucial for providing interpretable detection results in face forgery detection. Related learning objects have also been widely adopted as auxiliary tasks to improve the classification performance of detectors whereas they require comparisons between paired real and forged faces to obtain manipulation maps as supervision.… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Comments: TIFS 2024