Skip to main content

Showing 1–50 of 1,783 results for author: Zhao, C

  1. arXiv:2410.15959  [pdf, other

    cs.RO cs.CV

    Diffusion Transformer Policy

    Authors: Zhi Hou, Tianyi Zhang, Yuwen Xiong, Hengjun Pu, Chengyang Zhao, Ronglei Tong, Yu Qiao, Jifeng Dai, Yuntao Chen

    Abstract: Recent large visual-language action models pretrained on diverse robot datasets have demonstrated the potential for generalizing to new environments with a few in-domain data. However, those approaches usually predict discretized or continuous actions by a small action head, which limits the ability in handling diverse action spaces. In contrast, we model the continuous action with a large multi-m… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: Preprint

  2. arXiv:2410.15885  [pdf, other

    cs.AI

    How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making?

    Authors: Zuojin Tang, Bin Hu, Chenyang Zhao, De Ma, Gang Pan, Bin Liu

    Abstract: Existing large pre-trained models typically map text input to text output in an end-to-end manner, such as ChatGPT, or map a segment of text input to a hierarchy of action decisions, such as OpenVLA. However, humans can simultaneously generate text and actions when receiving specific input signals. For example, a driver can make precise driving decisions while conversing with a friend in the passe… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  3. arXiv:2410.15529  [pdf, other

    physics.ins-det hep-ex

    Measurement of gas properties for the ion-TPC of N$ν$DEx experiment

    Authors: Tianyu Liang, Meiqiang Zhan, Hulin Wang, Xianglun Wei, Dongliang Zhang, Jun Liu, Chengui Lu, Qiang Hu, Yichen Yang, Chaosong Gao, Le Xiao, Xiangming Sun, Feng Liu, Chengxin Zhao, Hao Qiu, Kai Chen

    Abstract: In the N$ν$DEx collaboration, a high-pressure gas TPC is being developed to search for the neutrinoless double beta decay. The use of electronegative $\mathrm{^{82}SeF_{6}}$ gas mandates an ion-TPC. The reconstruction of $z$ coordinate is to be realized exploiting the feature of multiple species of charge carriers. As the initial stage of the development, we studied the properties of the… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

    Comments: 10 pages, 8 figures

  4. arXiv:2410.13955  [pdf, other

    physics.ins-det cond-mat.other

    A multi-detector neutral helium atom microscope

    Authors: Chenyang Zhao, Sam M Lambrick, Nick A von Jeinsen, Yanke Yuan, Xiaolong Zhang, Aleksandar Radić, David J Ward, John Ellis, Andrew P Jardine

    Abstract: Scanning helium microscopy (SHeM) is an emerging technique that uses a beam of neutral atoms to image and analyse surfaces. The low energies ($\sim$64 meV) and completely non-destructive nature of the probe particles provide exceptional sensitivity for studying delicate samples and thin devices, including 2D materials. To date, around five such instruments have been constructed and are described i… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  5. arXiv:2410.11046  [pdf

    cs.IR cs.LG q-bio.QM

    SGUQ: Staged Graph Convolution Neural Network for Alzheimer's Disease Diagnosis using Multi-Omics Data

    Authors: Liang Tao, Yixin Xie, Jeffrey D Deng, Hui Shen, Hong-Wen Deng, Weihua Zhou, Chen Zhao

    Abstract: Alzheimer's disease (AD) is a chronic neurodegenerative disorder and the leading cause of dementia, significantly impacting cost, mortality, and burden worldwide. The advent of high-throughput omics technologies, such as genomics, transcriptomics, proteomics, and epigenomics, has revolutionized the molecular understanding of AD. Conventional AI approaches typically require the completion of all om… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: 20 pages, 2 figures

  6. arXiv:2410.10934  [pdf, other

    cs.AI

    Agent-as-a-Judge: Evaluate Agents with Agents

    Authors: Mingchen Zhuge, Changsheng Zhao, Dylan Ashley, Wenyi Wang, Dmitrii Khizbullin, Yunyang Xiong, Zechun Liu, Ernie Chang, Raghuraman Krishnamoorthi, Yuandong Tian, Yangyang Shi, Vikas Chandra, Jürgen Schmidhuber

    Abstract: Contemporary evaluation techniques are inadequate for agentic systems. These approaches either focus exclusively on final outcomes -- ignoring the step-by-step nature of agentic systems, or require excessive manual labour. To address this, we introduce the Agent-as-a-Judge framework, wherein agentic systems are used to evaluate agentic systems. This is an organic extension of the LLM-as-a-Judge fr… ▽ More

    Submitted 16 October, 2024; v1 submitted 14 October, 2024; originally announced October 2024.

    Comments: The project can be found at https://github.com/metauto-ai/agent-as-a-judge. The dataset is released at https://huggingface.co/DEVAI-benchmark

  7. arXiv:2410.10198  [pdf, other

    math.CO

    Regions of Level $\ell$ of Catalan/Semiorder-Type Arrangements

    Authors: Yanru Chen, Suijie Wang, Jinxing Yang, Chengdong Zhao

    Abstract: By establishing a labeled Dyck path model for the regions of \(\mathcal{C}_{n,A}\) and \(\mathcal{C}_{n,A}^*\), his paper explores several enumerative problems related to the number of regions of level \(\ell\), denoted as \(r_{\ell}(\mathcal{C}_{n,A})\) and \(r_{\ell}(\mathcal{C}_{n,A}^*)\), which includes: \begin{enumerate} \item[(1)] proving a Stirling convolution relation between \(r_{\ell}(… ▽ More

    Submitted 15 October, 2024; v1 submitted 14 October, 2024; originally announced October 2024.

    Comments: 38 pages,16 figures

    MSC Class: 05B35; 52C35

  8. arXiv:2410.09793  [pdf, other

    cond-mat.mes-hall

    Energy Bands of Incommensurate Systems

    Authors: Xin-Yu Guo, Jin-Rong Chen, Chen Zhao, Miao Liang, Ying-Hai Wu, Jin-Hua Gao, X. C. Xie

    Abstract: Energy band theory is a fundamental cornerstone of condensed matter physics. According to conventional wisdom, discrete translational symmetry is mandatory for defining energy bands. Here, we illustrate that, in fact, the concept of energy band can be generalized to incommensurate systems lacking such symmetry, thus transcending the traditional paradigm of energy band. The validity of our theory i… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

    Comments: 8 pages, 3 figures

  9. arXiv:2410.09151  [pdf, other

    astro-ph.HE

    A search using GEO600 for gravitational waves coincident with fast radio bursts from SGR 1935+2154

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, I. Abouelfettouh, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, D. Agarwal, M. Agathos, M. Aghaei Abchouyeh, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, A. Al-Jodah, C. Alléné , et al. (1758 additional authors not shown)

    Abstract: The magnetar SGR 1935+2154 is the only known Galactic source of fast radio bursts (FRBs). FRBs from SGR 1935+2154 were first detected by CHIME/FRB and STARE2 in 2020 April, after the conclusion of the LIGO, Virgo, and KAGRA Collaborations' O3 observing run. Here we analyze four periods of gravitational wave (GW) data from the GEO600 detector coincident with four periods of FRB activity detected by… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: 15 pages of text including references, 4 figures, 5 tables

    Report number: LIGO-P2400192

  10. arXiv:2410.06534  [pdf, other

    q-bio.NC cs.LG

    EEG-estimated functional connectivity, and not behavior, differentiates Parkinson's patients from health controls during the Simon conflict task

    Authors: Xiaoxiao Sun, Chongkun Zhao, Sharath Koorathota, Paul Sajda

    Abstract: Neural biomarkers that can classify or predict disease are of broad interest to the neurological and psychiatric communities. Such biomarkers can be informative of disease state or treatment efficacy, even before there are changes in symptoms and/or behavior. This work investigates EEG-estimated functional connectivity (FC) as a Parkinson's Disease (PD) biomarker. Specifically, we investigate FC m… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: This work is accepted at IEEE EMBC 2024. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications standards/publications/rights/index.html for more information

  11. arXiv:2410.05640  [pdf, ps, other

    math.DS

    Non-dense orbits on topological dynamical systems

    Authors: Cao Zhao, Jiao Yang, Xiaoyao Zhou

    Abstract: Let $(X,d,T )$ be a topological dynamical system with the specification property. We consider the non-dense orbit set $E(z_0)$ and show that for any non-transitive point $z_0\in X$, this set $E(z_0)$ is empty or carries full topological pressure.

    Submitted 7 October, 2024; originally announced October 2024.

  12. arXiv:2410.04342  [pdf, other

    cs.CV

    Accelerating Inference of Networks in the Frequency Domain

    Authors: Chenqiu Zhao, Guanfang Dong, Anup Basu

    Abstract: It has been demonstrated that networks' parameters can be significantly reduced in the frequency domain with a very small decrease in accuracy. However, given the cost of frequency transforms, the computational complexity is not significantly decreased. In this work, we propose performing network inference in the frequency domain to speed up networks whose frequency parameters are sparse. In parti… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

    Comments: accepted by ACM Multimedia Asia 2024

  13. arXiv:2410.04232  [pdf, other

    cs.HC

    Be There, Be Together, Be Streamed! AR Scenic Live-Streaming for an Interactive and Collective Experience

    Authors: Zeyu Huang, Zuyu Xu, Yuanhao Zhang, Chengzhong Liu, Yanwei Zhao, Chuhan Shi, Jason Chen Zhao, Xiaojuan Ma

    Abstract: Scenic Live-Streaming (SLS), capturing real-world scenic sites from fixed cameras without streamers, combines scene immersion and the social and real-time characteristics of live-streaming into a unique experience. However, existing SLS affords limited audience interactions to engage them in a collective experience compared to many other live-streaming genres. It is also difficult for SLS to recre… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

    Comments: 4 pages, 2 figures, to appear in the adjunct proceedings of ISMAR 2024 and the ISMAR 2024 conference

  14. arXiv:2410.03083  [pdf, other

    cs.CL cs.AI

    Scaling Parameter-Constrained Language Models with Quality Data

    Authors: Ernie Chang, Matteo Paltenghi, Yang Li, Pin-Jie Lin, Changsheng Zhao, Patrick Huber, Zechun Liu, Rastislav Rabatin, Yangyang Shi, Vikas Chandra

    Abstract: Scaling laws in language modeling traditionally quantify training loss as a function of dataset size and model parameters, providing compute-optimal estimates but often neglecting the impact of data quality on model generalization. In this paper, we extend the conventional understanding of scaling law by offering a microscopic view of data quality within the original formulation -- effective train… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: Accepted to EMNLP 2024 Industry Track, 18 pages, 9 figures, 4 tables

  15. arXiv:2410.00768  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    High Mobility SiGe/Ge 2DHG Heterostructure Quantum Wells for Semiconductor Hole Spin Qubits

    Authors: Zhenzhen Kong, Zonghu Li, Yuchen Zhou, Gang Cao, Hai-Ou Li, Jiale Su, Yiwen Zhang, Jinbiao Liu, Guo-Ping Guo, Junfeng Li, Jun Luo, Chao Zhao, Tianchun Ye, Guilei Wang

    Abstract: Strong spin-orbit coupling and relatively weak hyperfine interactions make germanium hole spin qubits a promising candidate for semiconductor quantum processors. The two-dimensional hole gas structure of strained Ge quantum wells serves as the primary material platform for spin hole qubits.A low disorder material environment is essential for this process. In this work, we fabricated a Ge/SiGe hete… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

  16. arXiv:2409.20558  [pdf, other

    cs.CV

    Uni$^2$Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection

    Authors: Yubin Wang, Zhikang Zou, Xiaoqing Ye, Xiao Tan, Errui Ding, Cairong Zhao

    Abstract: We present Uni$^2$Det, a brand new framework for unified and universal multi-dataset training on 3D detection, enabling robust performance across diverse domains and generalization to unseen domains. Due to substantial disparities in data distribution and variations in taxonomy across diverse domains, training such a detector by simply merging datasets poses a significant challenge. Motivated by t… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

    Comments: 13 pages, 5 figures, 6 tables

  17. arXiv:2409.20461  [pdf, other

    physics.app-ph cond-mat.mtrl-sci

    Helium atom micro-diffraction as a characterisation tool for 2D materials

    Authors: Nick von Jeinsen, Aleksandar Radic, Ke Wang, Chenyang Zhao, Vivian Perez, Yiru Zhu, Manish Chhowalla, Andrew Jardine, David Ward, Sam Lambrick

    Abstract: We present helium atom micro-diffraction as an ideal technique for characterization of 2D materials due to its ultimate surface sensitivity combined with sub-micron spatial resolution. Thermal energy neutral helium scatters from the valence electron density, 2-3A above the ionic cores of a surface, making the technique ideal for studying 2D materials, where other approaches can struggle due to sma… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

    Comments: Draft version, 11 pages, 6 figures, 2 tables

  18. arXiv:2409.19624  [pdf, other

    cs.CV cs.AI

    Storynizor: Consistent Story Generation via Inter-Frame Synchronized and Shuffled ID Injection

    Authors: Yuhang Ma, Wenting Xu, Chaoyi Zhao, Keqiang Sun, Qinfeng Jin, Zeng Zhao, Changjie Fan, Zhipeng Hu

    Abstract: Recent advances in text-to-image diffusion models have spurred significant interest in continuous story image generation. In this paper, we introduce Storynizor, a model capable of generating coherent stories with strong inter-frame character consistency, effective foreground-background separation, and diverse pose variation. The core innovation of Storynizor lies in its key modules: ID-Synchroniz… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

  19. arXiv:2409.17795  [pdf, other

    cs.CE

    Physics-driven complex relaxation for multi-body systems of SPH method

    Authors: Chenxi Zhao, Yongchuan Yu, Oskar J. Haidn, Xiangyu Hu

    Abstract: In the smoothed particle dynamics (SPH) method, the characteristics of a target particle are interpolated based on the information from its neighboring particles. Consequently, a uniform initial distribution of particles significantly enhances the accuracy of SPH calculations. This aspect is particularly critical in Eulerian SPH, where particles are stationary throughout the simulation. To address… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: 38 pages and 25 figures

  20. arXiv:2409.16682  [pdf, other

    cs.CL

    SynTQA: Synergistic Table-based Question Answering via Mixture of Text-to-SQL and E2E TQA

    Authors: Siyue Zhang, Anh Tuan Luu, Chen Zhao

    Abstract: Text-to-SQL parsing and end-to-end question answering (E2E TQA) are two main approaches for Table-based Question Answering task. Despite success on multiple benchmarks, they have yet to be compared and their synergy remains unexplored. In this paper, we identify different strengths and weaknesses through evaluating state-of-the-art models on benchmark datasets: Text-to-SQL demonstrates superiority… ▽ More

    Submitted 29 September, 2024; v1 submitted 25 September, 2024; originally announced September 2024.

    Comments: EMNLP 2024

  21. arXiv:2409.16332  [pdf

    q-bio.QM

    SOWAHA as a Cancer Suppressor Gene Influence Metabolic Reprogramming

    Authors: Xiaohong Yi, Xianwen Zhang, Claire H. Zhao, Yuhui Chen, Lijun Huang, Hua Zhong, Yumei Wang

    Abstract: SOWAHA is a protein-coding gene, also known as ANKRD43. Studies have indicated that SOWAHA can serve as a prognostic biomarker in colorectal cancer and pancreatic cancer. However, there are few reports about SOWAHA in other types of cancer and the specific mechanism of action of SOWAHA in cancer is also not clear. Based on National Center for Biotechnology Information (NCBI), The Cancer Genome Atl… ▽ More

    Submitted 2 October, 2024; v1 submitted 24 September, 2024; originally announced September 2024.

    Comments: 37 pages, 17 figures

  22. arXiv:2409.16280  [pdf, other

    cs.CV

    MonoFormer: One Transformer for Both Diffusion and Autoregression

    Authors: Chuyang Zhao, Yuxing Song, Wenhao Wang, Haocheng Feng, Errui Ding, Yifan Sun, Xinyan Xiao, Jingdong Wang

    Abstract: Most existing multimodality methods use separate backbones for autoregression-based discrete text generation and diffusion-based continuous visual generation, or the same backbone by discretizing the visual data to use autoregression for both text and visual generation. In this paper, we propose to study a simple idea: share one transformer for both autoregression and diffusion. The feasibility co… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

  23. arXiv:2409.16226  [pdf, other

    physics.flu-dyn

    Liquid sloshing behaviours in an elastic tank and suppression effect of baffles

    Authors: Chenxi Zhao, Yan Wu, Yongchuan Yu, Oskar J. Haidn, Xiangyu Hu

    Abstract: In this paper, a fluid-structure interaction (FSI) framework based on the smoothed particle hydrodynamics (SPH) method is employed to investigate the forces and deformations experienced by LNG tanks during liquid sloshing. As a Lagrangian approach, the SPH method offers the advantage of accurately modelling free-surface flow. The fluid phase consisting of water and air is modelled as a multi-phase… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

  24. arXiv:2409.15750  [pdf, other

    cs.LG cs.AI cs.ET

    The Roles of Generative Artificial Intelligence in Internet of Electric Vehicles

    Authors: Hanwen Zhang, Dusit Niyato, Wei Zhang, Changyuan Zhao, Hongyang Du, Abbas Jamalipour, Sumei Sun, Yiyang Pei

    Abstract: With the advancement of generative artificial intelligence (GenAI) models, their capability to generate content is seeing significant enhancement, leading to widespread applications in the field of data generation and forecasting. Furthermore, GenAI has strong capabilities in data modeling and analysis, which enhances Internet of electric vehicles (IoEV) applications in various aspects. In this pa… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    Comments: 25 Pages

  25. arXiv:2409.15540  [pdf, ps, other

    math.AP

    On $p(x)$-Laplacian equations in $\mathbb{R}^{N}$ with nonlinearity sublinear at zero

    Authors: Shibo Liu, Chunshan Zhao

    Abstract: Let $p,q$ be functions on $\mathbb{R}^{N}$ satisfying $1\ll q\ll p\ll N$, we consider $p(x)$-Laplacian problems of the form \[ \left\{ \begin{array} [c]{l}% -Δ_{p(x)}u+V(x)\vert u\vert ^{p(x)-2}u=λ\vert u\vert ^{q(x)-2}u+g(x,u)\text{,}\\ u\in W^{1,p(x)}(\mathbb{R}^{N})\text{.}% \end{array} \right. \] To apply variational methods, we introduce a subspace $X$ of $W^{1,p(x)}(\mathbb{R}^N)$ as our wor… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

    Comments: 14 pages

    MSC Class: Primary 35J60; Secondary 35D05

  26. arXiv:2409.14922  [pdf, ps, other

    eess.SP

    Hybrid Beamforming and Waveform Design for Over-the-air Integrated Signal

    Authors: Chonghao Zhao

    Abstract: The future wireless communications are expected to provide new use scenarios with emerging techniques. This paper focuses on vehicle to everything (V2X) network, where vehicles should cooperatively implement information obtaining, data sharing, and information postprocessing. Conventionally, the above three operations are considered in different layers or separated waveforms, leading to unavoidabl… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  27. arXiv:2409.14827  [pdf, other

    cs.CV cs.HC cs.MM

    AIM 2024 Challenge on Video Saliency Prediction: Methods and Results

    Authors: Andrey Moskalenko, Alexey Bryncev, Dmitry Vatolin, Radu Timofte, Gen Zhan, Li Yang, Yunlong Tang, Yiting Liao, Jiongzhi Lin, Baitao Huang, Morteza Moradi, Mohammad Moradi, Francesco Rundo, Concetto Spampinato, Ali Borji, Simone Palazzo, Yuxin Zhu, Yinan Sun, Huiyu Duan, Yuqin Cao, Ziheng Jia, Qiang Hu, Xiongkuo Min, Guangtao Zhai, Hao Fang , et al. (8 additional authors not shown)

    Abstract: This paper reviews the Challenge on Video Saliency Prediction at AIM 2024. The goal of the participants was to develop a method for predicting accurate saliency maps for the provided set of video sequences. Saliency maps are widely exploited in various applications, including video compression, quality assessment, visual perception studies, the advertising industry, etc. For this competition, a pr… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

    Comments: ECCVW 2024

    ACM Class: I.4.6; I.2.10

  28. arXiv:2409.14772  [pdf, other

    cond-mat.mtrl-sci

    Domino-like magnetic phase transition induced by a bias voltage in FeRh thin film

    Authors: Huiliang Wu, Jianbo Wang, Chenbo Zhao, Qingfeng Zhan, Jiangtao Xue, Senfu Zhang, Jinwu Wei, Xiangqian Wang, Qingfang Liu

    Abstract: The first-order magnetic phase transition (MPT) usually happens with a very wide magnetic field range about tens of thousands Oersted which hinders its applications. In this work, we induce a domino-like MPT via introducing a bias voltage in FeRh thin film and thus realize a large narrowing of transition magnetic field range from 6*10^4 Oe to lower than 2*10^3 Oe at room temperature. Furthermore,… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  29. arXiv:2409.14705  [pdf, other

    cs.CL cs.AI

    Target-Aware Language Modeling via Granular Data Sampling

    Authors: Ernie Chang, Pin-Jie Lin, Yang Li, Changsheng Zhao, Daeil Kim, Rastislav Rabatin, Zechun Liu, Yangyang Shi, Vikas Chandra

    Abstract: Language model pretraining generally targets a broad range of use cases and incorporates data from diverse sources. However, there are instances where we desire a model that excels in specific areas without markedly compromising performance in other areas. A cost-effective and straightforward approach is sampling with low-dimensional data features, which allows to select large-scale pretraining da… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

    Comments: Accepted to EMNLP 2024 Main Conference, 9 pages, 6 figures, 3 tables

  30. arXiv:2409.14441  [pdf, other

    eess.SP

    BUPTCMCC-6G-CMG+: A GBSM-Based ISAC Channel Model Simulator

    Authors: Changsheng Zhao, Yuxiang Zhang, Heng Wang, Lei Tian, Jianhua Zhang, Hanyuan Jiang

    Abstract: Integrated Sensing and Communication (ISAC) is one of the key technologies in 6G, and related research and standardization efforts are progressing vigorously. Wireless channel simulation is the cornerstone for the evaluation and optimization of wireless communication technologies. This paper proposes a design and implementation method for an ISAC channel simulation based on a Geometry-Based Stocha… ▽ More

    Submitted 17 October, 2024; v1 submitted 22 September, 2024; originally announced September 2024.

    Comments: 12 pages,5 fiures,2 tables

  31. arXiv:2409.14365  [pdf, other

    cs.RO

    D3RoMa: Disparity Diffusion-based Depth Sensing for Material-Agnostic Robotic Manipulation

    Authors: Songlin Wei, Haoran Geng, Jiayi Chen, Congyue Deng, Wenbo Cui, Chengyang Zhao, Xiaomeng Fang, Leonidas Guibas, He Wang

    Abstract: Depth sensing is an important problem for 3D vision-based robotics. Yet, a real-world active stereo or ToF depth camera often produces noisy and incomplete depth which bottlenecks robot performances. In this work, we propose D3RoMa, a learning-based depth estimation framework on stereo image pairs that predicts clean and accurate depth in diverse indoor scenes, even in the most challenging scenari… ▽ More

    Submitted 24 September, 2024; v1 submitted 22 September, 2024; originally announced September 2024.

  32. arXiv:2409.14031  [pdf, other

    eess.SP

    Signal Detection in Near-field Communication with Unknown Noise Characteristics: A Diffusion Model Method

    Authors: Changyuan Zhao, Jiacheng Wang, Ruichen Zhang, Dusit Niyato, Dong In Kim, Hongyang Du

    Abstract: In this letter, we present a diffusion model method for signal detection in near-field communication with unknown noise characteristics. We consider an uplink transmission of a near-filed MIMO communication system consisting of multiple mobile terminals and one base station with multiple antennas. Then, we proposed a Maximum Likelihood Estimation Diffusion Detector (MLEDD) aiming at learning the d… ▽ More

    Submitted 21 September, 2024; originally announced September 2024.

    Comments: 5 pages, 3 figures

  33. arXiv:2409.13763  [pdf

    physics.gen-ph

    Can thermal nonreciprocity improve the radiative cooling efficiency?

    Authors: Mengqi Liu, Shenghao Jin, Chenglong Zhou, Boxiang Wang, Changying Zhao, Cheng-Wei Qiu

    Abstract: Can thermal nonreciprocity improve the radiative cooling efficiency? Probably not.

    Submitted 17 September, 2024; originally announced September 2024.

    Comments: 12 pages, 3 figures

  34. arXiv:2409.12814  [pdf

    physics.optics physics.app-ph

    GeSn 320 \times 256 Focal Plane Array for Silicon-Based Short-wave Infrared Imaging

    Authors: Guoyin Xu, Hui Cong, Yue Li, Zhengjie Wu, Fenghe Fu, Ping Chen, Chao Zhao, Chi Xu, Chunlai Xue

    Abstract: Short-wave infrared (SWIR) imaging arrays have demonstrated great potential in applications spanning from military to civilian consumer electronics. However, the current focal plane arrays (FPAs), which are based on compound semiconductors, have limited applications in civilian circumstances due to elevated manufacturing costs and prolonged fabrication cycle time. To address this, a high-performan… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

  35. arXiv:2409.11927  [pdf, other

    astro-ph.HE

    A Revised Spin of the Black Hole in GRS 1716-249 with a New Distance

    Authors: S. J. Zhao, L. Tao, Q. Q. Yin, S. N. Zhang, R. C. Ma, P. P. Li, Q. C. Zhao, M. Y. Ge, L. Zhang, J. L. Qu, S. Zhang, X. Ma, Y. Huang, J. Q. Peng, Y. X. Xiao

    Abstract: GRS 1716-249 is a stellar-mass black hole in a low-mass X-ray binary that underwent a gaint outburst in 2016/17. In this paper we use simultaneous observations of Insight-HXMT and NuSTAR to determine its basic parameters. The observations were performed during the softest part of the outburst, and the spectra show clear thermal disk emission and reflection features. We have fitted the X-ray energy… ▽ More

    Submitted 18 September, 2024; originally announced September 2024.

  36. arXiv:2409.11234  [pdf, other

    cs.CV

    STCMOT: Spatio-Temporal Cohesion Learning for UAV-Based Multiple Object Tracking

    Authors: Jianbo Ma, Chuanming Tang, Fei Wu, Can Zhao, Jianlin Zhang, Zhiyong Xu

    Abstract: Multiple object tracking (MOT) in Unmanned Aerial Vehicle (UAV) videos is important for diverse applications in computer vision. Current MOT trackers rely on accurate object detection results and precise matching of target reidentification (ReID). These methods focus on optimizing target spatial attributes while overlooking temporal cues in modelling object relationships, especially for challengin… ▽ More

    Submitted 17 September, 2024; originally announced September 2024.

  37. arXiv:2409.11169  [pdf, other

    eess.IV cs.AI cs.CV

    MAISI: Medical AI for Synthetic Imaging

    Authors: Pengfei Guo, Can Zhao, Dong Yang, Ziyue Xu, Vishwesh Nath, Yucheng Tang, Benjamin Simon, Mason Belue, Stephanie Harmon, Baris Turkbey, Daguang Xu

    Abstract: Medical imaging analysis faces challenges such as data scarcity, high annotation costs, and privacy concerns. This paper introduces the Medical AI for Synthetic Imaging (MAISI), an innovative approach using the diffusion model to generate synthetic 3D computed tomography (CT) images to address those challenges. MAISI leverages the foundation volume compression network and the latent diffusion mode… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

  38. arXiv:2409.09250  [pdf, ps, other

    math.OC

    Optimal Adaptive Control of Linear Stochastic Systems with Quadratic Cost Function

    Authors: Nian Liu, Cheng Zhao, Shaolin Tan, Jinhu Lü

    Abstract: In this paper, we consider the adaptive linear quadratic Gaussian control problem, where both the linear transformation matrix of the state $A$ and the control gain matrix $B$ are unknown. The proposed adaptive optimal control only assumes that $(A, B)$ is stabilizable and $(A, Q^{1/2})$ is detectable, where $Q$ is the weighting matrix of the state in the quadratic cost function. This condition si… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

  39. arXiv:2409.05493  [pdf, other

    cs.RO

    DexDiff: Towards Extrinsic Dexterity Manipulation of Ungraspable Objects in Unrestricted Environments

    Authors: Chengzhong Ma, Houxue Yang, Hanbo Zhang, Zeyang Liu, Chao Zhao, Jian Tang, Xuguang Lan, Nanning Zheng

    Abstract: Grasping large and flat objects (e.g. a book or a pan) is often regarded as an ungraspable task, which poses significant challenges due to the unreachable grasping poses. Previous works leverage Extrinsic Dexterity like walls or table edges to grasp such objects. However, they are limited to task-specific policies and lack task planning to find pre-grasp conditions. This makes it difficult to adap… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

  40. arXiv:2409.04754  [pdf, ps, other

    math.AG

    Minimal extension property of direct images

    Authors: Chen Zhao

    Abstract: Given a projective morphism $f:X\to Y$ from a complex space to a complex manifold, we prove the Griffiths semi-positivity and minimal extension property of the direct image sheaf $f_\ast(\mathscr{F})$. Here, $\mathscr{F}$ is a coherent sheaf on $X$, which consists of the Grauert-Riemenschneider dualizing sheaf, a multiplier ideal sheaf, and a variation of Hodge structure (or more generally, a tame… ▽ More

    Submitted 7 September, 2024; originally announced September 2024.

    Comments: Comments are welcome

  41. Users' Perspectives on Multimodal Menstrual Tracking Using Consumer Health Devices

    Authors: Georgianna Lin, Brenna Li, Helen Li, Chloe Zhao, Khai N Truong, Alex Mariakakis

    Abstract: Previous menstrual health literature highlights a variety of signals not included in existing menstrual trackers because they are either difficult to gather or are not typically associated with menstrual health. Since it has become increasingly convenient to collect biomarkers through wearables and other consumer-grade devices, our work examines how people incorporate unconventional signals (e.g.,… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

    Comments: 25 pages, 4 figures, 2 tables. The paper was accepted by IMWUT/Ubicomp 2024

  42. arXiv:2409.02877  [pdf, other

    cs.AI cs.CL cs.LG

    Configurable Foundation Models: Building LLMs from a Modular Perspective

    Authors: Chaojun Xiao, Zhengyan Zhang, Chenyang Song, Dazhi Jiang, Feng Yao, Xu Han, Xiaozhi Wang, Shuo Wang, Yufei Huang, Guanyu Lin, Yingfa Chen, Weilin Zhao, Yuge Tu, Zexuan Zhong, Ao Zhang, Chenglei Si, Khai Hao Moo, Chenyang Zhao, Huimin Chen, Yankai Lin, Zhiyuan Liu, Jingbo Shang, Maosong Sun

    Abstract: Advancements in LLMs have recently unveiled challenges tied to computational efficiency and continual scalability due to their requirements of huge parameters, making the applications and evolution of these models on devices with limited computation resources and scenarios requiring various abilities increasingly cumbersome. Inspired by modularity within the human brain, there is a growing tendenc… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

  43. arXiv:2409.02132  [pdf, other

    quant-ph cs.LG

    Recognition of Schrodinger cat state based on CNN

    Authors: Tao Zhang, Chaoying Zhao

    Abstract: We applied convolutional neural networks to the classification of cat states and coherent states. Initially, we generated datasets of Schrodinger cat states and coherent states from nonlinear processes and preprocessed these datasets. Subsequently, we constructed both LeNet and ResNet network architectures, adjusting parameters such as convolution kernels and strides to optimal values. We then tra… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Comments: 6pages,5figures

  44. arXiv:2409.01822  [pdf, other

    physics.flu-dyn

    Capillary-driven migration of droplets on conical fibers

    Authors: Yixiao Mao, Chengxi Zhao, Kai Mu, Kai Li, Ting Si

    Abstract: A droplet placed on a hydrophilic conical fiber tends to move toward the end of larger radii due to capillary action. Experimental investigations are performed to explore the dynamics of droplets with varying viscosities and volumes on different fibers at the microscale. Droplets are found to accelerate initially and subsequently decelerate during migration. A dynamic model is developed to capture… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

  45. arXiv:2409.01557  [pdf, other

    cs.CV

    TASL-Net: Tri-Attention Selective Learning Network for Intelligent Diagnosis of Bimodal Ultrasound Video

    Authors: Chengqian Zhao, Zhao Yao, Zhaoyu Hu, Yuanxin Xie, Yafang Zhang, Yuanyuan Wang, Shuo Li, Jianhua Zhou, Jianqiao Zhou, Yin Wang, Jinhua Yu

    Abstract: In the intelligent diagnosis of bimodal (gray-scale and contrast-enhanced) ultrasound videos, medical domain knowledge such as the way sonographers browse videos, the particular areas they emphasize, and the features they pay special attention to, plays a decisive role in facilitating precise diagnosis. Embedding medical knowledge into the deep learning network can not only enhance performance but… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

  46. arXiv:2409.00060  [pdf, other

    cs.CL

    Understanding Literary Texts by LLMs: A Case Study of Ancient Chinese Poetry

    Authors: Cheng Zhao, Bin Wang, Zhen Wang

    Abstract: The birth and rapid development of large language models (LLMs) have caused quite a stir in the field of literature. Once considered unattainable, AI's role in literary creation is increasingly becoming a reality. In genres such as poetry, jokes, and short stories, numerous AI tools have emerged, offering refreshing new perspectives. However, it's difficult to further improve the quality of these… ▽ More

    Submitted 11 September, 2024; v1 submitted 22 August, 2024; originally announced September 2024.

  47. arXiv:2408.17224  [pdf, other

    hep-ex

    Hadronic cross section measurements with the DAMPE space mission using 20GeV-10TeV cosmic-ray protons and $^4$He

    Authors: F. Alemanno, Q. An, P. Azzarello, F. C. T. Barbato, P. Bernardini, X. J. Bi, I. Cagnoli, M. S. Cai, E. Casilli, E. Catanzani, J. Chang, D. Y. Chen, J. L. Chen, Z. F. Chen, P. Coppin, M. Y. Cui, T. S. Cui, Y. X. Cui, H. T. Dai, A. De Benedittis, I. De Mitri, F. de Palma, A. Di Giovanni, Q. Ding, T. K. Dong , et al. (126 additional authors not shown)

    Abstract: Precise direct cosmic-ray (CR) measurements provide an important probe to study the energetic particle sources in our Galaxy, and the interstellar environment through which these particles propagate. Uncertainties on hadronic models, ion-nucleon cross sections in particular, are currently the limiting factor towards obtaining more accurate CR ion flux measurements with calorimetric space-based exp… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: 17 pages, submitted to PRD

  48. arXiv:2408.17167  [pdf

    cond-mat.mtrl-sci physics.app-ph

    Highly Efficient and Stable Perovskite Solar Cells via MultiFunctional Curcumin Modified Buried Interface

    Authors: Xianhu Wu, Jieyu Bi, Guanglei Cu, Nian Liu, Gaojie Xia, Jilong Sun, Jiaxin Jiang, Ning Lu, Ping Li, Chunyi Zhao, Zewen Zuo, Min Gu

    Abstract: The buried interface between the electron transport layer and the perovskite layer suffers from severe interface defects and imperfect energy level alignment. To address this issue, this study employs a multifunctional organic molecule, curcumin, to modify the interface between SnO2 and the perovskite layer. The functional groups on curcumin effectively passivate the defects on both sides of the i… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

  49. arXiv:2408.16365  [pdf, ps, other

    cs.IT

    Protograph-Based Batched Network Codes

    Authors: Mingyang Zhu, Ming Jiang, Chunming Zhao

    Abstract: Batched network codes (BNCs) are a low-complexity solution for communication through networks with packet loss. Although their belief propagation (BP) performance is proved to approach capacity in the asymptotic regime, there is no evidence indicating that their BP performance is as good as expected in the finite-length regime. In this paper, we propose a protograph-based construction for BNCs, re… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: submitted to IEEE for possible publication

  50. arXiv:2408.15664  [pdf, other

    cs.LG cs.CL

    Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts

    Authors: Lean Wang, Huazuo Gao, Chenggang Zhao, Xu Sun, Damai Dai

    Abstract: For Mixture-of-Experts (MoE) models, an unbalanced expert load will lead to routing collapse or increased computational overhead. Existing methods commonly employ an auxiliary loss to encourage load balance, but a large auxiliary loss will introduce non-negligible interference gradients into training and thus impair the model performance. In order to control load balance while not producing undesi… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.