Skip to main content

Showing 1–50 of 6,016 results for author: Li, W

  1. ARTS: Semi-Analytical Regressor using Disentangled Skeletal Representations for Human Mesh Recovery from Videos

    Authors: Tao Tang, Hong Liu, Yingxuan You, Ti Wang, Wenhao Li

    Abstract: Although existing video-based 3D human mesh recovery methods have made significant progress, simultaneously estimating human pose and shape from low-resolution image features limits their performance. These image features lack sufficient spatial information about the human body and contain various noises (e.g., background, lighting, and clothing), which often results in inaccurate pose and inconsi… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

    Comments: Accepted by ACM MM 2024. Project page: https://github.com/TangTao-PKU/ARTS

  2. arXiv:2410.15581  [pdf, other

    cs.CV cs.LG

    Multimodal Learning for Embryo Viability Prediction in Clinical IVF

    Authors: Junsik Kim, Zhiyi Shi, Davin Jeong, Johannes Knittel, Helen Y. Yang, Yonghyun Song, Wanhua Li, Yicong Li, Dalit Ben-Yosef, Daniel Needleman, Hanspeter Pfister

    Abstract: In clinical In-Vitro Fertilization (IVF), identifying the most viable embryo for transfer is important to increasing the likelihood of a successful pregnancy. Traditionally, this process involves embryologists manually assessing embryos' static morphological features at specific intervals using light microscopy. This manual evaluation is not only time-intensive and costly, due to the need for expe… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

    Comments: Accepted to MICCAI 2024

  3. arXiv:2410.15403  [pdf

    cs.CV cs.AI

    MMCS: A Multimodal Medical Diagnosis System Integrating Image Analysis and Knowledge-based Departmental Consultation

    Authors: Yi Ren, HanZhi Zhang, Weibin Li, Diandong Liu, Tianyi Zhang, Jie He

    Abstract: We present MMCS, a system capable of recognizing medical images and patient facial details, and providing professional medical diagnoses. The system consists of two core components: The first component is the analysis of medical images and videos. We trained a specialized multimodal medical model capable of interpreting medical images and accurately analyzing patients' facial emotions and facial p… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

  4. arXiv:2410.15262  [pdf, other

    cs.IR cs.AI

    HyQE: Ranking Contexts with Hypothetical Query Embeddings

    Authors: Weichao Zhou, Jiaxin Zhang, Hilaf Hasson, Anu Singh, Wenchao Li

    Abstract: In retrieval-augmented systems, context ranking techniques are commonly employed to reorder the retrieved contexts based on their relevance to a user query. A standard approach is to measure this relevance through the similarity between contexts and queries in the embedding space. However, such similarity often fails to capture the relevance. Alternatively, large language models (LLMs) have been u… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

  5. arXiv:2410.15121  [pdf, other

    quant-ph physics.chem-ph

    Simulating and investigating various dynamic aspects of $\rm{H}_2\rm{O}$-related hydrogen bond model

    Authors: Jiangchuan You, Ran Chen, Wanshun Li, Hui-hui Miao, Yuri Igorevich Ozhigov

    Abstract: A simple $\rm{H}_2\rm{O}$-related hydrogen bond model, modified from the Jaynes-Cummings model, is proposed and its various dynamic aspects are investigated theoretically. In this model, the formation and breaking processes of hydrogen bond are accompanied by the creation and annihilation of the thermal phonon of the medium. A number of simplifying assumptions about the dynamics of the molecules i… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

    Comments: 15 pages, 11 figures, 1 table

  6. arXiv:2410.15006  [pdf, other

    math.NA

    Nonconvex Robust Quaternion Matrix Completion for Imaging Processing

    Authors: Baohua Huang, Jiakai Chen, Wen Li

    Abstract: One of the tasks in color image processing and computer vision is to recover clean data from partial observations corrupted by noise. To this end, robust quaternion matrix completion (QMC) has recently attracted more attention and shown its effectiveness, whose convex relaxation is to minimize the quaternion nuclear norm plus the quaternion $L_1$-norm. However, there is still room to improve due t… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

  7. arXiv:2410.14530  [pdf, other

    nlin.CD cond-mat.dis-nn nlin.AO

    Multistable Synaptic Plasticity induces Memory Effects and Cohabitation of Chimera and Bump States in Leaky Integrate-and-Fire Networks

    Authors: Astero Provata, Yannis Almirantis, Wentian Li

    Abstract: Chimera states and bump states are collective synchronization phenomena observed independently (at different parameter regions) in networks of coupled nonlinear oscillators. And while chimera states are characterized by coexistence of coherent and incoherent domains, bump states consist of active domains operating on a silent background. Multistable plasticity in the network connections originates… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: 22 pages, 7 figures

  8. arXiv:2410.13948  [pdf, other

    cs.AI

    The KnowWhereGraph Ontology

    Authors: Cogan Shimizu, Shirly Stephe, Adrita Barua, Ling Cai, Antrea Christou, Kitty Currier, Abhilekha Dalal, Colby K. Fisher, Pascal Hitzler, Krzysztof Janowicz, Wenwen Li, Zilong Liu, Mohammad Saeid Mahdavinejad, Gengchen Mai, Dean Rehberger, Mark Schildhauer, Meilin Shi, Sanaz Saki Norouzi, Yuanyuan Tian, Sizhe Wang, Zhangyu Wang, Joseph Zalewski, Lu Zhou, Rui Zhu

    Abstract: KnowWhereGraph is one of the largest fully publicly available geospatial knowledge graphs. It includes data from 30 layers on natural hazards (e.g., hurricanes, wildfires), climate variables (e.g., air temperature, precipitation), soil properties, crop and land-cover types, demographics, and human health, various place and region identifiers, among other themes. These have been leveraged through t… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  9. arXiv:2410.13912  [pdf

    cs.SI physics.soc-ph

    A spatiotemporal knowledge graph-based method for identifying individual activity locations from mobile phone data

    Authors: Jian Li, Tian Gan, Weifeng Li, Yuhang Liu

    Abstract: In recent years, mobile phone data has been widely used for human mobility analytics. Identifying individual activity locations is the fundamental step for mobile phone data processing. Current methods typically aggregate spatially adjacent location records over multiple days to identify activity locations. However, only considering spatial relationships while overlooking temporal ones may lead to… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 24 pages, 10 figures, 1 table

  10. arXiv:2410.13606  [pdf, ps, other

    math.RT math.NT

    Arthur packets for metaplectic groups

    Authors: Wen-Wei Li

    Abstract: For metaplectic groups over a local field of characteristic zero, we define the Arthur packet attached to any Arthur parameter $ψ$ as a multi-set of unitary genuine irreducible representations, characterized by endoscopic character relations. Over number fields, we obtain a multiplicity formula for the genuine discrete $L^2$-automorphic spectrum in terms of global Arthur parameters and $ε$-factors… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 90 pages

    MSC Class: 22E50 (Primary) 11F70; 11F72 (Secondary)

  11. arXiv:2410.13565  [pdf, other

    cs.CV

    SDI-Paste: Synthetic Dynamic Instance Copy-Paste for Video Instance Segmentation

    Authors: Sahir Shrestha, Weihao Li, Gao Zhu, Nick Barnes

    Abstract: Data augmentation methods such as Copy-Paste have been studied as effective ways to expand training datasets while incurring minimal costs. While such methods have been extensively implemented for image level tasks, we found no scalable implementation of Copy-Paste built specifically for video tasks. In this paper, we leverage the recent growth in video fidelity of generative models to explore eff… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  12. arXiv:2410.13515  [pdf, other

    hep-ex hep-lat hep-ph nucl-ex

    Observation of a rare beta decay of the charmed baryon with a Graph Neural Network

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (637 additional authors not shown)

    Abstract: The study of beta decay of the charmed baryon provides unique insights into the fundamental mechanism of the strong and electro-weak interactions. The $Λ_c^+$, being the lightest charmed baryon, undergoes disintegration solely through the charm quark weak decay. Its beta decay provides an ideal laboratory for investigating non-perturbative effects in quantum chromodynamics and for constraining the… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 28 pages, 6 figures

  13. arXiv:2410.13478  [pdf, other

    hep-ex

    Observation of $χ_{c0}\toΣ^{+}\barΣ^{-}η$ and evidence for $χ_{c1,2}\toΣ^{+}\barΣ^{-}η$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: Using $(27.12\pm 0.14)\times10^{8}$ $ψ(3686)$ events collected with the BESIII detector, the decay $χ_{c0}\toΣ^{+}\barΣ^{-}η$ is observed for the first time with a statistical significance of $7.0σ$, and evidence for $χ_{c1}\toΣ^{+}\barΣ^{-}η$ and $χ_{c2}\toΣ^{+}\barΣ^{-}η$ is found with statistical significances of $4.3σ$ and $4.6σ$, respectively. The branching fractions are determined to be… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  14. arXiv:2410.13368  [pdf, other

    hep-ex hep-ph

    Observation of the Singly Cabibbo-Suppressed Decay $Λ_c^{+}\to pπ^0$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Utilizing 4.5${~\rm{fb}}^{-1}$ of $e^+e^-$ annihilation data collected with the BESIII detector at the BEPCII collider at center-of-mass energies between 4.600 and 4.699 GeV, the first observation of the singly Cabibbo-suppressed decay $Λ_c^{+}\to pπ^0$ is presented, with a statistical significance of $5.4σ$. The ratio of the branching fractions of $Λ_c^{+}\to pπ^0$ and $Λ_c^{+}\to pη$ is measured… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 9 pages, 4 figures

  15. arXiv:2410.13344  [pdf, other

    cs.CL cs.AI

    Cerberus: Efficient Inference with Adaptive Parallel Decoding and Sequential Knowledge Enhancement

    Authors: Yuxuan Liu, Wenyuan Li, Laizhong Cui, Hailiang Yang

    Abstract: Large language models (LLMs) often face a bottleneck in inference speed due to their reliance on auto-regressive decoding. Recently, parallel decoding has shown significant promise in enhancing inference efficiency. However, we have identified two key issues with existing parallel decoding frameworks: (1) decoding heads fail to balance prediction accuracy and the parallelism of execution, and (2)… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  16. arXiv:2410.13205  [pdf, ps, other

    math.AP

    On the Boltzmann equation with soft potentials: Existence, uniqueness and smoothing effect of mild solutions

    Authors: Ling-Bing He, Jie Ji, Wei-Xi Li

    Abstract: We consider the spatially inhomogeneous Boltzmann equation without angular cutoff for soft potentials. For any given initial datum such that the mass, energy and entropy densities are bounded and the mass is away from vacuum, we establish the local-in-time existence and uniqueness of mild solutions, and further provide the first result on sharp smoothing effect in analytic space or Gevrey space fo… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 60pages,0 figure

    MSC Class: 35B65; 35Q20

  17. arXiv:2410.12847  [pdf, other

    cs.CL cs.AI

    ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning

    Authors: Yu-Chen Lin, Wei-Hua Li, Jun-Cheng Chen, Chu-Song Chen

    Abstract: Prompt Tuning has been a popular Parameter-Efficient Fine-Tuning method attributed to its remarkable performance with few updated parameters on various large-scale pretrained Language Models (PLMs). Traditionally, each prompt has been considered indivisible and updated independently, leading the parameters increase proportionally as prompt length grows. To address this issue, we propose Adaptive C… ▽ More

    Submitted 17 October, 2024; v1 submitted 10 October, 2024; originally announced October 2024.

    Comments: EMNLP Findings 2024

  18. arXiv:2410.12620  [pdf, other

    hep-ex

    Search for $e^{+}e^{-} \to φχ_{c0}$ and $φη_{c2}(1D)$ at center-of-mass energies from 4.47 to 4.95 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: Utilizing a data set of $6.7$ fb$^{-1}$ from electron-positron collisions recorded by the BESIII detector at the BEPCII storage ring, a search is conducted for the processes $e^{+}e^{-} \to φχ_{c0}$ and $φη_{c2}(1D)$ across center-of-mass energies from 4.47 to 4.95 GeV. In the absence of any significant signals, upper limits are set. These include limits on the Born cross sections for… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 14 pages, 6 figures

  19. arXiv:2410.12291  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Highly anisotropic Drude-weight-reduction and enhanced linear-dichroism in van der Waals Weyl semimetal Td-MoTe2 with coherent interlayer electronic transport

    Authors: Bo Su, Weikang Wu, Jianzhou Zhao, Xiutong Deng, Wenhui Li, Shengyuan A. Yang, Youguo Shi, Qiang Li, Jianlin Luo, Genda Gu, Zhi-Guo Chen

    Abstract: Weyl semimetal (WSM) states can be achieved by breaking spatial-inversion symmetry or time reversal symmetry. However, the anisotropy of the energy reduction contributing to the emergence of WSM states has seldom been investigated by experiments. A van der Waals metal MoTe2 exhibits a type-II WSM phase below the monoclinic-to-orthorhombic-phase-transition temperature Tc ~ 250 K. Here, we report a… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: Accepted by Laser & Photonics Reviews

  20. arXiv:2410.12099  [pdf, ps, other

    nucl-ex

    The EMC Effect of Tritium and Helium-3 from the JLab MARATHON Experiment

    Authors: D. Abrams, H. Albataineh, B. S. Aljawrneh, S. Alsalmi, D. Androic, K. Aniol, W. Armstrong, J. Arrington, H. Atac, T. Averett, C. Ayerbe Gayoso, X. Bai, J. Bane, S. Barcus, A. Beck, V. Bellini, H. Bhatt, D. Bhetuwal, D. Biswas, D. Blyth, W. Boeglin, D. Bulumulla, J. Butler, A. Camsonne, M. Carmignotto , et al. (109 additional authors not shown)

    Abstract: Measurements of the EMC effect in the tritium and helium-3 mirror nuclei are reported. The data were obtained by the MARATHON Jefferson Lab experiment, which performed deep inelastic electron scattering from deuterium and the three-body nuclei, using a cryogenic gas target system and the High Resolution Spectrometers of the Hall A Facility of the Lab. The data cover the Bjorken $x$ range from 0.20… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: arXiv admin note: text overlap with arXiv:2104.05850

  21. arXiv:2410.12075  [pdf, other

    cs.CV cs.AI

    WeatherDG: LLM-assisted Procedural Weather Generation for Domain-Generalized Semantic Segmentation

    Authors: Chenghao Qian, Yuhu Guo, Yuhong Mo, Wenjing Li

    Abstract: In this work, we propose a novel approach, namely WeatherDG, that can generate realistic, weather-diverse, and driving-screen images based on the cooperation of two foundation models, i.e, Stable Diffusion (SD) and Large Language Model (LLM). Specifically, we first fine-tune the SD with source data, aligning the content and layout of generated samples with real-world driving scenarios. Then, we pr… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  22. arXiv:2410.11784  [pdf, other

    nlin.CD cs.DM nlin.PS

    Extending 1089 attractor to any number of digits and any number of steps

    Authors: Yannis Almirantis, Wentian Li

    Abstract: The well-known 1089 trick reflects an amazing trait of digital reversal process and reminisces of a limiting attractor in dynamical systems even though it takes only two steps. It is natural to consider the situations when the number of digits is beyond three as in the original 1089 trick, as well as situations when the number of steps is beyond two. The first part has been mostly done by Webster… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 1 figure

  23. arXiv:2410.11613  [pdf, other

    stat.ML math.NA

    Stochastic diagonal estimation with adaptive parameter selection

    Authors: Zongyuan Han, Wenhao Li, Shengxin Zhu

    Abstract: In this paper, we investigate diagonal estimation for large or implicit matrices, aiming to develop a novel and efficient stochastic algorithm that incorporates adaptive parameter selection. We explore the influence of different eigenvalue distributions on diagonal estimation and analyze the necessity of introducing the projection method and adaptive parameter optimization into the stochastic diag… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  24. arXiv:2410.11607  [pdf, other

    hep-ex

    Observation of $χ_{cJ}\to p \bar p K^0_S K^- π^+ + c.c.$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (648 additional authors not shown)

    Abstract: By analyzing $(27.12\pm0.14)\times10^8$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decays of $χ_{cJ} \to p \bar{p} K^0_S K^- π^+ +c.c.(J=0, 1, 2)$ are observed for the first time with statistical significances greater than $10σ$. The branching fractions of these decays are determined to be… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 12 pages, 5 figures

  25. arXiv:2410.11287  [pdf, other

    cs.CL cs.AI

    Process Reward Model with Q-Value Rankings

    Authors: Wendi Li, Yixuan Li

    Abstract: Process Reward Modeling (PRM) is critical for complex reasoning and decision-making tasks where the accuracy of intermediate steps significantly influences the overall outcome. Existing PRM approaches, primarily framed as classification problems, employ cross-entropy loss to independently evaluate each step's correctness. This method can lead to suboptimal reward distribution and does not adequate… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  26. arXiv:2410.11201  [pdf, other

    cs.CV cs.AI cs.LG

    Tree of Attributes Prompt Learning for Vision-Language Models

    Authors: Tong Ding, Wanhua Li, Zhongqi Miao, Hanspeter Pfister

    Abstract: Prompt learning has proven effective in adapting vision language models for downstream tasks. However, existing methods usually append learnable prompt tokens solely with the category names to obtain textual features, which fails to fully leverage the rich context indicated in the category name. To address this issue, we propose the Tree of Attributes Prompt learning (TAP), which first instructs L… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  27. KNN Transformer with Pyramid Prompts for Few-Shot Learning

    Authors: Wenhao Li, Qiangchang Wang, Peng Zhao, Yilong Yin

    Abstract: Few-Shot Learning (FSL) aims to recognize new classes with limited labeled data. Recent studies have attempted to address the challenge of rare samples with textual prompts to modulate visual features. However, they usually struggle to capture complex semantic relationships between textual and visual features. Moreover, vanilla self-attention is heavily affected by useless information in images, s… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: 10 pages, 5 figures, accepted by ACM Multimedia 2024

    ACM Class: I.4.9

  28. arXiv:2410.10161  [pdf, other

    hep-th gr-qc

    Shear viscoelasticity in anisotropic holographic axion model

    Authors: Lei Li, Wei-Jia Li, Xiao-Mei Kuang

    Abstract: In this work, we investigate the shear viscoelasticity in a simple holographic axion model with broken translational symmetry and rotational symmetry in space via the perturbation computation. We find that, in the case of spontaneous symmetry breaking, the broken translations and anisotropy both enhance the shear elasticity of the system. While in all cases, they introduce a double suppression on… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  29. arXiv:2410.10140  [pdf, other

    cs.CV

    Hi-Mamba: Hierarchical Mamba for Efficient Image Super-Resolution

    Authors: Junbo Qiao, Jincheng Liao, Wei Li, Yulun Zhang, Yong Guo, Yi Wen, Zhangxizi Qiu, Jiao Xie, Jie Hu, Shaohui Lin

    Abstract: State Space Models (SSM), such as Mamba, have shown strong representation ability in modeling long-range dependency with linear complexity, achieving successful applications from high-level to low-level vision tasks. However, SSM's sequential nature necessitates multiple scans in different directions to compensate for the loss of spatial dependency when unfolding the image into a 1D sequence. This… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  30. arXiv:2410.09834  [pdf, other

    cs.CV eess.IV

    Towards Defining an Efficient and Expandable File Format for AI-Generated Contents

    Authors: Yixin Gao, Runsen Feng, Xin Li, Weiping Li, Zhibo Chen

    Abstract: Recently, AI-generated content (AIGC) has gained significant traction due to its powerful creation capability. However, the storage and transmission of large amounts of high-quality AIGC images inevitably pose new challenges for recent file formats. To overcome this, we define a new file format for AIGC images, named AIGIF, enabling ultra-low bitrate coding of AIGC images. Unlike compressing AIGC… ▽ More

    Submitted 15 October, 2024; v1 submitted 13 October, 2024; originally announced October 2024.

  31. arXiv:2410.09732  [pdf, other

    cs.CV

    LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models

    Authors: Junyan Ye, Baichuan Zhou, Zilong Huang, Junan Zhang, Tianyi Bai, Hengrui Kang, Jun He, Honglin Lin, Zihao Wang, Tong Wu, Zhizheng Wu, Yiping Chen, Dahua Lin, Conghui He, Weijia Li

    Abstract: With the rapid development of AI-generated content, the future internet may be inundated with synthetic data, making the discrimination of authentic and credible multimodal data increasingly challenging. Synthetic data detection has thus garnered widespread attention, and the performance of large multimodal models (LMMs) in this task has attracted significant interest. LMMs can provide natural lan… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

    Comments: 79 pages, 63 figures

  32. arXiv:2410.09677  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.str-el

    Phonon-Mediated Nonlinear Optical Responses and Quantum Geometry

    Authors: Jiaming Hu, Wenbin Li, Hua Wang, Kai Chang

    Abstract: Unraveling the complexities of nonlinear optical (NLO) responses, particularly the intricate many-body interactions among photons, electrons, and phonons, remains a significant challenge in condensed matter physics. Here, we present a diagrammatic approach to explore NLO responses with electron-phonon coupling (EPC), focusing on the phonon-mediated nonlinear optical (Ph-NLO) responses up to the se… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

    Comments: 24 pages, 16 figures

  33. arXiv:2410.09614  [pdf, other

    q-bio.NC cs.CV cs.LG

    Exploring Behavior-Relevant and Disentangled Neural Dynamics with Generative Diffusion Models

    Authors: Yule Wang, Chengrui Li, Weihan Li, Anqi Wu

    Abstract: Understanding the neural basis of behavior is a fundamental goal in neuroscience. Current research in large-scale neuro-behavioral data analysis often relies on decoding models, which quantify behavioral information in neural data but lack details on behavior encoding. This raises an intriguing scientific question: ``how can we enable in-depth exploration of neural representations in behavioral ta… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

  34. arXiv:2410.09181  [pdf, other

    cs.CR cs.AI cs.CL cs.CY cs.LG

    Can a large language model be a gaslighter?

    Authors: Wei Li, Luyao Zhu, Yang Song, Ruixi Lin, Rui Mao, Yang You

    Abstract: Large language models (LLMs) have gained human trust due to their capabilities and helpfulness. However, this in turn may allow LLMs to affect users' mindsets by manipulating language. It is termed as gaslighting, a psychological effect. In this work, we aim to investigate the vulnerability of LLMs under prompt-based and fine-tuning-based gaslighting attacks. Therefore, we propose a two-stage fram… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: 10/26 (Main Body/Total), 8 figures

  35. arXiv:2410.08987  [pdf, other

    math.OC math.NA

    Gradient-adjusted underdamped Langevin dynamics for sampling

    Authors: Xinzhe Zuo, Stanley Osher, Wuchen Li

    Abstract: Sampling from a target distribution is a fundamental problem. Traditional Markov chain Monte Carlo (MCMC) algorithms, such as the unadjusted Langevin algorithm (ULA), derived from the overdamped Langevin dynamics, have been extensively studied. From an optimization perspective, the Kolmogorov forward equation of the overdamped Langevin dynamics can be treated as the gradient flow of the relative e… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  36. arXiv:2410.08622  [pdf, ps, other

    hep-ex

    Observation of time-dependent $CP$ violation and measurement of the branching fraction of $B^0 \to J/ψπ^0$ decays

    Authors: Belle II Collaboration, I. Adachi, L. Aggarwal, H. Ahmed, H. Aihara, N. Akopov, A. Aloisio, N. Althubiti, N. Anh Ky, D. M. Asner, H. Atmacan, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, N. K. Baghel, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, J. Baudot, A. Baur, A. Beaubien, F. Becherer , et al. (369 additional authors not shown)

    Abstract: We present a measurement of the branching fraction and time-dependent charge-parity ($CP$) decay-rate asymmetries in $B^0 \to J/ψπ^0$ decays. The data sample was collected with the Belle~II detector at the SuperKEKB asymmetric $e^+e^-$ collider in 2019-2022 and contains $(387\pm 6)\times 10^6$ $B\overline{B}$ meson pairs from $Υ(4S)$ decays. We reconstruct $392\pm 24$ signal decays and fit the… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Report number: Belle II preprint: 2024-018, KEK preprint: 2024-14

  37. arXiv:2410.08603  [pdf, other

    hep-ex

    Observation of $D^+\toη^\primeμ^+ν_μ$ and First Study of $D^+\to η^\prime \ell^+ν_\ell$ Decay Dynamics

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Using $20.3\,\rm fb^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy 3.773\,GeV with the BESIII detector, we report the first observation of the semileptonic decay $D^+\to η^\prime μ^+ν_μ$ with significance of $8.6σ$ including systematic uncertainties, and an improved measurement of $D^+\to η^\prime e^+ν_e$. The branching fractions of $D^+\to η^\prime μ^+ν_μ$ and… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  38. arXiv:2410.08554  [pdf, other

    physics.optics physics.app-ph

    Integrated adaptive coherent LiDAR for 4D bionic vision

    Authors: Ruixuan Chen, Yichen Wu, Ke Zhang, Chuxin Liu, Yikun Chen, Wencan Li, Bitao Shen, Zhaoxi Chen, Hanke Feng, Zhangfeng Ge, Yan Zhou, Zihan Tao, Weihan Xu, Yimeng Wang, Pengfei Cai, Dong Pan, Haowen Shu, Linjie Zhou, Cheng Wang, Xingjun Wang

    Abstract: Light detection and ranging (LiDAR) is a ubiquitous tool to provide precise spatial awareness in various perception environments. A bionic LiDAR that can mimic human-like vision by adaptively gazing at selected regions of interest within a broad field of view is crucial to achieve high-resolution imaging in an energy-saving and cost-effective manner. However, current LiDARs based on stacking fixed… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  39. arXiv:2410.08257  [pdf, other

    cs.CV cs.GR cs.LG

    Neural Material Adaptor for Visual Grounding of Intrinsic Dynamics

    Authors: Junyi Cao, Shanyan Guan, Yanhao Ge, Wei Li, Xiaokang Yang, Chao Ma

    Abstract: While humans effortlessly discern intrinsic dynamics and adapt to new scenarios, modern AI systems often struggle. Current methods for visual grounding of dynamics either use pure neural-network-based simulators (black box), which may violate physical laws, or traditional physical simulators (white box), which rely on expert-defined equations that may not fully capture actual dynamics. We propose… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: NeurIPS 2024, the project page: https://xjay18.github.io/projects/neuma.html

  40. arXiv:2410.08192  [pdf, other

    cs.CV

    HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven Generation

    Authors: Shanyan Guan, Yanhao Ge, Ying Tai, Jian Yang, Wei Li, Mingyu You

    Abstract: Recent advancements in text-to-image diffusion models have shown remarkable creative capabilities with textual prompts, but generating personalized instances based on specific subjects, known as subject-driven generation, remains challenging. To tackle this issue, we present a new hybrid framework called HybridBooth, which merges the benefits of optimization-based and direct-regression methods. Hy… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: ECCV 2024, the project page: https://sites.google.com/view/hybridbooth

  41. arXiv:2410.07701  [pdf, other

    cs.RO

    Autonomous Driving in Unstructured Environments: How Far Have We Come?

    Authors: Chen Min, Shubin Si, Xu Wang, Hanzhang Xue, Weizhong Jiang, Yang Liu, Juan Wang, Qingtian Zhu, Qi Zhu, Lun Luo, Fanjie Kong, Jinyu Miao, Xudong Cai, Shuai An, Wei Li, Jilin Mei, Tong Sun, Heng Zhai, Qifeng Liu, Fangzhou Zhao, Liang Chen, Shuai Wang, Erke Shang, Linzhi Shang, Kunlong Zhao , et al. (13 additional authors not shown)

    Abstract: Research on autonomous driving in unstructured outdoor environments is less advanced than in structured urban settings due to challenges like environmental diversities and scene complexity. These environments-such as rural areas and rugged terrains-pose unique obstacles that are not common in structured urban areas. Despite these difficulties, autonomous driving in unstructured outdoor environment… ▽ More

    Submitted 12 October, 2024; v1 submitted 10 October, 2024; originally announced October 2024.

    Comments: Survey paper; 38 pages

  42. arXiv:2410.07675  [pdf, other

    cs.LG cs.AI

    Adversarial Robustness Overestimation and Instability in TRADES

    Authors: Jonathan Weiping Li, Ren-Wei Liang, Cheng-Han Yeh, Cheng-Chang Tsai, Kuanchun Yu, Chun-Shien Lu, Shang-Tse Chen

    Abstract: This paper examines the phenomenon of probabilistic robustness overestimation in TRADES, a prominent adversarial training method. Our study reveals that TRADES sometimes yields disproportionately high PGD validation accuracy compared to the AutoAttack testing accuracy in the multiclass classification task. This discrepancy highlights a significant overestimation of robustness for these instances,… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

  43. arXiv:2410.07626  [pdf, other

    hep-ex

    Precision Measurement of the Branching Fraction of $D^{+}\to μ^{+}ν_μ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Using $20.3~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected at a center-of-mass energy of $E_{\rm cm}=3.773$ GeV with the BESIII detector operating at the BEPCII collider, we determine the branching fraction of the leptonic decay $D^+\toμ^+ν_μ$ to be $(3.981\pm0.079_{\rm stat}\pm0.040_{\rm syst})\times10^{-4}$. Interpreting our measurement with knowledge of the Fermi coupling constant… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: 9 pages, 2 figures

  44. arXiv:2410.06916  [pdf, other

    cs.CL

    SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration

    Authors: Heming Xia, Yongqi Li, Jun Zhang, Cunxiao Du, Wenjie Li

    Abstract: Speculative decoding (SD) has emerged as a widely used paradigm to accelerate the inference of large language models (LLMs) without compromising generation quality. It works by first employing a compact model to draft multiple tokens efficiently and then using the target LLM to verify them in parallel. While this technique has achieved notable speedups, most existing approaches necessitate either… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  45. arXiv:2410.06894  [pdf, other

    cond-mat.stat-mech

    The global phase diagram of the cluster-XY spin chain with dissipation

    Authors: Wei-Lin Li, Ying-Ao Chen, Zheng-Xin Guo, Xue-Jia Yu, Zhi Li

    Abstract: We study the ground-state phase diagram of a non-Hermitian cluster-XY spin chain in the language of free fermions. By calculating the second derivative of ground-state energy density and various types of order parameters, we establish the global ground-state phase diagram of the model, exhibiting rich quantum phases and corresponding phase transitions. Specially, the results reveal that the non-He… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  46. arXiv:2410.06738  [pdf, other

    astro-ph.CO

    Optical and near-infrared photometry of 94 type II supernovae from the Carnegie Supernova Project

    Authors: J. P. Anderson, C. Contreras, M. D. Stritzinger, M. Hamuy, M. M. Phillips, N. B. Suntzeff, N. Morrell, S. Gonzalez-Gaitan, C. P. Gutierrez, C. R. Burns, E. Y. Hsiao, J. Anais, C. Ashall, C. Baltay, E. Baron, M. Bersten, L. Busta, S. Castellon, T. de Jaeger, D. DePoy, A. V. Filippenko, G. Folatelli, F. Forster, L. Galbany, C. Gall , et al. (21 additional authors not shown)

    Abstract: Type II supernovae (SNeII) mark the endpoint in the lives of hydrogen-rich massive stars. Their large explosion energies and luminosities allow us to measure distances, metallicities, and star formation rates into the distant Universe. To fully exploit their use in answering different astrophysical problems, high-quality low-redshift data sets are required. Such samples are vital to understand the… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: Accepted for publication in A&A. Photometric data will be uploaded to the CDS and the CSP website, and can also be requested from the first author

  47. arXiv:2410.06682  [pdf, other

    cs.CV cs.CL eess.IV

    Enhancing Multimodal LLM for Detailed and Accurate Video Captioning using Multi-Round Preference Optimization

    Authors: Changli Tang, Yixuan Li, Yudong Yang, Jimin Zhuang, Guangzhi Sun, Wei Li, Zujun Ma, Chao Zhang

    Abstract: Videos contain a wealth of information, and generating detailed and accurate descriptions in natural language is a key aspect of video understanding. In this paper, we present video-SALMONN 2, an advanced audio-visual large language model (LLM) with low-rank adaptation (LoRA) designed for enhanced video (with paired audio) captioning through directed preference optimization (DPO). We propose new m… ▽ More

    Submitted 10 October, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

  48. arXiv:2410.06667  [pdf, other

    cs.CL cs.AI

    Large Language Models as Code Executors: An Exploratory Study

    Authors: Chenyang Lyu, Lecheng Yan, Rui Xing, Wenxi Li, Younes Samih, Tianbo Ji, Longyue Wang

    Abstract: The capabilities of Large Language Models (LLMs) have significantly evolved, extending from natural language processing to complex tasks like code understanding and generation. We expand the scope of LLMs' capabilities to a broader context, using LLMs to execute code snippets to obtain the output. This paper pioneers the exploration of LLMs as code executors, where code snippets are directly fed t… ▽ More

    Submitted 10 October, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

  49. arXiv:2410.06638  [pdf, other

    cs.CL cs.AI

    Subtle Errors Matter: Preference Learning via Error-injected Self-editing

    Authors: Kaishuai Xu, Tiezheng Yu, Wenjun Hou, Yi Cheng, Chak Tou Leong, Liangyou Li, Xin Jiang, Lifeng Shang, Qun Liu, Wenjie Li

    Abstract: Large Language Models (LLMs) have exhibited strong mathematical reasoning and computational prowess, tackling tasks ranging from basic arithmetic to advanced competition-level problems. However, frequently occurring subtle errors, such as miscalculations or incorrect substitutions, limit the models' full mathematical potential. Existing studies to improve mathematical ability typically involve dis… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  50. arXiv:2410.06587  [pdf, other

    cs.CR

    Bots can Snoop: Uncovering and Mitigating Privacy Risks of Bots in Group Chats

    Authors: Kai-Hsiang Chou, Yi-Min Lin, Yi-An Wang, Jonathan Weiping Li, Tiffany Hyun-Jin Kim, Hsu-Chun Hsiao

    Abstract: New privacy concerns arise with chatbots on group messaging platforms. Chatbots may access information beyond their intended functionalities, such as messages unintended for chatbots or sender's identities. Chatbot operators may exploit such information to infer personal information and link users across groups, potentially leading to personal data breaches, pervasive tracking, and targeted advert… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: 18 pages, 5 figures