Skip to main content

Showing 1–50 of 148 results for author: Fang, G

  1. arXiv:2409.17481  [pdf, other

    cs.AI cs.CL cs.LG

    MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models

    Authors: Gongfan Fang, Hongxu Yin, Saurav Muralidharan, Greg Heinrich, Jeff Pool, Jan Kautz, Pavlo Molchanov, Xinchao Wang

    Abstract: Large Language Models (LLMs) are distinguished by their massive parameter counts, which typically result in significant redundancy. This work introduces MaskLLM, a learnable pruning method that establishes Semi-structured (or ``N:M'') Sparsity in LLMs, aimed at reducing computational overhead during inference. Instead of developing a new importance criterion, MaskLLM explicitly models N:M patterns… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

    Comments: NeurIPS 2024 Spotlight

  2. Preparation for CSST: Star-galaxy Classification using a Rotationally Invariant Supervised Machine Learning Method

    Authors: Shiliang Zhang, Guanwen Fang, Jie Song, Ran Li, Yizhou Gu, Zesen Lin, Chichun Zhou, Yao Dai, Xu Kong

    Abstract: Most existing star-galaxy classifiers depend on the reduced information from catalogs, necessitating careful data processing and feature extraction. In this study, we employ a supervised machine learning method (GoogLeNet) to automatically classify stars and galaxies in the COSMOS field. Unlike traditional machine learning methods, we introduce several preprocessing techniques, including noise red… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

    Comments: 11 pages, 9 figures, published in Research in Astronomy and Astrophysics, Volume 24, Number 9 (2024)

    Journal ref: Research in Astronomy and Astrophysics 24 (2024) 095012

  3. arXiv:2409.03807  [pdf, other

    cs.LG cs.GR

    Accelerate Neural Subspace-Based Reduced-Order Solver of Deformable Simulation by Lipschitz Optimization

    Authors: Aoran Lyu, Shixian Zhao, Chuhua Xian, Zhihao Cen, Hongmin Cai, Guoxin Fang

    Abstract: Reduced-order simulation is an emerging method for accelerating physical simulations with high DOFs, and recently developed neural-network-based methods with nonlinear subspaces have been proven effective in diverse applications as more concise subspaces can be detected. However, the complexity and landscape of simulation objectives within the subspace have not been optimized, which leaves room fo… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

  4. arXiv:2408.16659  [pdf, other

    physics.med-ph cs.GR

    Motion-Driven Neural Optimizer for Prophylactic Braces Made by Distributed Microstructures

    Authors: Xingjian Han, Yu Jiang, Weiming Wang, Guoxin Fang, Simeon Gill, Zhiqiang Zhang, Shengfa Wang, Jun Saito, Deepak Kumar, Zhongxuan Luo, Emily Whiting, Charlie C. L. Wang

    Abstract: Joint injuries, and their long-term consequences, present a substantial global health burden. Wearable prophylactic braces are an attractive potential solution to reduce the incidence of joint injuries by limiting joint movements that are related to injury risk. Given human motion and ground reaction forces, we present a computational framework that enables the design of personalized braces by opt… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

  5. arXiv:2408.09198  [pdf, other

    cs.RO

    Learning Based Toolpath Planner on Diverse Graphs for 3D Printing

    Authors: Yuming Huang, Yuhu Guo, Renbo Su, Xingjian Han, Junhao Ding, Tianyu Zhang, Tao Liu, Weiming Wang, Guoxin Fang, Xu Song, Emily Whiting, Charlie C. L. Wang

    Abstract: This paper presents a learning based planner for computing optimized 3D printing toolpaths on prescribed graphs, the challenges of which include the varying graph structures on different models and the large scale of nodes & edges on a graph. We adopt an on-the-fly strategy to tackle these challenges, formulating the planner as a Deep Q-Network (DQN) based optimizer to decide the next `best' node… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

  6. arXiv:2407.10468  [pdf, other

    cs.SD cs.AI eess.AS

    LiteFocus: Accelerated Diffusion Inference for Long Audio Synthesis

    Authors: Zhenxiong Tan, Xinyin Ma, Gongfan Fang, Xinchao Wang

    Abstract: Latent diffusion models have shown promising results in audio generation, making notable advancements over traditional methods. However, their performance, while impressive with short audio clips, faces challenges when extended to longer audio sequences. These challenges are due to model's self-attention mechanism and training predominantly on 10-second clips, which complicates the extension to lo… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: Interspeech 2024; Code: https://github.com/Yuanshi9815/LiteFocus

  7. arXiv:2407.06937  [pdf, other

    cs.CV

    HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance

    Authors: Guian Fang, Wenbiao Yan, Yuanfan Guo, Jianhua Han, Zutao Jiang, Hang Xu, Shengcai Liao, Xiaodan Liang

    Abstract: Text-to-image diffusion models have significantly advanced in conditional image generation. However, these models usually struggle with accurately rendering images featuring humans, resulting in distorted limbs and other anomalies. This issue primarily stems from the insufficient recognition and evaluation of limb qualities in diffusion models. To address this issue, we introduce AbHuman, the firs… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV2024

  8. arXiv:2407.04616  [pdf, other

    cs.CV cs.AI cs.LG

    Isomorphic Pruning for Vision Models

    Authors: Gongfan Fang, Xinyin Ma, Michael Bi Mi, Xinchao Wang

    Abstract: Structured pruning reduces the computational overhead of deep neural networks by removing redundant sub-structures. However, assessing the relative importance of different sub-structures remains a significant challenge, particularly in advanced vision models featuring novel mechanisms and architectures like self-attention, depth-wise convolutions, or residual connections. These heterogeneous subst… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  9. arXiv:2406.12315  [pdf, other

    cs.AI

    PruningBench: A Comprehensive Benchmark of Structural Pruning

    Authors: Haoling Li, Changhao Li, Mengqi Xue, Gongfan Fang, Sheng Zhou, Zunlei Feng, Huiqiong Wang, Yong Wang, Lechao Cheng, Mingli Song, Jie Song

    Abstract: Structural pruning has emerged as a promising approach for producing more efficient models. Nevertheless, the community suffers from a lack of standardized benchmarks and metrics, leaving the progress in this area not fully comprehended. To fill this gap, we present the first comprehensive benchmark, termed \textit{PruningBench}, for structural pruning. PruningBench showcases the following three c… ▽ More

    Submitted 20 July, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

    Comments: This is a paper aims to present a evaluation benchmark for structural pruning. The full text is 30 pages

  10. arXiv:2406.06911  [pdf, other

    cs.CV cs.AI

    AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising

    Authors: Zigeng Chen, Xinyin Ma, Gongfan Fang, Zhenxiong Tan, Xinchao Wang

    Abstract: Diffusion models have garnered significant interest from the community for their great generative ability across various applications. However, their typical multi-step sequential-denoising nature gives rise to high cumulative latency, thereby precluding the possibilities of parallel computation. To address this, we introduce AsyncDiff, a universal and plug-and-play acceleration scheme that enable… ▽ More

    Submitted 26 September, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: Accepted by NeurIPS 2024

  11. arXiv:2406.06861  [pdf, other

    hep-ph

    Exploring flavour space of an economical SU(5) GUT in future proton decay measurements

    Authors: Gao-Xiang Fang, Ye-Ling Zhou

    Abstract: We discuss the potential of future proton decay experiments on the exploration of the flavour space of grand unification. We focus on an economical SU(5) grand unified model (GUT) with the fermion sector extended by including only one copy of 24-plet. Neutrino masses are generated via type-(I+III) seesaw mechanism with the lightest neutrino massless. Gauge unification requires masses of fermions i… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 21 pages, 8 figures

  12. arXiv:2406.01733  [pdf, other

    cs.LG cs.CV

    Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching

    Authors: Xinyin Ma, Gongfan Fang, Michael Bi Mi, Xinchao Wang

    Abstract: Diffusion Transformers have recently demonstrated unprecedented generative capabilities for various tasks. The encouraging results, however, come with the cost of slow inference, since each denoising step requires inference on a transformer model with a large scale of parameters. In this study, we make an interesting and somehow surprising observation: the computation of a large proportion of laye… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Code is available at https://github.com/horseee/learning-to-cache

  13. arXiv:2406.00630  [pdf, other

    stat.ML cs.LG

    On Non-asymptotic Theory of Recurrent Neural Networks in Temporal Point Processes

    Authors: Zhiheng Chen, Guanhua Fang, Wen Yu

    Abstract: Temporal point process (TPP) is an important tool for modeling and predicting irregularly timed events across various domains. Recently, the recurrent neural network (RNN)-based TPPs have shown practical advantages over traditional parametric TPP models. However, in the current literature, it remains nascent in understanding neural TPPs from theoretical viewpoints. In this paper, we establish the… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  14. arXiv:2405.17828  [pdf, other

    stat.ME

    On Robust Clustering of Temporal Point Process

    Authors: Yuecheng Zhang, Guanhua Fang, Wen Yu

    Abstract: Clustering of event stream data is of great importance in many application scenarios, including but not limited to, e-commerce, electronic health, online testing, mobile music service, etc. Existing clustering algorithms fail to take outlier data into consideration and are implemented without theoretical guarantees. In this paper, we propose a robust temporal point processes clustering framework w… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  15. arXiv:2405.08935  [pdf, other

    cs.RO

    Function based sim-to-real learning for shape control of deformable free-form surfaces

    Authors: Yingjun Tian, Guoxin Fang, Renbo Su, Weiming Wang, Simeon Gill, Andrew Weightman, Charlie C. L. Wang

    Abstract: For the shape control of deformable free-form surfaces, simulation plays a crucial role in establishing the mapping between the actuation parameters and the deformed shapes. The differentiation of this forward kinematic mapping is usually employed to solve the inverse kinematic problem for determining the actuation parameters that can realize a target shape. However, the free-form surfaces obtaine… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  16. arXiv:2405.01882  [pdf, other

    cs.RO cs.AI eess.SP

    Millimeter Wave Radar-based Human Activity Recognition for Healthcare Monitoring Robot

    Authors: Zhanzhong Gu, Xiangjian He, Gengfa Fang, Chengpei Xu, Feng Xia, Wenjing Jia

    Abstract: Healthcare monitoring is crucial, especially for the daily care of elderly individuals living alone. It can detect dangerous occurrences, such as falls, and provide timely alerts to save lives. Non-invasive millimeter wave (mmWave) radar-based healthcare monitoring systems using advanced human activity recognition (HAR) models have recently gained significant attention. However, they encounter cha… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  17. arXiv:2404.15701  [pdf, other

    astro-ph.GA

    USmorph: An Updated Framework of Automatic Classification of Galaxy Morphologies and Its Application to Galaxies in the COSMOS Field

    Authors: Jie Song, GuanWen Fang, Shuo Ba, Zesen Lin, Yizhou Gu, Chichun Zhou, Tao Wang, Cai-Na Hao, Guilin Liu, Hongxin Zhang, Yao Yao, Xu Kong

    Abstract: Morphological classification conveys abundant information on the formation, evolution, and environment of galaxies. In this work, we refine the two-step galaxy morphological classification framework ({\tt\string USmorph}), which employs a combination of unsupervised machine learning (UML) and supervised machine learning (SML) techniques, along with a self-consistent and robust data preprocessing s… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Accepted by ApJS, 16 pages, 12 figures

  18. arXiv:2404.06103  [pdf, other

    cs.SD cs.IR eess.AS

    Exploring Diverse Sounds: Identifying Outliers in a Music Corpus

    Authors: Le Cai, Sam Ferguson, Gengfa Fang, Hani Alshamrani

    Abstract: Existing research on music recommendation systems primarily focuses on recommending similar music, thereby often neglecting diverse and distinctive musical recordings. Musical outliers can provide valuable insights due to the inherent diversity of music itself. In this paper, we explore music outliers, investigating their potential usefulness for music discovery and recommendation systems. We argu… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Journal ref: The 16th International Symposium on Computer Music Multidisciplinary Research,2023

  19. arXiv:2403.14166  [pdf, other

    cs.CV

    Mini-Splatting: Representing Scenes with a Constrained Number of Gaussians

    Authors: Guangchi Fang, Bing Wang

    Abstract: In this study, we explore the challenge of efficiently representing scenes with a constrained number of Gaussians. Our analysis shifts from traditional graphics and 2D computer vision to the perspective of point clouds, highlighting the inefficient spatial distribution of Gaussian representation as a key limitation in model performance. To address this, we introduce strategies for densification in… ▽ More

    Submitted 16 October, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

  20. arXiv:2403.11236  [pdf, other

    cs.CL

    ChartThinker: A Contextual Chain-of-Thought Approach to Optimized Chart Summarization

    Authors: Mengsha Liu, Daoyuan Chen, Yaliang Li, Guian Fang, Ying Shen

    Abstract: Data visualization serves as a critical means for presenting data and mining its valuable insights. The task of chart summarization, through natural language processing techniques, facilitates in-depth data analysis of charts. However, there still are notable deficiencies in terms of visual-language matching and reasoning ability for existing approaches. To address these limitations, this study co… ▽ More

    Submitted 24 April, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted by LREC-COLING 2024

  21. arXiv:2402.17126  [pdf, other

    cond-mat.mtrl-sci physics.comp-ph physics.optics

    Design principles of nonlinear optical materials for Terahertz lasers

    Authors: Juan Han, Yiwei Sun, Xiamin Huang, Wenjun Shuai, Guangyou Fang, Zhou Li

    Abstract: We have investigated both inter-band and intra-band second order nonlinear optical conductivity based on the velocity correlation formalism and the spectral expansion technique. We propose a scenario in which the second order intra-band process is nonzero while the inter-band process is zero. This occurs for a band structure with momentum asymmetry in the Brillouin zone. Very low-energy photons ar… ▽ More

    Submitted 4 July, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: 11 pages, 10 figures

  22. arXiv:2402.04033  [pdf, other

    cs.LG

    On provable privacy vulnerabilities of graph representations

    Authors: Ruofan Wu, Guanhua Fang, Qiying Pan, Mingyang Zhang, Tengfei Liu, Weiqiang Wang

    Abstract: Graph representation learning (GRL) is critical for extracting insights from complex network structures, but it also raises security concerns due to potential privacy vulnerabilities in these representations. This paper investigates the structural vulnerabilities in graph neural models where sensitive topological information can be inferred through edge reconstruction attacks. Our research primari… ▽ More

    Submitted 23 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  23. arXiv:2401.11106  [pdf, ps, other

    hep-ph hep-ex

    Study on the mixing of $Ξ_c$ and $Ξ'_c$ by the transition $Ξ_{b}\toΞ^{(')}_c$

    Authors: Hong-Wei Ke, Gang-Yang Fang, Yan-Liang Shi

    Abstract: Recently, the LHCb collaboration has observed the decays $Ξ^0_{b}\toΞ^+_{c}D^-_s$ and $Ξ^-_{b}\toΞ^0_{c}D^-_s$. They measured the relative branching fractions times the ratio of beauty-baryon production cross-sections $\mathcal{R}(\frac{Ξ^0_b}{Λ_b})\equiv\frac{σ(Ξ_b^0)}{σ(Λ^0_b)}\times\frac{B(Ξ^0_{b}\toΞ^+_{c}D^-_s)}{B(Λ^0_{b}\toΛ^+_{c}D^-_s)}$ and… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: 12 pages, 4 figures and 8 tables

  24. arXiv:2401.07781  [pdf, other

    cs.CV

    Towards A Better Metric for Text-to-Video Generation

    Authors: Jay Zhangjie Wu, Guian Fang, Haoning Wu, Xintao Wang, Yixiao Ge, Xiaodong Cun, David Junhao Zhang, Jia-Wei Liu, Yuchao Gu, Rui Zhao, Weisi Lin, Wynne Hsu, Ying Shan, Mike Zheng Shou

    Abstract: Generative models have demonstrated remarkable capability in synthesizing high-quality text, images, and videos. For video generation, contemporary text-to-video models exhibit impressive capabilities, crafting visually stunning videos. Nonetheless, evaluating such videos poses significant challenges. Current research predominantly employs automated metrics such as FVD, IS, and CLIP Score. However… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: Project page: https://showlab.github.io/T2VScore/

  25. arXiv:2312.06345  [pdf, other

    astro-ph.GA astro-ph.CO astro-ph.IM

    The Hubble Deep Hydrogen Alpha (HDH$α$) Project: I. Catalog of Emission-line Galaxies

    Authors: Shuairu Zhu, Zhen-Ya Zheng, James Rhoads, Junxian Wang, Linhua Jiang, Chunyan Jiang, Fang-Ting Yuan, P. T. Rahna, Weida Hu, Ruqiu Lin, Huanyuan Shan, Chun Xu, Leopoldo Infante, L. Felipe Barrientos, Xianzhong Zheng, Guanwen Fang, Zhixiong Liang

    Abstract: We present the first results of the Hubble Deep Hydrogen Alpha (HDH$α$) project, which analyzes the space-borne deep H$α$ narrowband imaging data in the GOODS-S region. The HDH$α$ data comprises 72 orbits' images taken with the HST ACS/WFC F658N filter. The exposure time varies across a total area of $\sim$76.1 $\rm{arcmin}^2$, adding up to a total exposure time of 195.7 ks, among which 68.8 ks ar… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 27 pages, 14 figures, 9 tables, accepted by ApJS

  26. arXiv:2312.05284  [pdf, other

    cs.CV

    SlimSAM: 0.1% Data Makes Segment Anything Slim

    Authors: Zigeng Chen, Gongfan Fang, Xinyin Ma, Xinchao Wang

    Abstract: Current approaches for compressing the Segment Anything Model (SAM) yield commendable results, yet necessitate extensive data to train a new network from scratch. Employing conventional pruning techniques can remarkably reduce data requirements but would suffer from a degradation in performance. To address this challenging trade-off, we introduce SlimSAM, a novel data-efficient SAM compression met… ▽ More

    Submitted 26 September, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: Accepted by NeurIPS 2024

  27. arXiv:2312.00858  [pdf, other

    cs.CV cs.AI

    DeepCache: Accelerating Diffusion Models for Free

    Authors: Xinyin Ma, Gongfan Fang, Xinchao Wang

    Abstract: Diffusion models have recently gained unprecedented attention in the field of image synthesis due to their remarkable generative capabilities. Notwithstanding their prowess, these models often incur substantial computational costs, primarily attributed to the sequential denoising process and cumbersome model size. Traditional methods for compressing diffusion models typically involve extensive ret… ▽ More

    Submitted 7 December, 2023; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: Work in progress. Project Page: https://horseee.github.io/Diffusion_DeepCache/

  28. arXiv:2311.17265  [pdf, other

    cs.CG

    Exceptional Mechanical Performance by Spatial Printing with Continuous Fiber: Curved Slicing, Toolpath Generation and Physical Verification

    Authors: Guoxin Fang, Tianyu Zhang, Yuming Huang, Zhizhou Zhang, Kunal Masania, Charlie C. L. Wang

    Abstract: This work explores a spatial printing method to fabricate continuous fiber-reinforced thermoplastic composites (CFRTPCs), which can achieve exceptional mechanical performance. For models giving complex 3D stress distribution under loads, typical planar-layer based fiber placement usually fails to provide sufficient reinforcement due to their orientations being constrained to planes. The effectiven… ▽ More

    Submitted 25 January, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

  29. arXiv:2311.12842  [pdf, other

    eess.IV cs.CV

    Multimodal Identification of Alzheimer's Disease: A Review

    Authors: Guian Fang, Mengsha Liu, Yi Zhong, Zhuolin Zhang, Jiehui Huang, Zhenchao Tang, Calvin Yu-Chian Chen

    Abstract: Alzheimer's disease is a progressive neurological disorder characterized by cognitive impairment and memory loss. With the increasing aging population, the incidence of AD is continuously rising, making early diagnosis and intervention an urgent need. In recent years, a considerable number of teams have applied computer-aided diagnostic techniques to early classification research of AD. Most studi… ▽ More

    Submitted 6 October, 2023; originally announced November 2023.

  30. arXiv:2311.08791  [pdf, other

    cs.DC

    A Direct Approach for Solving Cloud Computing Task Assignment with Soft Deadlines

    Authors: Guang Fang, Yuxiang Zhao

    Abstract: Job scheduling in cloud computing environments is a critical yet complex problem. Cloud computing user job requirements are highly dynamic and uncertain, while cloud computing resources are heterogeneous and constrained. This paper studies the online resource allocation problem for elastic computing jobs with soft deadlines in cloud computing environments. The main contributions include: 1) Intege… ▽ More

    Submitted 22 December, 2023; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: 17 pages,8 figures

  31. arXiv:2310.17524  [pdf, other

    physics.optics physics.app-ph

    Real-space sampling of terahertz waveforms under scanning tunneling microscope

    Authors: Hongbo Li, Tianwu Wang, Wenyin Wei, Kai Zhang, Jing-yin Xu, Yirong Wu, Guangyou Fang

    Abstract: Terahertz scanning tunneling microscopy (THz-STM) has emerged as a potent technique for probing ultrafast nanoscale dynamics with exceptional spatiotemporal precision, whereby the acquisition of THz near-field waveforms holds paramount significance. While substantial efforts have been dedicated to retrieving the waveform utilizing the photoemission current or a molecular sensor, these methods are… ▽ More

    Submitted 20 March, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: 23 pages and 4 figures for the manuscript; 15 pages and 7 figures for the Supporting Information

  32. Solution to the conflict between the resolved and unresolved galaxy stellar mass estimation from the perspective of JWST

    Authors: Jie Song, GuanWen Fang, Zesen Lin, Yizhou Gu, Xu Kong

    Abstract: By utilizing the spatially-resolved photometry of galaxies at $0.2<z<3.0$ in the CEERS field, we estimate the resolved and unresolved stellar mass via spectral energy distribution (SED) fitting to study the discrepancy between them. We first compare $M_{\ast}$ derived from photometry with and without the JWST wavelength coverage and find that $M_{\ast}$ can be overestimated by up to 0.2 dex when l… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 8 pages, 5 figures, accepted by ApJ

  33. arXiv:2310.06311  [pdf, other

    cs.CV cs.MM

    Improving Compositional Text-to-image Generation with Large Vision-Language Models

    Authors: Song Wen, Guian Fang, Renrui Zhang, Peng Gao, Hao Dong, Dimitris Metaxas

    Abstract: Recent advancements in text-to-image models, particularly diffusion models, have shown significant promise. However, compositional text-to-image models frequently encounter difficulties in generating high-quality images that accurately align with input texts describing multiple objects, variable attributes, and intricate spatial relationships. To address this limitation, we employ large vision-lan… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  34. arXiv:2309.14279  [pdf, other

    cs.RO

    Spring-IMU Fusion Based Proprioception for Feedback Control of Soft Manipulators

    Authors: Yinan Meng, Guoxin Fang, Jiong Yang, Yuhu Guo, Charlie C. L. Wang

    Abstract: This paper presents a novel framework to realize proprioception and closed-loop control for soft manipulators. Deformations with large elongation and large bending can be precisely predicted using geometry-based sensor signals obtained from the inductive springs and the inertial measurement units (IMUs) with the help of machine learning techniques. Multiple geometric signals are fused into robust… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  35. arXiv:2309.03818  [pdf, other

    stat.ML cs.LG

    Empirical Risk Minimization for Losses without Variance

    Authors: Guanhua Fang, Ping Li, Gennady Samorodnitsky

    Abstract: This paper considers an empirical risk minimization problem under heavy-tailed settings, where data does not have finite variance, but only has $p$-th moment with $p \in (1,2)$. Instead of using estimation procedure based on truncated observed data, we choose the optimizer by minimizing the risk value. Those risk values can be robustly estimated via using the remarkable Catoni's method (Catoni, 20… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

  36. arXiv:2308.01537  [pdf, other

    cs.MM

    Learning Causality-inspired Representation Consistency for Video Anomaly Detection

    Authors: Yang Liu, Zhaoyang Xia, Mengyang Zhao, Donglai Wei, Yuzheng Wang, Liu Siao, Bobo Ju, Gaoyun Fang, Jing Liu, Liang Song

    Abstract: Video anomaly detection is an essential yet challenging task in the multimedia community, with promising applications in smart cities and secure communities. Existing methods attempt to learn abstract representations of regular events with statistical dependence to model the endogenous normality, which discriminates anomalies by measuring the deviations to the learned distribution. However, conven… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: Accepted to ACM MM 2023

  37. arXiv:2308.00549  [pdf, other

    cs.LG cs.AI cs.CV

    Copula for Instance-wise Feature Selection and Ranking

    Authors: Hanyu Peng, Guanhua Fang, Ping Li

    Abstract: Instance-wise feature selection and ranking methods can achieve a good selection of task-friendly features for each sample in the context of neural networks. However, existing approaches that assume feature subsets to be independent are imperfect when considering the dependency between features. To address this limitation, we propose to incorporate the Gaussian copula, a powerful mathematical tech… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: 15 pages, UAI poster

  38. arXiv:2308.00472  [pdf, other

    cs.CG

    Vector Field Based Volume Peeling for Multi-Axis Machining

    Authors: Neelotpal Dutta, Tianyu Zhang, Guoxin Fang, Ismail E. Yigit, Charlie C. L. Wang

    Abstract: This paper presents an easy-to-control volume peeling method for multi-axis machining based on the computation taken on vector fields. The current scalar field based methods are not flexible and the vector-field based methods do not guarantee the satisfaction of the constraints in the final results. We first conduct an optimization formulation to compute an initial vector field that is well aligne… ▽ More

    Submitted 4 October, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

    Comments: ASME Journal of Computing and Information Science in Engineering, accepted, October 2023

  39. arXiv:2307.13975  [pdf, other

    astro-ph.GA

    Evolution of Non-parametric Morphology of Galaxies in the JWST CEERS Field at $z\simeq$0.8-3.0

    Authors: Yao Yao, Jie Song, Xu Kong, Guanwen Fang, Hong-Xin Zhang, Xinkai Chen

    Abstract: Galaxy morphology is one of the most fundamental ways to describe galaxy properties, but the morphology we observe may be affected by wavelength and spatial resolution, which may introduce systematic bias when comparing galaxies at different redshift. Taking advantage of the broad wavelength coverage from optical to near-IR and high resolution NIRCam instrument of JWST, we measure the non-parametr… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: 11 pages, 5 figures, accepted in ApJ

  40. arXiv:2307.02335  [pdf, other

    astro-ph.GA

    The Classification of Galaxy Morphology in H-band of COSMOS-DASH Field: a combination-based machine learning clustering model

    Authors: Yao Dai, Jun Xu, Jie Song, Guanwen Fang, Chichun Zhou, Shuo Ba, Yizhou Gu, Zesen Lin, Xu Kong

    Abstract: By applying our previously developed two-step scheme for galaxy morphology classification, we present a catalog of galaxy morphology for H-band selected massive galaxies in the COSMOS-DASH field, which includes 17292 galaxies with stellar mass $M_{\star}>10^{10}~M_{\odot}$ at $0.5<z<2.5$. The classification scheme is designed to provide a complete morphology classification for galaxies via a combi… ▽ More

    Submitted 6 July, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: 13 pages, 10 figures, accepted by ApJS

  41. A comparison of cosmological models with high-redshift quasars

    Authors: Liuyuan Fan, Guanwen Fang, Jian Hu

    Abstract: The non-linear relationship between the monochromatic X-ray and UV luminosities in quasars offers the possibility of using high-z quasars as standard candles for cosmological testing. In this paper, we use a high-quality catalog of 1598 quasars extending to redshift 6, to compare the flat and uniformly expanding cosmological model, $R_h$ = ct and $Λ$CDM cosmological models which are the most debat… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Comments: 10 pages, 5 figures, Accepted for publication in APSS

  42. arXiv:2306.07223  [pdf, other

    cs.CY

    Wise in Vaccine Allocation

    Authors: Baiqiao Yin, Jiaqing Yuan, Weichen Lv, Jiehui Huang, Guian Fang

    Abstract: The paper uses machine learning and mathematical modeling to predict future vaccine distribution and solve the problem of allocating vaccines to different types of hospitals. They collected data and analyzed it, finding factors such as nearby residents, transportation, and medical personnel that impact distribution. They used the results to create a model and allocate vaccines to central and commu… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  43. arXiv:2306.07201  [pdf, other

    cs.CL

    LTCR: Long-Text Chinese Rumor Detection Dataset

    Authors: Ziyang Ma, Mengsha Liu, Guian Fang, Ying Shen

    Abstract: False information can spread quickly on social media, negatively influencing the citizens' behaviors and responses to social events. To better detect all of the fake news, especially long texts which are harder to find completely, a Long-Text Chinese Rumor detection dataset named LTCR is proposed. The LTCR dataset provides a valuable resource for accurately detecting misinformation, especially in… ▽ More

    Submitted 13 June, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

  44. arXiv:2306.04902  [pdf, other

    cs.DS cs.LG math.ST

    A Cover Time Study of a non-Markovian Algorithm

    Authors: Guanhua Fang, Gennady Samorodnitsky, Zhiqiang Xu

    Abstract: Given a traversal algorithm, cover time is the expected number of steps needed to visit all nodes in a given graph. A smaller cover time means a higher exploration efficiency of traversal algorithm. Although random walk algorithms have been studied extensively in the existing literature, there has been no cover time result for any non-Markovian method. In this work, we stand on a theoretical persp… ▽ More

    Submitted 11 August, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 25 pages

  45. arXiv:2305.19599  [pdf, other

    cs.CV cs.AI

    RealignDiff: Boosting Text-to-Image Diffusion Model with Coarse-to-fine Semantic Re-alignment

    Authors: Guian Fang, Zutao Jiang, Jianhua Han, Guansong Lu, Hang Xu, Shengcai Liao, Xiaodan Liang

    Abstract: Recent advances in text-to-image diffusion models have achieved remarkable success in generating high-quality, realistic images from textual descriptions. However, these approaches have faced challenges in precisely aligning the generated visual content with the textual concepts described in the prompts. In this paper, we propose a two-stage coarse-to-fine semantic re-alignment method, named Reali… ▽ More

    Submitted 27 November, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

  46. arXiv:2305.11627  [pdf, other

    cs.CL

    LLM-Pruner: On the Structural Pruning of Large Language Models

    Authors: Xinyin Ma, Gongfan Fang, Xinchao Wang

    Abstract: Large language models (LLMs) have shown remarkable capabilities in language understanding and generation. However, such impressive capability typically comes with a substantial model size, which presents significant challenges in both the deployment, inference, and training stages. With LLM being a general-purpose task solver, we explore its compression in a task-agnostic manner, which aims to pre… ▽ More

    Submitted 27 September, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Accepted at NeurIPS 2023

  47. arXiv:2305.10924  [pdf, other

    cs.LG cs.AI cs.CV

    Structural Pruning for Diffusion Models

    Authors: Gongfan Fang, Xinyin Ma, Xinchao Wang

    Abstract: Generative modeling has recently undergone remarkable advancements, primarily propelled by the transformative implications of Diffusion Probabilistic Models (DPMs). The impressive capability of these models, however, often entails significant computational overhead during both training and inference. To tackle this challenge, we present Diff-Pruning, an efficient compression method tailored for le… ▽ More

    Submitted 30 September, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Preprint version

  48. The effect of environment on the properties of the most massive galaxies at $0.5<z<2.5$ in the cosmos-dash field

    Authors: Jie Song, Guanwen Fang, Yizhou Gu, Zesen Lin, Xu Kong

    Abstract: How the environment influences the most massive galaxies is still unclear. To explore the environmental effects on morphology and star formation in the most massive galaxies at high redshift, we select galaxies with stellar mass $\log(M_{\star}/M_{\odot})>11$ at $0.5<z<2.5$ in the COSMOS-DASH field, which is the largest field with near-infrared photometrical observations using HST/WFC3 to date. Co… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: 13 pages, 7 figures, To be published in ApJ

  49. arXiv:2304.11633  [pdf, other

    cs.CL

    Evaluating ChatGPT's Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness

    Authors: Bo Li, Gexiang Fang, Yang Yang, Quansen Wang, Wei Ye, Wen Zhao, Shikun Zhang

    Abstract: The capability of Large Language Models (LLMs) like ChatGPT to comprehend user intent and provide reasonable responses has made them extremely popular lately. In this paper, we focus on assessing the overall ability of ChatGPT using 7 fine-grained information extraction (IE) tasks. Specially, we present the systematically analysis by measuring ChatGPT's performance, explainability, calibration, an… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

  50. arXiv:2302.12007  [pdf, other

    cs.CV

    DMMG: Dual Min-Max Games for Self-Supervised Skeleton-Based Action Recognition

    Authors: Shannan Guan, Xin Yu, Wei Huang, Gengfa Fang, Haiyan Lu

    Abstract: In this work, we propose a new Dual Min-Max Games (DMMG) based self-supervised skeleton action recognition method by augmenting unlabeled data in a contrastive learning framework. Our DMMG consists of a viewpoint variation min-max game and an edge perturbation min-max game. These two min-max games adopt an adversarial paradigm to perform data augmentation on the skeleton sequences and graph-struct… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.