Skip to main content

Showing 1–50 of 128 results for author: Xia, A

  1. arXiv:2410.16602  [pdf, other

    cs.CV

    Foundation Models for Remote Sensing and Earth Observation: A Survey

    Authors: Aoran Xiao, Weihao Xuan, Junjue Wang, Jiaxing Huang, Dacheng Tao, Shijian Lu, Naoto Yokoya

    Abstract: Remote Sensing (RS) is a crucial technology for observing, monitoring, and interpreting our planet, with broad applications across geoscience, economics, humanitarian fields, etc. While artificial intelligence (AI), particularly deep learning, has achieved significant advances in RS, unique challenges persist in developing more intelligent RS systems, including the complexity of Earth's environmen… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  2. arXiv:2410.05326  [pdf, other

    cs.LG cond-mat.mtrl-sci

    Early-Cycle Internal Impedance Enables ML-Based Battery Cycle Life Predictions Across Manufacturers

    Authors: Tyler Sours, Shivang Agarwal, Marc Cormier, Jordan Crivelli-Decker, Steffen Ridderbusch, Stephen L. Glazier, Connor P. Aiken, Aayush R. Singh, Ang Xiao, Omar Allam

    Abstract: Predicting the end-of-life (EOL) of lithium-ion batteries across different manufacturers presents significant challenges due to variations in electrode materials, manufacturing processes, cell formats, and a lack of generally available data. Methods that construct features solely on voltage-capacity profile data typically fail to generalize across cell chemistries. This study introduces a methodol… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

    Comments: 17 pages, 7 figures

  3. arXiv:2409.20548  [pdf, other

    cs.RO cs.AI cs.HC

    Robi Butler: Remote Multimodal Interactions with Household Robot Assistant

    Authors: Anxing Xiao, Nuwan Janaka, Tianrun Hu, Anshul Gupta, Kaixin Li, Cunjun Yu, David Hsu

    Abstract: In this paper, we introduce Robi Butler, a novel household robotic system that enables multimodal interactions with remote users. Building on the advanced communication interfaces, Robi Butler allows users to monitor the robot's status, send text or voice instructions, and select target objects by hand pointing. At the core of our system is a high-level behavior module, powered by Large Language M… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

  4. arXiv:2409.18084  [pdf, other

    cs.RO cs.AI

    GSON: A Group-based Social Navigation Framework with Large Multimodal Model

    Authors: Shangyi Luo, Ji Zhu, Peng Sun, Yuhong Deng, Cunjun Yu, Anxing Xiao, Xueqian Wang

    Abstract: As the number of service robots and autonomous vehicles in human-centered environments grows, their requirements go beyond simply navigating to a destination. They must also take into account dynamic social contexts and ensure respect and comfort for others in shared spaces, which poses significant challenges for perception and planning. In this paper, we present a group-based social navigation fr… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

  5. arXiv:2409.17604  [pdf, other

    cs.LG

    RmGPT: Rotating Machinery Generative Pretrained Model

    Authors: Yilin Wang, Yifei Yu, Kong Sun, Peixuan Lei, Yuxuan Zhang, Enrico Zio, Aiguo Xia, Yuanxiang Li

    Abstract: In industry, the reliability of rotating machinery is critical for production efficiency and safety. Current methods of Prognostics and Health Management (PHM) often rely on task-specific models, which face significant challenges in handling diverse datasets with varying signal characteristics, fault modes and operating conditions. Inspired by advancements in generative pretrained models, we propo… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

  6. arXiv:2409.02244  [pdf, other

    cs.HC cs.CL

    Therapy as an NLP Task: Psychologists' Comparison of LLMs and Human Peers in CBT

    Authors: Zainab Iftikhar, Sean Ransom, Amy Xiao, Jeff Huang

    Abstract: Wider access to therapeutic care is one of the biggest challenges in mental health treatment. Due to institutional barriers, some people seeking mental health support have turned to large language models (LLMs) for personalized therapy, even though these models are largely unsanctioned and untested. We investigate the potential and limitations of using LLMs as providers of evidence-based therapy b… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

    ACM Class: I.2.7; J.4

  7. arXiv:2409.01491  [pdf, other

    cs.CV cs.AI

    EarthGen: Generating the World from Top-Down Views

    Authors: Ansh Sharma, Albert Xiao, Praneet Rathi, Rohit Kundu, Albert Zhai, Yuan Shen, Shenlong Wang

    Abstract: In this work, we present a novel method for extensive multi-scale generative terrain modeling. At the core of our model is a cascade of superresolution diffusion models that can be combined to produce consistent images across multiple resolutions. Pairing this concept with a tiled generation method yields a scalable system that can generate thousands of square kilometers of realistic Earth surface… ▽ More

    Submitted 7 September, 2024; v1 submitted 2 September, 2024; originally announced September 2024.

    ACM Class: J.2; I.4.8

  8. arXiv:2408.09085  [pdf, other

    cs.CV

    Segment Anything with Multiple Modalities

    Authors: Aoran Xiao, Weihao Xuan, Heli Qi, Yun Xing, Naoto Yokoya, Shijian Lu

    Abstract: Robust and accurate segmentation of scenes has become one core functionality in various visual recognition and navigation tasks. This has inspired the recent development of Segment Anything Model (SAM), a foundation model for general mask segmentation. However, SAM is largely tailored for single-modal RGB images, limiting its applicability to multi-modal data captured with widely-adopted sensor su… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Comments: Project page: https://xiaoaoran.github.io/projects/MM-SAM

  9. arXiv:2406.16112  [pdf, ps, other

    math.NA

    Greedy randomized Bregman-Kaczmarz method for constrained nonlinear systems of equations

    Authors: Aqin Xiao, Junfeng Yin

    Abstract: A greedy randomized nonlinear Bregman-Kaczmarz method by sampling the working index with residual information is developed for the solution of the constrained nonlinear system of equations. Theoretical analyses prove the convergence of the greedy randomized nonlinear Bregman-Kaczmarz method and its relaxed version. Numerical experiments verify the effectiveness of the proposed method,which converg… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  10. arXiv:2406.09813  [pdf, other

    astro-ph.IM astro-ph.HE

    Diffuse X-ray Explorer: a high-resolution X-ray spectroscopic sky surveyor on the China Space Station

    Authors: Hai Jin, Junjie Mao, Liubiao Chen, Naihui Chen, Wei Cui, Bo Gao, Jinjin Li, Xinfeng Li, Jiejia Liu, Jia Quan, Chunyang Jiang, Guole Wang, Le Wang, Qian Wang, Sifan Wang, Aimin Xiao, Shuo Zhang

    Abstract: DIffuse X-ray Explorer (DIXE) is a proposed high-resolution X-ray spectroscopic sky surveyor on the China Space Station (CSS). DIXE will focus on studying hot baryons in the Milky Way. Galactic hot baryons like the X-ray emitting Milky Way halo and eROSITA bubbles are best observed in the sky survey mode with a large field of view. DIXE will take advantage of the orbital motion of the CSS to scan… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 12 pages, 6 figures, the full version is published by Journal of Low Temperature Physics

  11. arXiv:2405.02794  [pdf, other

    cs.RO

    Octopi: Object Property Reasoning with Large Tactile-Language Models

    Authors: Samson Yu, Kelvin Lin, Anxing Xiao, Jiafei Duan, Harold Soh

    Abstract: Physical reasoning is important for effective robot manipulation. Recent work has investigated both vision and language modalities for physical reasoning; vision can reveal information about objects in the environment and language serves as an abstraction and communication medium for additional context. Although these works have demonstrated success on a variety of physical reasoning tasks, they a… ▽ More

    Submitted 4 June, 2024; v1 submitted 4 May, 2024; originally announced May 2024.

    Comments: Accepted at Robotics: Science and Systems (R:SS 2024)

  12. arXiv:2404.14953  [pdf, other

    cs.LG

    Dynamic pricing with Bayesian updates from online reviews

    Authors: José Correa, Mathieu Mari, Andrew Xia

    Abstract: When launching new products, firms face uncertainty about market reception. Online reviews provide valuable information not only to consumers but also to firms, allowing firms to adjust the product characteristics, including its selling price. In this paper, we consider a pricing model with online reviews in which the quality of the product is uncertain, and both the seller and the buyers Bayesian… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  13. arXiv:2402.03631  [pdf, other

    cs.CV

    CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model

    Authors: Aoran Xiao, Weihao Xuan, Heli Qi, Yun Xing, Ruijie Ren, Xiaoqin Zhang, Ling Shao, Shijian Lu

    Abstract: The recent Segment Anything Model (SAM) has demonstrated remarkable zero-shot capability and flexible geometric prompting in general image segmentation. However, SAM often struggles when handling various unconventional images, such as aerial, medical, and non-RGB images. This paper presents CAT-SAM, a ConditionAl Tuning network that adapts SAM toward various unconventional target tasks with just f… ▽ More

    Submitted 15 July, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: ECCV 2024

  14. arXiv:2401.08407  [pdf, other

    cs.CV

    Cross-Domain Few-Shot Segmentation via Iterative Support-Query Correspondence Mining

    Authors: Jiahao Nie, Yun Xing, Gongjie Zhang, Pei Yan, Aoran Xiao, Yap-Peng Tan, Alex C. Kot, Shijian Lu

    Abstract: Cross-Domain Few-Shot Segmentation (CD-FSS) poses the challenge of segmenting novel categories from a distinct domain using only limited exemplars. In this paper, we undertake a comprehensive study of CD-FSS and uncover two crucial insights: (i) the necessity of a fine-tuning stage to effectively transfer the learned meta-knowledge across domains, and (ii) the overfitting risk during the naïve fin… ▽ More

    Submitted 13 March, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted by CVPR 2024

  15. arXiv:2401.08344  [pdf, other

    math.PR

    Large-population asymptotics for the maximum of diffusive particles with mean-field interaction in the noises

    Authors: Nikolaos Kolliopoulos, David Sanchez, Amy Xiao

    Abstract: We study the $N \to \infty$ limit of the normalized largest component in some systems of $N$ diffusive particles with mean-field interaction. By applying a universal time change, the interaction in noises is transferred to the drift terms, and the asymptotic behavior of the maximum becomes well-understood due to existing results in the literature. We expect that the normalized maximum in the origi… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 12 pages

    MSC Class: 60K35; 60H10; 60F05; 60G70

  16. arXiv:2311.17406  [pdf, other

    cs.RO cs.AI

    LLM-State: Open World State Representation for Long-horizon Task Planning with Large Language Model

    Authors: Siwei Chen, Anxing Xiao, David Hsu

    Abstract: This work addresses the problem of long-horizon task planning with the Large Language Model (LLM) in an open-world household environment. Existing works fail to explicitly track key objects and attributes, leading to erroneous decisions in long-horizon tasks, or rely on highly engineered state features and feedback, which is not generalizable. We propose an open state representation that provides… ▽ More

    Submitted 22 April, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

  17. arXiv:2311.06711  [pdf, ps, other

    math.NA

    Optimal $L^\infty(L^2)$ and $L^1(L^2)$ a posteriori error estimates for the fully discrete approximations of time fractional parabolic differential equations

    Authors: Jiliang Cao, Wansheng Wang, Aiguo Xiao

    Abstract: We derive optimal order a posteriori error estimates in the $L^\infty(L^2)$ and $L^1(L^2)$-norms for the fully discrete approximations of time fractional parabolic differential equations. For the discretization in time, we use the $L1$ methods, while for the spatial discretization, we use standard conforming finite element methods. The linear and quadratic space-time reconstructions are introduced… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: 22 pages

  18. Parking Spot Classification based on surround view camera system

    Authors: Andy Xiao, Deep Doshi, Lihao Wang, Harsha Gorantla, Thomas Heitzmann, Peter Groth

    Abstract: Surround-view fisheye cameras are commonly used for near-field sensing in automated driving scenarios, including urban driving and auto valet parking. Four fisheye cameras, one on each side, are sufficient to cover 360° around the vehicle capturing the entire near-field region. Based on surround view cameras, there has been much research on parking slot detection with main focus on the occupancy s… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: SPIE Optical Engineering + Applications, 2023, San Diego, California, United States. Proc. SPIE 12675, Applications of Machine Learning 2023

  19. arXiv:2310.12141  [pdf, other

    math.PR

    A phase transition and critical phenomenon for the two-dimensional random field Ising model

    Authors: Jian Ding, Fenglin Huang, Aoteng Xia

    Abstract: We study the random field Ising model in a two-dimensional box with side length $N$ where the external field is given by independent normal variables with mean $0$ and variance $ε^2$. Our primary result is the following phase transition at $T = T_c$: for $ε\ll N^{-7/8}$ the boundary influence (i.e., the difference between the spin averages at the center of the box with the plus and the minus bound… ▽ More

    Submitted 4 March, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: 65 pages; minor revision throughout over previous version

    MSC Class: 60K35; 82B44

  20. arXiv:2310.09078  [pdf, other

    cs.NI eess.SP

    DNFS-VNE: Deep Neuro Fuzzy System Driven Virtual Network Embedding

    Authors: Ailing Xiao, Ning Chen, Sheng Wu, Peiying Zhang, Linling Kuang, Chunxiao Jiang

    Abstract: By decoupling substrate resources, network virtualization (NV) is a promising solution for meeting diverse demands and ensuring differentiated quality of service (QoS). In particular, virtual network embedding (VNE) is a critical enabling technology that enhances the flexibility and scalability of network deployment by addressing the coupling of Internet processes and services. However, in the exi… ▽ More

    Submitted 3 July, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

  21. arXiv:2309.13505  [pdf, other

    cs.CV

    Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation

    Authors: Yun Xing, Jian Kang, Aoran Xiao, Jiahao Nie, Ling Shao, Shijian Lu

    Abstract: Vision-Language Pre-training has demonstrated its remarkable zero-shot recognition ability and potential to learn generalizable visual representations from language supervision. Taking a step ahead, language-supervised semantic segmentation enables spatial localization of textual inputs by learning pixel grouping solely from image-text pairs. Nevertheless, the state-of-the-art suffers from clear s… ▽ More

    Submitted 4 January, 2024; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: NeurIPS 2023. Code is available at https://github.com/xing0047/rewrite

  22. arXiv:2309.06041  [pdf, other

    cs.RO

    GVD-Exploration: An Efficient Autonomous Robot Exploration Framework Based on Fast Generalized Voronoi Diagram Extraction

    Authors: Dingfeng Chen, Anxing Xiao, Meiyuan Zou, Wenzheng Chi, Jiankun Wang, Lining Sun

    Abstract: Rapidly-exploring Random Trees (RRTs) are a popular technique for autonomous exploration of mobile robots. However, the random sampling used by RRTs can result in inefficient and inaccurate frontiers extraction, which affects the exploration performance. To address the issues of slow path planning and high path cost, we propose a framework that uses a generalized Voronoi diagram (GVD) based multi-… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: 11 pages, 10 figures

  23. arXiv:2309.03005  [pdf, ps, other

    math.NA

    On multi-step extended maximum residual Kaczmarz method for solving large inconsistent linear systems

    Authors: Aqin Xiao, Junfeng Yin, Ning Zheng

    Abstract: A multi-step extended maximum residual Kaczmarz method is presented for the solution of the large inconsistent linear system of equations by using the multi-step iterations technique. Theoretical analysis proves the proposed method is convergent and gives an upper bound on its convergence rate. Numerical experiments show that the proposed method is effective and outperforms the existing extended K… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  24. arXiv:2309.02780  [pdf, other

    cs.CL cs.SD eess.AS

    GRASS: Unified Generation Model for Speech-to-Semantic Tasks

    Authors: Aobo Xia, Shuyu Lei, Yushu Yang, Xiang Guo, Hua Chai

    Abstract: This paper explores the instruction fine-tuning technique for speech-to-semantic tasks by introducing a unified end-to-end (E2E) framework that generates target text conditioned on a task-related prompt for audio data. We pre-train the model using large and diverse data, where instruction-speech pairs are constructed via a text-to-speech (TTS) system. Extensive experiments demonstrate that our pro… ▽ More

    Submitted 11 September, 2023; v1 submitted 6 September, 2023; originally announced September 2023.

  25. arXiv:2307.15283  [pdf, ps, other

    math.NA

    On averaging block Kaczmarz methods for solving nonlinear systems of equations

    Authors: Aqin Xiao, Junfeng Yin

    Abstract: A class of averaging block nonlinear Kaczmarz methods is developed for the solution of the nonlinear system of equations. The convergence theory of the proposed method is established under suitable assumptions and the upper bounds of the convergence rate for the proposed method with both constant stepsize and adaptive stepsize are derived. Numerical experiments are presented to verify the efficien… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

  26. arXiv:2305.19812  [pdf, other

    cs.CV

    A Survey of Label-Efficient Deep Learning for 3D Point Clouds

    Authors: Aoran Xiao, Xiaoqin Zhang, Ling Shao, Shijian Lu

    Abstract: In the past decade, deep neural networks have achieved significant progress in point cloud learning. However, collecting large-scale precisely-annotated training data is extremely laborious and expensive, which hinders the scalability of existing point cloud datasets and poses a bottleneck for efficient exploration of point cloud data in various tasks and applications. Label-efficient learning off… ▽ More

    Submitted 17 June, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

  27. arXiv:2304.00690  [pdf, other

    cs.CV

    3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds

    Authors: Aoran Xiao, Jiaxing Huang, Weihao Xuan, Ruijie Ren, Kangcheng Liu, Dayan Guan, Abdulmotaleb El Saddik, Shijian Lu, Eric Xing

    Abstract: Robust point cloud parsing under all-weather conditions is crucial to level-5 autonomy in autonomous driving. However, how to learn a universal 3D semantic segmentation (3DSS) model is largely neglected as most existing benchmarks are dominated by point clouds captured under normal weather. We introduce SemanticSTF, an adverse-weather point cloud dataset that provides dense point-level annotations… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

    Comments: CVPR2023

  28. Designing the pressure-dependent shear modulus using tessellated granular metamaterials

    Authors: Jerry Zhang, Dong Wang, Weiwei Jin, Annie Xia, Nidhi Pashine, Rebecca Kramer-Bottiglio, Mark D. Shattuck, Corey S. O'Hern

    Abstract: Jammed packings of granular materials display complex mechanical response. For example, the ensemble-averaged shear modulus $\left\langle G \right\rangle$ increases as a power-law in pressure $p$ for static packings of soft spherical particles that can rearrange during compression. We seek to design granular materials with shear moduli that can either increase {\it or} decrease with pressure witho… ▽ More

    Submitted 10 September, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Journal ref: Phys. Rev. E 108, 034901 (2023)

  29. arXiv:2303.06624  [pdf, other

    cs.RO

    Collaborative Trolley Transportation System with Autonomous Nonholonomic Robots

    Authors: Bingyi Xia, Hao Luan, Ziqi Zhao, Xuheng Gao, Peijia Xie, Anxing Xiao, Jiankun Wang, Max Q. -H. Meng

    Abstract: Cooperative object transportation using multiple robots has been intensively studied in the control and robotics literature, but most approaches are either only applicable to omnidirectional robots or lack a complete navigation and decision-making framework that operates in real time. This paper presents an autonomous nonholonomic multi-robot system and an end-to-end hierarchical autonomy framewor… ▽ More

    Submitted 21 July, 2023; v1 submitted 12 March, 2023; originally announced March 2023.

  30. arXiv:2303.05223  [pdf, other

    stat.ME

    LEAP: The latent exchangeability prior for borrowing information from historical data

    Authors: Ethan M. Alt, Xiuya Chang, Xun Jiang, Qing Liu, May Mo, H. Amy Xia, Joseph G. Ibrahim

    Abstract: It is becoming increasingly popular to elicit informative priors on the basis of historical data. Popular existing priors, including the power prior, commensurate prior, and robust meta-analytic prior provide blanket discounting. Thus, if only a subset of participants in the historical data are exchangeable with the current data, these priors may not be appropriate. In order to combat this issue,… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

  31. arXiv:2302.10654  [pdf, ps, other

    math.PR

    On the rate of normal approximation for Poisson continuum percolation

    Authors: Tiffany Y. Y. Lo, Aihua Xia

    Abstract: It is known that the number of points in the largest cluster of a percolating Poisson process restricted to a large finite box is asymptotically normal. In this note, we establish a rate of convergence for the statement. As each point in the largest cluster is determined by points as far as the diameter of the box, known results in the literature of normal approximation for Poisson functionals can… ▽ More

    Submitted 7 September, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: 10 pages. This version contains a correction to an error in Lemma 2.2 in the previous versions

    MSC Class: primary 60K35; 60F05; secondary 60D05; 60G57; 82B43; 62E20

  32. The Digital Foundation Platform -- A Multi-layered SOA Architecture for Intelligent Connected Vehicle Operating System

    Authors: David Yu, Andy Xiao

    Abstract: Legacy AD/ADAS development from OEMs centers around developing functions on ECUs using services provided by AUTOSAR Classic Platform (CP) to meet automotive-grade and mass-production requirements. The AUTOSAR CP couples hardware and software components statically and encounters challenges to provide sufficient capacities for the processing of high-level intelligent driving functions, whereas the n… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: WCX SAE World Congress Experience 2022

  33. arXiv:2210.05128  [pdf, ps, other

    math.NA

    On fast greedy block Kaczmarz methods for solving large consistent linear systems

    Authors: Aqin Xiao, Junfeng Yin, Ning Zheng

    Abstract: A class of fast greedy block Kaczmarz methods combined with general greedy strategy and average technique are proposed for solving large consistent linear systems. Theoretical analysis of the convergence of the proposed method is given in detail. Numerical experiments show that the proposed methods are efficient and faster than the existing methods.

    Submitted 16 October, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: 11 pages, 1 figure

  34. arXiv:2209.13998  [pdf, other

    math.PR

    Long range order for three-dimensional random field Ising model throughout the entire low temperature regime

    Authors: Jian Ding, Yu Liu, Aoteng Xia

    Abstract: For $d\geq 3$, we study the Ising model on $\mathbb Z^d$ with random field given by $\{εh_v: v\in \mathbb Z^d\}$ where $h_v$'s are independent normal variables with mean 0 and variance 1. We show that for any $T < T_c$ (here $T_c$ is the critical temperature without disorder), long range order exists as long as $ε$ is sufficiently small depending on $T$. Our work extends previous results of Imbrie… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: 36 pages

    MSC Class: 60K35; 82B44

  35. arXiv:2208.00223  [pdf, other

    cs.CV cs.AI cs.LG

    PolarMix: A General Data Augmentation Technique for LiDAR Point Clouds

    Authors: Aoran Xiao, Jiaxing Huang, Dayan Guan, Kaiwen Cui, Shijian Lu, Ling Shao

    Abstract: LiDAR point clouds, which are usually scanned by rotating LiDAR sensors continuously, capture precise geometry of the surrounding environment and are crucial to many autonomous detection and navigation tasks. Though many 3D deep architectures have been developed, efficient collection and annotation of large amounts of point clouds remain one major challenge in the analytic and understanding of poi… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

  36. arXiv:2205.13211  [pdf, ps, other

    math.PR

    Convergence rate for geometric statistics of point processes with fast decay dependence

    Authors: Tianshu Cong, Aihua Xia

    Abstract: [Błaszczyszyn, Yogeshwaran and Yukich (2019)] established central limit theorems for geometric statistics of point processes having fast decay dependence. As limit theorems are of limited use unless we understand their errors involved in the approximation, in this paper, we consider the rates of a normal approximation in terms of the Wasserstein distance for statistics of point processes on… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    Comments: 42 pages

    MSC Class: primary 60F05; secondary 60D05; 60G55; 62E20; 05C80

  37. arXiv:2205.03967  [pdf, other

    stat.ME math.ST

    The saturated pairwise interaction Gibbs point process as a joint species distribution model

    Authors: Ian Flint, Nick Golding, Peter Vesk, Yan Wang, Aihua Xia

    Abstract: In an effort to effectively model observed patterns in the spatial configuration of individuals of multiple species in nature, we introduce the saturated pairwise interaction Gibbs point process. Its main strength lies in its ability to model both attraction and repulsion within and between species, over different scales. As such, it is particularly well-suited to the study of associations in… ▽ More

    Submitted 20 August, 2022; v1 submitted 8 May, 2022; originally announced May 2022.

    Comments: 36 pages, 14 figures

    Journal ref: Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(5), 2022, pages 1721-1752

  38. arXiv:2204.06456  [pdf, other

    cond-mat.quant-gas hep-ph quant-ph

    Non-equilibrium dynamics of fluctuations in an ultra-cold atomic mixture

    Authors: Apoorva Hegde, Robert Ott, Andy Xia, Valentin Kasper, Jürgen Berges, Fred Jendrzejewski

    Abstract: We investigate an ultra-cold mixture of Bose gases interacting via spin-changing collisions by studying the dynamics of spin fluctuations. The experimental implementation employs $^{23}$Na and $^{7}$Li atoms, which are prepared out of equilibrium across a wide range of initial conditions. We identify three regimes in the dynamics of the system for different initial states: a long-lived metastable… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Comments: 9 pages, 5 figures

  39. arXiv:2204.03875  [pdf, other

    cs.DS cs.CG

    Deterministic, Near-Linear $\varepsilon$-Approximation Algorithm for Geometric Bipartite Matching

    Authors: Pankaj K. Agarwal, Hsien-Chih Chang, Sharath Raghvendra, Allen Xiao

    Abstract: Given point sets $A$ and $B$ in $\mathbb{R}^d$ where $A$ and $B$ have equal size $n$ for some constant dimension $d$ and a parameter $\varepsilon>0$, we present the first deterministic algorithm that computes, in $n\cdot(\varepsilon^{-1} \log n)^{O(d)}$ time, a perfect matching between $A$ and $B$ whose cost is within a $(1+\varepsilon)$ factor of the optimal under any $\smash{\ell_p}$-norm. Altho… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: The conference version of the paper is accepted to STOC 2022

  40. arXiv:2203.10026  [pdf, other

    cs.CV

    Unbiased Subclass Regularization for Semi-Supervised Semantic Segmentation

    Authors: Dayan Guan, Jiaxing Huang, Aoran Xiao, Shijian Lu

    Abstract: Semi-supervised semantic segmentation learns from small amounts of labelled images and large amounts of unlabelled images, which has witnessed impressive progress with the recent advance of deep neural networks. However, it often suffers from severe class-bias problem while exploring the unlabelled images, largely due to the clear pixel-wise class imbalance in the labelled images. This paper prese… ▽ More

    Submitted 26 March, 2022; v1 submitted 18 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022. Code is available at https://github.com/Dayan-Guan/USRN

  41. arXiv:2203.04541  [pdf, other

    cs.RO

    PUTN: A Plane-fitting based Uneven Terrain Navigation Framework

    Authors: Zhuozhu Jian, Zihong Lu, Xiao Zhou, Bin Lan, Anxing Xiao, Xueqian Wang, Bin Liang

    Abstract: Autonomous navigation of ground robots has been widely used in indoor structured 2D environments, but there are still many challenges in outdoor 3D unstructured environments, especially in rough, uneven terrains. This paper proposed a plane-fitting based uneven terrain navigation framework (PUTN) to solve this problem. The implementation of PUTN is divided into three steps. First, based on Rapidly… ▽ More

    Submitted 27 September, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: Accepted by IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2022

  42. arXiv:2203.03927  [pdf, other

    cs.RO eess.SY

    Quadruped Guidance Robot for the Visually Impaired: A Comfort-Based Approach

    Authors: Yanbo Chen, Zhengzhe Xu, Zhuozhu Jian, Gengpan Tang, Yunong Yangli, Anxing Xiao, Xueqian Wang, Bin Liang

    Abstract: Guidance robots that can guide people and avoid various obstacles, could potentially be owned by more visually impaired people at a fairly low cost. Most of the previous guidance robots for the visually impaired ignored the human response behavior and comfort, treating the human as an appendage dragged by the robot, which can lead to imprecise guidance of the human and sudden changes in the tracti… ▽ More

    Submitted 23 June, 2023; v1 submitted 8 March, 2022; originally announced March 2022.

    Comments: IEEE International Conference on Robotics and Automation (ICRA) 2023

  43. Unsupervised Point Cloud Representation Learning with Deep Neural Networks: A Survey

    Authors: Aoran Xiao, Jiaxing Huang, Dayan Guan, Xiaoqin Zhang, Shijian Lu, Ling Shao

    Abstract: Point cloud data have been widely explored due to its superior accuracy and robustness under various adverse situations. Meanwhile, deep neural networks (DNNs) have achieved very impressive success in various applications such as surveillance and autonomous driving. The convergence of point cloud and DNNs has led to many deep point cloud models, largely trained under the supervision of large-scale… ▽ More

    Submitted 26 March, 2023; v1 submitted 28 February, 2022; originally announced February 2022.

    Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence

  44. arXiv:2111.09983  [pdf, other

    eess.AS cs.SD

    Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions

    Authors: Chunxi Liu, Michael Picheny, Leda Sarı, Pooja Chitkara, Alex Xiao, Xiaohui Zhang, Mark Chou, Andres Alvarado, Caner Hazirbas, Yatharth Saraf

    Abstract: It is well known that many machine learning systems demonstrate bias towards specific groups of individuals. This problem has been studied extensively in the Facial Recognition area, but much less so in Automatic Speech Recognition (ASR). This paper presents initial Speech Recognition results on "Casual Conversations" -- a publicly released 846 hour corpus designed to help researchers evaluate the… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

    Comments: Submitted to ICASSP 2022. Our dataset will be publicly available at (https://ai.facebook.com/datasets/casual-conversations-downloads) for general use. We also would like to note that considering the limitations of our dataset, we limit the use of it for only evaluation purposes (see license agreement)

  45. arXiv:2111.05948  [pdf, other

    cs.CL cs.SD eess.AS

    Scaling ASR Improves Zero and Few Shot Learning

    Authors: Alex Xiao, Weiyi Zheng, Gil Keren, Duc Le, Frank Zhang, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed

    Abstract: With 4.5 million hours of English speech from 10 different sources across 120 countries and models of up to 10 billion parameters, we explore the frontiers of scale for automatic speech recognition. We propose data selection techniques to efficiently scale training data to find the most valuable samples in massive datasets. To efficiently scale model sizes, we leverage various optimizations such a… ▽ More

    Submitted 29 November, 2021; v1 submitted 10 November, 2021; originally announced November 2021.

  46. arXiv:2110.06648  [pdf, other

    cs.RO eess.SY

    Robotic Autonomous Trolley Collection with Progressive Perception and Nonlinear Model Predictive Control

    Authors: Anxing Xiao, Hao Luan, Ziqi Zhao, Yue Hong, Jieting Zhao, Weinan Chen, Jiankun Wang, Max Q. -H. Meng

    Abstract: Autonomous mobile manipulation robots that can collect trolleys are widely used to liberate human resources and fight epidemics. Most prior robotic trolley collection solutions only detect trolleys with 2D poses or are merely based on specific marks and lack the formal design of planning algorithms. In this paper, we present a novel mobile manipulation system with applications in luggage trolley c… ▽ More

    Submitted 1 March, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: Accepted to the 2022 International Conference on Robotics and Automation (ICRA 2022)

  47. arXiv:2110.05241  [pdf, other

    eess.AS cs.CL cs.LG

    Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution

    Authors: Yangyang Shi, Chunyang Wu, Dilin Wang, Alex Xiao, Jay Mahadeokar, Xiaohui Zhang, Chunxi Liu, Ke Li, Yuan Shangguan, Varun Nagaraja, Ozlem Kalinli, Mike Seltzer

    Abstract: This paper improves the streaming transformer transducer for speech recognition by using non-causal convolution. Many works apply the causal convolution to improve streaming transformer ignoring the lookahead context. We propose to use non-causal convolution to process the center block and lookahead context separately. This method leverages the lookahead context in convolution and maintains simila… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

    Comments: 5 pages, 3 figures, submit to ICASSP 2022

  48. arXiv:2110.03374  [pdf, other

    cs.CV

    Model Adaptation: Historical Contrastive Learning for Unsupervised Domain Adaptation without Source Data

    Authors: Jiaxing Huang, Dayan Guan, Aoran Xiao, Shijian Lu

    Abstract: Unsupervised domain adaptation aims to align a labeled source domain and an unlabeled target domain, but it requires to access the source data which often raises concerns in data privacy, data portability and data transmission efficiency. We study unsupervised model adaptation (UMA), or called Unsupervised Domain Adaptation without Source Data, an alternative setting that aims to adapt source-trai… ▽ More

    Submitted 4 June, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: Accepted to Advances in Neural Information Processing Systems 34 (NeurIPS 2021)

  49. arXiv:2110.03174  [pdf, other

    cs.SD cs.AI eess.AS

    Transferring Voice Knowledge for Acoustic Event Detection: An Empirical Study

    Authors: Dawei Liang, Yangyang Shi, Yun Wang, Nayan Singhal, Alex Xiao, Jonathan Shaw, Edison Thomaz, Ozlem Kalinli, Mike Seltzer

    Abstract: Detection of common events and scenes from audio is useful for extracting and understanding human contexts in daily life. Prior studies have shown that leveraging knowledge from a relevant domain is beneficial for a target acoustic event detection (AED) process. Inspired by the observation that many human-centered acoustic events in daily life involve voice elements, this paper investigates the po… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

    Comments: Submitted to ICASSP 2022

  50. arXiv:2108.00177  [pdf, other

    cs.CV cs.AI

    Greedy Network Enlarging

    Authors: Chuanjian Liu, Kai Han, An Xiao, Yiping Deng, Wei Zhang, Chunjing Xu, Yunhe Wang

    Abstract: Recent studies on deep convolutional neural networks present a simple paradigm of architecture design, i.e., models with more MACs typically achieve better accuracy, such as EfficientNet and RegNet. These works try to enlarge all the stages in the model with one unified rule by sampling and statistical methods. However, we observe that some network architectures have similar MACs and accuracies, b… ▽ More

    Submitted 25 November, 2021; v1 submitted 31 July, 2021; originally announced August 2021.