Skip to main content

Showing 1–28 of 28 results for author: Pei, H

  1. arXiv:2409.15822  [pdf, other

    cs.RO

    A Ducted Fan UAV for Safe Aerial Grabbing and Transfer of Multiple Loads Using Electromagnets

    Authors: Zhong Yin, Hailong Pei

    Abstract: In recent years, research on aerial grasping, manipulation, and transportation of objects has garnered significant attention. These tasks often require UAVs to operate safely close to environments or objects and to efficiently grasp payloads. However, current widely adopted flying platforms pose safety hazards: unprotected high-speed rotating propellers can cause harm to the surroundings. Addition… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    Comments: 8pages, 13figures,accepted by IROS2024 This work has been submitted to the IEEE for possible publication

  2. arXiv:2409.03155  [pdf, other

    cs.CL cs.AI

    Debate on Graph: a Flexible and Reliable Reasoning Framework for Large Language Models

    Authors: Jie Ma, Zhitao Gao, Qi Chai, Wangchun Sun, Pinghui Wang, Hongbin Pei, Jing Tao, Lingyun Song, Jun Liu, Chen Zhang, Lizhen Cui

    Abstract: Large Language Models (LLMs) may suffer from hallucinations in real-world applications due to the lack of relevant knowledge. In contrast, knowledge graphs encompass extensive, multi-relational structures that store a vast array of symbolic facts. Consequently, integrating LLMs with knowledge graphs has been extensively explored, with Knowledge Graph Question Answering (KGQA) serving as a critical… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

    Comments: 12 pages

    ACM Class: I.2.4

  3. arXiv:2404.12020  [pdf, other

    cs.CV

    Look, Listen, and Answer: Overcoming Biases for Audio-Visual Question Answering

    Authors: Jie Ma, Min Hu, Pinghui Wang, Wangchun Sun, Lingyun Song, Hongbin Pei, Jun Liu, Youtian Du

    Abstract: Audio-Visual Question Answering (AVQA) is a complex multi-modal reasoning task, demanding intelligent systems to accurately respond to natural language queries based on audio-video input pairs. Nevertheless, prevalent AVQA approaches are prone to overlearning dataset biases, resulting in poor robustness. Furthermore, current datasets may not provide a precise diagnostic for these methods. To tackl… ▽ More

    Submitted 21 October, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: Accepted by NeurIPS 2024

    ACM Class: I.2.10

  4. arXiv:2402.01253  [pdf, other

    cs.IR

    RimiRec: Modeling Refined Multi-interest in Hierarchical Structure for Recommendation

    Authors: Haolei Pei, Yuanyuan Xu, Yangping Zhu, Yuan Nie

    Abstract: Industrial recommender systems usually consist of the retrieval stage and the ranking stage, to handle the billion-scale of users and items. The retrieval stage retrieves candidate items relevant to user interests for recommendations and has attracted much attention. Frequently, a user shows refined multi-interests in a hierarchical structure. For example, a user likes Conan and Kuroba Kaito, whic… ▽ More

    Submitted 5 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 4 pages, 4 figures

  5. arXiv:2311.16522  [pdf

    cs.LG cs.CL eess.SP

    Dynamic Fault Characteristics Evaluation in Power Grid

    Authors: Hao Pei, Si Lin, Chuanfu Li, Che Wang, Haoming Chen, Sizhe Li

    Abstract: To enhance the intelligence degree in operation and maintenance, a novel method for fault detection in power grids is proposed. The proposed GNN-based approach first identifies fault nodes through a specialized feature extraction method coupled with a knowledge graph. By incorporating temporal data, the method leverages the status of nodes from preceding and subsequent time periods to help current… ▽ More

    Submitted 27 January, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

  6. arXiv:2311.11225  [pdf, other

    cs.LG cs.CR

    TextGuard: Provable Defense against Backdoor Attacks on Text Classification

    Authors: Hengzhi Pei, Jinyuan Jia, Wenbo Guo, Bo Li, Dawn Song

    Abstract: Backdoor attacks have become a major security threat for deploying machine learning models in security-critical applications. Existing research endeavors have proposed many defenses against backdoor attacks. Despite demonstrating certain empirical defense efficacy, none of these techniques could provide a formal and provable security guarantee against arbitrary attacks. As a result, they can be ea… ▽ More

    Submitted 24 November, 2023; v1 submitted 18 November, 2023; originally announced November 2023.

    Comments: Accepted by NDSS Symposium 2024

  7. arXiv:2308.07723  [pdf, other

    cs.RO cs.MA

    Extended Preintegration for Relative State Estimation of Leader-Follower Platform

    Authors: Ruican Xia, Hailong Pei

    Abstract: Relative state estimation using exteroceptive sensors suffers from limitations of the field of view (FOV) and false detection, that the proprioceptive sensor (IMU) data are usually engaged to compensate. Recently ego-motion constraint obtained by Inertial measurement unit (IMU) preintegration has been extensively used in simultaneous localization and mapping (SLAM) to alleviate the computation bur… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  8. arXiv:2307.11471  [pdf, other

    cs.CV cs.AI

    Robust Visual Question Answering: Datasets, Methods, and Future Challenges

    Authors: Jie Ma, Pinghui Wang, Dechen Kong, Zewei Wang, Jun Liu, Hongbin Pei, Junzhou Zhao

    Abstract: Visual question answering requires a system to provide an accurate natural language answer given an image and a natural language question. However, it is widely recognized that previous generic VQA methods often exhibit a tendency to memorize biases present in the training data rather than learning proper behaviors, such as grounding images before predicting answers. Therefore, these methods usual… ▽ More

    Submitted 18 February, 2024; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: Accepted by IEEE TPAMI

    ACM Class: I.2.10

  9. arXiv:2306.11698  [pdf, other

    cs.CL cs.AI cs.CR

    DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models

    Authors: Boxin Wang, Weixin Chen, Hengzhi Pei, Chulin Xie, Mintong Kang, Chenhui Zhang, Chejian Xu, Zidi Xiong, Ritik Dutta, Rylan Schaeffer, Sang T. Truong, Simran Arora, Mantas Mazeika, Dan Hendrycks, Zinan Lin, Yu Cheng, Sanmi Koyejo, Dawn Song, Bo Li

    Abstract: Generative Pre-trained Transformer (GPT) models have exhibited exciting progress in their capabilities, capturing the interest of practitioners and the public alike. Yet, while the literature on the trustworthiness of GPT models remains limited, practitioners have proposed employing capable GPT models for sensitive applications such as healthcare and finance -- where mistakes can be costly. To thi… ▽ More

    Submitted 26 February, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 Outstanding Paper (Datasets and Benchmarks Track)

  10. arXiv:2306.00381  [pdf, other

    cs.SE cs.LG

    Better Context Makes Better Code Language Models: A Case Study on Function Call Argument Completion

    Authors: Hengzhi Pei, Jinman Zhao, Leonard Lausen, Sheng Zha, George Karypis

    Abstract: Pretrained code language models have enabled great progress towards program synthesis. However, common approaches only consider in-file local context and thus miss information and constraints imposed by other parts of the codebase and its external dependencies. Existing code completion benchmarks also lack such context. To resolve these restrictions we curate a new dataset of permissively licensed… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: 12 pages. Accepted to AAAI 2023

    ACM Class: I.2.2; I.2.7

  11. Synthetic Datasets for Autonomous Driving: A Survey

    Authors: Zhihang Song, Zimin He, Xingyu Li, Qiming Ma, Ruibo Ming, Zhiqi Mao, Huaxin Pei, Lihui Peng, Jianming Hu, Danya Yao, Yi Zhang

    Abstract: Autonomous driving techniques have been flourishing in recent years while thirsting for huge amounts of high-quality data. However, it is difficult for real-world datasets to keep up with the pace of changing requirements due to their expensive and time-consuming experimental and labeling costs. Therefore, more and more researchers are turning to synthetic datasets to easily generate rich and chan… ▽ More

    Submitted 27 February, 2024; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: 19 pages, 5 figures

    Journal ref: in IEEE Transactions on Intelligent Vehicles, vol. 9, no. 1, pp. 1847-1864, Jan. 2024

  12. arXiv:2304.06966  [pdf

    cs.CV cs.AI cs.LG cs.RO

    Self-Supervised Learning based Depth Estimation from Monocular Images

    Authors: Mayank Poddar, Akash Mishra, Mohit Kewlani, Haoyang Pei

    Abstract: Depth Estimation has wide reaching applications in the field of Computer vision such as target tracking, augmented reality, and self-driving cars. The goal of Monocular Depth Estimation is to predict the depth map, given a 2D monocular RGB image as input. The traditional depth estimation methods are based on depth cues and used concepts like epipolar geometry. With the evolution of Convolutional N… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  13. arXiv:2211.03252  [pdf, other

    cs.CL

    Zero-Shot Classification by Logical Reasoning on Natural Language Explanations

    Authors: Chi Han, Hengzhi Pei, Xinya Du, Heng Ji

    Abstract: Humans can classify data of an unseen category by reasoning on its language explanations. This ability is owing to the compositional nature of language: we can combine previously seen attributes to describe the new category. For example, we might describe a sage thrasher as "it has a slim straight relatively short bill, yellow eyes and a long tail", so that others can use their knowledge of attrib… ▽ More

    Submitted 25 May, 2023; v1 submitted 6 November, 2022; originally announced November 2022.

    Comments: 9 pages, 8 figures. Accepted in the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023) Findings

  14. arXiv:2209.08237  [pdf, other

    cs.CV

    Understanding the Impact of Image Quality and Distance of Objects to Object Detection Performance

    Authors: Yu Hao, Haoyang Pei, Yixuan Lyu, Zhongzheng Yuan, John-Ross Rizzo, Yao Wang, Yi Fang

    Abstract: Deep learning has made great strides for object detection in images. The detection accuracy and computational cost of object detection depend on the spatial resolution of an image, which may be constrained by both the camera and storage considerations. Compression is often achieved by reducing either spatial or amplitude resolution or, at times, both, both of which have well-known effects on perfo… ▽ More

    Submitted 17 September, 2022; originally announced September 2022.

  15. arXiv:2206.12840  [pdf, other

    cs.LG cs.CL

    Your Autoregressive Generative Model Can be Better If You Treat It as an Energy-Based One

    Authors: Yezhen Wang, Tong Che, Bo Li, Kaitao Song, Hengzhi Pei, Yoshua Bengio, Dongsheng Li

    Abstract: Autoregressive generative models are commonly used, especially for those tasks involving sequential data. They have, however, been plagued by a slew of inherent flaws due to the intrinsic characteristics of chain-style conditional modeling (e.g., exposure bias or lack of long-range coherence), severely limiting their ability to model distributions properly. In this paper, we propose a unique metho… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

    Comments: Preprint version

  16. arXiv:2203.02677  [pdf

    cs.RO

    Tightly Coupled Optimization-based GPS-Visual-Inertial Odometry with Online Calibration and Initialization

    Authors: Shihao Han, Feiyang Deng, Tao Li, Hailong Pei

    Abstract: In this paper, we present a tightly coupled optimization-based GPS-Visual-Inertial odometry system to solve the trajectory drift of the visual-inertial odometry especially over long-term runs. Visual reprojection residuals, IMU residuals, and GPS measurement residuals are jointly minimized within a local bundle adjustment, in which we apply GPS measurements and IMU preintegration used for the IMU… ▽ More

    Submitted 5 March, 2022; originally announced March 2022.

    Comments: 7 pages, 10 figures

  17. arXiv:2112.13194  [pdf, other

    eess.IV cs.CV eess.SP

    Network-Aware 5G Edge Computing for Object Detection: Augmenting Wearables to "See" More, Farther and Faster

    Authors: Zhongzheng Yuan, Tommy Azzino, Yu Hao, Yixuan Lyu, Haoyang Pei, Alain Boldini, Marco Mezzavilla, Mahya Beheshti, Maurizio Porfiri, Todd Hudson, William Seiple, Yi Fang, Sundeep Rangan, Yao Wang, J. R. Rizzo

    Abstract: Advanced wearable devices are increasingly incorporating high-resolution multi-camera systems. As state-of-the-art neural networks for processing the resulting image data are computationally demanding, there has been growing interest in leveraging fifth generation (5G) wireless connectivity and mobile edge computing for offloading this processing to the cloud. To assess this possibility, this pape… ▽ More

    Submitted 15 April, 2022; v1 submitted 25 December, 2021; originally announced December 2021.

    Comments: Published in: IEEE Access ( Volume: 10)

  18. arXiv:2111.08386  [pdf, other

    cs.LG cs.AI

    Towards Generating Real-World Time Series Data

    Authors: Hengzhi Pei, Kan Ren, Yuqing Yang, Chang Liu, Tao Qin, Dongsheng Li

    Abstract: Time series data generation has drawn increasing attention in recent years. Several generative adversarial network (GAN) based methods have been proposed to tackle the problem usually with the assumption that the targeted time series data are well-formatted and complete. However, real-world time series (RTS) data are far away from this utopia, e.g., long sequences with variable lengths and informa… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: Accepted in 21th IEEE International Conference on Data Mining (ICDM 2021). Code is available at https://seqml.github.io/rtsgan

  19. arXiv:2011.14211  [pdf, other

    cs.LG cs.CV stat.ML

    Curvature Regularization to Prevent Distortion in Graph Embedding

    Authors: Hongbin Pei, Bingzhe Wei, Kevin Chen-Chuan Chang, Chunxu Zhang, Bo Yang

    Abstract: Recent research on graph embedding has achieved success in various applications. Most graph embedding methods preserve the proximity in a graph into a manifold in an embedding space. We argue an important but neglected problem about this proximity-preserving strategy: Graph topology patterns, while preserved well into an embedding manifold by preserving proximity, may distort in the ambient embedd… ▽ More

    Submitted 28 November, 2020; originally announced November 2020.

    Comments: Published as a conference paper at NeurIPS 2020

  20. arXiv:2003.00120  [pdf, other

    cs.LG cs.CR stat.ML

    Improving Certified Robustness via Statistical Learning with Logical Reasoning

    Authors: Zhuolin Yang, Zhikuan Zhao, Boxin Wang, Jiawei Zhang, Linyi Li, Hengzhi Pei, Bojan Karlas, Ji Liu, Heng Guo, Ce Zhang, Bo Li

    Abstract: Intensive algorithmic efforts have been made to enable the rapid improvements of certificated robustness for complex ML models recently. However, current robustness certification methods are only able to certify under a limited perturbation radius. Given that existing pure data-driven statistical approaches have reached a bottleneck, in this paper, we propose to integrate statistical ML models wit… ▽ More

    Submitted 12 April, 2023; v1 submitted 28 February, 2020; originally announced March 2020.

    Comments: Accepted by 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  21. arXiv:2002.05780  [pdf, other

    q-fin.PM cs.LG stat.ML

    Reinforcement-Learning based Portfolio Management with Augmented Asset Movement Prediction States

    Authors: Yunan Ye, Hengzhi Pei, Boxin Wang, Pin-Yu Chen, Yada Zhu, Jun Xiao, Bo Li

    Abstract: Portfolio management (PM) is a fundamental financial planning task that aims to achieve investment goals such as maximal profits or minimal risks. Its decision process involves continuous derivation of valuable information from various data sources and sequential decision optimization, which is a prospective research direction for reinforcement learning (RL). In this paper, we propose SARL, a nove… ▽ More

    Submitted 9 February, 2020; originally announced February 2020.

    Comments: AAAI 2020

  22. arXiv:2002.05287  [pdf, other

    cs.LG cs.CV stat.ML

    Geom-GCN: Geometric Graph Convolutional Networks

    Authors: Hongbin Pei, Bingzhe Wei, Kevin Chen-Chuan Chang, Yu Lei, Bo Yang

    Abstract: Message-passing neural networks (MPNNs) have been successfully applied to representation learning on graphs in a variety of real-world applications. However, two fundamental weaknesses of MPNNs' aggregators limit their ability to represent graph-structured data: losing the structural information of nodes in neighborhoods and lacking the ability to capture long-range dependencies in disassortative… ▽ More

    Submitted 13 February, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: Published as a conference paper at ICLR 2020

  23. arXiv:1912.10375  [pdf, other

    cs.CL cs.LG

    T3: Tree-Autoencoder Constrained Adversarial Text Generation for Targeted Attack

    Authors: Boxin Wang, Hengzhi Pei, Boyuan Pan, Qian Chen, Shuohang Wang, Bo Li

    Abstract: Adversarial attacks against natural language processing systems, which perform seemingly innocuous modifications to inputs, can induce arbitrary mistakes to the target models. Though raised great concerns, such adversarial attacks can be leveraged to estimate the robustness of NLP models. Compared with the adversarial example generation in continuous data domain (e.g., image), generating adversari… ▽ More

    Submitted 5 October, 2020; v1 submitted 21 December, 2019; originally announced December 2019.

    Comments: Accepted to EMNLP 2020 as a long paper. 17 pages, 4 figures

  24. arXiv:1911.07135  [pdf, other

    cs.LG stat.ML

    The Secret Revealer: Generative Model-Inversion Attacks Against Deep Neural Networks

    Authors: Yuheng Zhang, Ruoxi Jia, Hengzhi Pei, Wenxiao Wang, Bo Li, Dawn Song

    Abstract: This paper studies model-inversion attacks, in which the access to a model is abused to infer information about the training data. Since its first introduction, such attacks have raised serious concerns given that training data usually contain privacy-sensitive information. Thus far, successful model-inversion attacks have only been demonstrated on simple models, such as linear regression and logi… ▽ More

    Submitted 17 April, 2020; v1 submitted 16 November, 2019; originally announced November 2019.

  25. arXiv:1906.12035  [pdf, other

    cs.CL cs.AI

    A Concise Model for Multi-Criteria Chinese Word Segmentation with Transformer Encoder

    Authors: Xipeng Qiu, Hengzhi Pei, Hang Yan, Xuanjing Huang

    Abstract: Multi-criteria Chinese word segmentation (MCCWS) aims to exploit the relations among the multiple heterogeneous segmentation criteria and further improve the performance of each single criterion. Previous work usually regards MCCWS as different tasks, which are learned together under the multi-task learning framework. In this paper, we propose a concise but effective unified model for MCCWS, which… ▽ More

    Submitted 5 October, 2020; v1 submitted 28 June, 2019; originally announced June 2019.

    Comments: Findings of EMNLP 2020

  26. arXiv:1712.00328  [pdf, other

    stat.ML cs.LG

    Group Sparse Bayesian Learning for Active Surveillance on Epidemic Dynamics

    Authors: Hongbin Pei, Bo Yang, Jiming Liu, Lei Dong

    Abstract: Predicting epidemic dynamics is of great value in understanding and controlling diffusion processes, such as infectious disease spread and information propagation. This task is intractable, especially when surveillance resources are very limited. To address the challenge, we study the problem of active surveillance, i.e., how to identify a small portion of system components as sentinels to effect… ▽ More

    Submitted 21 November, 2017; originally announced December 2017.

  27. arXiv:1603.06780  [pdf, other

    cs.SI

    Early Warning of Human Crowds Based on Query Data from Baidu Map: Analysis Based on Shanghai Stampede

    Authors: Jingbo Zhou, Hongbin Pei, Haishan Wu

    Abstract: Without sufficient preparation and on-site management, the mass scale unexpected huge human crowd is a serious threat to public safety. A recent impressive tragedy is the 2014 Shanghai Stampede, where 36 people were killed and 49 were injured in celebration of the New Year's Eve on December 31th 2014 in the Shanghai Bund. Due to the innately stochastic and complicated individual movement, it is no… ▽ More

    Submitted 22 March, 2016; originally announced March 2016.

  28. Graph Regularized Low Rank Representation for Aerosol Optical Depth Retrieval

    Authors: Yubao Sun, Renlong Hang, Qingshan Liu, Fuping Zhu, Hucheng Pei

    Abstract: In this paper, we propose a novel data-driven regression model for aerosol optical depth (AOD) retrieval. First, we adopt a low rank representation (LRR) model to learn a powerful representation of the spectral response. Then, graph regularization is incorporated into the LRR model to capture the local structure information and the nonlinear property of the remote-sensing data. Since it is easy to… ▽ More

    Submitted 7 March, 2016; v1 submitted 22 February, 2016; originally announced February 2016.

    Comments: 16 pages, 6 figures