Skip to main content

Showing 1–50 of 101 results for author: Tan, Q

  1. arXiv:2410.15556  [pdf, other

    cs.LG

    Gradient Rewiring for Editable Graph Neural Network Training

    Authors: Zhimeng Jiang, Zirui Liu, Xiaotian Han, Qizhang Feng, Hongye Jin, Qiaoyu Tan, Kaixiong Zhou, Na Zou, Xia Hu

    Abstract: Deep neural networks are ubiquitously adopted in many applications, such as computer vision, natural language processing, and graph analytics. However, well-trained neural networks can make prediction errors after deployment as the world changes. \textit{Model editing} involves updating the base model to correct prediction errors with less accessible training data and computational resources. Desp… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

    Comments: NeurIPS 2024

  2. arXiv:2408.07246  [pdf, other

    cs.LG cs.CV

    ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area

    Authors: Junxian Li, Di Zhang, Xunzhi Wang, Zeying Hao, Jingdi Lei, Qian Tan, Cai Zhou, Wei Liu, Yaotian Yang, Xinrui Xiong, Weiyun Wang, Zhe Chen, Wenhai Wang, Wei Li, Shufei Zhang, Mao Su, Wanli Ouyang, Yuqiang Li, Dongzhan Zhou

    Abstract: Large Language Models (LLMs) have achieved remarkable success and have been applied across various scientific fields, including chemistry. However, many chemical tasks require the processing of visual information, which cannot be successfully handled by existing chemical LLMs. This brings a growing need for models capable of integrating multimodal information in the chemical domain. In this paper,… ▽ More

    Submitted 16 August, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

    Comments: 11 pages, updated version

  3. arXiv:2407.12232  [pdf, other

    cs.AR

    RTL Verification for Secure Speculation Using Contract Shadow Logic

    Authors: Qinhan Tan, Yuheng Yang, Thomas Bourgeat, Sharad Malik, Mengjia Yan

    Abstract: Modern out-of-order processors face speculative execution attacks. Despite various proposed software and hardware mitigations to prevent such attacks, new attacks keep arising from unknown vulnerabilities. Thus, a formal and rigorous evaluation of the ability of hardware designs to deal with speculative execution attacks is urgently desired. This paper proposes a formal verification technique call… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: This paper has been accepted to ASPLOS 2025

  4. arXiv:2407.07958  [pdf, other

    cs.CV

    Bayesian Detector Combination for Object Detection with Crowdsourced Annotations

    Authors: Zhi Qin Tan, Olga Isupova, Gustavo Carneiro, Xiatian Zhu, Yunpeng Li

    Abstract: Acquiring fine-grained object detection annotations in unconstrained images is time-consuming, expensive, and prone to noise, especially in crowdsourcing scenarios. Most prior object detection methods assume accurate annotations; A few recent works have studied object detection with noisy crowdsourced annotations, with evaluation on distinct synthetic crowdsourced datasets of varying setups under… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Accepted at ECCV 2024

  5. arXiv:2407.01284  [pdf, other

    cs.AI cs.CL cs.CV cs.LG cs.SC

    We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?

    Authors: Runqi Qiao, Qiuna Tan, Guanting Dong, Minhui Wu, Chong Sun, Xiaoshuai Song, Zhuoma GongQue, Shanglin Lei, Zhe Wei, Miaoxuan Zhang, Runfeng Qiao, Yifan Zhang, Xiao Zong, Yida Xu, Muxi Diao, Zhimin Bao, Chen Li, Honggang Zhang

    Abstract: Visual mathematical reasoning, as a fundamental visual reasoning ability, has received widespread attention from the Large Multimodal Models (LMMs) community. Existing benchmarks, such as MathVista and MathVerse, focus more on the result-oriented performance but neglect the underlying principles in knowledge acquisition and generalization. Inspired by human-like mathematical reasoning, we introduc… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Work in progress

  6. arXiv:2406.14045  [pdf, other

    cs.LG cs.AI

    Understanding Different Design Choices in Training Large Time Series Models

    Authors: Yu-Neng Chuang, Songchen Li, Jiayi Yuan, Guanchu Wang, Kwei-Herng Lai, Leisheng Yu, Sirui Ding, Chia-Yuan Chang, Qiaoyu Tan, Daochen Zha, Xia Hu

    Abstract: Inspired by Large Language Models (LLMs), Time Series Forecasting (TSF), a long-standing task in time series analysis, is undergoing a transition towards Large Time Series Models (LTSMs), aiming to train universal transformer-based models for TSF. However, training LTSMs on heterogeneous time series data poses unique challenges, including diverse frequencies, dimensions, and patterns across datase… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  7. arXiv:2406.13934  [pdf, other

    cs.CL cs.AI

    Reasoning Like a Doctor: Improving Medical Dialogue Systems via Diagnostic Reasoning Process Alignment

    Authors: Kaishuai Xu, Yi Cheng, Wenjun Hou, Qiaoyu Tan, Wenjie Li

    Abstract: Medical dialogue systems have attracted significant attention for their potential to act as medical assistants. Enabling these medical systems to emulate clinicians' diagnostic reasoning process has been the long-standing research focus. Previous studies rudimentarily realized the simulation of clinicians' diagnostic process by fine-tuning language models on high-quality dialogue datasets. Nonethe… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted by ACL 2024 Findings

  8. arXiv:2406.13705  [pdf, other

    eess.IV cs.AI cs.CV

    EndoUIC: Promptable Diffusion Transformer for Unified Illumination Correction in Capsule Endoscopy

    Authors: Long Bai, Tong Chen, Qiaozhi Tan, Wan Jun Nah, Yanheng Li, Zhicheng He, Sishen Yuan, Zhen Chen, Jinlin Wu, Mobarakol Islam, Zhen Li, Hongbin Liu, Hongliang Ren

    Abstract: Wireless Capsule Endoscopy (WCE) is highly valued for its non-invasive and painless approach, though its effectiveness is compromised by uneven illumination from hardware constraints and complex internal dynamics, leading to overexposed or underexposed images. While researchers have discussed the challenges of low-light enhancement in WCE, the issue of correcting for different exposure levels rema… ▽ More

    Submitted 8 July, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

    Comments: To appear in MICCAI 2024. Code and dataset availability: https://github.com/longbai1006/EndoUIC

  9. arXiv:2406.12950  [pdf, other

    q-bio.QM cs.AI cs.CE cs.CL cs.LG

    MolecularGPT: Open Large Language Model (LLM) for Few-Shot Molecular Property Prediction

    Authors: Yuyan Liu, Sirui Ding, Sheng Zhou, Wenqi Fan, Qiaoyu Tan

    Abstract: Molecular property prediction (MPP) is a fundamental and crucial task in drug discovery. However, prior methods are limited by the requirement for a large number of labeled molecules and their restricted ability to generalize for unseen and new tasks, both of which are essential for real-world applications. To address these challenges, we present MolecularGPT for few-shot MPP. From a perspective o… ▽ More

    Submitted 18 October, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  10. arXiv:2406.12052  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    UniGLM: Training One Unified Language Model for Text-Attributed Graphs

    Authors: Yi Fang, Dongzhe Fan, Sirui Ding, Ninghao Liu, Qiaoyu Tan

    Abstract: Representation learning on text-attributed graphs (TAGs), where nodes are represented by textual descriptions, is crucial for textual and relational knowledge systems and recommendation systems. Currently, state-of-the-art embedding methods for TAGs primarily focus on fine-tuning language models (e.g., BERT) using structure-aware training signals. While effective, these methods are tailored for in… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  11. arXiv:2406.11945  [pdf, other

    cs.LG cs.AI cs.IR

    GAugLLM: Improving Graph Contrastive Learning for Text-Attributed Graphs with Large Language Models

    Authors: Yi Fang, Dongzhe Fan, Daochen Zha, Qiaoyu Tan

    Abstract: This work studies self-supervised graph learning for text-attributed graphs (TAGs) where nodes are represented by textual attributes. Unlike traditional graph contrastive methods that perturb the numerical feature space and alter the graph's topological structure, we aim to improve view generation through language supervision. This is driven by the prevalence of textual attributes in real applicat… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  12. arXiv:2406.08587  [pdf, other

    cs.CL cs.AI cs.LG

    CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery

    Authors: Xiaoshuai Song, Muxi Diao, Guanting Dong, Zhengyang Wang, Yujia Fu, Runqi Qiao, Zhexu Wang, Dayuan Fu, Huangxuan Wu, Bin Liang, Weihao Zeng, Yejie Wang, Zhuoma GongQue, Jianing Yu, Qiuna Tan, Weiran Xu

    Abstract: Computer Science (CS) stands as a testament to the intricacies of human intelligence, profoundly advancing the development of artificial intelligence and modern society. However, the current community of large language models (LLMs) overly focuses on benchmarks for analyzing specific foundational skills (e.g. mathematics and code generation), neglecting an all-round evaluation of the computer scie… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Work in progress

  13. arXiv:2406.08310  [pdf, other

    cs.LG

    GraphFM: A Comprehensive Benchmark for Graph Foundation Model

    Authors: Yuhao Xu, Xinqi Liu, Keyu Duan, Yi Fang, Yu-Neng Chuang, Daochen Zha, Qiaoyu Tan

    Abstract: Foundation Models (FMs) serve as a general class for the development of artificial intelligence systems, offering broad potential for generalization across a spectrum of downstream tasks. Despite extensive research into self-supervised learning as the cornerstone of FMs, several outstanding issues persist in Graph Foundation Models that rely on graph self-supervised learning, namely: 1) Homogeniza… ▽ More

    Submitted 14 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  14. arXiv:2406.04553  [pdf, other

    cs.IR cs.AI

    Better Late Than Never: Formulating and Benchmarking Recommendation Editing

    Authors: Chengyu Lai, Sheng Zhou, Zhimeng Jiang, Qiaoyu Tan, Yuanchen Bei, Jiawei Chen, Ningyu Zhang, Jiajun Bu

    Abstract: Recommendation systems play a pivotal role in suggesting items to users based on their preferences. However, in online platforms, these systems inevitably offer unsuitable recommendations due to limited model capacity, poor data quality, or evolving user interests. Enhancing user experience necessitates efficiently rectify such unsuitable recommendation behaviors. This paper introduces a novel and… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  15. arXiv:2406.00452  [pdf, other

    cs.LG cs.AI

    Towards a Unified Framework of Clustering-based Anomaly Detection

    Authors: Zeyu Fang, Ming Gu, Sheng Zhou, Jiawei Chen, Qiaoyu Tan, Haishuai Wang, Jiajun Bu

    Abstract: Unsupervised Anomaly Detection (UAD) plays a crucial role in identifying abnormal patterns within data without labeled examples, holding significant practical implications across various domains. Although the individual contributions of representation learning and clustering to anomaly detection are well-established, their interdependencies remain under-explored due to the absence of a unified the… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  16. arXiv:2405.20701  [pdf, other

    cs.CL cs.AI

    Unveiling the Lexical Sensitivity of LLMs: Combinatorial Optimization for Prompt Enhancement

    Authors: Pengwei Zhan, Zhen Xu, Qian Tan, Jie Song, Ru Xie

    Abstract: Large language models (LLMs) demonstrate exceptional instruct-following ability to complete various downstream tasks. Although this impressive ability makes LLMs flexible task solvers, their performance in solving tasks also heavily relies on instructions. In this paper, we reveal that LLMs are over-sensitive to lexical variations in task instructions, even when the variations are imperceptible to… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  17. arXiv:2405.03401  [pdf, other

    cs.LG cs.AI

    E2GNN: Efficient Graph Neural Network Ensembles for Semi-Supervised Classification

    Authors: Xin Zhang, Daochen Zha, Qiaoyu Tan

    Abstract: This work studies ensemble learning for graph neural networks (GNNs) under the popular semi-supervised setting. Ensemble learning has shown superiority in improving the accuracy and robustness of traditional machine learning by combining the outputs of multiple weak learners. However, adopting a similar idea to integrate different GNN models is challenging because of two reasons. First, GNN is not… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  18. arXiv:2404.14135  [pdf, other

    cs.CV

    Text in the Dark: Extremely Low-Light Text Image Enhancement

    Authors: Che-Tsung Lin, Chun Chet Ng, Zhi Qin Tan, Wan Jun Nah, Xinyu Wang, Jie Long Kew, Pohao Hsu, Shang Hong Lai, Chee Seng Chan, Christopher Zach

    Abstract: Extremely low-light text images are common in natural scenes, making scene text detection and recognition challenging. One solution is to enhance these images using low-light image enhancement methods before text extraction. However, previous methods often do not try to particularly address the significance of low-level features, which are crucial for optimal performance on downstream scene text t… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: The first two authors contributed equally to this work

  19. arXiv:2403.19631  [pdf, other

    cs.CL cs.AI cs.LG

    Retrieval-enhanced Knowledge Editing in Language Models for Multi-Hop Question Answering

    Authors: Yucheng Shi, Qiaoyu Tan, Xuansheng Wu, Shaochen Zhong, Kaixiong Zhou, Ninghao Liu

    Abstract: Large Language Models (LLMs) have shown proficiency in question-answering tasks but often struggle to integrate real-time knowledge, leading to potentially outdated or inaccurate responses. This problem becomes even more challenging when dealing with multi-hop questions, since they require LLMs to update and integrate multiple knowledge pieces relevant to the questions. To tackle the problem, we p… ▽ More

    Submitted 13 August, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: Accepted by CIKM 2024

  20. arXiv:2403.16149  [pdf, other

    cs.CR cs.AI cs.LG

    A Survey on Consumer IoT Traffic: Security and Privacy

    Authors: Yan Jia, Yuxin Song, Zihou Liu, Qingyin Tan, Yang Song, Yu Zhang, Zheli Liu

    Abstract: Although CIoT has improved the convenience of daily activities, it also introduces new security and privacy concerns. Network traffic analysis, a common technique employed by the security community, has been extensively utilized to investigate security and privacy concerns, and it has also been applied to CIoT. However, compared to network traffic analysis in other fields such as mobile apps and w… ▽ More

    Submitted 15 July, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

  21. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1110 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 8 August, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  22. arXiv:2403.01268  [pdf, other

    cs.LG cs.CR cs.DC

    Defending Against Data Reconstruction Attacks in Federated Learning: An Information Theory Approach

    Authors: Qi Tan, Qi Li, Yi Zhao, Zhuotao Liu, Xiaobing Guo, Ke Xu

    Abstract: Federated Learning (FL) trains a black-box and high-dimensional model among different clients by exchanging parameters instead of direct data sharing, which mitigates the privacy leak incurred by machine learning. However, FL still suffers from membership inference attacks (MIA) or data reconstruction attacks (DRA). In particular, an attacker can extract the information from local datasets by cons… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: Accepted by USENIX Security '24

  23. arXiv:2402.11476  [pdf, other

    cs.CV

    EndoOOD: Uncertainty-aware Out-of-distribution Detection in Capsule Endoscopy Diagnosis

    Authors: Qiaozhi Tan, Long Bai, Guankun Wang, Mobarakol Islam, Hongliang Ren

    Abstract: Wireless capsule endoscopy (WCE) is a non-invasive diagnostic procedure that enables visualization of the gastrointestinal (GI) tract. Deep learning-based methods have shown effectiveness in disease screening using WCE data, alleviating the burden on healthcare professionals. However, existing capsule endoscopy classification methods mostly rely on pre-defined categories, making it challenging to… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: To appear in IEEE ISBI 2024

  24. arXiv:2402.06852  [pdf

    cs.AI cs.CL

    ChemLLM: A Chemical Large Language Model

    Authors: Di Zhang, Wei Liu, Qian Tan, Jingdan Chen, Hang Yan, Yuliang Yan, Jiatong Li, Weiran Huang, Xiangyu Yue, Wanli Ouyang, Dongzhan Zhou, Shufei Zhang, Mao Su, Han-Sen Zhong, Yuqiang Li

    Abstract: Large language models (LLMs) have made impressive progress in chemistry applications. However, the community lacks an LLM specifically designed for chemistry. The main challenges are two-fold: firstly, most chemical data and scientific knowledge are stored in structured databases, which limits the model's ability to sustain coherent dialogue when used directly. Secondly, there is an absence of obj… ▽ More

    Submitted 25 April, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: 9 pages, 5 figures

  25. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  26. Unbiased organism-agnostic and highly sensitive signal peptide predictor with deep protein language model

    Authors: Junbo Shen, Qinze Yu, Shenyang Chen, Qingxiong Tan, Jingcheng Li, Yu Li

    Abstract: Signal peptide (SP) is a short peptide located in the N-terminus of proteins. It is essential to target and transfer transmembrane and secreted proteins to correct positions. Compared with traditional experimental methods to identify signal peptides, computational methods are faster and more efficient, which are more practical for analyzing thousands or even millions of protein sequences, especial… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 23 pages 5 figures. Nat Comput Sci (2023)

  27. arXiv:2312.05526  [pdf, other

    cs.LG cs.AI

    Reinforcement Neighborhood Selection for Unsupervised Graph Anomaly Detection

    Authors: Yuanchen Bei, Sheng Zhou, Qiaoyu Tan, Hao Xu, Hao Chen, Zhao Li, Jiajun Bu

    Abstract: Unsupervised graph anomaly detection is crucial for various practical applications as it aims to identify anomalies in a graph that exhibit rare patterns deviating significantly from the majority of nodes. Recent advancements have utilized Graph Neural Networks (GNNs) to learn high-quality node representations for anomaly detection by aggregating information from neighborhoods. However, the presen… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: 1O pages, 7 figures, accepted by ICDM2023

  28. arXiv:2312.01431  [pdf, other

    cs.CV

    D$^2$ST-Adapter: Disentangled-and-Deformable Spatio-Temporal Adapter for Few-shot Action Recognition

    Authors: Wenjie Pei, Qizhong Tan, Guangming Lu, Jiandong Tian

    Abstract: Adapting large pre-trained image models to few-shot action recognition has proven to be an effective and efficient strategy for learning robust feature extractors, which is essential for few-shot learning. Typical fine-tuning based adaptation paradigm is prone to overfitting in the few-shot learning scenarios and offers little modeling flexibility for learning temporal features in video data. In t… ▽ More

    Submitted 20 April, 2024; v1 submitted 3 December, 2023; originally announced December 2023.

  29. arXiv:2312.00738  [pdf, other

    cs.CL

    SeaLLMs -- Large Language Models for Southeast Asia

    Authors: Xuan-Phi Nguyen, Wenxuan Zhang, Xin Li, Mahani Aljunied, Zhiqiang Hu, Chenhui Shen, Yew Ken Chia, Xingxuan Li, Jianyu Wang, Qingyu Tan, Liying Cheng, Guanzheng Chen, Yue Deng, Sen Yang, Chaoqun Liu, Hang Zhang, Lidong Bing

    Abstract: Despite the remarkable achievements of large language models (LLMs) in various tasks, there remains a linguistic bias that favors high-resource languages, such as English, often at the expense of low-resource and regional languages. To address this imbalance, we introduce SeaLLMs, an innovative series of language models that specifically focuses on Southeast Asian (SEA) languages. SeaLLMs are buil… ▽ More

    Submitted 1 July, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: Technical report, ACL 2024 DEMO TRACK

  30. arXiv:2311.09821  [pdf, other

    cs.CL

    Towards Robust Temporal Reasoning of Large Language Models via a Multi-Hop QA Dataset and Pseudo-Instruction Tuning

    Authors: Qingyu Tan, Hwee Tou Ng, Lidong Bing

    Abstract: Knowledge in the real world is being updated constantly. However, it is costly to frequently update large language models (LLMs). Therefore, it is crucial for LLMs to understand the concept of temporal knowledge. However, prior works on temporal question answering (TQA) did not emphasize multi-answer and multi-hop types of temporal reasoning. In this paper, we propose a complex temporal question-a… ▽ More

    Submitted 12 July, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: To appear in Findings of ACL 2024

  31. arXiv:2310.16523  [pdf, other

    cs.CL cs.AI

    Improving Diversity of Demographic Representation in Large Language Models via Collective-Critiques and Self-Voting

    Authors: Preethi Lahoti, Nicholas Blumm, Xiao Ma, Raghavendra Kotikalapudi, Sahitya Potluri, Qijun Tan, Hansa Srinivasan, Ben Packer, Ahmad Beirami, Alex Beutel, Jilin Chen

    Abstract: A crucial challenge for generative large language models (LLMs) is diversity: when a user's prompt is under-specified, models may follow implicit assumptions while generating a response, which may result in homogenization of the responses, as well as certain demographic groups being under-represented or even erased from the generated responses. In this paper, we formalize diversity of representati… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: To appear at EMNLP 2023 main conference

  32. Security Verification of Low-Trust Architectures

    Authors: Qinhan Tan, Yonathan Fisseha, Shibo Chen, Lauren Biernacki, Jean-Baptiste Jeannin, Sharad Malik, Todd Austin

    Abstract: Low-trust architectures work on, from the viewpoint of software, always-encrypted data, and significantly reduce the amount of hardware trust to a small software-free enclave component. In this paper, we perform a complete formal verification of a specific low-trust architecture, the Sequestered Encryption (SE) architecture, to show that the design is secure against direct data disclosures and dig… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

    Comments: 19 pages with appendix

  33. arXiv:2308.09663  [pdf, other

    cs.LG cs.AI

    GiGaMAE: Generalizable Graph Masked Autoencoder via Collaborative Latent Space Reconstruction

    Authors: Yucheng Shi, Yushun Dong, Qiaoyu Tan, Jundong Li, Ninghao Liu

    Abstract: Self-supervised learning with masked autoencoders has recently gained popularity for its ability to produce effective image or textual representations, which can be applied to various downstream tasks without retraining. However, we observe that the current masked autoencoder models lack good generalization ability on graph data. To tackle this issue, we propose a novel graph masked autoencoder fr… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: Accepted by CIKM 2023

  34. arXiv:2308.05309  [pdf, other

    cs.LG cs.AI cs.SI

    Homophily-enhanced Structure Learning for Graph Clustering

    Authors: Ming Gu, Gaoming Yang, Sheng Zhou, Ning Ma, Jiawei Chen, Qiaoyu Tan, Meihan Liu, Jiajun Bu

    Abstract: Graph clustering is a fundamental task in graph analysis, and recent advances in utilizing graph neural networks (GNNs) have shown impressive results. Despite the success of existing GNN-based graph clustering methods, they often overlook the quality of graph structure, which is inherent in real-world graphs due to their sparse and multifarious nature, leading to subpar performance. Graph structur… ▽ More

    Submitted 30 October, 2023; v1 submitted 9 August, 2023; originally announced August 2023.

    Comments: 11 pages with 7 figures. Accepted by CIKM'23

  35. arXiv:2307.11981  [pdf, other

    cs.LG cs.IR cs.SI

    Collaborative Graph Neural Networks for Attributed Network Embedding

    Authors: Qiaoyu Tan, Xin Zhang, Xiao Huang, Hao Chen, Jundong Li, Xia Hu

    Abstract: Graph neural networks (GNNs) have shown prominent performance on attributed network embedding. However, existing efforts mainly focus on exploiting network structures, while the exploitation of node attributes is rather limited as they only serve as node features at the initial layer. This simple strategy impedes the potential of node attributes in augmenting node connections, leading to limited r… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

  36. arXiv:2306.10280  [pdf, other

    cs.LG cs.SI

    OpenGSL: A Comprehensive Benchmark for Graph Structure Learning

    Authors: Zhiyao Zhou, Sheng Zhou, Bochao Mao, Xuanyi Zhou, Jiawei Chen, Qiaoyu Tan, Daochen Zha, Yan Feng, Chun Chen, Can Wang

    Abstract: Graph Neural Networks (GNNs) have emerged as the de facto standard for representation learning on graphs, owing to their ability to effectively integrate graph topology and node attributes. However, the inherent suboptimal nature of node connections, resulting from the complex and contingent formation process of graphs, presents significant challenges in modeling them effectively. To tackle this i… ▽ More

    Submitted 23 December, 2023; v1 submitted 17 June, 2023; originally announced June 2023.

    Comments: 25 pages, 21 figures. Camera-ready version for NeurIPS Datasets and Benchmarks Track 2023

  37. arXiv:2306.10265  [pdf, other

    cs.CV cs.RO

    NBMOD: Find It and Grasp It in Noisy Background

    Authors: Boyuan Cao, Xinyu Zhou, Congmin Guo, Baohua Zhang, Yuchen Liu, Qianqiu Tan

    Abstract: Grasping objects is a fundamental yet important capability of robots, and many tasks such as sorting and picking rely on this skill. The prerequisite for stable grasping is the ability to correctly identify suitable grasping positions. However, finding appropriate grasping points is challenging due to the diverse shapes, varying density distributions, and significant differences between the baryce… ▽ More

    Submitted 17 June, 2023; originally announced June 2023.

  38. arXiv:2306.09697  [pdf, other

    cs.CL

    Class-Adaptive Self-Training for Relation Extraction with Incompletely Annotated Training Data

    Authors: Qingyu Tan, Lu Xu, Lidong Bing, Hwee Tou Ng

    Abstract: Relation extraction (RE) aims to extract relations from sentences and documents. Existing relation extraction models typically rely on supervised machine learning. However, recent studies showed that many RE datasets are incompletely annotated. This is known as the false negative problem in which valid relations are falsely annotated as 'no_relation'. Models trained with such data inevitably make… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: ACL 2023 Findings

  39. arXiv:2306.08952  [pdf, other

    cs.CL cs.AI

    Towards Benchmarking and Improving the Temporal Reasoning Capability of Large Language Models

    Authors: Qingyu Tan, Hwee Tou Ng, Lidong Bing

    Abstract: Reasoning about time is of fundamental importance. Many facts are time-dependent. For example, athletes change teams from time to time, and different government officials are elected periodically. Previous time-dependent question answering (QA) datasets tend to be biased in either their coverage of time spans or question types. In this paper, we introduce a comprehensive probing dataset \tempreaso… ▽ More

    Submitted 27 June, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: ACL 2023

  40. arXiv:2306.05303  [pdf, other

    cs.CV

    Enhance-NeRF: Multiple Performance Evaluation for Neural Radiance Fields

    Authors: Qianqiu Tan, Tao Liu, Yinling Xie, Shuwan Yu, Baohua Zhang

    Abstract: The quality of three-dimensional reconstruction is a key factor affecting the effectiveness of its application in areas such as virtual reality (VR) and augmented reality (AR) technologies. Neural Radiance Fields (NeRF) can generate realistic images from any viewpoint. It simultaneously reconstructs the shape, lighting, and materials of objects, and without surface defects, which breaks down the b… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  41. arXiv:2305.15014  [pdf, other

    cs.CL

    Unlocking Temporal Question Answering for Large Language Models Using Code Execution

    Authors: Xingxuan Li, Liying Cheng, Qingyu Tan, Hwee Tou Ng, Shafiq Joty, Lidong Bing

    Abstract: Large language models (LLMs) have made significant progress in natural language processing (NLP), and are utilized extensively in various applications. Recent works, such as chain-of-thought (CoT), have shown that intermediate reasoning steps can improve the performance of LLMs for complex reasoning tasks, such as math problems and symbolic question-answering tasks. However, we notice the challeng… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  42. arXiv:2305.03513  [pdf, other

    cs.CL cs.AI cs.LG

    ChatGraph: Interpretable Text Classification by Converting ChatGPT Knowledge to Graphs

    Authors: Yucheng Shi, Hehuan Ma, Wenliang Zhong, Qiaoyu Tan, Gengchen Mai, Xiang Li, Tianming Liu, Junzhou Huang

    Abstract: ChatGPT, as a recently launched large language model (LLM), has shown superior performance in various natural language processing (NLP) tasks. However, two major limitations hinder its potential applications: (1) the inflexibility of finetuning on downstream tasks and (2) the lack of interpretability in the decision-making process. To tackle these limitations, we propose a novel framework that lev… ▽ More

    Submitted 19 September, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: 6 pages, 2 figures

  43. arXiv:2304.10543  [pdf, other

    cs.HC

    Repurposing Text-Generating AI into a Thought-Provoking Writing Tutor

    Authors: Tae Wook Kim, Quan Tan

    Abstract: Text-generating AI technology has the potential to revolutionize writing education. However, current AI writing-support tools are limited to providing linear feedback to users. In this work, we demonstrate how text-generating AI can be repurposed into a thought-provoking writing tutor with the addition of recursive feedback mechanisms. Concretely, we developed a prototype AI writing-support tool c… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

  44. arXiv:2304.00012  [pdf, other

    cs.LG cs.CY

    Multi-Task Learning for Post-transplant Cause of Death Analysis: A Case Study on Liver Transplant

    Authors: Sirui Ding, Qiaoyu Tan, Chia-yuan Chang, Na Zou, Kai Zhang, Nathan R. Hoot, Xiaoqian Jiang, Xia Hu

    Abstract: Organ transplant is the essential treatment method for some end-stage diseases, such as liver failure. Analyzing the post-transplant cause of death (CoD) after organ transplant provides a powerful tool for clinical decision making, including personalized treatment and organ allocation. However, traditional methods like Model for End-stage Liver Disease (MELD) score and conventional machine learnin… ▽ More

    Submitted 5 October, 2023; v1 submitted 29 March, 2023; originally announced April 2023.

    Comments: AMIA Annual Symposium 2023 Best Student Paper Finalist

  45. arXiv:2303.13790  [pdf, other

    cs.LG cs.AI cs.CY

    Towards Fair Patient-Trial Matching via Patient-Criterion Level Fairness Constraint

    Authors: Chia-Yuan Chang, Jiayi Yuan, Sirui Ding, Qiaoyu Tan, Kai Zhang, Xiaoqian Jiang, Xia Hu, Na Zou

    Abstract: Clinical trials are indispensable in developing new treatments, but they face obstacles in patient recruitment and retention, hindering the enrollment of necessary participants. To tackle these challenges, deep learning frameworks have been created to match patients to trials. These frameworks calculate the similarity between patients and clinical trial eligibility criteria, considering the discre… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  46. arXiv:2302.14395  [pdf, other

    cs.IR cs.AI cs.LG

    Item Cold Start Recommendation via Adversarial Variational Auto-encoder Warm-up

    Authors: Shenzheng Zhang, Qi Tan, Xinzhi Zheng, Yi Ren, Xu Zhao

    Abstract: The gap between the randomly initialized item ID embedding and the well-trained warm item ID embedding makes the cold items hard to suit the recommendation system, which is trained on the data of historical warm items. To alleviate the performance decline of new items recommendation, the distribution of the new item ID embedding should be close to that of the historical warm items. To achieve this… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

  47. arXiv:2302.14329  [pdf, other

    cs.LG

    Towards Personalized Preprocessing Pipeline Search

    Authors: Diego Martinez, Daochen Zha, Qiaoyu Tan, Xia Hu

    Abstract: Feature preprocessing, which transforms raw input features into numerical representations, is a crucial step in automated machine learning (AutoML) systems. However, the existing systems often have a very small search space for feature preprocessing with the same preprocessing pipeline applied to all the numerical features. This may result in sub-optimal performance since different datasets often… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

  48. arXiv:2301.08895  [pdf, other

    cs.DC

    ABS: Adaptive Bounded Staleness Converges Faster and Communicates Less

    Authors: Qiao Tan, Feng Zhu, Jingjing Zhang

    Abstract: Wall-clock convergence time and communication rounds are critical performance metrics in distributed learning with parameter-server setting. While synchronous methods converge fast but are not robust to stragglers; and asynchronous ones can reduce the wall-clock time per round but suffers from degraded convergence rate due to the staleness of gradients, it is natural to combine the two methods to… ▽ More

    Submitted 19 January, 2024; v1 submitted 21 January, 2023; originally announced January 2023.

  49. Bring Your Own View: Graph Neural Networks for Link Prediction with Personalized Subgraph Selection

    Authors: Qiaoyu Tan, Xin Zhang, Ninghao Liu, Daochen Zha, Li Li, Rui Chen, Soo-Hyun Choi, Xia Hu

    Abstract: Graph neural networks (GNNs) have received remarkable success in link prediction (GNNLP) tasks. Existing efforts first predefine the subgraph for the whole dataset and then apply GNNs to encode edge representations by leveraging the neighborhood structure induced by the fixed subgraph. The prominence of GNNLP methods significantly relies on the adhoc subgraph. Since node connectivity in real-world… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

  50. arXiv:2212.05299  [pdf

    cs.SI physics.soc-ph

    Characterizing human collective behaviours of COVID-19 in Hong Kong

    Authors: Zhanwei Du, Xiao Zhang, Lin Wang, Sidan Yao, Yuan Bai, Qi Tan, Xiaoke Xu, Sen Pei, Jingyi Xiao, Tim K. Tsang, Qiuyan Liao, Eric Lau, Peng Wu, Chao Gao, Benjamin J Cowling

    Abstract: People are likely to engage in collective behaviour online during extreme events, such as the COVID-19 crisis, to express their awareness, actions and concerns. Hong Kong has implemented stringent public health and social measures (PHSMs) to curb COVID-19 epidemic waves since the first COVID-19 case was confirmed on 22 January 2020. People are likely to engage in collective behaviour online during… ▽ More

    Submitted 10 December, 2022; originally announced December 2022.