Skip to main content

Showing 1–50 of 356 results for author: Xie, T

  1. arXiv:2410.16165  [pdf, other

    cs.CL cs.DB

    From Tokens to Materials: Leveraging Language Models for Scientific Discovery

    Authors: Yuwei Wan, Tong Xie, Nan Wu, Wenjie Zhang, Chunyu Kit, Bram Hoex

    Abstract: Exploring the predictive capabilities of language models in material science is an ongoing interest. This study investigates the application of language model embeddings to enhance material property prediction in materials science. By evaluating various contextual embedding methods and pre-trained models, including Bidirectional Encoder Representations from Transformers (BERT) and Generative Pre-t… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  2. arXiv:2410.15756  [pdf, other

    cs.SE cs.AI

    Automated Proof Generation for Rust Code via Self-Evolution

    Authors: Tianyu Chen, Shuai Lu, Shan Lu, Yeyun Gong, Chenyuan Yang, Xuheng Li, Md Rakib Hossain Misu, Hao Yu, Nan Duan, Peng Cheng, Fan Yang, Shuvendu K Lahiri, Tao Xie, Lidong Zhou

    Abstract: Ensuring correctness is crucial for code generation. Formal verification offers a definitive assurance of correctness, but demands substantial human effort in proof construction and hence raises a pressing need for automation. The primary obstacle lies in the severe lack of data - there is much less proof than code for LLMs to train upon. In this paper, we introduce SAFE, a novel framework that ov… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  3. arXiv:2410.15336  [pdf, other

    stat.ML cs.LG

    Diffusion-PINN Sampler

    Authors: Zhekun Shi, Longlin Yu, Tianyu Xie, Cheng Zhang

    Abstract: Recent success of diffusion models has inspired a surge of interest in developing sampling techniques using reverse diffusion processes. However, accurately estimating the drift term in the reverse stochastic differential equation (SDE) solely from the unnormalized target density poses significant challenges, hindering existing methods from achieving state-of-the-art performance. In this paper, we… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

    Comments: 33 pages, 7 figures

  4. arXiv:2410.15332  [pdf, other

    cs.LG cs.CL cs.DC cs.PF

    EPIC: Efficient Position-Independent Context Caching for Serving Large Language Models

    Authors: Junhao Hu, Wenrui Huang, Haoyi Wang, Weidong Wang, Tiancheng Hu, Qin Zhang, Hao Feng, Xusheng Chen, Yizhou Shan, Tao Xie

    Abstract: Large Language Models (LLMs) are critical for a wide range of applications, but serving them efficiently becomes increasingly challenging as inputs become more complex. Context caching improves serving performance by exploiting inter-request dependency and reusing key-value (KV) cache across requests, thus improving time-to-first-token (TTFT). However, existing prefix-based context caching require… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

  5. Continuous agent-based modeling of adult-child pairs based on a pseudo-energy: Relevance for public safety and egress efficiency

    Authors: Chuan-Zhi Thomas Xie, Tie-Qiao Tang, Alexandre Nicolas

    Abstract: Pushes, falls, stampedes, and crushes are safety hazards that emerge from the collective motion of crowds, but might be avoided by better design and guidance. While pedestrian dynamics are now getting better understood on the whole, complex heterogeneous flows involvinge.g. adult-child pairs, though widely found at e.g. crowded Chinese training schools, still defy the current understanding and cap… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Journal ref: Safety Science, 2024, 177, pp.106576

  6. arXiv:2410.11765  [pdf, other

    cs.LG

    ECGN: A Cluster-Aware Approach to Graph Neural Networks for Imbalanced Classification

    Authors: Bishal Thapaliya, Anh Nguyen, Yao Lu, Tian Xie, Igor Grudetskyi, Fudong Lin, Antonios Valkanas, Jingyu Liu, Deepayan Chakraborty, Bilel Fehri

    Abstract: Classifying nodes in a graph is a common problem. The ideal classifier must adapt to any imbalances in the class distribution. It must also use information in the clustering structure of real-world graphs. Existing Graph Neural Networks (GNNs) have not addressed both problems together. We propose the Enhanced Cluster-aware Graph Network (ECGN), a novel method that addresses these issues by integra… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 17 pages, 3 figures

  7. arXiv:2410.08511  [pdf, other

    cs.LG

    Distributionally robust self-supervised learning for tabular data

    Authors: Shantanu Ghosh, Tiankang Xie, Mikhail Kuznetsov

    Abstract: Machine learning (ML) models trained using Empirical Risk Minimization (ERM) often exhibit systematic errors on specific subpopulations of tabular data, known as error slices. Learning robust representation in presence of error slices is challenging, especially in self-supervised settings during the feature reconstruction phase, due to high cardinality features and the complexity of constructing e… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: TRL Workshop@NeurIPS2024

  8. arXiv:2410.08343  [pdf, other

    math.NA math.SP

    Computing Generalized Eigenfunctions in Rigged Hilbert Spaces

    Authors: Matthew J. Colbrook, Andrew Horning, Tianyiwa Xie

    Abstract: We introduce a simple, general, and convergent scheme to compute generalized eigenfunctions of self-adjoint operators with continuous spectra on rigged Hilbert spaces. Our approach does not require prior knowledge about the eigenfunctions, such as asymptotics or other analytic properties. Instead, we carefully sample the range of the resolvent operator to construct smooth and accurate wave packet… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

  9. arXiv:2410.07588  [pdf, other

    cs.CR cs.CY

    Careful About What App Promotion Ads Recommend! Detecting and Explaining Malware Promotion via App Promotion Graph

    Authors: Shang Ma, Chaoran Chen, Shao Yang, Shifu Hou, Toby Jia-Jun Li, Xusheng Xiao, Tao Xie, Yanfang Ye

    Abstract: In Android apps, their developers frequently place app promotion ads, namely advertisements to promote other apps. Unfortunately, the inadequate vetting of ad content allows malicious developers to exploit app promotion ads as a new distribution channel for malware. To help detect malware distributed via app promotion ads, in this paper, we propose a novel approach, named ADGPE, that synergistical… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: NDSS Symposium 2025 Accepted Papers

  10. arXiv:2410.07076  [pdf, other

    cs.CL cs.AI cs.LG

    MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses

    Authors: Zonglin Yang, Wanhao Liu, Ben Gao, Tong Xie, Yuqiang Li, Wanli Ouyang, Soujanya Poria, Erik Cambria, Dongzhan Zhou

    Abstract: Scientific discovery contributes largely to human society's prosperity, and recent progress shows that LLMs could potentially catalyze this process. However, it is still unclear whether LLMs can discover novel and valid hypotheses in chemistry. In this work, we investigate this central research question: Can LLMs automatically discover novel and valid chemistry research hypotheses given only a che… ▽ More

    Submitted 12 October, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

    Comments: Code and Benchmark are available at https://github.com/ZonglinY/MOOSE-Chem.git

  11. arXiv:2410.02884  [pdf, other

    cs.AI cs.CL

    LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

    Authors: Di Zhang, Jianbo Wu, Jingdi Lei, Tong Che, Jiatong Li, Tong Xie, Xiaoshui Huang, Shufei Zhang, Marco Pavone, Yuqiang Li, Wanli Ouyang, Dongzhan Zhou

    Abstract: This paper presents an advanced mathematical problem-solving framework, LLaMA-Berry, for enhancing the mathematical reasoning ability of Large Language Models (LLMs). The framework combines Monte Carlo Tree Search (MCTS) with iterative Self-Refine to optimize the reasoning path and utilizes a pairwise reward model to evaluate different paths globally. By leveraging the self-critic and rewriting ca… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  12. arXiv:2410.02268  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Structural-Entropy-Based Sample Selection for Efficient and Effective Learning

    Authors: Tianchi Xie, Jiangning Zhu, Guozu Ma, Minzhi Lin, Wei Chen, Weikai Yang, Shixia Liu

    Abstract: Sample selection improves the efficiency and effectiveness of machine learning models by providing informative and representative samples. Typically, samples can be modeled as a sample graph, where nodes are samples and edges represent their similarities. Most existing methods are based on local information, such as the training difficulty of samples, thereby overlooking global information, such a… ▽ More

    Submitted 5 October, 2024; v1 submitted 3 October, 2024; originally announced October 2024.

    Comments: Submitted to ICLR 2025

  13. arXiv:2409.17834  [pdf, other

    cs.CL

    PEDRO: Parameter-Efficient Fine-tuning with Prompt DEpenDent Representation MOdification

    Authors: Tianfang Xie, Tianjing Li, Wei Zhu, Wei Han, Yi Zhao

    Abstract: Due to their substantial sizes, large language models (LLMs) are typically deployed within a single-backbone multi-tenant framework. In this setup, a single instance of an LLM backbone must cater to multiple users or tasks through the application of various parameter-efficient fine-tuning (PEFT) models. Despite the availability of numerous effective PEFT techniques such as LoRA, there remains a ne… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: arXiv admin note: text overlap with arXiv:2405.18203

  14. arXiv:2409.13311  [pdf, other

    cs.SE

    Skill-Adpative Imitation Learning for UI Test Reuse

    Authors: Mengzhou Wu, Hao Wang, Jun Ren, Yuan Cao, Yuetong Li, Alex Jiang, Dezhi Ran, Yitao Hu, Wei Yang, Tao Xie

    Abstract: To alleviate the substantial cost of manually crafting user interface (UI) test cases, UI test migration aims to automatically generate test cases for a target mobile application (app) by adapting those from a source app that shares similar functionalities. Traditionally, this process has been approached as a sequential UI-event-mapping problem, where events in the source app are mapped to those i… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

  15. arXiv:2409.10441  [pdf, other

    cs.RO cs.CV

    CtRNet-X: Camera-to-Robot Pose Estimation in Real-world Conditions Using a Single Camera

    Authors: Jingpei Lu, Zekai Liang, Tristin Xie, Florian Ritcher, Shan Lin, Sainan Liu, Michael C. Yip

    Abstract: Camera-to-robot calibration is crucial for vision-based robot control and requires effort to make it accurate. Recent advancements in markerless pose estimation methods have eliminated the need for time-consuming physical setups for camera-to-robot calibration. While the existing markerless pose estimation methods have demonstrated impressive accuracy without the need for cumbersome setups, they r… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

    Comments: 7 pages, 5 figures, project website: https://sites.google.com/ucsd.edu/ctrnet-x

  16. arXiv:2409.05282  [pdf, other

    q-bio.PE stat.CO stat.ML

    Improving Tree Probability Estimation with Stochastic Optimization and Variance Reduction

    Authors: Tianyu Xie, Musu Yuan, Minghua Deng, Cheng Zhang

    Abstract: Probability estimation of tree topologies is one of the fundamental tasks in phylogenetic inference. The recently proposed subsplit Bayesian networks (SBNs) provide a powerful probabilistic graphical model for tree topology probability estimation by properly leveraging the hierarchical structure of phylogenetic trees. However, the expectation maximization (EM) method currently used for learning SB… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

    Comments: 23 pages, 6 figures, 7 tables

  17. arXiv:2409.02421  [pdf, other

    cs.SD eess.AS

    MusicMamba: A Dual-Feature Modeling Approach for Generating Chinese Traditional Music with Modal Precision

    Authors: Jiatao Chen, Tianming Xie, Xing Tang, Jing Wang, Wenjing Dong, Bing Shi

    Abstract: In recent years, deep learning has significantly advanced the MIDI domain, solidifying music generation as a key application of artificial intelligence. However, existing research primarily focuses on Western music and encounters challenges in generating melodies for Chinese traditional music, especially in capturing modal characteristics and emotional expression. To address these issues, we propo… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

  18. arXiv:2408.16233  [pdf, other

    cs.CV

    PSE-Net: Channel Pruning for Convolutional Neural Networks with Parallel-subnets Estimator

    Authors: Shiguang Wang, Tao Xie, Haijun Liu, Xingcheng Zhang, Jian Cheng

    Abstract: Channel Pruning is one of the most widespread techniques used to compress deep neural networks while maintaining their performances. Currently, a typical pruning algorithm leverages neural architecture search to directly find networks with a configurable width, the key step of which is to identify representative subnet for various pruning ratios by training a supernet. However, current methods mai… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: 10pages, Neural Networks

  19. arXiv:2408.14354  [pdf, other

    cs.SE cs.AI cs.CL

    SWE-bench-java: A GitHub Issue Resolving Benchmark for Java

    Authors: Daoguang Zan, Zhirong Huang, Ailun Yu, Shaoxin Lin, Yifan Shi, Wei Liu, Dong Chen, Zongshuai Qi, Hao Yu, Lei Yu, Dezhi Ran, Muhan Zeng, Bo Shen, Pan Bian, Guangtai Liang, Bei Guan, Pengjie Huang, Tao Xie, Yongji Wang, Qianxiang Wang

    Abstract: GitHub issue resolving is a critical task in software engineering, recently gaining significant attention in both industry and academia. Within this task, SWE-bench has been released to evaluate issue resolving capabilities of large language models (LLMs), but has so far only focused on Python version. However, supporting more programming languages is also important, as there is a strong demand in… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: This work is in progress

  20. arXiv:2408.11840  [pdf

    cs.CV cs.AI

    Joint PET-MRI Reconstruction with Diffusion Stochastic Differential Model

    Authors: Taofeng Xie, Zhuoxu Cui, Congcong Liu, Chen Luo, Huayu Wang, Yuanzhi Zhang, Xuemei Wang, Yihang Zhou, Qiyu Jin, Guoqing Chen, Dong Liang, Haifeng Wang

    Abstract: PET suffers from a low signal-to-noise ratio. Meanwhile, the k-space data acquisition process in MRI is time-consuming by PET-MRI systems. We aim to accelerate MRI and improve PET image quality. This paper proposed a novel joint reconstruction model by diffusion stochastic differential equations based on learning the joint probability distribution of PET and MRI. Compare the results underscore the… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: Accepted as ISMRM 2024 Digital poster 6575. 04-09 May 2024 Singapore

    Journal ref: ISMRM 2024 Digital poster 6575

  21. arXiv:2408.10190  [pdf, other

    cond-mat.mes-hall

    Superconductivity and spin canting in spin-orbit proximitized rhombohedral trilayer graphene

    Authors: Caitlin L. Patterson, Owen I. Sheekey, Trevor B. Arp, Ludwig F. W. Holleis, Jin Ming Koh, Youngjoon Choi, Tian Xie, Siyuan Xu, Evgeny Redekop, Grigory Babikyan, Haoxin Zhou, Xiang Cheng, Takashi Taniguchi, Kenji Watanabe, Chenhao Jin, Etienne Lantagne-Hurtubise, Jason Alicea, Andrea F. Young

    Abstract: Graphene and transition metal dichalcogenide flat-band systems show similar phase diagrams, replete with magnetic and superconducting phases. An abiding question has been whether magnetic ordering competes with superconductivity or facilitates pairing. The advent of crystalline graphene superconductors enables a new generation of controlled experiments to probe the microscopic origin of supercondu… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  22. arXiv:2408.05058  [pdf, other

    stat.ML cs.LG

    Variational Bayesian Phylogenetic Inference with Semi-implicit Branch Length Distributions

    Authors: Tianyu Xie, Frederick A. Matsen IV, Marc A. Suchard, Cheng Zhang

    Abstract: Reconstructing the evolutionary history relating a collection of molecular sequences is the main subject of modern Bayesian phylogenetic inference. However, the commonly used Markov chain Monte Carlo methods can be inefficient due to the complicated space of phylogenetic trees, especially when the number of sequences is large. An alternative approach is variational Bayesian phylogenetic inference… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: 26 pages, 7 figures

  23. arXiv:2407.19213  [pdf, other

    cond-mat.mtrl-sci cond-mat.supr-con

    First-principles study on spin density wave in La$_3$Ni$_2$O$_{7-δ}$

    Authors: Xiao-Sheng Ni, Yuyang Ji, Lixin He, Tao Xie, Dao-Xin Yao, Meng Wang, Kun Cao

    Abstract: The recent discovery of high-temperature superconductivity in high-pressurized La$_3$Ni$_2$O$_{7-δ}$ has garnered significant attention. Using density functional theory, we investigate the magnetic properties of La$_3$Ni$_2$O$_7$ at ambient pressure. Our calculations suggest that with $δ=0$, the double spin stripe phase is favored as the magnetic ground state. Oxygen vacancies may effectively turn… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

  24. arXiv:2407.16958  [pdf, other

    cs.LG cs.AI

    Wonderful Matrices: More Efficient and Effective Architecture for Language Modeling Tasks

    Authors: Jingze Shi, Bingheng Wu, Ting Xie, Lu He

    Abstract: Recent studies have shown that, relative position encoding performs well in selective state space model scanning algorithms, and the architecture that balances SSM and Attention enhances the efficiency and effectiveness of the algorithm, while the sparse activation of the mixture of experts reduces the training cost. We studied the effectiveness of using different position encodings in structured… ▽ More

    Submitted 12 October, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

  25. arXiv:2407.16569  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Regulated magnetic anisotropy and charge density wave in uniformly fabricated Janus CrTeSe monolayer

    Authors: Jin-Hua Nie, Cong Wang, Mao-Peng Miao, Kang-Di Niu, Tao Xie, Ting-Fei Guo, Wen-Hao Zhang, Chao-Fei Liu, Rui-Jing Sun, Jian-Wang Zhou, Jun-Hao Lin, Wei Ji, Ying-Shuang Fu

    Abstract: Two-dimensional materials with Janus structure host novel physical properties due to their inversional symmetry breaking. However, it remains elusive to synthesize Janus monolayer crystals with tailored long-range magnetic orders. Here, we have developed a general method to fabricate uniform Janus CrTeSe monolayers by selective selenization of preformed CrTe2 monolayers with molecular beam epitaxy… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: 19 pages, 4 figures

  26. arXiv:2407.14266  [pdf, other

    cs.IR cs.LG

    L^2CL: Embarrassingly Simple Layer-to-Layer Contrastive Learning for Graph Collaborative Filtering

    Authors: Xinzhou Jin, Jintang Li, Liang Chen, Chenyun Yu, Yuanzhen Xie, Tao Xie, Chengxiang Zhuo, Zang Li, Zibin Zheng

    Abstract: Graph neural networks (GNNs) have recently emerged as an effective approach to model neighborhood signals in collaborative filtering. Towards this research line, graph contrastive learning (GCL) demonstrates robust capabilities to address the supervision label shortage issue through generating massive self-supervised signals. Despite its effectiveness, GCL for recommendation suffers seriously from… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  27. arXiv:2407.13763  [pdf, other

    cond-mat.str-el cond-mat.mes-hall cond-mat.mtrl-sci

    Isospin Pomeranchuk effect and finite temperature resistivity minimum in rhombohedral graphene

    Authors: Ludwig Holleis, Tian Xie, Siyuan Xu, Haoxin Zhou, Caitlin L. Patterson, Archisman Panigrahi, Takashi Taniguchi, Kenji Watanabe, Leonid S. Levitov, Chenhao Jin, Erez Berg, Andrea F. Young

    Abstract: Magnetism typically arises from the effect of exchange interactions on highly localized fermionic wave functions in f- and d-atomic orbitals. In rhombohedral graphene multilayers, in contrast, magnetism -- manifesting as spontaneous polarization into one or more spin and valley flavors[1-7]-originates from fully itinerant electrons near a Van Hove singularity. Here, we show that despite the absenc… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  28. arXiv:2407.13399  [pdf, other

    cs.AI cs.CL cs.LG

    Correcting the Mythos of KL-Regularization: Direct Alignment without Overoptimization via Chi-Squared Preference Optimization

    Authors: Audrey Huang, Wenhao Zhan, Tengyang Xie, Jason D. Lee, Wen Sun, Akshay Krishnamurthy, Dylan J. Foster

    Abstract: Language model alignment methods, such as reinforcement learning from human feedback (RLHF), have led to impressive advances in language model capabilities, but existing techniques are limited by a widely observed phenomenon known as overoptimization, where the quality of the language model plateaus or degrades over the course of the alignment process. Overoptimization is often attributed to overf… ▽ More

    Submitted 19 July, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

  29. arXiv:2407.12767  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Helical Spin Dynamics in Commensurate Magnets: a Study on Brochantite, Cu$_4$SO$_4$(OH)$_6$

    Authors: S. E. Nikitin, Tao Xie, A. Gazizulina, B. Ouladdiaf, J. A. Rodríguez Velamazán, I. ~F. ~Díaz-Ortega, H. Nojiri, L. M. Anovitz, A. M. dos Santos, O. Prokhnenko, A. Podlesnyak

    Abstract: We report the direct observation of a commensurate-ordered antiferromagnetic (AFM) state but incommensurate helical spin dynamics in the natural mineral brochantite Cu$_4$SO$_4$(OH)$_6$ through neutron diffraction and neutron spectroscopy measurements. Inelastic neutron scattering measurements reveal magnon-like excitations with considerable dispersion along the c-axis and almost flat branches in… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: 9 pages, 5 figures

  30. arXiv:2407.11585  [pdf, other

    cs.CV cs.AI

    QVD: Post-training Quantization for Video Diffusion Models

    Authors: Shilong Tian, Hong Chen, Chengtao Lv, Yu Liu, Jinyang Guo, Xianglong Liu, Shengxi Li, Hao Yang, Tao Xie

    Abstract: Recently, video diffusion models (VDMs) have garnered significant attention due to their notable advancements in generating coherent and realistic video content. However, processing multiple frame features concurrently, coupled with the considerable model size, results in high latency and extensive memory consumption, hindering their broader application. Post-training quantization (PTQ) is an effe… ▽ More

    Submitted 17 July, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: accepted by ACMMM2024

  31. arXiv:2407.11369  [pdf

    cond-mat.supr-con

    High-Resolution Spectroscopy of the Intermediate Impurity States near a Quantum Phase Transition

    Authors: Yao Zhang, Tao Xie, Zhen-Yu Liu, Rui Wang, Wenhao Zhang, Chaofei Liu, Ying-Shuang Fu

    Abstract: The intermediate behavior near a quantum phase transition is crucial for understanding the quantum criticality of various competing phases and their separate origins, yet remains unexplored for the multiple Yu-Shiba-Rusinov (YSR) states. Here, we investigated the detailed spectroscopic change of the exchange coupling-dependent YSR states near a quantum phase transition. The initially developed one… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 8 pages, 3 figures

  32. arXiv:2407.10956  [pdf, other

    cs.AI cs.CL

    Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

    Authors: Ruisheng Cao, Fangyu Lei, Haoyuan Wu, Jixuan Chen, Yeqiao Fu, Hongcheng Gao, Xinzhuang Xiong, Hanchong Zhang, Yuchen Mao, Wenjing Hu, Tianbao Xie, Hongshen Xu, Danyang Zhang, Sida Wang, Ruoxi Sun, Pengcheng Yin, Caiming Xiong, Ansong Ni, Qian Liu, Victor Zhong, Lu Chen, Kai Yu, Tao Yu

    Abstract: Data science and engineering workflows often span multiple stages, from warehousing to orchestration, using tools like BigQuery, dbt, and Airbyte. As vision language models (VLMs) advance in multimodal understanding and code generation, VLM-based agents could potentially automate these workflows by generating SQL queries, Python code, and GUI operations. This automation can improve the productivit… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 34 pages, 14 figures, 10 tables

  33. arXiv:2407.08879  [pdf, other

    quant-ph

    Scalable microwave-to-optical transducers at single photon level with spins

    Authors: Tian Xie, Rikuto Fukumori, Jiahui Li, Andrei Faraon

    Abstract: Microwave-to-optical transduction of single photons will play an essential role in interconnecting future superconducting quantum devices, with applications in distributed quantum computing and secure communications. Various transducers that couple microwave and optical modes via an optical drive have been developed, utilizing nonlinear phenomena such as the Pockels effect and a combination of ele… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: TX and RF contributed equally to this work

  34. arXiv:2407.08176  [pdf, other

    cs.SE cs.AI cs.LG

    Foundation Model Engineering: Engineering Foundation Models Just as Engineering Software

    Authors: Dezhi Ran, Mengzhou Wu, Wei Yang, Tao Xie

    Abstract: By treating data and models as the source code, Foundation Models (FMs) become a new type of software. Mirroring the concept of software crisis, the increasing complexity of FMs making FM crisis a tangible concern in the coming decade, appealing for new theories and methodologies from the field of software engineering. In this paper, we outline our vision of introducing Foundation Model (FM) engin… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Accepted by 2030 Software Engineering Workshop, co-located with FSE24; Invited to ACM TOSEM 2030 Roadmap for Software Engineering

  35. arXiv:2406.17565  [pdf, other

    cs.DC

    MemServe: Context Caching for Disaggregated LLM Serving with Elastic Memory Pool

    Authors: Cunchen Hu, Heyang Huang, Junhao Hu, Jiang Xu, Xusheng Chen, Tao Xie, Chenxi Wang, Sa Wang, Yungang Bao, Ninghui Sun, Yizhou Shan

    Abstract: Large language model (LLM) serving has transformed from stateless to stateful systems, utilizing techniques like context caching and disaggregated inference. These optimizations extend the lifespan and domain of the KV cache, necessitating a new architectural approach. We present MemServe, a unified system that integrates both inter-request and intra-request optimizations. MemServe introduces MemP… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  36. arXiv:2406.17305  [pdf, other

    cs.CL

    Retrieval Augmented Instruction Tuning for Open NER with Large Language Models

    Authors: Tingyu Xie, Jian Zhang, Yan Zhang, Yuanyuan Liang, Qi Li, Hongwei Wang

    Abstract: The strong capability of large language models (LLMs) has been applied to information extraction (IE) through either retrieval augmented prompting or instruction tuning (IT). However, the best way to incorporate information with LLMs for IE remains an open question. In this paper, we explore Retrieval Augmented Instruction Tuning (RA-IT) for IE, focusing on the task of open named entity recognitio… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  37. arXiv:2406.16756  [pdf, other

    cs.LG cs.AI cs.CY

    Addressing Polarization and Unfairness in Performative Prediction

    Authors: Kun Jin, Tian Xie, Yang Liu, Xueru Zhang

    Abstract: When machine learning (ML) models are used in applications that involve humans (e.g., online recommendation, school admission, hiring, lending), the model itself may trigger changes in the distribution of targeted data it aims to predict. Performative prediction (PP) is a framework that explicitly considers such model-dependent distribution shifts when learning ML models. While significant efforts… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  38. arXiv:2406.16495  [pdf, other

    cs.CL cs.AI

    OTCE: Hybrid SSM and Attention with Cross Domain Mixture of Experts to construct Observer-Thinker-Conceiver-Expresser

    Authors: Jingze Shi, Ting Xie, Bingheng Wu, Chunjun Zheng, Kai Wang

    Abstract: Recent research has shown that combining Mamba with Transformer architecture, which has selective state space and quadratic self-attention mechanism, outperforms using Mamba or Transformer architecture alone in language modeling tasks. The quadratic self-attention mechanism effectively alleviates the shortcomings of selective state space in handling long-term dependencies of any element in the seq… ▽ More

    Submitted 19 July, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  39. arXiv:2406.16017  [pdf, other

    quant-ph physics.atom-ph

    Competing excitation quenching and charge exchange in ultracold Li-Ba$^+$ collisions

    Authors: Xiaodong Xing, Pascal Weckesser, Fabian Thielemann, Tibor Jónás, Romain Vexiau, Nadia Bouloufa-Maafa, Eliane Luc-Koenig, Kirk W. Madison, Andrea Orbán, Ting Xie, Tobias Schaetz, Olivier Dulieu

    Abstract: Hybrid atom-ion systems are a rich and powerful platform for studying chemical reactions, as they feature both excellent control over the electronic state preparation and readout as well as a versatile tunability over the scattering energy, ranging from the few-partial wave regime to the quantum regime. In this work, we make use of these excellent control knobs, and present a joint experimental an… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 17 pages, 15 figures, 4 tables

  40. arXiv:2406.14598  [pdf, other

    cs.AI

    SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors

    Authors: Tinghao Xie, Xiangyu Qi, Yi Zeng, Yangsibo Huang, Udari Madhushani Sehwag, Kaixuan Huang, Luxi He, Boyi Wei, Dacheng Li, Ying Sheng, Ruoxi Jia, Bo Li, Kai Li, Danqi Chen, Peter Henderson, Prateek Mittal

    Abstract: Evaluating aligned large language models' (LLMs) ability to recognize and reject unsafe user requests is crucial for safe, policy-compliant deployments. Existing evaluation efforts, however, face three limitations that we address with SORRY-Bench, our proposed benchmark. First, existing methods often use coarse-grained taxonomies of unsafe topics, and are over-representing some fine-grained topics… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  41. arXiv:2406.14526  [pdf, other

    cs.CV cs.AI cs.CY cs.LG

    Fantastic Copyrighted Beasts and How (Not) to Generate Them

    Authors: Luxi He, Yangsibo Huang, Weijia Shi, Tinghao Xie, Haotian Liu, Yue Wang, Luke Zettlemoyer, Chiyuan Zhang, Danqi Chen, Peter Henderson

    Abstract: Recent studies show that image and video generation models can be prompted to reproduce copyrighted content from their training data, raising serious legal concerns around copyright infringement. Copyrighted characters, in particular, pose a difficult challenge for image generation services, with at least one lawsuit already awarding damages based on the generation of these characters. Yet, little… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  42. arXiv:2406.12845  [pdf, other

    cs.LG cs.CL

    Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts

    Authors: Haoxiang Wang, Wei Xiong, Tengyang Xie, Han Zhao, Tong Zhang

    Abstract: Reinforcement learning from human feedback (RLHF) has emerged as the primary method for aligning large language models (LLMs) with human preferences. The RLHF process typically starts by training a reward model (RM) using human preference data. Conventional RMs are trained on pairwise responses to the same user request, with relative ratings indicating which response humans prefer. The trained RM… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Technical report v1. Code and model are released at https://github.com/RLHFlow/RLHF-Reward-Modeling/

  43. arXiv:2406.04274  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Self-Play with Adversarial Critic: Provable and Scalable Offline Alignment for Language Models

    Authors: Xiang Ji, Sanjeev Kulkarni, Mengdi Wang, Tengyang Xie

    Abstract: This work studies the challenge of aligning large language models (LLMs) with offline preference data. We focus on alignment by Reinforcement Learning from Human Feedback (RLHF) in particular. While popular preference optimization methods exhibit good empirical performance in practice, they are not theoretically guaranteed to converge to the optimal policy and can provably fail when the data cover… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  44. arXiv:2406.03520  [pdf, other

    cs.CV cs.AI cs.LG

    VideoPhy: Evaluating Physical Commonsense for Video Generation

    Authors: Hritik Bansal, Zongyu Lin, Tianyi Xie, Zeshun Zong, Michal Yarom, Yonatan Bitton, Chenfanfu Jiang, Yizhou Sun, Kai-Wei Chang, Aditya Grover

    Abstract: Recent advances in internet-scale video data pretraining have led to the development of text-to-video generative models that can create high-quality videos across a broad range of visual concepts, synthesize realistic motions and render complex objects. Hence, these generative models have the potential to become general-purpose simulators of the physical world. However, it is unclear how far we ar… ▽ More

    Submitted 3 October, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: 43 pages, 29 figures, 12 tables. Added CogVideo and Dream Machine in v2

  45. arXiv:2406.01304  [pdf, other

    cs.CL cs.AI cs.SE

    CodeR: Issue Resolving with Multi-Agent and Task Graphs

    Authors: Dong Chen, Shaoxin Lin, Muhan Zeng, Daoguang Zan, Jian-Gang Wang, Anton Cheshkov, Jun Sun, Hao Yu, Guoliang Dong, Artem Aliev, Jie Wang, Xiao Cheng, Guangtai Liang, Yuchi Ma, Pan Bian, Tao Xie, Qianxiang Wang

    Abstract: GitHub issue resolving recently has attracted significant attention from academia and industry. SWE-bench is proposed to measure the performance in resolving issues. In this paper, we propose CodeR, which adopts a multi-agent framework and pre-defined task graphs to Repair & Resolve reported bugs and add new features within code Repository. On SWE-bench lite, CodeR is able to solve 28.33% of issue… ▽ More

    Submitted 10 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: https://github.com/NL2Code/CodeR

  46. arXiv:2405.21046  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF

    Authors: Tengyang Xie, Dylan J. Foster, Akshay Krishnamurthy, Corby Rosset, Ahmed Awadallah, Alexander Rakhlin

    Abstract: Reinforcement learning from human feedback (RLHF) has emerged as a central tool for language model alignment. We consider online exploration in RLHF, which exploits interactive access to human or AI feedback by deliberately encouraging the model to produce diverse, maximally informative responses. By allowing RLHF to confidently stray from the pre-trained model, online exploration offers the possi… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  47. arXiv:2405.19524  [pdf, other

    cs.CR cs.AI

    AI Risk Management Should Incorporate Both Safety and Security

    Authors: Xiangyu Qi, Yangsibo Huang, Yi Zeng, Edoardo Debenedetti, Jonas Geiping, Luxi He, Kaixuan Huang, Udari Madhushani, Vikash Sehwag, Weijia Shi, Boyi Wei, Tinghao Xie, Danqi Chen, Pin-Yu Chen, Jeffrey Ding, Ruoxi Jia, Jiaqi Ma, Arvind Narayanan, Weijie J Su, Mengdi Wang, Chaowei Xiao, Bo Li, Dawn Song, Peter Henderson, Prateek Mittal

    Abstract: The exposure of security vulnerabilities in safety-aligned language models, e.g., susceptibility to adversarial attacks, has shed light on the intricate interplay between AI safety and AI security. Although the two disciplines now come together under the overarching goal of AI risk management, they have historically evolved separately, giving rise to differing perspectives. Therefore, in this pape… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  48. arXiv:2405.18997  [pdf, other

    stat.ML cs.LG

    Kernel Semi-Implicit Variational Inference

    Authors: Ziheng Cheng, Longlin Yu, Tianyu Xie, Shiyue Zhang, Cheng Zhang

    Abstract: Semi-implicit variational inference (SIVI) extends traditional variational families with semi-implicit distributions defined in a hierarchical manner. Due to the intractable densities of semi-implicit distributions, classical SIVI often resorts to surrogates of evidence lower bound (ELBO) that would introduce biases for training. A recent advancement in SIVI, named SIVI-SM, utilizes an alternative… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: ICML 2024 camera ready

  49. arXiv:2405.18515  [pdf, other

    cs.LG

    Atlas3D: Physically Constrained Self-Supporting Text-to-3D for Simulation and Fabrication

    Authors: Yunuo Chen, Tianyi Xie, Zeshun Zong, Xuan Li, Feng Gao, Yin Yang, Ying Nian Wu, Chenfanfu Jiang

    Abstract: Existing diffusion-based text-to-3D generation methods primarily focus on producing visually realistic shapes and appearances, often neglecting the physical constraints necessary for downstream tasks. Generated models frequently fail to maintain balance when placed in physics-based simulations or 3D printed. This balance is crucial for satisfying user design intentions in interactive gaming, embod… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  50. arXiv:2405.16577  [pdf, other

    stat.ML cs.LG

    Reflected Flow Matching

    Authors: Tianyu Xie, Yu Zhu, Longlin Yu, Tong Yang, Ziheng Cheng, Shiyue Zhang, Xiangyu Zhang, Cheng Zhang

    Abstract: Continuous normalizing flows (CNFs) learn an ordinary differential equation to transform prior samples into data. Flow matching (FM) has recently emerged as a simulation-free approach for training CNFs by regressing a velocity model towards the conditional velocity field. However, on constrained domains, the learned velocity model may lead to undesirable flows that result in highly unnatural sampl… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: ICML 2024 camera-ready