Skip to main content

Showing 1–50 of 58 results for author: Xie, A

  1. arXiv:2409.14914   

    econ.GN

    Impact of the Three-Child Policy and Delayed Retirement on the Transfer of Surplus Rural Labor under Xi Jinping's New Population Vision: A Re-examination of China's Lewis Turning Point

    Authors: Jun Dai, Guanqing Shi, Xiaoke Xie, Aitong Xie

    Abstract: Chinese-style modernization involves the modernization of a large population, requiring top-level design in terms of scale and structure. The population perspective in Xi Jinping's Thought on Socialism with Chinese Characteristics for a New Era serves as the fundamental guide for population policies. The three-child policy and delayed retirement will affect the supply of labor in China and challen… ▽ More

    Submitted 17 October, 2024; v1 submitted 23 September, 2024; originally announced September 2024.

    Comments: Due to unresolved issues among co-authors, we have decided to withdraw the manuscript

  2. arXiv:2408.17355  [pdf, other

    cs.RO cs.AI cs.LG

    Bidirectional Decoding: Improving Action Chunking via Closed-Loop Resampling

    Authors: Yuejiang Liu, Jubayer Ibn Hamid, Annie Xie, Yoonho Lee, Maximilian Du, Chelsea Finn

    Abstract: Predicting and executing a sequence of actions without intermediate replanning, known as action chunking, is increasingly used in robot learning from human demonstrations. Yet, its reported effects on the learned policy are inconsistent: some studies find it crucial for achieving strong results, while others observe decreased performance. In this paper, we first dissect how action chunking impacts… ▽ More

    Submitted 21 October, 2024; v1 submitted 30 August, 2024; originally announced August 2024.

    Comments: Project website: https://bid-robot.github.io/

  3. arXiv:2408.16944  [pdf, other

    cs.RO cs.LG

    FlowRetrieval: Flow-Guided Data Retrieval for Few-Shot Imitation Learning

    Authors: Li-Heng Lin, Yuchen Cui, Amber Xie, Tianyu Hua, Dorsa Sadigh

    Abstract: Few-shot imitation learning relies on only a small amount of task-specific demonstrations to efficiently adapt a policy for a given downstream tasks. Retrieval-based methods come with a promise of retrieving relevant past experiences to augment this target data when learning policies. However, existing data retrieval methods fall under two extremes: they either rely on the existence of exact behav… ▽ More

    Submitted 11 October, 2024; v1 submitted 29 August, 2024; originally announced August 2024.

  4. arXiv:2408.14363  [pdf

    physics.app-ph

    Selective-injection GaN Heterojunction Bipolar Transistors with 275 kA/cm$^2$ Current Density

    Authors: Zhanbo Xia, Chandan Joishi, Shahadat H. Sohel, Andy Xie, Edward Beam, Yu Cao, Siddharth Rajan

    Abstract: We design and demonstrate selective injection GaN heterojunction bipolar transistors that utilize a patterned base for selective injection of electrons from the emitter. The design maneuvers minority carrier injection through a thin p-GaN base region, while the majority carrier holes for base current are injected from thick p-GaN regions adjacent to the thin p-GaN base. The design is realized usin… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

  5. arXiv:2408.08084  [pdf, other

    cs.LG cs.AI

    An Efficient Replay for Class-Incremental Learning with Pre-trained Models

    Authors: Weimin Yin, Bin Chen adn Chunzhao Xie, Zhenhao Tan

    Abstract: In general class-incremental learning, researchers typically use sample sets as a tool to avoid catastrophic forgetting during continuous learning. At the same time, researchers have also noted the differences between class-incremental learning and Oracle training and have attempted to make corrections. In recent years, researchers have begun to develop class-incremental learning algorithms utiliz… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  6. arXiv:2407.18014  [pdf, other

    quant-ph

    Data-driven approach to mixed-state multipartite entanglement characterisation

    Authors: Eric Brunner, Aaron Xie, Gabriel Dufour, Andreas Buchleitner

    Abstract: We develop a statistical framework, based on a manifold learning embedding, to extract relevant features of multipartite entanglement structures of mixed quantum states from the measurable correlation data of a quantum computer. We show that the statistics of the measured correlators contains sufficient information to characterise the entanglement, and to quantify the mixedness of the state of the… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  7. arXiv:2407.10341  [pdf, other

    cs.RO cs.AI cs.LG

    Affordance-Guided Reinforcement Learning via Visual Prompting

    Authors: Olivia Y. Lee, Annie Xie, Kuan Fang, Karl Pertsch, Chelsea Finn

    Abstract: Robots equipped with reinforcement learning (RL) have the potential to learn a wide range of skills solely from a reward signal. However, obtaining a robust and dense reward signal for general manipulation tasks remains a challenge. Existing learning-based approaches require significant data, such as human demonstrations of success and failure, to learn task-specific reward functions. Recently, th… ▽ More

    Submitted 1 October, 2024; v1 submitted 14 July, 2024; originally announced July 2024.

    Comments: 8 pages, 6 figures. Robotics: Science and Systems (RSS) 2024, Task Specification for General-Purpose Intelligent Robots & Lifelong Robot Learning Workshops

  8. arXiv:2406.18559  [pdf, other

    cs.HC cs.AI cs.CV cs.LG

    Revision Matters: Generative Design Guided by Revision Edits

    Authors: Tao Li, Chin-Yi Cheng, Amber Xie, Gang Li, Yang Li

    Abstract: Layout design, such as user interface or graphical layout in general, is fundamentally an iterative revision process. Through revising a design repeatedly, the designer converges on an ideal layout. In this paper, we investigate how revision edits from human designer can benefit a multimodal generative model. To do so, we curate an expert dataset that traces how human designers iteratively edit an… ▽ More

    Submitted 27 May, 2024; originally announced June 2024.

  9. arXiv:2406.16838  [pdf, other

    cs.CL cs.LG

    From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models

    Authors: Sean Welleck, Amanda Bertsch, Matthew Finlayson, Hailey Schoelkopf, Alex Xie, Graham Neubig, Ilia Kulikov, Zaid Harchaoui

    Abstract: One of the most striking findings in modern research on large language models (LLMs) is that scaling up compute during training leads to better results. However, less attention has been given to the benefits of scaling compute during inference. This survey focuses on these inference-time approaches. We explore three areas under a unified mathematical formalism: token-level generation algorithms, m… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  10. arXiv:2406.15313  [pdf, other

    cs.IR cs.CL

    STARD: A Chinese Statute Retrieval Dataset with Real Queries Issued by Non-professionals

    Authors: Weihang Su, Yiran Hu, Anzhe Xie, Qingyao Ai, Zibing Que, Ning Zheng, Yun Liu, Weixing Shen, Yiqun Liu

    Abstract: Statute retrieval aims to find relevant statutory articles for specific queries. This process is the basis of a wide range of legal applications such as legal advice, automated judicial decisions, legal document drafting, etc. Existing statute retrieval benchmarks focus on formal and professional queries from sources like bar exams and legal case documents, thereby neglecting non-professional quer… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  11. arXiv:2406.11698  [pdf, other

    cs.CL

    Meta Reasoning for Large Language Models

    Authors: Peizhong Gao, Ao Xie, Shaoguang Mao, Wenshan Wu, Yan Xia, Haipeng Mi, Furu Wei

    Abstract: We introduce Meta-Reasoning Prompting (MRP), a novel and efficient system prompting method for large language models (LLMs) inspired by human meta-reasoning. Traditional in-context learning-based reasoning techniques, such as Tree-of-Thoughts, show promise but lack consistent state-of-the-art performance across diverse tasks due to their specialized nature. MRP addresses this limitation by guiding… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  12. arXiv:2406.07768  [pdf, other

    physics.app-ph

    Selective Undercut of Undoped Optical Membranes for Spin-Active Color Centers in 4H-SiC

    Authors: Jonathan R. Dietz, Aaron M. Day, Amberly Xie, Evelyn L. Hu

    Abstract: Silicon carbide (SiC) is a semiconductor used in quantum information processing, microelectromechanical systems, photonics, power electronics, and harsh environment sensors. However, its high temperature stability, high breakdown voltage, wide bandgap, and high mechanical strength are accompanied by a chemical inertness which makes complex micromachining difficult. Photoelectrochemical etching is… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 12 pages, 4 figures

  13. arXiv:2405.13026  [pdf, other

    cs.CL cs.AI

    Leveraging Human Revisions for Improving Text-to-Layout Models

    Authors: Amber Xie, Chin-Yi Cheng, Forrest Huang, Yang Li

    Abstract: Learning from human feedback has shown success in aligning large, pretrained models with human values. Prior works have mostly focused on learning from high-level labels, such as preferences between pairs of model outputs. On the other hand, many domains could benefit from more involved, detailed feedback, such as revisions, explanations, and reasoning of human users. Our work proposes using nuanc… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  14. arXiv:2404.00566  [pdf, other

    cs.SE cs.CL

    CodeBenchGen: Creating Scalable Execution-based Code Generation Benchmarks

    Authors: Yiqing Xie, Alex Xie, Divyanshu Sheth, Pengfei Liu, Daniel Fried, Carolyn Rose

    Abstract: To adequately test modern code generation systems, evaluation benchmarks must execute and test the code generated by the system. However, these execution and testing requirements have largely limited benchmarks to settings where code is easily executable or has human-written tests. To facilitate evaluation of code generation systems across diverse scenarios, we present CodeBenchGen, a framework to… ▽ More

    Submitted 2 October, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

  15. arXiv:2403.12945  [pdf, other

    cs.RO

    DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

    Authors: Alexander Khazatsky, Karl Pertsch, Suraj Nair, Ashwin Balakrishna, Sudeep Dasari, Siddharth Karamcheti, Soroush Nasiriany, Mohan Kumar Srirama, Lawrence Yunliang Chen, Kirsty Ellis, Peter David Fagan, Joey Hejna, Masha Itkina, Marion Lepert, Yecheng Jason Ma, Patrick Tree Miller, Jimmy Wu, Suneel Belkhale, Shivin Dass, Huy Ha, Arhan Jain, Abraham Lee, Youngwoon Lee, Marius Memmel, Sungjae Park , et al. (74 additional authors not shown)

    Abstract: The creation of large, diverse, high-quality robot manipulation datasets is an important stepping stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a resu… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Project website: https://droid-dataset.github.io/

  16. arXiv:2403.05110  [pdf, other

    cs.RO cs.AI cs.LG

    Efficient Data Collection for Robotic Manipulation via Compositional Generalization

    Authors: Jensen Gao, Annie Xie, Ted Xiao, Chelsea Finn, Dorsa Sadigh

    Abstract: Data collection has become an increasingly important problem in robotic manipulation, yet there still lacks much understanding of how to effectively collect data to facilitate broad generalization. Recent works on large-scale robotic data collection typically vary many environmental factors of variation (e.g., object types, table textures) during data collection, to cover a diverse range of scenar… ▽ More

    Submitted 21 May, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: RSS 2024

  17. arXiv:2403.02882  [pdf, other

    eess.SY cs.LG cs.RO

    Autonomous vehicle decision and control through reinforcement learning with traffic flow randomization

    Authors: Yuan Lin, Antai Xie, Xiao Liu

    Abstract: Most of the current studies on autonomous vehicle decision-making and control tasks based on reinforcement learning are conducted in simulated environments. The training and testing of these studies are carried out under rule-based microscopic traffic flow, with little consideration of migrating them to real or near-real environments to test their performance. It may lead to a degradation in perfo… ▽ More

    Submitted 19 April, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  18. arXiv:2403.01322  [pdf, ps, other

    math.OC

    A Communication-Efficient Stochastic Gradient Descent Algorithm for Distributed Nonconvex Optimization

    Authors: Antai Xie, Xinlei Yi, Xiaofan Wang, Ming Cao, Xiaoqiang Ren

    Abstract: This paper studies distributed nonconvex optimization problems with stochastic gradients for a multi-agent system, in which each agent aims to minimize the sum of all agents' cost functions by using local compressed information exchange. We propose a distributed stochastic gradient descent (SGD) algorithm, suitable for a general class of compressors. We show that the proposed algorithm achieves th… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  19. arXiv:2402.07872  [pdf, other

    cs.RO cs.CL cs.CV cs.LG

    PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs

    Authors: Soroush Nasiriany, Fei Xia, Wenhao Yu, Ted Xiao, Jacky Liang, Ishita Dasgupta, Annie Xie, Danny Driess, Ayzaan Wahid, Zhuo Xu, Quan Vuong, Tingnan Zhang, Tsang-Wei Edward Lee, Kuang-Huei Lee, Peng Xu, Sean Kirmani, Yuke Zhu, Andy Zeng, Karol Hausman, Nicolas Heess, Chelsea Finn, Sergey Levine, Brian Ichter

    Abstract: Vision language models (VLMs) have shown impressive capabilities across a variety of tasks, from logical reasoning to visual understanding. This opens the door to richer interaction with the world, for example robotic control. However, VLMs produce only textual outputs, while robotic control and other spatial tasks require outputting continuous coordinates, actions, or trajectories. How can we ena… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  20. arXiv:2402.03761  [pdf

    eess.IV cs.LG q-bio.TO

    Deep Learning-Based Correction and Unmixing of Hyperspectral Images for Brain Tumor Surgery

    Authors: David Black, Jaidev Gill, Andrew Xie, Benoit Liquet, Antonio Di leva, Walter Stummer, Eric Suero Molina

    Abstract: Hyperspectral Imaging (HSI) for fluorescence-guided brain tumor resection enables visualization of differences between tissues that are not distinguishable to humans. This augmentation can maximize brain tumor resection, improving patient outcomes. However, much of the processing in HSI uses simplified linear methods that are unable to capture the non-linear, wavelength-dependent phenomena that mu… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 20 pages, 8 figures, 3 tables - Under Review

  21. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, Ajinkya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  22. arXiv:2310.03294  [pdf, other

    cs.LG cs.AI cs.DC

    DISTFLASHATTN: Distributed Memory-efficient Attention for Long-context LLMs Training

    Authors: Dacheng Li, Rulin Shao, Anze Xie, Eric P. Xing, Xuezhe Ma, Ion Stoica, Joseph E. Gonzalez, Hao Zhang

    Abstract: FlashAttention (Dao, 2023) effectively reduces the quadratic peak memory usage to linear in training transformer-based large language models (LLMs) on a single GPU. In this paper, we introduce DISTFLASHATTN, a distributed memory-efficient attention mechanism optimized for long-context LLMs training. We propose three key techniques: token-level workload balancing, overlapping key-value communicatio… ▽ More

    Submitted 31 March, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

  23. arXiv:2310.01387  [pdf, other

    cs.CL

    It's MBR All the Way Down: Modern Generation Techniques Through the Lens of Minimum Bayes Risk

    Authors: Amanda Bertsch, Alex Xie, Graham Neubig, Matthew R. Gormley

    Abstract: Minimum Bayes Risk (MBR) decoding is a method for choosing the outputs of a machine learning system based not on the output with the highest probability, but the output with the lowest risk (expected error) among multiple candidates. It is a simple but powerful method: for an additional cost at inference time, MBR provides reliable several-point improvements across metrics for a wide variety of ta… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: Under submission

  24. arXiv:2309.11206  [pdf, other

    cs.CL cs.AI

    Retrieve-Rewrite-Answer: A KG-to-Text Enhanced LLMs Framework for Knowledge Graph Question Answering

    Authors: Yike Wu, Nan Hu, Sheng Bi, Guilin Qi, Jie Ren, Anhuan Xie, Wei Song

    Abstract: Despite their competitive performance on knowledge-intensive tasks, large language models (LLMs) still have limitations in memorizing all world knowledge especially long tail knowledge. In this paper, we study the KG-augmented language model approach for solving the knowledge graph question answering (KGQA) task that requires rich world knowledge. Existing work has shown that retrieving KG knowled… ▽ More

    Submitted 21 September, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

  25. arXiv:2308.16893  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Language-Conditioned Path Planning

    Authors: Amber Xie, Youngwoon Lee, Pieter Abbeel, Stephen James

    Abstract: Contact is at the core of robotic manipulation. At times, it is desired (e.g. manipulation and grasping), and at times, it is harmful (e.g. when avoiding obstacles). However, traditional path planning algorithms focus solely on collision-free paths, limiting their applicability in contact-rich tasks. To address this limitation, we propose the domain of Language-Conditioned Path Planning, where con… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: Conference on Robot Learning, 2023

  26. arXiv:2308.12270  [pdf, other

    cs.LG cs.AI

    Language Reward Modulation for Pretraining Reinforcement Learning

    Authors: Ademi Adeniji, Amber Xie, Carmelo Sferrazza, Younggyo Seo, Stephen James, Pieter Abbeel

    Abstract: Using learned reward functions (LRFs) as a means to solve sparse-reward reinforcement learning (RL) tasks has yielded some steady progress in task-complexity through the years. In this work, we question whether today's LRFs are best-suited as a direct replacement for task rewards. Instead, we propose leveraging the capabilities of LRFs as a pretraining signal for RL. Concretely, we propose… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: Code available at https://github.com/ademiadeniji/lamp

  27. arXiv:2307.16656  [pdf, ps, other

    math.OC

    Differentially Private and Communication-Efficient Distributed Nonconvex Optimization Algorithms

    Authors: Antai Xie, Xinlei Yi, Xiaofan Wang, Ming Cao, Xiaoqiang Ren

    Abstract: This paper studies the privacy-preserving distributed optimization problem under limited communication, where each agent aims to keep its cost function private while minimizing the sum of all agents' cost functions. To this end, we propose two differentially private distributed algorithms under compressed communication. We show that the proposed algorithms achieve sublinear convergence for smooth… ▽ More

    Submitted 1 May, 2024; v1 submitted 31 July, 2023; originally announced July 2023.

    Comments: 51 pages

  28. arXiv:2307.03659  [pdf, other

    cs.RO cs.AI

    Decomposing the Generalization Gap in Imitation Learning for Visual Robotic Manipulation

    Authors: Annie Xie, Lisa Lee, Ted Xiao, Chelsea Finn

    Abstract: What makes generalization hard for imitation learning in visual robotic manipulation? This question is difficult to approach at face value, but the environment from the perspective of a robot can often be decomposed into enumerable factors of variation, such as the lighting conditions or the placement of the camera. Empirically, generalization to some of these factors have presented a greater obst… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: Project webpage at https://sites.google.com/view/generalization-gap

  29. arXiv:2306.14892  [pdf, other

    cs.LG cs.AI

    Supervised Pretraining Can Learn In-Context Reinforcement Learning

    Authors: Jonathan N. Lee, Annie Xie, Aldo Pacchiano, Yash Chandak, Chelsea Finn, Ofir Nachum, Emma Brunskill

    Abstract: Large transformer models trained on diverse datasets have shown a remarkable ability to learn in-context, achieving high few-shot performance on tasks they were not explicitly trained to solve. In this paper, we study the in-context learning capabilities of transformers in decision-making problems, i.e., reinforcement learning (RL) for bandits and Markov decision processes. To do so, we introduce… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  30. arXiv:2304.01779  [pdf, ps, other

    math.OC

    Compressed Differentially Private Distributed Optimization with Linear Convergence

    Authors: Antai Xie, Xinlei Yi, Xiaofan Wang, Ming Cao, Xiaoqiang Ren

    Abstract: This paper addresses the problem of differentially private distributed optimization under limited communication, where each agent aims to keep their cost function private while minimizing the sum of all agents' cost functions. In response, we propose a novel Compressed differentially Private distributed Gradient Tracking algorithm (CPGT). We demonstrate that CPGT achieves linear convergence for sm… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: To appear in IFAC WC 2023

  31. arXiv:2303.06902  [pdf, other

    q-bio.BM cs.LG

    Molecular Property Prediction by Semantic-invariant Contrastive Learning

    Authors: Ziqiao Zhang, Ailin Xie, Jihong Guan, Shuigeng Zhou

    Abstract: Contrastive learning have been widely used as pretext tasks for self-supervised pre-trained molecular representation learning models in AI-aided drug design and discovery. However, exiting methods that generate molecular views by noise-adding operations for contrastive learning may face the semantic inconsistency problem, which leads to false positive pairs and consequently poor prediction perform… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

  32. arXiv:2302.07541  [pdf, other

    q-bio.BM cs.LG

    Activity Cliff Prediction: Dataset and Benchmark

    Authors: Ziqiao Zhang, Bangyi Zhao, Ailin Xie, Yatao Bian, Shuigeng Zhou

    Abstract: Activity cliffs (ACs), which are generally defined as pairs of structurally similar molecules that are active against the same bio-target but significantly different in the binding potency, are of great importance to drug discovery. Up to date, the AC prediction problem, i.e., to predict whether a pair of molecules exhibit the AC relationship, has not yet been fully explored. In this paper, we fir… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

  33. arXiv:2211.11319  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models

    Authors: Ajay Jain, Amber Xie, Pieter Abbeel

    Abstract: Diffusion models have shown impressive results in text-to-image synthesis. Using massive datasets of captioned images, diffusion models learn to generate raster images of highly diverse objects and scenes. However, designers frequently use vector representations of images like Scalable Vector Graphics (SVGs) for digital icons or art. Vector graphics can be scaled to any size, and are compact. We s… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: Project webpage: https://ajayj.com/vectorfusion

  34. arXiv:2210.14721  [pdf, other

    cs.LG cs.AI

    Sim-to-Real via Sim-to-Seg: End-to-end Off-road Autonomous Driving Without Real Data

    Authors: John So, Amber Xie, Sunggoo Jung, Jeffrey Edlund, Rohan Thakker, Ali Agha-mohammadi, Pieter Abbeel, Stephen James

    Abstract: Autonomous driving is complex, requiring sophisticated 3D scene understanding, localization, mapping, and control. Rather than explicitly modelling and fusing each of these components, we instead consider an end-to-end approach via reinforcement learning (RL). However, collecting exploration driving data in the real world is impractical and dangerous. While training in simulation and deploying vis… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: CoRL 2022 Paper

  35. arXiv:2210.13446  [pdf, other

    cs.RO

    Flying Trot Control Method for Quadruped Robot Based on Trajectory Planning

    Authors: Hongge Wang, Hui Chai, Bin Chen, Aizhen Xie, Rui Song, Bo Su

    Abstract: An intuitive control method for the flying trot, which combines offline trajectory planning with real-time balance control, is presented. The motion features of running animals in the vertical direction were analysed using the spring-load-inverted-pendulum (SLIP) model, and the foot trajectory of the robot was planned, so the robot could run similar to an animal capable of vertical flight, accordi… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: 30 pages, 20 figures, journal

  36. arXiv:2210.10765  [pdf, other

    cs.LG

    When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning

    Authors: Annie Xie, Fahim Tajwar, Archit Sharma, Chelsea Finn

    Abstract: A long-term goal of reinforcement learning is to design agents that can autonomously interact and learn in the world. A critical challenge to such autonomy is the presence of irreversible states which require external assistance to recover from, such as when a robot arm has pushed an object off of a table. While standard agents require constant monitoring to decide when to intervene, we aim to des… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  37. arXiv:2210.07426  [pdf, other

    cs.LG cs.AI cs.RO

    Skill-Based Reinforcement Learning with Intrinsic Reward Matching

    Authors: Ademi Adeniji, Amber Xie, Pieter Abbeel

    Abstract: While unsupervised skill discovery has shown promise in autonomously acquiring behavioral primitives, there is still a large methodological disconnect between task-agnostic skill pretraining and downstream, task-aware finetuning. We present Intrinsic Reward Matching (IRM), which unifies these two phases of learning via the $\textit{skill discriminator}$, a pretraining model component often discard… ▽ More

    Submitted 25 May, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: 16 pages

  38. arXiv:2209.07423  [pdf, other

    q-bio.BM cs.LG

    Can Pre-trained Models Really Learn Better Molecular Representations for AI-aided Drug Discovery?

    Authors: Ziqiao Zhang, Yatao Bian, Ailin Xie, Pengju Han, Long-Kai Huang, Shuigeng Zhou

    Abstract: Self-supervised pre-training is gaining increasingly more popularity in AI-aided drug discovery, leading to more and more pre-trained models with the promise that they can extract better feature representations for molecules. Yet, the quality of learned representations have not been fully explored. In this work, inspired by the two phenomena of Activity Cliffs (ACs) and Scaffold Hopping (SH) in tr… ▽ More

    Submitted 21 August, 2022; originally announced September 2022.

  39. arXiv:2207.03037  [pdf

    cs.CL

    Sensitivity Analysis on Transferred Neural Architectures of BERT and GPT-2 for Financial Sentiment Analysis

    Authors: Tracy Qian, Andy Xie, Camille Bruckmann

    Abstract: The explosion in novel NLP word embedding and deep learning techniques has induced significant endeavors into potential applications. One of these directions is in the financial sector. Although there is a lot of work done in state-of-the-art models like GPT and BERT, there are relatively few works on how well these methods perform through fine-tuning after being pre-trained, as well as info on ho… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

  40. arXiv:2205.03758  [pdf, other

    cond-mat.mes-hall physics.optics

    Nonparaxiality-triggered Landau-Zener transition in topological photonic waveguides

    Authors: An Xie, Shaodong Zhou, Kelei Xi, Li Ding, Yiming Pan, Yongguan Ke, Huaiqiang Wang, Songlin Zhuang, Qingqing Cheng

    Abstract: Photonic lattices have been widely used for simulating quantum physics, owing to the similar evolutions of paraxial waves and quantum particles. However, nonparaxial wave propagations in photonic lattices break the paradigm of the quantum-optical analogy. Here, we reveal that nonparaxiality exerts stretched and compressed forces on the energy spectrum in the celebrated Aubry-Andre-Harper model. By… ▽ More

    Submitted 7 May, 2022; originally announced May 2022.

    Comments: 17 pages, 4 figures

  41. arXiv:2202.07013  [pdf, other

    cs.LG cs.AI cs.RO

    Robust Policy Learning over Multiple Uncertainty Sets

    Authors: Annie Xie, Shagun Sodhani, Chelsea Finn, Joelle Pineau, Amy Zhang

    Abstract: Reinforcement learning (RL) agents need to be robust to variations in safety-critical environments. While system identification methods provide a way to infer the variation from online experience, they can fail in settings where fast identification is not possible. Another dominant approach is robust RL which produces a policy that can handle worst-case scenarios, but these methods are generally d… ▽ More

    Submitted 4 March, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Project webpage at https://sites.google.com/view/sirsa-public/home

  42. arXiv:2111.14059  [pdf, other

    cs.CV cs.CY cs.LG

    NoFADE: Analyzing Diminishing Returns on CO2 Investment

    Authors: Andre Fu, Justin Tran, Andy Xie, Jonathan Spraggett, Elisa Ding, Chang-Won Lee, Kanav Singla, Mahdi S. Hosseini, Konstantinos N. Plataniotis

    Abstract: Climate change continues to be a pressing issue that currently affects society at-large. It is important that we as a society, including the Computer Vision (CV) community take steps to limit our impact on the environment. In this paper, we (a) analyze the effect of diminishing returns on CV methods, and (b) propose a \textit{``NoFADE''}: a novel entropy-based metric to quantify model--dataset--co… ▽ More

    Submitted 28 November, 2021; originally announced November 2021.

    Comments: Climate Change with Machine Learning workshop at 35th Conference on Neural Information Processing Systems (NeurIPS2021-CCAI)

  43. arXiv:2110.08229  [pdf, other

    cs.RO cs.AI cs.LG cs.MA

    Influencing Towards Stable Multi-Agent Interactions

    Authors: Woodrow Z. Wang, Andy Shih, Annie Xie, Dorsa Sadigh

    Abstract: Learning in multi-agent environments is difficult due to the non-stationarity introduced by an opponent's or partner's changing behaviors. Instead of reactively adapting to the other agent's (opponent or partner) behavior, we propose an algorithm to proactively influence the other agent's strategy to stabilize -- which can restrain the non-stationarity caused by the other agent. We learn a low-dim… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    Comments: 15 pages, 5 figures, Published as an Oral at Conference on Robot Learning (CoRL) 2021

  44. arXiv:2109.09180  [pdf, other

    cs.LG cs.AI cs.RO

    Lifelong Robotic Reinforcement Learning by Retaining Experiences

    Authors: Annie Xie, Chelsea Finn

    Abstract: Multi-task learning ideally allows robots to acquire a diverse repertoire of useful skills. However, many multi-task reinforcement learning efforts assume the robot can collect data from all tasks at all times. In reality, the tasks that the robot learns arrive sequentially, depending on the user and the robot's current environment. In this work, we study a practical sequential multi-task RL probl… ▽ More

    Submitted 6 April, 2022; v1 submitted 19 September, 2021; originally announced September 2021.

    Comments: Supplementary website at https://sites.google.com/view/retain-experience/

  45. arXiv:2109.00115  [pdf, other

    eess.IV cs.CV cs.LG

    Uncertainty Quantified Deep Learning for Predicting Dice Coefficient of Digital Histopathology Image Segmentation

    Authors: Sambuddha Ghosal, Audrey Xie, Pratik Shah

    Abstract: Deep learning models (DLMs) can achieve state of the art performance in medical image segmentation and classification tasks. However, DLMs that do not provide feedback for their predictions such as Dice coefficients (Dice) have limited deployment potential in real world clinical settings. Uncertainty estimates can increase the trust of these automated systems by identifying predictions that need f… ▽ More

    Submitted 31 August, 2021; originally announced September 2021.

    Comments: Submitted to the 2022 IEEE International Symposium on Biomedical Imaging (ISBI) scientific conference

    MSC Class: 68T07; 54H30 ACM Class: I.2.1; G.3

  46. arXiv:2011.06646  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Investigation of Edge States in Artificial Graphene Nano-Flakes

    Authors: Qiushi Zhang, Tszchun Wu, Guowen Kuang, Ayu Xie, Nian Lin

    Abstract: Graphene nano-flakes (GNFs) are predicted to host spin-polarized metallic edge states, which are envisioned for exploration of spintronics at the nanometer scale. To date, experimental realization of GNFs is only in its infancy because of the limitation of precise cutting or synthesizing methods at the nanometer scale. Here, we use low temperature scanning tunneling microscope (STM) to manipulate… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

  47. arXiv:2011.06619  [pdf, other

    cs.RO cs.AI cs.LG

    Learning Latent Representations to Influence Multi-Agent Interaction

    Authors: Annie Xie, Dylan P. Losey, Ryan Tolsma, Chelsea Finn, Dorsa Sadigh

    Abstract: Seamlessly interacting with humans or robots is hard because these agents are non-stationary. They update their policy in response to the ego agent's behavior, and the ego agent must anticipate these changes to co-adapt. Inspired by humans, we recognize that robots do not need to explicitly model every low-level action another agent will make; instead, we can capture the latent strategy of other a… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

    Comments: Conference on Robot Learning (CoRL) 2020. Supplementary website at https://sites.google.com/view/latent-strategies/

  48. Efficient Learning of Control Policies for Robust Quadruped Bounding using Pretrained Neural Networks

    Authors: Zhicheng Wang, Anqiao Li, Yixiao Zheng, Anhuan Xie, Zhibin Li, Jun Wu, Qiuguo Zhu

    Abstract: Bounding is one of the important gaits in quadrupedal locomotion for negotiating obstacles. The authors proposed an effective approach that can learn robust bounding gaits more efficiently despite its large variation in dynamic body movements. The authors first pretrained the neural network (NN) based on data from a robot operated by conventional model based controllers, and then further optimised… ▽ More

    Submitted 29 October, 2023; v1 submitted 1 November, 2020; originally announced November 2020.

    Comments: 12 pages

    Journal ref: IET Cyber-Systems and Robotics 2022 4(4):331-338

  49. arXiv:2006.10701  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Deep Reinforcement Learning amidst Lifelong Non-Stationarity

    Authors: Annie Xie, James Harrison, Chelsea Finn

    Abstract: As humans, our goals and our environment are persistently changing throughout our lifetime based on our experiences, actions, and internal and external drives. In contrast, typical reinforcement learning problem set-ups consider decision processes that are stationary across episodes. Can we develop reinforcement learning algorithms that can cope with the persistent change in the former, more reali… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

    Comments: supplementary website at https://sites.google.com/stanford.edu/lilac/

  50. arXiv:1912.12773  [pdf, other

    cs.LG cs.RO stat.ML

    Learning Predictive Models From Observation and Interaction

    Authors: Karl Schmeckpeper, Annie Xie, Oleh Rybkin, Stephen Tian, Kostas Daniilidis, Sergey Levine, Chelsea Finn

    Abstract: Learning predictive models from interaction with the world allows an agent, such as a robot, to learn about how the world works, and then use this learned model to plan coordinated sequences of actions to bring about desired outcomes. However, learning a model that captures the dynamics of complex skills represents a major challenge: if the agent needs a good model to perform these skills, it migh… ▽ More

    Submitted 29 December, 2019; originally announced December 2019.