Skip to main content

Showing 1–50 of 80 results for author: Liu, C K

  1. arXiv:2410.08464  [pdf, other

    cs.RO cs.AI

    ARCap: Collecting High-quality Human Demonstrations for Robot Learning with Augmented Reality Feedback

    Authors: Sirui Chen, Chen Wang, Kaden Nguyen, Li Fei-Fei, C. Karen Liu

    Abstract: Recent progress in imitation learning from human demonstrations has shown promising results in teaching robots manipulation skills. To further scale up training datasets, recent works start to use portable data collection devices without the need for physical robot hardware. However, due to the absence of on-robot feedback during data collection, the data quality depends heavily on user expertise,… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: 8 pages, 8 Figures, submitted to ICRA 2025

  2. arXiv:2410.05791  [pdf, other

    cs.GR cs.AI cs.SD eess.AS

    FürElise: Capturing and Physically Synthesizing Hand Motions of Piano Performance

    Authors: Ruocheng Wang, Pei Xu, Haochen Shi, Elizabeth Schumann, C. Karen Liu

    Abstract: Piano playing requires agile, precise, and coordinated hand control that stretches the limits of dexterity. Hand motion models with the sophistication to accurately recreate piano playing have a wide range of applications in character animation, embodied AI, biomechanics, and VR/AR. In this paper, we construct a first-of-its-kind large-scale dataset that contains approximately 10 hours of 3D hand… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: SIGGRAPH Asia 2024. Project page: https://for-elise.github.io/

  3. arXiv:2409.13426  [pdf, other

    cs.CV

    HMD$^2$: Environment-aware Motion Generation from Single Egocentric Head-Mounted Device

    Authors: Vladimir Guzov, Yifeng Jiang, Fangzhou Hong, Gerard Pons-Moll, Richard Newcombe, C. Karen Liu, Yuting Ye, Lingni Ma

    Abstract: This paper investigates the online generation of realistic full-body human motion using a single head-mounted device with an outward-facing color camera and the ability to perform visual SLAM. Given the inherent ambiguity of this setup, we introduce a novel system, HMD$^2$, designed to balance between motion reconstruction and generation. From a reconstruction standpoint, our system aims to maxima… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

  4. arXiv:2406.18537  [pdf, other

    cs.CV cs.AI cs.GR cs.RO

    AddBiomechanics Dataset: Capturing the Physics of Human Motion at Scale

    Authors: Keenon Werling, Janelle Kaneda, Alan Tan, Rishi Agarwal, Six Skov, Tom Van Wouwe, Scott Uhlrich, Nicholas Bianco, Carmichael Ong, Antoine Falisse, Shardul Sapkota, Aidan Chandra, Joshua Carter, Ezio Preatoni, Benjamin Fregly, Jennifer Hicks, Scott Delp, C. Karen Liu

    Abstract: While reconstructing human poses in 3D from inexpensive sensors has advanced significantly in recent years, quantifying the dynamics of human motion, including the muscle-generated joint torques and external forces, remains a challenge. Prior attempts to estimate physics from reconstructed human poses have been hampered by a lack of datasets with high-quality pose and force data for a variety of m… ▽ More

    Submitted 16 May, 2024; originally announced June 2024.

    Comments: 15 pages, 6 figures, 4 tables

  5. arXiv:2406.17840  [pdf, other

    cs.AI cs.CV

    Human-Object Interaction from Human-Level Instructions

    Authors: Zhen Wu, Jiaman Li, C. Karen Liu

    Abstract: Intelligent agents need to autonomously navigate and interact within contextual environments to perform a wide range of daily tasks based on human-level instructions. These agents require a foundational understanding of the world, incorporating common sense and knowledge, to interpret such instructions. Moreover, they must possess precise low-level skills for movement and interaction to execute th… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 10 pages

  6. arXiv:2406.09905  [pdf, other

    cs.CV cs.GR

    Nymeria: A Massive Collection of Multimodal Egocentric Daily Motion in the Wild

    Authors: Lingni Ma, Yuting Ye, Fangzhou Hong, Vladimir Guzov, Yifeng Jiang, Rowan Postyeni, Luis Pesqueira, Alexander Gamino, Vijay Baiyya, Hyo Jin Kim, Kevin Bailey, David Soriano Fosas, C. Karen Liu, Ziwei Liu, Jakob Engel, Renzo De Nardi, Richard Newcombe

    Abstract: We introduce Nymeria - a large-scale, diverse, richly annotated human motion dataset collected in the wild with multiple multimodal egocentric devices. The dataset comes with a) full-body ground-truth motion; b) multiple multimodal egocentric data from Project Aria devices with videos, eye tracking, IMUs and etc; and c) a third-person perspective by an additional observer. All devices are precisel… ▽ More

    Submitted 19 September, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

  7. arXiv:2406.00960  [pdf, other

    cs.GR cs.RO

    PDP: Physics-Based Character Animation via Diffusion Policy

    Authors: Takara E. Truong, Michael Piseno, Zhaoming Xie, C. Karen Liu

    Abstract: Generating diverse and realistic human motion that can physically interact with an environment remains a challenging research area in character animation. Meanwhile, diffusion-based methods, as proposed by the robotics community, have demonstrated the ability to capture highly diverse and multi-modal skills. However, naively training a diffusion policy often results in unstable motions for high-fr… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  8. arXiv:2404.13532  [pdf, other

    cs.RO

    SpringGrasp: Synthesizing Compliant, Dexterous Grasps under Shape Uncertainty

    Authors: Sirui Chen, Jeannette Bohg, C. Karen Liu

    Abstract: Generating stable and robust grasps on arbitrary objects is critical for dexterous robotic hands, marking a significant step towards advanced dexterous manipulation. Previous studies have mostly focused on improving differentiable grasping metrics with the assumption of precisely known object geometry. However, shape uncertainty is ubiquitous due to noisy and partial shape observations, which intr… ▽ More

    Submitted 25 April, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

  9. arXiv:2404.07468  [pdf, other

    cs.RO

    One-Shot Transfer of Long-Horizon Extrinsic Manipulation Through Contact Retargeting

    Authors: Albert Wu, Ruocheng Wang, Sirui Chen, Clemens Eppner, C. Karen Liu

    Abstract: Extrinsic manipulation, the use of environment contacts to achieve manipulation objectives, enables strategies that are otherwise impossible with a parallel jaw gripper. However, orchestrating a long-horizon sequence of contact interactions between the robot, object, and environment is notoriously challenging due to the scene diversity, large action space, and difficult contact dynamics. We observ… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 8 pages, 6 figures

  10. arXiv:2403.19026  [pdf, other

    cs.CV

    EgoNav: Egocentric Scene-aware Human Trajectory Prediction

    Authors: Weizhuo Wang, C. Karen Liu, Monroe Kennedy III

    Abstract: Wearable collaborative robots stand to assist human wearers who need fall prevention assistance or wear exoskeletons. Such a robot needs to be able to constantly adapt to the surrounding scene based on egocentric vision, and predict the ego motion of the wearer. In this work, we leveraged body-mounted cameras and sensors to anticipate the trajectory of human wearers through complex surroundings. T… ▽ More

    Submitted 7 August, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: 13 pages, 9 figures

  11. arXiv:2403.09227  [pdf, other

    cs.RO cs.AI

    BEHAVIOR-1K: A Human-Centered, Embodied AI Benchmark with 1,000 Everyday Activities and Realistic Simulation

    Authors: Chengshu Li, Ruohan Zhang, Josiah Wong, Cem Gokmen, Sanjana Srivastava, Roberto Martín-Martín, Chen Wang, Gabrael Levine, Wensi Ai, Benjamin Martinez, Hang Yin, Michael Lingelbach, Minjune Hwang, Ayano Hiranaka, Sujay Garlanka, Arman Aydin, Sharon Lee, Jiankai Sun, Mona Anvari, Manasi Sharma, Dhruva Bansal, Samuel Hunter, Kyu-Young Kim, Alan Lou, Caleb R Matthews , et al. (10 additional authors not shown)

    Abstract: We present BEHAVIOR-1K, a comprehensive simulation benchmark for human-centered robotics. BEHAVIOR-1K includes two components, guided and motivated by the results of an extensive survey on "what do you want robots to do for you?". The first is the definition of 1,000 everyday activities, grounded in 50 scenes (houses, gardens, restaurants, offices, etc.) with more than 9,000 objects annotated with… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: A preliminary version was published at 6th Conference on Robot Learning (CoRL 2022)

  12. arXiv:2403.07788  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    DexCap: Scalable and Portable Mocap Data Collection System for Dexterous Manipulation

    Authors: Chen Wang, Haochen Shi, Weizhuo Wang, Ruohan Zhang, Li Fei-Fei, C. Karen Liu

    Abstract: Imitation learning from human hand motion data presents a promising avenue for imbuing robots with human-like dexterity in real-world manipulation tasks. Despite this potential, substantial challenges persist, particularly with the portability of existing hand motion capture (mocap) systems and the complexity of translating mocap data into effective robotic policies. To tackle these issues, we int… ▽ More

    Submitted 4 July, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  13. Iterative Motion Editing with Natural Language

    Authors: Purvi Goel, Kuan-Chieh Wang, C. Karen Liu, Kayvon Fatahalian

    Abstract: Text-to-motion diffusion models can generate realistic animations from text prompts, but do not support fine-grained motion editing controls. In this paper, we present a method for using natural language to iteratively specify local edits to existing character animations, a task that is common in most computer animation workflows. Our key idea is to represent a space of motion edits using a set of… ▽ More

    Submitted 3 June, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

  14. arXiv:2312.03913  [pdf, other

    cs.CV

    Controllable Human-Object Interaction Synthesis

    Authors: Jiaman Li, Alexander Clegg, Roozbeh Mottaghi, Jiajun Wu, Xavier Puig, C. Karen Liu

    Abstract: Synthesizing semantic-aware, long-horizon, human-object interaction is critical to simulate realistic human behaviors. In this work, we address the challenging problem of generating synchronized object motion and human motion guided by language descriptions in 3D scenes. We propose Controllable Human-Object Interaction Synthesis (CHOIS), an approach that generates object motion and human motion si… ▽ More

    Submitted 14 July, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: ECCV 2024, project webpage: https://lijiaman.github.io/projects/chois/

  15. arXiv:2311.00754  [pdf, other

    cs.RO cs.AI cs.LG

    Learning to Design and Use Tools for Robotic Manipulation

    Authors: Ziang Liu, Stephen Tian, Michelle Guo, C. Karen Liu, Jiajun Wu

    Abstract: When limited by their own morphologies, humans and some species of animals have the remarkable ability to use objects from the environment toward accomplishing otherwise impossible tasks. Robots might similarly unlock a range of additional capabilities through tool use. Recent techniques for jointly optimizing morphology and control via deep learning are effective at designing locomotion agents. B… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: First two authors contributed equally. Accepted at CoRL 2023

  16. arXiv:2310.07204  [pdf, other

    cs.AI cs.CV cs.GR cs.LG

    State of the Art on Diffusion Models for Visual Computing

    Authors: Ryan Po, Wang Yifan, Vladislav Golyanik, Kfir Aberman, Jonathan T. Barron, Amit H. Bermano, Eric Ryan Chan, Tali Dekel, Aleksander Holynski, Angjoo Kanazawa, C. Karen Liu, Lingjie Liu, Ben Mildenhall, Matthias Nießner, Björn Ommer, Christian Theobalt, Peter Wonka, Gordon Wetzstein

    Abstract: The field of visual computing is rapidly advancing due to the emergence of generative artificial intelligence (AI), which unlocks unprecedented capabilities for the generation, editing, and reconstruction of images, videos, and 3D scenes. In these domains, diffusion models are the generative AI architecture of choice. Within the last year alone, the literature on diffusion-based tools and applicat… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  17. arXiv:2309.16237  [pdf, other

    cs.CV

    Object Motion Guided Human Motion Synthesis

    Authors: Jiaman Li, Jiajun Wu, C. Karen Liu

    Abstract: Modeling human behaviors in contextual environments has a wide range of applications in character animation, embodied AI, VR/AR, and robotics. In real-world scenarios, humans frequently interact with the environment and manipulate various objects to complete daily tasks. In this work, we study the problem of full-body human motion synthesis for the manipulation of large-sized objects. We propose O… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: SIGGRAPH Asia 2023

  18. arXiv:2309.13742  [pdf, other

    cs.GR cs.CV

    DROP: Dynamics Responses from Human Motion Prior and Projective Dynamics

    Authors: Yifeng Jiang, Jungdam Won, Yuting Ye, C. Karen Liu

    Abstract: Synthesizing realistic human movements, dynamically responsive to the environment, is a long-standing objective in character animation, with applications in computer vision, sports, and healthcare, for motion prediction and data augmentation. Recent kinematics-based generative motion models offer impressive scalability in modeling extensive motion data, albeit without an interface to reason about… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

    Comments: SIGGRAPH Asia 2023, Video https://youtu.be/tF5WW7qNMLI, Website: https://stanford-tml.github.io/drop/

  19. arXiv:2309.00987  [pdf, other

    cs.RO cs.AI cs.LG

    Sequential Dexterity: Chaining Dexterous Policies for Long-Horizon Manipulation

    Authors: Yuanpei Chen, Chen Wang, Li Fei-Fei, C. Karen Liu

    Abstract: Many real-world manipulation tasks consist of a series of subtasks that are significantly different from one another. Such long-horizon, complex tasks highlight the potential of dexterous hands, which possess adaptability and versatility, capable of seamlessly transitioning between different modes of functionality without the need for re-grasping or external tools. However, the challenges arise du… ▽ More

    Submitted 16 October, 2023; v1 submitted 2 September, 2023; originally announced September 2023.

    Comments: 7th Conference on Robot Learning (CoRL 2023)

  20. arXiv:2308.16682  [pdf, other

    cs.CV

    DiffusionPoser: Real-time Human Motion Reconstruction From Arbitrary Sparse Sensors Using Autoregressive Diffusion

    Authors: Tom Van Wouwe, Seunghwan Lee, Antoine Falisse, Scott Delp, C. Karen Liu

    Abstract: Motion capture from a limited number of body-worn sensors, such as inertial measurement units (IMUs) and pressure insoles, has important applications in health, human performance, and entertainment. Recent work has focused on accurately reconstructing whole-body motion from a specific sensor configuration using six IMUs. While a common goal across applications is to use the minimal number of senso… ▽ More

    Submitted 28 March, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

    Comments: accepted at CVPR2024

  21. arXiv:2306.09532  [pdf, other

    cs.RO cs.GR

    Hierarchical Planning and Control for Box Loco-Manipulation

    Authors: Zhaoming Xie, Jonathan Tseng, Sebastian Starke, Michiel van de Panne, C. Karen Liu

    Abstract: Humans perform everyday tasks using a combination of locomotion and manipulation skills. Building a system that can handle both skills is essential to creating virtual humans. We present a physically-simulated human capable of solving box rearrangement tasks, which requires a combination of both skills. We propose a hierarchical control architecture, where each level solves the task at a different… ▽ More

    Submitted 8 July, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

  22. Anatomically Detailed Simulation of Human Torso

    Authors: Seunghwan Lee, Yifeng Jiang, C. Karen Liu

    Abstract: Existing digital human models approximate the human skeletal system using rigid bodies connected by rotational joints. While the simplification is considered acceptable for legs and arms, it significantly lacks fidelity to model rich torso movements in common activities such as dancing, Yoga, and various sports. Research from biomechanics provides more detailed modeling for parts of the torso, but… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 9 pages, 11 figures, SIGGRAPH 2023, ACM Transactions on Graphics

    Journal ref: ACM Transaction on Graphics (SIGGPRAPH 2023), volume 42

  23. Synthesize Dexterous Nonprehensile Pregrasp for Ungraspable Objects

    Authors: Sirui Chen, Albert Wu, C. Karen Liu

    Abstract: Daily objects embedded in a contextual environment are often ungraspable initially. Whether it is a book sandwiched by other books on a fully packed bookshelf or a piece of paper lying flat on the desk, a series of nonprehensile pregrasp maneuvers is required to manipulate the object into a graspable state. Humans are proficient at utilizing environmental contacts to achieve manipulation tasks tha… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 11 pages, 9 figures, SIGGRAPH Conference Proceedings 2023

    Journal ref: ACM SIGGRAPH Conference Proceedings 2023

  24. arXiv:2303.17912  [pdf, other

    cs.CV cs.GR

    CIRCLE: Capture In Rich Contextual Environments

    Authors: Joao Pedro Araujo, Jiaman Li, Karthik Vetrivel, Rishi Agarwal, Deepak Gopinath, Jiajun Wu, Alexander Clegg, C. Karen Liu

    Abstract: Synthesizing 3D human motion in a contextual, ecological environment is important for simulating realistic activities people perform in the real world. However, conventional optics-based motion capture systems are not suited for simultaneously capturing human movements and complex scenes. The lack of rich contextual 3D human motion datasets presents a roadblock to creating high-quality generative… ▽ More

    Submitted 31 March, 2023; originally announced March 2023.

  25. arXiv:2303.13390  [pdf, other

    cs.RO

    On Designing a Learning Robot: Improving Morphology for Enhanced Task Performance and Learning

    Authors: Maks Sorokin, Chuyuan Fu, Jie Tan, C. Karen Liu, Yunfei Bai, Wenlong Lu, Sehoon Ha, Mohi Khansari

    Abstract: As robots become more prevalent, optimizing their design for better performance and efficiency is becoming increasingly important. However, current robot design practices overlook the impact of perception and design choices on a robot's learning capabilities. To address this gap, we propose a comprehensive methodology that accounts for the interplay between the robot's perception, hardware charact… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  26. Scene Synthesis from Human Motion

    Authors: Sifan Ye, Yixing Wang, Jiaman Li, Dennis Park, C. Karen Liu, Huazhe Xu, Jiajun Wu

    Abstract: Large-scale capture of human motion with diverse, complex scenes, while immensely useful, is often considered prohibitively costly. Meanwhile, human motion alone contains rich information about the scene they reside in and interact with. For example, a sitting human suggests the existence of a chair, and their leg position further implies the chair's pose. In this paper, we propose to synthesize d… ▽ More

    Submitted 3 January, 2023; originally announced January 2023.

    Comments: 9 pages, 8 figures. Published in SIGGRAPH Asia 2022. Sifan Ye and Yixing Wang share equal contribution. Huazhe Xu and Jiajun Wu share equal contribution

  27. arXiv:2212.13660  [pdf, other

    cs.CV

    NeMo: 3D Neural Motion Fields from Multiple Video Instances of the Same Action

    Authors: Kuan-Chieh Wang, Zhenzhen Weng, Maria Xenochristou, Joao Pedro Araujo, Jeffrey Gu, C. Karen Liu, Serena Yeung

    Abstract: The task of reconstructing 3D human motion has wideranging applications. The gold standard Motion capture (MoCap) systems are accurate but inaccessible to the general public due to their cost, hardware and space constraints. In contrast, monocular human mesh recovery (HMR) methods are much more accessible than MoCap as they take single-view videos as inputs. Replacing the multi-view Mo- Cap system… ▽ More

    Submitted 27 December, 2022; originally announced December 2022.

  28. arXiv:2212.04741  [pdf, other

    cs.CV cs.AI cs.GR cs.RO

    Physically Plausible Animation of Human Upper Body from a Single Image

    Authors: Ziyuan Huang, Zhengping Zhou, Yung-Yu Chuang, Jiajun Wu, C. Karen Liu

    Abstract: We present a new method for generating controllable, dynamically responsive, and photorealistic human animations. Given an image of a person, our system allows the user to generate Physically plausible Upper Body Animation (PUBA) using interaction in the image space, such as dragging their hand to various locations. We formulate a reinforcement learning problem to train a dynamic model that predic… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

    Comments: WACV 2023

  29. arXiv:2212.04636  [pdf, other

    cs.CV cs.GR

    Ego-Body Pose Estimation via Ego-Head Pose Estimation

    Authors: Jiaman Li, C. Karen Liu, Jiajun Wu

    Abstract: Estimating 3D human motion from an egocentric video sequence plays a critical role in human behavior understanding and has various applications in VR/AR. However, naively learning a mapping between egocentric videos and human motions is challenging, because the user's body is often unobserved by the front-facing camera placed on the head of the user. In addition, collecting large-scale, high-quali… ▽ More

    Submitted 27 August, 2023; v1 submitted 8 December, 2022; originally announced December 2022.

    Comments: CVPR 2023 (Award Candidate)

  30. arXiv:2211.10658  [pdf, other

    cs.SD cs.CV cs.GR eess.AS

    EDGE: Editable Dance Generation From Music

    Authors: Jonathan Tseng, Rodrigo Castellon, C. Karen Liu

    Abstract: Dance is an important human art form, but creating new dances can be difficult and time-consuming. In this work, we introduce Editable Dance GEneration (EDGE), a state-of-the-art method for editable dance generation that is capable of creating realistic, physically-plausible dances while remaining faithful to the input music. EDGE uses a transformer-based diffusion model paired with Jukebox, a str… ▽ More

    Submitted 27 November, 2022; v1 submitted 19 November, 2022; originally announced November 2022.

    Comments: Project website: https://edge-dance.github.io

  31. arXiv:2209.11886  [pdf, other

    cs.RO

    Trajectory and Sway Prediction Towards Fall Prevention

    Authors: Weizhuo Wang, Michael Raitor, Steve Collins, C. Karen Liu, Monroe Kennedy III

    Abstract: Falls are the leading cause of fatal and non-fatal injuries, particularly for older persons. Imbalance can result from the body's internal causes (illness), or external causes (active or passive perturbation). Active perturbation results from applying an external force to a person, while passive perturbation results from human motion interacting with a static obstacle. This work proposes a metric… ▽ More

    Submitted 3 March, 2023; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: 6 pages + 1 page reference, 11 figures. Accepted by ICRA 2023

  32. arXiv:2207.00195  [pdf, other

    cs.RO

    Learning Diverse and Physically Feasible Dexterous Grasps with Generative Model and Bilevel Optimization

    Authors: Albert Wu, Michelle Guo, C. Karen Liu

    Abstract: To fully utilize the versatility of a multi-fingered dexterous robotic hand for executing diverse object grasps, one must consider the rich physical constraints introduced by hand-object interaction and object geometry. We propose an integrative approach of combining a generative model and a bilevel optimization (BO) to plan diverse grasp configurations on novel objects. First, a conditional varia… ▽ More

    Submitted 24 December, 2022; v1 submitted 1 July, 2022; originally announced July 2022.

  33. arXiv:2204.09443  [pdf, other

    cs.CV

    GIMO: Gaze-Informed Human Motion Prediction in Context

    Authors: Yang Zheng, Yanchao Yang, Kaichun Mo, Jiaman Li, Tao Yu, Yebin Liu, C. Karen Liu, Leonidas J. Guibas

    Abstract: Predicting human motion is critical for assistive robots and AR/VR applications, where the interaction with humans needs to be safe and comfortable. Meanwhile, an accurate prediction depends on understanding both the scene context and human intentions. Even though many works study scene-aware human motion prediction, the latter is largely underexplored due to the lack of ego-centric views that dis… ▽ More

    Submitted 19 July, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

  34. Transformer Inertial Poser: Real-time Human Motion Reconstruction from Sparse IMUs with Simultaneous Terrain Generation

    Authors: Yifeng Jiang, Yuting Ye, Deepak Gopinath, Jungdam Won, Alexander W. Winkler, C. Karen Liu

    Abstract: Real-time human motion reconstruction from a sparse set of (e.g. six) wearable IMUs provides a non-intrusive and economic approach to motion capture. Without the ability to acquire position information directly from IMUs, recent works took data-driven approaches that utilize large human motion datasets to tackle this under-determined problem. Still, challenges remain such as temporal consistency,… ▽ More

    Submitted 8 December, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: SIGGRAPH Asia 2022. Video: https://youtu.be/rXb6SaXsnc0. Code: https://github.com/jyf588/transformer-inertial-poser

  35. A Survey on Reinforcement Learning Methods in Character Animation

    Authors: Ariel Kwiatkowski, Eduardo Alvarado, Vicky Kalogeiton, C. Karen Liu, Julien Pettré, Michiel van de Panne, Marie-Paule Cani

    Abstract: Reinforcement Learning is an area of Machine Learning focused on how agents can be trained to make sequential decisions, and achieve a particular goal within an arbitrary environment. While learning, they repeatedly take actions based on their observation of the environment, and receive appropriate rewards which define the objective. This experience is then used to progressively improve the policy… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: 27 pages, 6 figures, Eurographics STAR, Computer Graphics Forum

  36. arXiv:2202.09834  [pdf, other

    cs.RO cs.GR

    Real-time Model Predictive Control and System Identification Using Differentiable Physics Simulation

    Authors: Sirui Chen, Keenon Werling, Albert Wu, C. Karen Liu

    Abstract: Developing robot controllers in a simulated environment is advantageous but transferring the controllers to the target environment presents challenges, often referred to as the "sim-to-real gap". We present a method for continuous improvement of modeling and control after deploying the robot to a dynamically-changing target environment. We develop a differentiable physics simulation framework that… ▽ More

    Submitted 22 November, 2022; v1 submitted 20 February, 2022; originally announced February 2022.

  37. arXiv:2109.05603  [pdf, other

    cs.RO

    Learning to Navigate Sidewalks in Outdoor Environments

    Authors: Maks Sorokin, Jie Tan, C. Karen Liu, Sehoon Ha

    Abstract: Outdoor navigation on sidewalks in urban environments is the key technology behind important human assistive applications, such as last-mile delivery or neighborhood patrol. This paper aims to develop a quadruped robot that follows a route plan generated by public map services, while remaining on sidewalks and avoiding collisions with obstacles and pedestrians. We devise a two-staged learning fram… ▽ More

    Submitted 12 September, 2021; originally announced September 2021.

    Comments: Submitted to IEEE Robotics and Automation Letters (RA-L)

  38. arXiv:2108.12536  [pdf, other

    cs.GR cs.AI cs.RO

    DASH: Modularized Human Manipulation Simulation with Vision and Language for Embodied AI

    Authors: Yifeng Jiang, Michelle Guo, Jiangshan Li, Ioannis Exarchos, Jiajun Wu, C. Karen Liu

    Abstract: Creating virtual humans with embodied, human-like perceptual and actuation constraints has the promise to provide an integrated simulation platform for many scientific and engineering applications. We present Dynamic and Autonomous Simulated Human (DASH), an embodied virtual human that, given natural language commands, performs grasp-and-stack tasks in a physically-simulated cluttered environment… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: SCA'2021

    Journal ref: In The ACM SIGGRAPH / Eurographics Symposium on Computer Animation (SCA 21), September 6~9, 2021, Virtual Event, USA. ACM, New York, NY, USA, 12 pages

  39. arXiv:2108.06038  [pdf, other

    cs.RO cs.AI

    Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration

    Authors: Chen Wang, Claudia Pérez-D'Arpino, Danfei Xu, Li Fei-Fei, C. Karen Liu, Silvio Savarese

    Abstract: We present a method for learning a human-robot collaboration policy from human-human collaboration demonstrations. An effective robot assistant must learn to handle diverse human behaviors shown in the demonstrations and be robust when the humans adjust their strategies during online task execution. Our method co-optimizes a human policy and a robot policy in an interactive learning process: the h… ▽ More

    Submitted 20 September, 2023; v1 submitted 12 August, 2021; originally announced August 2021.

    Comments: CoRL 2021

  40. arXiv:2108.03332  [pdf, other

    cs.RO cs.AI cs.CV

    BEHAVIOR: Benchmark for Everyday Household Activities in Virtual, Interactive, and Ecological Environments

    Authors: Sanjana Srivastava, Chengshu Li, Michael Lingelbach, Roberto Martín-Martín, Fei Xia, Kent Vainio, Zheng Lian, Cem Gokmen, Shyamal Buch, C. Karen Liu, Silvio Savarese, Hyowon Gweon, Jiajun Wu, Li Fei-Fei

    Abstract: We introduce BEHAVIOR, a benchmark for embodied AI with 100 activities in simulation, spanning a range of everyday household chores such as cleaning, maintenance, and food preparation. These activities are designed to be realistic, diverse, and complex, aiming to reproduce the challenges that agents must face in the real world. Building such a benchmark poses three fundamental difficulties for eac… ▽ More

    Submitted 6 August, 2021; originally announced August 2021.

  41. arXiv:2108.03272  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    iGibson 2.0: Object-Centric Simulation for Robot Learning of Everyday Household Tasks

    Authors: Chengshu Li, Fei Xia, Roberto Martín-Martín, Michael Lingelbach, Sanjana Srivastava, Bokui Shen, Kent Vainio, Cem Gokmen, Gokul Dharan, Tanish Jain, Andrey Kurenkov, C. Karen Liu, Hyowon Gweon, Jiajun Wu, Li Fei-Fei, Silvio Savarese

    Abstract: Recent research in embodied AI has been boosted by the use of simulation environments to develop and train robot learning approaches. However, the use of simulation has skewed the attention to tasks that only require what robotics simulators can simulate: motion and physical contact. We present iGibson 2.0, an open-source simulation environment that supports the simulation of a more diverse set of… ▽ More

    Submitted 3 November, 2021; v1 submitted 6 August, 2021; originally announced August 2021.

    Comments: Accepted at Conference on Robot Learning (CoRL) 2021. Project website: http://svl.stanford.edu/igibson/

  42. arXiv:2107.14285  [pdf, other

    cs.CV

    ADeLA: Automatic Dense Labeling with Attention for Viewpoint Adaptation in Semantic Segmentation

    Authors: Yanchao Yang, Hanxiang Ren, He Wang, Bokui Shen, Qingnan Fan, Youyi Zheng, C. Karen Liu, Leonidas Guibas

    Abstract: We describe an unsupervised domain adaptation method for image content shift caused by viewpoint changes for a semantic segmentation task. Most existing methods perform domain alignment in a shared space and assume that the mapping from the aligned space to the output is transferable. However, the novel content induced by viewpoint changes may nullify such a space for effective alignments, thus re… ▽ More

    Submitted 29 July, 2021; originally announced July 2021.

  43. DCL: Differential Contrastive Learning for Geometry-Aware Depth Synthesis

    Authors: Yuefan Shen, Yanchao Yang, Youyi Zheng, C. Karen Liu, Leonidas Guibas

    Abstract: We describe a method for unpaired realistic depth synthesis that learns diverse variations from the real-world depth scans and ensures geometric consistency between the synthetic and synthesized depth. The synthesized realistic depth can then be used to train task-specific networks facilitating label transfer from the synthetic domain. Unlike existing image synthesis pipelines, where geometries ar… ▽ More

    Submitted 28 February, 2022; v1 submitted 27 July, 2021; originally announced July 2021.

    Comments: Accepted by International Conference on Robotics and Automation (ICRA) 2022 and RA-L 2022

  44. arXiv:2105.11582  [pdf, other

    cs.RO cs.HC

    Characterizing Multidimensional Capacitive Servoing for Physical Human-Robot Interaction

    Authors: Zackory Erickson, Henry M. Clever, Vamsee Gangaram, Eliot Xing, Greg Turk, C. Karen Liu, Charles C. Kemp

    Abstract: Towards the goal of robots performing robust and intelligent physical interactions with people, it is crucial that robots are able to accurately sense the human body, follow trajectories around the body, and track human motion. This study introduces a capacitive servoing control scheme that allows a robot to sense and navigate around human limbs during close physical interactions. Capacitive servo… ▽ More

    Submitted 27 August, 2021; v1 submitted 24 May, 2021; originally announced May 2021.

    Comments: 17 pages, 22 figures, 4 tables, 2 algorithms

  45. arXiv:2103.16021  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Fast and Feature-Complete Differentiable Physics for Articulated Rigid Bodies with Contact

    Authors: Keenon Werling, Dalton Omens, Jeongseok Lee, Ioannis Exarchos, C. Karen Liu

    Abstract: We present a fast and feature-complete differentiable physics engine, Nimble (nimblephysics.org), that supports Lagrangian dynamics and hard contact constraints for articulated rigid body simulation. Our differentiable physics engine offers a complete set of features that are typically only available in non-differentiable physics simulators commonly used by robotics applications. We solve contact… ▽ More

    Submitted 22 June, 2021; v1 submitted 29 March, 2021; originally announced March 2021.

  46. arXiv:2103.07732  [pdf, other

    cs.RO cs.LG

    Error-Aware Policy Learning: Zero-Shot Generalization in Partially Observable Dynamic Environments

    Authors: Visak Kumar, Sehoon Ha, C. Karen Liu

    Abstract: Simulation provides a safe and efficient way to generate useful data for learning complex robotic tasks. However, matching simulation and real-world dynamics can be quite challenging, especially for systems that have a large number of unobserved or unmeasurable parameters, which may lie in the robot dynamics itself or in the environment with which the robot interacts. We introduce a novel approach… ▽ More

    Submitted 13 March, 2021; originally announced March 2021.

  47. arXiv:2103.04942  [pdf, other

    cs.RO

    Task-Specific Design Optimization and Fabrication for Inflated-Beam Soft Robots with Growable Discrete Joints

    Authors: Ioannis Exarchos, Karen Wang, Brian H. Do, Fabio Stroppa, Margaret M. Coad, Allison M. Okamura, C. Karen Liu

    Abstract: Soft robot serial chain manipulators with the capability for growth, stiffness control, and discrete joints have the potential to approach the dexterity of traditional robot arms, while improving safety, lowering cost, and providing an increased workspace, with potential application in home environments. This paper presents an approach for design optimization of such robots to reach specified targ… ▽ More

    Submitted 22 September, 2021; v1 submitted 8 March, 2021; originally announced March 2021.

  48. arXiv:2103.02533  [pdf, other

    cs.GR cs.LG

    Learning to Manipulate Amorphous Materials

    Authors: Yunbo Zhang, Wenhao Yu, C. Karen Liu, Charles C. Kemp, Greg Turk

    Abstract: We present a method of training character manipulation of amorphous materials such as those often used in cooking. Common examples of amorphous materials include granular materials (salt, uncooked rice), fluids (honey), and visco-plastic materials (sticky rice, softened butter). A typical task is to spread a given material out across a flat surface using a tool such as a scraper or knife. We use r… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

  49. arXiv:2101.06005  [pdf, other

    cs.RO

    SimGAN: Hybrid Simulator Identification for Domain Adaptation via Adversarial Reinforcement Learning

    Authors: Yifeng Jiang, Tingnan Zhang, Daniel Ho, Yunfei Bai, C. Karen Liu, Sergey Levine, Jie Tan

    Abstract: As learning-based approaches progress towards automating robot controllers design, transferring learned policies to new domains with different dynamics (e.g. sim-to-real transfer) still demands manual effort. This paper introduces SimGAN, a framework to tackle domain adaptation by identifying a hybrid physics simulator to match the simulated trajectories to the ones from the target domain, using a… ▽ More

    Submitted 31 May, 2021; v1 submitted 15 January, 2021; originally announced January 2021.

    Comments: ICRA 2021, Code Available at: https://github.com/jyf588/SimGAN ; Accompanying Video: https://youtu.be/McKOGllO7nc

  50. arXiv:2012.06662  [pdf, other

    cs.RO cs.LG

    Protective Policy Transfer

    Authors: Wenhao Yu, C. Karen Liu, Greg Turk

    Abstract: Being able to transfer existing skills to new situations is a key capability when training robots to operate in unpredictable real-world environments. A successful transfer algorithm should not only minimize the number of samples that the robot needs to collect in the new environment, but also prevent the robot from damaging itself or the surrounding environment during the transfer process. In thi… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.