Skip to main content

Showing 1–6 of 6 results for author: Kelestemur, T

  1. arXiv:2407.20179  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Theia: Distilling Diverse Vision Foundation Models for Robot Learning

    Authors: Jinghuan Shang, Karl Schmeckpeper, Brandon B. May, Maria Vittoria Minniti, Tarik Kelestemur, David Watkins, Laura Herlant

    Abstract: Vision-based robot policy learning, which maps visual inputs to actions, necessitates a holistic understanding of diverse visual tasks beyond single-task needs like classification or segmentation. Inspired by this, we introduce Theia, a vision foundation model for robot learning that distills multiple off-the-shelf vision foundation models trained on varied vision tasks. Theia's rich visual repres… ▽ More

    Submitted 10 October, 2024; v1 submitted 29 July, 2024; originally announced July 2024.

    Comments: CoRL 2024

  2. arXiv:2407.01812  [pdf, other

    cs.RO cs.LG

    Equivariant Diffusion Policy

    Authors: Dian Wang, Stephen Hart, David Surovik, Tarik Kelestemur, Haojie Huang, Haibo Zhao, Mark Yeatman, Jiuguang Wang, Robin Walters, Robert Platt

    Abstract: Recent work has shown diffusion models are an effective approach to learning the multimodal distributions arising from demonstration data in behavior cloning. However, a drawback of this approach is the need to learn a denoising function, which is significantly more complex than learning an explicit policy. In this work, we propose Equivariant Diffusion Policy, a novel diffusion policy learning me… ▽ More

    Submitted 15 October, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

    Comments: Conference on Robot Learning 2024, Oral Presentation

  3. arXiv:2309.16118  [pdf, other

    cs.RO cs.CV cs.LG

    D$^3$Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangement

    Authors: Yixuan Wang, Mingtong Zhang, Zhuoran Li, Tarik Kelestemur, Katherine Driggs-Campbell, Jiajun Wu, Li Fei-Fei, Yunzhu Li

    Abstract: Scene representation is a crucial design choice in robotic manipulation systems. An ideal representation is expected to be 3D, dynamic, and semantic to meet the demands of diverse manipulation tasks. However, previous works often lack all three properties simultaneously. In this work, we introduce D$^3$Fields -- dynamic 3D descriptor fields. These fields are implicit 3D representations that take i… ▽ More

    Submitted 16 October, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: Accepted to Conference on Robot Learning (CoRL 2024) as Oral Presentation. The first three authors contributed equally. Project Page: https://robopil.github.io/d3fields/

  4. arXiv:2207.00942  [pdf, other

    cs.RO

    Pregrasp Object Material Classification by a Novel Gripper Design with Integrated Spectroscopy

    Authors: Nathaniel Hanson, Tarik Kelestemur, Deniz Erdogmus, Taskin Padir

    Abstract: Robots benefit from being able to classify objects they interact with or manipulate based on their material properties. This capability ensures fine manipulation of complex objects through proper grasp pose and force selection. Prior work has focused on haptic or visual processing to determine material type at grasp time. In this work, we introduce a novel parallel robot gripper design and a metho… ▽ More

    Submitted 2 July, 2022; originally announced July 2022.

  5. arXiv:2203.10685  [pdf, other

    cs.RO

    Tactile Pose Estimation and Policy Learning for Unknown Object Manipulation

    Authors: Tarik Kelestemur, Robert Platt, Taskin Padir

    Abstract: Object pose estimation methods allow finding locations of objects in unstructured environments. This is a highly desired skill for autonomous robot manipulation as robots need to estimate the precise poses of the objects in order to manipulate them. In this paper, we investigate the problems of tactile pose estimation and manipulation for category-level objects. Our proposed method uses a Bayes fi… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

    Comments: Accepted atthe 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022)

  6. arXiv:2011.05559  [pdf, other

    cs.RO

    Learning Bayes Filter Models for Tactile Localization

    Authors: Tarik Kelestemur, Colin Keil, John P. Whitney, Robert Platt, Taskin Padir

    Abstract: Localizing and tracking the pose of robotic grippers are necessary skills for manipulation tasks. However, the manipulators with imprecise kinematic models (e.g. low-cost arms) or manipulators with unknown world coordinates (e.g. poor camera-arm calibration) cannot locate the gripper with respect to the world. In these circumstances, we can leverage tactile feedback between the gripper and the env… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: Accepted in IROS2020