Skip to main content

Showing 1–23 of 23 results for author: Thrun, S

  1. arXiv:2410.06424  [pdf, other

    cs.LG cs.CV

    Restructuring Vector Quantization with the Rotation Trick

    Authors: Christopher Fifty, Ronald G. Junkins, Dennis Duan, Aniketh Iger, Jerry W. Liu, Ehsan Amid, Sebastian Thrun, Christopher Ré

    Abstract: Vector Quantized Variational AutoEncoders (VQ-VAEs) are designed to compress a continuous input to a discrete latent space and reconstruct it with minimal distortion. They operate by maintaining a set of vectors -- often referred to as the codebook -- and quantizing each encoder output to the nearest vector in the codebook. However, as vector quantization is non-differentiable, the gradient to the… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  2. arXiv:2310.18844  [pdf, other

    cs.LG cs.AI

    BanditPAM++: Faster $k$-medoids Clustering

    Authors: Mo Tiwari, Ryan Kang, Donghyun Lee, Sebastian Thrun, Chris Piech, Ilan Shomorony, Martin Jinye Zhang

    Abstract: Clustering is a fundamental task in data science with wide-ranging applications. In $k$-medoids clustering, cluster centers must be actual datapoints and arbitrary distance metrics may be used; these features allow for greater interpretability of the cluster centers and the clustering of exotic objects in $k$-medoids clustering, respectively. $k$-medoids clustering has recently grown in popularity… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

    MSC Class: 68 ACM Class: I.m; I.2.0; I.2.6; K.3.2; I.2.m

  3. arXiv:2310.10971  [pdf, other

    cs.LG cs.CV

    Context-Aware Meta-Learning

    Authors: Christopher Fifty, Dennis Duan, Ronald G. Junkins, Ehsan Amid, Jure Leskovec, Christopher Re, Sebastian Thrun

    Abstract: Large Language Models like ChatGPT demonstrate a remarkable capacity to learn new concepts during inference without any fine-tuning. However, visual models trained to detect new objects during inference have been unable to replicate this ability, and instead either perform poorly or require meta-training and/or fine-tuning on similar objects. In this work, we propose a meta-learning algorithm that… ▽ More

    Submitted 25 March, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  4. arXiv:2310.08863  [pdf, other

    cs.LG

    In-Context Learning for Few-Shot Molecular Property Prediction

    Authors: Christopher Fifty, Jure Leskovec, Sebastian Thrun

    Abstract: In-context learning has become an important approach for few-shot learning in Large Language Models because of its ability to rapidly adapt to new tasks without fine-tuning model parameters. However, it is restricted to applications in natural language and inapplicable to other domains. In this paper, we adapt the concepts underpinning in-context learning to develop a new algorithm for few-shot mo… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  5. arXiv:2309.15312  [pdf, other

    cs.LG cs.AI

    MAPTree: Beating "Optimal" Decision Trees with Bayesian Decision Trees

    Authors: Colin Sullivan, Mo Tiwari, Sebastian Thrun

    Abstract: Decision trees remain one of the most popular machine learning models today, largely due to their out-of-the-box performance and interpretability. In this work, we present a Bayesian approach to decision tree induction via maximum a posteriori inference of a posterior distribution over trees. We first demonstrate a connection between maximum a posteriori inference of decision trees and AND/OR sear… ▽ More

    Submitted 19 December, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: 19 pages

    ACM Class: I.2.0; I.2.6; I.2.m

  6. arXiv:2302.07407  [pdf, ps, other

    cs.LG cs.AI

    Bayesian Decision Trees via Tractable Priors and Probabilistic Context-Free Grammars

    Authors: Colin Sullivan, Mo Tiwari, Sebastian Thrun, Chris Piech

    Abstract: Decision Trees are some of the most popular machine learning models today due to their out-of-the-box performance and interpretability. Often, Decision Trees models are constructed greedily in a top-down fashion via heuristic search criteria, such as Gini impurity or entropy. However, trees constructed in this manner are sensitive to minor fluctuations in training data and are prone to overfitting… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: 10 pages, 1 figure

    ACM Class: I.2.m; I.2.6; I.2.0

  7. arXiv:2212.07551  [pdf, ps, other

    cs.LG cs.AI

    Faster Maximum Inner Product Search in High Dimensions

    Authors: Mo Tiwari, Ryan Kang, Je-Yong Lee, Donghyun Lee, Chris Piech, Sebastian Thrun, Ilan Shomorony, Martin Jinye Zhang

    Abstract: Maximum Inner Product Search (MIPS) is a ubiquitous task in machine learning applications such as recommendation systems. Given a query vector and $n$ atom vectors in $d$-dimensional space, the goal of MIPS is to find the atom that has the highest inner product with the query vector. Existing MIPS algorithms scale at least as $O(\sqrt{d})$, which becomes computationally prohibitive in high-dimensi… ▽ More

    Submitted 26 June, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

    Comments: 24 pages

  8. arXiv:2212.07473  [pdf, ps, other

    cs.LG cs.DS

    MABSplit: Faster Forest Training Using Multi-Armed Bandits

    Authors: Mo Tiwari, Ryan Kang, Je-Yong Lee, Sebastian Thrun, Chris Piech, Ilan Shomorony, Martin Jinye Zhang

    Abstract: Random forests are some of the most widely used machine learning models today, especially in domains that necessitate interpretability. We present an algorithm that accelerates the training of random forests and other popular tree-based learning methods. At the core of our algorithm is a novel node-splitting subroutine, dubbed MABSplit, used to efficiently find split points when constructing decis… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: Published at NeurIPS 2022, 30 pages

    ACM Class: I.2.8

  9. arXiv:2006.06856  [pdf, other

    cs.LG cs.AI stat.ML

    BanditPAM: Almost Linear Time $k$-Medoids Clustering via Multi-Armed Bandits

    Authors: Mo Tiwari, Martin Jinye Zhang, James Mayclin, Sebastian Thrun, Chris Piech, Ilan Shomorony

    Abstract: Clustering is a ubiquitous task in data science. Compared to the commonly used $k$-means clustering, $k$-medoids clustering requires the cluster centers to be actual data points and support arbitrary distance metrics, which permits greater interpretability and the clustering of structured objects. Current state-of-the-art $k$-medoids clustering algorithms, such as Partitioning Around Medoids (PAM)… ▽ More

    Submitted 6 December, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: 21 pages, NeurIPS 2020

  10. arXiv:1612.01074  [pdf, other

    cs.CV

    Skin Cancer Detection and Tracking using Data Synthesis and Deep Learning

    Authors: Yunzhu Li, Andre Esteva, Brett Kuprel, Rob Novoa, Justin Ko, Sebastian Thrun

    Abstract: Dense object detection and temporal tracking are needed across applications domains ranging from people-tracking to analysis of satellite imagery over time. The detection and tracking of malignant skin cancers and benign moles poses a particularly challenging problem due to the general uniformity of large skin patches, the fact that skin lesions vary little in their appearance, and the relatively… ▽ More

    Submitted 4 December, 2016; originally announced December 2016.

    Comments: 4 pages, 5 figures, Yunzhu Li and Andre Esteva contributed equally to this work

  11. arXiv:1604.01802  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Learning to Track at 100 FPS with Deep Regression Networks

    Authors: David Held, Sebastian Thrun, Silvio Savarese

    Abstract: Machine learning techniques are often used in computer vision due to their ability to leverage large amounts of training data to improve performance. Unfortunately, most generic object trackers are still trained from scratch online and do not benefit from the large number of videos that are readily available for offline training. We propose a method for offline training of neural networks that can… ▽ More

    Submitted 15 August, 2016; v1 submitted 6 April, 2016; originally announced April 2016.

    Comments: To appear in European Conference on Computer Vision (ECCV) 2016

  12. arXiv:1507.08286  [pdf, other

    cs.CV cs.LG cs.NE cs.RO

    Deep Learning for Single-View Instance Recognition

    Authors: David Held, Sebastian Thrun, Silvio Savarese

    Abstract: Deep learning methods have typically been trained on large datasets in which many training examples are available. However, many real-world product datasets have only a small number of images available for each product. We explore the use of deep learning methods for recognizing object instances when we have only a single training example per class. We show that feedforward neural networks outperf… ▽ More

    Submitted 29 July, 2015; originally announced July 2015.

    Comments: 16 pages, 15 figures

  13. arXiv:1301.2292  [pdf

    cs.AI cs.LG

    A Bayesian Multiresolution Independence Test for Continuous Variables

    Authors: Dimitris Margaritis, Sebastian Thrun

    Abstract: In this paper we present a method ofcomputing the posterior probability ofconditional independence of two or morecontinuous variables from data,examined at several resolutions. Ourapproach is motivated by theobservation that the appearance ofcontinuous data varies widely atvarious resolutions, producing verydifferent independence estimatesbetween the variablesinvolved. Therefore, it is difficult… ▽ More

    Submitted 10 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (UAI2001)

    Report number: UAI-P-2001-PG-346-353

  14. arXiv:1301.0607  [pdf

    cs.RO cs.AI

    Particle Filters in Robotics (Invited Talk)

    Authors: Sebastian Thrun

    Abstract: This presentation will introduce the audience to a new, emerging body of research on sequential Monte Carlo techniques in robotics. In recent years, particle filters have solved several hard perceptual robotic problems. Early successes were limited to low-dimensional problems, such as the problem of robot localization in environments with known maps. More recently, researchers have begun exploiti… ▽ More

    Submitted 12 December, 2012; originally announced January 2013.

    Comments: Appears in Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI2002)

    Report number: UAI-P-2002-PG-511-518

  15. arXiv:1301.0551  [pdf

    cs.LG cs.RO stat.ML

    Learning Hierarchical Object Maps Of Non-Stationary Environments with mobile robots

    Authors: Dragomir Anguelov, Rahul Biswas, Daphne Koller, Benson Limketkai, Sebastian Thrun

    Abstract: Building models, or maps, of robot environments is a highly active research area; however, most existing techniques construct unstructured maps and assume static environments. In this paper, we present an algorithm for learning object models of non-stationary objects found in office-type environments. Our algorithm exploits the fact that many objects found in office environments look alike (e.g.,… ▽ More

    Submitted 12 December, 2012; originally announced January 2013.

    Comments: Appears in Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI2002)

    Report number: UAI-P-2002-PG-10-17

  16. arXiv:1212.2495  [pdf

    cs.RO cs.AI eess.SY

    Policy-contingent abstraction for robust robot control

    Authors: Joelle Pineau, Geoffrey Gordon, Sebastian Thrun

    Abstract: This paper presents a scalable control algorithm that enables a deployed mobile robot system to make high-level decisions under full consideration of its probabilistic belief. Our approach is based on insights from the rich literature of hierarchical controllers and hierarchical MDPs. The resulting controller has been successfully deployed in a nursing facility near Pittsburgh,… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-477-484

  17. arXiv:1212.2493  [pdf

    cs.AI cs.RO

    Decentralized Sensor Fusion With Distributed Particle Filters

    Authors: Matthew Rosencrantz, Geoffrey Gordon, Sebastian Thrun

    Abstract: This paper presents a scalable Bayesian technique for decentralized state estimation from multiple platforms in dynamic environments. As has long been recognized, centralized architectures impose severe scaling limitations for distributed systems due to the enormous communication overheads. We propose a strictly decentralized approach in which only nearby platforms exchange… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-493-500

  18. arXiv:1207.4129  [pdf

    cs.CV

    Recovering Articulated Object Models from 3D Range Data

    Authors: Dragomir Anguelov, Daphne Koller, Hoi-Cheung Pang, Praveen Srinivasan, Sebastian Thrun

    Abstract: We address the problem of unsupervised learning of complex articulated object models from 3D range data. We describe an algorithm whose input is a set of meshes corresponding to different configurations of an articulated object. The algorithm automatically recovers a decomposition of the object into approximately rigid parts, the location of the parts in the different object instances, and the art… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

    Report number: UAI-P-2004-PG-18-26

  19. arXiv:1207.1399  [pdf

    cs.RO cs.AI

    Robotic Mapping with Polygonal Random Fields

    Authors: Mark Paskin, Sebastian Thrun

    Abstract: Two types of probabilistic maps are popular in the mobile robotics literature: occupancy grids and geometric maps. Occupancy grids have the advantages of simplicity and speed, but they represent only a restricted class of maps and they make incorrect independence assumptions. On the other hand, current geometric approaches, which characterize the environment by features such as line segments, can… ▽ More

    Submitted 4 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence (UAI2005)

    Report number: UAI-P-2005-PG-450-458

  20. arXiv:1206.6872  [pdf

    cs.CV cs.LG cs.RO

    A Self-Supervised Terrain Roughness Estimator for Off-Road Autonomous Driving

    Authors: David Stavens, Sebastian Thrun

    Abstract: We present a machine learning approach for estimating the second derivative of a drivable surface, its roughness. Robot perception generally focuses on the first derivative, obstacle detection. However, the second derivative is also important due to its direct relation (with speed) to the shock the vehicle experiences. Knowing the second derivative allows a vehicle to slow down in advance of rough… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI2006)

    Report number: UAI-P-2006-PG-469-476

  21. Anytime Point-Based Approximations for Large POMDPs

    Authors: J. Pineau, G. Gordon, S. Thrun

    Abstract: The Partially Observable Markov Decision Process has long been recognized as a rich framework for real-world planning and control problems, especially in robotics. However exact solutions in this framework are typically computationally intractable for all but the smallest problems. A well-known technique for speeding up POMDP solving involves performing value backups at specific belief points, rat… ▽ More

    Submitted 4 October, 2011; v1 submitted 30 September, 2011; originally announced October 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 27, pages 335-380, 2006

  22. Finding Approximate POMDP solutions Through Belief Compression

    Authors: N. Roy, G. Gordon, S. Thrun

    Abstract: Standard value function approaches to finding policies for Partially Observable Markov Decision Processes (POMDPs) are generally considered to be intractable for large models. The intractability of these algorithms is to a large extent a consequence of computing an exact, optimal policy over the entire belief space. However, in real-world POMDP problems, computing the optimal policy for the full b… ▽ More

    Submitted 4 October, 2011; v1 submitted 30 June, 2011; originally announced July 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 23, pages 1-40, 2005

  23. arXiv:1106.0222  [pdf, ps

    cs.AI cs.RO

    Markov Localization for Mobile Robots in Dynamic Environments

    Authors: W. Burgard, D. Fox, S. Thrun

    Abstract: Localization, that is the estimation of a robot's location from sensor data, is a fundamental problem in mobile robotics. This papers presents a version of Markov localization which provides accurate position estimates and which is tailored towards dynamic environments. The key idea of Markov localization is to maintain a probability density over the space of all locations of a rob… ▽ More

    Submitted 1 June, 2011; originally announced June 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 11, pages 391-427, 1999