Skip to main content

Showing 1–3 of 3 results for author: Klett, P

  1. arXiv:2410.01793  [pdf, other

    cond-mat.stat-mech cs.ET cs.LG

    Thermodynamic Bayesian Inference

    Authors: Maxwell Aifer, Samuel Duffield, Kaelan Donatella, Denis Melanson, Phoebe Klett, Zach Belateche, Gavin Crooks, Antonio J. Martinez, Patrick J. Coles

    Abstract: A fully Bayesian treatment of complicated predictive models (such as deep neural networks) would enable rigorous uncertainty quantification and the automation of higher-level tasks including model selection. However, the intractability of sampling Bayesian posteriors over many parameters inhibits the use of Bayesian methods where they are most needed. Thermodynamic computing has emerged as a parad… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

    Comments: 20 pages, 8 figures

  2. arXiv:2406.02332  [pdf, other

    cs.LG cs.CL

    Extended Mind Transformers

    Authors: Phoebe Klett, Thomas Ahle

    Abstract: Pre-trained language models demonstrate general intelligence and common sense, but long inputs quickly become a bottleneck for memorizing information at inference time. We resurface a simple method, Memorizing Transformers (Wu et al., 2022), that gives the model access to a bank of pre-computed memories. We show that it is possible to fix many of the shortcomings of the original method, such as th… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  3. arXiv:2406.00104  [pdf, other

    cs.LG stat.ML

    Scalable Bayesian Learning with posteriors

    Authors: Samuel Duffield, Kaelan Donatella, Johnathan Chiu, Phoebe Klett, Daniel Simpson

    Abstract: Although theoretically compelling, Bayesian learning with modern machine learning models is computationally challenging since it requires approximating a high dimensional posterior distribution. In this work, we (i) introduce posteriors, an easily extensible PyTorch library hosting general-purpose implementations making Bayesian learning accessible and scalable to large data and parameter regimes;… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.