Skip to main content

Showing 1–13 of 13 results for author: Pettee, M

  1. arXiv:2406.02585  [pdf, other

    cs.LG cs.AI stat.ML

    Contextual Counting: A Mechanistic Study of Transformers on a Quantitative Task

    Authors: Siavash Golkar, Alberto Bietti, Mariel Pettee, Michael Eickenberg, Miles Cranmer, Keiya Hirashima, Geraud Krawezik, Nicholas Lourie, Michael McCabe, Rudy Morel, Ruben Ohana, Liam Holden Parker, Bruno Régaldo-Saint Blancard, Kyunghyun Cho, Shirley Ho

    Abstract: Transformers have revolutionized machine learning across diverse domains, yet understanding their behavior remains crucial, particularly in high-stakes applications. This paper introduces the contextual counting task, a novel toy problem aimed at enhancing our understanding of Transformers in quantitative and scientific contexts. This task requires precise localization and computation within datas… ▽ More

    Submitted 30 May, 2024; originally announced June 2024.

  2. arXiv:2310.03024  [pdf, other

    astro-ph.IM cs.AI cs.LG

    AstroCLIP: A Cross-Modal Foundation Model for Galaxies

    Authors: Liam Parker, Francois Lanusse, Siavash Golkar, Leopoldo Sarra, Miles Cranmer, Alberto Bietti, Michael Eickenberg, Geraud Krawezik, Michael McCabe, Ruben Ohana, Mariel Pettee, Bruno Regaldo-Saint Blancard, Tiberiu Tesileanu, Kyunghyun Cho, Shirley Ho

    Abstract: We present AstroCLIP, a single, versatile model that can embed both galaxy images and spectra into a shared, physically meaningful latent space. These embeddings can then be used - without any model fine-tuning - for a variety of downstream tasks including (1) accurate in-modality and cross-modality semantic similarity search, (2) photometric redshift estimation, (3) galaxy property estimation fro… ▽ More

    Submitted 14 June, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: 18 pages, accepted in Monthly Notices of the Royal Astronomical Society, Presented at the NeurIPS 2023 AI4Science Workshop

  3. arXiv:2310.02994  [pdf, other

    cs.LG cs.AI stat.ML

    Multiple Physics Pretraining for Physical Surrogate Models

    Authors: Michael McCabe, Bruno Régaldo-Saint Blancard, Liam Holden Parker, Ruben Ohana, Miles Cranmer, Alberto Bietti, Michael Eickenberg, Siavash Golkar, Geraud Krawezik, Francois Lanusse, Mariel Pettee, Tiberiu Tesileanu, Kyunghyun Cho, Shirley Ho

    Abstract: We introduce multiple physics pretraining (MPP), an autoregressive task-agnostic pretraining approach for physical surrogate modeling. MPP involves training large surrogate models to predict the dynamics of multiple heterogeneous physical systems simultaneously by learning features that are broadly useful across diverse physical tasks. In order to learn effectively in this setting, we introduce a… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  4. arXiv:2310.02989  [pdf, other

    stat.ML cs.AI cs.CL cs.LG

    xVal: A Continuous Number Encoding for Large Language Models

    Authors: Siavash Golkar, Mariel Pettee, Michael Eickenberg, Alberto Bietti, Miles Cranmer, Geraud Krawezik, Francois Lanusse, Michael McCabe, Ruben Ohana, Liam Parker, Bruno Régaldo-Saint Blancard, Tiberiu Tesileanu, Kyunghyun Cho, Shirley Ho

    Abstract: Large Language Models have not yet been broadly adapted for the analysis of scientific datasets due in part to the unique difficulties of tokenizing numbers. We propose xVal, a numerical encoding scheme that represents any real number using just a single token. xVal represents a given real number by scaling a dedicated embedding vector by the number value. Combined with a modified number-inference… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 10 pages 7 figures. Supplementary: 5 pages 2 figures

  5. arXiv:2305.10500  [pdf, other

    hep-ph physics.data-an stat.AP stat.ML

    Learning Likelihood Ratios with Neural Network Classifiers

    Authors: Shahzar Rizvi, Mariel Pettee, Benjamin Nachman

    Abstract: The likelihood ratio is a crucial quantity for statistical inference in science that enables hypothesis testing, construction of confidence intervals, reweighting of distributions, and more. Many modern scientific applications, however, make use of data- or simulation-driven models for which computing the likelihood ratio can be very difficult or even impossible. By applying the so-called ``likeli… ▽ More

    Submitted 8 January, 2024; v1 submitted 17 May, 2023; originally announced May 2023.

  6. arXiv:2305.03761  [pdf, other

    astro-ph.GA cs.LG hep-ph physics.data-an

    Weakly-Supervised Anomaly Detection in the Milky Way

    Authors: Mariel Pettee, Sowmya Thanvantri, Benjamin Nachman, David Shih, Matthew R. Buckley, Jack H. Collins

    Abstract: Large-scale astrophysics datasets present an opportunity for new machine learning techniques to identify regions of interest that might otherwise be overlooked by traditional searches. To this end, we use Classification Without Labels (CWoLa), a weakly-supervised anomaly detection method, to identify cold stellar streams within the more than one billion Milky Way stars observed by the Gaia satelli… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

  7. Fast Point Cloud Generation with Diffusion Models in High Energy Physics

    Authors: Vinicius Mikuni, Benjamin Nachman, Mariel Pettee

    Abstract: Many particle physics datasets like those generated at colliders are described by continuous coordinates (in contrast to grid points like in an image), respect a number of symmetries (like permutation invariance), and have a stochastic dimensionality. For this reason, standard deep generative models that produce images or at least a fixed set of features are limiting. We introduce a new neural net… ▽ More

    Submitted 17 July, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: 11 pages, 8 figures

  8. arXiv:2301.00501  [pdf, other

    physics.ins-det hep-ex hep-ph

    Heterogeneous Graph Neural Network for Identifying Hadronically Decayed Tau Leptons at the High Luminosity LHC

    Authors: Andris Huang, Xiangyang Ju, Jacob Lyons, Daniel Murnane, Mariel Pettee, Landon Reed

    Abstract: We present a new algorithm that identifies reconstructed jets originating from hadronic decays of tau leptons against those from quarks or gluons. No tau lepton reconstruction algorithm is used. Instead, the algorithm represents jets as heterogeneous graphs with tracks and energy clusters as nodes and trains a Graph Neural Network to identify tau jets from other jets. Different attributed graph re… ▽ More

    Submitted 27 June, 2023; v1 submitted 1 January, 2023; originally announced January 2023.

    Comments: 14 pages, 10 figures, 4 tables

  9. arXiv:2209.10010  [pdf, other

    cs.LG

    Intentional Choreography with Semi-Supervised Recurrent VAEs

    Authors: Mathilde Papillon, Mariel Pettee, Nina Miolane

    Abstract: We summarize the model and results of PirouNet, a semi-supervised recurrent variational autoencoder. Given a small amount of dance sequences labeled with qualitative choreographic annotations, PirouNet conditionally generates dance sequences in the style of the choreographer.

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: arXiv admin note: text overlap with arXiv:2207.12126

  10. arXiv:2209.06225  [pdf, other

    hep-ph hep-ex physics.data-an

    Anomaly Detection under Coordinate Transformations

    Authors: Gregor Kasieczka, Radha Mastandrea, Vinicius Mikuni, Benjamin Nachman, Mariel Pettee, David Shih

    Abstract: There is a growing need for machine learning-based anomaly detection strategies to broaden the search for Beyond-the-Standard-Model (BSM) physics at the Large Hadron Collider (LHC) and elsewhere. The first step of any anomaly detection approach is to specify observables and then use them to decide on a set of anomalous events. One common choice is to select events that have low probability density… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 10 pages, 6 figures

  11. arXiv:2207.12126  [pdf, other

    cs.LG

    PirouNet: Creating Dance through Artist-Centric Deep Learning

    Authors: Mathilde Papillon, Mariel Pettee, Nina Miolane

    Abstract: Using Artificial Intelligence (AI) to create dance choreography with intention is still at an early stage. Methods that conditionally generate dance sequences remain limited in their ability to follow choreographer-specific creative direction, often relying on external prompts or supervised learning. In the same vein, fully annotated dance datasets are rare and labor intensive. To fill this gap an… ▽ More

    Submitted 14 October, 2022; v1 submitted 21 July, 2022; originally announced July 2022.

    Comments: 18 pages, 6 figures

  12. arXiv:2203.06153  [pdf, other

    cs.LG astro-ph.IM cs.AI hep-ex hep-ph

    Symmetry Group Equivariant Architectures for Physics

    Authors: Alexander Bogatskiy, Sanmay Ganguly, Thomas Kipf, Risi Kondor, David W. Miller, Daniel Murnane, Jan T. Offermann, Mariel Pettee, Phiala Shanahan, Chase Shimmin, Savannah Thais

    Abstract: Physical theories grounded in mathematical symmetries are an essential component of our understanding of a wide range of properties of the universe. Similarly, in the domain of machine learning, an awareness of symmetries such as rotation or permutation invariance has driven impressive performance breakthroughs in computer vision, natural language processing, and other important applications. In t… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: Contribution to Snowmass 2021

  13. arXiv:1907.05297  [pdf, other

    cs.LG cs.MM stat.ML

    Beyond Imitation: Generative and Variational Choreography via Machine Learning

    Authors: Mariel Pettee, Chase Shimmin, Douglas Duhaime, Ilya Vidrin

    Abstract: Our team of dance artists, physicists, and machine learning researchers has collectively developed several original, configurable machine-learning tools to generate novel sequences of choreography as well as tunable variations on input choreographic sequences. We use recurrent neural network and autoencoder architectures from a training dataset of movements captured as 53 three-dimensional points… ▽ More

    Submitted 11 July, 2019; originally announced July 2019.

    Comments: 8 pages, 11 figures, presented at the 10th International Conference on Computational Creativity (ICCC 2019)