Skip to main content

Showing 1–50 of 210 results for author: Roth, A

  1. arXiv:2410.03461  [pdf, other

    cs.CL cs.LG

    Auto-GDA: Automatic Domain Adaptation for Efficient Grounding Verification in Retrieval Augmented Generation

    Authors: Tobias Leemann, Periklis Petridis, Giuseppe Vietri, Dionysis Manousakas, Aaron Roth, Sergul Aydore

    Abstract: While retrieval augmented generation (RAG) has been shown to enhance factuality of large language model (LLM) outputs, LLMs still suffer from hallucination, generating incorrect or irrelevant information. One common detection strategy involves prompting the LLM again to assess whether its response is grounded in the retrieved evidence, but this approach is costly. Alternatively, lightweight natura… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

  2. arXiv:2409.20070  [pdf, other

    physics.bio-ph cond-mat.soft nlin.PS

    Deciphering the Interface Laws of Turing Mixtures and Foams

    Authors: Henrik Weyer, Tobias A. Roth, Erwin Frey

    Abstract: For cellular functions like division and polarization, protein pattern formation driven by NTPase cycles is a central spatial control strategy. Operating far from equilibrium, no general theory links microscopic reaction networks and parameters to the pattern type and dynamics. We discover a generic mechanism giving rise to an effective interfacial tension organizing the macroscopic structure of n… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

    Comments: 11 pages main text, 5 pages Methods, 64 pages Supplementary Information; 12 figures

  3. arXiv:2409.14513  [pdf, other

    cs.LG cs.CR stat.ML

    Order of Magnitude Speedups for LLM Membership Inference

    Authors: Rongting Zhang, Martin Bertran, Aaron Roth

    Abstract: Large Language Models (LLMs) have the promise to revolutionize computing broadly, but their complexity and extensive training data also expose significant privacy vulnerabilities. One of the simplest privacy risks associated with LLMs is their susceptibility to membership inference attacks (MIAs), wherein an adversary aims to determine whether a specific data point was part of the model's training… ▽ More

    Submitted 24 September, 2024; v1 submitted 22 September, 2024; originally announced September 2024.

  4. arXiv:2409.11504  [pdf, other

    cs.LG

    Preventing Representational Rank Collapse in MPNNs by Splitting the Computational Graph

    Authors: Andreas Roth, Franka Bause, Nils M. Kriege, Thomas Liebig

    Abstract: The ability of message-passing neural networks (MPNNs) to fit complex functions over graphs is limited each iteration of message-passing over a simple makes representations more similar, a phenomenon known as rank collapse, and over-smoothing as a special case. Most approaches to mitigate over-smoothing extend common message-passing schemes, e.g., the graph convolutional network, by utilizing resi… ▽ More

    Submitted 17 September, 2024; originally announced September 2024.

  5. arXiv:2409.07437  [pdf, other

    cs.SD cs.CL eess.AS

    A Suite for Acoustic Language Model Evaluation

    Authors: Gallil Maimon, Amit Roth, Yossi Adi

    Abstract: Speech language models have recently demonstrated great potential as universal speech processing systems. Such models have the ability to model the rich acoustic information existing in audio signals, beyond spoken content, such as emotion, background noise, etc. Despite this, evaluation benchmarks which evaluate awareness to a wide range of acoustic aspects, are lacking. To help bridge this gap,… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

  6. arXiv:2409.05608  [pdf, other

    cs.GT

    The Value of Ambiguous Commitments in Multi-Follower Games

    Authors: Natalie Collina, Rabanus Derr, Aaron Roth

    Abstract: We study games in which a leader makes a single commitment, and then multiple followers (each with a different utility function) respond. In particular, we study ambiguous commitment strategies in these games, in which the leader may commit to a set of mixed strategies, and ambiguity-averse followers respond to maximize their worst-case utility over the set of leader strategies. Special cases of t… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

  7. arXiv:2409.03956  [pdf, other

    cs.GT cs.LG econ.TH

    Algorithmic Collusion Without Threats

    Authors: Eshwar Ram Arunachaleswaran, Natalie Collina, Sampath Kannan, Aaron Roth, Juba Ziani

    Abstract: There has been substantial recent concern that pricing algorithms might learn to ``collude.'' Supra-competitive prices can emerge as a Nash equilibrium of repeated pricing games, in which sellers play strategies which threaten to punish their competitors who refuse to support high prices, and these strategies can be automatically learned. In fact, a standard economic intuition is that supra-compet… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

  8. arXiv:2408.13430  [pdf, other

    stat.AP cs.DL cs.GT cs.LG stat.ML

    Analysis of the ICML 2023 Ranking Data: Can Authors' Opinions of Their Own Papers Assist Peer Review in Machine Learning?

    Authors: Buxin Su, Jiayao Zhang, Natalie Collina, Yuling Yan, Didong Li, Kyunghyun Cho, Jianqing Fan, Aaron Roth, Weijie J. Su

    Abstract: We conducted an experiment during the review process of the 2023 International Conference on Machine Learning (ICML) that requested authors with multiple submissions to rank their own papers based on perceived quality. We received 1,342 rankings, each from a distinct author, pertaining to 2,592 submissions. In this paper, we present an empirical analysis of how author-provided rankings could be le… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

    Comments: See more details about the experiment at https://openrank.cc/

  9. arXiv:2407.12206  [pdf, other

    cs.CL cs.SD eess.AS

    A Language Modeling Approach to Diacritic-Free Hebrew TTS

    Authors: Amit Roth, Arnon Turetzky, Yossi Adi

    Abstract: We tackle the task of text-to-speech (TTS) in Hebrew. Traditional Hebrew contains Diacritics, which dictate the way individuals should pronounce given words, however, modern Hebrew rarely uses them. The lack of diacritics in modern Hebrew results in readers expected to conclude the correct pronunciation and understand which phonemes to use based on the context. This imposes a fundamental challenge… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Accepted at Interspeech24

  10. arXiv:2407.11876  [pdf, other

    cs.LG

    Simplifying the Theory on Over-Smoothing

    Authors: Andreas Roth

    Abstract: Graph convolutions have gained popularity due to their ability to efficiently operate on data with an irregular geometric structure. However, graph convolutions cause over-smoothing, which refers to representations becoming more similar with increased depth. However, many different definitions and intuitions currently coexist, leading to research efforts focusing on incompatible directions. This p… ▽ More

    Submitted 2 September, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

  11. arXiv:2407.10339  [pdf, other

    hep-ex astro-ph.HE astro-ph.IM astro-ph.SR nucl-ex physics.ins-det

    Supernova Pointing Capabilities of DUNE

    Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, T. Alves, H. Amar, P. Amedo, J. Anderson, D. A. Andrade , et al. (1340 additional authors not shown)

    Abstract: The determination of the direction of a stellar core collapse via its neutrino emission is crucial for the identification of the progenitor for a multimessenger follow-up. A highly effective method of reconstructing supernova directions within the Deep Underground Neutrino Experiment (DUNE) is introduced. The supernova neutrino pointing resolution is studied by simulating and reconstructing electr… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 25 pages, 16 figures

    Report number: FERMILAB-PUB-24-0319-LBNF

  12. arXiv:2407.07566  [pdf, other

    cs.CL cs.SD eess.AS

    HebDB: a Weakly Supervised Dataset for Hebrew Speech Processing

    Authors: Arnon Turetzky, Or Tal, Yael Segal-Feldman, Yehoshua Dissen, Ella Zeldes, Amit Roth, Eyal Cohen, Yosi Shrem, Bronya R. Chernyak, Olga Seleznova, Joseph Keshet, Yossi Adi

    Abstract: We present HebDB, a weakly supervised dataset for spoken language processing in the Hebrew language. HebDB offers roughly 2500 hours of natural and spontaneous speech recordings in the Hebrew language, consisting of a large variety of speakers and topics. We provide raw recordings together with a pre-processed, weakly supervised, and filtered version. The goal of HebDB is to further enhance resear… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Accepted at Interspeech2024

  13. arXiv:2405.20272  [pdf, other

    cs.LG cs.CR

    Reconstruction Attacks on Machine Unlearning: Simple Models are Vulnerable

    Authors: Martin Bertran, Shuai Tang, Michael Kearns, Jamie Morgenstern, Aaron Roth, Zhiwei Steven Wu

    Abstract: Machine unlearning is motivated by desire for data autonomy: a person can request to have their data's influence removed from deployed models, and those models should be updated as if they were retrained without the person's data. We show that, counter-intuitively, these updates expose individuals to high-accuracy reconstruction attacks which allow the attacker to recover their data in its entiret… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  14. arXiv:2405.16752  [pdf, other

    cs.LG cs.AI

    Model Ensembling for Constrained Optimization

    Authors: Ira Globus-Harris, Varun Gupta, Michael Kearns, Aaron Roth

    Abstract: There is a long history in machine learning of model ensembling, beginning with boosting and bagging and continuing to the present day. Much of this history has focused on combining models for classification and regression, but recently there is interest in more complex settings such as ensembling policies in reinforcement learning. Strong connections have also emerged between ensembling and multi… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  15. arXiv:2405.16739  [pdf, other

    cs.LG cs.AI eess.SY

    Oracle-Efficient Reinforcement Learning for Max Value Ensembles

    Authors: Marcel Hussing, Michael Kearns, Aaron Roth, Sikata Bela Sengupta, Jessica Sorrell

    Abstract: Reinforcement learning (RL) in large or infinite state spaces is notoriously challenging, both theoretically (where worst-case sample and computational complexities must scale with state space cardinality) and experimentally (where function approximation and policy gradient techniques often scale poorly and suffer from instability and high variance). One line of research attempting to address thes… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  16. arXiv:2405.02225  [pdf, other

    stat.ML cs.AI cs.CY cs.LG stat.ME

    Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks

    Authors: Lujing Zhang, Aaron Roth, Linjun Zhang

    Abstract: This paper introduces a framework for post-processing machine learning models so that their predictions satisfy multi-group fairness guarantees. Based on the celebrated notion of multicalibration, we introduce $(\mathbf{s},\mathcal{G}, α)-$GMC (Generalized Multi-Dimensional Multicalibration) for multi-dimensional mappings $\mathbf{s}$, constraint set $\mathcal{G}$, and a pre-specified threshold le… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: 28 pages, 8 figures, accepted by ICML2024

  17. Hot Jupiter Diversity and the Onset of TiO/VO Revealed by a Large Grid of Non-Grey Global Circulation Models

    Authors: Alexander Roth, Vivien Parmentier, Mark Hammond

    Abstract: The population of hot Jupiters is extremely diverse, with large variations in their irradiation, period, gravity and chemical composition. To understand the intrinsic planet diversity through the observed population level trends, we explore the a-priori scatter in the population created by the different responses of atmospheric circulation to planetary parameters. We use the SPARC/MITgcm 3D global… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 28 pages, 25 figures, accepted in MNRAS

  18. arXiv:2404.04689  [pdf, other

    stat.ML cs.CL cs.LG

    Multicalibration for Confidence Scoring in LLMs

    Authors: Gianluca Detommaso, Martin Bertran, Riccardo Fogliato, Aaron Roth

    Abstract: This paper proposes the use of "multicalibration" to yield interpretable and reliable confidence scores for outputs generated by large language models (LLMs). Multicalibration asks for calibration not just marginally, but simultaneously across various intersecting groupings of the data. We show how to form groupings for prompt/completion pairs that are correlated with the probability of correctnes… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  19. arXiv:2402.17108  [pdf, ps, other

    cs.GT cs.DS cs.LG

    Repeated Contracting with Multiple Non-Myopic Agents: Policy Regret and Limited Liability

    Authors: Natalie Collina, Varun Gupta, Aaron Roth

    Abstract: We study a repeated contracting setting in which a Principal adaptively chooses amongst $k$ Agents at each of $T$ rounds. The Agents are non-myopic, and so a mechanism for the Principal induces a $T$-round extensive form game amongst the Agents. We give several results aimed at understanding an under-explored aspect of contract theory -- the game induced when choosing an Agent to contract with. Fi… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  20. arXiv:2402.11410  [pdf, ps, other

    cs.LG cs.DS stat.ML

    An Elementary Predictor Obtaining $2\sqrt{T}+1$ Distance to Calibration

    Authors: Eshwar Ram Arunachaleswaran, Natalie Collina, Aaron Roth, Mirah Shi

    Abstract: Blasiok et al. [2023] proposed distance to calibration as a natural measure of calibration error that unlike expected calibration error (ECE) is continuous. Recently, Qiao and Zheng [2024] gave a non-constructive argument establishing the existence of an online predictor that can obtain $O(\sqrt{T})$ distance to calibration in the adversarial setting, which is known to be impossible for ECE. They… ▽ More

    Submitted 7 October, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

  21. arXiv:2402.10795  [pdf, other

    cs.LG cs.CY cs.HC

    Diversified Ensembling: An Experiment in Crowdsourced Machine Learning

    Authors: Ira Globus-Harris, Declan Harrison, Michael Kearns, Pietro Perona, Aaron Roth

    Abstract: Crowdsourced machine learning on competition platforms such as Kaggle is a popular and often effective method for generating accurate models. Typically, teams vie for the most accurate model, as measured by overall error on a holdout set, and it is common towards the end of such competitions for teams at the top of the leaderboard to ensemble or average their models outside the platform mechanism… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  22. arXiv:2402.08753  [pdf, ps, other

    cs.GT cs.LG

    Forecasting for Swap Regret for All Downstream Agents

    Authors: Aaron Roth, Mirah Shi

    Abstract: We study the problem of making predictions so that downstream agents who best respond to them will be guaranteed diminishing swap regret, no matter what their utility functions are. It has been known since Foster and Vohra (1997) that agents who best-respond to calibrated forecasts have no swap regret. Unfortunately, the best known algorithms for guaranteeing calibrated forecasts in sequential adv… ▽ More

    Submitted 15 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  23. arXiv:2312.06589  [pdf, other

    econ.GN

    Power sector impacts of a simultaneous European heat pump rollout

    Authors: Alexander Roth

    Abstract: The decarbonization of buildings requires the phase-out of fossil fuel heating systems. Heat pumps are considered a crucial technology to supply a substantial part of heating energy for buildings. Yet, their introduction is not without challenges, as heat pumps generate additional electricity demand as well as peak loads. To better understand these challenges, an ambitious simultaneous heat pump r… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  24. arXiv:2312.05140  [pdf, other

    cs.LG cs.CR

    Membership Inference Attacks on Diffusion Models via Quantile Regression

    Authors: Shuai Tang, Zhiwei Steven Wu, Sergul Aydore, Michael Kearns, Aaron Roth

    Abstract: Recently, diffusion models have become popular tools for image synthesis because of their high-quality outputs. However, like other large-scale models, they may leak private information about their training data. Here, we demonstrate a privacy vulnerability of diffusion models through a \emph{membership inference (MI) attack}, which aims to identify whether a target example belongs to the training… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  25. arXiv:2311.07754  [pdf, other

    cs.GT cs.DS econ.TH

    Efficient Prior-Free Mechanisms for No-Regret Agents

    Authors: Natalie Collina, Aaron Roth, Han Shao

    Abstract: We study a repeated Principal Agent problem between a long lived Principal and Agent pair in a prior free setting. In our setting, the sequence of realized states of nature may be adversarially chosen, the Agent is non-myopic, and the Principal aims for a strong form of policy regret. Following Camara, Hartline, and Johnson, we model the Agent's long-run behavior with behavioral assumptions that r… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  26. arXiv:2310.17651  [pdf, other

    cs.LG

    High-Dimensional Prediction for Sequential Decision Making

    Authors: Georgy Noarov, Ramya Ramalingam, Aaron Roth, Stephan Xie

    Abstract: We study the problem of making predictions of an adversarially chosen high-dimensional state that are unbiased subject to an arbitrary collection of conditioning events, with the goal of tailoring these events to downstream decision makers. We give efficient algorithms for solving this problem, as well as a number of applications that stem from choosing an appropriate set of conditioning events.… ▽ More

    Submitted 27 October, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: Added references, Arxiv abstract edited

  27. arXiv:2310.05693  [pdf, other

    astro-ph.HE astro-ph.GA

    CONGRuENTS (COsmic-ray, Neutrino, Gamma-ray and Radio Non-Thermal Spectra). II. Population-level correlations between galactic infrared, radio, and γ-ray emission

    Authors: Matt A. Roth, Mark R. Krumholz, Roland M. Crocker, Todd A. Thompson

    Abstract: Galaxies obey a number of empirical correlations between their radio, γ-ray, and infrared emission, but the physical origins of these correlations remain uncertain. Here we use the CONGRuENTS model for broadband non-thermal emission from star-forming galaxies, which self-consistently calculates energy-dependent transport and non-thermal emission from cosmic ray hadrons and leptons, to predict radi… ▽ More

    Submitted 15 June, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: 17 pages, 14 figures

    Journal ref: MNRAS, Volume 530, Issue 2, May 2024, Pages 1849-1865

  28. arXiv:2310.04652  [pdf, other

    cs.LG

    Oracle Efficient Algorithms for Groupwise Regret

    Authors: Krishna Acharya, Eshwar Ram Arunachaleswaran, Sampath Kannan, Aaron Roth, Juba Ziani

    Abstract: We study the problem of online prediction, in which at each time step $t$, an individual $x_t$ arrives, whose label we must predict. Each individual is associated with various groups, defined based on their features such as age, sex, race etc., which may intersect. Our goal is to make predictions that have regret guarantees not just overall but also simultaneously on each sub-sequence comprised of… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

  29. arXiv:2310.00946  [pdf, other

    cs.LG cs.AI

    Distilling Influences to Mitigate Prediction Churn in Graph Neural Networks

    Authors: Andreas Roth, Thomas Liebig

    Abstract: Models with similar performances exhibit significant disagreement in the predictions of individual samples, referred to as prediction churn. Our work explores this phenomenon in graph neural networks by investigating differences between models differing only in their initializations in their utilized features for predictions. We propose a novel metric called Influence Difference (ID) to quantify t… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted at ACML 2023

  30. arXiv:2309.06000  [pdf, other

    cs.RO

    Gait Design of a Novel Arboreal Concertina Locomotion for Snake-like Robots

    Authors: Shuoqi Chen, Aaron Roth

    Abstract: In this paper, we propose a novel strategy for a snake robot to move straight up a cylindrical surface. Prior works on pole-climbing for a snake robot mainly utilized a rolling helix gait, and although proven to be efficient, it does not reassemble movements made by a natural snake. We take inspiration from nature and seek to imitate the Arboreal Concertina Locomotion (ACL) from real-life serpents… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: 4 pages, 3 figures

  31. arXiv:2308.16800  [pdf, other

    cs.LG cs.AI

    Rank Collapse Causes Over-Smoothing and Over-Correlation in Graph Neural Networks

    Authors: Andreas Roth, Thomas Liebig

    Abstract: Our study reveals new theoretical insights into over-smoothing and feature over-correlation in graph neural networks. Specifically, we demonstrate that with increased depth, node representations become dominated by a low-dimensional subspace that depends on the aggregation function but not on the feature transformations. For all aggregation functions, the rank of the node representations collapses… ▽ More

    Submitted 17 September, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

    Comments: LoG 2023

  32. arXiv:2308.16516  [pdf, other

    cs.LG cs.AI

    Curvature-based Pooling within Graph Neural Networks

    Authors: Cedric Sanders, Andreas Roth, Thomas Liebig

    Abstract: Over-squashing and over-smoothing are two critical issues, that limit the capabilities of graph neural networks (GNNs). While over-smoothing eliminates the differences between nodes making them indistinguishable, over-squashing refers to the inability of GNNs to propagate information over long distances, as exponentially many node states are squashed into fixed-size representations. Both phenomena… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: ECMLPKDD 2023 - Workshop on Mining and Learning with Graphs

  33. arXiv:2307.12918  [pdf, other

    econ.GN

    Power sector benefits of flexible heat pumps

    Authors: Alexander Roth, Carlos Gaete-Morales, Dana Kirchem, Wolf-Peter Schill

    Abstract: Heat pumps play a major role in decreasing fossil fuel use in heating. They increase electricity demand, but could also foster the system integration of variable renewable energy sources. We analyze three scenarios for expanding decentralized heat pumps in Germany by 2030, focusing on the role of buffer heat storage. Using an open-source power sector model, we assess costs, capacity investments, a… ▽ More

    Submitted 15 October, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

  34. arXiv:2307.08999  [pdf, ps, other

    cs.LG stat.ML

    Oracle Efficient Online Multicalibration and Omniprediction

    Authors: Sumegha Garg, Christopher Jung, Omer Reingold, Aaron Roth

    Abstract: A recent line of work has shown a surprising connection between multicalibration, a multi-group fairness notion, and omniprediction, a learning paradigm that provides simultaneous loss minimization guarantees for a large family of loss functions. Prior work studies omniprediction in the batch setting. We initiate the study of omniprediction in the online adversarial setting. Although there exist a… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  35. arXiv:2307.03694  [pdf, other

    cs.LG cs.AI cs.CR

    Scalable Membership Inference Attacks via Quantile Regression

    Authors: Martin Bertran, Shuai Tang, Michael Kearns, Jamie Morgenstern, Aaron Roth, Zhiwei Steven Wu

    Abstract: Membership inference attacks are designed to determine, using black box access to trained models, whether a particular example was used in training or not. Membership inference can be formalized as a hypothesis testing problem. The most effective existing attacks estimate the distribution of some test statistic (usually the model's confidence on the true label) on points that were (and were not) u… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  36. Balanced Filtering via Disclosure-Controlled Proxies

    Authors: Siqi Deng, Emily Diana, Michael Kearns, Aaron Roth

    Abstract: We study the problem of collecting a cohort or set that is balanced with respect to sensitive groups when group membership is unavailable or prohibited from use at deployment time. Specifically, our deployment-time collection mechanism does not reveal significantly more about the group membership of any individual sample than can be ascertained from base rates alone. To do this, we study a learner… ▽ More

    Submitted 17 June, 2024; v1 submitted 26 June, 2023; originally announced June 2023.

    Journal ref: 5th Symposium on Foundations of Responsible Computing (FORC 2024)

  37. Awesome SOSS: Atmospheric Characterisation of WASP-96 b using the JWST Early Release Observations

    Authors: Jake Taylor, Michael Radica, Luis Welbanks, Ryan J. MacDonald, Jasmina Blecic, Maria Zamyatina, Alexander Roth, Jacob L. Bean, Vivien Parmentier, Louis-Philippe Coulombe, Adina D. Feinstein, Néstor Espinoza, Björn Benneke, David Lafrenière, René Doyon, Eva-Maria Ahrer

    Abstract: The newly operational JWST offers the potential to study the atmospheres of distant worlds with precision that has not been achieved before. One of the first exoplanets observed by JWST in the summer of 2022 was WASP-96 b, a hot-Saturn orbiting a G8 star. As part of the Early Release Observations program, one transit of WASP-96 b was observed with NIRISS/SOSS to capture its transmission spectrum f… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: 12 pages, 5 Figures. Accepted for publication in MNRAS. Companion paper to Radica et al., 2023

  38. arXiv:2303.03451  [pdf, other

    cs.LG cs.CR

    Improved Differentially Private Regression via Gradient Boosting

    Authors: Shuai Tang, Sergul Aydore, Michael Kearns, Saeyoung Rho, Aaron Roth, Yichen Wang, Yu-Xiang Wang, Zhiwei Steven Wu

    Abstract: We revisit the problem of differentially private squared error linear regression. We observe that existing state-of-the-art methods are sensitive to the choice of hyperparameters -- including the ``clipping threshold'' that cannot be set optimally in a data-independent way. We give a new algorithm for private linear regression based on gradient boosting. We show that our method consistently improv… ▽ More

    Submitted 20 May, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

  39. arXiv:2302.08507  [pdf, ps, other

    cs.LG cs.DS math.ST

    The Scope of Multicalibration: Characterizing Multicalibration via Property Elicitation

    Authors: Georgy Noarov, Aaron Roth

    Abstract: We make a connection between multicalibration and property elicitation and show that (under mild technical conditions) it is possible to produce a multicalibrated predictor for a continuous scalar distributional property $Γ$ if and only if $Γ$ is elicitable. On the negative side, we show that for non-elicitable continuous properties there exist simple data distributions on which even the true di… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

  40. arXiv:2301.13767  [pdf, other

    cs.LG cs.DS

    Multicalibration as Boosting for Regression

    Authors: Ira Globus-Harris, Declan Harrison, Michael Kearns, Aaron Roth, Jessica Sorrell

    Abstract: We study the connection between multicalibration and boosting for squared error regression. First we prove a useful characterization of multicalibration in terms of a ``swap regret'' like condition on squared error. Using this characterization, we give an exceedingly simple algorithm that can be analyzed both as a boosting algorithm for regression and as a multicalibration algorithm for a class H… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

    Comments: Code available here: https://github.com/Declancharrison/Level-Set-Boosting

  41. arXiv:2212.09428  [pdf, other

    astro-ph.HE astro-ph.GA

    CONGRuENTS (COsmic-ray, Neutrino, Gamma-ray and Radio Non-Thermal Spectra). I. A predictive model for galactic non-thermal emission

    Authors: Matt A. Roth, Mark R. Krumholz, Roland M. Crocker, Todd A. Thompson

    Abstract: The total luminosity and spectral shape of the non-thermal emission produced by cosmic rays depends on their interstellar environment, a dependence that gives rise to correlations between galaxies' bulk properties -- star formation rate, stellar mass, and others -- and their non-thermal spectra. Understanding the physical mechanisms of cosmic ray transport, loss, and emission is key to understandi… ▽ More

    Submitted 16 May, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: 23 pages, 14 figures, 1 table, accepted for publication in MNRAS

  42. Geographical balancing of wind power decreases storage needs in a 100% renewable European power sector

    Authors: Alexander Roth, Wolf-Peter Schill

    Abstract: To reduce greenhouse gas emissions, many countries plan to massively expand wind power and solar photovoltaic capacities. These variable renewable energy sources require additional flexibility in the power sector. Both geographical balancing enabled by interconnection and electricity storage can provide such flexibility. In a 100% renewable energy scenario of twelve central European countries, we… ▽ More

    Submitted 21 June, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

  43. arXiv:2211.11596  [pdf, other

    cs.LG

    Forecasting Unobserved Node States with spatio-temporal Graph Neural Networks

    Authors: Andreas Roth, Thomas Liebig

    Abstract: Forecasting future states of sensors is key to solving tasks like weather prediction, route planning, and many others when dealing with networks of sensors. But complete spatial coverage of sensors is generally unavailable and would practically be infeasible due to limitations in budget and other resources during deployment and maintenance. Currently existing approaches using machine learning are… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  44. arXiv:2211.03128  [pdf, other

    cs.CY cs.CR cs.LG

    Confidence-Ranked Reconstruction of Census Microdata from Published Statistics

    Authors: Travis Dick, Cynthia Dwork, Michael Kearns, Terrance Liu, Aaron Roth, Giuseppe Vietri, Zhiwei Steven Wu

    Abstract: A reconstruction attack on a private dataset $D$ takes as input some publicly accessible information about the dataset and produces a list of candidate elements of $D$. We introduce a new class of data reconstruction attacks based on randomized methods for non-convex optimization. We empirically demonstrate that our attacks can not only reconstruct full rows of $D$ from aggregate query statistics… ▽ More

    Submitted 6 February, 2023; v1 submitted 6 November, 2022; originally announced November 2022.

  45. arXiv:2209.15145  [pdf, other

    cs.LG math.ST

    Batch Multivalid Conformal Prediction

    Authors: Christopher Jung, Georgy Noarov, Ramya Ramalingam, Aaron Roth

    Abstract: We develop fast distribution-free conformal prediction algorithms for obtaining multivalid coverage on exchangeable data in the batch setting. Multivalid coverage guarantees are stronger than marginal coverage guarantees in two ways: (1) They hold even conditional on group membership -- that is, the target coverage level $1-α$ holds conditionally on membership in each of an arbitrary (potentially… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: Code to replicate all of our experiments can be found at https://github.com/ProgBelarus/BatchMultivalidConformal

  46. arXiv:2209.09079  [pdf, other

    cs.RO cs.AI cs.HC cs.LG

    MSVIPER: Improved Policy Distillation for Reinforcement-Learning-Based Robot Navigation

    Authors: Aaron M. Roth, Jing Liang, Ram Sriram, Elham Tabassi, Dinesh Manocha

    Abstract: We present Multiple Scenario Verifiable Reinforcement Learning via Policy Extraction (MSVIPER), a new method for policy distillation to decision trees for improved robot navigation. MSVIPER learns an "expert" policy using any Reinforcement Learning (RL) technique involving learning a state-action mapping and then uses imitation learning to learn a decision-tree policy from it. We demonstrate that… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: 6 pages main paper, 2 pages of references, 5 page appendix (13 pages total) 5 tables, 9 algorithms, 4 figures

  47. arXiv:2209.07400  [pdf, other

    cs.LG

    Private Synthetic Data for Multitask Learning and Marginal Queries

    Authors: Giuseppe Vietri, Cedric Archambeau, Sergul Aydore, William Brown, Michael Kearns, Aaron Roth, Ankit Siva, Shuai Tang, Zhiwei Steven Wu

    Abstract: We provide a differentially private algorithm for producing synthetic data simultaneously useful for multiple tasks: marginal queries and multitask machine learning (ML). A key innovation in our algorithm is the ability to directly handle numerical features, in contrast to a number of related prior approaches which require numerical features to be first converted into {high cardinality} categorica… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: The short version of this paper appears in the proceedings of NeurIPS-22

  48. arXiv:2209.07375  [pdf, other

    cs.GT

    Wealth Dynamics Over Generations: Analysis and Interventions

    Authors: Krishna Acharya, Eshwar Ram Arunachaleswaran, Sampath Kannan, Aaron Roth, Juba Ziani

    Abstract: We present a stylized model with feedback loops for the evolution of a population's wealth over generations. Individuals have both talent and wealth: talent is a random variable distributed identically for everyone, but wealth is a random variable that is dependent on the population one is born into. Individuals then apply to a downstream agent, which we treat as a university throughout the paper… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

  49. arXiv:2209.07312  [pdf, other

    cs.LG cs.DS

    Multicalibrated Regression for Downstream Fairness

    Authors: Ira Globus-Harris, Varun Gupta, Christopher Jung, Michael Kearns, Jamie Morgenstern, Aaron Roth

    Abstract: We show how to take a regression function $\hat{f}$ that is appropriately ``multicalibrated'' and efficiently post-process it into an approximately error minimizing classifier satisfying a large variety of fairness constraints. The post-processing requires no labeled data, and only a modest amount of unlabeled data and computation. The computational and sample complexity requirements of computing… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

  50. arXiv:2209.01687  [pdf, ps, other

    cs.LG cs.DS math.ST

    Reconciling Individual Probability Forecasts

    Authors: Aaron Roth, Alexander Tolbert, Scott Weinstein

    Abstract: Individual probabilities refer to the probabilities of outcomes that are realized only once: the probability that it will rain tomorrow, the probability that Alice will die within the next 12 months, the probability that Bob will be arrested for a violent crime in the next 18 months, etc. Individual probabilities are fundamentally unknowable. Nevertheless, we show that two parties who agree on the… ▽ More

    Submitted 6 May, 2023; v1 submitted 4 September, 2022; originally announced September 2022.

    Comments: This is the full version of a paper that appears in the proceedings of FAccT 2023: The Sixth Annual ACM Conference on Fairness, Accountability, and Transparency, 2023