-
Altogether: Image Captioning via Re-aligning Alt-text
Authors:
Hu Xu,
Po-Yao Huang,
Xiaoqing Ellen Tan,
Ching-Feng Yeh,
Jacob Kahn,
Christine Jou,
Gargi Ghosh,
Omer Levy,
Luke Zettlemoyer,
Wen-tau Yih,
Shang-Wen Li,
Saining Xie,
Christoph Feichtenhofer
Abstract:
This paper focuses on creating synthetic data to improve the quality of image captions. Existing works typically have two shortcomings. First, they caption images from scratch, ignoring existing alt-text metadata, and second, lack transparency if the captioners' training data (e.g. GPT) is unknown. In this paper, we study a principled approach Altogether based on the key idea to edit and re-align…
▽ More
This paper focuses on creating synthetic data to improve the quality of image captions. Existing works typically have two shortcomings. First, they caption images from scratch, ignoring existing alt-text metadata, and second, lack transparency if the captioners' training data (e.g. GPT) is unknown. In this paper, we study a principled approach Altogether based on the key idea to edit and re-align existing alt-texts associated with the images. To generate training data, we perform human annotation where annotators start with the existing alt-text and re-align it to the image content in multiple rounds, consequently constructing captions with rich visual concepts. This differs from prior work that carries out human annotation as a one-time description task solely based on images and annotator knowledge. We train a captioner on this data that generalizes the process of re-aligning alt-texts at scale. Our results show our Altogether approach leads to richer image captions that also improve text-to-image generation and zero-shot image classification tasks.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
SpectroMotion: Dynamic 3D Reconstruction of Specular Scenes
Authors:
Cheng-De Fan,
Chen-Wei Chang,
Yi-Ruei Liu,
Jie-Ying Lee,
Jiun-Long Huang,
Yu-Chee Tseng,
Yu-Lun Liu
Abstract:
We present SpectroMotion, a novel approach that combines 3D Gaussian Splatting (3DGS) with physically-based rendering (PBR) and deformation fields to reconstruct dynamic specular scenes. Previous methods extending 3DGS to model dynamic scenes have struggled to accurately represent specular surfaces. Our method addresses this limitation by introducing a residual correction technique for accurate su…
▽ More
We present SpectroMotion, a novel approach that combines 3D Gaussian Splatting (3DGS) with physically-based rendering (PBR) and deformation fields to reconstruct dynamic specular scenes. Previous methods extending 3DGS to model dynamic scenes have struggled to accurately represent specular surfaces. Our method addresses this limitation by introducing a residual correction technique for accurate surface normal computation during deformation, complemented by a deformable environment map that adapts to time-varying lighting conditions. We implement a coarse-to-fine training strategy that significantly enhances both scene geometry and specular color prediction. We demonstrate that our model outperforms prior methods for view synthesis of scenes containing dynamic specular objects and that it is the only existing 3DGS method capable of synthesizing photorealistic real-world dynamic specular scenes, outperforming state-of-the-art methods in rendering complex, dynamic, and specular scenes.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction
Authors:
Long Xing,
Qidong Huang,
Xiaoyi Dong,
Jiajie Lu,
Pan Zhang,
Yuhang Zang,
Yuhang Cao,
Conghui He,
Jiaqi Wang,
Feng Wu,
Dahua Lin
Abstract:
In large vision-language models (LVLMs), images serve as inputs that carry a wealth of information. As the idiom "A picture is worth a thousand words" implies, representing a single image in current LVLMs can require hundreds or even thousands of tokens. This results in significant computational costs, which grow quadratically as input image resolution increases, thereby severely impacting the eff…
▽ More
In large vision-language models (LVLMs), images serve as inputs that carry a wealth of information. As the idiom "A picture is worth a thousand words" implies, representing a single image in current LVLMs can require hundreds or even thousands of tokens. This results in significant computational costs, which grow quadratically as input image resolution increases, thereby severely impacting the efficiency of both training and inference. Previous approaches have attempted to reduce the number of image tokens either before or within the early layers of LVLMs. However, these strategies inevitably result in the loss of crucial image information, ultimately diminishing model performance. To address this challenge, we conduct an empirical study revealing that all visual tokens are necessary for LVLMs in the shallow layers, and token redundancy progressively increases in the deeper layers of the model. To this end, we propose PyramidDrop, a visual redundancy reduction strategy for LVLMs to boost their efficiency in both training and inference with neglectable performance loss. Specifically, we partition the LVLM into several stages and drop part of the image tokens at the end of each stage with a pre-defined ratio, creating pyramid-like visual tokens across model layers. The dropping is based on a lightweight similarity calculation with a negligible time overhead. Extensive experiments demonstrate that PyramidDrop can achieve a 40% training time and 55% inference FLOPs acceleration of LLaVA-NeXT with comparable performance. Besides, the PyramidDrop could also serve as a plug-and-play strategy for inference acceleration without training, with better performance and lower inference cost than counterparts. We hope that the insights and approach introduced by PyramidDrop will inspire future research to further investigate the role of image tokens in LVLMs.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
SELA: Tree-Search Enhanced LLM Agents for Automated Machine Learning
Authors:
Yizhou Chi,
Yizhang Lin,
Sirui Hong,
Duyi Pan,
Yaying Fei,
Guanghao Mei,
Bangbang Liu,
Tianqi Pang,
Jacky Kwok,
Ceyao Zhang,
Bang Liu,
Chenglin Wu
Abstract:
Automated Machine Learning (AutoML) approaches encompass traditional methods that optimize fixed pipelines for model selection and ensembling, as well as newer LLM-based frameworks that autonomously build pipelines. While LLM-based agents have shown promise in automating machine learning tasks, they often generate low-diversity and suboptimal code, even after multiple iterations. To overcome these…
▽ More
Automated Machine Learning (AutoML) approaches encompass traditional methods that optimize fixed pipelines for model selection and ensembling, as well as newer LLM-based frameworks that autonomously build pipelines. While LLM-based agents have shown promise in automating machine learning tasks, they often generate low-diversity and suboptimal code, even after multiple iterations. To overcome these limitations, we introduce Tree-Search Enhanced LLM Agents (SELA), an innovative agent-based system that leverages Monte Carlo Tree Search (MCTS) to optimize the AutoML process. By representing pipeline configurations as trees, our framework enables agents to conduct experiments intelligently and iteratively refine their strategies, facilitating a more effective exploration of the machine learning solution space. This novel approach allows SELA to discover optimal pathways based on experimental feedback, improving the overall quality of the solutions. In an extensive evaluation across 20 machine learning datasets, we compare the performance of traditional and agent-based AutoML methods, demonstrating that SELA achieves a win rate of 65% to 80% against each baseline across all datasets. These results underscore the significant potential of agent-based strategies in AutoML, offering a fresh perspective on tackling complex machine learning challenges.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Few-shot In-Context Preference Learning Using Large Language Models
Authors:
Chao Yu,
Hong Lu,
Jiaxuan Gao,
Qixin Tan,
Xinting Yang,
Yu Wang,
Yi Wu,
Eugene Vinitsky
Abstract:
Designing reward functions is a core component of reinforcement learning but can be challenging for truly complex behavior. Reinforcement Learning from Human Feedback (RLHF) has been used to alleviate this challenge by replacing a hand-coded reward function with a reward function learned from preferences. However, it can be exceedingly inefficient to learn these rewards as they are often learned t…
▽ More
Designing reward functions is a core component of reinforcement learning but can be challenging for truly complex behavior. Reinforcement Learning from Human Feedback (RLHF) has been used to alleviate this challenge by replacing a hand-coded reward function with a reward function learned from preferences. However, it can be exceedingly inefficient to learn these rewards as they are often learned tabula rasa. We investigate whether Large Language Models (LLMs) can reduce this query inefficiency by converting an iterative series of human preferences into code representing the rewards. We propose In-Context Preference Learning (ICPL), a method that uses the grounding of an LLM to accelerate learning reward functions from preferences. ICPL takes the environment context and task description, synthesizes a set of reward functions, and then repeatedly updates the reward functions using human rankings of videos of the resultant policies. Using synthetic preferences, we demonstrate that ICPL is orders of magnitude more efficient than RLHF and is even competitive with methods that use ground-truth reward functions instead of preferences. Finally, we perform a series of human preference-learning trials and observe that ICPL extends beyond synthetic settings and can work effectively with humans-in-the-loop. Additional information and videos are provided at https://sites.google.com/view/few-shot-icpl/home.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Solving the Independent Domination Problem by Quantum Approximate Optimization Algorithm
Authors:
Haoqian Pan,
Changhong Lu
Abstract:
In the wake of quantum computing advancements and quantum algorithmic progress, quantum algorithms are increasingly being employed to address a myriad of combinatorial optimization problems. Among these, the Independent Domination Problem (IDP), a derivative of the Domination Problem, has practical implications in various real-world scenarios. Despite this, existing classical algorithms for IDP ar…
▽ More
In the wake of quantum computing advancements and quantum algorithmic progress, quantum algorithms are increasingly being employed to address a myriad of combinatorial optimization problems. Among these, the Independent Domination Problem (IDP), a derivative of the Domination Problem, has practical implications in various real-world scenarios. Despite this, existing classical algorithms for IDP are plagued by high computational complexity, and quantum algorithms have yet to tackle this challenge. This paper introduces a Quantum Approximate Optimization Algorithm (QAOA)-based approach to address the IDP. Utilizing IBM's qasm_simulator, we have demonstrated the efficacy of QAOA in solving IDP under specific parameter settings, with a computational complexity that surpasses that of classical methods. Our findings offer a novel avenue for the resolution of IDP.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Context-aware Prompt Tuning: Advancing In-Context Learning with Adversarial Methods
Authors:
Tsachi Blau,
Moshe Kimhi,
Yonatan Belinkov,
Alexander Bronstein,
Chaim Baskin
Abstract:
Fine-tuning Large Language Models (LLMs) typically involves updating at least a few billions of parameters. A more parameter-efficient approach is Prompt Tuning (PT), which updates only a few learnable tokens, and differently, In-Context Learning (ICL) adapts the model to a new task by simply including examples in the input without any training. When applying optimization-based methods, such as fi…
▽ More
Fine-tuning Large Language Models (LLMs) typically involves updating at least a few billions of parameters. A more parameter-efficient approach is Prompt Tuning (PT), which updates only a few learnable tokens, and differently, In-Context Learning (ICL) adapts the model to a new task by simply including examples in the input without any training. When applying optimization-based methods, such as fine-tuning and PT for few-shot learning, the model is specifically adapted to the small set of training examples, whereas ICL leaves the model unchanged. This distinction makes traditional learning methods more prone to overfitting; in contrast, ICL is less sensitive to the few-shot scenario. While ICL is not prone to overfitting, it does not fully extract the information that exists in the training examples. This work introduces Context-aware Prompt Tuning (CPT), a method inspired by ICL, PT, and adversarial attacks. We build on the ICL strategy of concatenating examples before the input, but we extend this by PT-like learning, refining the context embedding through iterative optimization to extract deeper insights from the training examples. We carefully modify specific context tokens, considering the unique structure of input and output formats. Inspired by adversarial attacks, we adjust the input based on the labels present in the context, focusing on minimizing, rather than maximizing, the loss. Moreover, we apply a projected gradient descent algorithm to keep token embeddings close to their original values, under the assumption that the user-provided data is inherently valuable. Our method has been shown to achieve superior accuracy across multiple classification tasks using various LLM models.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Paracomposition Operators and Paradifferential Reducibility
Authors:
Thomas Alazard,
Chengyang Shao
Abstract:
Reducibility methods, aiming to simplify systems by conjugating them to those with constant coefficients, are crucial for studying the existence of quasiperiodic solutions. In KAM theory for PDEs, these methods help address the invertibility of linearized operators that arise in a Nash-Moser/KAM type scheme. The goal of this paper is to prove paradifferential reducibility results, enabling the red…
▽ More
Reducibility methods, aiming to simplify systems by conjugating them to those with constant coefficients, are crucial for studying the existence of quasiperiodic solutions. In KAM theory for PDEs, these methods help address the invertibility of linearized operators that arise in a Nash-Moser/KAM type scheme. The goal of this paper is to prove paradifferential reducibility results, enabling the reduction of nonlinear equations themselves, rather than just their linearizations, to constant coefficient form, modulo smoothing terms. As an initial application, we demonstrate the existence of quasiperiodic solutions for certain hyperbolic systems. Despite the small denominator problem, our proof does not rely on traditional Nash-Moser/KAM-type schemes. To achieve this, we develop two key toolsets. The first focuses on the calculus of paracomposition operators introduced by Alinhac, interpreted as the flow map of a paraproduct vector field. We refine this approach to establish new estimates that precisely capture the dependence on the diffeomorphism in question. The second toolset addresses two classical reducibility problems: one for matrix differential operators and another for nearly parallel vector fields on the torus. We resolve these problems by paralinearizing the conjugacy equation and exploiting, at the paradifferential level, the specific algebraic structure of conjugacy problems, akin to Zehnder's approximate Nash-Moser approach.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Data-efficient 4D-STEM in SEM: Beyond 2D Materials to Metallic Materials
Authors:
Ujjval Bansal,
Amit Sharma,
Barbara Putz,
Christoph Kirchlechner,
Subin Lee
Abstract:
Four-dimensional scanning transmission electron microscopy (4D-STEM) is a powerful tool that allows for the simultaneous acquisition of spatial and diffraction information, driven by recent advancements in direct electron detector technology. Although 4D-STEM has been predominantly developed for and used in conventional TEM and STEM, efforts are being made to implement the technique in scanning el…
▽ More
Four-dimensional scanning transmission electron microscopy (4D-STEM) is a powerful tool that allows for the simultaneous acquisition of spatial and diffraction information, driven by recent advancements in direct electron detector technology. Although 4D-STEM has been predominantly developed for and used in conventional TEM and STEM, efforts are being made to implement the technique in scanning electron microscopy (SEM). In this paper, we push the boundaries of 4D-STEM in SEM and extend its capabilities in three key aspects: (1) faster acquisition rate with reduced data size, (2) higher angular resolution, and (3) application to various materials including conventional alloys and focused ion beam (FIB) lamella. Specifically, operating the MiniPIX Timepix3 detector in the event-driven mode significantly improves the acquisition rate by a factor of a few tenths compared to conventional frame-based mode, thereby opening up possibilities for integrating 4D-STEM into various in situ SEM testing. Furthermore, with a novel stage-detector geometry, a camera length of 160 mm is achieved which improves the angular resolution amplifying its utility, for example, magnetic or electric field imaging. Lastly, we successfully imaged a nanostructured platinum-copper thin film with a grain size of 16 nm and a thickness of 20 nm, and identified annealing twins in FIB-prepared polycrystalline copper using virtual darkfield imaging and orientation mapping. This work demonstrates the potential of synergetic combination of 4D-STEM with in situ experiments, and broadening its applications across a wide range of materials.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Vulnerability anti-patterns in Solidity: Increasing smart contracts security by reducing false alarms
Authors:
Tommaso Oss,
Carlos E. Budde
Abstract:
Turing completeness has made Ethereum smart contracts attractive to blockchain developers and attackers alike. To increase code security, many tools can now spot most known vulnerabilities$-$at the cost of production efficiency. Recent studies show false-positive ratios over 99% in state-of-the-art technologies: this makes them impractical for use in industry and have raised questions on the direc…
▽ More
Turing completeness has made Ethereum smart contracts attractive to blockchain developers and attackers alike. To increase code security, many tools can now spot most known vulnerabilities$-$at the cost of production efficiency. Recent studies show false-positive ratios over 99% in state-of-the-art technologies: this makes them impractical for use in industry and have raised questions on the direction of academic research. In this work we show how integrating and extending current analyses is not only feasible, but also a next logical step in smart-contract security. We propose light-weight static checks on the morphology and dynamics of Solidity code, stemming from a developer-centric notion of vulnerability, that we use to verify the output of other tools, flag potential false alarms, and suggest verifications. Besides technical details we implemented an open-source prototype. For three top-10 vulnerabilities it flags 324 warnings of other tools as false-positives, in 60 verified de-duplicated smart contracts selected from the blockchain by the presence of true (and false) vulnerabilities. This amounts to a 92%- to 100%-reduction in the number of false-positives for these vulnerabilities.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Dipolar Attraction of Superparamagnetic Nanoparticles
Authors:
Frederik L. Durhuus,
Marco Beleggia,
Cathrine Frandsen
Abstract:
For superparamagnetic nanoparticles (SMNPs), it is often claimed that the rapid thermal fluctuations of their magnetic moments negates the magnetic dipolar attraction, hence preventing aggregation in liquid suspension. However we find that this is a misconception. Using Langevin dynamics, we simulate SMNP pairs and the dimer clusters they form which is the simplest case of aggregation. To quantify…
▽ More
For superparamagnetic nanoparticles (SMNPs), it is often claimed that the rapid thermal fluctuations of their magnetic moments negates the magnetic dipolar attraction, hence preventing aggregation in liquid suspension. However we find that this is a misconception. Using Langevin dynamics, we simulate SMNP pairs and the dimer clusters they form which is the simplest case of aggregation. To quantify the tendency to aggregate, we introduce the dimer debonding time and calculate the average magnetic force of attraction which results from correlations in the fluctuating moments. Neither quantity has any dependence on the magnetocrystalline anisotropy, which determines the rate of superparamagnetic reversals, and comparing with computed Néel relaxation times we show that this holds for both blocked and superparamagnetic particles. These results imply that the phenomenon of superparamagnetism does not affect aggregation. Because the key dimensionless parameter for the Néel relaxation of a lone SMNP and the one for magnetic attraction have the same size and temperature scaling, there is a strong correlation between superparamagnetism and colloidal stability, as observed experimentally, but no causal relation.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
On the control of recurrent neural networks using constant inputs
Authors:
Cyprien Tamekue,
Ruiqi Chen,
ShiNung Ching
Abstract:
This paper investigates the controllability properties of a general class of recurrent neural networks that are widely used for hypothesis generation in theoretical neuroscience, including the modeling of large-scale human brain dynamics. Our study focuses on the control synthesis of such networks using constant and piecewise constant inputs, motivated by emerging applications in non-invasive neur…
▽ More
This paper investigates the controllability properties of a general class of recurrent neural networks that are widely used for hypothesis generation in theoretical neuroscience, including the modeling of large-scale human brain dynamics. Our study focuses on the control synthesis of such networks using constant and piecewise constant inputs, motivated by emerging applications in non-invasive neurostimulation such as transcranial direct current stimulation (tDCS). The neural network model considered is a continuous Hopfield-type system with nonlinear activation functions and arbitrary input matrices representing interactions among multiple brain regions. Our main contribution is the formulation and solution of a control synthesis problem for these nonlinear systems. We provide a proper generalization of the variation of the constants formula that constitutes a novel representation of the system's state trajectory. This representation admits a verifiable condition on the existence of the constant control input to solve a short-time two-point boundary value problem in the state space. This formulation admits a synthesis for the input in question, which can be realized using modern algorithmic optimization tools. In the case of linear activation functions, this analysis and synthesis reduces to the verification of algebraic conditions on the system matrices. Simulation results are presented to illustrate the theoretical findings and demonstrate the efficacy of the proposed control strategies. These results offer a novel control synthesis for an important class of neural network models that may, in turn, enable the design of brain stimulation protocols to modulate whole-brain activity in therapeutic and cognitive enhancement applications.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
VoiceBench: Benchmarking LLM-Based Voice Assistants
Authors:
Yiming Chen,
Xianghu Yue,
Chen Zhang,
Xiaoxue Gao,
Robby T. Tan,
Haizhou Li
Abstract:
Building on the success of large language models (LLMs), recent advancements such as GPT-4o have enabled real-time speech interactions through LLM-based voice assistants, offering a significantly improved user experience compared to traditional text-based interactions. However, the absence of benchmarks designed to evaluate these speech interaction capabilities has hindered progress of LLM-based v…
▽ More
Building on the success of large language models (LLMs), recent advancements such as GPT-4o have enabled real-time speech interactions through LLM-based voice assistants, offering a significantly improved user experience compared to traditional text-based interactions. However, the absence of benchmarks designed to evaluate these speech interaction capabilities has hindered progress of LLM-based voice assistants development. Current evaluations focus primarily on automatic speech recognition (ASR) or general knowledge evaluation with clean speeches, neglecting the more intricate, real-world scenarios that involve diverse speaker characteristics, environmental and content factors. To address this, we introduce VoiceBench, the first benchmark designed to provide a multi-faceted evaluation of LLM-based voice assistants. VoiceBench also includes both real and synthetic spoken instructions that incorporate the above three key real-world variations. Extensive experiments reveal the limitations of current LLM-based voice assistant models and offer valuable insights for future research and development in this field.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Language Model Non-myopic Generation for Reasoning and Planning
Authors:
Chang Ma,
Haiteng Zhao,
Junlei Zhang,
Junxian He,
Lingpeng Kong
Abstract:
Large Language Models have demonstrated remarkable abilities in reasoning and planning by breaking down complex problems into sequential steps. Despite their success in various domains like mathematical problem-solving and coding, LLMs face challenges in ensuring reliable and optimal planning due to their inherent myopic nature of autoregressive decoding. This paper revisits LLM reasoning from an…
▽ More
Large Language Models have demonstrated remarkable abilities in reasoning and planning by breaking down complex problems into sequential steps. Despite their success in various domains like mathematical problem-solving and coding, LLMs face challenges in ensuring reliable and optimal planning due to their inherent myopic nature of autoregressive decoding. This paper revisits LLM reasoning from an optimal-control perspective, proposing a novel method, Predictive-Decoding, that leverages Model Predictive Control to enhance planning accuracy. By re-weighting LLM distributions based on foresight trajectories, Predictive-Decoding aims to mitigate early errors and promote non-myopic planning. Our experiments show significant improvements in a wide range of tasks for math, coding, and agents. Furthermore, Predictive-Decoding demonstrates computational efficiency, outperforming search baselines with reduced computational resources. This study provides insights into optimizing LLM planning capabilities.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
A deterministic optimization algorithm for nonconvex and combinatorial bi-objective programming
Authors:
Ye Seol Lee,
George Jackson,
Amparo Galindo,
Claire S. Adjiman
Abstract:
any practical multiobjective optimization (MOO) problems include discrete decision variables and/or nonlinear model equations and exhibit disconnected or smooth but nonconvex Pareto surfaces. Scalarization methods, such as the weighted-sum and sandwich (SD) algorithms, are common approaches to solving MOO problems but may fail on nonconvex or discontinuous Pareto fronts. In the current work, motiv…
▽ More
any practical multiobjective optimization (MOO) problems include discrete decision variables and/or nonlinear model equations and exhibit disconnected or smooth but nonconvex Pareto surfaces. Scalarization methods, such as the weighted-sum and sandwich (SD) algorithms, are common approaches to solving MOO problems but may fail on nonconvex or discontinuous Pareto fronts. In the current work, motivated by the well-known normal boundary intersection (NBI) method and the SD algorithm, we present SDNBI, a new algorithm for bi-objective optimization (BOO) designed to address the theoretical and numerical challenges associated with the reliable solution of general nonconvex and discrete BOO problems. The main improvements in the algorithm are the effective exploration of the nonconvex regions of the Pareto front and, uniquely, the early identification of regions where no additional Pareto solutions exist. The performance of the SDNBI algorithm is assessed based on the accuracy of the approximation of the Pareto front constructed over the disconnected nonconvex objective domains. The new algorithm is compared with two MOO approaches, the modified NBI method and the SD algorithm, using published benchmark problems. The results indicate that the SDNBI algorithm outperforms the modified NBI and SD algorithms in solving convex, nonconvex-continuous, and combinatorial problems, both in terms of computational cost and of the overall quality of the Pareto-optimal set, suggesting that the SDNBI algorithm is a promising alternative for solving BOO problems.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Temporal and Spectral Analysis of the Unique and Second Brightest Gamma-Ray Burst GRB 230307A: Insights from GECAM and Fermi/GBM Observations
Authors:
R. Moradi,
C. W. Wang,
B. Zhang,
Y. Wang,
S. -L. Xiong,
S. -X. Yi,
W. -J. Tan,
M. Karlica,
S. -N. Zhang
Abstract:
In this study, we present the pulse profile of the unique and the second brightest gamma-ray burst GRB 230307A, and analyze its temporal behavior using a joint GECAM--Fermi/GBM time-resolved spectral analysis. The utilization of GECAM data is advantageous as it successfully captured significant data during the pile-up period of the Fermi/GBM. We investigate the evolution of its flux, photon fluenc…
▽ More
In this study, we present the pulse profile of the unique and the second brightest gamma-ray burst GRB 230307A, and analyze its temporal behavior using a joint GECAM--Fermi/GBM time-resolved spectral analysis. The utilization of GECAM data is advantageous as it successfully captured significant data during the pile-up period of the Fermi/GBM. We investigate the evolution of its flux, photon fluence, photon flux, peak energy, and the corresponding hardness-intensity and hardness-flux correlations. The findings within the first 27 seconds exhibit consistent patterns reported previously, providing valuable insights for comparing observations with predictions from the synchrotron radiation model invoking an expanding shell. Beyond the initial 27 seconds, we observe a notable transition in the emitted radiation, attributed to high latitude emission (HLE), influenced by the geometric properties of the shells and the relativistic Doppler effects. By modeling the data within the framework of the large-radius internal shock model, we discuss the required parameters as well as the limitations of the model. We conclude that a more complicated synchrotron emission model is needed to fully describe the observational data of GRB 230307A.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
From Attention to Activation: Unravelling the Enigmas of Large Language Models
Authors:
Prannay Kaul,
Chengcheng Ma,
Ismail Elezi,
Jiankang Deng
Abstract:
We study two strange phenomena in auto-regressive Transformers: (1) the dominance of the first token in attention heads; (2) the occurrence of large outlier activations in the hidden states. We find that popular large language models, such as Llama attend maximally to the first token in 98% of attention heads, a behaviour we attribute to the softmax function. To mitigate this issue, we propose a r…
▽ More
We study two strange phenomena in auto-regressive Transformers: (1) the dominance of the first token in attention heads; (2) the occurrence of large outlier activations in the hidden states. We find that popular large language models, such as Llama attend maximally to the first token in 98% of attention heads, a behaviour we attribute to the softmax function. To mitigate this issue, we propose a reformulation of softmax to softmax-1. Furthermore, we identify adaptive optimisers, e.g. Adam, as the primary contributor to the large outlier activations and introduce OrthoAdam, a novel optimiser that utilises orthogonal matrices to transform gradients, to address this issue. Finally, not only do our methods prevent these phenomena from occurring, but additionally, they enable Transformers to sustain their performance when quantised using basic algorithms, something that standard methods are unable to do. In summary, our methods reduce the attention proportion on the first token from 65% to 3.3%, the activation kurtosis in the hidden states from 1657 to 3.1, and perplexity penalty under 4-bit weight quantisation from 3565 to 0.3.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Impact of 3D LiDAR Resolution in Graph-based SLAM Approaches: A Comparative Study
Authors:
J. Jorge,
T. Barros,
C. Premebida,
M. Aleksandrov,
D. Goehring,
U. J. Nunes
Abstract:
Simultaneous Localization and Mapping (SLAM) is a key component of autonomous systems operating in environments that require a consistent map for reliable localization. SLAM has been a widely studied topic for decades with most of the solutions being camera or LiDAR based. Early LiDAR-based approaches primarily relied on 2D data, whereas more recent frameworks use 3D data. In this work, we survey…
▽ More
Simultaneous Localization and Mapping (SLAM) is a key component of autonomous systems operating in environments that require a consistent map for reliable localization. SLAM has been a widely studied topic for decades with most of the solutions being camera or LiDAR based. Early LiDAR-based approaches primarily relied on 2D data, whereas more recent frameworks use 3D data. In this work, we survey recent 3D LiDAR-based Graph-SLAM methods in urban environments, aiming to compare their strengths, weaknesses, and limitations. Additionally, we evaluate their robustness regarding the LiDAR resolution namely 64 $vs$ 128 channels. Regarding SLAM methods, we evaluate SC-LeGO-LOAM, SC-LIO-SAM, Cartographer, and HDL-Graph on real-world urban environments using the KITTI odometry dataset (a LiDAR with 64-channels only) and a new dataset (AUTONOMOS-LABS). The latter dataset, collected using instrumented vehicles driving in Berlin suburban area, comprises both 64 and 128 LiDARs. The experimental results are reported in terms of quantitative `metrics' and complemented by qualitative maps.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Towards Map-Agnostic Policies for Adaptive Informative Path Planning
Authors:
Julius Rückin,
David Morilla-Cabello,
Cyrill Stachniss,
Eduardo Montijano,
Marija Popović
Abstract:
Robots are frequently tasked to gather relevant sensor data in unknown terrains. A key challenge for classical path planning algorithms used for autonomous information gathering is adaptively replanning paths online as the terrain is explored given limited onboard compute resources. Recently, learning-based approaches emerged that train planning policies offline and enable computationally efficien…
▽ More
Robots are frequently tasked to gather relevant sensor data in unknown terrains. A key challenge for classical path planning algorithms used for autonomous information gathering is adaptively replanning paths online as the terrain is explored given limited onboard compute resources. Recently, learning-based approaches emerged that train planning policies offline and enable computationally efficient online replanning performing policy inference. These approaches are designed and trained for terrain monitoring missions assuming a single specific map representation, which limits their applicability to different terrains. To address these issues, we propose a novel formulation of the adaptive informative path planning problem unified across different map representations, enabling training and deploying planning policies in a larger variety of monitoring missions. Experimental results validate that our novel formulation easily integrates with classical non-learning-based planning approaches while maintaining their performance. Our trained planning policy performs similarly to state-of-the-art map-specifically trained policies. We validate our learned policy on unseen real-world terrain datasets.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Evidence for a Lattice Supersolid of Subradiant Dipolar Excitons
Authors:
Camille Lagoin,
Corentin Morin,
Kirk Baldwin,
Loren Pfeiffer,
Francois Dubin
Abstract:
In condensed-matter physics, supersolids refer to many-body quantum states breaking translational symmetry while exhibiting off-diagonal long-range order. This combination is debated for commensurate crystals, however it is accessible in lattice potentials fractionally filled by bosons with extended interactions. Here, we report such lattice supersolid with dipolar excitons confined in a sub-wavel…
▽ More
In condensed-matter physics, supersolids refer to many-body quantum states breaking translational symmetry while exhibiting off-diagonal long-range order. This combination is debated for commensurate crystals, however it is accessible in lattice potentials fractionally filled by bosons with extended interactions. Here, we report such lattice supersolid with dipolar excitons confined in a sub-wavelength period potential. Excitons implement the Dicke-Hubbard Hamiltonian controlled by spatially extended dipolar and Dicke correlations. At half lattice-filling, these induce both a condensation in a single sub-radiant state and a dipolar quantum order spontaneously breaking the lattice symmetry. This combination signals a lattice supersolid dissipatively prepared across 8x8 sites. Our study underlines that nanoscopic exciton arrays open a route to explore new frontiers of quantum matter, e.g. many-body entanglement, at the interface with quantum nano-photonics.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Modes of convergence of sequences of holomorphic functions: a linear point of view
Authors:
L. Bernal-González,
M. C. Calderón-Moreno,
J. López-Salazar,
J. A. Prado-Bassas
Abstract:
In this paper, pointwise convergence, uniform convergence and compact convergence of sequences of holomorphic functions on an open subset of the complex plane are compared from a linear point of view. In fact, it is proved the existence of large linear algebras consisting, except for zero, of sequences of holomorphic functions tending to zero compactly but not uniformly on the open set or of seque…
▽ More
In this paper, pointwise convergence, uniform convergence and compact convergence of sequences of holomorphic functions on an open subset of the complex plane are compared from a linear point of view. In fact, it is proved the existence of large linear algebras consisting, except for zero, of sequences of holomorphic functions tending to zero compactly but not uniformly on the open set or of sequences of holomorphic functions tending pointwisely to zero but not compactly. Also dense linear subspaces in an appropriate Fréchet space as well as infinite dimensional Banach spaces of sequences converging to zero in the mentioned modes are shown to exist.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Interpreting tunneling spectroscopic maps of a dinuclear Co(II) complex on gold
Authors:
Roberto Robles,
Chao Li,
Sara Realista,
Paulo Nuno Martinho,
Manuel Gruber,
Alexander Weismann,
Nicolás Lorente,
Richard Berndt
Abstract:
Scanning tunneling microscope data from a dinuclear Co(II) complex adsorbed on Au(111) are analysed using density functional theory calculations. We find that the interaction with the substrate substantially changes the geometry of the non-planar molecule. Its electronic states, however, remain fairly similar to those calculated for a gas-phase molecule. The calculations reproduce intriguing contr…
▽ More
Scanning tunneling microscope data from a dinuclear Co(II) complex adsorbed on Au(111) are analysed using density functional theory calculations. We find that the interaction with the substrate substantially changes the geometry of the non-planar molecule. Its electronic states, however, remain fairly similar to those calculated for a gas-phase molecule. The calculations reproduce intriguing contrasts observed in experimental maps of the differential conductance dI/dV and reveal the relative importance of geometric and electronic factors that impinge on the image contrasts. For a meaningful comparison, it is important that the calculations closely mimic the experimental mode of measurement.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
AI Future Envisioning with PLACARD
Authors:
Mary C. Tedeschi,
Paola Ricaurte,
Sridevi Ayloo,
Joseph Corneli,
Charles Jeffrey Danoff,
Sergio Belich
Abstract:
At EuroPLoP 2024 Mary Tedeschi led the "AI Future Envisioning with PLACARD" focus group in Germany. Three conference attendees joined in the room while Sridevi, Paola, and Charles co-facilitated remotely via a web conference. The participants were introduced to a Futures Studies technique with the goal of capturing envisionments of Artificial Intelligence (AI) going forward. To set an atmosphere a…
▽ More
At EuroPLoP 2024 Mary Tedeschi led the "AI Future Envisioning with PLACARD" focus group in Germany. Three conference attendees joined in the room while Sridevi, Paola, and Charles co-facilitated remotely via a web conference. The participants were introduced to a Futures Studies technique with the goal of capturing envisionments of Artificial Intelligence (AI) going forward. To set an atmosphere a technology focused card game was used to make the session more interactive. To close everyone co-created a Project Action Review to recap of the event to capture learnings that has been summarized in this paper. The Focus Group was structured based on lessons learned over six earlier iterations.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
A Bayesian Perspective on the Maximum Score Problem
Authors:
Christopher D. Walker
Abstract:
This paper presents a Bayesian inference framework for a linear index threshold-crossing binary choice model that satisfies a median independence restriction. The key idea is that the model is observationally equivalent to a probit model with nonparametric heteroskedasticity. Consequently, Gibbs sampling techniques from Albert and Chib (1993) and Chib and Greenberg (2013) lead to a computationally…
▽ More
This paper presents a Bayesian inference framework for a linear index threshold-crossing binary choice model that satisfies a median independence restriction. The key idea is that the model is observationally equivalent to a probit model with nonparametric heteroskedasticity. Consequently, Gibbs sampling techniques from Albert and Chib (1993) and Chib and Greenberg (2013) lead to a computationally attractive Bayesian inference procedure in which a Gaussian process forms a conditionally conjugate prior for the natural logarithm of the skedastic function.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Recovering the cluster picture of a polynomial over a discretely valued field
Authors:
Lilybelle Cowland Kellock
Abstract:
For $f(x)$ a separable polynomial of degree $d$ over a discretely valued field $K$, we describe how the cluster picture of $f(x)$ over $K$, in other words the set of tuples $\{(\mathrm{ord}(x_i-x_j),i,j) : 1\leq i< j \leq d \}$ where $x_1,\dots,x_d$ are the roots of $f(x)$, can be recovered without knowing the roots of $f(x)$ over $\bar{K}$. We construct an explicit list of polynomials…
▽ More
For $f(x)$ a separable polynomial of degree $d$ over a discretely valued field $K$, we describe how the cluster picture of $f(x)$ over $K$, in other words the set of tuples $\{(\mathrm{ord}(x_i-x_j),i,j) : 1\leq i< j \leq d \}$ where $x_1,\dots,x_d$ are the roots of $f(x)$, can be recovered without knowing the roots of $f(x)$ over $\bar{K}$. We construct an explicit list of polynomials $g_d^{(1)},\dots,g_d^{(t_d)}\in\mathbb{Z}[A_0,\dots,A_{d-1}]$ such that the valuations $\mathrm{ord}(g_{d}^{(i)}(a_0,\dots,a_{d-1}))$ for $i=1,\dots,t_d$ uniquely determine this set of distances for the polynomial $f(x)=c_f(x^d+a_{d-1}x^{d-1}+\dots+a_0)$, and we describe the process by which they do so. We use this to deduce that if $C:y^2=f(x)$ is a hyperelliptic curve over a local field $K$, this list of valuations of polynomials in the coefficients of $f(x)$ uniquely determines the dual graph of the special fibre of the minimal strict normal crossings model of $C/K^{\mathrm{unr}}$, the inertia action on the Tate module and the conductor exponent. This provides a hyperelliptic curves analogue to a corollary of Tate's algorithm, that in residue characteristic $p\geq 5$ the dual graph of special fibre of the the minimal regular model of an elliptic curve $E/K^{\mathrm{unr}}$ is uniquely determined by the valuation of $j_E$ and $Δ_E$.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
The XLZD Design Book: Towards the Next-Generation Liquid Xenon Observatory for Dark Matter and Neutrino Physics
Authors:
XLZD Collaboration,
J. Aalbers,
K. Abe,
M. Adrover,
S. Ahmed Maouloud,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
L. Althueser,
D. W. P. Amaral,
C. S. Amarasinghe,
A. Ames,
B. Andrieu,
N. Angelides,
E. Angelino,
B. Antunovic,
E. Aprile,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
M. Babicz,
D. Bajpai,
A. Baker,
M. Balzer,
J. Bang
, et al. (419 additional authors not shown)
Abstract:
This report describes the experimental strategy and technologies for a next-generation xenon observatory sensitive to dark matter and neutrino physics. The detector will have an active liquid xenon target mass of 60-80 tonnes and is proposed by the XENON-LUX-ZEPLIN-DARWIN (XLZD) collaboration. The design is based on the mature liquid xenon time projection chamber technology of the current-generati…
▽ More
This report describes the experimental strategy and technologies for a next-generation xenon observatory sensitive to dark matter and neutrino physics. The detector will have an active liquid xenon target mass of 60-80 tonnes and is proposed by the XENON-LUX-ZEPLIN-DARWIN (XLZD) collaboration. The design is based on the mature liquid xenon time projection chamber technology of the current-generation experiments, LZ and XENONnT. A baseline design and opportunities for further optimization of the individual detector components are discussed. The experiment envisaged here has the capability to explore parameter space for Weakly Interacting Massive Particle (WIMP) dark matter down to the neutrino fog, with a 3$σ$ evidence potential for the spin-independent WIMP-nucleon cross sections as low as $3\times10^{-49}\rm cm^2$ (at 40 GeV/c$^2$ WIMP mass). The observatory is also projected to have a 3$σ$ observation potential of neutrinoless double-beta decay of $^{136}$Xe at a half-life of up to $5.7\times 10^{27}$ years. Additionally, it is sensitive to astrophysical neutrinos from the atmosphere, sun, and galactic supernovae.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles
Authors:
Li Siyan,
Vethavikashini Chithrra Raghuram,
Omar Khattab,
Julia Hirschberg,
Zhou Yu
Abstract:
Users can divulge sensitive information to proprietary LLM providers, raising significant privacy concerns. While open-source models, hosted locally on the user's machine, alleviate some concerns, models that users can host locally are often less capable than proprietary frontier models. Toward preserving user privacy while retaining the best quality, we propose Privacy-Conscious Delegation, a nov…
▽ More
Users can divulge sensitive information to proprietary LLM providers, raising significant privacy concerns. While open-source models, hosted locally on the user's machine, alleviate some concerns, models that users can host locally are often less capable than proprietary frontier models. Toward preserving user privacy while retaining the best quality, we propose Privacy-Conscious Delegation, a novel task for chaining API-based and local models. We utilize recent public collections of user-LLM interactions to construct a natural benchmark called PUPA, which contains personally identifiable information (PII). To study potential approaches, we devise PAPILLON, a multi-stage LLM pipeline that uses prompt optimization to address a simpler version of our task. Our best pipeline maintains high response quality for 85.5% of user queries while restricting privacy leakage to only 7.5%. We still leave a large margin to the generation quality of proprietary LLMs for future work. Our data and code will be available at https://github.com/siyan-sylvia-li/PAPILLON.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Dynamic Glucose Enhanced Imaging using Direct Water Saturation
Authors:
Linda Knutsson,
Nirbhay N. Yadav,
Sajad Mohammed Ali,
David Olayinka Kamson,
Eleni Demetriou,
Anina Seidemo,
Lindsay Blair,
Doris D. Lin,
John Laterra,
Peter C. M. van Zijl
Abstract:
Purpose: Dynamic glucose enhanced (DGE) MRI studies employ chemical exchange saturation transfer (CEST) or spin lock (CESL) to study glucose uptake. Currently, these methods are hampered by low effect size and sensitivity to motion. To overcome this, we propose to utilize exchange-based linewidth (LW) broadening of the direct water saturation (DS) curve of the water saturation spectrum (Z-spectrum…
▽ More
Purpose: Dynamic glucose enhanced (DGE) MRI studies employ chemical exchange saturation transfer (CEST) or spin lock (CESL) to study glucose uptake. Currently, these methods are hampered by low effect size and sensitivity to motion. To overcome this, we propose to utilize exchange-based linewidth (LW) broadening of the direct water saturation (DS) curve of the water saturation spectrum (Z-spectrum) during and after glucose infusion (DS-DGE MRI). Methods: To estimate the glucose-infusion-induced LW changes ($Δ$LW), Bloch-McConnell simulations were performed for normoglycemia and hyperglycemia in blood, gray matter (GM), white matter (WM), CSF, and malignant tumor tissue. Whole-brain DS-DGE imaging was implemented at 3 tesla using dynamic Z-spectral acquisitions (1.2 s per offset frequency, 38 s per spectrum) and assessed on four brain tumor patients using infusion of 35 g of D-glucose. To assess $Δ$LW, a deep learning-based Lorentzian fitting approach was employed on voxel-based DS spectra acquired before, during, and post-infusion. Area-under-the-curve (AUC) images, obtained from the dynamic $Δ$LW time curves, were compared qualitatively to perfusion-weighted imaging (PWI). Results: In simulations, $Δ$LW was 1.3%, 0.30%, 0.29/0.34%, 7.5%, and 13% in arterial blood, venous blood, GM/WM, malignant tumor tissue, and CSF, respectively. In vivo, $Δ$LW was approximately 1% in GM/WM, 5-20% for different tumor types, and 40% in CSF. The resulting DS-DGE AUC maps clearly outlined lesion areas. Conclusions: DS-DGE MRI is highly promising for assessing D-glucose uptake. Initial results in brain tumor patients show high-quality AUC maps of glucose-induced line broadening and DGE-based lesion enhancement similar and/or complementary to PWI.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Learning Load Balancing with GNN in MPTCP-Enabled Heterogeneous Networks
Authors:
Han Ji,
Xiping Wu,
Zhihong Zeng,
Chen Chen
Abstract:
Hybrid light fidelity (LiFi) and wireless fidelity (WiFi) networks are a promising paradigm of heterogeneous network (HetNet), attributed to the complementary physical properties of optical spectra and radio frequency. However, the current development of such HetNets is mostly bottlenecked by the existing transmission control protocol (TCP), which restricts the user equipment (UE) to connecting on…
▽ More
Hybrid light fidelity (LiFi) and wireless fidelity (WiFi) networks are a promising paradigm of heterogeneous network (HetNet), attributed to the complementary physical properties of optical spectra and radio frequency. However, the current development of such HetNets is mostly bottlenecked by the existing transmission control protocol (TCP), which restricts the user equipment (UE) to connecting one access point (AP) at a time. While the ongoing investigation on multipath TCP (MPTCP) can bring significant benefits, it complicates the network topology of HetNets, making the existing load balancing (LB) learning models less effective. Driven by this, we propose a graph neural network (GNN)-based model to tackle the LB problem for MPTCP-enabled HetNets, which results in a partial mesh topology. Such a topology can be modeled as a graph, with the channel state information and data rate requirement embedded as node features, while the LB solutions are deemed as edge labels. Compared to the conventional deep neural network (DNN), the proposed GNN-based model exhibits two key strengths: i) it can better interpret a complex network topology; and ii) it can handle various numbers of APs and UEs with a single trained model. Simulation results show that against the traditional optimisation method, the proposed learning model can achieve near-optimal throughput within a gap of 11.5%, while reducing the inference time by 4 orders of magnitude. In contrast to the DNN model, the new method can improve the network throughput by up to 21.7%, at a similar inference time level.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Dynamic Massive Star Formation: Radio Flux Variability in UCHII Regions
Authors:
A. Y. Yang,
M. A. Thompson,
J. S. Urquhart,
A. Brunthaler,
K. M. Menten,
Y. Gong,
Chao-Wei Tsai,
A. L. Patel,
D. Li,
W. D. Cotton
Abstract:
Context:
Theoretical models of early accretion during the formation process of massive stars have predicted that HII regions exhibit radio variability on timescales of decades. However, large-scale searches for such temporal variations with sufficient sensitivity have not yet been carried out.
Aims:
We aim to identify HII regions with variable radio wavelength fluxes and to investigate the p…
▽ More
Context:
Theoretical models of early accretion during the formation process of massive stars have predicted that HII regions exhibit radio variability on timescales of decades. However, large-scale searches for such temporal variations with sufficient sensitivity have not yet been carried out.
Aims:
We aim to identify HII regions with variable radio wavelength fluxes and to investigate the properties of the identified objects, especially those with the highest level of variability.
Methods:
We compared the peak flux densities of 86 ultracompact HII (UCHII) regions measured by the GLOSTAR and CORNISH surveys and identified variables that show flux variations higher than 30% over ~8 yr timespan between these surveys.
Results:
We found a sample of 38 variable UCHII regions, which is the largest sample identified to date. The overall occurrence of variability is 44$\pm$5%, suggesting that variation in UCHII regions is significantly more common than prediction.
The variable UCHII regions are found to be younger than non-variable UCHII regions, all of them meeting the size criterion of hypercompact (HC) HII regions. We studied the 7 UCHII regions (the ``Top7'') that show the highest variability with variations > 100%.
The Top7 variable UCHII regions are optically thick at 4--8 GHz and compact, suggesting they are in a very early evolutionary stage of HCHII or UCHII regions. There is a significant correlation between variability and the spectral index of the radio emission. No dependence is observed between the variations and the properties of the sources' natal clumps traced by submillimeter continuum emission from dust, although variable HII regions are found in clumps at an earlier evolutionary stage.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Security and RAS in the Computing Continuum
Authors:
Martí Alonso,
David Andreu,
Ramon Canal,
Stefano Di Carlo,
Odysseas Chatzopoulos,
Cristiano Chenet,
Juanjo Costa,
Andreu Girones,
Dimitris Gizopoulos,
George Papadimitriou,
Enric Morancho,
Beatriz Otero,
Alessandro Savino
Abstract:
Security and RAS are two non-functional requirements under focus for current systems developed for the computing continuum. Due to the increased number of interconnected computer systems across the continuum, security becomes especially pervasive at all levels, from the smallest edge device to the high-performance cloud at the other end. Similarly, RAS (Reliability, Availability, and Serviceabilit…
▽ More
Security and RAS are two non-functional requirements under focus for current systems developed for the computing continuum. Due to the increased number of interconnected computer systems across the continuum, security becomes especially pervasive at all levels, from the smallest edge device to the high-performance cloud at the other end. Similarly, RAS (Reliability, Availability, and Serviceability) ensures the robustness of a system towards hardware defects. Namely, making them reliable, with high availability and design for easy service. In this paper and as a result of the Vitamin-V EU project, the authors detail the comprehensive approach to malware and hardware attack detection; as well as, the RAS features envisioned for future systems across the computing continuum.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
The FLAMINGO project: Baryon effects on the matter power spectrum
Authors:
Matthieu Schaller,
Joop Schaye,
Roi Kugel,
Jeger C. Broxterman,
Marcel P. van Daalen
Abstract:
The effect of baryon physics associated with galaxy formation onto the large-scale matter distribution of the Universe is a key uncertainty in the theoretical modelling required for the interpretation of Stage IV cosmology surveys. We use the FLAMINGO suite of simulations to study the baryon response due to galaxy formation of the total matter power spectrum. We find that it is only well converged…
▽ More
The effect of baryon physics associated with galaxy formation onto the large-scale matter distribution of the Universe is a key uncertainty in the theoretical modelling required for the interpretation of Stage IV cosmology surveys. We use the FLAMINGO suite of simulations to study the baryon response due to galaxy formation of the total matter power spectrum. We find that it is only well converged for simulation volumes in excess of $(200~Mpc)^3$. We report results for simulations of varying feedback intensity, which either match the X-ray inferred gas fractions in clusters and the z=0 stellar mass function, or shifted versions of the data, as well as for different implementations of AGN feedback. We package our results in the form of a Gaussian process emulator which can rapidly reproduce all the simulations' predictions to better than 1% up to the comoving wavenumber $k = 10~h/Mpc$ and up to z=2 for all the feedback models present in the FLAMINGO suite. We find that the response becomes stronger, the range of scales affected increases, and the position of the minimum of the response moves to smaller scales as the redshift decreases. We find that lower gas fractions in groups and clusters lead to a stronger response and that the use of collimated jets instead of thermally driven winds for AGN feedback enhances the effect. Lowering the stellar masses at fixed cluster gas fractions also increases the magnitude of the response. We find only a small (1% at $k<10~h/Mpc$) dependence of our results on the background cosmology, but a wider range of cosmology variations will be needed to confirm this result. The response we obtain for our strongest feedback models is compatible with some of the recent analyses combining weak lensing with external data. Such a response is, however, in strong tension with the X-ray inferred gas fractions in clusters used to calibrate the FLAMINGO model.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
A general framework for probabilistic model uncertainty
Authors:
Vik Shirvaikar,
Stephen G. Walker,
Chris Holmes
Abstract:
Existing approaches to model uncertainty typically either compare models using a quantitative model selection criterion or evaluate posterior model probabilities having set a prior. In this paper, we propose an alternative strategy which views missing observations as the source of model uncertainty, where the true model would be identified with the complete data. To quantify model uncertainty, it…
▽ More
Existing approaches to model uncertainty typically either compare models using a quantitative model selection criterion or evaluate posterior model probabilities having set a prior. In this paper, we propose an alternative strategy which views missing observations as the source of model uncertainty, where the true model would be identified with the complete data. To quantify model uncertainty, it is then necessary to provide a probability distribution for the missing observations conditional on what has been observed. This can be set sequentially using one-step-ahead predictive densities, which recursively sample from the best model according to some consistent model selection criterion. Repeated predictive sampling of the missing data, to give a complete dataset and hence a best model each time, provides our measure of model uncertainty. This approach bypasses the need for subjective prior specification or integration over parameter spaces, addressing issues with standard methods such as the Bayes factor. Predictive resampling also suggests an alternative view of hypothesis testing as a decision problem based on a population statistic, where we directly index the probabilities of competing models. In addition to hypothesis testing, we provide illustrations from density estimation and variable selection, demonstrating our approach on a range of standard problems.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
General Seemingly Unrelated Local Projections
Authors:
Florian Huber,
Christian Matthes,
Michael Pfarrhofer
Abstract:
We provide a framework for efficiently estimating impulse response functions with Local Projections (LPs). Our approach offers a Bayesian treatment for LPs with Instrumental Variables, accommodating multiple shocks and instruments per shock, accounts for autocorrelation in multi-step forecasts by jointly modeling all LPs as a seemingly unrelated system of equations, defines a flexible yet parsimon…
▽ More
We provide a framework for efficiently estimating impulse response functions with Local Projections (LPs). Our approach offers a Bayesian treatment for LPs with Instrumental Variables, accommodating multiple shocks and instruments per shock, accounts for autocorrelation in multi-step forecasts by jointly modeling all LPs as a seemingly unrelated system of equations, defines a flexible yet parsimonious joint prior for impulse responses based on a Gaussian Process, allows for joint inference about the entire vector of impulse responses, and uses all available data across horizons by imputing missing values.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
The hadronic light-by-light contribution to the muon $g{-}2$ using staggered fermions at the physical point
Authors:
Christian Zimmermann,
Antoine Gérardin
Abstract:
Hadronic contributions dominate the uncertainty of the Standard Model prediction for the anomalous magnetic moment of the muon. In this work, we present results on the hadronic light-by-light contribution obtained from the evaluation of the hadronic four-point function of electromagnetic currents using the position-space formalism developed by the Mainz group. The simulations are performed with st…
▽ More
Hadronic contributions dominate the uncertainty of the Standard Model prediction for the anomalous magnetic moment of the muon. In this work, we present results on the hadronic light-by-light contribution obtained from the evaluation of the hadronic four-point function of electromagnetic currents using the position-space formalism developed by the Mainz group. The simulations are performed with staggered fermions directly at the physical point. Several physical volumes are used to estimate finite volume effects. This direct lattice study is supplemented by considering the contribution of the light pseudoscalar pole in both finite and infinite volumes, where we reuse the pseudoscalar transition form factors that have been evaluated in previous simulations on the same ensembles.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Cohen-Macaulay, Gorenstein and complete intersection conditions by marked bases
Authors:
Cristina Bertone,
Francesca Cioffi,
Matthias Orth,
Werner M. Seiler
Abstract:
Using techniques coming from the theory of marked bases, we develop new computational methods for detection and construction of Cohen-Macaulay, Gorenstein and complete intersection homogeneous polynomial ideals. Thanks to the functorial properties of marked bases, an elementary and effective proof of the openness of arithmetically Cohen-Macaulay, arithmetically Gorenstein and strict complete inter…
▽ More
Using techniques coming from the theory of marked bases, we develop new computational methods for detection and construction of Cohen-Macaulay, Gorenstein and complete intersection homogeneous polynomial ideals. Thanks to the functorial properties of marked bases, an elementary and effective proof of the openness of arithmetically Cohen-Macaulay, arithmetically Gorenstein and strict complete intersection loci in a Hilbert scheme follows, for a non-constant Hilbert polynomial.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Impact of the North Atlantic Oscillation on the subtropical and subpolar gyres
Authors:
Dhruv Bhagtani,
Andy McC. Hogg,
Ryan M. Holmes,
Navid C. Constantinou,
Hemant Khatri
Abstract:
The North Atlantic Oscillation (NAO) is a leading mode of atmospheric variability, affecting the North Atlantic Ocean on sub-seasonal to multi-decadal timescales. The NAO changes the atmospheric forcing at the ocean's surface, including winds and surface buoyancy fluxes, both of which are known to impact large-scale gyre circulation. However, the relative role of other physical processes (such as…
▽ More
The North Atlantic Oscillation (NAO) is a leading mode of atmospheric variability, affecting the North Atlantic Ocean on sub-seasonal to multi-decadal timescales. The NAO changes the atmospheric forcing at the ocean's surface, including winds and surface buoyancy fluxes, both of which are known to impact large-scale gyre circulation. However, the relative role of other physical processes (such as mesoscale eddies and topography) in influencing gyre circulation under NAO variability is not fully understood. Here, we analyze a series of ocean--sea ice simulations using a barotropic vorticity budget to understand long-term response of the North Atlantic gyre circulation to NAO forcing. We find that for each standard deviation increase in the NAO index, the subtropical and subpolar gyres intensify by 0.90 Sv and 3.41 Sv (1 Sv = 10$^6$ m$^3$ s$^{-1}$) respectively. The NAO-induced wind stress anomalies drive approximately 90% of the change in the subtropical gyre's interior flow. However, in the subpolar gyre's interior, in addition to wind stress, flow-topography interactions, stratification (influenced by surface heat fluxes), and non-linear advection significantly influence the circulation. Along the western boundary the bottom pressure torque plays a key role in steering the flow, and the vorticity input by the bottom pressure torque is partly redistributed by non-linear advection. Our study highlights the importance of both atmospheric forcing and oceanic dynamical processes in driving long-term gyre circulation responses to the NAO.
△ Less
Submitted 18 October, 2024;
originally announced October 2024.
-
Majorana-metal transition in a disordered superconductor: percolation in a landscape of topological domain walls
Authors:
V. A. Zakharov,
I. C. Fulga,
G. Lemut,
J. Tworzydlo,
C. W. J. Beenakker
Abstract:
Most superconductors are thermal insulators. A disordered chiral \textit{p}-wave superconductor, however, can make a transition to a thermal metal phase. Because heat is then transported by Majorana fermions, this phase is referred to as a Majorana metal. Here we present numerical evidence that the mechanism for the phase transition with increasing electrostatic disorder is the percolation of boun…
▽ More
Most superconductors are thermal insulators. A disordered chiral \textit{p}-wave superconductor, however, can make a transition to a thermal metal phase. Because heat is then transported by Majorana fermions, this phase is referred to as a Majorana metal. Here we present numerical evidence that the mechanism for the phase transition with increasing electrostatic disorder is the percolation of boundaries separating domains of different Chern number. We construct the network of domain walls using the spectral localizer as a ``topological landscape function'', and obtain the thermal metal--insulator phase diagram from the percolation transition.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Combinatorial Logistic Bandits
Authors:
Xutong Liu,
Xiangxiang Dai,
Xuchuang Wang,
Mohammad Hajiesmaili,
John C. S. Lui
Abstract:
We introduce a novel framework called combinatorial logistic bandits (CLogB), where in each round, a subset of base arms (called the super arm) is selected, with the outcome of each base arm being binary and its expectation following a logistic parametric model. The feedback is governed by a general arm triggering process. Our study covers CLogB with reward functions satisfying two smoothness cond…
▽ More
We introduce a novel framework called combinatorial logistic bandits (CLogB), where in each round, a subset of base arms (called the super arm) is selected, with the outcome of each base arm being binary and its expectation following a logistic parametric model. The feedback is governed by a general arm triggering process. Our study covers CLogB with reward functions satisfying two smoothness conditions, capturing application scenarios such as online content delivery, online learning to rank, and dynamic channel allocation. We first propose a simple yet efficient algorithm, CLogUCB, utilizing a variance-agnostic exploration bonus. Under the 1-norm triggering probability modulated (TPM) smoothness condition, CLogUCB achieves a regret bound of $\tilde{O}(d\sqrt{κKT})$, where $\tilde{O}$ ignores logarithmic factors, $d$ is the dimension of the feature vector, $κ$ represents the nonlinearity of the logistic model, and $K$ is the maximum number of base arms a super arm can trigger. This result improves on prior work by a factor of $\tilde{O}(\sqrtκ)$. We then enhance CLogUCB with a variance-adaptive version, VA-CLogUCB, which attains a regret bound of $\tilde{O}(d\sqrt{KT})$ under the same 1-norm TPM condition, improving another $\tilde{O}(\sqrtκ)$ factor. VA-CLogUCB shows even greater promise under the stronger triggering probability and variance modulated (TPVM) condition, achieving a leading $\tilde{O}(d\sqrt{T})$ regret, thus removing the additional dependency on the action-size $K$. Furthermore, we enhance the computational efficiency of VA-CLogUCB by eliminating the nonconvex optimization process when the context feature map is time-invariant while maintaining the tight $\tilde{O}(d\sqrt{T})$ regret. Finally, experiments on synthetic and real-world datasets demonstrate the superior performance of our algorithms compared to benchmark algorithms.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Personalized Playback Technology: How Short Video Services Create Excellent User Experience
Authors:
Weihui Deng,
Zhiwei Fan,
Deliang Fu,
Yun Gong,
Shenglan Huang,
Xiaocheng Li,
Zheng Li,
Yiting Liao,
He Liu,
Chunyu Qiao,
Bin Wang,
Zhen Wang,
Zhengyu Xiong
Abstract:
Short-form video content has become increasingly popular and influential in recent years. Its concise yet engaging format aligns well with todays' fast-paced and on-the-go lifestyles, making it a dominating trend in the digital world. As one of the front runners in the short video platform space, ByteDance has been highly successful in delivering a one-of-a-kind short video experience and attracti…
▽ More
Short-form video content has become increasingly popular and influential in recent years. Its concise yet engaging format aligns well with todays' fast-paced and on-the-go lifestyles, making it a dominating trend in the digital world. As one of the front runners in the short video platform space, ByteDance has been highly successful in delivering a one-of-a-kind short video experience and attracting billions of users worldwide. One key contributing factor is its advanced end-to-end personalized short video playback technology, where we pioneered and developed the new technical field over the past five years to optimize user experience. This paper introduces the major concepts and methodologies of this personalized video playback technology that distinguish it from traditional multimedia technologies. More details, including goal setting, iterative process, modeling, experimental methods and required supporting systems, are also provided to encourage deeper research in this area.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Broadband long-range thermal imaging via meta-correctors
Authors:
Cameron Vo,
Owen Anderson,
Anna Wirth-Singh,
Rose Johnson,
Arka Majumdar,
Zachary Coppens
Abstract:
Long-range imaging in the thermal infrared band is critical for applications such as environmental monitoring, industrial inspections, and surveillance. To achieve high quality imaging, these systems typically require large apertures and many elements with complex shapes to correct aberrations, adding significant weight and cost. Large-area metasurface optics offer a promising solution for weight…
▽ More
Long-range imaging in the thermal infrared band is critical for applications such as environmental monitoring, industrial inspections, and surveillance. To achieve high quality imaging, these systems typically require large apertures and many elements with complex shapes to correct aberrations, adding significant weight and cost. Large-area metasurface optics offer a promising solution for weight reduction; however, their substantial chromatic aberrations limit their effectiveness in the long-wave infrared (LWIR) band where broadband imaging is typically desired. In this work, we introduce a hybrid system comprising four refractive lenses and two all-silicon metasurface correctors (meta-correctors) to achieve high-quality, broadband thermal imaging at long range. Compared to a refractive-only assembly, our system demonstrates a three-fold contrast enhancement at the detector's half-Nyquist frequency. Testing outside the laboratory reveals noticeably sharper images, with human features clearly recognizable at distances of 250 meters. The assembly utilizes off-the-shelf refractive elements and avoids the use of germanium, which poses a supply chain risk. Our findings highlight the potential of hybrid meta-corrector systems to enable long-range, lightweight, and cost-effective LWIR imaging solutions.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Evaporating sessile droplets: solutal Marangoni effects overwhelm thermal Marangoni flow
Authors:
Duarte Rocha,
Philip L. Lederer,
Pim J. Dekker,
Alvaro Marin,
Detlef Lohse,
Christian Diddens
Abstract:
When an evaporating water droplet is deposited on a thermally conductive substrate, the minimum temperature will be at the apex due to evaporative cooling. Consequently, density and surface tension gradients emerge within the droplet and at the droplet-gas interface, giving rise to competing flows from, respectively, the apex towards the contact line (thermal-buoyancy-driven flow) and the other wa…
▽ More
When an evaporating water droplet is deposited on a thermally conductive substrate, the minimum temperature will be at the apex due to evaporative cooling. Consequently, density and surface tension gradients emerge within the droplet and at the droplet-gas interface, giving rise to competing flows from, respectively, the apex towards the contact line (thermal-buoyancy-driven flow) and the other way around (thermal Marangoni flow). In small droplets with a diameter below the capillary length, the thermal Marangoni effects are expected to dominate over thermal buoyancy ("thermal Rayleigh") effects. However, contrary to these theoretical predictions, our experiments mostly show a dominant circulation from the apex towards the contact line, indicating a prevailing of thermal Rayleigh convection. Furthermore, our experiments often show an unexpected asymmetric flow that persisted for several minutes. We hypothesise that a tiny amount of contaminants, commonly encountered in experiments with water/air interfaces, act as surfactants and counteract the thermal surface tension gradients at the interface and thereby promote the dominance of Rayleigh convection. Our finite element numerical simulations demonstrate that, under our specified experimental conditions, a mere 0.5% reduction in the static surface tension caused by surfactants leads to a reversal in the flow direction, compared to the theoretical prediction without contaminants. Additionally, we investigate the linear stability of the axisymmetric solutions, revealing that the presence of surfactants also affects the axial symmetry of the flow.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
$β$ symmetry of heterotic supergravity
Authors:
Walter H. Baron,
Carmen A. Nunez,
Jesus A. Rodriguez
Abstract:
The low energy effective action describing the Kaluza-Klein reduction of string theory on a $d$-torus possesses a continuous O($d, d$) global symmetry. The non-geometric piece of this symmetry, parameterized by a bi-vector $β$, was recently shown to effectively act as a hidden symmetry on the massless RR and universal NSNS fields of the ten dimensional parent theory, fixing their couplings. Here w…
▽ More
The low energy effective action describing the Kaluza-Klein reduction of string theory on a $d$-torus possesses a continuous O($d, d$) global symmetry. The non-geometric piece of this symmetry, parameterized by a bi-vector $β$, was recently shown to effectively act as a hidden symmetry on the massless RR and universal NSNS fields of the ten dimensional parent theory, fixing their couplings. Here we extend the analysis of this symmetry to the massless gauge and fermion fields of heterotic supergravity. While the interactions of the boson fields are univocally fixed by $β$ symmetry, we find four bilinear and two quartic $β$ invariant combinations of fermions whose relative coefficients in the action must be determined by supersymmetry. Although not fully fixed, bilinear and quartic fermion couplings are strongly restricted by $β$ symmetry at leading order in $α'$.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
High-Order Dynamic Integration Method (HODIM) for Modeling Turbulent Fluid Dynamics
Authors:
Rômulo Damasclin Chaves dos Santos,
Jorge Henrique de Oliveira Sales
Abstract:
This research explores the development and application of the High-Order Dynamic Integration Method for solving integro-differential equations, with a specific focus on turbulent fluid dynamics. Traditional numerical methods, such as the Finite Difference Method and the Finite Volume Method, have been widely employed in fluid dynamics but struggle to accurately capture the complexities of turbulen…
▽ More
This research explores the development and application of the High-Order Dynamic Integration Method for solving integro-differential equations, with a specific focus on turbulent fluid dynamics. Traditional numerical methods, such as the Finite Difference Method and the Finite Volume Method, have been widely employed in fluid dynamics but struggle to accurately capture the complexities of turbulence, particularly in high Reynolds number regimes. These methods often require significant computational resources and are prone to errors in nonlinear dynamic systems. The High-Order Dynamic Integration Method addresses these challenges by integrating higher-order interpolation techniques with dynamic adaptation strategies, significantly enhancing accuracy and computational efficiency. Through rigorous numerical analysis, this method demonstrates superior performance over the Finite Difference Method and the Finite Volume Method in handling the nonlinear behaviors characteristic of turbulent flows. Furthermore, the High-Order Dynamic Integration Method achieves this without a substantial increase in computational cost, making it a highly efficient tool for simulations in computational fluid dynamics. The research validates the capabilities of the High-Order Dynamic Integration Method through a series of benchmark tests and case studies. Results indicate a marked improvement in both accuracy and stability, particularly in simulations of high-Reynolds-number flows, where traditional methods often falter. This innovative approach offers a robust and efficient alternative for solving complex fluid dynamics problems, contributing to advances in the field of numerical methods and computational fluid dynamics.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Resolved photoproduction in {\tt MadGraph5\_aMC@NLO}
Authors:
Laboni Manna,
Anton Safronov,
Carlo Flore,
Daniel Kikola,
Jean-Philippe Lansberg,
Olivier Mattelaer
Abstract:
The upcoming Electron-Ion Collider (EIC), with its high luminosity, will offer an unprecedented opportunity to explore the internal structure of atomic nucleus over an extended energy range from $\sqrt{s_{ep}} =$ 45 GeV to $\sqrt{s_{ep}} =$ 140 GeV. A particularly promising aspect of this collider is the study of the partonic structure with quasi-real photons which can also be studied in inclusive…
▽ More
The upcoming Electron-Ion Collider (EIC), with its high luminosity, will offer an unprecedented opportunity to explore the internal structure of atomic nucleus over an extended energy range from $\sqrt{s_{ep}} =$ 45 GeV to $\sqrt{s_{ep}} =$ 140 GeV. A particularly promising aspect of this collider is the study of the partonic structure with quasi-real photons which can also be studied in inclusive ultra-peripheral collisions at the Large Hadron Collider (LHC). In this work, we present our validation of resolved photoproduction at fixed order for (next-to-)leading order using \texttt{MadGraph5\_aMC@NLO}, a widely adopted framework at the LHC.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
A high-precision continuum limit study of the HVP short-distance window
Authors:
Sebastian Spiegel,
Christoph Lehner
Abstract:
The separation of the hadronic vacuum polarization (HVP) contribution to the muon anomalous magnetic moment into Euclidean windows allows for a tailored approach to address the different dominant challenges at short, intermediate, and long distances. We present a novel approach to compute the short-distance window without the need for using perturbative QCD. We combine a quenched continuum extrapo…
▽ More
The separation of the hadronic vacuum polarization (HVP) contribution to the muon anomalous magnetic moment into Euclidean windows allows for a tailored approach to address the different dominant challenges at short, intermediate, and long distances. We present a novel approach to compute the short-distance window without the need for using perturbative QCD. We combine a quenched continuum extrapolation using 18 lattice spacings ($1.6 \,\text{GeV} \lesssim a^{-1} \lesssim 6.1\,\text{GeV}$) with a separate continuum extrapolation of the sea-quark effects. This method allows for the computationally expensive sea-quark effects to be estimated using only a smaller number of ensembles at coarser lattice spacings, while largely confining the logarithmic dependency of the continuum extrapolation to the quenched component.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
A Repeating Fast Radio Burst Source in a Low-Luminosity Dwarf Galaxy
Authors:
Danté M. Hewitt,
Mohit Bhardwaj,
Alexa C. Gordon,
Aida Kirichenko,
Kenzie Nimmo,
Shivani Bhandari,
Ismaël Cognard,
Wen-fai Fong,
Armando Gil de Paz,
Akshatha Gopinath,
Jason W. T. Hessels,
Franz Kirsten,
Benito Marcote,
Vladislavs Bezrukovs,
Richard Blaauw,
Justin D. Bray,
Salvatore Buttaccio,
Tomas Cassanelli,
Pragya Chawla,
Alessandro Corongiu,
William Deng,
Hannah N. Didehbani,
Yuxin Dong,
Marcin P. Gawroński,
Marcello Giroletti
, et al. (26 additional authors not shown)
Abstract:
We present the localization and host galaxy of FRB 20190208A, a repeating source of fast radio bursts (FRBs) discovered using CHIME/FRB. As part of the PRECISE repeater localization program on the EVN, we monitored FRB 20190208A for 65.6 hours at $\sim1.4$ GHz and detected a single burst, which led to its VLBI localization with 260 mas uncertainty (2$σ$). Follow-up optical observations with the MM…
▽ More
We present the localization and host galaxy of FRB 20190208A, a repeating source of fast radio bursts (FRBs) discovered using CHIME/FRB. As part of the PRECISE repeater localization program on the EVN, we monitored FRB 20190208A for 65.6 hours at $\sim1.4$ GHz and detected a single burst, which led to its VLBI localization with 260 mas uncertainty (2$σ$). Follow-up optical observations with the MMT Observatory ($i\gtrsim 25.7$ mag (AB)) found no visible host at the FRB position. Subsequent deeper observations with the GTC, however, revealed an extremely faint galaxy ($r=27.32 \pm0.16$ mag), very likely ($99.95 \%$) associated with FRB 20190208A. Given the dispersion measure of the FRB ($\sim580$ pc cm$^{-3}$), even the most conservative redshift estimate ($z_{\mathrm{max}}\sim0.83$) implies that this is the lowest-luminosity FRB host to date ($\lesssim10^8L_{\odot}$), even less luminous than the dwarf host of FRB 20121102A. We investigate how localization precision and the depth of optical imaging affect host association, and discuss the implications of such a low-luminosity dwarf galaxy. Unlike the other repeaters with low-luminosity hosts, FRB 20190208A has a modest Faraday rotation measure of a few tens of rad m$^{-2}$, and EVN plus VLA observations reveal no associated compact persistent radio source. We also monitored FRB 20190208A for 40.4 hours over 2 years as part of the ÉCLAT repeating FRB monitoring campaign on the Nançay Radio Telescope, and detected one burst. Our results demonstrate that, in some cases, the robust association of an FRB with a host galaxy will require both high localization precision, as well as deep optical follow-up.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Performance of the CMS high-level trigger during LHC Run 2
Authors:
CMS Collaboration
Abstract:
The CERN LHC provided proton and heavy ion collisions during its Run 2 operation period from 2015 to 2018. Proton-proton collisions reached a peak instantaneous luminosity of 2.1 $\times$ 10$^{34}$ cm$^{-2}$s$^{-1}$, twice the initial design value, at $\sqrt{s}$ = 13 TeV. The CMS experiment records a subset of the collisions for further processing as part of its online selection of data for physic…
▽ More
The CERN LHC provided proton and heavy ion collisions during its Run 2 operation period from 2015 to 2018. Proton-proton collisions reached a peak instantaneous luminosity of 2.1 $\times$ 10$^{34}$ cm$^{-2}$s$^{-1}$, twice the initial design value, at $\sqrt{s}$ = 13 TeV. The CMS experiment records a subset of the collisions for further processing as part of its online selection of data for physics analyses, using a two-level trigger system: the Level-1 trigger, implemented in custom-designed electronics, and the high-level trigger, a streamlined version of the offline reconstruction software running on a large computer farm. This paper presents the performance of the CMS high-level trigger system during LHC Run 2 for physics objects, such as leptons, jets, and missing transverse momentum, which meet the broad needs of the CMS physics program and the challenge of the evolving LHC and detector conditions. Sophisticated algorithms that were originally used in offline reconstruction were deployed online. Highlights include a machine-learning b tagging algorithm and a reconstruction algorithm for tau leptons that decay hadronically.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Dark Matter Search Results from 4.2 Tonne-Years of Exposure of the LUX-ZEPLIN (LZ) Experiment
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
J. W. Bargemann,
E. E. Barillier,
D. Bauer,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. Bishop,
G. M. Blockinger
, et al. (193 additional authors not shown)
Abstract:
We report results of a search for nuclear recoils induced by weakly interacting massive particle (WIMP) dark matter using the LUX-ZEPLIN (LZ) two-phase xenon time projection chamber. This analysis uses a total exposure of $4.2\pm0.1$ tonne-years from 280 live days of LZ operation, of which $3.3\pm0.1$ tonne-years and 220 live days are new. A technique to actively tag background electronic recoils…
▽ More
We report results of a search for nuclear recoils induced by weakly interacting massive particle (WIMP) dark matter using the LUX-ZEPLIN (LZ) two-phase xenon time projection chamber. This analysis uses a total exposure of $4.2\pm0.1$ tonne-years from 280 live days of LZ operation, of which $3.3\pm0.1$ tonne-years and 220 live days are new. A technique to actively tag background electronic recoils from $^{214}$Pb $β$ decays is featured for the first time. Enhanced electron-ion recombination is observed in two-neutrino double electron capture decays of $^{124}$Xe, representing a noteworthy new background. After removal of artificial signal-like events injected into the data set to mitigate analyzer bias, we find no evidence for an excess over expected backgrounds. World-leading constraints are placed on spin-independent (SI) and spin-dependent WIMP-nucleon cross sections for masses $\geq$9 GeV/$c^2$. The strongest SI exclusion set is $2.1\times10^{-48}$ cm$^{2}$ at the 90% confidence level at a mass of 36 GeV/$c^2$, and the best SI median sensitivity achieved is $5.0\times10^{-48}$ cm$^{2}$ for a mass of 40 GeV/$c^2$.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
DIRI: Adversarial Patient Reidentification with Large Language Models for Evaluating Clinical Text Anonymization
Authors:
John X. Morris,
Thomas R. Campion,
Sri Laasya Nutheti,
Yifan Peng,
Akhil Raj,
Ramin Zabih,
Curtis L. Cole
Abstract:
Sharing protected health information (PHI) is critical for furthering biomedical research. Before data can be distributed, practitioners often perform deidentification to remove any PHI contained in the text. Contemporary deidentification methods are evaluated on highly saturated datasets (tools achieve near-perfect accuracy) which may not reflect the full variability or complexity of real-world c…
▽ More
Sharing protected health information (PHI) is critical for furthering biomedical research. Before data can be distributed, practitioners often perform deidentification to remove any PHI contained in the text. Contemporary deidentification methods are evaluated on highly saturated datasets (tools achieve near-perfect accuracy) which may not reflect the full variability or complexity of real-world clinical text and annotating them is resource intensive, which is a barrier to real-world applications. To address this gap, we developed an adversarial approach using a large language model (LLM) to re-identify the patient corresponding to a redacted clinical note and evaluated the performance with a novel De-Identification/Re-Identification (DIRI) method. Our method uses a large language model to reidentify the patient corresponding to a redacted clinical note. We demonstrate our method on medical data from Weill Cornell Medicine anonymized with three deidentification tools: rule-based Philter and two deep-learning-based models, BiLSTM-CRF and ClinicalBERT. Although ClinicalBERT was the most effective, masking all identified PII, our tool still reidentified 9% of clinical notes Our study highlights significant weaknesses in current deidentification technologies while providing a tool for iterative development and improvement.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.