subscribe to arXiv mailings

Search for gravitational waves emitted from SN 2023ixf

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, I. Abouelfettouh, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, D. Agarwal, M. Agathos, M. Aghaei Abchouyeh, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, T. Akutsu, S. Albanesi, R. A. Alfaidi, A. Al-Jodah, C. Alléné, A. Allocca , et al. (1758 additional authors not shown)

Abstract: We present the results of a search for gravitational-wave transients associated with core-collapse supernova SN 2023ixf, which was observed in the galaxy Messier 101 via optical emission on 2023 May 19th, during the LIGO-Virgo-KAGRA 15th Engineering Run. We define a five-day on-source window during which an accompanying gravitational-wave signal may have occurred. No gravitational waves have been… ▽ More We present the results of a search for gravitational-wave transients associated with core-collapse supernova SN 2023ixf, which was observed in the galaxy Messier 101 via optical emission on 2023 May 19th, during the LIGO-Virgo-KAGRA 15th Engineering Run. We define a five-day on-source window during which an accompanying gravitational-wave signal may have occurred. No gravitational waves have been identified in data when at least two gravitational-wave observatories were operating, which covered $\sim 14\%$ of this five-day window. We report the search detection efficiency for various possible gravitational-wave emission models. Considering the distance to M101 (6.7 Mpc), we derive constraints on the gravitational-wave emission mechanism of core-collapse supernovae across a broad frequency spectrum, ranging from 50 Hz to 2 kHz where we assume the GW emission occurred when coincident data are available in the on-source window. Considering an ellipsoid model for a rotating proto-neutron star, our search is sensitive to gravitational-wave energy $1 \times 10^{-5} M_{\odot} c^2$ and luminosity $4 \times 10^{-5} M_{\odot} c^2/\text{s}$ for a source emitting at 50 Hz. These constraints are around an order of magnitude more stringent than those obtained so far with gravitational-wave data. The constraint on the ellipticity of the proto-neutron star that is formed is as low as $1.04$, at frequencies above $1200$ Hz, surpassing results from SN 2019ejj. △ Less

Submitted 21 October, 2024; originally announced October 2024.

Comments: Main paper: 6 pages, 4 figures and 1 table. Total with appendices: 20 pages, 4 figures, and 1 table

Report number: LIGO-P2400125

arXiv:2410.15690 [pdf, other]

Efficient Terminology Integration for LLM-based Translation in Specialized Domains

Authors: Sejoon Kim, Mingi Sung, Jeonghwan Lee, Hyunkuk Lim, Jorge Froilan Gimenez Perez

Abstract: Traditional machine translation methods typically involve training models directly on large parallel corpora, with limited emphasis on specialized terminology. However, In specialized fields such as patent, finance, or biomedical domains, terminology is crucial for translation, with many terms that needs to be translated following agreed-upon conventions. In this paper we introduce a methodology t… ▽ More Traditional machine translation methods typically involve training models directly on large parallel corpora, with limited emphasis on specialized terminology. However, In specialized fields such as patent, finance, or biomedical domains, terminology is crucial for translation, with many terms that needs to be translated following agreed-upon conventions. In this paper we introduce a methodology that efficiently trains models with a smaller amount of data while preserving the accuracy of terminology translation. We achieve this through a systematic process of term extraction and glossary creation using the Trie Tree algorithm, followed by data reconstruction to teach the LLM how to integrate these specialized terms. This methodology enhances the model's ability to handle specialized terminology and ensures high-quality translations, particularly in fields where term consistency is crucial. Our approach has demonstrated exceptional performance, achieving the highest translation score among participants in the WMT patent task to date, showcasing its effectiveness and broad applicability in specialized translation domains where general methods often fall short. △ Less

Submitted 21 October, 2024; originally announced October 2024.

Comments: Accepted to WMT 2024

arXiv:2410.13627 [pdf, other]

Is the $w_0w_a$CDM cosmological parameterization evidence for dark energy dynamics partially caused by the excess smoothing of Planck CMB anisotropy data?

Authors: Chan-Gyung Park, Javier de Cruz Perez, Bharat Ratra

Abstract: We study the performance of the spatially-flat dynamical dark energy (DE) $w_0w_a$CDM parameterization, with redshift-dependent DE fluid equation of state parameter $w(z) = w_0 + w_a z/(1+z)$, with and without a varying CMB lensing consistency parameter $A_L$, against Planck cosmic microwave background (CMB) data (P18 and lensing) and a combination of non-CMB data composed of baryonic acoustic osc… ▽ More We study the performance of the spatially-flat dynamical dark energy (DE) $w_0w_a$CDM parameterization, with redshift-dependent DE fluid equation of state parameter $w(z) = w_0 + w_a z/(1+z)$, with and without a varying CMB lensing consistency parameter $A_L$, against Planck cosmic microwave background (CMB) data (P18 and lensing) and a combination of non-CMB data composed of baryonic acoustic oscillation (BAO) measurements that do not include DESI BAO data, Pantheon+ type Ia supernovae (SNIa) observations, Hubble parameter [$H(z)$] measurements, and growth factor ($fσ_8$) data points. From our most restrictive data set, P18+lensing+non-CMB, for the $w_0w_a$CDM+$A_L$ parameterization, we obtain $w_0=-0.879\pm 0.060$, $w_a=-0.39^{+0.26}_{-0.22}$, the asymptotic limit $w(z\to\infty) = w_0+w_a=-1.27^{+0.20}_{-0.17}$, and $A_L=1.078^{+0.036}_{-0.040}$ (all $1σ$ errors). This joint analysis of CMB and non-CMB data favors DE dynamics over a cosmological constant at $\sim 1σ$ and $A_L>1$ at $\sim 2σ$, i.e. more smoothing of the Planck CMB anisotropy data than is predicted by the best-fit model. For the $w_0w_a$CDM parameterization with $A_L=1$ the evidence in favor of DE dynamics is larger, $\sim 2σ$, suggesting that at least part of the evidence for DE dynamics comes from the excess smoothing of the Planck CMB anisotropy data. For the $w_0w_a$CDM parameterization with $A_L=1$, there is a difference of $2.8σ$ between P18 and non-CMB cosmological parameter constraints and $2.7σ$ between P18+lensing and non-CMB constraints. When $A_L$ is allowed to vary these tensions reduced to $1.9σ$ and $2.1σ$ respectively. Our P18+lensing+non-CMB data compilation positively favors the $w_0w_a$CDM parameterization without and with a varying $A_L$ parameter over the flat $Λ$CDM model, and $w_0w_a$CDM+$A_L$ is also positively favored over $w_0w_a$CDM. △ Less

Submitted 4 October, 2024; originally announced October 2024.

Comments: 14 pages, 6 figures

arXiv:2410.12174 [pdf, other]

Exploring Large Language Models for Hate Speech Detection in Rioplatense Spanish

Authors: Juan Manuel Pérez, Paula Miguel, Viviana Cotik

Abstract: Hate speech detection deals with many language variants, slang, slurs, expression modalities, and cultural nuances. This outlines the importance of working with specific corpora, when addressing hate speech within the scope of Natural Language Processing, recently revolutionized by the irruption of Large Language Models. This work presents a brief analysis of the performance of large language mode… ▽ More Hate speech detection deals with many language variants, slang, slurs, expression modalities, and cultural nuances. This outlines the importance of working with specific corpora, when addressing hate speech within the scope of Natural Language Processing, recently revolutionized by the irruption of Large Language Models. This work presents a brief analysis of the performance of large language models in the detection of Hate Speech for Rioplatense Spanish. We performed classification experiments leveraging chain-of-thought reasoning with ChatGPT 3.5, Mixtral, and Aya, comparing their results with those of a state-of-the-art BERT classifier. These experiments outline that, even if large language models show a lower precision compared to the fine-tuned BERT classifier and, in some cases, they find hard-to-get slurs or colloquialisms, they still are sensitive to highly nuanced cases (particularly, homophobic/transphobic hate speech). We make our code and models publicly available for future research. △ Less

Submitted 15 October, 2024; originally announced October 2024.

arXiv:2410.09151 [pdf, other]

A search using GEO600 for gravitational waves coincident with fast radio bursts from SGR 1935+2154

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, I. Abouelfettouh, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, D. Agarwal, M. Agathos, M. Aghaei Abchouyeh, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, A. Al-Jodah, C. Alléné , et al. (1758 additional authors not shown)

Abstract: The magnetar SGR 1935+2154 is the only known Galactic source of fast radio bursts (FRBs). FRBs from SGR 1935+2154 were first detected by CHIME/FRB and STARE2 in 2020 April, after the conclusion of the LIGO, Virgo, and KAGRA Collaborations' O3 observing run. Here we analyze four periods of gravitational wave (GW) data from the GEO600 detector coincident with four periods of FRB activity detected by… ▽ More The magnetar SGR 1935+2154 is the only known Galactic source of fast radio bursts (FRBs). FRBs from SGR 1935+2154 were first detected by CHIME/FRB and STARE2 in 2020 April, after the conclusion of the LIGO, Virgo, and KAGRA Collaborations' O3 observing run. Here we analyze four periods of gravitational wave (GW) data from the GEO600 detector coincident with four periods of FRB activity detected by CHIME/FRB, as well as X-ray glitches and X-ray bursts detected by NICER and NuSTAR close to the time of one of the FRBs. We do not detect any significant GW emission from any of the events. Instead, using a short-duration GW search (for bursts $\leq$ 1 s) we derive 50\% (90\%) upper limits of $10^{48}$ ($10^{49}$) erg for GWs at 300 Hz and $10^{49}$ ($10^{50}$) erg at 2 kHz, and constrain the GW-to-radio energy ratio to $\leq 10^{14} - 10^{16}$. We also derive upper limits from a long-duration search for bursts with durations between 1 and 10 s. These represent the strictest upper limits on concurrent GW emission from FRBs. △ Less

Submitted 11 October, 2024; originally announced October 2024.

Comments: 15 pages of text including references, 4 figures, 5 tables

Report number: LIGO-P2400192

arXiv:2410.04855 [pdf, other]

Unsupervised Skill Discovery for Robotic Manipulation through Automatic Task Generation

Authors: Paul Jansonnie, Bingbing Wu, Julien Perez, Jan Peters

Abstract: Learning skills that interact with objects is of major importance for robotic manipulation. These skills can indeed serve as an efficient prior for solving various manipulation tasks. We propose a novel Skill Learning approach that discovers composable behaviors by solving a large and diverse number of autonomously generated tasks. Our method learns skills allowing the robot to consistently and ro… ▽ More Learning skills that interact with objects is of major importance for robotic manipulation. These skills can indeed serve as an efficient prior for solving various manipulation tasks. We propose a novel Skill Learning approach that discovers composable behaviors by solving a large and diverse number of autonomously generated tasks. Our method learns skills allowing the robot to consistently and robustly interact with objects in its environment. The discovered behaviors are embedded in primitives which can be composed with Hierarchical Reinforcement Learning to solve unseen manipulation tasks. In particular, we leverage Asymmetric Self-Play to discover behaviors and Multiplicative Compositional Policies to embed them. We compare our method to Skill Learning baselines and find that our skills are more interactive. Furthermore, the learned skills can be used to solve a set of unseen manipulation tasks, in simulation as well as on a real robotic platform. △ Less

Submitted 7 October, 2024; originally announced October 2024.

Comments: Accepted at the 2024 IEEE-RAS International Conference on Humanoid Robots

arXiv:2409.05994 [pdf, other]

MessIRve: A Large-Scale Spanish Information Retrieval Dataset

Authors: Francisco Valentini, Viviana Cotik, Damián Furman, Ivan Bercovich, Edgar Altszyler, Juan Manuel Pérez

Abstract: Information retrieval (IR) is the task of finding relevant documents in response to a user query. Although Spanish is the second most spoken native language, current IR benchmarks lack Spanish data, hindering the development of information access tools for Spanish speakers. We introduce MessIRve, a large-scale Spanish IR dataset with around 730 thousand queries from Google's autocomplete API and r… ▽ More Information retrieval (IR) is the task of finding relevant documents in response to a user query. Although Spanish is the second most spoken native language, current IR benchmarks lack Spanish data, hindering the development of information access tools for Spanish speakers. We introduce MessIRve, a large-scale Spanish IR dataset with around 730 thousand queries from Google's autocomplete API and relevant documents sourced from Wikipedia. MessIRve's queries reflect diverse Spanish-speaking regions, unlike other datasets that are translated from English or do not consider dialectal variations. The large size of the dataset allows it to cover a wide variety of topics, unlike smaller datasets. We provide a comprehensive description of the dataset, comparisons with existing datasets, and baseline evaluations of prominent IR models. Our contributions aim to advance Spanish IR research and improve information access for Spanish speakers. △ Less

Submitted 9 September, 2024; originally announced September 2024.

arXiv:2409.03923 [pdf, ps, other]

Model theory of Hilbert spaces expanded by a representation of a group

Authors: Alexander Berenstein, Juan Manuel Pérez

Abstract: In this paper we study expansions of infinite dimensional Hilbert spaces with a unitary representation of a group. When the group is finite, we prove the theory of the corresponding expansion is $\aleph_0$-categorical, $\aleph_0$-stable and is SFB. On the other hand, when the group involved is a product of the form $H\times \mathbb{Z}^n$, where $H$ is a finite group and $n\geq 1$, the theory of th… ▽ More In this paper we study expansions of infinite dimensional Hilbert spaces with a unitary representation of a group. When the group is finite, we prove the theory of the corresponding expansion is $\aleph_0$-categorical, $\aleph_0$-stable and is SFB. On the other hand, when the group involved is a product of the form $H\times \mathbb{Z}^n$, where $H$ is a finite group and $n\geq 1$, the theory of the Hilbert space expanded by the representation of this group is, in general, stable not $\aleph_0$-stable, not $\aleph_0$-categorical, but it is $\aleph_0$-categorical up to perturbations and $\aleph_0$-stable up to perturbations. △ Less

Submitted 10 September, 2024; v1 submitted 5 September, 2024; originally announced September 2024.

Comments: 19 pages

arXiv:2408.14110 [pdf]

Room-temperature Optically Detected Magnetic Resonance of Telecom Single Photon Emitters in GaN

Authors: John J. H. Eng, Zhengzhi Jiang, Max Meunier, Abdullah Rasmita, Haoran Zhang, Yuzhe Yang, Feifei Zhou, Hongbing Cai, Zhaogang Dong, Jesús Zúñiga Pérez, Weibo Gao

Abstract: Solid-state defects susceptible of spin manipulation hold great promise for scalable quantum technology. To broaden their utility, operating at room temperature and emitting in the telecom wavelength range are desired, eliminating cryogenic requirements and leveraging existing optical fiber infrastructure for transmitting the quantum information. To that end, we report that telecom single photon e… ▽ More Solid-state defects susceptible of spin manipulation hold great promise for scalable quantum technology. To broaden their utility, operating at room temperature and emitting in the telecom wavelength range are desired, eliminating cryogenic requirements and leveraging existing optical fiber infrastructure for transmitting the quantum information. To that end, we report that telecom single photon emitters (SPEs) in gallium nitride (GaN) exhibit optically detected magnetic resonance (ODMR) at room temperature. The analysis of ODMR as a function of magnetic field orientation enables the determination of the orientation of the spin quantization axis with respect to the GaN crystalline lattice. The optical transitions dynamics are analyzed to gain further insight into the transition rates dominating ODMR. Our findings, coupled with GaN's mature fabrication technology, could facilitate the realization of scalable quantum technology. △ Less

Submitted 26 August, 2024; originally announced August 2024.

arXiv:2408.13135 [pdf, other]

Deep Learning at the Intersection: Certified Robustness as a Tool for 3D Vision

Authors: Gabriel Pérez S, Juan C. Pérez, Motasem Alfarra, Jesús Zarzar, Sara Rojas, Bernard Ghanem, Pablo Arbeláez

Abstract: This paper presents preliminary work on a novel connection between certified robustness in machine learning and the modeling of 3D objects. We highlight an intriguing link between the Maximal Certified Radius (MCR) of a classifier representing a space's occupancy and the space's Signed Distance Function (SDF). Leveraging this relationship, we propose to use the certification method of randomized s… ▽ More This paper presents preliminary work on a novel connection between certified robustness in machine learning and the modeling of 3D objects. We highlight an intriguing link between the Maximal Certified Radius (MCR) of a classifier representing a space's occupancy and the space's Signed Distance Function (SDF). Leveraging this relationship, we propose to use the certification method of randomized smoothing (RS) to compute SDFs. Since RS' high computational cost prevents its practical usage as a way to compute SDFs, we propose an algorithm to efficiently run RS in low-dimensional applications, such as 3D space, by expressing RS' fundamental operations as Gaussian smoothing on pre-computed voxel grids. Our approach offers an innovative and practical tool to compute SDFs, validated through proof-of-concept experiments in novel view synthesis. This paper bridges two previously disparate areas of machine learning, opening new avenues for further exploration and potential cross-domain advancements. △ Less

Submitted 23 August, 2024; originally announced August 2024.

Comments: This paper is an accepted extended abstract to the LatinX workshop at ICCV 2023. This was uploaded a year late

arXiv:2408.09223 [pdf, other]

A theoretical framework for reservoir computing on networks of organic electrochemical transistors

Authors: Nicholas W. Landry, Beckett R. Hyde, Jake C. Perez, Sean E. Shaheen, Juan G. Restrepo

Abstract: Efficient and accurate prediction of physical systems is important even when the rules of those systems cannot be easily learned. Reservoir computing, a type of recurrent neural network with fixed nonlinear units, is one such prediction method and is valued for its ease of training. Organic electrochemical transistors (OECTs) are physical devices with nonlinear transient properties that can be use… ▽ More Efficient and accurate prediction of physical systems is important even when the rules of those systems cannot be easily learned. Reservoir computing, a type of recurrent neural network with fixed nonlinear units, is one such prediction method and is valued for its ease of training. Organic electrochemical transistors (OECTs) are physical devices with nonlinear transient properties that can be used as the nonlinear units of a reservoir computer. We present a theoretical framework for simulating reservoir computers using OECTs as the non-linear units as a test bed for designing physical reservoir computers. We present a proof of concept demonstrating that such an implementation can accurately predict the Lorenz attractor with comparable performance to standard reservoir computer implementations. We explore the effect of operating parameters and find that the prediction performance strongly depends on the pinch-off voltage of the OECTs. △ Less

Submitted 17 August, 2024; originally announced August 2024.

Comments: 10 pages, 8 figures

arXiv:2407.12867 [pdf, other]

Swift-BAT GUANO follow-up of gravitational-wave triggers in the third LIGO-Virgo-KAGRA observing run

Authors: Gayathri Raman, Samuele Ronchini, James Delaunay, Aaron Tohuvavohu, Jamie A. Kennea, Tyler Parsotan, Elena Ambrosi, Maria Grazia Bernardini, Sergio Campana, Giancarlo Cusumano, Antonino D'Ai, Paolo D'Avanzo, Valerio D'Elia, Massimiliano De Pasquale, Simone Dichiara, Phil Evans, Dieter Hartmann, Paul Kuin, Andrea Melandri, Paul O'Brien, Julian P. Osborne, Kim Page, David M. Palmer, Boris Sbarufatti, Gianpiero Tagliaferri , et al. (1797 additional authors not shown)

Abstract: We present results from a search for X-ray/gamma-ray counterparts of gravitational-wave (GW) candidates from the third observing run (O3) of the LIGO-Virgo-KAGRA (LVK) network using the Swift Burst Alert Telescope (Swift-BAT). The search includes 636 GW candidates received in low latency, 86 of which have been confirmed by the offline analysis and included in the third cumulative Gravitational-Wav… ▽ More We present results from a search for X-ray/gamma-ray counterparts of gravitational-wave (GW) candidates from the third observing run (O3) of the LIGO-Virgo-KAGRA (LVK) network using the Swift Burst Alert Telescope (Swift-BAT). The search includes 636 GW candidates received in low latency, 86 of which have been confirmed by the offline analysis and included in the third cumulative Gravitational-Wave Transient Catalogs (GWTC-3). Targeted searches were carried out on the entire GW sample using the maximum--likelihood NITRATES pipeline on the BAT data made available via the GUANO infrastructure. We do not detect any significant electromagnetic emission that is temporally and spatially coincident with any of the GW candidates. We report flux upper limits in the 15-350 keV band as a function of sky position for all the catalog candidates. For GW candidates where the Swift-BAT false alarm rate is less than 10$^{-3}$ Hz, we compute the GW--BAT joint false alarm rate. Finally, the derived Swift-BAT upper limits are used to infer constraints on the putative electromagnetic emission associated with binary black hole mergers. △ Less

Submitted 13 July, 2024; originally announced July 2024.

Comments: 50 pages, 10 figures, 4 tables

arXiv:2407.10201 [pdf, other]

The PolarKID project: polarization measurements with KIDs for the next generation of CMB telescopes

Authors: Sofia Savorgnano, Julien Bounmy, Olivier Bourrion, Martino Calvo, Andrea Catalano, Olivier Choulet, Gregory Garde, Anne Gerardin, Mile Kusulja, Juan Francisco Macias Perez, Alessandro Monfardini, Damien Tourres, Francis Vezzu

Abstract: The goal of the PolarKID project is testing a new method for the measurement of polarized sources, in order to identify all the possible instrumental systematic effects that could impact the detection of CMB B-modes of polarization. It employs the KISS (KIDs Interferometer Spectrum Survey) instrument coupled to a sky simulator and to sources such as point-like black bodies (simulating planets), a… ▽ More The goal of the PolarKID project is testing a new method for the measurement of polarized sources, in order to identify all the possible instrumental systematic effects that could impact the detection of CMB B-modes of polarization. It employs the KISS (KIDs Interferometer Spectrum Survey) instrument coupled to a sky simulator and to sources such as point-like black bodies (simulating planets), a dipole (extended source) and a polarizer. We use filled-arrays Lumped Element Kinetic Inductance Detectors (LEKIDs) since they have multiple advantages when observing both in a photometry and in a polarimetry configuration △ Less

Submitted 14 July, 2024; originally announced July 2024.

Comments: 8 pages, 1 figure, Proceeding of the SPIE conference Millimeter, Submillimeter, and Far-Infrared Detectors and Instrumentation for Astronomy XII, SPIE Astronomical Telescopes + Instrumentation 2024

arXiv:2407.07762 [pdf]

doi 10.1109/TE.2023.3241099

Learning and Motivational Impact of Game-Based Learning: Comparing Face-to-Face and Online Formats on Computer Science Education

Authors: Daniel López-Fernández, Aldo Gordillo, Jennifer Pérez, Edmundo Tovar

Abstract: Contribution: This article analyzes the learning and motivational impact of teacher-authored educational video games on computer science education and compares its effectiveness in both face-to-face and online (remote) formats. This work presents comparative data and findings obtained from 217 students who played the game in a face-to-face format (control group) and 104 students who played the gam… ▽ More Contribution: This article analyzes the learning and motivational impact of teacher-authored educational video games on computer science education and compares its effectiveness in both face-to-face and online (remote) formats. This work presents comparative data and findings obtained from 217 students who played the game in a face-to-face format (control group) and 104 students who played the game in an online format (experimental group). Background: Serious video games have been proven effective at computer science education, however, it is still unknown whether the effectiveness of these games is the same regardless of their format, face-to-face or online. Moreover, the usage of games created through authoring tools has barely been explored. Research Questions: Are teacher-authored educational video games effective in terms of learning and motivation for computer science students? Does the effectiveness of teacher-authored educational video games depend on whether they are used in a face-to-face or online format? Methodology: A quasi-experiment has been conducted by using three instruments (pre-test, post-test, and questionnaire) with the purpose of comparing the effectiveness of game-based learning in face-to-face and online formats. A total of 321 computer science students played a teacher-authored educational video game aimed to learn about software design. Findings: The results reveal that teacher-authored educational video games are highly effective in terms of knowledge acquisition and motivation both in face-to-face and online formats. The results also show that some students' perceptions were more positive when a face-to-face format was used. △ Less

Submitted 10 July, 2024; originally announced July 2024.

Comments: 10 pages, 3 figures. Accepted version of a journal article published in IEEE Transactions on Education

Journal ref: IEEE Transactions on Education, Volume 66, Issue 4, 2023

arXiv:2407.07258 [pdf, other]

Identification of emotions on Twitter during the 2022 electoral process in Colombia

Authors: Juan Jose Iguaran Fernandez, Juan Manuel Perez, German Rosati

Abstract: The study of Twitter as a means for analyzing social phenomena has gained interest in recent years due to the availability of large amounts of data in a relatively spontaneous environment. Within opinion-mining tasks, emotion detection is specially relevant, as it allows for the identification of people's subjective responses to different social events in a more granular way than traditional senti… ▽ More The study of Twitter as a means for analyzing social phenomena has gained interest in recent years due to the availability of large amounts of data in a relatively spontaneous environment. Within opinion-mining tasks, emotion detection is specially relevant, as it allows for the identification of people's subjective responses to different social events in a more granular way than traditional sentiment analysis based on polarity. In the particular case of political events, the analysis of emotions in social networks can provide valuable information on the perception of candidates, proposals, and other important aspects of the public debate. In spite of this importance, there are few studies on emotion detection in Spanish and, to the best of our knowledge, few resources are public for opinion mining in Colombian Spanish, highlighting the need for generating resources addressing the specific cultural characteristics of this variety. In this work, we present a small corpus of tweets in Spanish related to the 2022 Colombian presidential elections, manually labeled with emotions using a fine-grained taxonomy. We perform classification experiments using supervised state-of-the-art models (BERT models) and compare them with GPT-3.5 in few-shot learning settings. We make our dataset and code publicly available for research purposes. △ Less

Submitted 9 July, 2024; originally announced July 2024.

arXiv:2407.06391 [pdf, other]

Around Classical and Intuitionistic Linear Processes

Authors: Juan C. Jaramillo, Dan Frumin, Jorge A. Pérez

Abstract: Curry-Howard correspondences between Linear Logic (LL) and session types provide a firm foundation for concurrent processes. As the correspondences hold for intuitionistic and classic versions of LL (ILL and CLL), we obtain two different families of type systems for concurrency. An open question remains: how do these two families exactly relate to each other? Based upon a translation from CLL to I… ▽ More Curry-Howard correspondences between Linear Logic (LL) and session types provide a firm foundation for concurrent processes. As the correspondences hold for intuitionistic and classic versions of LL (ILL and CLL), we obtain two different families of type systems for concurrency. An open question remains: how do these two families exactly relate to each other? Based upon a translation from CLL to ILL due to Laurent (2018), we provide two complementary answers, in the form of full abstraction results based on a typed observational equivalence due to Atkey (2017). Our results elucidate hitherto missing formal links between seemingly related yet different type systems for concurrency. △ Less

Submitted 22 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

Comments: Full version, 19 pages + appendices

arXiv:2407.04503 [pdf, other]

When LLMs Play the Telephone Game: Cumulative Changes and Attractors in Iterated Cultural Transmissions

Authors: Jérémy Perez, Corentin Léger, Grgur Kovač, Cédric Colas, Gaia Molinaro, Maxime Derex, Pierre-Yves Oudeyer, Clément Moulin-Frier

Abstract: As large language models (LLMs) start interacting with each other and generating an increasing amount of text online, it becomes crucial to better understand how information is transformed as it passes from one LLM to the next. While significant research has examined individual LLM behaviors, existing studies have largely overlooked the collective behaviors and information distortions arising from… ▽ More As large language models (LLMs) start interacting with each other and generating an increasing amount of text online, it becomes crucial to better understand how information is transformed as it passes from one LLM to the next. While significant research has examined individual LLM behaviors, existing studies have largely overlooked the collective behaviors and information distortions arising from iterated LLM interactions. Small biases, negligible at the single output level, risk being amplified in iterated interactions, potentially leading the content to evolve towards attractor states. In a series of telephone game experiments, we apply a transmission chain design borrowed from the human cultural evolution literature: LLM agents iteratively receive, produce, and transmit texts from the previous to the next agent in the chain. By tracking the evolution of text toxicity, positivity, difficulty, and length across transmission chains, we uncover the existence of biases and attractors, and study their dependence on the initial text, the instructions, language model, and model size. For instance, we find that more open-ended instructions lead to stronger attraction effects compared to more constrained tasks. We also find that different text properties display different sensitivity to attraction effects, with toxicity leading to stronger attractors than length. These findings highlight the importance of accounting for multi-step transmission dynamics and represent a first step towards a more comprehensive understanding of LLM cultural dynamics. △ Less

Submitted 5 July, 2024; originally announced July 2024.

Comments: Code available at https://github.com/jeremyperez2/TelephoneGameLLM. Companion website with a Data Explorer tool at https://sites.google.com/view/telephone-game-llm

MSC Class: 68T50 ACM Class: I.2.7

arXiv:2407.03124 [pdf, other]

Linking microscopic structure to optical properties in soft plasmonic complexes

Authors: Francesco Brasili, Angela Capocefalo, Giovanni Del Monte, Rodrigo Rivas-Barbosa, Javier Pérez, Edouard Chauveau, Federico Bordi, Domenico Truzzolillo, Emanuela Zaccarelli, Simona Sennato

Abstract: The complexation of plasmonic nanoparticles (NPs) and thermoresponsive microgels is widely exploited for applications. Yet, a microscopic description of the mechanisms governing spatial organization of the NPs is still lacking. Combining small angle X-ray scattering, state-of-the-art numerical simulations and a simple toy model, we uncover how the volume phase transition of microgels drives NP-NP… ▽ More The complexation of plasmonic nanoparticles (NPs) and thermoresponsive microgels is widely exploited for applications. Yet, a microscopic description of the mechanisms governing spatial organization of the NPs is still lacking. Combining small angle X-ray scattering, state-of-the-art numerical simulations and a simple toy model, we uncover how the volume phase transition of microgels drives NP-NP interactions, inducing NP progressive rearrangement with temperature. These results are directly compared to the extinction spectra of microgel-NPs complexes, allowing us to establish for the first time a microscopic link between plasmon coupling and NP local structure. △ Less

Submitted 3 July, 2024; originally announced July 2024.

arXiv:2406.11001 [pdf, ps, other]

Symmetries and Interactions of $\mathcal{N}=1$ SUGRA: from Constructive and BCFW to KLT formulations

Authors: Dibya Chakraborty, J. Lorenzo Díaz-Cruz, Jonathan Reyes Pérez, Pablo Ortega Ruiz

Abstract: In this paper, we study the coupling of the gravity supermultiplet (graviton and gravitino) of minimal $\mathcal{N}=1$ SUGRA following a constructive approach. Firstly, we use the master formulae that follows from considering the scaling behavior of the spinor variables under the little group. Secondly, we derive the 4-point couplings using BFCW. Then, we verify these results for the general 3-poi… ▽ More In this paper, we study the coupling of the gravity supermultiplet (graviton and gravitino) of minimal $\mathcal{N}=1$ SUGRA following a constructive approach. Firstly, we use the master formulae that follows from considering the scaling behavior of the spinor variables under the little group. Secondly, we derive the 4-point couplings using BFCW. Then, we verify these results for the general 3-point interactions that can be derived using the KLT-type relations, i.e., they can be written as the square of the coupling of the gluons and gluinos. Finally, we consider SUGRA Compton effect for graviton-gravitino. For completeness, we present in the appendix the $\mathcal{N}=1$ Sugra lagrangian in the 2-component Weyl formalism, including the proofs of SUSY and gauge invariance. △ Less

Submitted 2 July, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

Comments: 28 pages, 5 figures, 1 table, minor typos fixed, reference added

arXiv:2406.06474 [pdf, other]

Towards a Personal Health Large Language Model

Authors: Justin Cosentino, Anastasiya Belyaeva, Xin Liu, Nicholas A. Furlotte, Zhun Yang, Chace Lee, Erik Schenck, Yojan Patel, Jian Cui, Logan Douglas Schneider, Robby Bryant, Ryan G. Gomes, Allen Jiang, Roy Lee, Yun Liu, Javier Perez, Jameson K. Rogers, Cathy Speed, Shyam Tailor, Megan Walker, Jeffrey Yu, Tim Althoff, Conor Heneghan, John Hernandez, Mark Malhotra , et al. (9 additional authors not shown)

Abstract: In health, most large language model (LLM) research has focused on clinical tasks. However, mobile and wearable devices, which are rarely integrated into such tasks, provide rich, longitudinal data for personal health monitoring. Here we present Personal Health Large Language Model (PH-LLM), fine-tuned from Gemini for understanding and reasoning over numerical time-series personal health data. We… ▽ More In health, most large language model (LLM) research has focused on clinical tasks. However, mobile and wearable devices, which are rarely integrated into such tasks, provide rich, longitudinal data for personal health monitoring. Here we present Personal Health Large Language Model (PH-LLM), fine-tuned from Gemini for understanding and reasoning over numerical time-series personal health data. We created and curated three datasets that test 1) production of personalized insights and recommendations from sleep patterns, physical activity, and physiological responses, 2) expert domain knowledge, and 3) prediction of self-reported sleep outcomes. For the first task we designed 857 case studies in collaboration with domain experts to assess real-world scenarios in sleep and fitness. Through comprehensive evaluation of domain-specific rubrics, we observed that Gemini Ultra 1.0 and PH-LLM are not statistically different from expert performance in fitness and, while experts remain superior for sleep, fine-tuning PH-LLM provided significant improvements in using relevant domain knowledge and personalizing information for sleep insights. We evaluated PH-LLM domain knowledge using multiple choice sleep medicine and fitness examinations. PH-LLM achieved 79% on sleep and 88% on fitness, exceeding average scores from a sample of human experts. Finally, we trained PH-LLM to predict self-reported sleep quality outcomes from textual and multimodal encoding representations of wearable data, and demonstrate that multimodal encoding is required to match performance of specialized discriminative models. Although further development and evaluation are necessary in the safety-critical personal health domain, these results demonstrate both the broad knowledge and capabilities of Gemini models and the benefit of contextualizing physiological data for personal health applications as done with PH-LLM. △ Less

Submitted 10 June, 2024; originally announced June 2024.

Comments: 72 pages

arXiv:2406.06464 [pdf, other]

Transforming Wearable Data into Health Insights using Large Language Model Agents

Authors: Mike A. Merrill, Akshay Paruchuri, Naghmeh Rezaei, Geza Kovacs, Javier Perez, Yun Liu, Erik Schenck, Nova Hammerquist, Jake Sunshine, Shyam Tailor, Kumar Ayush, Hao-Wei Su, Qian He, Cory Y. McLean, Mark Malhotra, Shwetak Patel, Jiening Zhan, Tim Althoff, Daniel McDuff, Xin Liu

Abstract: Despite the proliferation of wearable health trackers and the importance of sleep and exercise to health, deriving actionable personalized insights from wearable data remains a challenge because doing so requires non-trivial open-ended analysis of these data. The recent rise of large language model (LLM) agents, which can use tools to reason about and interact with the world, presents a promising… ▽ More Despite the proliferation of wearable health trackers and the importance of sleep and exercise to health, deriving actionable personalized insights from wearable data remains a challenge because doing so requires non-trivial open-ended analysis of these data. The recent rise of large language model (LLM) agents, which can use tools to reason about and interact with the world, presents a promising opportunity to enable such personalized analysis at scale. Yet, the application of LLM agents in analyzing personal health is still largely untapped. In this paper, we introduce the Personal Health Insights Agent (PHIA), an agent system that leverages state-of-the-art code generation and information retrieval tools to analyze and interpret behavioral health data from wearables. We curate two benchmark question-answering datasets of over 4000 health insights questions. Based on 650 hours of human and expert evaluation we find that PHIA can accurately address over 84% of factual numerical questions and more than 83% of crowd-sourced open-ended questions. This work has implications for advancing behavioral health across the population, potentially enabling individuals to interpret their own wearable data, and paving the way for a new era of accessible, personalized wellness regimens that are informed by data-driven insights. △ Less

Submitted 11 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

Comments: 38 pages

arXiv:2406.04255 [pdf, other]

The frequency process in a non-neutral two-type continuous-state branching process with competition and its genealogy

Authors: Imanol Nuñez, José Luis Pérez

Abstract: We consider a population growth model given by a two-type continuous-state branching process with immigration and competition, introduced by Ma. We study the relative frequency of one of the types in the population when the total mass is forced to be constant at a dense set of times. The resulting process is described as the solution to an SDE, which we call the culled frequency process, generaliz… ▽ More We consider a population growth model given by a two-type continuous-state branching process with immigration and competition, introduced by Ma. We study the relative frequency of one of the types in the population when the total mass is forced to be constant at a dense set of times. The resulting process is described as the solution to an SDE, which we call the culled frequency process, generalizing the $Λ$-asymmetric frequency process introduced by Caballero et al. We obtain conditions for the culled frequency process to have a moment dual and show that it is given by a branching-coalescing continuous-time Markov chain that describes the genealogy of the two-type CBI with competition. Finally, we obtain a large population limit of the culled frequency process, resulting in a deterministic ordinary differential equation (ODE). Two particular cases of the limiting ODE are studied to determine if general two-type branching mechanisms and general Malthusians can lead to the coexistence of the two types in the population. △ Less

Submitted 6 June, 2024; originally announced June 2024.

arXiv:2405.18108 [pdf]

Simulation of Single-Phase Natural Circulation within the BEPU Framework: Sketching Scaling Uncertainty Principle by Multi-Scale CFD Approaches

Authors: Haifu Huang, Jorge Perez, Nicolas Alpy, Marc Medale

Abstract: In order to enhance safety, nuclear reactors in the design phase consider natural circulation as a mean to remove residual power. The simulation of this passive mechanism must be qualified between the validation range and the scope of utilization (reactor case), introducing potential physical and numerical distortion effects. In this study, we simulate the flow of liquid sodium using the TrioCFD c… ▽ More In order to enhance safety, nuclear reactors in the design phase consider natural circulation as a mean to remove residual power. The simulation of this passive mechanism must be qualified between the validation range and the scope of utilization (reactor case), introducing potential physical and numerical distortion effects. In this study, we simulate the flow of liquid sodium using the TrioCFD code, employing both higher-fidelity (HF) LES and lower-fidelity (LF) URANS models. We tackle respectively numerical uncertainties through the Grid Convergence Index method, and physical modelling uncertainties through the Polynomial Chaos Expansion method available on the URANIE platform. HF simulations are shown to exhibit a strong resilience to physical distortion effects, with numerical uncertainties being intricately correlated. Conversely, the LF approach, the only one applicable at the reactor scale, is likely to present a reduced predictability. If so, the HF approach should be effective in pinpointing the LF weaknesses: the concept of scaling uncertainty is inline introduced as the growth of the LF simulation uncertainty associated with distortion effects. Thus, the paper outlines that a specific methodology within the BEPU framework - leveraging both HF and LF approaches - could pragmatically enable correlating distortion effects with scaling uncertainty, thereby providing a metric principle. △ Less

Submitted 28 May, 2024; originally announced May 2024.

Journal ref: Best Estimate Plus Uncertainty International Conference (BEPU 2024), May 2024, Lucca, Italy

arXiv:2405.17146 [pdf, other]

Compressed-Language Models for Understanding Compressed File Formats: a JPEG Exploration

Authors: Juan C. Pérez, Alejandro Pardo, Mattia Soldan, Hani Itani, Juan Leon-Alcazar, Bernard Ghanem

Abstract: This study investigates whether Compressed-Language Models (CLMs), i.e. language models operating on raw byte streams from Compressed File Formats~(CFFs), can understand files compressed by CFFs. We focus on the JPEG format as a representative CFF, given its commonality and its representativeness of key concepts in compression, such as entropy coding and run-length encoding. We test if CLMs unders… ▽ More This study investigates whether Compressed-Language Models (CLMs), i.e. language models operating on raw byte streams from Compressed File Formats~(CFFs), can understand files compressed by CFFs. We focus on the JPEG format as a representative CFF, given its commonality and its representativeness of key concepts in compression, such as entropy coding and run-length encoding. We test if CLMs understand the JPEG format by probing their capabilities to perform along three axes: recognition of inherent file properties, handling of files with anomalies, and generation of new files. Our findings demonstrate that CLMs can effectively perform these tasks. These results suggest that CLMs can understand the semantics of compressed data when directly operating on the byte streams of files produced by CFFs. The possibility to directly operate on raw compressed files offers the promise to leverage some of their remarkable characteristics, such as their ubiquity, compactness, multi-modality and segment-nature. △ Less

Submitted 27 May, 2024; originally announced May 2024.

arXiv:2405.00502 [pdf, other]

Using non-DESI data to confirm and strengthen the DESI 2024 spatially-flat $w_0w_a$CDM cosmological parameterization result

Authors: Chan-Gyung Park, Javier de Cruz Perez, Bharat Ratra

Abstract: We use a combination of Planck cosmic microwave background (CMB) anisotropy data and non-CMB data that include Pantheon+ type Ia supernovae (SNIa), Hubble parameter [$H(z)$], growth factor ($fσ_8$) measurements, and a collection of baryon acoustic oscillation (BAO) data, but not recent DESI 2024 BAO measurements, to confirm the DESI 2024 (DESI+CMB+PantheonPlus) data compilation support for dynamic… ▽ More We use a combination of Planck cosmic microwave background (CMB) anisotropy data and non-CMB data that include Pantheon+ type Ia supernovae (SNIa), Hubble parameter [$H(z)$], growth factor ($fσ_8$) measurements, and a collection of baryon acoustic oscillation (BAO) data, but not recent DESI 2024 BAO measurements, to confirm the DESI 2024 (DESI+CMB+PantheonPlus) data compilation support for dynamical dark energy with an evolving equation of state parameter $w(z) = w_0 + w_a z/(1+z)$. From our joint compilation of CMB and non-CMB data, in a spatially-flat cosmological model, we obtain $w_0 = -0.850 \pm 0.059$ and $w_a = -0.59^{+0.26}_{-0.22}$ and find that this dynamical dark energy is favored over a cosmological constant by $\sim 2σ$. Our data constraints on the flat $w_0w_a$CDM parameterization are slightly more restrictive than the DESI 2024 constraints, with the DESI 2024 and our values of $w_0$ and $w_a$ differing by $-0.27σ$ and $0.44σ$, respectively. Our data compilation slightly more strongly favors the flat $w_0w_a$CDM model over the flat $Λ$CDM model than does the DESI 2024 data compilation. We note that our CMB and non-CMB data $w_0w_a$CDM parameterization cosmological constraints are discrepant at 2.7$σ$, a little larger than the 1.9$σ$ discrepancy between DESI DR1 BAO and CMB data flat $Λ$CDM model cosmological constraints. We also show that if we remove the Pantheon+ SNIa contribution from the non-CMB data, for the $w_0w_a$CDM parameterization we still find tension between P18 and non-CMB data (2.5$σ$) and P18+lensing and non-CMB data (2.4$σ$). Even after the exclusion of Pantheon+ SNIa data the $Λ$CDM model is still disfavoured at $\sim 2σ$ c.l. △ Less

Submitted 4 October, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

Comments: 13 pages, 6 figures. Revised in response to the third referee report. We now also show that the approximately $2σ$ evidence for dark energy dynamics remains even when we do not use Pantheon+ SNIa data

arXiv:2404.19194 [pdf, other]

Updated observational constraints on spatially-flat and non-flat $Λ$CDM and XCDM cosmological models

Authors: Javier de Cruz Perez, Chan-Gyung Park, Bharat Ratra

Abstract: We study 6 LCDM models, with 4 allowing for non-flat geometry and 3 allowing for a non-unity lensing consistency parameter $A_L$. We also study 6 XCDM models with a dynamical dark energy density X-fluid with equation of state $w$. For the non-flat models we use two different primordial power spectra, Planck $P(q)$ and new $P(q)$. These models are tested against: Planck 2018 CMB power spectra (P18)… ▽ More We study 6 LCDM models, with 4 allowing for non-flat geometry and 3 allowing for a non-unity lensing consistency parameter $A_L$. We also study 6 XCDM models with a dynamical dark energy density X-fluid with equation of state $w$. For the non-flat models we use two different primordial power spectra, Planck $P(q)$ and new $P(q)$. These models are tested against: Planck 2018 CMB power spectra (P18) and lensing potential power spectrum (lensing), and an updated compilation of BAO, SNIa, $H(z)$, and $fσ_8$ data [non-CMB data]. P18 data favor closed geometry for the LCDM and XCDM models and $w<-1$ (phantom-like dark energy) for the XCDM models while non-CMB data favor open geometry for the LCDM models and closed geometry and $w>-1$ (quintessence-like dark energy) for the XCDM models. When P18 and non-CMB data are jointly analyzed there is weak evidence for open geometry and moderate evidence for quintessence-like dark energy. Regardless of data used, $A_L>1$ is always favored. The XCDM model constraints obtained from CMB data and from non-CMB data are incompatible, ruling out the 3 $A_L = 1$ XCDM models at $> 3σ$. In the 9 models not ruled out, for the P18+lensing+non-CMB data set we find little deviation from flat geometry and moderate deviation from $w=-1$. In all 6 non-flat models (not ruled out), open geometry is mildly favored, and in all 3 XCDM+$A_L$ models (not ruled out) quintessence-like dark energy is moderately favored (by at most $1.6 σ$). In the $A_L = 1$ non-flat LCDM cases, we find for P18+lensing+non-CMB data $Ω_k = 0.0009 \pm 0.0017$ [$0.0008 \pm 0.0017$] for the Planck [new] $P(q)$ model, favoring open geometry. The flat LCDM model remains the simplest (largely) observationally-consistent cosmological model. Our cosmological parameter constraints obtained for the flat LCDM model (and other models) are the most restrictive results to date (Abridged). △ Less

Submitted 4 June, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

Comments: 71 pages, 33 figures. Accepted for publication in Physical Review D

arXiv:2404.12411 [pdf]

Analysis of the Annealing Budget of Metal Oxide Thin-Film Transistors Prepared by an Aqueous Blade-Coating Process

Authors: Tianyu Tang, Preetam Dacha, Katherina Haase, Joshua Kreß, Christian Hänisch, Jonathan Perez, Yulia Krupskaya, Alexander Tahn, Darius Pohl, Sebastian Schneider, Felix Talnack, Mike Hambsch, Sebastian Reineke, Yana Vaynzof, Stefan C. B. Mannsfeld

Abstract: Metal oxide (MO) semiconductors are widely used in electronic devices due to their high optical transmittance and promising electrical performance. This work describes the advancement toward an eco-friendly, streamlined method for preparing thin-film transistors (TFTs) via a pure water-solution blade-coating process with focus on a low thermal budget. Low temperature and rapid annealing of triple-… ▽ More Metal oxide (MO) semiconductors are widely used in electronic devices due to their high optical transmittance and promising electrical performance. This work describes the advancement toward an eco-friendly, streamlined method for preparing thin-film transistors (TFTs) via a pure water-solution blade-coating process with focus on a low thermal budget. Low temperature and rapid annealing of triple-coated indium oxide thin-film transistors (3C-TFTs) and indium oxide/zinc oxide/indium oxide thin-film transistors (IZI-TFTs) on a 300 nm SiO2 gate dielectric at 300 $^{\circ}$C for only 60 s yields devices with an average field effect mobility of 10.7 and 13.8 cm2/Vs, respectively. The devices show an excellent on/off ratio (>10^6), and a threshold voltage close to 0 V when measured in air. Flexible MO-TFTs on polyimide substrates with AlOx dielectrics fabricated by rapid annealing treatment can achieve a remarkable mobility of over 10 cm2/Vs at low operating voltage. When using a longer post-coating annealing period of 20 min, high-performance 3C-TFTs (over 18 cm2/Vs) and IZI-TFTs (over 38 cm2/Vs) using MO semiconductor layers annealed at 300 $^{\circ}$C are achieved. △ Less

Submitted 17 April, 2024; originally announced April 2024.

arXiv:2404.04248 [pdf, other]

doi 10.3847/2041-8213/ad5beb

Observation of Gravitational Waves from the Coalescence of a $2.5\text{-}4.5~M_\odot$ Compact Object and a Neutron Star

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, I. Abouelfettouh, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, D. Agarwal, M. Agathos, M. Aghaei Abchouyeh, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, S. Akçay, T. Akutsu, S. Albanesi, R. A. Alfaidi, A. Al-Jodah , et al. (1771 additional authors not shown)

Abstract: We report the observation of a coalescing compact binary with component masses $2.5\text{-}4.5~M_\odot$ and $1.2\text{-}2.0~M_\odot$ (all measurements quoted at the 90% credible level). The gravitational-wave signal GW230529_181500 was observed during the fourth observing run of the LIGO-Virgo-KAGRA detector network on 2023 May 29 by the LIGO Livingston Observatory. The primary component of the so… ▽ More We report the observation of a coalescing compact binary with component masses $2.5\text{-}4.5~M_\odot$ and $1.2\text{-}2.0~M_\odot$ (all measurements quoted at the 90% credible level). The gravitational-wave signal GW230529_181500 was observed during the fourth observing run of the LIGO-Virgo-KAGRA detector network on 2023 May 29 by the LIGO Livingston Observatory. The primary component of the source has a mass less than $5~M_\odot$ at 99% credibility. We cannot definitively determine from gravitational-wave data alone whether either component of the source is a neutron star or a black hole. However, given existing estimates of the maximum neutron star mass, we find the most probable interpretation of the source to be the coalescence of a neutron star with a black hole that has a mass between the most massive neutron stars and the least massive black holes observed in the Galaxy. We provisionally estimate a merger rate density of $55^{+127}_{-47}~\text{Gpc}^{-3}\,\text{yr}^{-1}$ for compact binary coalescences with properties similar to the source of GW230529_181500; assuming that the source is a neutron star-black hole merger, GW230529_181500-like sources constitute about 60% of the total merger rate inferred for neutron star-black hole coalescences. The discovery of this system implies an increase in the expected rate of neutron star-black hole mergers with electromagnetic counterparts and provides further evidence for compact objects existing within the purported lower mass gap. △ Less

Submitted 26 July, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

Comments: 45 pages (10 pages author list, 13 pages main text, 1 page acknowledgements, 13 pages appendices, 8 pages bibliography), 17 figures, 16 tables. Update to match version published in The Astrophysical Journal Letters. Data products available from https://zenodo.org/records/10845779

Report number: LIGO-P2300352

Journal ref: ApJL 970, L34 (2024)

arXiv:2403.17352 [pdf, other]

On the Heating of the Slow Solar-Wind by Imbalanced Alfvén-Wave Turbulence from 0.06 au to 1 au: Parker Solar Probe and Solar Orbiter observations

Authors: Sofiane Bourouaine, Jean C. Perez, Benjamin D. G. Chandran, Vamsee K. Jagarlamudi, Nour E. Raouafi, Jasper S. Halekas

Abstract: In this work we analyze plasma and magnetic field data provided by the Parker Solar Probe (\emph{PSP}) and Solar Orbiter (\emph{SO}) missions to investigate the radial evolution of the heating of Alfvénic slow wind (ASW) by imbalanced Alfvén-Wave (AW) turbulent fluctuations from 0.06 au to 1 au. in our analysis we focus on slow solar-wind intervals with highly imbalanced and incompressible turbule… ▽ More In this work we analyze plasma and magnetic field data provided by the Parker Solar Probe (\emph{PSP}) and Solar Orbiter (\emph{SO}) missions to investigate the radial evolution of the heating of Alfvénic slow wind (ASW) by imbalanced Alfvén-Wave (AW) turbulent fluctuations from 0.06 au to 1 au. in our analysis we focus on slow solar-wind intervals with highly imbalanced and incompressible turbulence (i.e., magnetic compressibility $C_B=δB/B\leq 0.25$, plasma compressibility $C_n=δn/n\leq 0.25$ and normalized cross-helicity $σ_c\geq 0.65$). First, we estimate the AW turbulent dissipation rate from the wave energy equation and find that the radial profile trend is similar to the proton heating rate. Second, we find that the scaling of the empirical AW turbulent dissipation rate $Q_W$ obtained from the wave energy equation matches the scaling from the phenomenological AW turbulent dissipation rate $Q_{\rm CH09}$ (with $Q_{\rm CH09}\simeq 1.55 Q_W$) derived by~\cite{chandran09} based on the model of reflection-driven turbulence. Our results suggest that, as in the fast solar wind, AW turbulence plays a major role in the ion heating that occurs in incompressible slow-wind streams. △ Less

Submitted 25 March, 2024; originally announced March 2024.

Comments: This paper has been accepted for publication in the Astrophysical Journal Letters

arXiv:2403.16077 [pdf, ps, other]

On the Bailout Dividend Problem with Periodic Dividend Payments and Fixed Transaction Costs

Authors: Harold A. Moreno-Franco, Jose-Luis Pérez

Abstract: We study the optimal bailout dividend problem with transaction costs for an insurance company, where shareholder payouts align with the arrival times of an independent Poisson process. In this scenario, the underlying risk model follows a spectrally negative Lévy process. Our analysis confirms the optimality of a periodic $(b_{1},b_{2})$-barrier policy with classical reflection at zero. This strat… ▽ More We study the optimal bailout dividend problem with transaction costs for an insurance company, where shareholder payouts align with the arrival times of an independent Poisson process. In this scenario, the underlying risk model follows a spectrally negative Lévy process. Our analysis confirms the optimality of a periodic $(b_{1},b_{2})$-barrier policy with classical reflection at zero. This strategy involves reducing the surplus to $b_1$ when it exceeds $b_{2}$ at the Poisson arrival times and pushes the surplus to 0 whenever it goes below zero. △ Less

Submitted 24 March, 2024; originally announced March 2024.

MSC Class: 91B30; 60G51; 93E20

arXiv:2403.08882 [pdf, other]

Cultural evolution in populations of Large Language Models

Authors: Jérémy Perez, Corentin Léger, Marcela Ovando-Tellez, Chris Foulon, Joan Dussauld, Pierre-Yves Oudeyer, Clément Moulin-Frier

Abstract: Research in cultural evolution aims at providing causal explanations for the change of culture over time. Over the past decades, this field has generated an important body of knowledge, using experimental, historical, and computational methods. While computational models have been very successful at generating testable hypotheses about the effects of several factors, such as population structure o… ▽ More Research in cultural evolution aims at providing causal explanations for the change of culture over time. Over the past decades, this field has generated an important body of knowledge, using experimental, historical, and computational methods. While computational models have been very successful at generating testable hypotheses about the effects of several factors, such as population structure or transmission biases, some phenomena have so far been more complex to capture using agent-based and formal models. This is in particular the case for the effect of the transformations of social information induced by evolved cognitive mechanisms. We here propose that leveraging the capacity of Large Language Models (LLMs) to mimic human behavior may be fruitful to address this gap. On top of being an useful approximation of human cultural dynamics, multi-agents models featuring generative agents are also important to study for their own sake. Indeed, as artificial agents are bound to participate more and more to the evolution of culture, it is crucial to better understand the dynamics of machine-generated cultural evolution. We here present a framework for simulating cultural evolution in populations of LLMs, allowing the manipulation of variables known to be important in cultural evolution, such as network structure, personality, and the way social information is aggregated and transformed. The software we developed for conducting these simulations is open-source and features an intuitive user-interface, which we hope will help to build bridges between the fields of cultural evolution and generative artificial intelligence. △ Less

Submitted 13 March, 2024; originally announced March 2024.

Comments: 17 pages, 20 figures. Open-source code available at https://github.com/jeremyperez2/LLM-Culture

MSC Class: 68T50 ACM Class: I.2.7

arXiv:2403.07842 [pdf, other]

Quantifying and Mitigating Privacy Risks for Tabular Generative Models

Authors: Chaoyi Zhu, Jiayi Tang, Hans Brouwer, Juan F. Pérez, Marten van Dijk, Lydia Y. Chen

Abstract: Synthetic data from generative models emerges as the privacy-preserving data-sharing solution. Such a synthetic data set shall resemble the original data without revealing identifiable private information. The backbone technology of tabular synthesizers is rooted in image generative models, ranging from Generative Adversarial Networks (GANs) to recent diffusion models. Recent prior work sheds ligh… ▽ More Synthetic data from generative models emerges as the privacy-preserving data-sharing solution. Such a synthetic data set shall resemble the original data without revealing identifiable private information. The backbone technology of tabular synthesizers is rooted in image generative models, ranging from Generative Adversarial Networks (GANs) to recent diffusion models. Recent prior work sheds light on the utility-privacy tradeoff on tabular data, revealing and quantifying privacy risks on synthetic data. We first conduct an exhaustive empirical analysis, highlighting the utility-privacy tradeoff of five state-of-the-art tabular synthesizers, against eight privacy attacks, with a special focus on membership inference attacks. Motivated by the observation of high data quality but also high privacy risk in tabular diffusion, we propose DP-TLDM, Differentially Private Tabular Latent Diffusion Model, which is composed of an autoencoder network to encode the tabular data and a latent diffusion model to synthesize the latent tables. Following the emerging f-DP framework, we apply DP-SGD to train the auto-encoder in combination with batch clipping and use the separation value as the privacy metric to better capture the privacy gain from DP algorithms. Our empirical evaluation demonstrates that DP-TLDM is capable of achieving a meaningful theoretical privacy guarantee while also significantly enhancing the utility of synthetic data. Specifically, compared to other DP-protected tabular generative models, DP-TLDM improves the synthetic quality by an average of 35% in data resemblance, 15% in the utility for downstream tasks, and 50% in data discriminability, all while preserving a comparable level of privacy risk. △ Less

Submitted 12 March, 2024; originally announced March 2024.

arXiv:2403.07796 [pdf, other]

doi 10.1016/j.nima.2024.169480

Second gadolinium loading to Super-Kamiokande

Authors: K. Abe, C. Bronner, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, J. Kameda, Y. Kanemura, R. Kaneshima, Y. Kashiwagi, Y. Kataoka, S. Miki, S. Mine, M. Miura, S. Moriyama, Y. Nakano, M. Nakahata, S. Nakayama, Y. Noguchi, K. Sato, H. Sekiya, H. Shiba, K. Shimizu, M. Shiozawa , et al. (225 additional authors not shown)

Abstract: The first loading of gadolinium (Gd) into Super-Kamiokande in 2020 was successful, and the neutron capture efficiency on Gd reached 50\%. To further increase the Gd neutron capture efficiency to 75\%, 26.1 tons of $\rm Gd_2(\rm SO_4)_3\cdot \rm 8H_2O$ was additionally loaded into Super-Kamiokande (SK) from May 31 to July 4, 2022. As the amount of loaded $\rm Gd_2(\rm SO_4)_3\cdot \rm 8H_2O$ was do… ▽ More The first loading of gadolinium (Gd) into Super-Kamiokande in 2020 was successful, and the neutron capture efficiency on Gd reached 50\%. To further increase the Gd neutron capture efficiency to 75\%, 26.1 tons of $\rm Gd_2(\rm SO_4)_3\cdot \rm 8H_2O$ was additionally loaded into Super-Kamiokande (SK) from May 31 to July 4, 2022. As the amount of loaded $\rm Gd_2(\rm SO_4)_3\cdot \rm 8H_2O$ was doubled compared to the first loading, the capacity of the powder dissolving system was doubled. We also developed new batches of gadolinium sulfate with even further reduced radioactive impurities. In addition, a more efficient screening method was devised and implemented to evaluate these new batches of $\rm Gd_2(\rm SO_4)_3\cdot \rm 8H_2O$. Following the second loading, the Gd concentration in SK was measured to be $333.5\pm2.5$ ppm via an Atomic Absorption Spectrometer (AAS). From the mean neutron capture time constant of neutrons from an Am/Be calibration source, the Gd concentration was independently measured to be 332.7 $\pm$ 6.8(sys.) $\pm$ 1.1(stat.) ppm, consistent with the AAS result. Furthermore, during the loading the Gd concentration was monitored continually using the capture time constant of each spallation neutron produced by cosmic-ray muons,and the final neutron capture efficiency was shown to become 1.5 times higher than that of the first loaded phase, as expected. △ Less

Submitted 18 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

Comments: 34 pages, 13 figures, submitted to Nuclear Inst. and Methods in Physics Research, A

Journal ref: Nuclear Inst. and Methods in Physics Research, A 1065 (2024) 169480

arXiv:2403.03004 [pdf, other]

Ultralight vector dark matter search using data from the KAGRA O3GK run

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi , et al. (1778 additional authors not shown)

Abstract: Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese… ▽ More Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we present the result of a search for $U(1)_{B-L}$ gauge boson DM using the KAGRA data from auxiliary length channels during the first joint observation run together with GEO600. By applying our search pipeline, which takes into account the stochastic nature of ultralight DM, upper bounds on the coupling strength between the $U(1)_{B-L}$ gauge boson and ordinary matter are obtained for a range of DM masses. While our constraints are less stringent than those derived from previous experiments, this study demonstrates the applicability of our method to the lower-mass vector DM search, which is made difficult in this measurement by the short observation time compared to the auto-correlation time scale of DM. △ Less

Submitted 5 March, 2024; originally announced March 2024.

Comments: 20 pages, 5 figures

Report number: LIGO-P2300250

arXiv:2402.00823 [pdf, other]

SLIM: Skill Learning with Multiple Critics

Authors: David Emukpere, Bingbing Wu, Julien Perez, Jean-Michel Renders

Abstract: Self-supervised skill learning aims to acquire useful behaviors that leverage the underlying dynamics of the environment. Latent variable models, based on mutual information maximization, have been successful in this task but still struggle in the context of robotic manipulation. As it requires impacting a possibly large set of degrees of freedom composing the environment, mutual information maxim… ▽ More Self-supervised skill learning aims to acquire useful behaviors that leverage the underlying dynamics of the environment. Latent variable models, based on mutual information maximization, have been successful in this task but still struggle in the context of robotic manipulation. As it requires impacting a possibly large set of degrees of freedom composing the environment, mutual information maximization fails alone in producing useful and safe manipulation behaviors. Furthermore, tackling this by augmenting skill discovery rewards with additional rewards through a naive combination might fail to produce desired behaviors. To address this limitation, we introduce SLIM, a multi-critic learning approach for skill discovery with a particular focus on robotic manipulation. Our main insight is that utilizing multiple critics in an actor-critic framework to gracefully combine multiple reward functions leads to a significant improvement in latent-variable skill discovery for robotic manipulation while overcoming possible interference occurring among rewards which hinders convergence to useful skills. Furthermore, in the context of tabletop manipulation, we demonstrate the applicability of our novel skill discovery approach to acquire safe and efficient motor primitives in a hierarchical reinforcement learning fashion and leverage them through planning, significantly surpassing baseline approaches for skill discovery. △ Less

Submitted 21 March, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

Comments: Accepted at IEEE ICRA 2024

arXiv:2401.16434 [pdf]

doi 10.1016/j.egyr.2023.01.039

A novel ANROA based control approach for grid-tied multi-functional solar energy conversion system

Authors: Dinanath Prasad, Narendra Kumar, Rakhi Sharma, Hasmat Malik, Fausto Pedro García Márquez, Jesús María Pinar Pérez

Abstract: An adaptive control approach for a three-phase grid-interfaced solar photovoltaic system based on the new Neuro-Fuzzy Inference System with Rain Optimization Algorithm (ANROA) methodology is proposed and discussed in this manuscript. This method incorporates an Adaptive Neuro-fuzzy Inference System (ANFIS) with a Rain Optimization Algorithm (ROA). The ANFIS controller has excellent maximum trackin… ▽ More An adaptive control approach for a three-phase grid-interfaced solar photovoltaic system based on the new Neuro-Fuzzy Inference System with Rain Optimization Algorithm (ANROA) methodology is proposed and discussed in this manuscript. This method incorporates an Adaptive Neuro-fuzzy Inference System (ANFIS) with a Rain Optimization Algorithm (ROA). The ANFIS controller has excellent maximum tracking capability because it includes features of both neural and fuzzy techniques. The ROA technique is in charge of controlling the voltage source converter switching. Avoiding power quality problems including voltage fluctuations, harmonics, and flickers as well as unbalanced loads and reactive power usage is the major goal. Besides, the proposed method performs at zero voltage regulation and unity power factor modes. The suggested control approach has been modeled and simulated, and its performance has been assessed using existing alternative methods. A statistical analysis of proposed and existing techniques has been also presented and discussed. The results of the simulations demonstrate that, when compared to alternative approaches, the suggested strategy may properly and effectively identify the best global solutions. Furthermore, the system's robustness has been studied by using MATLAB/SIMULINK environment and experimentally by Field Programmable Gate Arrays Controller (FPGA)-based Hardware-in-Loop (HLL). △ Less

Submitted 26 January, 2024; originally announced January 2024.

Comments: The paper was published in Energy Reports journal (ELSEVIER). Cite as: Prasad, D., Kumar, N., Sharma, R., Malik, H., Márquez, F. P. G., & Pinar-Pérez, J. M. (2023). A novel ANROA based control approach for grid-tied multi-functional solar energy conversion system. Energy Reports, 9, 2044-2057

Journal ref: Energy Reports (2023) Elsevier

arXiv:2401.14763 [pdf, ps, other]

Comparing Session Type Systems derived from Linear Logic

Authors: Bas van den Heuvel, Jorge A. Pérez

Abstract: Session types are a typed approach to message-passing concurrency, where types describe sequences of intended exchanges over channels. Session type systems have been given strong logical foundations via Curry-Howard correspondences with linear logic, a resource-aware logic that naturally captures structured interactions. These logical foundations provide an elegant framework to specify and (static… ▽ More Session types are a typed approach to message-passing concurrency, where types describe sequences of intended exchanges over channels. Session type systems have been given strong logical foundations via Curry-Howard correspondences with linear logic, a resource-aware logic that naturally captures structured interactions. These logical foundations provide an elegant framework to specify and (statically) verify message-passing processes. In this paper, we rigorously compare different type systems for concurrency derived from the Curry-Howard correspondence between linear logic and session types. We address the main divide between these type systems: the classical and intuitionistic presentations of linear logic. Over the years, these presentations have given rise to separate research strands on logical foundations for concurrency; the differences between their derived type systems have only been addressed informally. To formally assess these differences, we develop $π\mathsf{ULL}$, a session type system that encompasses type systems derived from classical and intuitionistic interpretations of linear logic. Based on a fragment of Girard's Logic of Unity, $π\mathsf{ULL}$ provides a basic reference framework: we compare existing session type systems by characterizing fragments of $π\mathsf{ULL}$ that coincide with classical and intuitionistic formulations. We analyze the significance of our characterizations by considering the locality principle (enforced by intuitionistic interpretations but not by classical ones) and forms of process composition induced by the interpretations. △ Less

Submitted 22 August, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

Comments: Preprint to appear in JLAMP; revised/extended version of https://doi.org/10.4204/EPTCS.314.1

arXiv:2401.08251 [pdf]

doi 10.1016/j.rser.2022.112753

A techno-economic model for avoiding conflicts of interest between owners of offshore wind farms and maintenance suppliers

Authors: Alberto Pliego Marugán, Fausto Pedro García Márquez, Jesús María Pinar Pérez

Abstract: Currently, wind energy is one of the most important sources of renewable energy. Offshore locations for wind turbines are increasingly exploited because of their numerous advantages. However, offshore wind farms require high investment in maintenance service. Due to its complexity and special requirements, maintenance service is usually outsourced by wind farm owners. In this paper, we propose a n… ▽ More Currently, wind energy is one of the most important sources of renewable energy. Offshore locations for wind turbines are increasingly exploited because of their numerous advantages. However, offshore wind farms require high investment in maintenance service. Due to its complexity and special requirements, maintenance service is usually outsourced by wind farm owners. In this paper, we propose a novel approach to determine, quantify, and reduce the possible conflicts of interest between owners and maintenance suppliers. We created a complete techno-economic model to address this problem from an impartial point of view. An iterative process was developed to obtain statistical results that can help stakeholders negotiate the terms of the contract, in which the availability of the wind farm is the reference parameter by which to determine penalisations and incentives. Moreover, a multi-objective programming problem was addressed that maximises the profits of both parties without losing the alignment of their interests. The main scientific contribution of this paper is the maintenance analysis of offshore wind farms from two perspectives: that of the owner and the maintenance supplier. This analysis evaluates the conflicts of interest of both parties. In addition, we demonstrate that proper adjustment of some parameters, such as penalisation, incentives, and resources, and adequate control of availability can help reduce this conflict of interests. △ Less

Submitted 16 January, 2024; originally announced January 2024.

Comments: Published in Renewable and Sustainable Energy Reviews (ELSEVIER) 10 July 2022. DOI: https://doi.org/10.1016/j.rser.2022.112753 Cite as: Marugán, A. P., Márquez, F. P. G., & Pérez, J. M. P. (2022). A techno-economic model for avoiding conflicts of interest between owners of offshore wind farms and maintenance suppliers. Renewable and Sustainable Energy Reviews, 168, 112753

arXiv:2312.17068 [pdf, ps, other]

doi 10.1007/978-3-031-43502-7_11

Orbifolds and the modular curve

Authors: Juan Martín Pérez, Florent Schaffhauser

Abstract: We provide an account of the construction of the moduli stack of elliptic curves as an analytic orbifold. While intimately linked to Thurston's point of view on the subject (discrete groups acting properly and effectively on differentiable manifolds), the construction of the modular orbi-curve and its universal family of elliptic curves ends up requiring a bit more technology, in order to allow fo… ▽ More We provide an account of the construction of the moduli stack of elliptic curves as an analytic orbifold. While intimately linked to Thurston's point of view on the subject (discrete groups acting properly and effectively on differentiable manifolds), the construction of the modular orbi-curve and its universal family of elliptic curves ends up requiring a bit more technology, in order to allow for non-effective actions. The paper is entirely expository and makes no claims to originality: its main goal is to be self-contained enough in order to be useful to young researchers who are entering the field and are interested in the interactions between differential and algebraic geometry. △ Less

Submitted 28 December, 2023; originally announced December 2023.

Comments: To appear in 'In the tradition of Thurston III. Geometry and dynamics'

Journal ref: In: Ohshika, K., Papadopoulos, A. (eds) In the Tradition of Thurston III. Springer, Cham (2024)

arXiv:2312.12487 [pdf, other]

Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models

Authors: Angela Castillo, Jonas Kohler, Juan C. Pérez, Juan Pablo Pérez, Albert Pumarola, Bernard Ghanem, Pablo Arbeláez, Ali Thabet

Abstract: This paper presents a comprehensive study on the role of Classifier-Free Guidance (CFG) in text-conditioned diffusion models from the perspective of inference efficiency. In particular, we relax the default choice of applying CFG in all diffusion steps and instead search for efficient guidance policies. We formulate the discovery of such policies in the differentiable Neural Architecture Search fr… ▽ More This paper presents a comprehensive study on the role of Classifier-Free Guidance (CFG) in text-conditioned diffusion models from the perspective of inference efficiency. In particular, we relax the default choice of applying CFG in all diffusion steps and instead search for efficient guidance policies. We formulate the discovery of such policies in the differentiable Neural Architecture Search framework. Our findings suggest that the denoising steps proposed by CFG become increasingly aligned with simple conditional steps, which renders the extra neural network evaluation of CFG redundant, especially in the second half of the denoising process. Building upon this insight, we propose "Adaptive Guidance" (AG), an efficient variant of CFG, that adaptively omits network evaluations when the denoising process displays convergence. Our experiments demonstrate that AG preserves CFG's image quality while reducing computation by 25%. Thus, AG constitutes a plug-and-play alternative to Guidance Distillation, achieving 50% of the speed-ups of the latter while being training-free and retaining the capacity to handle negative prompts. Finally, we uncover further redundancies of CFG in the first half of the diffusion process, showing that entire neural function evaluations can be replaced by simple affine transformations of past score estimates. This method, termed LinearAG, offers even cheaper inference at the cost of deviating from the baseline model. Our findings provide insights into the efficiency of the conditional denoising process that contribute to more practical and swift deployment of text-conditioned diffusion models. △ Less

Submitted 19 December, 2023; originally announced December 2023.

arXiv:2312.11075 [pdf, other]

doi 10.18653/v1/2024.acl-long.622

Split and Rephrase with Large Language Models

Authors: David Ponce, Thierry Etchegoyhen, Jesús Calleja Pérez, Harritxu Gete

Abstract: The Split and Rephrase (SPRP) task, which consists in splitting complex sentences into a sequence of shorter grammatical sentences, while preserving the original meaning, can facilitate the processing of complex texts for humans and machines alike. It is also a valuable testbed to evaluate natural language processing models, as it requires modelling complex grammatical aspects. In this work, we ev… ▽ More The Split and Rephrase (SPRP) task, which consists in splitting complex sentences into a sequence of shorter grammatical sentences, while preserving the original meaning, can facilitate the processing of complex texts for humans and machines alike. It is also a valuable testbed to evaluate natural language processing models, as it requires modelling complex grammatical aspects. In this work, we evaluate large language models on the task, showing that they can provide large improvements over the state of the art on the main metrics, although still lagging in terms of splitting compliance. Results from two human evaluations further support the conclusions drawn from automated metric results. We provide a comprehensive study that includes prompting variants, domain shift, fine-tuned pretrained language models of varying parameter size and training data volumes, contrasted with both zero-shot and few-shot approaches on instruction-tuned language models. Although the latter were markedly outperformed by fine-tuned models, they may constitute a reasonable off-the-shelf alternative. Our results provide a fine-grained analysis of the potential and limitations of large language models for SPRP, with significant improvements achievable using relatively small amounts of training data and model parameters overall, and remaining limitations for all models on the task. △ Less

Submitted 3 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

arXiv:2312.07430 [pdf, other]

Flavon vacuum alignment beyond SUSY

Authors: Claudia Hagedorn, M. L. López-Ibáñez, M. Jay Pérez, Moinul Hossain Rahat, Oscar Vives

Abstract: In flavor models the vacuum alignment of flavons is typically achieved via the $F$-terms of certain fields in the supersymmetric limit. We propose a method for preserving such alignments, up to a rescaling of the vacuum expectation values, even after supersymmetry (and the flavor symmetry) are softly broken, facilitating the vacuum alignment in models which are non-supersymmetric at low energies.… ▽ More In flavor models the vacuum alignment of flavons is typically achieved via the $F$-terms of certain fields in the supersymmetric limit. We propose a method for preserving such alignments, up to a rescaling of the vacuum expectation values, even after supersymmetry (and the flavor symmetry) are softly broken, facilitating the vacuum alignment in models which are non-supersymmetric at low energies. Examples of models with different flavor groups, namely $A_4$, $T_7$, $S_4$ and $Δ(27)$, are discussed. △ Less

Submitted 12 December, 2023; originally announced December 2023.

Comments: 28 pages + references

Report number: IFIC/23-25, FTUV/23-0718

arXiv:2311.18112 [pdf, ps, other]

Detection of an Arbitrary Number of Communities in a Block Spin Ising Model

Authors: Miguel Ballesteros, Ramsés H. Mena, José Luis Pérez, Gabor Toth

Abstract: We study the problem of community detection in a general version of the block spin Ising model featuring M groups, a model inspired by the Curie-Weiss model of ferromagnetism in statistical mechanics. We solve the general problem of identifying any number of groups with any possible coupling constants. Up to now, the problem was only solved for the specific situation with two groups of identical s… ▽ More We study the problem of community detection in a general version of the block spin Ising model featuring M groups, a model inspired by the Curie-Weiss model of ferromagnetism in statistical mechanics. We solve the general problem of identifying any number of groups with any possible coupling constants. Up to now, the problem was only solved for the specific situation with two groups of identical size and identical interactions. Our results can be applied to the most realistic situations, in which there are many groups of different sizes and different interactions. In addition, we give an explicit algorithm that permits the reconstruction of the structure of the model from a sample of observations based on the comparison of empirical correlations of the spin variables, thus unveiling easy applications of the model to real-world voting data and communities in biology. △ Less

Submitted 29 November, 2023; originally announced November 2023.

Comments: 31 pages

MSC Class: 62H22; 82B20; 60F05

arXiv:2311.16836 [pdf, other]

doi 10.1103/PhysRevD.109.023537

A unified TDiff invariant field theory for the dark sector

Authors: David Alonso-López, Javier de Cruz Pérez, Antonio L. Maroto

Abstract: In this work we present a unified model for the cosmological dark sector. The theory is based on a simple minimally coupled scalar field whose action only contains a canonical kinetic term and is invariant under transverse diffeomorphisms (TDiff). The model has the same number of free parameters as $Λ$CDM. We confront the predictions of the model at the background level with data from Planck 2018… ▽ More In this work we present a unified model for the cosmological dark sector. The theory is based on a simple minimally coupled scalar field whose action only contains a canonical kinetic term and is invariant under transverse diffeomorphisms (TDiff). The model has the same number of free parameters as $Λ$CDM. We confront the predictions of the model at the background level with data from Planck 2018 CMB distance priors, Pantheon+ and SH0ES SNIa distance moduli, BAO data points from 6dFGS, BOSS, eBOSS and DES and measurements of the Hubble parameter from cosmic chronometers. The model provides excellent results in the joint fit analysis, showing very strong evidence compared to $Λ$CDM in the deviance information criterion (DIC). We also show that the Hubble tension between Planck 2018 and SH0ES measurements can be alleviated in the unified TDiff model although further analysis is still needed. △ Less

Submitted 4 February, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

Comments: 17 pages, 8 figures. Final version published in PRD

Report number: IPARCOS-UCM-23-130

arXiv:2311.09957 [pdf, other]

The distribution amplitude of the $η_c$ meson

Authors: Miguel Teseo San José Pérez, Benoît Blossier, Mariane Mangin-Brinet, José Manuel Morgado Chávez

Abstract: We report on the first lattice determination of the pseudoscalar meson $η_c$ light-cone distribution amplitude, using a set of three CLS $N_f=2$ ensembles at a pion mass $m_π \sim 270~\text{MeV}$ and lattice spacings $a \sim 0.076~\text{fm}$, $0.066~\text{fm}$ and $0.049~\text{fm}$. Employing Short Distance Factorization, we extract the pseudo-DA on the lattice for Ioffe times $ν\leq 4.5$, and the… ▽ More We report on the first lattice determination of the pseudoscalar meson $η_c$ light-cone distribution amplitude, using a set of three CLS $N_f=2$ ensembles at a pion mass $m_π \sim 270~\text{MeV}$ and lattice spacings $a \sim 0.076~\text{fm}$, $0.066~\text{fm}$ and $0.049~\text{fm}$. Employing Short Distance Factorization, we extract the pseudo-DA on the lattice for Ioffe times $ν\leq 4.5$, and the various lattice spacings allow us to take the continuum limit. We employ a basis of Jacobi polynomials to parametrize the distribution amplitude, which allows to express the matching to the pseudo distribution in closed form, and we observe a strong effect which we attribute to the heavy charm-quark mass. △ Less

Submitted 20 November, 2023; v1 submitted 16 November, 2023; originally announced November 2023.

Comments: Contribution to the 40th International Symposium on Lattice Field Theory (Lattice 2023), July 31st - August 4th, 2023. Fermi National Accelerator Laboratory

arXiv:2311.07894 [pdf]

Security in Drones

Authors: Jonathan Morgan, Julio Perez, Jordan Wade, Sundar Krishnan

Abstract: Drones are used in our everyday world for private, commercial, and government uses. It is important to establish both the cyber threats drone users face and security practices to combat those threats. Privacy will always be the main concern when using drones. Protecting information legally collected on drones and protecting people from the illegal collection of their data are topics that security… ▽ More Drones are used in our everyday world for private, commercial, and government uses. It is important to establish both the cyber threats drone users face and security practices to combat those threats. Privacy will always be the main concern when using drones. Protecting information legally collected on drones and protecting people from the illegal collection of their data are topics that security professionals should consider before their organization uses drones. In this article, the authors discuss the importance of security in drones. △ Less

Submitted 13 November, 2023; originally announced November 2023.

arXiv:2311.02705 [pdf, other]

doi 10.1051/0004-6361/202346847

MELCHIORS: The Mercator Library of High Resolution Stellar Spectroscopy

Authors: P. Royer, T. Merle, K. Dsilva, S. Sekaran, H. Van Winckel, Y. Frémat, M. Van der Swaelmen, S. Gebruers, A. Tkachenko, M. Laverick, M. Dirickx, G. Raskin, H. Hensberge, M. Abdul-Masih, B. Acke, M. L. Alonso, S. Bandhu Mahato, P. G. Beck, N. Behara, S. Bloemen, B. Buysschaert, N. Cox, J. Debosscher, P. De Cat, P. Degroote , et al. (49 additional authors not shown)

Abstract: Over the past decades, libraries of stellar spectra have been used in a large variety of science cases, including as sources of reference spectra for a given object or a given spectral type. Despite the existence of large libraries and the increasing number of projects of large-scale spectral surveys, there is to date only one very high-resolution spectral library offering spectra from a few hundr… ▽ More Over the past decades, libraries of stellar spectra have been used in a large variety of science cases, including as sources of reference spectra for a given object or a given spectral type. Despite the existence of large libraries and the increasing number of projects of large-scale spectral surveys, there is to date only one very high-resolution spectral library offering spectra from a few hundred objects from the southern hemisphere (UVES-POP) . We aim to extend the sample, offering a finer coverage of effective temperatures and surface gravity with a uniform collection of spectra obtained in the northern hemisphere. Between 2010 and 2020, we acquired several thousand echelle spectra of bright stars with the Mercator-HERMES spectrograph located in the Roque de Los Muchachos Observatory in La Palma, whose pipeline offers high-quality data reduction products. We have also developed methods to correct for the instrumental response in order to approach the true shape of the spectral continuum. Additionally, we have devised a normalisation process to provide a homogeneous normalisation of the full spectral range for most of the objects. We present a new spectral library consisting of 3256 spectra covering 2043 stars. It combines high signal-to-noise and high spectral resolution over the entire range of effective temperatures and luminosity classes. The spectra are presented in four versions: raw, corrected from the instrumental response, with and without correction from the atmospheric molecular absorption, and normalised (including the telluric correction). △ Less

Submitted 5 November, 2023; originally announced November 2023.

Comments: 17 pages, 18 figures Preview and access to the library: https://www.royer.se/melchiors.html

Journal ref: A&A 681, A107 (2024)

arXiv:2310.19075 [pdf, other]

Bespoke Solvers for Generative Flow Models

Authors: Neta Shaul, Juan Perez, Ricky T. Q. Chen, Ali Thabet, Albert Pumarola, Yaron Lipman

Abstract: Diffusion or flow-based models are powerful generative paradigms that are notoriously hard to sample as samples are defined as solutions to high-dimensional Ordinary or Stochastic Differential Equations (ODEs/SDEs) which require a large Number of Function Evaluations (NFE) to approximate well. Existing methods to alleviate the costly sampling process include model distillation and designing dedica… ▽ More Diffusion or flow-based models are powerful generative paradigms that are notoriously hard to sample as samples are defined as solutions to high-dimensional Ordinary or Stochastic Differential Equations (ODEs/SDEs) which require a large Number of Function Evaluations (NFE) to approximate well. Existing methods to alleviate the costly sampling process include model distillation and designing dedicated ODE solvers. However, distillation is costly to train and sometimes can deteriorate quality, while dedicated solvers still require relatively large NFE to produce high quality samples. In this paper we introduce "Bespoke solvers", a novel framework for constructing custom ODE solvers tailored to the ODE of a given pre-trained flow model. Our approach optimizes an order consistent and parameter-efficient solver (e.g., with 80 learnable parameters), is trained for roughly 1% of the GPU time required for training the pre-trained model, and significantly improves approximation and generation quality compared to dedicated solvers. For example, a Bespoke solver for a CIFAR10 model produces samples with Fréchet Inception Distance (FID) of 2.73 with 10 NFE, and gets to 1% of the Ground Truth (GT) FID (2.59) for this model with only 20 NFE. On the more challenging ImageNet-64$\times$64, Bespoke samples at 2.2 FID with 10 NFE, and gets within 2% of GT FID (1.71) with 20 NFE. △ Less

Submitted 29 October, 2023; originally announced October 2023.

arXiv:2310.17226 [pdf]

In situ digestion of canola protein gel observed by synchrotron X-Ray Scattering

Authors: Maja Napieraj, Annie Brûlet, Javier Perez, François Boué, Evelyne Lutton

Abstract: We address the issue of structure changes of a canola protein gel (as a solid food model) during gastrointestinal digestion. We present a method for synchrotron Small-Angle X-ray Scattering analysis of the digestion of a gel in a capillary. Scanning the capillary allows tracking the digestion under diffusion of enzymatic juices. The fitting parameters characterizing the sizes, scattering intensiti… ▽ More We address the issue of structure changes of a canola protein gel (as a solid food model) during gastrointestinal digestion. We present a method for synchrotron Small-Angle X-ray Scattering analysis of the digestion of a gel in a capillary. Scanning the capillary allows tracking the digestion under diffusion of enzymatic juices. The fitting parameters characterizing the sizes, scattering intensities and structures allow to distinguish the compact, unfolded or aggregated states of proteins. The evolutions of these parameters enable to detail the complex changes of proteins during gel digestion, involving back-and-forth evolutions with proteins unfolding (1 st and 3 rd steps), re-compaction (2 nd step) due to gastrointestinal pH and enzyme actions, before final protein scissions (4 th step) resulting in small peptides. This complexity is related to the wide ranges of successive pH and enzyme activity acting on large and charged protein assemblies. Digestion is therefore impacted by the conditions of food preparation. △ Less

Submitted 26 October, 2023; originally announced October 2023.

arXiv:2310.12041 [pdf, other]

Simulating high-pressure surface reactions with molecular beams

Authors: Amjad Al Taleb, Frederik Schiller, Denis V. Vyalikh, José Maria Pérez, Sabine V. Auras, Daniel Farías, J. Enrique Ortega

Abstract: Using a reactive molecular beam with high kinetic energy ($E_{kin}$) it is possible to speed gas-surface reactions involving high activation barriers ($E_{act}$), which would require elevated pressures ($P_0$) if a random gas with a Maxwell-Boltzmann distribution is used. By simply computing the number of molecules that overcome the activation barrier in a random gas at $P_0$ and in a molecular be… ▽ More Using a reactive molecular beam with high kinetic energy ($E_{kin}$) it is possible to speed gas-surface reactions involving high activation barriers ($E_{act}$), which would require elevated pressures ($P_0$) if a random gas with a Maxwell-Boltzmann distribution is used. By simply computing the number of molecules that overcome the activation barrier in a random gas at $P_0$ and in a molecular beam at $E_{kin}$=$E_{act}$, we establish an $E_{kin}$-$P_0$ equivalence curve, through which we postulate that molecular beams are ideal tools to investigate gas-surface reactions that involve high activation energies. In particular, we foresee the use of molecular beams to simulate gas surface reactions within the industrial-range ($>$ 10 bar) using surface-sensitive Ultra-High Vacuum (UHV) techniques, such as X-ray photoemission spectroscopy (XPS). To test this idea, we revisit the oxidation of the Cu(111) surface combining O$_2$ molecular beams and XPS experiments. By tuning the kinetic energy of the O$_2$ beam in the range 0.24-1 eV we achieve the same sequence of surface oxides obtained in Ambient Pressure Photoemission (AP-XPS) experiments, in which the Cu(111) surface was exposed to a random O$_2$ gas up to 1 mbar. We observe the same surface oxidation kinetics as in the random gas, but with a much lower dose, close to the expected value derived from the equivalence curve. △ Less

Submitted 18 October, 2023; originally announced October 2023.

Showing 1–50 of 743 results for author: Perez, J