-
Merian: A Wide-Field Imaging Survey of Dwarf Galaxies at z~0.06-0.10
Authors:
Shany Danieli,
Erin Kado-Fong,
Song Huang,
Yifei Luo,
Ting S Li,
Lee S Kelvin,
Alexie Leauthaud,
Jenny E. Greene,
Abby Mintz,
Xiaojing Lin,
Jiaxuan Li,
Vivienne Baldassare,
Arka Banerjee,
Joy Bhattacharyya,
Diana Blanco,
Alyson Brooks,
Zheng Cai,
Xinjun Chen,
Akaxia Cruz,
Robel Geda,
Runquan Guan,
Sean Johnson,
Arun Kannawadi,
Stacy Y. Kim,
Mingyu Li
, et al. (10 additional authors not shown)
Abstract:
We present the Merian Survey, an optical imaging survey optimized for studying the physical properties of bright star-forming dwarf galaxies. Merian is carried out with two medium-band filters ($N708$ and $N540$, centered at $708$ and $540$ nm), custom-built for the Dark Energy Camera (DECam) on the Blanco telescope. Merian covers $\sim 750\,\mathrm{deg}^2$ of equatorial fields, overlapping with t…
▽ More
We present the Merian Survey, an optical imaging survey optimized for studying the physical properties of bright star-forming dwarf galaxies. Merian is carried out with two medium-band filters ($N708$ and $N540$, centered at $708$ and $540$ nm), custom-built for the Dark Energy Camera (DECam) on the Blanco telescope. Merian covers $\sim 750\,\mathrm{deg}^2$ of equatorial fields, overlapping with the Hyper Suprime-Cam Subaru Strategic Program (HSC-SSP) wide, deep, and ultra-deep fields. When combined with the HSC-SSP imaging data ($grizy$), the new Merian DECam medium-band imaging allows for photometric redshift measurements via the detection of H$\rmα$ and [OIII] line emission flux excess in the $N708$ and $N540$ filters, respectively, at $0.06<z<0.10$. We present an overview of the survey design, observations taken to date, data reduction using the LSST Science Pipelines, including aperture-matched photometry for accurate galaxy colors, and a description of the data included in the first data release (DR1). The key science goals of Merian include: probing the dark matter halos of dwarf galaxies out to their virial radii using high signal-to-noise weak lensing profile measurements, decoupling the effects of baryonic processes from dark matter, and understanding the role of black holes in dwarf galaxy evolution. This rich dataset will also offer unique opportunities for studying extremely metal-poor galaxies via their strong [OIII] emission and H$\rmα$ lines, as well as [OIII] emitters at $z\sim 0.4$, and Ly$\rmα$ emitters at $z\sim 3.3$ and $z\sim 4.8$. Merian showcases the power of utilizing narrow and medium-band filters alongside broad-band filters for sky imaging, demonstrating their synergistic capacity to unveil astrophysical insights across diverse astrophysical phenomena.
△ Less
Submitted 8 October, 2024; v1 submitted 2 October, 2024;
originally announced October 2024.
-
Cross Correlating the Unresolved Gamma-Ray Background with Cosmic Large-Scale Structure from DESI: Implications for Astrophysics and Dark Matter
Authors:
Bei Zhou,
José Luis Bernal,
Elena Pinetti,
Hector Afonso G. Cruz,
Marc Kamionkowski
Abstract:
The unresolved gamma-ray background (UGRB) is a diffuse gamma-ray emission arising from numerous extragalactic sources below the detection threshold and is an important component of the gamma-ray sky. Studying the UGRB is crucial for understanding high-energy astrophysical processes in the universe and for probing fundamental physics, such as the nature of dark matter. In this work, we forecast th…
▽ More
The unresolved gamma-ray background (UGRB) is a diffuse gamma-ray emission arising from numerous extragalactic sources below the detection threshold and is an important component of the gamma-ray sky. Studying the UGRB is crucial for understanding high-energy astrophysical processes in the universe and for probing fundamental physics, such as the nature of dark matter. In this work, we forecast the cross-correlation between the UGRB and galaxy catalogs from the Dark Energy Spectroscopic Instrument (DESI) survey. First, we study the expected astrophysical contributions to the UGRB and their cross-correlation with DESI spectroscopic galaxies. Our calculations show that the cross-correlation signal-to-noise ratio is expected to be significant, with the highest value predicted to be 20.6 for DESI luminous red galaxies due to a higher predicted overlap in the redshift distribution with the UGRB. We consider two science cases that the UGRB-spectroscopic galaxies cross-correlation can be applied to: 1) measuring the UGRB flux as a function of redshift, achieving a precision of 10\% in some redshift bins, and 2) searching for annihilating dark matter potentially up to a mass of about 300~GeV, three times higher than the currently strongest constraints. This work underscores the importance of cross correlating the UGRB with cosmic large-scale structure tracers and highlights the multiwavelength approaches to advancing our understanding of high-energy astrophysical phenomena and fundamental physics.
△ Less
Submitted 30 September, 2024;
originally announced October 2024.
-
Data Augmentation for 3DMM-based Arousal-Valence Prediction for HRI
Authors:
Christian Arzate Cruz,
Yotam Sechayk,
Takeo Igarashi,
Randy Gomez
Abstract:
Humans use multiple communication channels to interact with each other. For instance, body gestures or facial expressions are commonly used to convey an intent. The use of such non-verbal cues has motivated the development of prediction models. One such approach is predicting arousal and valence (AV) from facial expressions. However, making these models accurate for human-robot interaction (HRI) s…
▽ More
Humans use multiple communication channels to interact with each other. For instance, body gestures or facial expressions are commonly used to convey an intent. The use of such non-verbal cues has motivated the development of prediction models. One such approach is predicting arousal and valence (AV) from facial expressions. However, making these models accurate for human-robot interaction (HRI) settings is challenging as it requires handling multiple subjects, challenging conditions, and a wide range of facial expressions. In this paper, we propose a data augmentation (DA) technique to improve the performance of AV predictors using 3D morphable models (3DMM). We then utilize this approach in an HRI setting with a mediator robot and a group of three humans. Our augmentation method creates synthetic sequences for underrepresented values in the AV space of the SEWA dataset, which is the most comprehensive dataset with continuous AV labels. Results show that using our DA method improves the accuracy and robustness of AV prediction in real-time applications. The accuracy of our models on the SEWA dataset is 0.793 for arousal and valence.
△ Less
Submitted 30 September, 2024;
originally announced October 2024.
-
Estimation and imputation of missing data in longitudinal models with Zero-Inflated Poisson response variable
Authors:
D. S. Martinez-Lobo,
O. O. Melo,
N. A. Cruz
Abstract:
This research deals with the estimation and imputation of missing data in longitudinal models with a Poisson response variable inflated with zeros. A methodology is proposed that is based on the use of maximum likelihood, assuming that data is missing at random and that there is a correlation between the response variables. In each of the times, the expectation maximization (EM) algorithm is used:…
▽ More
This research deals with the estimation and imputation of missing data in longitudinal models with a Poisson response variable inflated with zeros. A methodology is proposed that is based on the use of maximum likelihood, assuming that data is missing at random and that there is a correlation between the response variables. In each of the times, the expectation maximization (EM) algorithm is used: in step E, a weighted regression is carried out, conditioned on the previous times that are taken as covariates. In step M, the estimation and imputation of the missing data are performed. The good performance of the methodology in different loss scenarios is demonstrated in a simulation study comparing the model only with complete data, and estimating missing data using the mode of the data of each individual. Furthermore, in a study related to the growth of corn, it is tested on real data to develop the algorithm in a practical scenario.
△ Less
Submitted 17 September, 2024;
originally announced September 2024.
-
Joint spatial modeling of mean and non-homogeneous variance combining semiparametric SAR and GAMLSS models for hedonic prices
Authors:
J. D. Toloza-Delgado,
O. O. Melo,
N. A. Cruz
Abstract:
In the context of spatial econometrics, it is very useful to have methodologies that allow modeling the spatial dependence of the observed variables and obtaining more precise predictions of both the mean and the variability of the response variable, something very useful in territorial planning and public policies. This paper proposes a new methodology that jointly models the mean and the varianc…
▽ More
In the context of spatial econometrics, it is very useful to have methodologies that allow modeling the spatial dependence of the observed variables and obtaining more precise predictions of both the mean and the variability of the response variable, something very useful in territorial planning and public policies. This paper proposes a new methodology that jointly models the mean and the variance. Also, it allows to model the spatial dependence of the dependent variable as a function of covariates and to model the semiparametric effects in both models. The algorithms developed are based on generalized additive models that allow the inclusion of non-parametric terms in both the mean and the variance, maintaining the traditional theoretical framework of spatial regression. The theoretical developments of the estimation of this model are carried out, obtaining desirable statistical properties in the estimators. A simulation study is developed to verify that the proposed method has a remarkable predictive capacity in terms of the mean square error and shows a notable improvement in the estimation of the spatial autoregressive parameter, compared to other traditional methods and some recent developments. The model is also tested on data from the construction of a hedonic price model for the city of Bogota, highlighting as the main result the ability to model the variability of housing prices, and the wealth in the analysis obtained.
△ Less
Submitted 13 September, 2024;
originally announced September 2024.
-
Machine Learning for Reducing Noise in RF Control Signals at Industrial Accelerators
Authors:
M. Henderson,
J. P. Edelen,
J. Einstein-Curtis,
C. C. Hall,
J. A. Diaz Cruz,
A. L. Edelen
Abstract:
Industrial particle accelerators typically operate in dirtier environments than research accelerators, leading to increased noise in RF and electronic systems. Furthermore, given that industrial accelerators are mass produced, less attention is given to optimizing the performance of individual systems. As a result, industrial accelerators tend to underperform their own hardware capabilities. Impro…
▽ More
Industrial particle accelerators typically operate in dirtier environments than research accelerators, leading to increased noise in RF and electronic systems. Furthermore, given that industrial accelerators are mass produced, less attention is given to optimizing the performance of individual systems. As a result, industrial accelerators tend to underperform their own hardware capabilities. Improving signal processing for these machines will improve cost and time margins for deployment, helping to meet the growing demand for accelerators for medical sterilization, food irradiation, cancer treatment, and imaging. Our work focuses on using machine learning techniques to reduce noise in RF signals used for pulse-to-pulse feedback in industrial accelerators. Here we review our algorithms and observed results for simulated RF systems, and discuss next steps with the ultimate goal of deployment on industrial systems.
△ Less
Submitted 5 September, 2024;
originally announced September 2024.
-
On Artin groups admitting retractions to parabolic subgroups
Authors:
Bruno Aaron Cisneros de la Cruz,
María Cumplido,
Islam Foniqi
Abstract:
We generalize the retractions to standard parabolic subgroups for even Artin groups to FC-type Artin groups and other more general families. We prove that these retractions uniquely extend to any parabolic subgroup. We use retractions to generalize the results of Antolín and Foniqi that reduce the problem of intersection of parabolic subgroups to weaker conditions. As a corollary, we characterize…
▽ More
We generalize the retractions to standard parabolic subgroups for even Artin groups to FC-type Artin groups and other more general families. We prove that these retractions uniquely extend to any parabolic subgroup. We use retractions to generalize the results of Antolín and Foniqi that reduce the problem of intersection of parabolic subgroups to weaker conditions. As a corollary, we characterize coherence for the FC case.
△ Less
Submitted 22 August, 2024;
originally announced August 2024.
-
Quantifying the Effectiveness of Student Organization Activities using Natural Language Processing
Authors:
Lyberius Ennio F. Taruc,
Arvin R. De La Cruz
Abstract:
Student extracurricular activities play an important role in enriching the students' educational experiences. With the increasing popularity of Machine Learning and Natural Language Processing, it becomes a logical step that incorporating ML-NLP in improving extracurricular activities is a potential focus of study in Artificial Intelligence (AI). This research study aims to develop a machine learn…
▽ More
Student extracurricular activities play an important role in enriching the students' educational experiences. With the increasing popularity of Machine Learning and Natural Language Processing, it becomes a logical step that incorporating ML-NLP in improving extracurricular activities is a potential focus of study in Artificial Intelligence (AI). This research study aims to develop a machine learning workflow that will quantify the effectiveness of student-organized activities based on student emotional responses using sentiment analysis. The study uses the Bidirectional Encoder Representations from Transformers (BERT) Large Language Model (LLM) called via the pysentimiento toolkit, as a Transformer pipeline in Hugging Face. A sample data set from Organization C, a Recognized Student Organization (RSO) of a higher educational institute in the Philippines, College X, was used to develop the workflow. The workflow consisted of data preprocessing, key feature selection, LLM feature processing, and score aggregation, resulting in an Event Score for each data set. The results show that the BERT LLM can also be used effectively in analyzing sentiment beyond product reviews and post comments. For the student affairs offices of educational institutions, this study can provide a practical example of how NLP can be applied to real-world scenarios, showcasing the potential impact of data-driven decision making.
△ Less
Submitted 16 August, 2024;
originally announced August 2024.
-
Narrowband-IoT (NB-IoT) and IoT Use Cases in Universities, Campuses, and Educational Institutions: A Research Analysis
Authors:
Lyberius Ennio F. Taruc,
Arvin R. De La Cruz
Abstract:
The main objective of this research paper is to analyze the available use cases of Narrowband-IoT and IoT in universities, campuses, and educational institutions. A literature review was conducted using multiple databases such as IEEE Xplore, ACM Digital Library, and Scopus. The study explores the benefits of IoT adoption in higher education. Various use cases of NB-IoT in educational institutions…
▽ More
The main objective of this research paper is to analyze the available use cases of Narrowband-IoT and IoT in universities, campuses, and educational institutions. A literature review was conducted using multiple databases such as IEEE Xplore, ACM Digital Library, and Scopus. The study explores the benefits of IoT adoption in higher education. Various use cases of NB-IoT in educational institutions were analyzed, including smart campus management, asset tracking, monitoring, and safety and security systems. Of the six use cases assessed, three focused on the deployment of IoT Things, while three focused on NB-IoT Connectivity. The research paper concludes that NB-IoT technology has significant potential to enhance various aspects of educational institutions, from smart campus management to improving safety and security systems. The study recommends further exploration and implementation of NB-IoT technology in educational settings to improve efficiency, security, and overall campus management. The research highlights the potential applications of NB-IoT in universities and educational institutions, paving the way for future studies in this area. The social implications of this research could involve enhancing the overall learning experience for students, improving campus safety, and promoting technological advancements in educational settings.
Keywords: narrowband-IoT, Internet-of-Things, smart campus, smart institutions
△ Less
Submitted 6 August, 2024;
originally announced August 2024.
-
The First Billion Years in Seconds: An Effective Model for the 21-cm Signal with Population III Stars
Authors:
Hector Afonso G. Cruz,
Julian B. Munoz,
Nashwan Sabti,
Marc Kamionkowski
Abstract:
Observations of the 21-cm signal are opening a window to the cosmic-dawn epoch, when the first stars formed. These observations are usually interpreted with semi-numerical or hydrodynamical simulations, which are often computationally intensive and inflexible to changes in cosmological or astrophysical effects. Here, we present an effective, fully analytic model for the impact of the first stars o…
▽ More
Observations of the 21-cm signal are opening a window to the cosmic-dawn epoch, when the first stars formed. These observations are usually interpreted with semi-numerical or hydrodynamical simulations, which are often computationally intensive and inflexible to changes in cosmological or astrophysical effects. Here, we present an effective, fully analytic model for the impact of the first stars on the 21-cm signal, using the modular code Zeus21. Zeus21 employs an analytic prescription of the star formation rate density (SFRD) to recover the fully nonlinear and nonlocal correlations of radiative fields that determine the 21-cm signal. We introduce the earliest Population III (Pop III) stars residing in low-mass molecular-cooling galaxies in Zeus21, with distinct spectra from later Pop II stars. We also self-consistently model feedback in the form of $H_2$-dissociating Lyman-Werner (LW) radiation, as well as dark matter-baryon relative velocities, both of which suppress star formation in the lowest-mass halos. LW feedback produces a scale-dependence on the SFRD fluctuations, due to the long mean free path of LW photons. Relative velocities give rise to "wiggles" in the spatial distribution of the 21-cm signal; we present an improved calculation of the shape of these velocity-induced acoustic oscillations, showing they remain a standard ruler at cosmic dawn. Our improved version of Zeus21 predicts the 21-cm global signal and power spectra in agreement with simulations at the $\sim 10\%$ level, yet is at least three orders of magnitude faster. This public code represents a step towards efficient and flexible parameter inference at cosmic dawn, allowing us to predict the first billion years of the universe in mere seconds.
△ Less
Submitted 25 July, 2024;
originally announced July 2024.
-
Data Poisoning: An Overlooked Threat to Power Grid Resilience
Authors:
Nora Agah,
Javad Mohammadi,
Alex Aved,
David Ferris,
Erika Ardiles Cruz,
Philip Morrone
Abstract:
As the complexities of Dynamic Data Driven Applications Systems increase, preserving their resilience becomes more challenging. For instance, maintaining power grid resilience is becoming increasingly complicated due to the growing number of stochastic variables (such as renewable outputs) and extreme weather events that add uncertainty to the grid. Current optimization methods have struggled to a…
▽ More
As the complexities of Dynamic Data Driven Applications Systems increase, preserving their resilience becomes more challenging. For instance, maintaining power grid resilience is becoming increasingly complicated due to the growing number of stochastic variables (such as renewable outputs) and extreme weather events that add uncertainty to the grid. Current optimization methods have struggled to accommodate this rise in complexity. This has fueled the growing interest in data-driven methods used to operate the grid, leading to more vulnerability to cyberattacks. One such disruption that is commonly discussed is the adversarial disruption, where the intruder attempts to add a small perturbation to input data in order to "manipulate" the system operation. During the last few years, work on adversarial training and disruptions on the power system has gained popularity. In this paper, we will first review these applications, specifically on the most common types of adversarial disruptions: evasion and poisoning disruptions. Through this review, we highlight the gap between poisoning and evasion research when applied to the power grid. This is due to the underlying assumption that model training is secure, leading to evasion disruptions being the primary type of studied disruption. Finally, we will examine the impacts of data poisoning interventions and showcase how they can endanger power grid resilience.
△ Less
Submitted 19 July, 2024;
originally announced July 2024.
-
Evaluating language models as risk scores
Authors:
André F. Cruz,
Moritz Hardt,
Celestine Mendler-Dünner
Abstract:
Current question-answering benchmarks predominantly focus on accuracy in realizable prediction tasks. Conditioned on a question and answer-key, does the most likely token match the ground truth? Such benchmarks necessarily fail to evaluate LLMs' ability to quantify ground-truth outcome uncertainty. In this work, we focus on the use of LLMs as risk scores for unrealizable prediction tasks. We intro…
▽ More
Current question-answering benchmarks predominantly focus on accuracy in realizable prediction tasks. Conditioned on a question and answer-key, does the most likely token match the ground truth? Such benchmarks necessarily fail to evaluate LLMs' ability to quantify ground-truth outcome uncertainty. In this work, we focus on the use of LLMs as risk scores for unrealizable prediction tasks. We introduce folktexts, a software package to systematically generate risk scores using LLMs, and evaluate them against US Census data products. A flexible API enables the use of different prompting schemes, local or web-hosted models, and diverse census columns that can be used to compose custom prediction tasks. We evaluate 17 recent LLMs across five proposed benchmark tasks. We find that zero-shot risk scores produced by multiple-choice question-answering have high predictive signal but are widely miscalibrated. Base models consistently overestimate outcome uncertainty, while instruction-tuned models underestimate uncertainty and produce over-confident risk scores. In fact, instruction-tuning polarizes answer distribution regardless of true underlying data uncertainty. This reveals a general inability of instruction-tuned LLMs to express data uncertainty using multiple-choice answers. A separate experiment using verbalized chat-style risk queries yields substantially improved calibration across instruction-tuned models. These differences in ability to quantify data uncertainty cannot be revealed in realizable settings, and highlight a blind-spot in the current evaluation ecosystem that folktexts covers.
△ Less
Submitted 23 September, 2024; v1 submitted 19 July, 2024;
originally announced July 2024.
-
Fast Bayesian Basis Selection for Functional Data Representation with Correlated Errors
Authors:
Ana Carolina da Cruz,
Camila P. E. de Souza,
Pedro H. T. O. Sousa
Abstract:
Functional data analysis finds widespread application across various fields. While functional data are intrinsically infinite-dimensional, in practice, they are observed only at a finite set of points, typically over a dense grid. As a result, smoothing techniques are often used to approximate the observed data as functions. In this work, we propose a novel Bayesian approach for selecting basis fu…
▽ More
Functional data analysis finds widespread application across various fields. While functional data are intrinsically infinite-dimensional, in practice, they are observed only at a finite set of points, typically over a dense grid. As a result, smoothing techniques are often used to approximate the observed data as functions. In this work, we propose a novel Bayesian approach for selecting basis functions for smoothing one or multiple curves simultaneously. Our method differentiates from other Bayesian approaches in two key ways: (i) by accounting for correlated errors and (ii) by developing a variational EM algorithm, which is faster than MCMC methods such as Gibbs sampling. Simulation studies demonstrate that our method effectively identifies the true underlying structure of the data across various scenarios and it is applicable to different types of functional data. Our variational EM algorithm not only recovers the basis coefficients and the correct set of basis functions but also estimates the existing within-curve correlation. When applied to the motorcycle and temperature datasets, our method demonstrates comparable, and in some cases superior, performance in terms of adjusted $R^2$ compared to regression splines, smoothing splines, Bayesian LASSO and LASSO. Our proposed method is implemented in R and codes are available at https://github.com/acarolcruz/VB-Bases-Selection.
△ Less
Submitted 8 October, 2024; v1 submitted 31 May, 2024;
originally announced May 2024.
-
Multiple sampling and interpolation in a space of polynomials
Authors:
Carlos A. Cruz,
Xavier Massaneda,
Joaquim Ortega-Cerdà
Abstract:
We study sampling and interpolation arrays with multiplicities for the spaces P_k of holomorphic polynomials of degree at most k. We find that the geometric conditions satisfied by these arrays are in accordance with the conditions satisfied by the sampling and interpolating sequences with unbounded multiplicities in the Fock space, which can be seen as a limiting case of the space P_k as k tends…
▽ More
We study sampling and interpolation arrays with multiplicities for the spaces P_k of holomorphic polynomials of degree at most k. We find that the geometric conditions satisfied by these arrays are in accordance with the conditions satisfied by the sampling and interpolating sequences with unbounded multiplicities in the Fock space, which can be seen as a limiting case of the space P_k as k tends to infinity.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Understanding crypter-as-a-service in a popular underground marketplace
Authors:
Alejandro de la Cruz,
Sergio Pastrana
Abstract:
Crypters are pieces of software whose main goal is to transform a target binary so it can avoid detection from Anti Viruses (AVs from now on) applications. They work similar to packers, by taking a malware binary and applying a series of modifications, obfuscations and encryptions to output a binary that evades one or more AVs. The goal is to remain fully undetected, or FUD in the hacking jargon,…
▽ More
Crypters are pieces of software whose main goal is to transform a target binary so it can avoid detection from Anti Viruses (AVs from now on) applications. They work similar to packers, by taking a malware binary and applying a series of modifications, obfuscations and encryptions to output a binary that evades one or more AVs. The goal is to remain fully undetected, or FUD in the hacking jargon, while maintaining its (often malicious) functionality. In line to the growth of commoditization in cybercrime, the crypter-as-a-service model has gained popularity, in response to the increased sophistication of detection mechanisms. In this business model, customers receive an initial crypter which is soon updated once becomes detected by anti-viruses. This paper provides the first study on an online underground market dedicated to crypter-as-a-service. We compare the most relevant products in sale, analyzing the existent social network on the platform and comparing the different features that they provide. We also conduct an experiment as a case study, to validate the usage of one of the most popular crypters sold in the market, and compare the results before and after crypting binaries (both benign and malware), to show its effectiveness when evading antivirus engines.
△ Less
Submitted 6 June, 2024; v1 submitted 20 May, 2024;
originally announced May 2024.
-
Introducing the DREAMS Project: DaRk mattEr and Astrophysics with Machine learning and Simulations
Authors:
Jonah C. Rose,
Paul Torrey,
Francisco Villaescusa-Navarro,
Mariangela Lisanti,
Tri Nguyen,
Sandip Roy,
Kassidy E. Kollmann,
Mark Vogelsberger,
Francis-Yan Cyr-Racine,
Mikhail V. Medvedev,
Shy Genel,
Daniel Anglés-Alcázar,
Nitya Kallivayalil,
Bonny Y. Wang,
Belén Costanza,
Stephanie O'Neil,
Cian Roche,
Soumyodipta Karmakar,
Alex M. Garcia,
Ryan Low,
Shurui Lin,
Olivia Mostow,
Akaxia Cruz,
Andrea Caputo,
Arya Farahi
, et al. (5 additional authors not shown)
Abstract:
We introduce the DREAMS project, an innovative approach to understanding the astrophysical implications of alternative dark matter models and their effects on galaxy formation and evolution. The DREAMS project will ultimately comprise thousands of cosmological hydrodynamic simulations that simultaneously vary over dark matter physics, astrophysics, and cosmology in modeling a range of systems -- f…
▽ More
We introduce the DREAMS project, an innovative approach to understanding the astrophysical implications of alternative dark matter models and their effects on galaxy formation and evolution. The DREAMS project will ultimately comprise thousands of cosmological hydrodynamic simulations that simultaneously vary over dark matter physics, astrophysics, and cosmology in modeling a range of systems -- from galaxy clusters to ultra-faint satellites. Such extensive simulation suites can provide adequate training sets for machine-learning-based analyses. This paper introduces two new cosmological hydrodynamical suites of Warm Dark Matter, each comprised of 1024 simulations generated using the Arepo code. One suite consists of uniform-box simulations covering a $(25~h^{-1}~{\rm M}_\odot)^3$ volume, while the other consists of Milky Way zoom-ins with sufficient resolution to capture the properties of classical satellites. For each simulation, the Warm Dark Matter particle mass is varied along with the initial density field and several parameters controlling the strength of baryonic feedback within the IllustrisTNG model. We provide two examples, separately utilizing emulators and Convolutional Neural Networks, to demonstrate how such simulation suites can be used to disentangle the effects of dark matter and baryonic physics on galactic properties. The DREAMS project can be extended further to include different dark matter models, galaxy formation physics, and astrophysical targets. In this way, it will provide an unparalleled opportunity to characterize uncertainties on predictions for small-scale observables, leading to robust predictions for testing the particle physics nature of dark matter on these scales.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
Fine-Tuning Pre-trained Language Models to Detect In-Game Trash Talks
Authors:
Daniel Fesalbon,
Arvin De La Cruz,
Marvin Mallari,
Nelson Rodelas
Abstract:
Common problems in playing online mobile and computer games were related to toxic behavior and abusive communication among players. Based on different reports and studies, the study also discusses the impact of online hate speech and toxicity on players' in-game performance and overall well-being. This study investigates the capability of pre-trained language models to classify or detect trash tal…
▽ More
Common problems in playing online mobile and computer games were related to toxic behavior and abusive communication among players. Based on different reports and studies, the study also discusses the impact of online hate speech and toxicity on players' in-game performance and overall well-being. This study investigates the capability of pre-trained language models to classify or detect trash talk or toxic in-game messages The study employs and evaluates the performance of pre-trained BERT and GPT language models in detecting toxicity within in-game chats. Using publicly available APIs, in-game chat data from DOTA 2 game matches were collected, processed, reviewed, and labeled as non-toxic, mild (toxicity), and toxic. The study was able to collect around two thousand in-game chats to train and test BERT (Base-uncased), BERT (Large-uncased), and GPT-3 models. Based on the three models' state-of-the-art performance, this study concludes pre-trained language models' promising potential for addressing online hate speech and in-game insulting trash talk.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
Results of the follow-up of ANTARES neutrino alerts
Authors:
A. Albert,
S. Alves,
M. André,
M. Ardid,
S. Ardid,
J. -J. Aubert,
J. Aublin,
B. Baret,
S. Basa,
Y. Becherini,
B. Belhorma,
M. Bendahman,
F. Benfenati,
V. Bertin,
S. Biagi,
M. Bissinger,
J. Boumaaza,
M. Bouta,
M. C. Bouwhuis,
H. Brânzas,
R. Bruijn,
J. Brunner,
J. Busto,
B. Caiffi,
D. Calvo
, et al. (166 additional authors not shown)
Abstract:
High-energy neutrinos could be produced in the interaction of charged cosmic rays with matter or radiation surrounding astrophysical sources. To look for transient sources associated with neutrino emission, a follow-up program of neutrino alerts has been operating within the ANTARES Collaboration since 2009. This program, named TAToO, has triggered robotic optical telescopes (MASTER, TAROT, ROTSE…
▽ More
High-energy neutrinos could be produced in the interaction of charged cosmic rays with matter or radiation surrounding astrophysical sources. To look for transient sources associated with neutrino emission, a follow-up program of neutrino alerts has been operating within the ANTARES Collaboration since 2009. This program, named TAToO, has triggered robotic optical telescopes (MASTER, TAROT, ROTSE and the SVOM ground based telescopes) immediately after the detection of any relevant neutrino candidate and scheduled several observations in the weeks following the detection. A subset of ANTARES events with highest probabilities of being of cosmic origin has also been followed by the Swift and the INTEGRAL satellites, the Murchison Widefield Array radio telescope and the H.E.S.S. high-energy gamma-ray telescope. The results of twelve years of observations are reported. No optical counterpart has been significantly associated with an ANTARES candidate neutrino signal during image analysis. Constraints on transient neutrino emission have been set. In September 2015, ANTARES issued a neutrino alert and during the follow-up, a potential transient counterpart was identified by Swift and MASTER. A multi-wavelength follow-up campaign has allowed to identify the nature of this source and has proven its fortuitous association with the neutrino. The return of experience is particularly important for the design of the alert system of KM3NeT, the next generation neutrino telescope in the Mediterranean Sea.
△ Less
Submitted 26 February, 2024;
originally announced February 2024.
-
Estimability conditions for complex carryover effects in crossover designs
Authors:
N. A. Cruz,
O. O. Melo,
C. A. Martinez
Abstract:
It has been argued for many years that models used to analyze data from crossover designs are not appropriate when simple carryover effects are assumed. Furthermore, a statistical model that could estimate complex carry-over effects in crossover designs had never been found. However, in this paper, the estimability conditions of the complex carryover effects and a theoretical result that supports…
▽ More
It has been argued for many years that models used to analyze data from crossover designs are not appropriate when simple carryover effects are assumed. Furthermore, a statistical model that could estimate complex carry-over effects in crossover designs had never been found. However, in this paper, the estimability conditions of the complex carryover effects and a theoretical result that supports them are found. In addition, a simulation example is developed in a non-linear dose-response test for a typical AB/BA crossover design with repeated measures. This simulation shows that a semiparametric model can detect complex carryover effects and that this estimation improves the precision of the estimators of the treatment effect. It is concluded that when there are at least five replicates in each observation period per individual, semiparametric statistical models provide a good estimator of the treatment effect and reduce bias with respect to models that assume the absence of carryover effects or simplex carryover effects. Furthermore, an application of the methodology is shown and the wealth of analysis gained by estimating complex carryover effects is evident.
△ Less
Submitted 11 September, 2024; v1 submitted 26 February, 2024;
originally announced February 2024.
-
Filipino Use of Designer and Luxury Perfumes: A Pilot Study of Consumer Behavior
Authors:
John Paul P. Miranda,
Maria Anna D. Cruz,
Dina D. Gonzales,
Ma. Rebecca G. Del Rosario,
Aira May B. Canlas,
Joseph Alexander Bansil
Abstract:
This study investigates the usage patterns and purposes of designer perfumes among Filipino consumers, employing purposive and snowball sampling methods as non-probability sampling techniques. Data was collected using Google Forms, and the majority of respondents purchased full bottles of designer perfumes from retailers, wholesalers, and physical stores, with occasional "blind purchases." Daily u…
▽ More
This study investigates the usage patterns and purposes of designer perfumes among Filipino consumers, employing purposive and snowball sampling methods as non-probability sampling techniques. Data was collected using Google Forms, and the majority of respondents purchased full bottles of designer perfumes from retailers, wholesalers, and physical stores, with occasional "blind purchases." Daily usage was common, with respondents applying an average of 5.88 sprays in the morning, favoring fresh scent notes and Eau De Parfum concentration. They tended to alternate perfumes daily, selecting different scent profiles according to the Philippine climate. The study reveals that Filipino respondents primarily use designer perfumes to achieve a pleasant and fresh fragrance. Additionally, these perfumes play a role in boosting self-esteem, elevating mood, and enhancing personal presentation. Some respondents reported fewer common applications, such as using perfume to address insomnia and migraines. Overall, the research highlights the significant role of perfume in the grooming routine of Filipino consumers. This study represents the first attempt to comprehend perfume usage patterns and purposes specifically within the Filipino context. Consequently, its findings are invaluable for manufacturers and marketers targeting the Filipino market, providing insights into consumer preferences and motivations.
△ Less
Submitted 31 January, 2024;
originally announced January 2024.
-
Towards a zk-SNARK compiler for Wolfram language
Authors:
Armando Cruz
Abstract:
Zero-knowledge proofs (zk-Proofs) are communication protocols by which a prover can demonstrate to a verifier that it possesses a solution to a given public problem without revealing the content of the solution. Arbitrary computations can be transformed into an interactive zk-Proof so anyone is convinced that it was executed correctly without knowing what was executed on, having huge implications…
▽ More
Zero-knowledge proofs (zk-Proofs) are communication protocols by which a prover can demonstrate to a verifier that it possesses a solution to a given public problem without revealing the content of the solution. Arbitrary computations can be transformed into an interactive zk-Proof so anyone is convinced that it was executed correctly without knowing what was executed on, having huge implications for digital currency. Despite this, interactive proofs are not suited for blockchain applications but novel protocols such as zk-SNARKs have made zero-knowledge ledgers like Zcash possible. This project builds upon Wolfram's ZeroKnowledgeProofs paclet and implements a zk-SNARK compiler based on Pinocchio protocol.
△ Less
Submitted 5 January, 2024;
originally announced January 2024.
-
On some discrete statistics of parking functions
Authors:
Ari Cruz,
Pamela E. Harris,
Kimberly J. Harry,
Jan Kretschmann,
Matt McClinton,
Alex Moon,
John O. Museus,
Eric Redmon
Abstract:
Recall that $α=(a_1,a_2,\ldots,a_n)\in[n]^n$ is a parking function if its nondecreasing rearrangement $β=(b_1,b_2,\ldots,b_n)$ satisfies $b_i\leq i$ for all $1\leq i\leq n$. In this article, we study parking functions based on their ascents (indices at which $a_i<a_{i+1}$), descents (indices at which $a_i>a_{i+1}$), and ties (indices at which $a_i=a_{i+1}$). By utilizing multiset Eulerian polynomi…
▽ More
Recall that $α=(a_1,a_2,\ldots,a_n)\in[n]^n$ is a parking function if its nondecreasing rearrangement $β=(b_1,b_2,\ldots,b_n)$ satisfies $b_i\leq i$ for all $1\leq i\leq n$. In this article, we study parking functions based on their ascents (indices at which $a_i<a_{i+1}$), descents (indices at which $a_i>a_{i+1}$), and ties (indices at which $a_i=a_{i+1}$). By utilizing multiset Eulerian polynomials, we give a generating function for the number of parking functions of length $n$ with $i$ descents. We present a recursive formula for the number of parking functions of length $n$ with descents at a specified subset of $[n-1]$. We establish that the number of parking functions of length $n$ with descents at $I\subset[n-1]$ and descents at $J=\{n-i:i\in I\}$ are equinumerous. As a special case, we show that the number of parking functions of length $n$ with descents at the first $k$ indices is given by $f(n, n-k-1)=\frac{1}{n}\binom{n}{k}\binom{2n-k}{n-k-1}$. We prove this by bijecting to the set of standard Young tableaux of shape $((n-k)^2,1^k)$, which are enumerated by $f(n,n-k-1)$. We also study peaks of parking functions, which are indices at which $a_{i-1}<a_i>a_{i+1}$. We show that the set of parking functions with no peaks and no ties is enumerated by the Catalan numbers. We conclude our study by characterizing when a parking function is uniquely determined by their statistic encoding; a word indicating what indices in the parking function are ascents, descents, and ties. We provide open problems throughout.
△ Less
Submitted 24 May, 2024; v1 submitted 27 December, 2023;
originally announced December 2023.
-
Variational Autoencoders for Noise Reduction in Industrial LLRF Systems
Authors:
J. P. Edelen,
M. J. Henderson,
J. Einstein-Curtis,
C. C. Hall,
J. A. Diaz Cruz,
A. L. Edelen
Abstract:
Industrial particle accelerators inherently operate in much dirtier environments than typical research accelerators. This leads to an increase in noise both in the RF system and in other electronic systems. Combined with the fact that industrial accelerators are mass produced, there is less attention given to optimizing the performance of an individual system. As a result, industrial systems tend…
▽ More
Industrial particle accelerators inherently operate in much dirtier environments than typical research accelerators. This leads to an increase in noise both in the RF system and in other electronic systems. Combined with the fact that industrial accelerators are mass produced, there is less attention given to optimizing the performance of an individual system. As a result, industrial systems tend to under perform considering their hardware hardware capabilities. With the growing demand for accelerators for medical sterilization, food irradiation, cancer treatment, and imaging, improving the signal processing of these machines will increase the margin for the deployment of these systems. Our work is focusing on using machine learning techniques to reduce the noise of RF signals used for pulse-to-pulse feedback in industrial accelerators. We will review our algorithms, simulation results, and results working with measured data. We will then discuss next steps for deployment and testing on an industrial system.
△ Less
Submitted 7 November, 2023; v1 submitted 29 October, 2023;
originally announced November 2023.
-
Electric Vehicle Aggregation Review: Benefits and Vulnerabilities of Managing a Growing EV Fleet
Authors:
Kelsey Nelson,
Javad Mohammadi,
Yu Chen,
Erik Blasch,
Alex Aved,
David Ferris,
Erika Ardiles Cruz,
Philip Morrone
Abstract:
Electric vehicles (EVs) are becoming more popular within the United States, making up an increasingly large portion of the US's electricity consumption. Hence, there is much attention has been directed on how to manage EVs within the power sector. A well-investigated strategy for managing the increase in electricity demand from EV charging is aggregation, which allows for an intermediary to manage…
▽ More
Electric vehicles (EVs) are becoming more popular within the United States, making up an increasingly large portion of the US's electricity consumption. Hence, there is much attention has been directed on how to manage EVs within the power sector. A well-investigated strategy for managing the increase in electricity demand from EV charging is aggregation, which allows for an intermediary to manage electricity flow between EV owners and their utilities. When implemented effectively, EV aggregation provides key benefits to power grids by relieving electrical loads.. These benefits are aggregation's ability to shift EV loads to peak shave, which often leads to lower emissions, electricity generation prices, and consumer costs depending on the penetration levels of non-dispatchable electricity sources. This review seeks to appropriately highlight the broad vulnerabilities of EV aggregation alongside its benefits, namely those regarding battery degradation, rebound peaks, and cybersecurity. The holistic overview of EV aggregation provides comparisons that balance expectations with realistic performance.
△ Less
Submitted 27 October, 2023; v1 submitted 25 October, 2023;
originally announced October 2023.
-
On the Computational Complexities of Complex-valued Neural Networks
Authors:
Kayol Soares Mayer,
Jonathan Aguiar Soares,
Ariadne Arrais Cruz,
Dalton Soares Arantes
Abstract:
Complex-valued neural networks (CVNNs) are nonlinear filters used in the digital signal processing of complex-domain data. Compared with real-valued neural networks~(RVNNs), CVNNs can directly handle complex-valued input and output signals due to their complex domain parameters and activation functions. With the trend toward low-power systems, computational complexity analysis has become essential…
▽ More
Complex-valued neural networks (CVNNs) are nonlinear filters used in the digital signal processing of complex-domain data. Compared with real-valued neural networks~(RVNNs), CVNNs can directly handle complex-valued input and output signals due to their complex domain parameters and activation functions. With the trend toward low-power systems, computational complexity analysis has become essential for measuring an algorithm's power consumption. Therefore, this paper presents both the quantitative and asymptotic computational complexities of CVNNs. This is a crucial tool in deciding which algorithm to implement. The mathematical operations are described in terms of the number of real-valued multiplications, as these are the most demanding operations. To determine which CVNN can be implemented in a low-power system, quantitative computational complexities can be used to accurately estimate the number of floating-point operations. We have also investigated the computational complexities of CVNNs discussed in some studies presented in the literature.
△ Less
Submitted 19 October, 2023;
originally announced October 2023.
-
Fine structure splitting cancellation in highly asymmetric InAs/InP droplet epitaxy quantum dots
Authors:
N. R. S. van Venrooij,
A. R. da Cruz,
R. S. R. Gajjella,
P. M. Koenraad,
Craig E. Pryor,
Michael E. Flatté
Abstract:
We find the single exciton's fine structure splitting (FSS), which splits its degenerate ground state manifold into singlets, nearly vanishes in highly asymmetric quantum dots due to the cancellation of splitting effects with markedly different origin. The dots simulated are those that emerge on top of etch pits through the droplet epitaxy growth process; these etch pit dots break square (…
▽ More
We find the single exciton's fine structure splitting (FSS), which splits its degenerate ground state manifold into singlets, nearly vanishes in highly asymmetric quantum dots due to the cancellation of splitting effects with markedly different origin. The dots simulated are those that emerge on top of etch pits through the droplet epitaxy growth process; these etch pit dots break square ($C_{4v}$) spatial symmetry, which has been previously associated with small FSS. Configuration interaction calculations predict a vanishing FSS at a specific finite etch pit displacement from the center of the dot, for a structure far from square symmetry. We thus predict that highly asymmetric quantum dots may still display negligible fine structure splitting, providing new avenues for high-fidelity generation of indistinguishable, polarization entangled photon pairs on demand.
△ Less
Submitted 26 September, 2023;
originally announced September 2023.
-
Selection of powerful radio galaxies with machine learning
Authors:
R. Carvajal,
I. Matute,
J. Afonso,
R. P. Norris,
K. J. Luken,
P. Sánchez-Sáez,
P. A. C. Cunha,
A. Humphrey,
H. Messias,
S. Amarantidis,
D. Barbosa,
H. A. Cruz,
H. Miranda,
A. Paulino-Afonso,
C. Pappalardo
Abstract:
We developed and trained a pipeline of three machine learning (ML) models than can predict which sources are more likely to be an AGN and to be detected in specific radio surveys. Also, it can estimate redshift values for predicted radio-detectable AGNs. These models, which combine predictions from tree-based and gradient-boosting algorithms, have been trained with multi-wavelength data from near-…
▽ More
We developed and trained a pipeline of three machine learning (ML) models than can predict which sources are more likely to be an AGN and to be detected in specific radio surveys. Also, it can estimate redshift values for predicted radio-detectable AGNs. These models, which combine predictions from tree-based and gradient-boosting algorithms, have been trained with multi-wavelength data from near-infrared-selected sources in the Hobby-Eberly Telescope Dark Energy Experiment (HETDEX) Spring field. Training, testing, calibration, and validation were carried out in the HETDEX field. Further validation was performed on near-infrared-selected sources in the Stripe 82 field. In the HETDEX validation subset, our pipeline recovers 96% of the initially labelled AGNs and, from AGNs candidates, we recover 50% of previously detected radio sources. For Stripe 82, these numbers are 94% and 55%. Compared to random selection, these rates are two and four times better for HETDEX, and 1.2 and 12 times better for Stripe 82. The pipeline can also recover the redshift distribution of these sources with $σ_{\mathrm{NMAD}}$ = 0.07 for HETDEX ($σ_{\mathrm{NMAD}}$ = 0.09 for Stripe 82) and an outlier fraction of 19% (25% for Stripe 82), compatible with previous results based on broad-band photometry. Feature importance analysis stresses the relevance of near- and mid-infrared colours to select AGNs and identify their radio and redshift nature. Combining different algorithms in ML models shows an improvement in the prediction power of our pipeline over a random selection of sources. Tree-based ML models (in contrast to deep learning techniques) facilitate the analysis of the impact that features have on the predictions. This prediction can give insight into the potential physical interplay between the properties of radio AGNs (e.g. mass of black hole and accretion rate).
△ Less
Submitted 1 December, 2023; v1 submitted 20 September, 2023;
originally announced September 2023.
-
Searches for neutrinos in the direction of radio-bright blazars with the ANTARES telescope
Authors:
ANTARES Collaboration,
A. Albert,
S. Alves,
M. André,
M. Ardid,
S. Ardid,
J. J. Aubert,
J Aublin,
B. Baret,
S. Basa,
Y. Becherini,
B. Belhorma,
M. Bendahman,
F. Benfenati,
V. Bertin,
S. Biagi,
M. Bissinger,
J. Boumaaza,
M. Bouta,
M. C. Bouwhuis,
H. Brânzaş,
R. Bruijn,
J. Brunner,
J. Busto,
B. Caiffi
, et al. (140 additional authors not shown)
Abstract:
Active galaxies, especially blazars, are among the most promising neutrino source candidates. To date, ANTARES searches for these objects considered GeV-TeV $γ$-ray bright blazars. Here, a statistically complete radio-bright blazar sample is used as the target for searches of origins of neutrinos collected by the ANTARES neutrino telescope over 13 years of operation. The hypothesis of a neutrino-b…
▽ More
Active galaxies, especially blazars, are among the most promising neutrino source candidates. To date, ANTARES searches for these objects considered GeV-TeV $γ$-ray bright blazars. Here, a statistically complete radio-bright blazar sample is used as the target for searches of origins of neutrinos collected by the ANTARES neutrino telescope over 13 years of operation. The hypothesis of a neutrino-blazar directional correlation is tested by pair counting and by a complementary likelihood-based approach. The resulting post-trial $p$-value is $3.0\%$ ($2.2σ$ in the two-sided convention), possibly indicating a correlation. Additionally, a time-dependent analysis is performed to search for temporal clustering of neutrino candidates as a mean of detecting neutrino flares in blazars. None of the investigated sources alone reaches a significant flare detection level. However, the presence of 18 sources with a pre-trial significance above $3σ$ indicates a $p=1.4\%$ ($2.5σ$ in the two-sided convention) detection of a time-variable neutrino flux. An \textit{a posteriori} investigation reveals an intriguing temporal coincidence of neutrino, radio, and $γ$-ray flares of the J0242+1101 blazar at a $p=0.5\%$ ($2.9σ$ in the two-sided convention) level. Altogether, the results presented here suggest a possible connection of neutrino candidates detected by the ANTARES telescope with radio-bright blazars.
△ Less
Submitted 13 September, 2023;
originally announced September 2023.
-
Optical microcavities as platforms for entangled photon spectroscopy
Authors:
Ravyn Malatesta,
Lorenzo Uboldi,
Evan J. Kumar,
Esteban Rojas-Gatjens,
Luca Moretti,
Andy Cruz,
Vinod Menon,
Giulio Cerullo,
Ajay Ram Srimath Kandada
Abstract:
Optical microcavities are often proposed as platforms for spectroscopy in the single- and few-photon regime due to strong light-matter coupling. For classical-light spectroscopies, an empty microcavity simply acts as an optical filter. However, we find that in the single- or few-photon regime treating the empty microcavity as an optical filter does not capture the full effect on the quantum state…
▽ More
Optical microcavities are often proposed as platforms for spectroscopy in the single- and few-photon regime due to strong light-matter coupling. For classical-light spectroscopies, an empty microcavity simply acts as an optical filter. However, we find that in the single- or few-photon regime treating the empty microcavity as an optical filter does not capture the full effect on the quantum state of the transmitted photons. Focusing on the case of entangled photon-pair spectroscopy, we consider how the propagation of one photon through an optical microcavity changes the joint spectrum of a frequency-entangled photon pair. Using the input-output treatment of a Dicke model, we find that propagation through a strongly coupled microcavity above a certain coupling threshold enhances the entanglement entropy between the signal and idler photons. These results show that optical microcavities are not neutral platforms for quantum-light spectroscopies and their effects must be carefully considered when using change in entanglement entropy as an observable.
△ Less
Submitted 9 September, 2023;
originally announced September 2023.
-
Prospects for combined analyses of hadronic emission from $γ$-ray sources in the Milky Way with CTA and KM3NeT
Authors:
T. Unbehaun,
L. Mohrmann,
S. Funk,
S. Aiello,
A. Albert,
S. Alves Garre,
Z. Aly,
A. Ambrosone,
F. Ameli,
M. Andre,
E. Androutsou,
M. Anghinolfi,
M. Anguita,
L. Aphecetche,
M. Ardid,
S. Ardid,
H. Atmani,
J. Aublin,
C. Bagatelas,
L. Bailly-Salins,
Z. Bardačová,
B. Baret,
S. Basegmez du Pree,
Y. Becherini,
M. Bendahman
, et al. (249 additional authors not shown)
Abstract:
The Cherenkov Telescope Array and the KM3NeT neutrino telescopes are major upcoming facilities in the fields of $γ$-ray and neutrino astronomy, respectively. Possible simultaneous production of $γ$ rays and neutrinos in astrophysical accelerators of cosmic-ray nuclei motivates a combination of their data. We assess the potential of a combined analysis of CTA and KM3NeT data to determine the contri…
▽ More
The Cherenkov Telescope Array and the KM3NeT neutrino telescopes are major upcoming facilities in the fields of $γ$-ray and neutrino astronomy, respectively. Possible simultaneous production of $γ$ rays and neutrinos in astrophysical accelerators of cosmic-ray nuclei motivates a combination of their data. We assess the potential of a combined analysis of CTA and KM3NeT data to determine the contribution of hadronic emission processes in known Galactic $γ$-ray emitters, comparing this result to the cases of two separate analyses. In doing so, we demonstrate the capability of Gammapy, an open-source software package for the analysis of $γ$-ray data, to also process data from neutrino telescopes. For a selection of prototypical $γ$-ray sources within our Galaxy, we obtain models for primary proton and electron spectra in the hadronic and leptonic emission scenario, respectively, by fitting published $γ$-ray spectra. Using these models and instrument response functions for both detectors, we employ the Gammapy package to generate pseudo data sets, where we assume 200 hours of CTA observations and 10 years of KM3NeT detector operation. We then apply a three-dimensional binned likelihood analysis to these data sets, separately for each instrument and jointly for both. We find that the largest benefit of the combined analysis lies in the possibility of a consistent modelling of the $γ$-ray and neutrino emission. Assuming a purely leptonic scenario as input, we obtain, for the most favourable source, an average expected 68% credible interval that constrains the contribution of hadronic processes to the observed $γ$-ray emission to below 15%.
△ Less
Submitted 2 February, 2024; v1 submitted 6 September, 2023;
originally announced September 2023.
-
21-cm fluctuations from primordial magnetic fields
Authors:
Hector Afonso G. Cruz,
Tal Adi,
Jordan Flitter,
Marc Kamionkowski,
Ely D. Kovetz
Abstract:
The fluid forces associated with primordial magnetic fields (PMFs) generate small-scale fluctuations in the primordial density field, which add to the $\mathrm{ΛCDM}$ linear matter power spectrum on small scales. These enhanced small-scale fluctuations lead to earlier formation of galactic halos and stars and thus affect cosmic reionization. We study the consequences of these effects on 21 cm obse…
▽ More
The fluid forces associated with primordial magnetic fields (PMFs) generate small-scale fluctuations in the primordial density field, which add to the $\mathrm{ΛCDM}$ linear matter power spectrum on small scales. These enhanced small-scale fluctuations lead to earlier formation of galactic halos and stars and thus affect cosmic reionization. We study the consequences of these effects on 21 cm observables using the semi-numerical code 21cmFAST v3.1.3. We find the excess small-scale structure generates strong stellar radiation backgrounds in the early Universe, resulting in altered 21 cm global signals and power spectra commensurate with earlier reionization. We restrict the allowed PMF models using the CMB optical depth to reionization. Lastly, we probe parameter degeneracies and forecast experimental sensitivities with an information matrix analysis subject to the CMB optical depth bound. Our forecasts show that interferometers like HERA are sensitive to PMFs of order $\sim \mathrm{pG}$, nearly an order of magnitude stronger than existing and next-generation experiments.
△ Less
Submitted 29 March, 2024; v1 submitted 8 August, 2023;
originally announced August 2023.
-
Modeling the SED of the AGN inside NGC 4395
Authors:
Hector Afonso G. Cruz,
Andy D. Goulding,
Jenny E. Greene
Abstract:
We study the broad-band spectral energy distribution (SED) of the prototypical low-mass active galactic nucleus (AGN) in NGC 4395. We jointly model the optical through mid-infrared SED with a combination of galaxy and AGN light, and find that on arcsecond scales, the AGN dominates at most wavelengths. However, there is still some ambiguity about emission from the galaxy, owing partially to the str…
▽ More
We study the broad-band spectral energy distribution (SED) of the prototypical low-mass active galactic nucleus (AGN) in NGC 4395. We jointly model the optical through mid-infrared SED with a combination of galaxy and AGN light, and find that on arcsecond scales, the AGN dominates at most wavelengths. However, there is still some ambiguity about emission from the galaxy, owing partially to the strong short-term variability of the black hole. We investigate the use of smooth and clumpy-torus models in order to disentangle the nuclear infrared emission, as well as exploring the use of poloidal wind emission to account for the blue spectral slope observed in the near-IR. Even when simultaneously fitting the full optical-IR spectral range, we find that degeneracies still remain in the best-fit models. We conclude that high spatial resolution and wider wavelength coverage with the James Webb Space Telescope is needed to understand the mid-infrared emission in this complex highly-variable object, which is the best nearby example to provide a blueprint to finding other low-mass AGN via their mid-infrared emission in the future.
△ Less
Submitted 27 July, 2023;
originally announced July 2023.
-
Extending free actions of finite groups on unoriented surfaces
Authors:
Omar A. Cruz,
Gustavo Ortega,
Carlos Segovia
Abstract:
We present the unoriented version of the Schur and Bogomolov multiplier associated with a finite group $G$. We show that the unoriented Schur multiplier is isomorphic to the second cohomology group $H^2(G;\mathbb{Z}_2)$. We define the unoriented Bogomolov multiplier as the quotient of the unoriented Schur multiplier by the subgroup generated by classes over the disjoint union of tori, Klein bottle…
▽ More
We present the unoriented version of the Schur and Bogomolov multiplier associated with a finite group $G$. We show that the unoriented Schur multiplier is isomorphic to the second cohomology group $H^2(G;\mathbb{Z}_2)$. We define the unoriented Bogomolov multiplier as the quotient of the unoriented Schur multiplier by the subgroup generated by classes over the disjoint union of tori, Klein bottles, and projective spaces. Disregarding the subgroup generated by the trivial $G$-bundle over the projective space, we show that the unoriented Bogomolov multiplier is a complete obstruction. This means that a free surface action over an unoriented surface equivariantly bounds if and only if the class in the unoriented Bogomolov multiplier is trivial. We show that the unoriented Bogomolov multiplier is trivial for abelian, dihedral, and symmetric groups. Because the group cohomology $H^2(G;\mathbb{Z}_2)$ is trivial for a group of odd order, there are plenty of examples where the usual Bogomolov multiplier is not trivial while the unoriented Bogomolov multiplier is trivial. However, we show that there is a group of order $64$ where the unoriented Bogomolov multiplier is non-trivial.
△ Less
Submitted 13 September, 2024; v1 submitted 11 July, 2023;
originally announced July 2023.
-
Quantifying memory in spin glasses
Authors:
Janus Collaboration,
I. Paga,
J. He,
M. Baity-Jesi,
E. Calore,
A. Cruz,
L. A. Fernandez,
J. M. Gil-Narvion,
I. Gonzalez-Adalid Pemartin,
A. Gordillo-Guerrero,
D. Iñiguez,
A. Maiorano,
E. Marinari,
V. Martin-Mayor,
J. Moreno-Gordo,
A. Muñoz Sudupe,
D. Navarro,
R. L. Orbach,
G. Parisi,
S. Perez-Gaviro,
F. Ricci-Tersenghi,
J. J. Ruiz-Lorenzo,
S. F. Schifano,
D. L. Schlagel,
B. Seoane
, et al. (2 additional authors not shown)
Abstract:
Rejuvenation and memory, long considered the distinguishing features of spin glasses, have recently been proven to result from the growth of multiple length scales. This insight, enabled by simulations on the Janus~II supercomputer, has opened the door to a quantitative analysis. We combine numerical simulations with comparable experiments to introduce two coefficients that quantify memory. A thir…
▽ More
Rejuvenation and memory, long considered the distinguishing features of spin glasses, have recently been proven to result from the growth of multiple length scales. This insight, enabled by simulations on the Janus~II supercomputer, has opened the door to a quantitative analysis. We combine numerical simulations with comparable experiments to introduce two coefficients that quantify memory. A third coefficient has been recently presented by Freedberg et al. We show that these coefficients are physically equivalent by studying their temperature and waiting-time dependence.
△ Less
Submitted 5 July, 2023;
originally announced July 2023.
-
An optimization approach to study the phase changing behavior of multi-component mixtures
Authors:
Gustavo E. O. Celis,
Reza Arefidamghani,
Hamidreza Anbarlooei,
Daniel O. A. Cruz
Abstract:
The appropriate design, construction, and operation of carbon capture and storage (CCS) and enhanced oil recovery (EOR) processes require a deep understanding of the resulting phases behavior in hydrocarbons-CO_2 multi-component mixtures under reservoir conditions. To model this behavior a nonlinear system consists of the equation of states and some mixing rules (for each component) needed to be s…
▽ More
The appropriate design, construction, and operation of carbon capture and storage (CCS) and enhanced oil recovery (EOR) processes require a deep understanding of the resulting phases behavior in hydrocarbons-CO_2 multi-component mixtures under reservoir conditions. To model this behavior a nonlinear system consists of the equation of states and some mixing rules (for each component) needed to be solved simultaneously. The mixing usually requires to model the binary interaction between the components of the mixture. This work employs optimization techniques to enhance the predictions of such model by optimizing the binary interaction parameters. The results show that the optimized parameters, although obtained mathematically, are in physical ranges and can reproduce successfully the experimental observations, specially for the multi-component hydrocarbons systems containing Carbon dioxide at reservoir temperatures and pressures
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
Unprocessing Seven Years of Algorithmic Fairness
Authors:
André F. Cruz,
Moritz Hardt
Abstract:
Seven years ago, researchers proposed a postprocessing method to equalize the error rates of a model across different demographic groups. The work launched hundreds of papers purporting to improve over the postprocessing baseline. We empirically evaluate these claims through thousands of model evaluations on several tabular datasets. We find that the fairness-accuracy Pareto frontier achieved by p…
▽ More
Seven years ago, researchers proposed a postprocessing method to equalize the error rates of a model across different demographic groups. The work launched hundreds of papers purporting to improve over the postprocessing baseline. We empirically evaluate these claims through thousands of model evaluations on several tabular datasets. We find that the fairness-accuracy Pareto frontier achieved by postprocessing contains all other methods we were feasibly able to evaluate. In doing so, we address two common methodological errors that have confounded previous observations. One relates to the comparison of methods with different unconstrained base models. The other concerns methods achieving different levels of constraint relaxation. At the heart of our study is a simple idea we call unprocessing that roughly corresponds to the inverse of postprocessing. Unprocessing allows for a direct comparison of methods using different underlying models and levels of relaxation.
△ Less
Submitted 15 March, 2024; v1 submitted 12 June, 2023;
originally announced June 2023.
-
Multifractality in spin glasses
Authors:
Janus Collaboration,
M. Baity-Jesi,
E. Calore,
A. Cruz,
L. A. Fernandez,
J. M. Gil-Narvion,
I. Gonzalez-Adalid Pemartin,
A. Gordillo-Guerrero,
D. Iñiguez,
A. Maiorano,
E. Marinari,
V. Martin-Mayor,
J. Moreno-Gordo,
A. Muñoz Sudupe,
D. Navarro,
I. Paga,
G. Parisi,
S. Perez-Gaviro,
F. Ricci-Tersenghi,
J. J. Ruiz-Lorenzo,
S. F. Schifano,
B. Seoane,
A. Tarancon,
D. Yllanes
Abstract:
We unveil the multifractal behavior of Ising spin glasses in their low-temperature phase. Using the Janus II custom-built supercomputer, the spin-glass correlation function is studied locally. Dramatic fluctuations are found when pairs of sites at the same distance are compared. The scaling of these fluctuations, as the spin-glass coherence length grows with time, is characterized through the comp…
▽ More
We unveil the multifractal behavior of Ising spin glasses in their low-temperature phase. Using the Janus II custom-built supercomputer, the spin-glass correlation function is studied locally. Dramatic fluctuations are found when pairs of sites at the same distance are compared. The scaling of these fluctuations, as the spin-glass coherence length grows with time, is characterized through the computation of the singularity spectrum and its corresponding Legendre transform. A comparatively small number of site pairs controls the average correlation that governs the response to a magnetic field. We explain how this scenario of dramatic fluctuations (at length scales smaller than the coherence length) can be reconciled with the smooth, self-averaging behavior that has long been considered to describe spin-glass dynamics.
△ Less
Submitted 22 January, 2024; v1 submitted 7 June, 2023;
originally announced June 2023.
-
Gravitational waves during Higgs inflation from complex geometrical scalar-tensor theory of gravity
Authors:
José Edgar Madriz Aguilar,
A. Bernal,
F. Aceves de la Cruz,
J. A. Licea
Abstract:
In this paper we investigate tensor fluctuations of the metric at the end of a Higgs inflationary period in the context of a recently introduced complex geometrical scalar-tensor theory of gravity. In our model the Higgs field has a geometrical origin and the affine connection is determined by the Palatini's principle. Additionally, we consider an extra contribution to the tensor-fluctuations equa…
▽ More
In this paper we investigate tensor fluctuations of the metric at the end of a Higgs inflationary period in the context of a recently introduced complex geometrical scalar-tensor theory of gravity. In our model the Higgs field has a geometrical origin and the affine connection is determined by the Palatini's principle. Additionally, we consider an extra contribution to the tensor-fluctuations equation coming from the vacuum term in the energy momentum tensor associated to the Higgs field. The Higgs potential is rescaled by the non-canonicity function of the kinetic term of the field which is modified by the symmetry group of the background geometry. We obtain a nearly scale invariant spectrum and a scalar to tensor ratio in agreement with PLANCK 2018 cosmological results.
△ Less
Submitted 16 June, 2023; v1 submitted 5 June, 2023;
originally announced June 2023.
-
BayesCPclust: A Bayesian Approach for Clustering Constant-Wise Change-Point Data
Authors:
Ana Carolina da Cruz,
Camila P. E. de Souza
Abstract:
Change-point models deal with ordered data sequences. Their primary goal is to infer the locations where an aspect of the data sequence changes. In this paper, we propose and implement a nonparametric Bayesian model for clustering observations based on their constant-wise change-point profiles via Gibbs sampler. Our model incorporates a Dirichlet Process on the constant-wise change-point structure…
▽ More
Change-point models deal with ordered data sequences. Their primary goal is to infer the locations where an aspect of the data sequence changes. In this paper, we propose and implement a nonparametric Bayesian model for clustering observations based on their constant-wise change-point profiles via Gibbs sampler. Our model incorporates a Dirichlet Process on the constant-wise change-point structures to cluster observations while simultaneously performing multiple change-point estimation. Additionally, our approach controls the number of clusters in the model, not requiring the specification of the number of clusters \textit{a priori}. Satisfactory clustering and estimation results were obtained when evaluating our method under various simulated scenarios and on a real dataset from single-cell genomic sequencing. Our proposed methodology is implemented as an R package called BayesCPclust and is available at \texttt{https://github.com/acarolcruz/BayesCPclust}.
△ Less
Submitted 24 November, 2023; v1 submitted 28 May, 2023;
originally announced May 2023.
-
Constraining Primordial Magnetic Fields with Line-Intensity Mapping
Authors:
Tal Adi,
Sarah Libanore,
Hector Afonso G. Cruz,
Ely D. Kovetz
Abstract:
Primordial magnetic fields (PMFs) offer a compelling explanation for the origin of observed magnetic fields, especially on extragalactic scales. Such PMFs give rise to excess of power in small scale matter perturbations that could strongly influence structure formation. We study the impact of the magnetically enhanced matter power spectrum on the signal that will be observed by line-intensity mapp…
▽ More
Primordial magnetic fields (PMFs) offer a compelling explanation for the origin of observed magnetic fields, especially on extragalactic scales. Such PMFs give rise to excess of power in small scale matter perturbations that could strongly influence structure formation. We study the impact of the magnetically enhanced matter power spectrum on the signal that will be observed by line-intensity mapping (LIM) surveys targeting carbon monoxide (CO) emission from star-forming galaxies at high redshifts. Specifically, the voxel intensity distribution of intensity maps provides access to small-scale information, which makes it highly sensitive to signatures of PMFs on matter overdensities. We present forecasts for future LIM CO surveys, finding that they can constrain PMF strength as small as $B_{\rm 1Mpc}\sim0.006-1\,{\rm nG}$, depending on the magnetic spectral index and the targeted redshifts.
△ Less
Submitted 21 June, 2023; v1 submitted 10 May, 2023;
originally announced May 2023.
-
Coronal Heating as Determined by the Solar Flare Frequency Distribution Obtained by Aggregating Case Studies
Authors:
James Paul Mason,
Alexandra Werth,
Colin G. West,
Allison A. Youngblood,
Donald L. Woodraska,
Courtney Peck,
Kevin Lacjak,
Florian G. Frick,
Moutamen Gabir,
Reema A. Alsinan,
Thomas Jacobsen,
Mohammad Alrubaie,
Kayla M. Chizmar,
Benjamin P. Lau,
Lizbeth Montoya Dominguez,
David Price,
Dylan R. Butler,
Connor J. Biron,
Nikita Feoktistov,
Kai Dewey,
N. E. Loomis,
Michal Bodzianowski,
Connor Kuybus,
Henry Dietrick,
Aubrey M. Wolfe
, et al. (977 additional authors not shown)
Abstract:
Flare frequency distributions represent a key approach to addressing one of the largest problems in solar and stellar physics: determining the mechanism that counter-intuitively heats coronae to temperatures that are orders of magnitude hotter than the corresponding photospheres. It is widely accepted that the magnetic field is responsible for the heating, but there are two competing mechanisms th…
▽ More
Flare frequency distributions represent a key approach to addressing one of the largest problems in solar and stellar physics: determining the mechanism that counter-intuitively heats coronae to temperatures that are orders of magnitude hotter than the corresponding photospheres. It is widely accepted that the magnetic field is responsible for the heating, but there are two competing mechanisms that could explain it: nanoflares or Alfvén waves. To date, neither can be directly observed. Nanoflares are, by definition, extremely small, but their aggregate energy release could represent a substantial heating mechanism, presuming they are sufficiently abundant. One way to test this presumption is via the flare frequency distribution, which describes how often flares of various energies occur. If the slope of the power law fitting the flare frequency distribution is above a critical threshold, $α=2$ as established in prior literature, then there should be a sufficient abundance of nanoflares to explain coronal heating. We performed $>$600 case studies of solar flares, made possible by an unprecedented number of data analysts via three semesters of an undergraduate physics laboratory course. This allowed us to include two crucial, but nontrivial, analysis methods: pre-flare baseline subtraction and computation of the flare energy, which requires determining flare start and stop times. We aggregated the results of these analyses into a statistical study to determine that $α= 1.63 \pm 0.03$. This is below the critical threshold, suggesting that Alfvén waves are an important driver of coronal heating.
△ Less
Submitted 9 May, 2023;
originally announced May 2023.
-
CrossCarry: An R package for the analysis of data from a crossover design with GEE
Authors:
N. A. Cruz,
O. O. Melo,
C. A. Martinez
Abstract:
Experimental crossover designs are widely used in medicine, agriculture, and other areas of the biological sciences. Due to the characteristics of the crossover design, each experimental unit has longitudinal observations and the presence of drag effects on the response variable. There is no package in {R} that clearly models data from crossover designs. The {CrossCarry} package presented in this…
▽ More
Experimental crossover designs are widely used in medicine, agriculture, and other areas of the biological sciences. Due to the characteristics of the crossover design, each experimental unit has longitudinal observations and the presence of drag effects on the response variable. There is no package in {R} that clearly models data from crossover designs. The {CrossCarry} package presented in this paper allows testing any crossover design as long as the observed response variable belongs to the exponential family, regardless of whether or not there is a washout period. It also allows modeling repeated measurements within each period and extends the correlation structures used in the generalized estimating equations. The family of correlation structures is built that takes into account the particularities of the design, that is, the correlation between and within the periods. It also includes a parametric component for modeling treatment effects and a non-parametric component for modeling time effects and carry-over effects. The non-parametric component is estimated from splines inserted into the generalized estimation equations.
△ Less
Submitted 5 April, 2023;
originally announced April 2023.
-
Paraconsistent Transition Systems
Authors:
Ana Cruz,
Alexandre Madeira,
LuÂ-Ã-s Soares Barbosa
Abstract:
Often in Software Engineering, a modeling formalism has to support scenarios of inconsistency in which several requirements either reinforce or contradict each other. Paraconsistent transition systems are proposed in this paper as one such formalism: states evolve through two accessibility relations capturing weighted evidence of a transition or its absence, respectively. Their weights come from a…
▽ More
Often in Software Engineering, a modeling formalism has to support scenarios of inconsistency in which several requirements either reinforce or contradict each other. Paraconsistent transition systems are proposed in this paper as one such formalism: states evolve through two accessibility relations capturing weighted evidence of a transition or its absence, respectively. Their weights come from a specific residuated lattice. A category of these systems, and the corresponding algebra, is defined as providing a formal setting to model different application scenarios. One of them, dealing with the effect of quantum decoherence in quantum programs, is used for illustration purposes.
△ Less
Submitted 23 March, 2023;
originally announced March 2023.
-
Search for neutrino counterparts to the gravitational wave sources from LIGO/Virgo O3 run with the ANTARES detector
Authors:
ANTARES Collaboration,
A. Albert,
S. Alves,
M. André,
M. Ardid,
S. Ardid,
J. -J. Aubert,
J. Aublin,
B. Baret,
S. Basa,
Y. Becherini,
B. Belhorma,
M. Bendahman,
F. Benfenati,
V. Bertin,
S. Biagi,
M. Bissinger,
J. Boumaaza,
M. Bouta,
M. C. Bouwhuis,
H. Brânzaş,
R. Bruijn,
J. Brunner,
J. Busto,
B. Caiffi
, et al. (128 additional authors not shown)
Abstract:
Since 2015 the LIGO and Virgo interferometers have detected gravitational waves from almost one hundred coalescences of compact objects (black holes and neutron stars). This article presents the results of a search performed with data from the ANTARES telescope to identify neutrino counterparts to the gravitational wave sources detected during the third LIGO/Virgo observing run and reported in the…
▽ More
Since 2015 the LIGO and Virgo interferometers have detected gravitational waves from almost one hundred coalescences of compact objects (black holes and neutron stars). This article presents the results of a search performed with data from the ANTARES telescope to identify neutrino counterparts to the gravitational wave sources detected during the third LIGO/Virgo observing run and reported in the catalogues GWTC-2, GWTC-2.1, and GWTC-3. This search is sensitive to all-sky neutrinos of all flavours and of energies $>100$ GeV, thanks to the inclusion of both track-like events (mainly induced by $ν_μ$ charged-current interactions) and shower-like events (induced by other interaction types). Neutrinos are selected if they are detected within $\pm 500$ s from the GW merger and with a reconstructed direction compatible with its sky localisation. No significant excess is found for any of the 80 analysed GW events, and upper limits on the neutrino emission are derived. Using the information from the GW catalogues and assuming isotropic emission, upper limits on the total energy $E_{\rm tot, ν}$ emitted as neutrinos of all flavours and on the ratio $f_ν= E_{\rm tot, ν}/E_{\rm GW}$ between neutrino and GW emissions are also computed. Finally, a stacked analysis of all the 72 binary black hole mergers (respectively the 7 neutron star - black hole merger candidates) has been performed to constrain the typical neutrino emission within this population, leading to the limits: $E_{\rm tot, ν} < 4.0 \times 10^{53}$ erg and $f_ν< 0.15$ (respectively, $E_{\rm tot, ν} < 3.2 \times 10^{53}$ erg and $f_ν< 0.88$) for $E^{-2}$ spectrum and isotropic emission. Other assumptions including softer spectra and non-isotropic scenarios have also been tested.
△ Less
Submitted 17 April, 2023; v1 submitted 15 February, 2023;
originally announced February 2023.
-
Probing invisible neutrino decay with KM3NeT-ORCA
Authors:
KM3NeT Collaboration,
S. Aiello,
A. Albert,
S. Alves Garre,
Z. Aly,
A. Ambrosone,
F. Ameli,
M. Andre,
M. Anghinolfi,
M. Anguita,
M. Ardid,
S. Ardid,
J. Aublin,
C. Bagatelas,
L. Bailly-Salins,
B. Baret,
S. Basegmez du Pree,
Y. Becherini,
M. Bendahman,
F. Benfenati,
E. Berbee,
V. Bertin,
S. Biagi,
M. Boettcher,
M. Bou Cabo
, et al. (230 additional authors not shown)
Abstract:
In the era of precision measurements of the neutrino oscillation parameters, upcoming neutrino experiments will also be sensitive to physics beyond the Standard Model. KM3NeT/ORCA is a neutrino detector optimised for measuring atmospheric neutrinos from a few GeV to around 100 GeV. In this paper, the sensitivity of the KM3NeT/ORCA detector to neutrino decay has been explored. A three-flavour neutr…
▽ More
In the era of precision measurements of the neutrino oscillation parameters, upcoming neutrino experiments will also be sensitive to physics beyond the Standard Model. KM3NeT/ORCA is a neutrino detector optimised for measuring atmospheric neutrinos from a few GeV to around 100 GeV. In this paper, the sensitivity of the KM3NeT/ORCA detector to neutrino decay has been explored. A three-flavour neutrino oscillation scenario, where the third neutrino mass state $ν_3$ decays into an invisible state, e.g. a sterile neutrino, is considered. We find that KM3NeT/ORCA would be sensitive to invisible neutrino decays with $1/α_3=τ_3/m_3 < 180$~$\mathrm{ps/eV}$ at $90\%$ confidence level, assuming true normal ordering. Finally, the impact of neutrino decay on the precision of KM3NeT/ORCA measurements for $θ_{23}$, $Δm^2_{31}$ and mass ordering have been studied. No significant effect of neutrino decay on the sensitivity to these measurements has been found.
△ Less
Submitted 27 March, 2023; v1 submitted 6 February, 2023;
originally announced February 2023.
-
Mapping quantum geometry and quantum phase transitions to real space by a fidelity marker
Authors:
Matheus S. M. de Sousa,
Antonio L. Cruz,
Wei Chen
Abstract:
The quantum geometry in the momentum space of semiconductors and insulators, described by the quantum metric of the valence band Bloch state, has been an intriguing issue owing to its connection to various material properties. Because the Brillouin zone is periodic, the integration of quantum metric over momentum space represents an average distance between neighboring Bloch states, of which we ca…
▽ More
The quantum geometry in the momentum space of semiconductors and insulators, described by the quantum metric of the valence band Bloch state, has been an intriguing issue owing to its connection to various material properties. Because the Brillouin zone is periodic, the integration of quantum metric over momentum space represents an average distance between neighboring Bloch states, of which we call the fidelity number. We show that this number can further be expressed in real space as a fidelity marker, which is a local quantity that can be calculated directly from diagonalizing the lattice Hamiltonian. A linear response theory is further introduced to generalize the fidelity number and marker to finite temperature, and moreover demonstrates that they can be measured from the global and local optical absorption power against linearly polarized light. In particular, the fidelity number spectral function in 2D systems can be easily measured from the opacity of the material. Based on the divergence of quantum metric, a nonlocal fidelity marker is further introduced and postulated as a universal indicator of any quantum phase transitions provided the crystalline momentum remains a good quantum number, and it may be interpreted as a Wannier state correlation function. The ubiquity of these concepts is demonstrated for a variety of topological insulators and topological phase transitions in different dimensions.
△ Less
Submitted 29 May, 2023; v1 submitted 16 January, 2023;
originally announced January 2023.
-
Turning the Tables: Biased, Imbalanced, Dynamic Tabular Datasets for ML Evaluation
Authors:
Sérgio Jesus,
José Pombal,
Duarte Alves,
André Cruz,
Pedro Saleiro,
Rita P. Ribeiro,
João Gama,
Pedro Bizarro
Abstract:
Evaluating new techniques on realistic datasets plays a crucial role in the development of ML research and its broader adoption by practitioners. In recent years, there has been a significant increase of publicly available unstructured data resources for computer vision and NLP tasks. However, tabular data -- which is prevalent in many high-stakes domains -- has been lagging behind. To bridge this…
▽ More
Evaluating new techniques on realistic datasets plays a crucial role in the development of ML research and its broader adoption by practitioners. In recent years, there has been a significant increase of publicly available unstructured data resources for computer vision and NLP tasks. However, tabular data -- which is prevalent in many high-stakes domains -- has been lagging behind. To bridge this gap, we present Bank Account Fraud (BAF), the first publicly available privacy-preserving, large-scale, realistic suite of tabular datasets. The suite was generated by applying state-of-the-art tabular data generation techniques on an anonymized,real-world bank account opening fraud detection dataset. This setting carries a set of challenges that are commonplace in real-world applications, including temporal dynamics and significant class imbalance. Additionally, to allow practitioners to stress test both performance and fairness of ML methods, each dataset variant of BAF contains specific types of data bias. With this resource, we aim to provide the research community with a more realistic, complete, and robust test bed to evaluate novel and existing methods.
△ Less
Submitted 28 November, 2022; v1 submitted 23 November, 2022;
originally announced November 2022.
-
Estimating the Long-term Behavior of Biologically Inspired Agent-based Models
Authors:
Daniel A. Cruz,
Jack Toppen,
Eunbi Park,
Melissa L. Kemp,
Elena S. Dimitrova
Abstract:
An agent-based model (ABM) is a computational model in which the local interactions of autonomous agents with each other and with their environment give rise to global properties within a given domain. As the detail and complexity of these models has grown, so too has the computational expense of running several simulations to perform sensitivity analysis and evaluate long-term model behavior. Her…
▽ More
An agent-based model (ABM) is a computational model in which the local interactions of autonomous agents with each other and with their environment give rise to global properties within a given domain. As the detail and complexity of these models has grown, so too has the computational expense of running several simulations to perform sensitivity analysis and evaluate long-term model behavior. Here, we generalize a framework for mathematically formalizing ABMs to explicitly incorporate features commonly found in biological systems: appearance of agents (birth), removal of agents (death), and locally dependent state changes. We then use our broader framework to extend an approach for estimating long-term behavior without simulations, specifically changes in population densities over time. The approach is probabilistic and relies on treating the discrete, incremental update of an ABM via "time steps" as a Markov process to generate expected values for agents at each time step. As case studies, we apply our extensions to both a simple ABM based on the Game of Life and a published ABM of rib development in vertebrates.
△ Less
Submitted 30 November, 2022; v1 submitted 1 November, 2022;
originally announced November 2022.
-
Global city densities: re-examining urban scaling theory
Authors:
Joseph R. Burger,
Jordan G. Okie,
Ian Hatton,
Vanessa P. Weinberger,
Munik Shrestha,
Kyra J. Liedtke,
Tam Be,
Austin R. Cruz,
Xiao Feng,
Cesar Hinojo-Hinojo,
Abu S. M. G. Kibria,
Kacey C. Ernst,
Brian J. Enquist
Abstract:
Understanding scaling relations of social and environmental attributes of urban systems is necessary for effectively managing cities. Urban scaling theory (UST) has assumed that population density scales positively with city size. We present a new global analysis using a publicly available database of 933 cities from 38 countries. Our results showed that (18/38) 47% of countries analyzed supported…
▽ More
Understanding scaling relations of social and environmental attributes of urban systems is necessary for effectively managing cities. Urban scaling theory (UST) has assumed that population density scales positively with city size. We present a new global analysis using a publicly available database of 933 cities from 38 countries. Our results showed that (18/38) 47% of countries analyzed supported increasing density scaling (pop ~ area) with exponents ~5/6 as UST predicts. In contrast, 17 of 38 countries (~45%) exhibited density scalings statistically indistinguishable from constant population densities across cities of varying sizes. These results were generally consistent in years spanning four decades from 1975 to 2015. Importantly, density varies by an order of magnitude between regions and countries and decreases in more developed economies. Our results (i) point to how economic and regional differences may affect the scaling of density with city size and (ii) show how understanding country- and region-specific strategies could inform effective management of urban systems for biodiversity, public health, conservation and resiliency from local to global scales.
△ Less
Submitted 14 October, 2022;
originally announced October 2022.
-
FairGBM: Gradient Boosting with Fairness Constraints
Authors:
André F Cruz,
Catarina Belém,
Sérgio Jesus,
João Bravo,
Pedro Saleiro,
Pedro Bizarro
Abstract:
Tabular data is prevalent in many high-stakes domains, such as financial services or public policy. Gradient Boosted Decision Trees (GBDT) are popular in these settings due to their scalability, performance, and low training cost. While fairness in these domains is a foremost concern, existing in-processing Fair ML methods are either incompatible with GBDT, or incur in significant performance loss…
▽ More
Tabular data is prevalent in many high-stakes domains, such as financial services or public policy. Gradient Boosted Decision Trees (GBDT) are popular in these settings due to their scalability, performance, and low training cost. While fairness in these domains is a foremost concern, existing in-processing Fair ML methods are either incompatible with GBDT, or incur in significant performance losses while taking considerably longer to train. We present FairGBM, a dual ascent learning framework for training GBDT under fairness constraints, with little to no impact on predictive performance when compared to unconstrained GBDT. Since observational fairness metrics are non-differentiable, we propose smooth convex error rate proxies for common fairness criteria, enabling gradient-based optimization using a ``proxy-Lagrangian'' formulation. Our implementation shows an order of magnitude speedup in training time relative to related work, a pivotal aspect to foster the widespread adoption of FairGBM by real-world practitioners.
△ Less
Submitted 3 March, 2023; v1 submitted 16 September, 2022;
originally announced September 2022.