-
NL-Eye: Abductive NLI for Images
Authors:
Mor Ventura,
Michael Toker,
Nitay Calderon,
Zorik Gekhman,
Yonatan Bitton,
Roi Reichart
Abstract:
Will a Visual Language Model (VLM)-based bot warn us about slipping if it detects a wet floor? Recent VLMs have demonstrated impressive capabilities, yet their ability to infer outcomes and causes remains underexplored. To address this, we introduce NL-Eye, a benchmark designed to assess VLMs' visual abductive reasoning skills. NL-Eye adapts the abductive Natural Language Inference (NLI) task to t…
▽ More
Will a Visual Language Model (VLM)-based bot warn us about slipping if it detects a wet floor? Recent VLMs have demonstrated impressive capabilities, yet their ability to infer outcomes and causes remains underexplored. To address this, we introduce NL-Eye, a benchmark designed to assess VLMs' visual abductive reasoning skills. NL-Eye adapts the abductive Natural Language Inference (NLI) task to the visual domain, requiring models to evaluate the plausibility of hypothesis images based on a premise image and explain their decisions. NL-Eye consists of 350 carefully curated triplet examples (1,050 images) spanning diverse reasoning categories: physical, functional, logical, emotional, cultural, and social. The data curation process involved two steps - writing textual descriptions and generating images using text-to-image models, both requiring substantial human involvement to ensure high-quality and challenging scenes. Our experiments show that VLMs struggle significantly on NL-Eye, often performing at random baseline levels, while humans excel in both plausibility prediction and explanation quality. This demonstrates a deficiency in the abductive reasoning capabilities of modern VLMs. NL-Eye represents a crucial step toward developing VLMs capable of robust multimodal reasoning for real-world applications, including accident-prevention bots and generated video verification.
△ Less
Submitted 3 October, 2024;
originally announced October 2024.
-
The MOST Hosts Survey: spectroscopic observation of the host galaxies of ~40,000 transients using DESI
Authors:
Maayane T. Soumagnac,
Peter Nugent,
Robert A. Knop,
Anna Y. Q. Ho,
William Hohensee,
Autumn Awbrey,
Alexis Andersen,
Greg Aldering,
Matan Ventura,
Jessica N. Aguilar,
Steven Ahlen,
Segev Y. Benzvi,
David Brooks,
Dillon Brout,
Todd Claybaugh,
Tamara M. Davis,
Kyle Dawson,
Axel de la Macorra,
Arjun Dey,
Biprateep Dey,
Peter Doel,
Kelly A. Douglass,
Jaime E. Forero-Romero,
Enrique Gaztanaga,
Satya Gontcho A Gontcho
, et al. (32 additional authors not shown)
Abstract:
We present the MOST Hosts survey (Multi-Object Spectroscopy of Transient Hosts). The survey is planned to run throughout the five years of operation of the Dark Energy Spectroscopic Instrument (DESI) and will generate a spectroscopic catalog of the hosts of most transients observed to date, in particular all the supernovae observed by most public, untargeted, wide-field, optical surveys (PTF/iPTF,…
▽ More
We present the MOST Hosts survey (Multi-Object Spectroscopy of Transient Hosts). The survey is planned to run throughout the five years of operation of the Dark Energy Spectroscopic Instrument (DESI) and will generate a spectroscopic catalog of the hosts of most transients observed to date, in particular all the supernovae observed by most public, untargeted, wide-field, optical surveys (PTF/iPTF, SDSS II, ZTF, DECAT, DESIRT). Scientific questions for which the MOST Hosts survey will be useful include Type Ia supernova cosmology, fundamental plane and peculiar velocity measurements, and the understanding of the correlations between transients and their host galaxy properties. Here, we present the first release of the MOST Hosts survey: 21,931 hosts of 20,235 transients. These numbers represent 36% of the final MOST Hosts sample, consisting of 60,212 potential host galaxies of 38,603 transients (a transient can be assigned multiple potential hosts). Of these galaxies, 40% do not appear in the DESI primary target list and therefore require a specific program like MOST Hosts. Of all the transients in the MOST Hosts list, only 26.7% have existing classifications, and so the survey will provide redshifts (and luminosities) for nearly 30,000 transients. A preliminary Hubble diagram and a transient luminosity-duration diagram are shown as examples of future potential uses of the MOST Hosts survey. The survey will also provide a training sample of spectroscopically observed transients for photometry-only classifiers, as we enter an era when most newly observed transients will lack spectroscopic classification. The MOST Hosts DESI survey data will be released through the Wiserep platform on a rolling cadence and updated to match the DESI releases. Dates of future releases and updates are available through the https://mosthosts.desi.lbl.gov website.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines
Authors:
Michael Toker,
Hadas Orgad,
Mor Ventura,
Dana Arad,
Yonatan Belinkov
Abstract:
Text-to-image diffusion models (T2I) use a latent representation of a text prompt to guide the image generation process. However, the process by which the encoder produces the text representation is unknown. We propose the Diffusion Lens, a method for analyzing the text encoder of T2I models by generating images from its intermediate representations. Using the Diffusion Lens, we perform an extensi…
▽ More
Text-to-image diffusion models (T2I) use a latent representation of a text prompt to guide the image generation process. However, the process by which the encoder produces the text representation is unknown. We propose the Diffusion Lens, a method for analyzing the text encoder of T2I models by generating images from its intermediate representations. Using the Diffusion Lens, we perform an extensive analysis of two recent T2I models. Exploring compound prompts, we find that complex scenes describing multiple objects are composed progressively and more slowly compared to simple scenes; Exploring knowledge retrieval, we find that representation of uncommon concepts requires further computation compared to common concepts, and that knowledge retrieval is gradual across layers. Overall, our findings provide valuable insights into the text encoder component in T2I pipelines.
△ Less
Submitted 21 October, 2024; v1 submitted 9 March, 2024;
originally announced March 2024.
-
Semi-analytic modelling of Pop. III star formation and metallicity evolution -- I. Impact on the UV luminosity functions at z = 9-16
Authors:
Emanuele M. Ventura,
Yuxiang Qin,
Sreedhar Balu,
J. Stuart B. Wyithe
Abstract:
We implemented Population III (Pop. III) star formation in mini-halos within the MERAXES semi-analytic galaxy formation and reionisation model, run on top of a N-body simulation with $L = 10 h^{-1}$ cMpc with 2048$^3$ particles resolving all dark matter halos down to the mini-halos ($\sim 10^5 M_\odot$). Our modelling includes the chemical evolution of the IGM, with metals released through superno…
▽ More
We implemented Population III (Pop. III) star formation in mini-halos within the MERAXES semi-analytic galaxy formation and reionisation model, run on top of a N-body simulation with $L = 10 h^{-1}$ cMpc with 2048$^3$ particles resolving all dark matter halos down to the mini-halos ($\sim 10^5 M_\odot$). Our modelling includes the chemical evolution of the IGM, with metals released through supernova-driven bubbles that expand according to the Sedov-Taylor model. We found that SN-driven metal bubbles are generally small, with radii typically of 150 ckpc at z = 6. Hence, the majority of the first galaxies are likely enriched by their own star formation. However, as reionization progresses, the feedback effects from the UV background become more pronounced, leading to a halt in star formation in low-mass galaxies, after which external chemical enrichment becomes more relevant. We explore the sensitivity of the star formation rate density and stellar mass functions on the unknown values of free parameters. We also discuss the observability of Pop. III dominated systems with JWST, finding that the inclusion of Pop. III galaxies can have a significant effect on the total UV luminosity function at z = 12 - 16. Our results support the idea that the excess of bright galaxies detected with JWST might be explained by the presence of bright top-heavy Pop. III dominated galaxies without requiring an increased star formation efficiency.
△ Less
Submitted 21 February, 2024; v1 submitted 14 January, 2024;
originally announced January 2024.
-
The Initial Mass Function Based on the Full-sky 20-pc Census of $\sim$3,600 Stars and Brown Dwarfs
Authors:
J. Davy Kirkpatrick,
Federico Marocco,
Christopher R. Gelino,
Yadukrishna Raghu,
Jacqueline K. Faherty,
Daniella C. Bardalez Gagliuffi,
Steven D. Schurr,
Kevin Apps,
Adam C. Schneider,
Aaron M. Meisner,
Marc J. Kuchner,
Dan Caselden,
R. L. Smart,
S. L. Casewell,
Roberto Raddi,
Aurora Kesseli,
Nikolaj Stevnbak Andersen,
Edoardo Antonini,
Paul Beaulieu,
Thomas P. Bickle,
Martin Bilsing,
Raymond Chieng,
Guillaume Colin,
Sam Deen,
Alexandru Dereveanco
, et al. (63 additional authors not shown)
Abstract:
A complete accounting of nearby objects -- from the highest-mass white dwarf progenitors down to low-mass brown dwarfs -- is now possible, thanks to an almost complete set of trigonometric parallax determinations from Gaia, ground-based surveys, and Spitzer follow-up. We create a census of objects within a Sun-centered sphere of 20-pc radius and check published literature to decompose each binary…
▽ More
A complete accounting of nearby objects -- from the highest-mass white dwarf progenitors down to low-mass brown dwarfs -- is now possible, thanks to an almost complete set of trigonometric parallax determinations from Gaia, ground-based surveys, and Spitzer follow-up. We create a census of objects within a Sun-centered sphere of 20-pc radius and check published literature to decompose each binary or higher-order system into its separate components. The result is a volume-limited census of $\sim$3,600 individual star formation products useful in measuring the initial mass function across the stellar ($<8 M_\odot$) and substellar ($\gtrsim 5 M_{Jup}$) regimes. Comparing our resulting initial mass function to previous measurements shows good agreement above 0.8$M_\odot$ and a divergence at lower masses. Our 20-pc space densities are best fit with a quadripartite power law, $ξ(M) = dN/dM \propto M^{-α}$ with long-established values of $α= 2.3$ at high masses ($0.55 < M < 8.00 M_\odot$) and $α= 1.3$ at intermediate masses ($0.22 < M < 0.55 M_\odot$), but at lower masses we find $α= 0.25$ for $0.05 < M <0.22 M_\odot$ and $α= 0.6$ for $0.01 < M < 0.05 M_\odot$. This implies that the rate of production as a function of decreasing mass diminishes in the low-mass star/high-mass brown dwarf regime before increasing again in the low-mass brown dwarf regime. Correcting for completeness, we find a star to brown dwarf number ratio of, currently, 4:1, and an average mass per object of 0.41 $M_\odot$.
△ Less
Submitted 6 December, 2023;
originally announced December 2023.
-
Navigating Cultural Chasms: Exploring and Unlocking the Cultural POV of Text-To-Image Models
Authors:
Mor Ventura,
Eyal Ben-David,
Anna Korhonen,
Roi Reichart
Abstract:
Text-To-Image (TTI) models, such as DALL-E and StableDiffusion, have demonstrated remarkable prompt-based image generation capabilities. Multilingual encoders may have a substantial impact on the cultural agency of these models, as language is a conduit of culture. In this study, we explore the cultural perception embedded in TTI models by characterizing culture across three hierarchical tiers: cu…
▽ More
Text-To-Image (TTI) models, such as DALL-E and StableDiffusion, have demonstrated remarkable prompt-based image generation capabilities. Multilingual encoders may have a substantial impact on the cultural agency of these models, as language is a conduit of culture. In this study, we explore the cultural perception embedded in TTI models by characterizing culture across three hierarchical tiers: cultural dimensions, cultural domains, and cultural concepts. Based on this ontology, we derive prompt templates to unlock the cultural knowledge in TTI models, and propose a comprehensive suite of evaluation techniques, including intrinsic evaluations using the CLIP space, extrinsic evaluations with a Visual-Question-Answer (VQA) model and human assessments, to evaluate the cultural content of TTI-generated images. To bolster our research, we introduce the CulText2I dataset, derived from six diverse TTI models and spanning ten languages. Our experiments provide insights regarding Do, What, Which and How research questions about the nature of cultural encoding in TTI models, paving the way for cross-cultural applications of these models.
△ Less
Submitted 13 August, 2024; v1 submitted 3 October, 2023;
originally announced October 2023.
-
Machine learning of twin/matrix interfaces from local stress field
Authors:
Javier F. Troncoso,
Yang Hu,
Nicolo M. della Ventura,
Amit Sharma,
Xavier Maeder,
Vladyslav Turlo
Abstract:
Twinning is an important deformation mode in plastically deformed hexagonal close-packed materials. The extremely high twin growth rates at the nanoscale make atomistic simulations an attractive method for investigating the role of individual twin/matrix interfaces such as twin boundaries and basal-prismatic interfaces in twin growth kinetics. Unfortunately, there is no single framework that allow…
▽ More
Twinning is an important deformation mode in plastically deformed hexagonal close-packed materials. The extremely high twin growth rates at the nanoscale make atomistic simulations an attractive method for investigating the role of individual twin/matrix interfaces such as twin boundaries and basal-prismatic interfaces in twin growth kinetics. Unfortunately, there is no single framework that allows researchers to differentiate such interfaces automatically, neither in experimental in-situ transmission electron microscopy analysis images nor in atomistic simulations. Moreover, the presence of alloying elements introduces substantial noise to local atomic environments, making it nearly impossible to identify which atoms belong to which interface. Here, with the help of advanced machine learning methods, we provide a proof-of-concept way of using the local stress field distribution as an indicator for the presence of interfaces and for determining their types. We apply such an analysis to the growth of twin embryos in Mg-10 at.% Al alloys under constant stress and constant strain conditions, corresponding to two extremes of high and low strain rates, respectively. We discover that the kinetics of such growth is driven by high-energy basal-prismatic interfaces, in line with our experimental observations for pure Mg.
△ Less
Submitted 3 March, 2023;
originally announced March 2023.
-
TRBLLmaker -- Transformer Reads Between Lyrics Lines maker
Authors:
Mor Ventura,
Michael Toker
Abstract:
Even for us, it can be challenging to comprehend the meaning of songs. As part of this project, we explore the process of generating the meaning of songs. Despite the widespread use of text-to-text models, few attempts have been made to achieve a similar objective. Songs are primarily studied in the context of sentiment analysis. This involves identifying opinions and emotions in texts, evaluating…
▽ More
Even for us, it can be challenging to comprehend the meaning of songs. As part of this project, we explore the process of generating the meaning of songs. Despite the widespread use of text-to-text models, few attempts have been made to achieve a similar objective. Songs are primarily studied in the context of sentiment analysis. This involves identifying opinions and emotions in texts, evaluating them as positive or negative, and utilizing these evaluations to make music recommendations. In this paper, we present a generative model that offers implicit meanings for several lines of a song. Our model uses a decoder Transformer architecture GPT-2, where the input is the lyrics of a song. Furthermore, we compared the performance of this architecture with that of the encoder-decoder Transformer architecture of the T5 model. We also examined the effect of different prompt types with the option of appending additional information, such as the name of the artist and the title of the song. Moreover, we tested different decoding methods with different training parameters and evaluated our results using ROUGE. In order to build our dataset, we utilized the 'Genious' API, which allowed us to acquire the lyrics of songs and their explanations, as well as their rich metadata.
△ Less
Submitted 9 December, 2022;
originally announced December 2022.
-
The role of Pop III stars and early black holes in the 21cm signal from Cosmic Dawn
Authors:
Emanuele M. Ventura,
Alessandro Trinca,
Raffaella Schneider,
Luca Graziani,
Rosa Valiante,
J. Stuart B. Wyithe
Abstract:
Modeling the 21cm global signal from the Cosmic Dawn is challenging due to the many poorly constrained physical processes that come into play. We address this problem using the semi-analytical code "Cosmic Archaeology Tool" (CAT). CAT follows the evolution of dark matter halos tracking their merger history and provides an ab initio description of their baryonic evolution, starting from the formati…
▽ More
Modeling the 21cm global signal from the Cosmic Dawn is challenging due to the many poorly constrained physical processes that come into play. We address this problem using the semi-analytical code "Cosmic Archaeology Tool" (CAT). CAT follows the evolution of dark matter halos tracking their merger history and provides an ab initio description of their baryonic evolution, starting from the formation of the first (Pop III) stars and black holes (BHs) in mini-halos at z > 20. The model is anchored to observations of galaxies and AGN at z < 6 and predicts a reionization history consistent with constraints. In this work we compute the evolution of the mean global 21cm signal between $4\leq z \leq 40$ based on the rate of formation and emission properties of stars and accreting black holes. We obtain an absorption profile with a maximum depth $δ{\rm T_b} = -95$ mK at $z \sim 26.5$ (54 MHz). This feature is quickly suppressed turning into an emission signal at $z = 20$ due to the contribution of accreting BHs that efficiently heat the IGM at $z < 27$. The high-$z$ absorption feature is caused by the early coupling between the spin and kinetic temperature of the IGM induced by Pop III star formation episodes in mini-halos. Once we account for an additional radio background from early BHs, we are able to reproduce the timing and the depth of the EDGES signal only if we consider a smaller X-ray background from accreting BHs, but not the shape.
△ Less
Submitted 18 January, 2023; v1 submitted 18 October, 2022;
originally announced October 2022.
-
Rethinking The Memory Staleness Problem In Dynamics GNN
Authors:
Mor Ventura,
Hadas Ben Atya,
Dekel Brav
Abstract:
The staleness problem is a well-known problem when working with dynamic data, due to the absence of events for a long time. Since the memory of the node is updated only when the node is involved in an event, its memory becomes stale. Usually, it refers to a lack of events such as a temporal deactivation of a social account. To overcome the memory staleness problem aggregate information from the no…
▽ More
The staleness problem is a well-known problem when working with dynamic data, due to the absence of events for a long time. Since the memory of the node is updated only when the node is involved in an event, its memory becomes stale. Usually, it refers to a lack of events such as a temporal deactivation of a social account. To overcome the memory staleness problem aggregate information from the nodes neighbors memory in addition to the nodes memory. Inspired by that, we design an updated embedding module that inserts the most similar node in addition to the nodes neighbors. Our method achieved similar results to the TGN, with a slight improvement. This could indicate a potential improvement after fine-tuning our hyper-parameters, especially the time threshold, and using a learnable similarity metric.
△ Less
Submitted 6 September, 2022;
originally announced September 2022.
-
CUBES: A Parallel Synthesizer for SQL Using Examples
Authors:
Ricardo Brancas,
Miguel Terra-Neves,
Miguel Ventura,
Vasco Manquinho,
Ruben Martins
Abstract:
In recent years, more people have seen their work depend on data manipulation tasks. However, many of these users do not have the background in programming required to write complex programs, particularly SQL queries. One way of helping these users is automatically synthesizing the SQL query given a small set of examples. Several program synthesizers for SQL have been recently proposed, but they d…
▽ More
In recent years, more people have seen their work depend on data manipulation tasks. However, many of these users do not have the background in programming required to write complex programs, particularly SQL queries. One way of helping these users is automatically synthesizing the SQL query given a small set of examples. Several program synthesizers for SQL have been recently proposed, but they do not leverage multicore architectures.
This paper proposes CUBES, a parallel program synthesizer for the domain of SQL queries using input-output examples. Since input-output examples are an under-specification of the desired SQL query, sometimes, the synthesized query does not match the user's intent. CUBES incorporates a new disambiguation procedure based on fuzzing techniques that interacts with the user and increases the confidence that the returned query matches the user intent.
We perform an extensive evaluation on around 4000 SQL queries from different domains. Experimental results show that our sequential version can solve more instances than other state-of-the-art SQL synthesizers. Moreover, the parallel approach can scale up to 16 processes with super-linear speedups for many hard instances. Our disambiguation approach is critical to achieving an accuracy of around 60%, significantly larger than other SQL synthesizers.
△ Less
Submitted 1 February, 2024; v1 submitted 9 March, 2022;
originally announced March 2022.
-
Duplicated Code Pattern Mining in Visual Programming Languages
Authors:
Miguel Terra-Neves,
João Nadkarni,
Miguel Ventura,
Pedro Resende,
Hugo Veiga,
António Alegria
Abstract:
Visual Programming Languages (VPLs), coupled with the high-level abstractions that are commonplace in visual programming environments, enable users with less technical knowledge to become proficient programmers. However, the lower skill floor required by VPLs also entails that programmers are more likely to not adhere to best practices of software development, producing systems with high technical…
▽ More
Visual Programming Languages (VPLs), coupled with the high-level abstractions that are commonplace in visual programming environments, enable users with less technical knowledge to become proficient programmers. However, the lower skill floor required by VPLs also entails that programmers are more likely to not adhere to best practices of software development, producing systems with high technical debt, and thus poor maintainability. Duplicated code is one important example of such technical debt. In fact, we observed that the amount of duplication in the OutSystems VPL code bases can reach as high as $39\%$.
Duplicated code detection in text-based programming languages is still an active area of research with important implications regarding software maintainability and evolution. However, to the best of our knowledge, the literature on duplicated code detection for VPLs is very limited. We propose a novel and scalable duplicated code pattern mining algorithm that leverages the visual structure of VPLs in order to not only detect duplicated code, but also highlight duplicated code patterns that explain the reported duplication. The performance of the proposed approach is evaluated on a wide range of real-world mobile and web applications developed using OutSystems.
△ Less
Submitted 15 July, 2021;
originally announced July 2021.
-
Modeling differential rates of aging using routine laboratory data; Implications for morbidity and health care expenditure
Authors:
Alix Jean Santos,
Xavier Eugenio Asuncion,
Camille Rivero-Co,
Maria Eloisa Ventura,
Reynaldo Geronia II,
Lauren Bangerter,
Natalie E. Sheils
Abstract:
Aging is a multidimensional process where phenotypes change at varying rates. Longitudinal studies of aging typically involve following a cohort of individuals over the course of several years. This design is hindered by cost, attrition, and subsequently small sample size. Alternative methodologies are therefore warranted. In this study, we used a variational autoencoder to estimate rates of aging…
▽ More
Aging is a multidimensional process where phenotypes change at varying rates. Longitudinal studies of aging typically involve following a cohort of individuals over the course of several years. This design is hindered by cost, attrition, and subsequently small sample size. Alternative methodologies are therefore warranted. In this study, we used a variational autoencoder to estimate rates of aging from cross-sectional data from routine laboratory tests of 1.4 million individuals collected from 2016 to 2019. By incorporating metrics that would ensure model's stability and distinctness of the dimensions, we uncovered four aging dimensions that represent the following bodily functions: 1) kidney, 2) thyroid, 3) white blood cells, and 4) liver and heart. We then examined the relationship between rates of aging on morbidity and health care expenditure. In general, faster agers along these dimensions are more likely to develop chronic diseases that are related to these bodily functions. They also had higher health care expenditures compared to the slower agers. K-means clustering of individuals based on rate of aging revealed that clusters with higher odds of developing morbidity had the highest cost across all types of health care services. Results suggest that cross-sectional laboratory data can be leveraged as an alternative methodology to understand age along the different dimensions. Moreover, rates of aging are differentially related to future costs, which can aid in the development of interventions to delay disease progression.
△ Less
Submitted 17 March, 2021;
originally announced March 2021.
-
The sooner the better: lives saved by the lockdown during the COVID-19 outbreak. The case of Italy
Authors:
Roy Cerqueti,
Raffaella Coppier,
Alessandro Girardi,
Marco Ventura
Abstract:
This paper estimates the effects of non-pharmaceutical interventions - mainly, the lockdown - on the COVID-19 mortality rate for the case of Italy, the first Western country to impose a national shelter-in-place order. We use a new estimator, the Augmented Synthetic Control Method (ASCM), that overcomes some limits of the standard Synthetic Control Method (SCM). The results are twofold. From a met…
▽ More
This paper estimates the effects of non-pharmaceutical interventions - mainly, the lockdown - on the COVID-19 mortality rate for the case of Italy, the first Western country to impose a national shelter-in-place order. We use a new estimator, the Augmented Synthetic Control Method (ASCM), that overcomes some limits of the standard Synthetic Control Method (SCM). The results are twofold. From a methodological point of view, the ASCM outperforms the SCM in that the latter cannot select a valid donor set, assigning all the weights to only one country (Spain) while placing zero weights to all the remaining. From an empirical point of view, we find strong evidence of the effectiveness of non-pharmaceutical interventions in avoiding losses of human lives in Italy: conservative estimates indicate that for each human life actually lost, in the absence of lockdown there would have been on average other 1.15, the policy saved in total 20,400 human lives.
△ Less
Submitted 28 January, 2021;
originally announced January 2021.
-
FOREST: An Interactive Multi-tree Synthesizer for Regular Expressions
Authors:
Margarida Ferreira,
Miguel Terra-Neves,
Miguel Ventura,
Inês Lynce,
Ruben Martins
Abstract:
Form validators based on regular expressions are often used on digital forms to prevent users from inserting data in the wrong format. However, writing these validators can pose a challenge to some users. We present FOREST, a regular expression synthesizer for digital form validations. FOREST produces a regular expression that matches the desired pattern for the input values and a set of condition…
▽ More
Form validators based on regular expressions are often used on digital forms to prevent users from inserting data in the wrong format. However, writing these validators can pose a challenge to some users. We present FOREST, a regular expression synthesizer for digital form validations. FOREST produces a regular expression that matches the desired pattern for the input values and a set of conditions over capturing groups that ensure the validity of integer values in the input. Our synthesis procedure is based on enumerative search and uses a Satisfiability Modulo Theories (SMT) solver to explore and prune the search space. We propose a novel representation for regular expressions synthesis, multi-tree, which induces patterns in the examples and uses them to split the problem through a divide-and-conquer approach. We also present a new SMT encoding to synthesize capture conditions for a given regular expression. To increase confidence in the synthesized regular expression, we implement user interaction based on distinguishing inputs. We evaluated FOREST on real-world form-validation instances using regular expressions. Experimental results show that FOREST successfully returns the desired regular expression in 72% of the instances and outperforms REGEL, a state-of-the-art regular expression synthesizer.
△ Less
Submitted 28 December, 2020;
originally announced December 2020.
-
Multiple populations in globular clusters and their parent galaxies
Authors:
A. P. Milone,
A. F. Marino,
G. S. Da Costa,
E. P. Lagioia,
F. D'Antona,
P. Goudfrooij,
H. Jerjen,
D. Massari,
A. Renzini,
D. Yong,
H. Baumgardt,
G. Cordoni,
E. Dondoglio,
C. Li,
M. Tailo,
R. Asa'd,
E. M. Ventura
Abstract:
The 'chromosome map' diagram (ChM) proved a successful tool to identify and characterize multiple populations (MPs) in 59 Galactic Globular Clusters (GCs). Here, we construct ChMs for 11 GCs of both Magellanic Clouds (MCs) and with different ages to compare MPs in Galactic and extra-Galactic environments, and explore whether this phenomenon is universal through 'place' and 'time'. MPs are detected…
▽ More
The 'chromosome map' diagram (ChM) proved a successful tool to identify and characterize multiple populations (MPs) in 59 Galactic Globular Clusters (GCs). Here, we construct ChMs for 11 GCs of both Magellanic Clouds (MCs) and with different ages to compare MPs in Galactic and extra-Galactic environments, and explore whether this phenomenon is universal through 'place' and 'time'. MPs are detected in five clusters. The fractions of 1G stars, ranging from about 50% to more than 80%, are significantly higher than those observed in Galactic GCs with similar present-day masses. By considering both Galactic and MC clusters, the fraction of 1G stars exhibits: (i) a strong anti-correlation with the present-day mass, and (ii) with the present-day mass of 2G stars; (iii) a mild anti-correlation with 1G present-day mass. All Galactic clusters without MPs have initial masses smaller than ~1.5 10^5 solar masses but a mass threshold governing the occurrence of MPs seems challenged by massive simple-population MC GCs; (iv) Milky Way clusters with large perigalactic distances typically host larger fractions of 1G stars, but the difference disappears when we use initial cluster masses. These facts are consistent with a scenario where the stars lost by GCs mostly belong to the 1G. By exploiting recent work based on Gaia, half of the known Type II GCs appear clustered in a distinct region of the integral of motions space, thus suggesting a common progenitor galaxy. Except for these Type II GCs,we do not find any significant difference in the MPs between clusters associated with different progenitors.
△ Less
Submitted 21 October, 2019;
originally announced October 2019.
-
Streamer studies in Resistive Plate Chambers
Authors:
A. Paoloni,
A. Mengucci,
M. Spinetti,
M. Ventura,
L. Votano
Abstract:
The present paper is meant as an update of the presentation given in a previous Resistive Plate Chamber (RPC) workshop, aimed at finding an eco-friendly gas mixture for streamer operation of RPCs. Indeed the streamer working regime is still suitable for building large RPC systems dedicated to low rate applications, such as cosmic ray and neutrino physics. In addition to other studies about gas mix…
▽ More
The present paper is meant as an update of the presentation given in a previous Resistive Plate Chamber (RPC) workshop, aimed at finding an eco-friendly gas mixture for streamer operation of RPCs. Indeed the streamer working regime is still suitable for building large RPC systems dedicated to low rate applications, such as cosmic ray and neutrino physics. In addition to other studies about gas mixtures for streamer mode operation, in this paper the replacement of R134a with CF4, a gas widely used in other gaseous detectors, has been investigated. The effect of the gas gap thickness on the discharge quenching has also been studied; this is an important check because thin gas gaps of 1 mm, one half of the typical used value, have been introduced for high rate applications. Finally preliminar results about the streamer formation timing are also reported.
△ Less
Submitted 29 May, 2019; v1 submitted 9 June, 2018;
originally announced June 2018.
-
Gas gap studies about streamer operated RPCs
Authors:
A. Paoloni,
A. Mengucci,
M. Spinetti,
M. Ventura,
L. Votano
Abstract:
The requirement of high rate capability for operation at LHC, led 20 years ago to the achievement of Resistive Plate Chambers operated in avalanche mode, thanks to the introduction of new gas mixtures and to the development of the Front-End electronics. The need for a further increase of the rate capability, in view of the upgrades of LHC, is imposing new detector geometries with thinner gas gaps…
▽ More
The requirement of high rate capability for operation at LHC, led 20 years ago to the achievement of Resistive Plate Chambers operated in avalanche mode, thanks to the introduction of new gas mixtures and to the development of the Front-End electronics. The need for a further increase of the rate capability, in view of the upgrades of LHC, is imposing new detector geometries with thinner gas gaps and electrodes. Streamer operation of RPCs may still be suitable for low rate experiments, and therefore in this paper a comparison between two different detector geometries, the old standard and the newly proposed one, is performed in streamer mode.
△ Less
Submitted 15 May, 2017; v1 submitted 15 April, 2017;
originally announced April 2017.
-
The NESSiE way to searches for sterile neutrinos at FNAL
Authors:
L. Stanco,
A. Anokhina,
A. Bagulya,
M. Benettoni,
P. Bernardini,
R. Brugnera,
M. Calabrese,
A. Cecchetti,
S. Cecchini,
M. Chernyavskiy,
P. Creti,
F. Dal Corso,
O. Dalkarov,
A. Del Prete,
G. De Robertis,
M. De Serio,
L. Degli Esposti,
D. Di Ferdinando,
S. Dusini,
T. Dzhatdoev,
C. Fanin,
R. A. Fini,
G. Fiore,
A. Garfagnini,
S. Golovanov
, et al. (44 additional authors not shown)
Abstract:
Neutrino physics is nowadays receiving more and more attention as a possible source of information for the long-standing problem of new physics beyond the Standard Model. The recent measurement of the mixing angle $θ_{13}$ in the standard mixing oscillation scenario encourages us to pursue the still missing results on leptonic CP violation and absolute neutrino masses. However, puzzling measuremen…
▽ More
Neutrino physics is nowadays receiving more and more attention as a possible source of information for the long-standing problem of new physics beyond the Standard Model. The recent measurement of the mixing angle $θ_{13}$ in the standard mixing oscillation scenario encourages us to pursue the still missing results on leptonic CP violation and absolute neutrino masses. However, puzzling measurements exist that deserve an exhaustive evaluation.
The NESSiE Collaboration has been setup to undertake conclusive experiments to clarify the muon-neutrino disappearance measurements at small $L/E$, which will be able to put severe constraints to models with more than the three-standard neutrinos, or even to robustly measure the presence of a new kind of neutrino oscillation for the first time. To this aim the use of the current FNAL-Booster neutrino beam for a Short-Baseline experiment has been carefully evaluated. Its recent proposal refers to the use of magnetic spectrometers at two different sites, Near and Far ones. Their positions have been extensively studied, together with the possible performances of two OPERA-like spectrometers. The proposal is constrained by availability of existing hardware and a time-schedule compatible with the undergoing project of a multi-site Liquid-Argon detectors at FNAL.
The experiment to be possibly setup at Booster will allow to definitively clarify the current $ν_μ$ disappearance tension with $ν_{e}$ appearance and disappearance at the eV mass scale.
△ Less
Submitted 15 October, 2014;
originally announced October 2014.
-
Prospects for the measurement of muon-neutrino disappearance at the FNAL-Booster
Authors:
A. Anokhina,
A. Bagulya,
M. Benettoni,
P. Bernardini,
R. Brugnera,
M. Calabrese,
A. Cecchetti,
S. Cecchini,
M. Chernyavskiy,
P. Creti,
F. Dal Corso,
O. Dalkarov,
A. Del Prete,
G. De Robertis,
M. De Serio,
L. Degli Esposti,
D. Di Ferdinando,
S. Dusini,
T. Dzhatdoev,
C. Fanin,
R. A. Fini,
G. Fiore,
A. Garfagnini,
S. Golovanov,
M. Guerzoni
, et al. (44 additional authors not shown)
Abstract:
Neutrino physics is nowadays receiving more and more attention as a possible source of information for the long-standing problem of new physics beyond the Standard Model. The recent measurement of the mixing angle $θ_{13}$ in the standard mixing oscillation scenario encourages us to pursue the still missing results on leptonic CP violation and absolute neutrino masses. However, puzzling measuremen…
▽ More
Neutrino physics is nowadays receiving more and more attention as a possible source of information for the long-standing problem of new physics beyond the Standard Model. The recent measurement of the mixing angle $θ_{13}$ in the standard mixing oscillation scenario encourages us to pursue the still missing results on leptonic CP violation and absolute neutrino masses. However, puzzling measurements exist that deserve an exhaustive evaluation. The NESSiE Collaboration has been setup to undertake conclusive experiments to clarify the muon-neutrino disappearance measurements at small $L/E$, which will be able to put severe constraints to models with more than the three-standard neutrinos, or even to robustly measure the presence of a new kind of neutrino oscillation for the first time. To this aim the use of the current FNAL-Booster neutrino beam for a Short-Baseline experiment has been carefully evaluated. This proposal refers to the use of magnetic spectrometers at two different sites, Near and Far. Their positions have been extensively studied, together with the possible performances of two OPERA-like spectrometers. The proposal is constrained by availability of existing hardware and a time-schedule compatible with the CERN project for a new more performant neutrino beam, which will nicely extend the physics results achievable at the Booster. The possible FNAL experiment will allow to clarify the current $ν_μ$ disappearance tension with $ν_e$ appearance and disappearance at the eV mass scale. Instead, a new CERN neutrino beam would allow a further span in the parameter space together with a refined control of systematics and, more relevant, the measurement of the antineutrino sector, by upgrading the spectrometer with detectors currently under R&D study.
△ Less
Submitted 9 April, 2014;
originally announced April 2014.
-
The NESSiE Concept for Sterile Neutrinos
Authors:
L. Stanco,
A. Anokhina,
A. Bagulya,
M. Benettoni,
P. Bernardini,
A. Bertolin,
R. Brugnera,
M. Calabrese,
A. Cecchetti,
S. Cecchini,
M. Chernyavskiy,
G. Collazuol,
P. Creti,
F. Dal Corso,
O. Dalkarov,
A. Del Prete,
I. De Mitri,
G. De Robertis,
M. De Serio,
L. Degli Esposti,
D. Di Ferdinando,
U. Dore,
S. Dusini,
T. Dzhatdoev,
C. Fanin
, et al. (56 additional authors not shown)
Abstract:
Neutrino physics is nowadays receiving more and more attention as a possible source of information for the long-standing problem of new physics beyond the Standard Model. The recent measurement of the third mixing angle theta13 in the standard mixing oscillation scenario encourages us to pursue the still missing results on leptonic CP violation and absolute neutrino masses. However, several puzzli…
▽ More
Neutrino physics is nowadays receiving more and more attention as a possible source of information for the long-standing problem of new physics beyond the Standard Model. The recent measurement of the third mixing angle theta13 in the standard mixing oscillation scenario encourages us to pursue the still missing results on leptonic CP violation and absolute neutrino masses. However, several puzzling measurements exist, which deserve an exhaustive evaluation. The NESSiE Collaboration has been setup to undertake a definitive experiment to clarify the muon disappearance measurements at small L/E, which will be able to put severe constraints to any model with more than the three-standard neutrinos, or even to robustly measure the presence of a new kind of neutrino oscillation for the first time. Within the context of the current CERN project, aimed to revitalize the neutrino field in Europe, we will illustrate the achievements that can be obtained by a double muon-spectrometer system, with emphasis on the search for sterile neutrinos.
△ Less
Submitted 4 December, 2013;
originally announced December 2013.
-
Search for anomalies in the neutrino sector with muon spectrometers and large LArTPC imaging detectors at CERN
Authors:
M. Antonello,
D. Bagliani,
B. Baibussinov,
H. Bilokon,
F. Boffelli,
M. Bonesini,
E. Calligarich,
N. Canci,
S. Centro,
A. Cesana,
K. Cieslik,
D. B. Cline,
A. G. Cocco,
D. Dequal,
A. Dermenev,
R. Dolfini,
M. De Gerone,
S. Dussoni,
C. Farnese,
A. Fava,
A. Ferrari,
G. Fiorillo,
G. T. Garvey,
F. Gatti,
D. Gibin
, et al. (114 additional authors not shown)
Abstract:
A new experiment with an intense ~2 GeV neutrino beam at CERN SPS is proposed in order to definitely clarify the possible existence of additional neutrino states, as pointed out by neutrino calibration source experiments, reactor and accelerator experiments and measure the corresponding oscillation parameters. The experiment is based on two identical LAr-TPCs complemented by magnetized spectromete…
▽ More
A new experiment with an intense ~2 GeV neutrino beam at CERN SPS is proposed in order to definitely clarify the possible existence of additional neutrino states, as pointed out by neutrino calibration source experiments, reactor and accelerator experiments and measure the corresponding oscillation parameters. The experiment is based on two identical LAr-TPCs complemented by magnetized spectrometers detecting electron and muon neutrino events at Far and Near positions, 1600 m and 300 m from the proton target, respectively. The ICARUS T600 detector, the largest LAr-TPC ever built with a size of about 600 ton of imaging mass, now running in the LNGS underground laboratory, will be moved at the CERN Far position. An additional 1/4 of the T600 detector (T150) will be constructed and located in the Near position. Two large area spectrometers will be placed downstream of the two LAr-TPC detectors to perform charge identification and muon momentum measurements from sub-GeV to several GeV energy range, greatly complementing the physics capabilities. This experiment will offer remarkable discovery potentialities, collecting a very large number of unbiased events both in the neutrino and antineutrino channels, largely adequate to definitely settle the origin of the observed neutrino-related anomalies.
△ Less
Submitted 28 September, 2012; v1 submitted 3 August, 2012;
originally announced August 2012.
-
Search for "anomalies" from neutrino and anti-neutrino oscillations at Delta_m^2 ~ 1eV^2 with muon spectrometers and large LAr-TPC imaging detectors
Authors:
M. Antonello,
D. Bagliani,
B. Baibussinov,
H. Bilokon,
F. Boffelli,
M. Bonesini,
E. Calligarich,
N. Canci,
S. Centro,
A. Cesana,
K. Cieslik,
D. B. Cline,
A. G. Cocco,
D. Dequal,
A. Dermenev,
R. Dolfini,
M. De Gerone,
S. Dussoni,
C. Farnese,
A. Fava,
A. Ferrari,
G. Fiorillo,
G. T. Garvey,
F. Gatti,
D. Gibin
, et al. (114 additional authors not shown)
Abstract:
This proposal describes an experimental search for sterile neutrinos beyond the Standard Model with a new CERN-SPS neutrino beam. The experiment is based on two identical LAr-TPC's followed by magnetized spectrometers, observing the electron and muon neutrino events at 1600 and 300 m from the proton target. This project will exploit the ICARUS T600, moved from LNGS to the CERN "Far" position. An a…
▽ More
This proposal describes an experimental search for sterile neutrinos beyond the Standard Model with a new CERN-SPS neutrino beam. The experiment is based on two identical LAr-TPC's followed by magnetized spectrometers, observing the electron and muon neutrino events at 1600 and 300 m from the proton target. This project will exploit the ICARUS T600, moved from LNGS to the CERN "Far" position. An additional 1/4 of the T600 detector will be constructed and located in the "Near" position. Two spectrometers will be placed downstream of the two LAr-TPC detectors to greatly complement the physics capabilities. Spectrometers will exploit a classical dipole magnetic field with iron slabs, and a new concept air-magnet, to perform charge identification and muon momentum measurements in a wide energy range over a large transverse area. In the two positions, the radial and energy spectra of the nu_e beam are practically identical. Comparing the two detectors, in absence of oscillations, all cross sections and experimental biases cancel out, and the two experimentally observed event distributions must be identical. Any difference of the event distributions at the locations of the two detectors might be attributed to the possible existence of ν-oscillations, presumably due to additional neutrinos with a mixing angle sin^2(2theta_new) and a larger mass difference Delta_m^2_new. The superior quality of the LAr imaging TPC, in particular its unique electron-pi_zero discrimination allows full rejection of backgrounds and offers a lossless nu_e detection capability. The determination of the muon charge with the spectrometers allows the full separation of nu_mu from anti-nu_mu and therefore controlling systematics from muon mis-identification largely at high momenta.
△ Less
Submitted 29 March, 2012; v1 submitted 15 March, 2012;
originally announced March 2012.