-
Findings of the WMT 2024 Shared Task on Chat Translation
Authors:
Wafaa Mohammed,
Sweta Agrawal,
M. Amin Farajian,
Vera Cabarrão,
Bryan Eikema,
Ana C. Farinha,
José G. C. de Souza
Abstract:
This paper presents the findings from the third edition of the Chat Translation Shared Task. As with previous editions, the task involved translating bilingual customer support conversations, specifically focusing on the impact of conversation context in translation quality and evaluation. We also include two new language pairs: English-Korean and English-Dutch, in addition to the set of language…
▽ More
This paper presents the findings from the third edition of the Chat Translation Shared Task. As with previous editions, the task involved translating bilingual customer support conversations, specifically focusing on the impact of conversation context in translation quality and evaluation. We also include two new language pairs: English-Korean and English-Dutch, in addition to the set of language pairs from previous editions: English-German, English-French, and English-Brazilian Portuguese. We received 22 primary submissions and 32 contrastive submissions from eight teams, with each language pair having participation from at least three teams. We evaluated the systems comprehensively using both automatic metrics and human judgments via a direct assessment framework. The official rankings for each language pair were determined based on human evaluation scores, considering performance in both translation directions--agent and customer. Our analysis shows that while the systems excelled at translating individual turns, there is room for improvement in overall conversation-level translation quality.
△ Less
Submitted 15 October, 2024;
originally announced October 2024.
-
Do we need more complex representations for structure? A comparison of note duration representation for Music Transformers
Authors:
Gabriel Souza,
Flavio Figueiredo,
Alexei Machado,
Deborah Guimarães
Abstract:
In recent years, deep learning has achieved formidable results in creative computing. When it comes to music, one viable model for music generation are Transformer based models. However, while transformers models are popular for music generation, they often rely on annotated structural information. In this work, we inquire if the off-the-shelf Music Transformer models perform just as well on struc…
▽ More
In recent years, deep learning has achieved formidable results in creative computing. When it comes to music, one viable model for music generation are Transformer based models. However, while transformers models are popular for music generation, they often rely on annotated structural information. In this work, we inquire if the off-the-shelf Music Transformer models perform just as well on structural similarity metrics using only unannotated MIDI information. We show that a slight tweak to the most common representation yields small but significant improvements. We also advocate that searching for better unannotated musical representations is more cost-effective than producing large amounts of curated and annotated data.
△ Less
Submitted 14 October, 2024;
originally announced October 2024.
-
Modeling User Preferences with Automatic Metrics: Creating a High-Quality Preference Dataset for Machine Translation
Authors:
Sweta Agrawal,
José G. C. de Souza,
Ricardo Rei,
António Farinhas,
Gonçalo Faria,
Patrick Fernandes,
Nuno M Guerreiro,
Andre Martins
Abstract:
Alignment with human preferences is an important step in developing accurate and safe large language models. This is no exception in machine translation (MT), where better handling of language nuances and context-specific variations leads to improved quality. However, preference data based on human feedback can be very expensive to obtain and curate at a large scale. Automatic metrics, on the othe…
▽ More
Alignment with human preferences is an important step in developing accurate and safe large language models. This is no exception in machine translation (MT), where better handling of language nuances and context-specific variations leads to improved quality. However, preference data based on human feedback can be very expensive to obtain and curate at a large scale. Automatic metrics, on the other hand, can induce preferences, but they might not match human expectations perfectly. In this paper, we propose an approach that leverages the best of both worlds. We first collect sentence-level quality assessments from professional linguists on translations generated by multiple high-quality MT systems and evaluate the ability of current automatic metrics to recover these preferences. We then use this analysis to curate a new dataset, MT-Pref (metric induced translation preference) dataset, which comprises 18k instances covering 18 language directions, using texts sourced from multiple domains post-2022. We show that aligning TOWER models on MT-Pref significantly improves translation quality on WMT23 and FLORES benchmarks.
△ Less
Submitted 10 October, 2024;
originally announced October 2024.
-
Optical memory in a MoSe$_2$/Clinochlore device
Authors:
Alessandra Ames,
Frederico B. Sousa,
Gabriel A. D. Souza,
Raphaela de Oliveira,
Igor R. F. Silva,
Gabriel L. Rodrigues,
Kenji Watanabe,
Takashi Taniguchi,
Gilmar E. Marques,
Ingrid D. Barcelos,
Alisson R. Cadore,
Victor López-Richard,
Marcio D. Teodoro
Abstract:
Two-dimensional heterostructures have been crucial in advancing optoelectronic devices utilizing van der Waals materials. Semiconducting transition metal dichalcogenide monolayers, known for their unique optical properties, offer extensive possibilities for light-emitting devices. Recently, a memory-driven optical device, termed a Mem-emitter, was proposed using these monolayers atop dielectric su…
▽ More
Two-dimensional heterostructures have been crucial in advancing optoelectronic devices utilizing van der Waals materials. Semiconducting transition metal dichalcogenide monolayers, known for their unique optical properties, offer extensive possibilities for light-emitting devices. Recently, a memory-driven optical device, termed a Mem-emitter, was proposed using these monolayers atop dielectric substrates. The successful realization of such devices heavily depends on selecting the optimal substrate. Here, we report a pronounced memory effect in a MoSe$_2$/clinochlore device, evidenced by electric hysteresis in the intensity and energy of MoSe$_2$ monolayer emissions. This demonstrates both population-driven and transition-rate-driven Mem-emitter abilities. Our theoretical approach correlates these memory effects with internal state variables of the substrate, emphasizing that clinochlore layered structure is crucial for a robust and rich memory response. This work introduces a novel two-dimensional device with promising applications in memory functionalities, highlighting the importance of alternative insulators in fabricating van der Waals heterostructures.
△ Less
Submitted 9 October, 2024;
originally announced October 2024.
-
EuroLLM: Multilingual Language Models for Europe
Authors:
Pedro Henrique Martins,
Patrick Fernandes,
João Alves,
Nuno M. Guerreiro,
Ricardo Rei,
Duarte M. Alves,
José Pombal,
Amin Farajian,
Manuel Faysse,
Mateusz Klimaszewski,
Pierre Colombo,
Barry Haddow,
José G. C. de Souza,
Alexandra Birch,
André F. T. Martins
Abstract:
The quality of open-weight LLMs has seen significant improvement, yet they remain predominantly focused on English. In this paper, we introduce the EuroLLM project, aimed at developing a suite of open-weight multilingual LLMs capable of understanding and generating text in all official European Union languages, as well as several additional relevant languages. We outline the progress made to date,…
▽ More
The quality of open-weight LLMs has seen significant improvement, yet they remain predominantly focused on English. In this paper, we introduce the EuroLLM project, aimed at developing a suite of open-weight multilingual LLMs capable of understanding and generating text in all official European Union languages, as well as several additional relevant languages. We outline the progress made to date, detailing our data collection and filtering process, the development of scaling laws, the creation of our multilingual tokenizer, and the data mix and modeling configurations. Additionally, we release our initial models: EuroLLM-1.7B and EuroLLM-1.7B-Instruct and report their performance on multilingual general benchmarks and machine translation.
△ Less
Submitted 24 September, 2024;
originally announced September 2024.
-
Exploring Foundation Models for Synthetic Medical Imaging: A Study on Chest X-Rays and Fine-Tuning Techniques
Authors:
Davide Clode da Silva,
Marina Musse Bernardes,
Nathalia Giacomini Ceretta,
Gabriel Vaz de Souza,
Gabriel Fonseca Silva,
Rafael Heitor Bordini,
Soraia Raupp Musse
Abstract:
Machine learning has significantly advanced healthcare by aiding in disease prevention and treatment identification. However, accessing patient data can be challenging due to privacy concerns and strict regulations. Generating synthetic, realistic data offers a potential solution for overcoming these limitations, and recent studies suggest that fine-tuning foundation models can produce such data e…
▽ More
Machine learning has significantly advanced healthcare by aiding in disease prevention and treatment identification. However, accessing patient data can be challenging due to privacy concerns and strict regulations. Generating synthetic, realistic data offers a potential solution for overcoming these limitations, and recent studies suggest that fine-tuning foundation models can produce such data effectively. In this study, we explore the potential of foundation models for generating realistic medical images, particularly chest x-rays, and assess how their performance improves with fine-tuning. We propose using a Latent Diffusion Model, starting with a pre-trained foundation model and refining it through various configurations. Additionally, we performed experiments with input from a medical professional to assess the realism of the images produced by each trained model.
△ Less
Submitted 6 September, 2024;
originally announced September 2024.
-
On groups with at most five irrational conjugacy classes
Authors:
Gabriel de Arêa Leão Souza
Abstract:
G. Navarro and P. H. Tiep, among others, have studied groups with few rational conjugacy classes or few rational irreducible characters. In this paper we look at the opposite extreme. Let $G$ be a finite group. Given a conjugacy class $K$ of $G$, we say it is irrational if there is some $χ\in \operatorname{Irr}(G)$ such that $χ(K) \not \in \mathbb{Q}$. One of our main results shows that, when $G$…
▽ More
G. Navarro and P. H. Tiep, among others, have studied groups with few rational conjugacy classes or few rational irreducible characters. In this paper we look at the opposite extreme. Let $G$ be a finite group. Given a conjugacy class $K$ of $G$, we say it is irrational if there is some $χ\in \operatorname{Irr}(G)$ such that $χ(K) \not \in \mathbb{Q}$. One of our main results shows that, when $G$ contains at most $5$ irrational conjugacy classes, then $|\operatorname{Irr}_{\mathbb{Q}} (G)| = | \operatorname{cl}_{\mathbb{Q}} (G)|$. This suggests some duality with the known results and open questions on groups with few rational irreducible characters.
△ Less
Submitted 5 September, 2024;
originally announced September 2024.
-
Orbital motion of NGC 6166 (3C 338) and its impact on the jet morphology at kiloparsec scales
Authors:
A. S. R. Antas,
A. Caproni,
R. E. G. Machado,
T. F. Laganá,
G. S. Souza
Abstract:
In the central region of the galaxy cluster Abell 2199 (A2199) resides the cD galaxy NGC 6166, which spatially coincides with the 3C 338 radio source. Lobes, jets, and a more detached southern structure (similar to a jet labelled as ridge) are seen at kiloparsec-scale images of 3C 338. This unusual radio morphology has led to the proposition of different hypotheses about its physical origin in the…
▽ More
In the central region of the galaxy cluster Abell 2199 (A2199) resides the cD galaxy NGC 6166, which spatially coincides with the 3C 338 radio source. Lobes, jets, and a more detached southern structure (similar to a jet labelled as ridge) are seen at kiloparsec-scale images of 3C 338. This unusual radio morphology has led to the proposition of different hypotheses about its physical origin in the literature. In this work, we study the feasibility of a dynamical scenario where NGC 6166 moves around the X-ray inferred centre of A2199 from the point of view of three-dimensional hydrodynamic simulations. The physical characteristics of the intra-cluster medium in which the jet propagates are constrained to those derived from X-ray observations in the vicinity of NGC 6166. Possible orbits for the jet inlet region are derived from the estimated radial velocity of NGC 6166, while the jet parameters are constrained by parsec-scale interferometric radio observations and the estimated jet power of 3C 338 obtained from radio and X-ray data. Our results show that the hypothesis of NGC 6166 has been moving around the centre of A2199 during the last tens of million of years is compatible with the general radio morphology of 3C 338. Furthermore, the proposed dynamic scenario for the motion of NGC 6166 may be linked to gravitational perturbations induced by the passage of a sub-cluster of galaxies hundreds of millions of years ago.
△ Less
Submitted 27 September, 2024; v1 submitted 21 August, 2024;
originally announced August 2024.
-
The Fourth S-PLUS Data Release: 12-filter photometry covering $\sim3000$ square degrees in the southern hemisphere
Authors:
Fabio R. Herpich,
Felipe Almeida-Fernandes,
Gustavo B. Oliveira Schwarz,
Erik V. R. Lima,
Lilianne Nakazono,
Javier Alonso-García,
Marcos A. Fonseca-Faria,
Marilia J. Sartori,
Guilherme F. Bolutavicius,
Gabriel Fabiano de Souza,
Eduardo A. Hartmann,
Liana Li,
Luna Espinosa,
Antonio Kanaan,
William Schoenell,
Ariel Werle,
Eduardo Machado-Pereira,
Luis A. Gutiérrez-Soto,
Thaís Santos-Silva,
Analia V. Smith Castelli,
Eduardo A. D. Lacerda,
Cassio L. Barbosa,
Hélio D. Perottoni,
Carlos E. Ferreira Lopes,
Raquel Ruiz Valença
, et al. (46 additional authors not shown)
Abstract:
The Southern Photometric Local Universe Survey (S-PLUS) is a project to map $\sim9300$ sq deg of the sky using twelve bands (seven narrow and five broadbands). Observations are performed with the T80-South telescope, a robotic telescope located at the Cerro Tololo Observatory in Chile. The survey footprint consists of several large contiguous areas, including fields at high and low galactic latitu…
▽ More
The Southern Photometric Local Universe Survey (S-PLUS) is a project to map $\sim9300$ sq deg of the sky using twelve bands (seven narrow and five broadbands). Observations are performed with the T80-South telescope, a robotic telescope located at the Cerro Tololo Observatory in Chile. The survey footprint consists of several large contiguous areas, including fields at high and low galactic latitudes, and towards the Magellanic Clouds. S-PLUS uses fixed exposure times to reach point source depths of about $21$ mag in the $griz$ and $20$ mag in the $u$ and the narrow filters. This paper describes the S-PLUS Data Release 4 (DR4), which includes calibrated images and derived catalogues for over 3000 sq deg, covering the aforementioned area. The catalogues provide multi-band photometry performed with the tools \texttt{DoPHOT} and \texttt{SExtractor} -- point spread function (\PSF) and aperture photometry, respectively. In addition to the characterization, we also present the scientific potential of the data. We use statistical tools to present and compare the photometry obtained through different methods. Overall we find good agreement between the different methods, with a slight systematic offset of 0.05\,mag between our \PSF and aperture photometry. We show that the astrometry accuracy is equivalent to that obtained in previous S-PLUS data releases, even in very crowded fields where photometric extraction is challenging. The depths of main survey (MS) photometry for a minimum signal-to-noise ratio $S/N = 3$ reach from $\sim19.5$ for the bluer bands to $\sim21.5$ mag on the red. The range of magnitudes over which accurate \PSF photometry is obtained is shallower, reaching $\sim19$ to $\sim20.5$ mag depending on the filter. Based on these photometric data, we provide star-galaxy-quasar classification and photometric redshift for millions of objects.
△ Less
Submitted 30 July, 2024;
originally announced July 2024.
-
A Comprehensive Review and Taxonomy of Audio-Visual Synchronization Techniques for Realistic Speech Animation
Authors:
Jose Geraldo Fernandes,
Sinval Nascimento,
Daniel Dominguete,
André Oliveira,
Lucas Rotsen,
Gabriel Souza,
David Brochero,
Luiz Facury,
Mateus Vilela,
Hebert Costa,
Frederico Coelho,
Antônio P. Braga
Abstract:
In many applications, synchronizing audio with visuals is crucial, such as in creating graphic animations for films or games, translating movie audio into different languages, and developing metaverse applications. This review explores various methodologies for achieving realistic facial animations from audio inputs, highlighting generative and adaptive models. Addressing challenges like model tra…
▽ More
In many applications, synchronizing audio with visuals is crucial, such as in creating graphic animations for films or games, translating movie audio into different languages, and developing metaverse applications. This review explores various methodologies for achieving realistic facial animations from audio inputs, highlighting generative and adaptive models. Addressing challenges like model training costs, dataset availability, and silent moment distributions in audio data, it presents innovative solutions to enhance performance and realism. The research also introduces a new taxonomy to categorize audio-visual synchronization methods based on logistical aspects, advancing the capabilities of virtual assistants, gaming, and interactive digital media.
△ Less
Submitted 28 August, 2024; v1 submitted 24 July, 2024;
originally announced July 2024.
-
Low latency carbon budget analysis reveals a large decline of the land carbon sink in 2023
Authors:
Piyu Ke,
Philippe Ciais,
Stephen Sitch,
Wei Li,
Ana Bastos,
Zhu Liu,
Yidi Xu,
Xiaofan Gui,
Jiang Bian,
Daniel S Goll,
Yi Xi,
Wanjing Li,
Michael O'Sullivan,
Jeffeson Goncalves de Souza,
Pierre Friedlingstein,
Frederic Chevallier
Abstract:
In 2023, the CO2 growth rate was 3.37 +/- 0.11 ppm at Mauna Loa, 86% above the previous year, and hitting a record high since observations began in 1958, while global fossil fuel CO2 emissions only increased by 0.6 +/- 0.5%. This implies an unprecedented weakening of land and ocean sinks, and raises the question of where and why this reduction happened. Here we show a global net land CO2 sink of 0…
▽ More
In 2023, the CO2 growth rate was 3.37 +/- 0.11 ppm at Mauna Loa, 86% above the previous year, and hitting a record high since observations began in 1958, while global fossil fuel CO2 emissions only increased by 0.6 +/- 0.5%. This implies an unprecedented weakening of land and ocean sinks, and raises the question of where and why this reduction happened. Here we show a global net land CO2 sink of 0.44 +/- 0.21 GtC yr-1, the weakest since 2003. We used dynamic global vegetation models, satellites fire emissions, an atmospheric inversion based on OCO-2 measurements, and emulators of ocean biogeochemical and data driven models to deliver a fast-track carbon budget in 2023. Those models ensured consistency with previous carbon budgets. Regional flux anomalies from 2015-2022 are consistent between top-down and bottom-up approaches, with the largest abnormal carbon loss in the Amazon during the drought in the second half of 2023 (0.31 +/- 0.19 GtC yr-1), extreme fire emissions of 0.58 +/- 0.10 GtC yr-1 in Canada and a loss in South-East Asia (0.13 +/- 0.12 GtC yr-1). Since 2015, land CO2 uptake north of 20 degree N declined by half to 1.13 +/- 0.24 GtC yr-1 in 2023. Meanwhile, the tropics recovered from the 2015-16 El Nino carbon loss, gained carbon during the La Nina years (2020-2023), then switched to a carbon loss during the 2023 El Nino (0.56 +/- 0.23 GtC yr-1). The ocean sink was stronger than normal in the equatorial eastern Pacific due to reduced upwelling from La Nina's retreat in early 2023 and the development of El Nino later. Land regions exposed to extreme heat in 2023 contributed a gross carbon loss of 1.73 GtC yr-1, indicating that record warming in 2023 had a strong negative impact on the capacity of terrestrial ecosystems to mitigate climate change.
△ Less
Submitted 17 July, 2024;
originally announced July 2024.
-
First results on monolithic CMOS detector with internal gain
Authors:
U. Follo,
G. Gioachin,
C. Ferrero,
M. Mandurrino,
M. Bregant,
S. Bufalino,
F. Carnesecchi,
D. Cavazza,
M. Colocci,
T. Corradino,
M. Da Rocha Rolo,
G. Di Nicolantonio,
S. Durando,
G. Margutti,
M. Mignone,
R. Nania,
L. Pancheri,
A. Rivetti,
B. Sabiu,
G. G. A. de Souza,
S. Strazzi,
R. Wheadon
Abstract:
In this paper we report on a set of characterisations carried out on the first monolithic LGAD prototype integrated in a customised 110 nm CMOS process having a depleted active volume thickness of 48 $μ$m. This prototype is formed by a pixel array where each pixel has a total size of 100 $μ$m $\times$ 250 $μ$m and includes a high-speed front-end amplifier. After describing the sensor and the elect…
▽ More
In this paper we report on a set of characterisations carried out on the first monolithic LGAD prototype integrated in a customised 110 nm CMOS process having a depleted active volume thickness of 48 $μ$m. This prototype is formed by a pixel array where each pixel has a total size of 100 $μ$m $\times$ 250 $μ$m and includes a high-speed front-end amplifier. After describing the sensor and the electronics architecture, both laboratory and in-beam measurements are reported and described. Optical characterisations performed with an IR pulsed laser setup have shown a sensor internal gain of about 2.5. With the same experimental setup, the electronic jitter was found to be between 50 ps and 150 ps, depending on the signal amplitude. Moreover, the analysis of a test beam performed at the Proton Synchrotron (PS) T10 facility of CERN with 10 GeV/c protons and pions indicated that the overall detector time resolution is in the range of 234 ps to 244 ps. Further TCAD investigations, based on the doping profile extracted from $C(V)$ measurements, confirmed the multiplication gain measured on the test devices. Finally, TCAD simulations were used to tune the future doping concentration of the gain layer implant, targeting sensors with a higher avalanche gain. This adjustment is expected to enhance the timing performance of the sensors of the future productions, in order to cope with the high event rate expected in most of the near future high-energy and high-luminosity physics experiments, where the time resolution will be essential to disentangle overlapping events and it will also be crucial for Particle IDentification (PID).
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
QUEST: Quality-Aware Metropolis-Hastings Sampling for Machine Translation
Authors:
Gonçalo R. A. Faria,
Sweta Agrawal,
António Farinhas,
Ricardo Rei,
José G. C. de Souza,
André F. T. Martins
Abstract:
An important challenge in machine translation (MT) is to generate high-quality and diverse translations. Prior work has shown that the estimated likelihood from the MT model correlates poorly with translation quality. In contrast, quality evaluation metrics (such as COMET or BLEURT) exhibit high correlations with human judgments, which has motivated their use as rerankers (such as quality-aware an…
▽ More
An important challenge in machine translation (MT) is to generate high-quality and diverse translations. Prior work has shown that the estimated likelihood from the MT model correlates poorly with translation quality. In contrast, quality evaluation metrics (such as COMET or BLEURT) exhibit high correlations with human judgments, which has motivated their use as rerankers (such as quality-aware and minimum Bayes risk decoding). However, relying on a single translation with high estimated quality increases the chances of "gaming the metric''. In this paper, we address the problem of sampling a set of high-quality and diverse translations. We provide a simple and effective way to avoid over-reliance on noisy quality estimates by using them as the energy function of a Gibbs distribution. Instead of looking for a mode in the distribution, we generate multiple samples from high-density areas through the Metropolis-Hastings algorithm, a simple Markov chain Monte Carlo approach. The results show that our proposed method leads to high-quality and diverse outputs across multiple language pairs (English$\leftrightarrow${German, Russian}) with two strong decoder-only LLMs (Alma-7b, Tower-7b).
△ Less
Submitted 15 October, 2024; v1 submitted 28 May, 2024;
originally announced June 2024.
-
High-order parallel-in-time method for the monodomain equation in cardiac electrophysiology
Authors:
Giacomo Rosilho de Souza,
Simone Pezzuto,
Rolf Krause
Abstract:
Simulation of the monodomain equation, crucial for modeling the heart's electrical activity, faces scalability limits when traditional numerical methods only parallelize in space. To optimize the use of large multi-processor computers by distributing the computational load more effectively, time parallelization is essential. We introduce a high-order parallel-in-time method addressing the substant…
▽ More
Simulation of the monodomain equation, crucial for modeling the heart's electrical activity, faces scalability limits when traditional numerical methods only parallelize in space. To optimize the use of large multi-processor computers by distributing the computational load more effectively, time parallelization is essential. We introduce a high-order parallel-in-time method addressing the substantial computational challenges posed by the stiff, multiscale, and nonlinear nature of cardiac dynamics. Our method combines the semi-implicit and exponential spectral deferred correction methods, yielding a hybrid method that is extended to parallel-in-time employing the PFASST framework. We thoroughly evaluate the stability, accuracy, and robustness of the proposed parallel-in-time method through extensive numerical experiments, using practical ionic models such as the ten-Tusscher-Panfilov. The results underscore the method's potential to significantly enhance real-time and high-fidelity simulations in biomedical research and clinical applications.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Optimization of GEM detectors for applications in X-ray fluorescence imaging
Authors:
Geovane G. A. de Souza,
Hugo Natal da Luz,
Marco Bregant
Abstract:
In this work a set of simulations that aim at the optimization of gaseous detectors for applications in X-ray fluorescence imaging in the energy range of 3 -- 30keV is presented. By studying the statistical distribution of the radiation interactions with gases, the energy resolution limits after charge multiplication for 6keV X-ray photons in Ar/CO$_2$(70/30) and Kr/CO$_2$(90/10) were calculated,…
▽ More
In this work a set of simulations that aim at the optimization of gaseous detectors for applications in X-ray fluorescence imaging in the energy range of 3 -- 30keV is presented. By studying the statistical distribution of the radiation interactions with gases, the energy resolution limits after charge multiplication for 6keV X-ray photons in Ar/CO$_2$(70/30) and Kr/CO$_2$(90/10) were calculated, obtaining energy resolutions of 15.4(4)% and 14.6(2)% respectively. The detector design was also studied to reduce the presence of escape peaks and complement a model to evaluate the inevitable X-ray fluorescence of copper generated by the conductive materials inside the detector.
△ Less
Submitted 30 September, 2024; v1 submitted 23 April, 2024;
originally announced April 2024.
-
CP HDR: A feature point detection and description library for LDR and HDR images
Authors:
Artur Santos Nascimento,
Valter Guilherme Silva de Souza,
Daniel Oliveira Dantas,
Beatriz Trinchão Andrade
Abstract:
In computer vision, characteristics refer to image regions with unique properties, such as corners, edges, textures, or areas with high contrast. These regions can be represented through feature points (FPs). FP detection and description are fundamental steps to many computer vision tasks. Most FP detection and description methods use low dynamic range (LDR) images, sufficient for most application…
▽ More
In computer vision, characteristics refer to image regions with unique properties, such as corners, edges, textures, or areas with high contrast. These regions can be represented through feature points (FPs). FP detection and description are fundamental steps to many computer vision tasks. Most FP detection and description methods use low dynamic range (LDR) images, sufficient for most applications involving digital images. However, LDR images may have saturated pixels in scenes with extreme light conditions, which degrade FP detection. On the other hand, high dynamic range (HDR) images usually present a greater dynamic range but FP detection algorithms do not take advantage of all the information in such images. In this study, we present a systematic review of image detection and description algorithms that use HDR images as input. We developed a library called CP_HDR that implements the Harris corner detector, SIFT detector and descriptor, and two modifications of those algorithms specialized in HDR images, called SIFT for HDR (SfHDR) and Harris for HDR (HfHDR). Previous studies investigated the use of HDR images in FP detection, but we did not find studies investigating the use of HDR images in FP description. Using uniformity, repeatability rate, mean average precision, and matching rate metrics, we compared the performance of the CP_HDR algorithms using LDR and HDR images. We observed an increase in the uniformity of the distribution of FPs among the high-light, mid-light, and low-light areas of the images. The results show that using HDR images as input to detection algorithms improves performance and that SfHDR and HfHDR enhance FP description.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
On the set of supercyclic operators
Authors:
Thiago R. Alves,
Gustavo C. Souza
Abstract:
In this article, we address a problem posed by F. Bayart regarding the existence of an infinite-dimensional closed vector subspace (excluding the null operator) within the set of supercyclic operators on Banach spaces. We resolve this problem by establishing the existence of the closed subspace. Furthermore, we prove that the set of supercyclic operators on $\ell_1$ contains, up to the null operat…
▽ More
In this article, we address a problem posed by F. Bayart regarding the existence of an infinite-dimensional closed vector subspace (excluding the null operator) within the set of supercyclic operators on Banach spaces. We resolve this problem by establishing the existence of the closed subspace. Furthermore, we prove that the set of supercyclic operators on $\ell_1$ contains, up to the null operator, an isometric copy of $\ell_1$.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
Tower: An Open Multilingual Large Language Model for Translation-Related Tasks
Authors:
Duarte M. Alves,
José Pombal,
Nuno M. Guerreiro,
Pedro H. Martins,
João Alves,
Amin Farajian,
Ben Peters,
Ricardo Rei,
Patrick Fernandes,
Sweta Agrawal,
Pierre Colombo,
José G. C. de Souza,
André F. T. Martins
Abstract:
While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose a recipe for tailoring LLMs to multiple tasks present in translation workflows. We perform continued pretraining on a multilingual mixture of monolingual and pa…
▽ More
While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose a recipe for tailoring LLMs to multiple tasks present in translation workflows. We perform continued pretraining on a multilingual mixture of monolingual and parallel data, creating TowerBase, followed by finetuning on instructions relevant for translation processes, creating TowerInstruct. Our final model surpasses open alternatives on several tasks relevant to translation workflows and is competitive with general-purpose closed LLMs. To facilitate future research, we release the Tower models, our specialization dataset, an evaluation framework for LLMs focusing on the translation ecosystem, and a collection of model generations, including ours, on our benchmark.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
All dielectric integrable optical isolators
Authors:
Sevag Abadian,
Getulio Souza,
Stanislav Winkler,
Marian Bogdan Sirbu,
Michail Symeonidis,
Tolga Tekin
Abstract:
On-chip optical isolators, functioning as unidirectional gates for light, play a crucial role in maintaining signal integrity, preventing laser destabilization, and fortifying the overall performance of optical systems. In this paper, we propose a five-layered heterostructure consisting of a magneto-optic material sandwiched between parallel dielectric slab waveguides. Under TMOKE configuration, t…
▽ More
On-chip optical isolators, functioning as unidirectional gates for light, play a crucial role in maintaining signal integrity, preventing laser destabilization, and fortifying the overall performance of optical systems. In this paper, we propose a five-layered heterostructure consisting of a magneto-optic material sandwiched between parallel dielectric slab waveguides. Under TMOKE configuration, the coupled optical modes undergo an electromagnetic profile transformation that can be harnessed to confine the input mode in one waveguide during forward propagation and in the other during backward propagation. Together with radiative subwavelength gratings, such a system can provide a 20dB isolation ratio with negligible insertion losses.
△ Less
Submitted 1 March, 2024; v1 submitted 8 February, 2024;
originally announced February 2024.
-
Multilevel lattice codes from Hurwitz quaternion integers
Authors:
Juliana G. F. Souza,
Sueli I. R. Costa,
Cong Ling
Abstract:
This work presents an extension of the Construction $π_A$ lattices proposed in \cite{huang2017construction}, to Hurwitz quaternion integers. This construction is provided by using an isomorphism from a version of the Chinese remainder theorem applied to maximal orders in contrast to natural orders in prior works. Exploiting this map, we analyze the performance of the resulting multilevel lattice c…
▽ More
This work presents an extension of the Construction $π_A$ lattices proposed in \cite{huang2017construction}, to Hurwitz quaternion integers. This construction is provided by using an isomorphism from a version of the Chinese remainder theorem applied to maximal orders in contrast to natural orders in prior works. Exploiting this map, we analyze the performance of the resulting multilevel lattice codes, highlight via computer simulations their notably reduced computational complexity provided by the multistage decoding. Moreover it is shown that this construction effectively attain the Poltyrev-limit.
△ Less
Submitted 27 February, 2024; v1 submitted 19 January, 2024;
originally announced January 2024.
-
Explicit stabilized multirate methods for the monodomain model in cardiac electrophysiology
Authors:
Giacomo Rosilho de Souza,
Marcus J. Grote,
Simone Pezzuto,
Rolf Krause
Abstract:
Fully explicit stabilized multirate (mRKC) methods are well-suited for the numerical solution of large multiscale systems of stiff ordinary differential equations thanks to their improved stability properties. To demonstrate their efficiency for the numerical solution of stiff, multiscale, nonlinear parabolic PDE's, we apply mRKC methods to the monodomain equation from cardiac electrophysiology. I…
▽ More
Fully explicit stabilized multirate (mRKC) methods are well-suited for the numerical solution of large multiscale systems of stiff ordinary differential equations thanks to their improved stability properties. To demonstrate their efficiency for the numerical solution of stiff, multiscale, nonlinear parabolic PDE's, we apply mRKC methods to the monodomain equation from cardiac electrophysiology. In doing so, we propose an improved version, specifically tailored to the monodomain model, which leads to the explicit exponential multirate stabilized (emRKC) method. Several numerical experiments are conducted to evaluate the efficiency of both mRKC and emRKC, while taking into account different finite element meshes (structured and unstructured) and realistic ionic models. The new emRKC method typically outperforms a standard implicit-explicit baseline method for cardiac electrophysiology. Code profiling and strong scalability results further demonstrate that emRKC is faster and inherently parallel without sacrificing accuracy.
△ Less
Submitted 24 June, 2024; v1 submitted 3 January, 2024;
originally announced January 2024.
-
Alternative Frenkel liquid Lagrangian
Authors:
F. A. P. Alves-Júnior,
A. S. Ribeiro,
G. B. Souza,
José A. Helayël-Neto
Abstract:
Based on the Caldirola-Canai approach, we endeavor to propose a dissipative scalar field theory in Minkowski space-time. We present its free particle solutions for complex $ω^μ$ components, and we find three profiles of dispersion relations, two of them support gapped momentum states. We also present an alternative view of this model, where dissipation acts as a geometric effect, and an effective…
▽ More
Based on the Caldirola-Canai approach, we endeavor to propose a dissipative scalar field theory in Minkowski space-time. We present its free particle solutions for complex $ω^μ$ components, and we find three profiles of dispersion relations, two of them support gapped momentum states. We also present an alternative view of this model, where dissipation acts as a geometric effect, and an effective negative scalar curvature space-time emerges. Finally, we illustrate how the present model could be adapted to describe shear waves in Frenkel liquids.
△ Less
Submitted 21 November, 2023; v1 submitted 1 November, 2023;
originally announced November 2023.
-
Ages and metallicities of stellar clusters using S-PLUS narrow-band integrated photometry: the Small Magellanic Cloud
Authors:
Gabriel Fabiano de Souza,
Pieter Westera,
Felipe Almeida-Fernandes,
Guilherme Limberg,
Bruno Dias,
José A. Hernandez-Jimenez,
Fábio R. Herpich,
Leandro O. Kerber,
Eduardo Machado-Pereira,
Hélio D. Perottoni,
Rafael Guerço,
Liana Li,
Laura Sampedro,
Antonio Kanaan,
Tiago Ribeiro,
William Schoenell,
Claudia Mendes de Oliveira
Abstract:
The Magellanic Clouds are the most massive and closest satellite galaxies of the Milky Way, with stars covering ages from a few Myr up to 13 Gyr. This makes them important for validating integrated light methods to study stellar populations and star-formation processes, which can be applied to more distant galaxies. We characterized a set of stellar clusters in the Small Magellanic Cloud (SMC), us…
▽ More
The Magellanic Clouds are the most massive and closest satellite galaxies of the Milky Way, with stars covering ages from a few Myr up to 13 Gyr. This makes them important for validating integrated light methods to study stellar populations and star-formation processes, which can be applied to more distant galaxies. We characterized a set of stellar clusters in the Small Magellanic Cloud (SMC), using the $\textit{Southern Photometric Local Universe Survey}$. This is the first age (metallicity) determination for 11 (65) clusters of this sample. Through its 7 narrow bands, centered on important spectral features, and 5 broad bands, we can retrieve detailed information about stellar populations. We obtained ages and metallicities for all stellar clusters using the Bayesian spectral energy distribution fitting code $\texttt{BAGPIPES}$. With a sample of clusters in the color range $-0.20 < r-z < +0.35$, for which our determined parameters are most reliable, we modeled the age-metallicity relation of SMC. At any given age, the metallicities of SMC clusters are lower than those of both the Gaia Sausage-Enceladus disrupted dwarf galaxy and the Milky Way. In comparison with literature values, differences are $Δ$log(age)$\approx0.31$ and $Δ$[Fe/H]$\approx0.41$, which is comparable to low-resolution spectroscopy of individual stars. Finally, we confirm a previously known gradient, with younger clusters in the center and older ones preferentially located in the outermost regions. On the other hand, we found no evidence of a significant metallicity gradient.
△ Less
Submitted 30 November, 2023; v1 submitted 23 October, 2023;
originally announced October 2023.
-
Steering Large Language Models for Machine Translation with Finetuning and In-Context Learning
Authors:
Duarte M. Alves,
Nuno M. Guerreiro,
João Alves,
José Pombal,
Ricardo Rei,
José G. C. de Souza,
Pierre Colombo,
André F. T. Martins
Abstract:
Large language models (LLMs) are a promising avenue for machine translation (MT). However, current LLM-based MT systems are brittle: their effectiveness highly depends on the choice of few-shot examples and they often require extra post-processing due to overgeneration. Alternatives such as finetuning on translation instructions are computationally expensive and may weaken in-context learning capa…
▽ More
Large language models (LLMs) are a promising avenue for machine translation (MT). However, current LLM-based MT systems are brittle: their effectiveness highly depends on the choice of few-shot examples and they often require extra post-processing due to overgeneration. Alternatives such as finetuning on translation instructions are computationally expensive and may weaken in-context learning capabilities, due to overspecialization. In this paper, we provide a closer look at this problem. We start by showing that adapter-based finetuning with LoRA matches the performance of traditional finetuning while reducing the number of training parameters by a factor of 50. This method also outperforms few-shot prompting and eliminates the need for post-processing or in-context examples. However, we show that finetuning generally degrades few-shot performance, hindering adaptation capabilities. Finally, to obtain the best of both worlds, we propose a simple approach that incorporates few-shot examples during finetuning. Experiments on 10 language pairs show that our proposed approach recovers the original few-shot capabilities while keeping the added benefits of finetuning.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
An Empirical Study of Translation Hypothesis Ensembling with Large Language Models
Authors:
António Farinhas,
José G. C. de Souza,
André F. T. Martins
Abstract:
Large language models (LLMs) are becoming a one-fits-many solution, but they sometimes hallucinate or produce unreliable output. In this paper, we investigate how hypothesis ensembling can improve the quality of the generated text for the specific problem of LLM-based machine translation. We experiment with several techniques for ensembling hypotheses produced by LLMs such as ChatGPT, LLaMA, and A…
▽ More
Large language models (LLMs) are becoming a one-fits-many solution, but they sometimes hallucinate or produce unreliable output. In this paper, we investigate how hypothesis ensembling can improve the quality of the generated text for the specific problem of LLM-based machine translation. We experiment with several techniques for ensembling hypotheses produced by LLMs such as ChatGPT, LLaMA, and Alpaca. We provide a comprehensive study along multiple dimensions, including the method to generate hypotheses (multiple prompts, temperature-based sampling, and beam search) and the strategy to produce the final translation (instruction-based, quality-based reranking, and minimum Bayes risk (MBR) decoding). Our results show that MBR decoding is a very effective method, that translation quality can be improved using a small number of samples, and that instruction tuning has a strong impact on the relation between the diversity of the hypotheses and the sampling temperature.
△ Less
Submitted 17 October, 2023;
originally announced October 2023.
-
Scaling up COMETKIWI: Unbabel-IST 2023 Submission for the Quality Estimation Shared Task
Authors:
Ricardo Rei,
Nuno M. Guerreiro,
José Pombal,
Daan van Stigt,
Marcos Treviso,
Luisa Coheur,
José G. C. de Souza,
André F. T. Martins
Abstract:
We present the joint contribution of Unbabel and Instituto Superior Técnico to the WMT 2023 Shared Task on Quality Estimation (QE). Our team participated on all tasks: sentence- and word-level quality prediction (task 1) and fine-grained error span detection (task 2). For all tasks, we build on the COMETKIWI-22 model (Rei et al., 2022b). Our multilingual approaches are ranked first for all tasks,…
▽ More
We present the joint contribution of Unbabel and Instituto Superior Técnico to the WMT 2023 Shared Task on Quality Estimation (QE). Our team participated on all tasks: sentence- and word-level quality prediction (task 1) and fine-grained error span detection (task 2). For all tasks, we build on the COMETKIWI-22 model (Rei et al., 2022b). Our multilingual approaches are ranked first for all tasks, reaching state-of-the-art performance for quality estimation at word-, span- and sentence-level granularity. Compared to the previous state-of-the-art COMETKIWI-22, we show large improvements in correlation with human judgements (up to 10 Spearman points). Moreover, we surpass the second-best multilingual submission to the shared-task with up to 3.8 absolute points.
△ Less
Submitted 21 September, 2023;
originally announced September 2023.
-
Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation
Authors:
Patrick Fernandes,
Aman Madaan,
Emmy Liu,
António Farinhas,
Pedro Henrique Martins,
Amanda Bertsch,
José G. C. de Souza,
Shuyan Zhou,
Tongshuang Wu,
Graham Neubig,
André F. T. Martins
Abstract:
Many recent advances in natural language generation have been fueled by training large language models on internet-scale data. However, this paradigm can lead to models that generate toxic, inaccurate, and unhelpful content, and automatic evaluation metrics often fail to identify these behaviors. As models become more capable, human feedback is an invaluable signal for evaluating and improving mod…
▽ More
Many recent advances in natural language generation have been fueled by training large language models on internet-scale data. However, this paradigm can lead to models that generate toxic, inaccurate, and unhelpful content, and automatic evaluation metrics often fail to identify these behaviors. As models become more capable, human feedback is an invaluable signal for evaluating and improving models. This survey aims to provide an overview of the recent research that has leveraged human feedback to improve natural language generation. First, we introduce an encompassing formalization of feedback, and identify and organize existing research into a taxonomy following this formalization. Next, we discuss how feedback can be described by its format and objective, and cover the two approaches proposed to use feedback (either for training or decoding): directly using the feedback or training feedback models. We also discuss existing datasets for human-feedback data collection, and concerns surrounding feedback collection. Finally, we provide an overview of the nascent field of AI feedback, which exploits large language models to make judgments based on a set of principles and minimize the need for human intervention.
△ Less
Submitted 31 May, 2023; v1 submitted 1 May, 2023;
originally announced May 2023.
-
Human-AI Co-Creation Approach to Find Forever Chemicals Replacements
Authors:
Juliana Jansen Ferreira,
Vinícius Segura,
Joana G. R. Souza,
Gabriel D. J. Barbosa,
João Gallas,
Renato Cerqueira,
Dmitry Zubarev
Abstract:
Generative models are a powerful tool in AI for material discovery. We are designing a software framework that supports a human-AI co-creation process to accelerate finding replacements for the ``forever chemicals''-- chemicals that enable our modern lives, but are harmful to the environment and the human health. Our approach combines AI capabilities with the domain-specific tacit knowledge of sub…
▽ More
Generative models are a powerful tool in AI for material discovery. We are designing a software framework that supports a human-AI co-creation process to accelerate finding replacements for the ``forever chemicals''-- chemicals that enable our modern lives, but are harmful to the environment and the human health. Our approach combines AI capabilities with the domain-specific tacit knowledge of subject matter experts to accelerate the material discovery. Our co-creation process starts with the interaction between the subject matter experts and a generative model that can generate new molecule designs. In this position paper, we discuss our hypothesis that these subject matter experts can benefit from a more iterative interaction with the generative model, asking for smaller samples and ``guiding'' the exploration of the discovery space with their knowledge.
△ Less
Submitted 11 April, 2023;
originally announced April 2023.
-
Impact of cross-section uncertainties on supernova neutrino spectral parameter fitting in the Deep Underground Neutrino Experiment
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
F. Akbar,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
A. Alton,
R. Alvarez,
P. Amedo,
J. Anderson,
D. A. Andrade
, et al. (1294 additional authors not shown)
Abstract:
A primary goal of the upcoming Deep Underground Neutrino Experiment (DUNE) is to measure the $\mathcal{O}(10)$ MeV neutrinos produced by a Galactic core-collapse supernova if one should occur during the lifetime of the experiment. The liquid-argon-based detectors planned for DUNE are expected to be uniquely sensitive to the $ν_e$ component of the supernova flux, enabling a wide variety of physics…
▽ More
A primary goal of the upcoming Deep Underground Neutrino Experiment (DUNE) is to measure the $\mathcal{O}(10)$ MeV neutrinos produced by a Galactic core-collapse supernova if one should occur during the lifetime of the experiment. The liquid-argon-based detectors planned for DUNE are expected to be uniquely sensitive to the $ν_e$ component of the supernova flux, enabling a wide variety of physics and astrophysics measurements. A key requirement for a correct interpretation of these measurements is a good understanding of the energy-dependent total cross section $σ(E_ν)$ for charged-current $ν_e$ absorption on argon. In the context of a simulated extraction of supernova $ν_e$ spectral parameters from a toy analysis, we investigate the impact of $σ(E_ν)$ modeling uncertainties on DUNE's supernova neutrino physics sensitivity for the first time. We find that the currently large theoretical uncertainties on $σ(E_ν)$ must be substantially reduced before the $ν_e$ flux parameters can be extracted reliably: in the absence of external constraints, a measurement of the integrated neutrino luminosity with less than 10\% bias with DUNE requires $σ(E_ν)$ to be known to about 5%. The neutrino spectral shape parameters can be known to better than 10% for a 20% uncertainty on the cross-section scale, although they will be sensitive to uncertainties on the shape of $σ(E_ν)$. A direct measurement of low-energy $ν_e$-argon scattering would be invaluable for improving the theoretical precision to the needed level.
△ Less
Submitted 7 July, 2023; v1 submitted 29 March, 2023;
originally announced March 2023.
-
A data acquisition and reconstruction software for SAMPA-SRS integration
Authors:
G. G. A. de Souza,
T. S. Abelha,
T. B. Saramela,
A. F. V. Cortez,
H. N. da Luz,
C. G. Penteado,
M. Bregant
Abstract:
In this work we present the latest developments in the SAMPA-SRS integration. A software was developed to improve the acquisition configuration, acquisition, and decoding of the data. The complete framework was tested using a triple GEM-based position sensitive detector for X-rays. The detector was operated in Ar/CO$_2$ (70/30) in continuous flow, at atmospheric pressure and made use of a 1 dimens…
▽ More
In this work we present the latest developments in the SAMPA-SRS integration. A software was developed to improve the acquisition configuration, acquisition, and decoding of the data. The complete framework was tested using a triple GEM-based position sensitive detector for X-rays. The detector was operated in Ar/CO$_2$ (70/30) in continuous flow, at atmospheric pressure and made use of a 1 dimension strip readout (200$μ$m wide strips at a pitch of 400$μ$m) for charge collection. With this detector a position resolution of better than 833$μ$m was obtained, with an energy resolution of 14.2% ($σ/E$) for 5.9keV.
△ Less
Submitted 8 May, 2023; v1 submitted 20 March, 2023;
originally announced March 2023.
-
Boundary Integral Formulation of the Cell-by-Cell Model of Cardiac Electrophysiology
Authors:
Giacomo Rosilho de Souza,
Rolf Krause,
Simone Pezzuto
Abstract:
We propose a boundary element method for the accurate solution of the cell-by-cell bidomain model of electrophysiology. The cell-by-cell model, also called Extracellular-Membrane-Intracellular (EMI) model, is a system of reaction-diffusion equations describing the evolution of the electric potential within each domain: intra- and extra-cellular space and the cellular membrane. The system is parabo…
▽ More
We propose a boundary element method for the accurate solution of the cell-by-cell bidomain model of electrophysiology. The cell-by-cell model, also called Extracellular-Membrane-Intracellular (EMI) model, is a system of reaction-diffusion equations describing the evolution of the electric potential within each domain: intra- and extra-cellular space and the cellular membrane. The system is parabolic but degenerate because the time derivative is only in the membrane domain. In this work, we adopt a boundary-integral formulation for removing the degeneracy in the system and recast it to a parabolic equation on the membrane. The formulation is also numerically advantageous since the number of degrees of freedom is sensibly reduced compared to the original model. Specifically, we prove that the boundary-element discretization of the EMI model is equivalent to a system of ordinary differential equations, and we consider a time discretization based on the multirate explicit stabilized Runge-Kutta method. We numerically show that our scheme convergences exponentially in space for the single-cell case. We finally provide several numerical experiments of biological interest.
△ Less
Submitted 10 February, 2023;
originally announced February 2023.
-
Bidimensional Symplectic Maps
Authors:
Felipe G. Souza,
Gabriel C. Grime,
Iberê L. Caldas
Abstract:
Symplectic maps can provide a straightforward and accurate way to visualize and quantify the dynamics of conservative systems with two degrees of freedom. These maps can be easily iterated from the simplest computers to obtain trajectories with great accuracy. Their usage arises in many fields, including celeste mechanics, plasma physics, chemistry, and so on. In this paper we introduce two exampl…
▽ More
Symplectic maps can provide a straightforward and accurate way to visualize and quantify the dynamics of conservative systems with two degrees of freedom. These maps can be easily iterated from the simplest computers to obtain trajectories with great accuracy. Their usage arises in many fields, including celeste mechanics, plasma physics, chemistry, and so on. In this paper we introduce two examples of symplectic maps, the standard and the standard non-twist map, exploring the phase space transformation as their control parameters are varied.
△ Less
Submitted 16 January, 2023;
originally announced January 2023.
-
Highly-parallelized simulation of a pixelated LArTPC on a GPU
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
F. Akbar,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
C. Alt,
A. Alton,
R. Alvarez,
P. Amedo,
J. Anderson
, et al. (1282 additional authors not shown)
Abstract:
The rapid development of general-purpose computing on graphics processing units (GPGPU) is allowing the implementation of highly-parallelized Monte Carlo simulation chains for particle physics experiments. This technique is particularly suitable for the simulation of a pixelated charge readout for time projection chambers, given the large number of channels that this technology employs. Here we pr…
▽ More
The rapid development of general-purpose computing on graphics processing units (GPGPU) is allowing the implementation of highly-parallelized Monte Carlo simulation chains for particle physics experiments. This technique is particularly suitable for the simulation of a pixelated charge readout for time projection chambers, given the large number of channels that this technology employs. Here we present the first implementation of a full microphysical simulator of a liquid argon time projection chamber (LArTPC) equipped with light readout and pixelated charge readout, developed for the DUNE Near Detector. The software is implemented with an end-to-end set of GPU-optimized algorithms. The algorithms have been written in Python and translated into CUDA kernels using Numba, a just-in-time compiler for a subset of Python and NumPy instructions. The GPU implementation achieves a speed up of four orders of magnitude compared with the equivalent CPU version. The simulation of the current induced on $10^3$ pixels takes around 1 ms on the GPU, compared with approximately 10 s on the CPU. The results of the simulation are compared against data from a pixel-readout LArTPC prototype.
△ Less
Submitted 28 February, 2023; v1 submitted 19 December, 2022;
originally announced December 2022.
-
Identification and reconstruction of low-energy electrons in the ProtoDUNE-SP detector
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
F. Akbar,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
C. Alt,
A. Alton,
R. Alvarez,
P. Amedo,
J. Anderson
, et al. (1235 additional authors not shown)
Abstract:
Measurements of electrons from $ν_e$ interactions are crucial for the Deep Underground Neutrino Experiment (DUNE) neutrino oscillation program, as well as searches for physics beyond the standard model, supernova neutrino detection, and solar neutrino measurements. This article describes the selection and reconstruction of low-energy (Michel) electrons in the ProtoDUNE-SP detector. ProtoDUNE-SP is…
▽ More
Measurements of electrons from $ν_e$ interactions are crucial for the Deep Underground Neutrino Experiment (DUNE) neutrino oscillation program, as well as searches for physics beyond the standard model, supernova neutrino detection, and solar neutrino measurements. This article describes the selection and reconstruction of low-energy (Michel) electrons in the ProtoDUNE-SP detector. ProtoDUNE-SP is one of the prototypes for the DUNE far detector, built and operated at CERN as a charged particle test beam experiment. A sample of low-energy electrons produced by the decay of cosmic muons is selected with a purity of 95%. This sample is used to calibrate the low-energy electron energy scale with two techniques. An electron energy calibration based on a cosmic ray muon sample uses calibration constants derived from measured and simulated cosmic ray muon events. Another calibration technique makes use of the theoretically well-understood Michel electron energy spectrum to convert reconstructed charge to electron energy. In addition, the effects of detector response to low-energy electron energy scale and its resolution including readout electronics threshold effects are quantified. Finally, the relation between the theoretical and reconstructed low-energy electron energy spectrum is derived and the energy resolution is characterized. The low-energy electron selection presented here accounts for about 75% of the total electron deposited energy. After the addition of lost energy using a Monte Carlo simulation, the energy resolution improves from about 40% to 25% at 50~MeV. These results are used to validate the expected capabilities of the DUNE far detector to reconstruct low-energy electrons.
△ Less
Submitted 31 May, 2023; v1 submitted 2 November, 2022;
originally announced November 2022.
-
CometKiwi: IST-Unbabel 2022 Submission for the Quality Estimation Shared Task
Authors:
Ricardo Rei,
Marcos Treviso,
Nuno M. Guerreiro,
Chrysoula Zerva,
Ana C. Farinha,
Christine Maroti,
José G. C. de Souza,
Taisiya Glushkova,
Duarte M. Alves,
Alon Lavie,
Luisa Coheur,
André F. T. Martins
Abstract:
We present the joint contribution of IST and Unbabel to the WMT 2022 Shared Task on Quality Estimation (QE). Our team participated on all three subtasks: (i) Sentence and Word-level Quality Prediction; (ii) Explainable QE; and (iii) Critical Error Detection. For all tasks we build on top of the COMET framework, connecting it with the predictor-estimator architecture of OpenKiwi, and equipping it w…
▽ More
We present the joint contribution of IST and Unbabel to the WMT 2022 Shared Task on Quality Estimation (QE). Our team participated on all three subtasks: (i) Sentence and Word-level Quality Prediction; (ii) Explainable QE; and (iii) Critical Error Detection. For all tasks we build on top of the COMET framework, connecting it with the predictor-estimator architecture of OpenKiwi, and equipping it with a word-level sequence tagger and an explanation extractor. Our results suggest that incorporating references during pretraining improves performance across several language pairs on downstream tasks, and that jointly training with sentence and word-level objectives yields a further boost. Furthermore, combining attention and gradient information proved to be the top strategy for extracting good explanations of sentence-level QE models. Overall, our submissions achieved the best results for all three tasks for almost all language pairs by a considerable margin.
△ Less
Submitted 13 September, 2022;
originally announced September 2022.
-
Reconstruction of interactions in the ProtoDUNE-SP detector with Pandora
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
F. Akbar,
B. Ali-Mohammadzadeh,
K. Allison,
S. Alonso Monsalve,
M. AlRashed,
C. Alt,
A. Alton,
R. Alvarez,
P. Amedo
, et al. (1203 additional authors not shown)
Abstract:
The Pandora Software Development Kit and algorithm libraries provide pattern-recognition logic essential to the reconstruction of particle interactions in liquid argon time projection chamber detectors. Pandora is the primary event reconstruction software used at ProtoDUNE-SP, a prototype for the Deep Underground Neutrino Experiment far detector. ProtoDUNE-SP, located at CERN, is exposed to a char…
▽ More
The Pandora Software Development Kit and algorithm libraries provide pattern-recognition logic essential to the reconstruction of particle interactions in liquid argon time projection chamber detectors. Pandora is the primary event reconstruction software used at ProtoDUNE-SP, a prototype for the Deep Underground Neutrino Experiment far detector. ProtoDUNE-SP, located at CERN, is exposed to a charged-particle test beam. This paper gives an overview of the Pandora reconstruction algorithms and how they have been tailored for use at ProtoDUNE-SP. In complex events with numerous cosmic-ray and beam background particles, the simulated reconstruction and identification efficiency for triggered test-beam particles is above 80% for the majority of particle type and beam momentum combinations. Specifically, simulated 1 GeV/$c$ charged pions and protons are correctly reconstructed and identified with efficiencies of 86.1$\pm0.6$% and 84.1$\pm0.6$%, respectively. The efficiencies measured for test-beam data are shown to be within 5% of those predicted by the simulation.
△ Less
Submitted 17 July, 2023; v1 submitted 29 June, 2022;
originally announced June 2022.
-
Simulating nearly edge-on sloshing in the galaxy cluster Abell 2199
Authors:
Rubens E. G. Machado,
Tatiana F. Laganá,
Gilvan S. Souza,
Anderson Caproni,
Abraão S. R. Antas,
Elvis A. Mello-Terencio
Abstract:
Off-axis collisions between galaxy clusters may induce the phenomenon of sloshing, causing dense gas to be dragged from the cool core of a cluster, resulting in a spiral of enhanced X-ray emission. Abell 2199 displays signatures of sloshing in its core and it is possible that the orbital plane of the collision is seen nearly edge-on. We aim to evaluate whether the features of Abell 2199 can be exp…
▽ More
Off-axis collisions between galaxy clusters may induce the phenomenon of sloshing, causing dense gas to be dragged from the cool core of a cluster, resulting in a spiral of enhanced X-ray emission. Abell 2199 displays signatures of sloshing in its core and it is possible that the orbital plane of the collision is seen nearly edge-on. We aim to evaluate whether the features of Abell 2199 can be explained by a sloshing spiral seen under a large inclination angle. To address this, we perform tailored hydrodynamical $N$-body simulations of a non-frontal collision with a galaxy group of $M_{200}=1.6\times10^{13}\,{\rm M_{\odot}}$. We obtain a suitable scenario in which the group passed by the main cluster core 0.8 Gyr ago, with a pericentric separation of 292 kpc. Good agreement is obtained from the temperature maps as well as the residuals from a $β$-model fit to the simulated X-ray emission. We find that under an inclination of $i=70^{\circ}$ the simulation results remain consistent with the observations.
△ Less
Submitted 1 July, 2022; v1 submitted 28 June, 2022;
originally announced June 2022.
-
Double-GEM based thermal neutron detector prototype
Authors:
L. A. Serra Filho,
R. Felix dos Santos,
G. G. A. de Souza,
M. M. M. Paulino,
F. A. Souza,
M. Moralles,
H. Natal da Luz,
M. Bregant,
M. G. Munhoz,
Chung-Chuan Lai,
Carina Höglund,
Per-Olof Svensson,
Linda Robinson,
Richard Hall-Wilton
Abstract:
The Helium-3 shortage and the growing interest in neutron science constitute a driving factor in developing new neutron detection technologies. In this work, we report the development of a double-GEM detector prototype that uses a $^{10}$B$_4$C layer as a neutron converter material. GEANT4 simulations were performed predicting an efficiency of 3.14(10) %, agreeing within 2.7 $σ$ with the experimen…
▽ More
The Helium-3 shortage and the growing interest in neutron science constitute a driving factor in developing new neutron detection technologies. In this work, we report the development of a double-GEM detector prototype that uses a $^{10}$B$_4$C layer as a neutron converter material. GEANT4 simulations were performed predicting an efficiency of 3.14(10) %, agreeing within 2.7 $σ$ with the experimental and analytic detection efficiencies obtained by the detector when tested in a 41.8 meV thermal neutron beam. The detector is position sensitive, equipped with a 256+256 strip readout connected to resistive chains, and achieves a spatial resolution better than 3 mm. The gain stability over time was also measured with a fluctuation of about 0.2 %h$^{-1}$ of the signal amplitude. A simple data acquisition with only 5 electronic channels is sufficient to operate this detector.
△ Less
Submitted 19 July, 2022; v1 submitted 14 May, 2022;
originally announced May 2022.
-
Quality-Aware Decoding for Neural Machine Translation
Authors:
Patrick Fernandes,
António Farinhas,
Ricardo Rei,
José G. C. de Souza,
Perez Ogayo,
Graham Neubig,
André F. T. Martins
Abstract:
Despite the progress in machine translation quality estimation and evaluation in the last years, decoding in neural machine translation (NMT) is mostly oblivious to this and centers around finding the most probable translation according to the model (MAP decoding), approximated with beam search. In this paper, we bring together these two lines of research and propose quality-aware decoding for NMT…
▽ More
Despite the progress in machine translation quality estimation and evaluation in the last years, decoding in neural machine translation (NMT) is mostly oblivious to this and centers around finding the most probable translation according to the model (MAP decoding), approximated with beam search. In this paper, we bring together these two lines of research and propose quality-aware decoding for NMT, by leveraging recent breakthroughs in reference-free and reference-based MT evaluation through various inference methods like $N$-best reranking and minimum Bayes risk decoding. We perform an extensive comparison of various possible candidate generation and ranking methods across four datasets and two model classes and find that quality-aware decoding consistently outperforms MAP-based decoding according both to state-of-the-art automatic metrics (COMET and BLEURT) and to human assessments. Our code is available at https://github.com/deep-spin/qaware-decode.
△ Less
Submitted 2 May, 2022;
originally announced May 2022.
-
Separation of track- and shower-like energy deposits in ProtoDUNE-SP using a convolutional neural network
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
B. Ali-Mohammadzadeh,
T. Alion,
K. Allison,
S. Alonso Monsalve,
M. AlRashed,
C. Alt,
A. Alton,
R. Alvarez,
P. Amedo,
J. Anderson
, et al. (1204 additional authors not shown)
Abstract:
Liquid argon time projection chamber detector technology provides high spatial and calorimetric resolutions on the charged particles traversing liquid argon. As a result, the technology has been used in a number of recent neutrino experiments, and is the technology of choice for the Deep Underground Neutrino Experiment (DUNE). In order to perform high precision measurements of neutrinos in the det…
▽ More
Liquid argon time projection chamber detector technology provides high spatial and calorimetric resolutions on the charged particles traversing liquid argon. As a result, the technology has been used in a number of recent neutrino experiments, and is the technology of choice for the Deep Underground Neutrino Experiment (DUNE). In order to perform high precision measurements of neutrinos in the detector, final state particles need to be effectively identified, and their energy accurately reconstructed. This article proposes an algorithm based on a convolutional neural network to perform the classification of energy deposits and reconstructed particles as track-like or arising from electromagnetic cascades. Results from testing the algorithm on data from ProtoDUNE-SP, a prototype of the DUNE far detector, are presented. The network identifies track- and shower-like particles, as well as Michel electrons, with high efficiency. The performance of the algorithm is consistent between data and simulation.
△ Less
Submitted 30 June, 2022; v1 submitted 31 March, 2022;
originally announced March 2022.
-
Scintillation light detection in the 6-m drift-length ProtoDUNE Dual Phase liquid argon TPC
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
B. Ali-Mohammadzadeh,
T. Alion,
K. Allison,
S. Alonso Monsalve,
M. AlRashed,
C. Alt,
A. Alton,
R. Alvarez,
P. Amedo,
J. Anderson
, et al. (1202 additional authors not shown)
Abstract:
DUNE is a dual-site experiment for long-baseline neutrino oscillation studies, neutrino astrophysics and nucleon decay searches. ProtoDUNE Dual Phase (DP) is a 6x6x6m3 liquid argon time-projection-chamber (LArTPC) that recorded cosmic-muon data at the CERN Neutrino Platform in 2019-2020 as a prototype of the DUNE Far Detector. Charged particles propagating through the LArTPC produce ionization and…
▽ More
DUNE is a dual-site experiment for long-baseline neutrino oscillation studies, neutrino astrophysics and nucleon decay searches. ProtoDUNE Dual Phase (DP) is a 6x6x6m3 liquid argon time-projection-chamber (LArTPC) that recorded cosmic-muon data at the CERN Neutrino Platform in 2019-2020 as a prototype of the DUNE Far Detector. Charged particles propagating through the LArTPC produce ionization and scintillation light. The scintillation light signal in these detectors can provide the trigger for non-beam events. In addition, it adds precise timing capabilities and improves the calorimetry measurements. In ProtoDUNE-DP, scintillation and electroluminescence light produced by cosmic muons in the LArTPC is collected by photomultiplier tubes placed up to 7 m away from the ionizing track. In this paper, the ProtoDUNE-DP photon detection system performance is evaluated with a particular focus on the different wavelength shifters, such as PEN and TPB, and the use of Xe-doped LAr, considering its future use in giant LArTPCs. The scintillation light production and propagation processes are analyzed and a comparison of simulation to data is performed, improving understanding of the liquid argon properties
△ Less
Submitted 3 June, 2022; v1 submitted 30 March, 2022;
originally announced March 2022.
-
Application of Stabilized Explicit Runge-Kutta Methods to the Incompressible Navier-Stokes Equations by means of a Projection Method and a Differential Algebraic Approach
Authors:
Giacomo Rosilho de Souza
Abstract:
In this master thesis we have compared different second order stabilized explicit Runge-Kutta methods when applied to the incompressible Navier-Stokes equations by means of a projection method and a differential algebraic approach. We explored the stability and accuracy properties of the RKC, ROCK2 and PIROCK schemes when coupled with the projection and the differential algebraic approach. PIROCK…
▽ More
In this master thesis we have compared different second order stabilized explicit Runge-Kutta methods when applied to the incompressible Navier-Stokes equations by means of a projection method and a differential algebraic approach. We explored the stability and accuracy properties of the RKC, ROCK2 and PIROCK schemes when coupled with the projection and the differential algebraic approach. PIROCK has shown unexpected instabilities, ROCK2 resulted to be the most efficient and versatile Runge-Kutta method taken into account. The differential algebraic approach sounds computationally costly but it exhibits better accuracy and a larger stability region. These properties make it more efficient than the projection method. The theory presented in the first chapters is supported by numerical experiments.
△ Less
Submitted 28 March, 2022;
originally announced March 2022.
-
A Gaseous Argon-Based Near Detector to Enhance the Physics Capabilities of DUNE
Authors:
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
F. Akbar,
B. Ali-Mohammadzadeh,
T. Alion,
K. Allison,
S. Alonso Monsalve,
M. AlRashed,
C. Alt,
A. Alton,
R. Alvarez,
P. Amedo
, et al. (1220 additional authors not shown)
Abstract:
This document presents the concept and physics case for a magnetized gaseous argon-based detector system (ND-GAr) for the Deep Underground Neutrino Experiment (DUNE) Near Detector. This detector system is required in order for DUNE to reach its full physics potential in the measurement of CP violation and in delivering precision measurements of oscillation parameters. In addition to its critical r…
▽ More
This document presents the concept and physics case for a magnetized gaseous argon-based detector system (ND-GAr) for the Deep Underground Neutrino Experiment (DUNE) Near Detector. This detector system is required in order for DUNE to reach its full physics potential in the measurement of CP violation and in delivering precision measurements of oscillation parameters. In addition to its critical role in the long-baseline oscillation program, ND-GAr will extend the overall physics program of DUNE. The LBNF high-intensity proton beam will provide a large flux of neutrinos that is sampled by ND-GAr, enabling DUNE to discover new particles and search for new interactions and symmetries beyond those predicted in the Standard Model.
△ Less
Submitted 11 March, 2022;
originally announced March 2022.
-
Snowmass Neutrino Frontier: DUNE Physics Summary
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
F. Akbar,
B. Ali-Mohammadzadeh,
T. Alion,
K. Allison,
S. Alonso Monsalve,
M. AlRashed,
C. Alt,
A. Alton,
R. Alvarez
, et al. (1221 additional authors not shown)
Abstract:
The Deep Underground Neutrino Experiment (DUNE) is a next-generation long-baseline neutrino oscillation experiment with a primary physics goal of observing neutrino and antineutrino oscillation patterns to precisely measure the parameters governing long-baseline neutrino oscillation in a single experiment, and to test the three-flavor paradigm. DUNE's design has been developed by a large, internat…
▽ More
The Deep Underground Neutrino Experiment (DUNE) is a next-generation long-baseline neutrino oscillation experiment with a primary physics goal of observing neutrino and antineutrino oscillation patterns to precisely measure the parameters governing long-baseline neutrino oscillation in a single experiment, and to test the three-flavor paradigm. DUNE's design has been developed by a large, international collaboration of scientists and engineers to have unique capability to measure neutrino oscillation as a function of energy in a broadband beam, to resolve degeneracy among oscillation parameters, and to control systematic uncertainty using the exquisite imaging capability of massive LArTPC far detector modules and an argon-based near detector. DUNE's neutrino oscillation measurements will unambiguously resolve the neutrino mass ordering and provide the sensitivity to discover CP violation in neutrinos for a wide range of possible values of $δ_{CP}$. DUNE is also uniquely sensitive to electron neutrinos from a galactic supernova burst, and to a broad range of physics beyond the Standard Model (BSM), including nucleon decays. DUNE is anticipated to begin collecting physics data with Phase I, an initial experiment configuration consisting of two far detector modules and a minimal suite of near detector components, with a 1.2 MW proton beam. To realize its extensive, world-leading physics potential requires the full scope of DUNE be completed in Phase II. The three Phase II upgrades are all necessary to achieve DUNE's physics goals: (1) addition of far detector modules three and four for a total FD fiducial mass of at least 40 kt, (2) upgrade of the proton beam power from 1.2 MW to 2.4 MW, and (3) replacement of the near detector's temporary muon spectrometer with a magnetized, high-pressure gaseous argon TPC and calorimeter.
△ Less
Submitted 11 March, 2022;
originally announced March 2022.
-
Using the Energy probability distribution zeros to obtain the critical properties of the two-dimensional anisotropic Heisenberg model
Authors:
Gabriel Bruno Garcia de Souza,
Bismarck Vaz da Costa
Abstract:
In this paper we present a Monte Carlo study of the critical behavior of the easy axis anisotropic Heisenberg spin model in two dimensions. Based on the partial knowledge of the zeros of the energy probability distribution we determine with good precision the phase diagram of the model obtaining the critical temperature and exponents for several values of the anisotropy. Our results indicate that…
▽ More
In this paper we present a Monte Carlo study of the critical behavior of the easy axis anisotropic Heisenberg spin model in two dimensions. Based on the partial knowledge of the zeros of the energy probability distribution we determine with good precision the phase diagram of the model obtaining the critical temperature and exponents for several values of the anisotropy. Our results indicate that the model is in the Ising universality class for any anisotropy.
△ Less
Submitted 7 July, 2022; v1 submitted 1 March, 2022;
originally announced March 2022.
-
Improving Image-recognition Edge Caches with a Generative Adversarial Network
Authors:
Guilherme B. Souza,
Roberto G. Pacheco,
Rodrigo S. Couto
Abstract:
Image recognition is an essential task in several mobile applications. For instance, a smartphone can process a landmark photo to gather more information about its location. If the device does not have enough computational resources available, it offloads the processing task to a cloud infrastructure. Although this approach solves resource shortages, it introduces a communication delay. Image-reco…
▽ More
Image recognition is an essential task in several mobile applications. For instance, a smartphone can process a landmark photo to gather more information about its location. If the device does not have enough computational resources available, it offloads the processing task to a cloud infrastructure. Although this approach solves resource shortages, it introduces a communication delay. Image-recognition caches on the Internet's edge can mitigate this problem. These caches run on servers close to mobile devices and stores information about previously recognized images. If the server receives a request with a photo stored in its cache, it replies to the device, avoiding cloud offloading. The main challenge for this cache is to verify if the received image matches a stored one. Furthermore, for outdoor photos, it is difficult to compare them if one was taken in the daytime and the other at nighttime. In that case, the cache might wrongly infer that they refer to different places, offloading the processing to the cloud. This work shows that a well-known generative adversarial network, called ToDayGAN, can solve this problem by generating daytime images using nighttime ones. We can thus use this translation to populate a cache with synthetic photos that can help image matching. We show that our solution reduces cloud offloading and, therefore, the application's latency.
△ Less
Submitted 11 February, 2022;
originally announced February 2022.
-
Large scalar gaps in 2D CFTs with generalized polynomials
Authors:
Renato G. F. Souza
Abstract:
We present an analytic way of writing simple crossing symmetric expressions and use them to search for unitary 4-point functions in 2D CFTs. We've applied our method for a class of functions we called generalized polynomials to achieve large gaps for operators with integer scaling dimension less or equal to 18.
We present an analytic way of writing simple crossing symmetric expressions and use them to search for unitary 4-point functions in 2D CFTs. We've applied our method for a class of functions we called generalized polynomials to achieve large gaps for operators with integer scaling dimension less or equal to 18.
△ Less
Submitted 15 December, 2021;
originally announced December 2021.
-
Mixed-precision explicit stabilized Runge-Kutta methods for single- and multi-scale differential equations
Authors:
Matteo Croci,
Giacomo Rosilho de Souza
Abstract:
Mixed-precision algorithms combine low- and high-precision computations in order to benefit from the performance gains of reduced-precision without sacrificing accuracy. In this work, we design mixed-precision Runge-Kutta-Chebyshev (RKC) methods, where high precision is used for accuracy, and low precision for stability. Generally speaking, RKC methods are low-order explicit schemes with a stabili…
▽ More
Mixed-precision algorithms combine low- and high-precision computations in order to benefit from the performance gains of reduced-precision without sacrificing accuracy. In this work, we design mixed-precision Runge-Kutta-Chebyshev (RKC) methods, where high precision is used for accuracy, and low precision for stability. Generally speaking, RKC methods are low-order explicit schemes with a stability domain growing quadratically with the number of function evaluations. For this reason, most of the computational effort is spent on stability rather than accuracy purposes. In this paper, we show that a naïve mixed-precision implementation of any Runge-Kutta scheme can harm the convergence order of the method and limit its accuracy, and we introduce a new class of mixed-precision RKC schemes that are instead unaffected by this limiting behaviour. We present three mixed-precision schemes: a first- and a second-order RKC method, and a first-order multirate RKC scheme for multiscale problems. These schemes perform only the few function evaluations needed for accuracy (1 or 2 for first- and second-order methods respectively) in high precision, while the rest are performed in low precision. We prove that while these methods are essentially as cheap as their fully low-precision equivalent, they retain the stability and convergence order of their high-precision counterpart. Indeed, numerical experiments confirm that these schemes are as accurate as the corresponding high-precision method.
△ Less
Submitted 6 April, 2022; v1 submitted 24 September, 2021;
originally announced September 2021.
-
Trends on 3d Transition Metal Coordination on Monolayer MoS$_2$
Authors:
He Liu,
Walner Costa Silva,
Leonardo Santana Gonçalves de Souza,
Amanda Garcez Veiga,
Leandro Seixas,
Kazunori Fujisawa,
Ethan Kahn,
Tianyi Zhang,
Fu Zhang,
Zhuohang Yu,
Katherine Thompson,
Yu Lei,
Christiano J. S. de Matos,
Maria Luiza M. Rocco,
Mauricio Terrones,
Daniel Grasseschi
Abstract:
Two-dimensional materials (2DM) have attracted much interest due to their distinct optical, electronic, and catalytic properties. These properties can be by tuned a range of methods including substitutional doping or, as recently demonstrated, by surface functionalization with single atoms, increasing even further 2DM portfolio. Here we theoretically and experimentally describe the coordination re…
▽ More
Two-dimensional materials (2DM) have attracted much interest due to their distinct optical, electronic, and catalytic properties. These properties can be by tuned a range of methods including substitutional doping or, as recently demonstrated, by surface functionalization with single atoms, increasing even further 2DM portfolio. Here we theoretically and experimentally describe the coordination reaction between MoS$_2$ monolayers with 3d transition metals (TMs), exploring the nature and the trend of MoS$_2$-TMs interaction. Density Functional Theory calculations, X-Ray Photoelectron Spectroscopy (XPS), and Photoluminescence (PL) point to the formation of MoS$_2$-TM coordination complexes, where the adsorption energy trend for 3d TM resembles the crystal-field (CF) stabilization energy for weak-field complexes. Pearson's theory for hard-soft acid-base and Ligand-field theory were applied to discuss the periodic trends on 3d TM coordination on the MoS$_2$ surface. We found that softer acids with higher ligand field stabilization energy, such as Ni$^{2+}$, tend to form bonds with more covalent character with MoS$_2$, which can be considered a soft base. On the other hand, harder acids, such as Cr$^{3+}$, tend to form bonds with more ionic character. Additionally, we studied the trends in charge transfer and doping observed in the XPS and PL results, where metals such as Ni led to an n-type of doping, while Cu functionalization results in p-type doping. Therefore, the formation of coordination complexes on TMD's surface is demonstrated to be a promising and effective way to control and to understand the nature of the single-atom functionalization of TMD.
△ Less
Submitted 17 September, 2021;
originally announced September 2021.
-
Low exposure long-baseline neutrino oscillation sensitivity of the DUNE experiment
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
D. Adams,
M. Adinolfi,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
B. Ali-Mohammadzadeh,
T. Alion,
K. Allison,
S. Alonso Monsalve,
M. AlRashed,
C. Alt,
A. Alton,
P. Amedo,
J. Anderson,
C. Andreopoulos,
M. Andreotti
, et al. (1132 additional authors not shown)
Abstract:
The Deep Underground Neutrino Experiment (DUNE) will produce world-leading neutrino oscillation measurements over the lifetime of the experiment. In this work, we explore DUNE's sensitivity to observe charge-parity violation (CPV) in the neutrino sector, and to resolve the mass ordering, for exposures of up to 100 kiloton-megawatt-years (kt-MW-yr). The analysis includes detailed uncertainties on t…
▽ More
The Deep Underground Neutrino Experiment (DUNE) will produce world-leading neutrino oscillation measurements over the lifetime of the experiment. In this work, we explore DUNE's sensitivity to observe charge-parity violation (CPV) in the neutrino sector, and to resolve the mass ordering, for exposures of up to 100 kiloton-megawatt-years (kt-MW-yr). The analysis includes detailed uncertainties on the flux prediction, the neutrino interaction model, and detector effects. We demonstrate that DUNE will be able to unambiguously resolve the neutrino mass ordering at a 3$σ$ (5$σ$) level, with a 66 (100) kt-MW-yr far detector exposure, and has the ability to make strong statements at significantly shorter exposures depending on the true value of other oscillation parameters. We also show that DUNE has the potential to make a robust measurement of CPV at a 3$σ$ level with a 100 kt-MW-yr exposure for the maximally CP-violating values $δ_{\rm CP}} = \pmπ/2$. Additionally, the dependence of DUNE's sensitivity on the exposure taken in neutrino-enhanced and antineutrino-enhanced running is discussed. An equal fraction of exposure taken in each beam mode is found to be close to optimal when considered over the entire space of interest.
△ Less
Submitted 3 September, 2021;
originally announced September 2021.