-
A Large Dataset of Spontaneous Speech with the Accent Spoken in São Paulo for Automatic Speech Recognition Evaluation
Authors:
Rodrigo Lima,
Sidney Evaldo Leal,
Arnaldo Candido Junior,
Sandra Maria Aluísio
Abstract:
We present a freely available spontaneous speech corpus for the Brazilian Portuguese language and report preliminary automatic speech recognition (ASR) results, using both the Wav2Vec2-XLSR-53 and Distil-Whisper models fine-tuned and trained on our corpus. The NURC-SP Audio Corpus comprises 401 different speakers (204 females, 197 males) with a total of 239.30 hours of transcribed audio recordings…
▽ More
We present a freely available spontaneous speech corpus for the Brazilian Portuguese language and report preliminary automatic speech recognition (ASR) results, using both the Wav2Vec2-XLSR-53 and Distil-Whisper models fine-tuned and trained on our corpus. The NURC-SP Audio Corpus comprises 401 different speakers (204 females, 197 males) with a total of 239.30 hours of transcribed audio recordings. To the best of our knowledge, this is the first large Paulistano accented spontaneous speech corpus dedicated to the ASR task in Portuguese. We first present the design and development procedures of the NURC-SP Audio Corpus, and then describe four ASR experiments in detail. The experiments demonstrated promising results for the applicability of the corpus for ASR. Specifically, we fine-tuned two versions of Wav2Vec2-XLSR-53 model, trained a Distil-Whisper model using our dataset with labels determined by Whisper Large-V3 model, and fine-tuned this Distil-Whisper model with our corpus. Our best results were the Distil-Whisper fine-tuned over NURC-SP Audio Corpus with a WER of 24.22% followed by a fine-tuned versions of Wav2Vec2-XLSR-53 model with a WER of 33.73%, that is almost 10% point worse than Distil-Whisper's. To enable experiment reproducibility, we share the NURC-SP Audio Corpus dataset, pre-trained models, and training recipes in Hugging-Face and Github repositories.
△ Less
Submitted 10 September, 2024;
originally announced September 2024.
-
Correlating $0νββ$ decays and flavor observables in leptoquark models
Authors:
S. Fajfer,
L. P. S. Leal,
O. Sumensari,
R. Zukanovich Funchal
Abstract:
In this paper, we investigate minimal scalar leptoquark models that dynamically generate neutrino Majorana masses at the one-loop level and examine their implications for low-energy processes. We show that these models can produce viable neutrino masses, consistent with neutrino oscillation and cosmological data. By using leptoquark couplings fixed by neutrino data, we predict additional contribut…
▽ More
In this paper, we investigate minimal scalar leptoquark models that dynamically generate neutrino Majorana masses at the one-loop level and examine their implications for low-energy processes. We show that these models can produce viable neutrino masses, consistent with neutrino oscillation and cosmological data. By using leptoquark couplings fixed by neutrino data, we predict additional contributions to neutrinoless double-beta decays ($0νββ$), which are chirality enhanced and compete with the standard contributions from the Majorana masses. Our analysis demonstrates that these effects are sizable for leptoquark masses as large as $\mathcal{O}(300~\mathrm{TeV})$, potentially increasing or decreasing the $0νββ$ half-life, and creating an ambiguity between the normal and inverted mass ordering scenarios. Furthermore, we explore the correlation between $0νββ$ and flavor observables, such as kaon decays and $μ\to e$ conversion in nuclei, emphasizing that the latter is complementary to $0νββ$ decays.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Disentangling left and right-handed neutrino effects in $B\rightarrow K^{(*)}νν$
Authors:
L. P. S. Leal,
S. Rosauro-Alcaraz
Abstract:
The first observation of $\mathcal{B}\left(B^+\rightarrow K^+νν\right)$ by the Belle II experiment lies almost $3σ$ away from the Standard Model expectation. In this letter we study this result in the SMEFT, extended by a light right-handed neutrino. We explore the correlations between the measured decay rate and other observables, such as $\mathcal{B}\left(B\rightarrow K^*νν\right)$ and…
▽ More
The first observation of $\mathcal{B}\left(B^+\rightarrow K^+νν\right)$ by the Belle II experiment lies almost $3σ$ away from the Standard Model expectation. In this letter we study this result in the SMEFT, extended by a light right-handed neutrino. We explore the correlations between the measured decay rate and other observables, such as $\mathcal{B}\left(B\rightarrow K^*νν\right)$ and $F_L\left(B\rightarrow K^*νν\right)$, showing that they could disentangle among scenarios involving left-handed neutrinos and those with the right-handed ones. Furthermore, we find that the high-$p_T$ tails of Drell-Yan processes studied at LHC provide important constraints that help us exclude some of the scenarios consistent with the Belle II result.
△ Less
Submitted 27 August, 2024; v1 submitted 26 April, 2024;
originally announced April 2024.
-
AirPilot: Interpretable PPO-based DRL Auto-Tuned Nonlinear PID Drone Controller for Robust Autonomous Flights
Authors:
Junyang Zhang,
Cristian Emanuel Ocampo Rivera,
Kyle Tyni,
Steven Nguyen,
Ulices Santa Cruz Leal,
Yasser Shoukry
Abstract:
Navigation precision, speed and stability are crucial for safe Unmanned Aerial Vehicle (UAV) flight maneuvers and effective flight mission executions in dynamic environments. Different flight missions may have varying objectives, such as minimizing energy consumption, achieving precise positioning, or maximizing speed. A controller that can adapt to different objectives on the fly is highly valuab…
▽ More
Navigation precision, speed and stability are crucial for safe Unmanned Aerial Vehicle (UAV) flight maneuvers and effective flight mission executions in dynamic environments. Different flight missions may have varying objectives, such as minimizing energy consumption, achieving precise positioning, or maximizing speed. A controller that can adapt to different objectives on the fly is highly valuable. Proportional Integral Derivative (PID) controllers are one of the most popular and widely used control algorithms for drones and other control systems, but their linear control algorithm fails to capture the nonlinear nature of the dynamic wind conditions and complex drone system. Manually tuning the PID gains for various missions can be time-consuming and requires significant expertise. This paper aims to revolutionize drone flight control by presenting the AirPilot, a nonlinear Deep Reinforcement Learning (DRL) - enhanced Proportional Integral Derivative (PID) drone controller using Proximal Policy Optimization (PPO). AirPilot controller combines the simplicity and effectiveness of traditional PID control with the adaptability, learning capability, and optimization potential of DRL. This makes it better suited for modern drone applications where the environment is dynamic, and mission-specific performance demands are high. We employed a COEX Clover autonomous drone for training the DRL agent within the simulator and implemented it in a real-world lab setting, which marks a significant milestone as one of the first attempts to apply a DRL-based flight controller on an actual drone. Airpilot is capable of reducing the navigation error of the default PX4 PID position controller by 90%, improving effective navigation speed of a fine-tuned PID controller by 21%, reducing settling time and overshoot by 17% and 16% respectively.
△ Less
Submitted 31 August, 2024; v1 submitted 29 March, 2024;
originally announced April 2024.
-
Certified Vision-based State Estimation for Autonomous Landing Systems using Reachability Analysis
Authors:
Ulices Santa Cruz Leal,
Yasser Shoukry
Abstract:
This paper studies the problem of designing a certified vision-based state estimator for autonomous landing systems. In such a system, a neural network (NN) processes images from a camera to estimate the aircraft relative position with respect to the runway. We propose an algorithm to design such NNs with certified properties in terms of their ability to detect runways and provide accurate state e…
▽ More
This paper studies the problem of designing a certified vision-based state estimator for autonomous landing systems. In such a system, a neural network (NN) processes images from a camera to estimate the aircraft relative position with respect to the runway. We propose an algorithm to design such NNs with certified properties in terms of their ability to detect runways and provide accurate state estimation. At the heart of our approach is the use of geometric models of perspective cameras to obtain a mathematical model that captures the relation between the aircraft states and the inputs. We show that such geometric models enjoy mixed monotonicity properties that can be used to design state estimators with certifiable error bounds. We show the effectiveness of the proposed approach using an experimental testbed on data collected from event-based cameras.
△ Less
Submitted 10 September, 2023;
originally announced September 2023.
-
New limits on $W_R$ from meson decays
Authors:
Gustavo F. S. Alves,
Chee Sheng Fong,
Luighi P. S. Leal,
Renata Zukanovich Funchal
Abstract:
In this letter we show that pseudoscalar meson leptonic decay data can be used to set stringent limits on the mass $m_{W_R}$ of a right-handed vector boson, such as the one that appears in left-right symmetric models. We have shown that for a heavy neutrino with a mass $m_N$ in the range $50<m_N/{\rm MeV} <1900$ one can constraint $m_{W_R} \lesssim (4-19)$ TeV at 90 % CL. This provides the most st…
▽ More
In this letter we show that pseudoscalar meson leptonic decay data can be used to set stringent limits on the mass $m_{W_R}$ of a right-handed vector boson, such as the one that appears in left-right symmetric models. We have shown that for a heavy neutrino with a mass $m_N$ in the range $50<m_N/{\rm MeV} <1900$ one can constraint $m_{W_R} \lesssim (4-19)$ TeV at 90 % CL. This provides the most stringent experimental limits on the $W_R$ mass to date.
△ Less
Submitted 10 July, 2023;
originally announced July 2023.
-
Exploring the Neutrino Sector of the Minimal Left-Right Symmetric Model
Authors:
Gustavo F. S. Alves,
Chee Sheng Fong,
Luighi P. S. Leal,
Renata Zukanovich Funchal
Abstract:
We explore the neutrino sector of the minimal left-right symmetric model, with the additional charge conjugation discrete symmetry, in the tuned regime where type-I and type-II seesaw mechanisms are equally responsible for the light neutrino masses. We show that unless the charged lepton mixing matrix is the identity and the right handed neutrino mass matrix has no phases, we expect sizable lepton…
▽ More
We explore the neutrino sector of the minimal left-right symmetric model, with the additional charge conjugation discrete symmetry, in the tuned regime where type-I and type-II seesaw mechanisms are equally responsible for the light neutrino masses. We show that unless the charged lepton mixing matrix is the identity and the right handed neutrino mass matrix has no phases, we expect sizable lepton flavor violation and electron dipole moment in this region. We use results from recent neutrino oscillation fits, bounds on neutrinoless double beta decay, $μ\to e γ$, $μ\to 3 e$, $μ\to e$ conversion in nuclei, the muon anomalous magnetic moment, the electron electric dipole moment and cosmology to determine the viability of this region. We derive stringent limits on the heavy neutrino masses and mixing angles as well as on the vacuum expectation value $v_L$, which drives the type-II seesaw contribution, using the current data. We discuss the perspectives of probing the remaining parameter space by future experiments.
△ Less
Submitted 15 August, 2022;
originally announced August 2022.
-
Diagnosing Magnetic Fields in Cylindrical Implosions with Oblique Proton Radiography
Authors:
P. V. Heuer,
L. S. Leal,
J. R. Davies,
E. C. Hansen,
D. H. Barnak,
J. L. Peebles,
F. García-Rubio,
B. Pollock,
J. Moody,
A. Birkel,
F. H. Seguin
Abstract:
Two experiments on the OMEGA Laser System used oblique proton radiography to measure magnetic fields in cylindrical implosions with and without an applied axial magnetic field. Although the goal of both experiments was to measure the magnitude of the compressed axial magnetic field in the core of the implosion, this field was obfuscated by two features in the coronal plasma produced by the compres…
▽ More
Two experiments on the OMEGA Laser System used oblique proton radiography to measure magnetic fields in cylindrical implosions with and without an applied axial magnetic field. Although the goal of both experiments was to measure the magnitude of the compressed axial magnetic field in the core of the implosion, this field was obfuscated by two features in the coronal plasma produced by the compression beams: an azimuthal self-generated magnetic field and small length scale, high-amplitude structures attributed to collisionless effects. In order to understand these features, synthetic radiographs are generated using fields produced by 3-D HYDRA simulations. These synthetic radiographs reproduce the features of the experimental radiographs with the exception of the small-scale structures. A direct inversion algorithm is successfully applied to a synthetic radiograph, but is only partially able to invert the experimental radiographs in part because some protons are blocked by the field coils. The origins of the radiograph features and their dependence on various experimental parameters are explored. The results of this analysis should inform future measurements of compressed axial magnetic fields in cylindrical implosions.
△ Less
Submitted 22 July, 2022; v1 submitted 22 March, 2022;
originally announced March 2022.
-
NILC-Metrix: assessing the complexity of written and spoken language in Brazilian Portuguese
Authors:
Sidney Evaldo Leal,
Magali Sanches Duran,
Carolina Evaristo Scarton,
Nathan Siegle Hartmann,
Sandra Maria Aluísio
Abstract:
This paper presents and makes publicly available the NILC-Metrix, a computational system comprising 200 metrics proposed in studies on discourse, psycholinguistics, cognitive and computational linguistics, to assess textual complexity in Brazilian Portuguese (BP). These metrics are relevant for descriptive analysis and the creation of computational models and can be used to extract information fro…
▽ More
This paper presents and makes publicly available the NILC-Metrix, a computational system comprising 200 metrics proposed in studies on discourse, psycholinguistics, cognitive and computational linguistics, to assess textual complexity in Brazilian Portuguese (BP). These metrics are relevant for descriptive analysis and the creation of computational models and can be used to extract information from various linguistic levels of written and spoken language. The metrics in NILC-Metrix were developed during the last 13 years, starting in 2008 with Coh-Metrix-Port, a tool developed within the scope of the PorSimples project. Coh-Metrix-Port adapted some metrics to BP from the Coh-Metrix tool that computes metrics related to cohesion and coherence of texts in English. After the end of PorSimples in 2010, new metrics were added to the initial 48 metrics of Coh-Metrix-Port. Given the large number of metrics, we present them following an organisation similar to the metrics of Coh-Metrix v3.0 to facilitate comparisons made with metrics in Portuguese and English. In this paper, we illustrate the potential of NILC-Metrix by presenting three applications: (i) a descriptive analysis of the differences between children's film subtitles and texts written for Elementary School I and II (Final Years); (ii) a new predictor of textual complexity for the corpus of original and simplified texts of the PorSimples project; (iii) a complexity prediction model for school grades, using transcripts of children's story narratives told by teenagers. For each application, we evaluate which groups of metrics are more discriminative, showing their contribution for each task.
△ Less
Submitted 17 December, 2021;
originally announced January 2022.
-
Unity Perception: Generate Synthetic Data for Computer Vision
Authors:
Steve Borkman,
Adam Crespi,
Saurav Dhakad,
Sujoy Ganguly,
Jonathan Hogins,
You-Cyuan Jhang,
Mohsen Kamalzadeh,
Bowen Li,
Steven Leal,
Pete Parisi,
Cesar Romero,
Wesley Smith,
Alex Thaman,
Samuel Warren,
Nupur Yadav
Abstract:
We introduce the Unity Perception package which aims to simplify and accelerate the process of generating synthetic datasets for computer vision tasks by offering an easy-to-use and highly customizable toolset. This open-source package extends the Unity Editor and engine components to generate perfectly annotated examples for several common computer vision tasks. Additionally, it offers an extensi…
▽ More
We introduce the Unity Perception package which aims to simplify and accelerate the process of generating synthetic datasets for computer vision tasks by offering an easy-to-use and highly customizable toolset. This open-source package extends the Unity Editor and engine components to generate perfectly annotated examples for several common computer vision tasks. Additionally, it offers an extensible Randomization framework that lets the user quickly construct and configure randomized simulation parameters in order to introduce variation into the generated datasets. We provide an overview of the provided tools and how they work, and demonstrate the value of the generated synthetic datasets by training a 2D object detection model. The model trained with mostly synthetic data outperforms the model trained using only real data.
△ Less
Submitted 19 July, 2021; v1 submitted 9 July, 2021;
originally announced July 2021.
-
Unveiling the research landscape of Sustainable Development Goals and their inclusion in Higher Education Institutions and Research Centers: major trends in 2000-2017
Authors:
Nuria Bautista-Puig,
Ana Marta Aleixo,
Susana Leal,
Ulisses Azeiteiro,
Rodrigo Costas
Abstract:
Sustainable Development Goals are the blueprint to achieve a better and more sustainable future for society. Its legacy is linked with the Millennium Development Goals, set up in 2000. A bibliometric analysis was conducted to 1) measure "core" research output from 2000-2017, with the aim to map the global research of sustainability goals, 2) describe thematic specialization based on keywords co-oc…
▽ More
Sustainable Development Goals are the blueprint to achieve a better and more sustainable future for society. Its legacy is linked with the Millennium Development Goals, set up in 2000. A bibliometric analysis was conducted to 1) measure "core" research output from 2000-2017, with the aim to map the global research of sustainability goals, 2) describe thematic specialization based on keywords co-occurrence analysis and strongest citation burst, 3) present a methodology to classify scientific output (based on an ad-hoc glossary) and assess SDGs interconnections.
Sustainability goals publications (core+expand based on direct citations) were identified in-house CWTS Web of Science by using search terms in titles, abstracts, and keywords. 25,299 bibliographic records were analyzed, from which 21,653 (85.59%) are from HEIs and research centres (RC). The purpose of this paper is to analyze the role of these organizations in sustainability research. The findings reveal the increasing participation of these organizations in this research (660 institutions in 2000-2005 to 1744 institutions involved in 2012-2017). In terms of specialization, some institutions present a higher production and specialization on the topic (e.g., London School of Hygiene & Tropical Medicine and World Health Organization); however, others present less production but higher specialization (e.g., Stockholm Environment Institute). Regarding the topics, health (especially in developing countries), women and socio-economic aspects are the most prominent ones. Moreover, it is observed the interlinked nature of SDGs between some SDGs in research output (e.g., SDG11 and SDG3). This study provides important orientation for HEIs and RCs in terms of Research, Development and Innovation (R&D+i) to respond to major societal challenges and could be useful for the policymakers in order to promote the research agenda on this topic.
△ Less
Submitted 12 February, 2020;
originally announced February 2020.
-
Pareto-optimal solution for the quantum battle of the sexes
Authors:
Adriane Consuelo da Silva Leal,
Arthur Gustavo de Araujo Ferreira,
Everton Lucas de Oliveira,
Tito Jose Bonagamba,
Ruben Auccaise Estrada
Abstract:
Quantum games have gained much popularity in the last two decades. Many of these quantum games are a redefinition of iconic classical games to fit the quantum world, and they gain many different properties and solutions in this different view. In this letter, we attempt to find a solution to an asymmetric quantum game which still troubles quantum game researchers, the quantum battle of the sexes.…
▽ More
Quantum games have gained much popularity in the last two decades. Many of these quantum games are a redefinition of iconic classical games to fit the quantum world, and they gain many different properties and solutions in this different view. In this letter, we attempt to find a solution to an asymmetric quantum game which still troubles quantum game researchers, the quantum battle of the sexes. To achieve that, we perform an analysis using the Eisert-Wilkens-Lewenstein's protocol for this asymmetric game. The protocol highlights two solutions for the game, which solve the dilemma and satisfy the Pareto-optimal definition, unlike previous reports that rely on Nash equilibrium. We perform an experimental implementation using the NMR technique in a two-qubit system. Our results eliminate dilemmas on the quantum battle of the sexes and provide us with arguments to elucidate that the Eisert-Wilkens-Lewenstein's protocol is not restricted to symmetric games when at the quantum regime.
△ Less
Submitted 3 December, 2019; v1 submitted 7 February, 2019;
originally announced February 2019.
-
Electron Paramagnetic Resonance signature of point defects in neutron irradiated hexagonal Boron Nitride
Authors:
J. R. Toledo,
D. B. de Jesus,
M. Kianinia,
A. S. Leal,
C. Fantini,
L. A. Cury,
G. M. Sáfar,
I. Aharonovich,
K. Krambrock
Abstract:
Hexagonal boron nitride (h-BN) is an attractive van der Waals material for studying fluorescent defects due to its large bandgap. In this work, we demonstrate enhanced pink color due to neutron irradiation and perform electron paramagnetic resonance (EPR) measurements. The new point defects are tentatively assigned to doubly- occupied nitrogen vacancies with (S = 1) and a zero-field splitting (D =…
▽ More
Hexagonal boron nitride (h-BN) is an attractive van der Waals material for studying fluorescent defects due to its large bandgap. In this work, we demonstrate enhanced pink color due to neutron irradiation and perform electron paramagnetic resonance (EPR) measurements. The new point defects are tentatively assigned to doubly- occupied nitrogen vacancies with (S = 1) and a zero-field splitting (D = 1.2 GHz). These defects are associated with a broad visible optical absorption band and near infrared photoluminescence band centered at ~ 490 nm and 820 nm, respectively. The EPR signal intensities are strongly affected by thermal treatments in temperature range between 600 to 800°C, where also the irradiation - induced pink color is lost. Our results are important for understanding of point defects in h-BN and their deployment for quantum and integrated photonic applications.
△ Less
Submitted 8 October, 2018; v1 submitted 14 July, 2018;
originally announced July 2018.
-
Estimating the time evolution of NMR systems via quantum speed limit-like expression
Authors:
D. V. Villamizar,
A. C. S. Leal,
R. Auccaise,
E. I. Duzzioni
Abstract:
Finding the solutions of the equations that describe the dynamics of a given physical system is crucial in order to obtain important information about its evolution. However, by using estimation theory, it is possible to obtain, under certain limitations, some information on its dynamics. The quantum-speed-limit (QSL) theory was originally used to estimate the shortest time in which a Hamiltonian…
▽ More
Finding the solutions of the equations that describe the dynamics of a given physical system is crucial in order to obtain important information about its evolution. However, by using estimation theory, it is possible to obtain, under certain limitations, some information on its dynamics. The quantum-speed-limit (QSL) theory was originally used to estimate the shortest time in which a Hamiltonian drives an initial state to a final one for a given fidelity. Using the QSL theory in a slightly different way, we are able to estimate the running time of a given quantum process. For that purpose, we impose the saturation of the Anandan-Aharonov bound in a rotating frame of reference where the state of the system travels slower than in the original frame (laboratory frame). Through this procedure it is possible to estimate the actual evolution time in the laboratory frame of reference with good accuracy when compared to previous methods. Our method is tested successfully to predict the time spent in the evolution of nuclear spins 1/2 and 3/2 in NMR systems. We find that the estimated time according to our method is better than previous approaches by up to four orders of magnitude. One disadvantage of our method is that we need to solve a number of transcendental equations, which increases with the system dimension and parameter discretization used to solve such equations numerically.
△ Less
Submitted 6 July, 2018; v1 submitted 17 May, 2017;
originally announced May 2017.
-
Rock around the Clock: An Agent-Based Model of Low- and High-Frequency Trading
Authors:
Sandrine Jacob Leal,
Mauro Napoletano,
Andrea Roventini,
Giorgio Fagiolo
Abstract:
We build an agent-based model to study how the interplay between low- and high-frequency trading affects asset price dynamics. Our main goal is to investigate whether high-frequency trading exacerbates market volatility and generates flash crashes. In the model, low-frequency agents adopt trading rules based on chronological time and can switch between fundamentalist and chartist strategies. On th…
▽ More
We build an agent-based model to study how the interplay between low- and high-frequency trading affects asset price dynamics. Our main goal is to investigate whether high-frequency trading exacerbates market volatility and generates flash crashes. In the model, low-frequency agents adopt trading rules based on chronological time and can switch between fundamentalist and chartist strategies. On the contrary, high-frequency traders activation is event-driven and depends on price fluctuations. High-frequency traders use directional strategies to exploit market information produced by low-frequency traders. Monte-Carlo simulations reveal that the model replicates the main stylized facts of financial markets. Furthermore, we find that the presence of high-frequency trading increases market volatility and plays a fundamental role in the generation of flash crashes. The emergence of flash crashes is explained by two salient characteristics of high-frequency traders, i.e. their ability to i) generate high bid-ask spreads and ii) synchronize on the sell side of the limit order book. Finally, we find that higher rates of order cancellation by high-frequency traders increase the incidence of flash crashes but reduce their duration.
△ Less
Submitted 10 February, 2014;
originally announced February 2014.