-
Privacy-Preserving Synthetically Augmented Knowledge Graphs with Semantic Utility
Authors:
Luigi Bellomarini,
Costanza Catalano,
Andrea Coletta,
Michela Iezzi,
Pierangela Samarati
Abstract:
Knowledge Graphs (KGs) have recently gained relevant attention in many application domains, from healthcare to biotechnology, from logistics to finance. Financial organisations, central banks, economic research entities, and national supervision authorities apply ontological reasoning on KGs to address crucial business tasks, such as economic policymaking, banking supervision, anti-money launderin…
▽ More
Knowledge Graphs (KGs) have recently gained relevant attention in many application domains, from healthcare to biotechnology, from logistics to finance. Financial organisations, central banks, economic research entities, and national supervision authorities apply ontological reasoning on KGs to address crucial business tasks, such as economic policymaking, banking supervision, anti-money laundering, and economic research. Reasoning allows for the generation of derived knowledge capturing complex business semantics and the set up of effective business processes. A major obstacle in KGs sharing is represented by privacy considerations since the identity of the data subjects and their sensitive or company-confidential information may be improperly exposed.
In this paper, we propose a novel framework to enable KGs sharing while ensuring that information that should remain private is not directly released nor indirectly exposed via derived knowledge, while maintaining the embedded knowledge of the KGs to support business downstream tasks. Our approach produces a privacy-preserving synthetic KG as an augmentation of the input one via the introduction of structural anonymisation. We introduce a novel privacy measure for KGs, which considers derived knowledge and a new utility metric that captures the business semantics we want to preserve, and propose two novel anonymization algorithms. Our extensive experimental evaluation, with both synthetic graphs and real-world datasets, confirms the effectiveness of our approach achieving up to a 70% improvement in the privacy of entities compared to existing methods not specifically designed for KGs.
△ Less
Submitted 16 October, 2024;
originally announced October 2024.
-
Dynamical Accretion Flows -- ALMAGAL: Flows along filamentary structures in high-mass star-forming clusters
Authors:
M. R. A. Wells,
H. Beuther,
S. Molinari,
P. Schilke,
C. Battersby,
P. Ho,
Á. Sánchez-Monge,
B. Jones,
M. B. Scheuck,
J. Syed,
C. Gieser,
R. Kuiper,
D. Elia,
A. Coletta,
A. Traficante,
J. Wallace,
A. J. Rigby,
R. S. Klessen,
Q. Zhang,
S. Walch,
M. T. Beltrán,
Y. Tang,
G. A. Fuller,
D. C. Lis,
T. Möller
, et al. (25 additional authors not shown)
Abstract:
We use data from the ALMA Evolutionary Study of High Mass Protocluster Formation in the Galaxy (ALMAGAL) survey to study 100 ALMAGAL regions at $\sim$ 1 arsecond resolution located between $\sim$ 2 and 6 kpc distance. Using ALMAGAL $\sim$ 1.3mm line and continuum data we estimate flow rates onto individual cores. We focus specifically on flow rates along filamentary structures associated with thes…
▽ More
We use data from the ALMA Evolutionary Study of High Mass Protocluster Formation in the Galaxy (ALMAGAL) survey to study 100 ALMAGAL regions at $\sim$ 1 arsecond resolution located between $\sim$ 2 and 6 kpc distance. Using ALMAGAL $\sim$ 1.3mm line and continuum data we estimate flow rates onto individual cores. We focus specifically on flow rates along filamentary structures associated with these cores. Our primary analysis is centered around position velocity cuts in H$_2$CO (3$_{0,3}$ - 2$_{0,2}$) which allow us to measure the velocity fields, surrounding these cores. Combining this work with column density estimates we derive the flow rates along the extended filamentary structures associated with cores in these regions. We select a sample of 100 ALMAGAL regions covering four evolutionary stages from quiescent to protostellar, Young Stellar Objects (YSOs), and HII regions (25 each). Using dendrogram and line analysis, we identify a final sample of 182 cores in 87 regions. In this paper, we present 728 flow rates for our sample (4 per core), analysed in the context of evolutionary stage, distance from the core, and core mass. On average, for the whole sample, we derive flow rates on the order of $\sim$10$^{-4}$ M$_{sun}$yr$^{-1}$ with estimated uncertainties of $\pm$50%. We see increasing differences in the values among evolutionary stages, most notably between the less evolved (quiescent/protostellar) and more evolved (YSO/HII region) sources. We also see an increasing trend as we move further away from the centre of these cores. We also find a clear relationship between the flow rates and core masses $\sim$M$^{2/3}$ which is in line with the result expected from the tidal-lobe accretion mechanism. Overall, we see increasing trends in the relationships between the flow rate and the three investigated parameters; evolutionary stage, distance from the core, and core mass.
△ Less
Submitted 16 August, 2024; v1 submitted 15 August, 2024;
originally announced August 2024.
-
Simulating the Economic Impact of Rationality through Reinforcement Learning and Agent-Based Modelling
Authors:
Simone Brusatin,
Tommaso Padoan,
Andrea Coletta,
Domenico Delli Gatti,
Aldo Glielmo
Abstract:
Agent-based models (ABMs) are simulation models used in economics to overcome some of the limitations of traditional frameworks based on general equilibrium assumptions. However, agents within an ABM follow predetermined 'bounded rational' behavioural rules which can be cumbersome to design and difficult to justify. Here we leverage multi-agent reinforcement learning (RL) to expand the capabilitie…
▽ More
Agent-based models (ABMs) are simulation models used in economics to overcome some of the limitations of traditional frameworks based on general equilibrium assumptions. However, agents within an ABM follow predetermined 'bounded rational' behavioural rules which can be cumbersome to design and difficult to justify. Here we leverage multi-agent reinforcement learning (RL) to expand the capabilities of ABMs with the introduction of 'fully rational' agents that learn their policy by interacting with the environment and maximising a reward function. Specifically, we propose a 'Rational macro ABM' (R-MABM) framework by extending a paradigmatic macro ABM from the economic literature. We show that gradually substituting ABM firms in the model with RL agents, trained to maximise profits, allows for studying the impact of rationality on the economy. We find that RL agents spontaneously learn three distinct strategies for maximising profits, with the optimal strategy depending on the level of market competition and rationality. We also find that RL agents with independent policies, and without the ability to communicate with each other, spontaneously learn to segregate into different strategic groups, thus increasing market power and overall profits. Finally, we find that a higher number of rational (RL) agents in the economy always improves the macroeconomic environment as measured by total output. Depending on the specific rational policy, this can come at the cost of higher instability. Our R-MABM framework allows for stable multi-agent learning, is available in open source, and represents a principled and robust direction to extend economic simulators.
△ Less
Submitted 21 October, 2024; v1 submitted 3 May, 2024;
originally announced May 2024.
-
LLM-driven Imitation of Subrational Behavior : Illusion or Reality?
Authors:
Andrea Coletta,
Kshama Dwarakanath,
Penghang Liu,
Svitlana Vyetrenko,
Tucker Balch
Abstract:
Modeling subrational agents, such as humans or economic households, is inherently challenging due to the difficulty in calibrating reinforcement learning models or collecting data that involves human subjects. Existing work highlights the ability of Large Language Models (LLMs) to address complex reasoning tasks and mimic human communication, while simulation using LLMs as agents shows emergent so…
▽ More
Modeling subrational agents, such as humans or economic households, is inherently challenging due to the difficulty in calibrating reinforcement learning models or collecting data that involves human subjects. Existing work highlights the ability of Large Language Models (LLMs) to address complex reasoning tasks and mimic human communication, while simulation using LLMs as agents shows emergent social behaviors, potentially improving our comprehension of human conduct. In this paper, we propose to investigate the use of LLMs to generate synthetic human demonstrations, which are then used to learn subrational agent policies though Imitation Learning. We make an assumption that LLMs can be used as implicit computational models of humans, and propose a framework to use synthetic demonstrations derived from LLMs to model subrational behaviors that are characteristic of humans (e.g., myopic behavior or preference for risk aversion). We experimentally evaluate the ability of our framework to model sub-rationality through four simple scenarios, including the well-researched ultimatum game and marshmallow experiment. To gain confidence in our framework, we are able to replicate well-established findings from prior human studies associated with the above scenarios. We conclude by discussing the potential benefits, challenges and limitations of our framework.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
Synthetic Data Applications in Finance
Authors:
Vamsi K. Potluru,
Daniel Borrajo,
Andrea Coletta,
Niccolò Dalmasso,
Yousef El-Laham,
Elizabeth Fons,
Mohsen Ghassemi,
Sriram Gopalakrishnan,
Vikesh Gosai,
Eleonora Kreačić,
Ganapathy Mani,
Saheed Obitayo,
Deepak Paramanand,
Natraj Raman,
Mikhail Solonin,
Srijan Sood,
Svitlana Vyetrenko,
Haibei Zhu,
Manuela Veloso,
Tucker Balch
Abstract:
Synthetic data has made tremendous strides in various commercial settings including finance, healthcare, and virtual reality. We present a broad overview of prototypical applications of synthetic data in the financial sector and in particular provide richer details for a few select ones. These cover a wide variety of data modalities including tabular, time-series, event-series, and unstructured ar…
▽ More
Synthetic data has made tremendous strides in various commercial settings including finance, healthcare, and virtual reality. We present a broad overview of prototypical applications of synthetic data in the financial sector and in particular provide richer details for a few select ones. These cover a wide variety of data modalities including tabular, time-series, event-series, and unstructured arising from both markets and retail financial applications. Since finance is a highly regulated industry, synthetic data is a potential approach for dealing with issues related to privacy, fairness, and explainability. Various metrics are utilized in evaluating the quality and effectiveness of our approaches in these applications. We conclude with open directions in synthetic data in the context of the financial domain.
△ Less
Submitted 20 March, 2024; v1 submitted 29 December, 2023;
originally announced January 2024.
-
Multi-Modal Financial Time-Series Retrieval Through Latent Space Projections
Authors:
Tom Bamford,
Andrea Coletta,
Elizabeth Fons,
Sriram Gopalakrishnan,
Svitlana Vyetrenko,
Tucker Balch,
Manuela Veloso
Abstract:
Financial firms commonly process and store billions of time-series data, generated continuously and at a high frequency. To support efficient data storage and retrieval, specialized time-series databases and systems have emerged. These databases support indexing and querying of time-series by a constrained Structured Query Language(SQL)-like format to enable queries like "Stocks with monthly price…
▽ More
Financial firms commonly process and store billions of time-series data, generated continuously and at a high frequency. To support efficient data storage and retrieval, specialized time-series databases and systems have emerged. These databases support indexing and querying of time-series by a constrained Structured Query Language(SQL)-like format to enable queries like "Stocks with monthly price returns greater than 5%", and expressed in rigid formats. However, such queries do not capture the intrinsic complexity of high dimensional time-series data, which can often be better described by images or language (e.g., "A stock in low volatility regime"). Moreover, the required storage, computational time, and retrieval complexity to search in the time-series space are often non-trivial. In this paper, we propose and demonstrate a framework to store multi-modal data for financial time-series in a lower-dimensional latent space using deep encoders, such that the latent space projections capture not only the time series trends but also other desirable information or properties of the financial time-series data (such as price volatility). Moreover, our approach allows user-friendly query interfaces, enabling natural language text or sketches of time-series, for which we have developed intuitive interfaces. We demonstrate the advantages of our method in terms of computational efficiency and accuracy on real historical data as well as synthetic data, and highlight the utility of latent-space projections in the storage and retrieval of financial time-series data with intuitive query modalities.
△ Less
Submitted 2 January, 2024; v1 submitted 28 September, 2023;
originally announced September 2023.
-
INTAGS: Interactive Agent-Guided Simulation
Authors:
Song Wei,
Andrea Coletta,
Svitlana Vyetrenko,
Tucker Balch
Abstract:
In many applications involving multi-agent system (MAS), it is imperative to test an experimental (Exp) autonomous agent in a high-fidelity simulator prior to its deployment to production, to avoid unexpected losses in the real-world. Such a simulator acts as the environmental background (BG) agent(s), called agent-based simulator (ABS), aiming to replicate the complex real MAS. However, developin…
▽ More
In many applications involving multi-agent system (MAS), it is imperative to test an experimental (Exp) autonomous agent in a high-fidelity simulator prior to its deployment to production, to avoid unexpected losses in the real-world. Such a simulator acts as the environmental background (BG) agent(s), called agent-based simulator (ABS), aiming to replicate the complex real MAS. However, developing realistic ABS remains challenging, mainly due to the sequential and dynamic nature of such systems. To fill this gap, we propose a metric to distinguish between real and synthetic multi-agent systems, which is evaluated through the live interaction between the Exp and BG agents to explicitly account for the systems' sequential nature. Specifically, we characterize the system/environment by studying the effect of a sequence of BG agents' responses to the environment state evolution and take such effects' differences as MAS distance metric; The effect estimation is cast as a causal inference problem since the environment evolution is confounded with the previous environment state. Importantly, we propose the Interactive Agent-Guided Simulation (INTAGS) framework to build a realistic ABS by optimizing over this novel metric. To adapt to any environment with interactive sequential decision making agents, INTAGS formulates the simulator as a stochastic policy in reinforcement learning. Moreover, INTAGS utilizes the policy gradient update to bypass differentiating the proposed metric such that it can support non-differentiable operations of multi-agent environments. Through extensive experiments, we demonstrate the effectiveness of INTAGS on an equity stock market simulation example. We show that using INTAGS to calibrate the simulator can generate more realistic market data compared to the state-of-the-art conditional Wasserstein Generative Adversarial Network approach.
△ Less
Submitted 17 November, 2023; v1 submitted 4 September, 2023;
originally announced September 2023.
-
A new tool to derive simultaneously exponent and extremes of power-law distributions
Authors:
S. Pezzuto,
A. Coletta,
R. S. Klessen,
E. Schisano,
M. Benedettini,
D. Elia,
S. Molinari,
J. D. Soler,
A. Traficante
Abstract:
Many experimental quantities show a power-law distribution $p(x)\propto x^{-α}$. In astrophysics, examples are: size distribution of dust grains or luminosity function of galaxies. Such distributions are characterized by the exponent $α$ and by the extremes $x_\text{min}$ $x_\text{max}$ where the distribution extends. There are no mathematical tools that derive the three unknowns at the same time.…
▽ More
Many experimental quantities show a power-law distribution $p(x)\propto x^{-α}$. In astrophysics, examples are: size distribution of dust grains or luminosity function of galaxies. Such distributions are characterized by the exponent $α$ and by the extremes $x_\text{min}$ $x_\text{max}$ where the distribution extends. There are no mathematical tools that derive the three unknowns at the same time. In general, one estimates a set of $α$ corresponding to different guesses of $x_\text{min}$ $x_\text{max}$. Then, the best set of values describing the observed data is selected a posteriori. In this paper, we present a tool that finds contextually the three parameters based on simple assumptions on how the observed values $x_i$ populate the unknown range between $x_\text{min}$ and $x_\text{max}$ for a given $α$. Our tool, freely downloadable, finds the best values through a non-linear least-squares fit. We compare our technique with the maximum likelihood estimators for power-law distributions, both truncated and not. Through simulated data, we show for each method the reliability of the computed parameters as a function of the number $N$ of data in the sample. We then apply our method to observed data to derive: i) the slope of the core mass function in the Perseus star-forming region, finding two power-law distributions: $α=2.576$ between $1.06\,M_{\sun}$ and $3.35\,M_{\sun}$, $α=3.39$ between $3.48\,M_{\sun}$ and $33.4\,M_{\sun}$; ii) the slope of the $γ$-ray spectrum of the blazar J0011.4+0057, extracted from the Fermi-LAT archive. For the latter case, we derive $α=2.89$ between 1,484~MeV and 28.7~GeV; then we derive the time-resolved slopes using subsets 200 photons each.
△ Less
Submitted 19 September, 2023; v1 submitted 28 August, 2023;
originally announced August 2023.
-
LOB-Based Deep Learning Models for Stock Price Trend Prediction: A Benchmark Study
Authors:
Matteo Prata,
Giuseppe Masi,
Leonardo Berti,
Viviana Arrigoni,
Andrea Coletta,
Irene Cannistraci,
Svitlana Vyetrenko,
Paola Velardi,
Novella Bartolini
Abstract:
The recent advancements in Deep Learning (DL) research have notably influenced the finance sector. We examine the robustness and generalizability of fifteen state-of-the-art DL models focusing on Stock Price Trend Prediction (SPTP) based on Limit Order Book (LOB) data. To carry out this study, we developed LOBCAST, an open-source framework that incorporates data preprocessing, DL model training, e…
▽ More
The recent advancements in Deep Learning (DL) research have notably influenced the finance sector. We examine the robustness and generalizability of fifteen state-of-the-art DL models focusing on Stock Price Trend Prediction (SPTP) based on Limit Order Book (LOB) data. To carry out this study, we developed LOBCAST, an open-source framework that incorporates data preprocessing, DL model training, evaluation and profit analysis. Our extensive experiments reveal that all models exhibit a significant performance drop when exposed to new data, thereby raising questions about their real-world market applicability. Our work serves as a benchmark, illuminating the potential and the limitations of current approaches and providing insight for innovative solutions.
△ Less
Submitted 19 September, 2023; v1 submitted 5 July, 2023;
originally announced August 2023.
-
On the Constrained Time-Series Generation Problem
Authors:
Andrea Coletta,
Sriram Gopalakrishan,
Daniel Borrajo,
Svitlana Vyetrenko
Abstract:
Synthetic time series are often used in practical applications to augment the historical time series dataset for better performance of machine learning algorithms, amplify the occurrence of rare events, and also create counterfactual scenarios described by the time series. Distributional-similarity (which we refer to as realism) as well as the satisfaction of certain numerical constraints are comm…
▽ More
Synthetic time series are often used in practical applications to augment the historical time series dataset for better performance of machine learning algorithms, amplify the occurrence of rare events, and also create counterfactual scenarios described by the time series. Distributional-similarity (which we refer to as realism) as well as the satisfaction of certain numerical constraints are common requirements in counterfactual time series scenario generation requests. For instance, the US Federal Reserve publishes synthetic market stress scenarios given by the constrained time series for financial institutions to assess their performance in hypothetical recessions. Existing approaches for generating constrained time series usually penalize training loss to enforce constraints, and reject non-conforming samples. However, these approaches would require re-training if we change constraints, and rejection sampling can be computationally expensive, or impractical for complex constraints. In this paper, we propose a novel set of methods to tackle the constrained time series generation problem and provide efficient sampling while ensuring the realism of generated time series. In particular, we frame the problem using a constrained optimization framework and then we propose a set of generative methods including "GuidedDiffTime", a guided diffusion model to generate realistic time series. Empirically, we evaluate our work on several datasets for financial and energy data, where incorporating constraints is critical. We show that our approaches outperform existing work both qualitatively and quantitatively. Most importantly, we show that our "GuidedDiffTime" model is the only solution where re-training is not necessary for new constraints, resulting in a significant carbon footprint reduction, up to 92% w.r.t. existing deep learning methods.
△ Less
Submitted 14 September, 2023; v1 submitted 4 July, 2023;
originally announced July 2023.
-
Conditional Generators for Limit Order Book Environments: Explainability, Challenges, and Robustness
Authors:
Andrea Coletta,
Joseph Jerome,
Rahul Savani,
Svitlana Vyetrenko
Abstract:
Limit order books are a fundamental and widespread market mechanism. This paper investigates the use of conditional generative models for order book simulation. For developing a trading agent, this approach has drawn recent attention as an alternative to traditional backtesting due to its ability to react to the presence of the trading agent. Using a state-of-the-art CGAN (from Coletta et al. (202…
▽ More
Limit order books are a fundamental and widespread market mechanism. This paper investigates the use of conditional generative models for order book simulation. For developing a trading agent, this approach has drawn recent attention as an alternative to traditional backtesting due to its ability to react to the presence of the trading agent. Using a state-of-the-art CGAN (from Coletta et al. (2022)), we explore its dependence upon input features, which highlights both strengths and weaknesses. To do this, we use "adversarial attacks" on the model's features and its mechanism. We then show how these insights can be used to improve the CGAN, both in terms of its realism and robustness. We finish by laying out a roadmap for future work.
△ Less
Submitted 22 June, 2023;
originally announced June 2023.
-
K-SHAP: Policy Clustering Algorithm for Anonymous Multi-Agent State-Action Pairs
Authors:
Andrea Coletta,
Svitlana Vyetrenko,
Tucker Balch
Abstract:
Learning agent behaviors from observational data has shown to improve our understanding of their decision-making processes, advancing our ability to explain their interactions with the environment and other agents. While multiple learning techniques have been proposed in the literature, there is one particular setting that has not been explored yet: multi agent systems where agent identities remai…
▽ More
Learning agent behaviors from observational data has shown to improve our understanding of their decision-making processes, advancing our ability to explain their interactions with the environment and other agents. While multiple learning techniques have been proposed in the literature, there is one particular setting that has not been explored yet: multi agent systems where agent identities remain anonymous. For instance, in financial markets labeled data that identifies market participant strategies is typically proprietary, and only the anonymous state-action pairs that result from the interaction of multiple market participants are publicly available. As a result, sequences of agent actions are not observable, restricting the applicability of existing work. In this paper, we propose a Policy Clustering algorithm, called K-SHAP, that learns to group anonymous state-action pairs according to the agent policies. We frame the problem as an Imitation Learning (IL) task, and we learn a world-policy able to mimic all the agent behaviors upon different environmental states. We leverage the world-policy to explain each anonymous observation through an additive feature attribution method called SHAP (SHapley Additive exPlanations). Finally, by clustering the explanations we show that we are able to identify different agent policies and group observations accordingly. We evaluate our approach on simulated synthetic market data and a real-world financial dataset. We show that our proposal significantly and consistently outperforms the existing methods, identifying different agent strategies.
△ Less
Submitted 26 June, 2023; v1 submitted 23 February, 2023;
originally announced February 2023.
-
A$^2$-UAV: Application-Aware Content and Network Optimization of Edge-Assisted UAV Systems
Authors:
Andrea Coletta,
Flavio Giorgi,
Gaia Maselli,
Matteo Prata,
Domenicomichele Silvestri,
Jonathan Ashdown,
Francesco Restuccia
Abstract:
To perform advanced surveillance, Unmanned Aerial Vehicles (UAVs) require the execution of edge-assisted computer vision (CV) tasks. In multi-hop UAV networks, the successful transmission of these tasks to the edge is severely challenged due to severe bandwidth constraints. For this reason, we propose a novel A$^2$-UAV framework to optimize the number of correctly executed tasks at the edge. In st…
▽ More
To perform advanced surveillance, Unmanned Aerial Vehicles (UAVs) require the execution of edge-assisted computer vision (CV) tasks. In multi-hop UAV networks, the successful transmission of these tasks to the edge is severely challenged due to severe bandwidth constraints. For this reason, we propose a novel A$^2$-UAV framework to optimize the number of correctly executed tasks at the edge. In stark contrast with existing art, we take an application-aware approach and formulate a novel pplication-Aware Task Planning Problem (A$^2$-TPP) that takes into account (i) the relationship between deep neural network (DNN) accuracy and image compression for the classes of interest based on the available dataset, (ii) the target positions, (iii) the current energy/position of the UAVs to optimize routing, data pre-processing and target assignment for each UAV. We demonstrate A$^2$-TPP is NP-Hard and propose a polynomial-time algorithm to solve it efficiently. We extensively evaluate A$^2$-UAV through real-world experiments with a testbed composed by four DJI Mavic Air 2 UAVs. We consider state-of-the-art image classification tasks with four different DNN models (i.e., DenseNet, ResNet152, ResNet50 and MobileNet-V2) and object detection tasks using YoloV4 trained on the ImageNet dataset. Results show that A$^2$-UAV attains on average around 38% more accomplished tasks than the state-of-the-art, with 400% more accomplished tasks when the number of targets increases significantly. To allow full reproducibility, we pledge to share datasets and code with the research community.
△ Less
Submitted 24 July, 2023; v1 submitted 16 January, 2023;
originally announced January 2023.
-
The Star Formation Rate of the Milky Way as seen by Herschel
Authors:
D. Elia,
S. Molinari,
E. Schisano,
J. D. Soler,
M. Merello,
D. Russeil,
M. Veneziani,
A. Zavagno,
A. Noriega-Crespo,
L. Olmi,
M. Benedettini,
P. Hennebelle,
R. S. Klessen,
S. Leurini,
R. Paladini,
S. Pezzuto,
A. Traficante,
D. J. Eden,
P. G. Martin,
M. Sormani,
A. Coletta,
T. Colman,
R. Plume,
Y. Maruccia,
C. Mininni
, et al. (1 additional authors not shown)
Abstract:
We present a new derivation of the Milky Way's current star formation rate (SFR) based on the data of the Hi-GAL Galactic plane survey. We estimate the distribution of the SFR across the Galactic plane from the star-forming clumps identified in the Hi-GAL survey and calculate the total SFR from the sum of their contributions. The estimate of the global SFR amounts to $2.0 \pm 0.7$~M$_{\odot}$~yr…
▽ More
We present a new derivation of the Milky Way's current star formation rate (SFR) based on the data of the Hi-GAL Galactic plane survey. We estimate the distribution of the SFR across the Galactic plane from the star-forming clumps identified in the Hi-GAL survey and calculate the total SFR from the sum of their contributions. The estimate of the global SFR amounts to $2.0 \pm 0.7$~M$_{\odot}$~yr$^{-1}$, of which $1.7 \pm 0.6$~M$_{\odot}$~yr$^{-1}$ coming from clumps with reliable heliocentric distance assignment. This value is in general agreement with estimates found in the literature of last decades. The profile of SFR density averaged in Galactocentric rings is found to be qualitatively similar to others previously computed, with a peak corresponding to the Central Molecular Zone and another one around Galactocentric radius $R_\mathrm{gal} \sim 5$~kpc, followed by an exponential decrease as $\log(Σ_\mathrm{SFR}/[\mathrm{M}_\odot~\mathrm{yr}^{-1}~\mathrm{kpc}^{-2}])=a\,R_\mathrm{gal}/[\mathrm{kpc}]+b $, with $a=-0.28 \pm 0.01$. In this regard, the fraction of SFR produced within and outside the Solar circle is 84\% and 16\%, respectively; the fraction corresponding to the far outer Galaxy ($R_\mathrm{gal} > 13.5$~kpc) is only 1\%. We also find that, for $R_\mathrm{gal}>3$~kpc, our data follow a power law as a function of density, similarly to the Kennicutt-Schmidt relation. Finally, we compare the distribution of the SFR density across the face-on Galactic plane and those of median parameters, such as temperature, luminosity/mass ratio and bolometric temperature, describing the evolutionary stage of Hi-GAL clumps. We found no clear correlation between the SFR and the clump evolutionary stage.
△ Less
Submitted 10 November, 2022;
originally announced November 2022.
-
Learning to simulate realistic limit order book markets from data as a World Agent
Authors:
Andrea Coletta,
Aymeric Moulin,
Svitlana Vyetrenko,
Tucker Balch
Abstract:
Multi-agent market simulators usually require careful calibration to emulate real markets, which includes the number and the type of agents. Poorly calibrated simulators can lead to misleading conclusions, potentially causing severe loss when employed by investment banks, hedge funds, and traders to study and evaluate trading strategies. In this paper, we propose a world model simulator that accur…
▽ More
Multi-agent market simulators usually require careful calibration to emulate real markets, which includes the number and the type of agents. Poorly calibrated simulators can lead to misleading conclusions, potentially causing severe loss when employed by investment banks, hedge funds, and traders to study and evaluate trading strategies. In this paper, we propose a world model simulator that accurately emulates a limit order book market -- it requires no agent calibration but rather learns the simulated market behavior directly from historical data. Traditional approaches fail short to learn and calibrate trader population, as historical labeled data with details on each individual trader strategy is not publicly available. Our approach proposes to learn a unique "world" agent from historical data. It is intended to emulate the overall trader population, without the need of making assumptions about individual market agent strategies. We implement our world agent simulator models as a Conditional Generative Adversarial Network (CGAN), as well as a mixture of parametric distributions, and we compare our models against previous work. Qualitatively and quantitatively, we show that the proposed approaches consistently outperform previous work, providing more realism and responsiveness.
△ Less
Submitted 26 September, 2022;
originally announced October 2022.
-
Towards Realistic Market Simulations: a Generative Adversarial Networks Approach
Authors:
Andrea Coletta,
Matteo Prata,
Michele Conti,
Emanuele Mercanti,
Novella Bartolini,
Aymeric Moulin,
Svitlana Vyetrenko,
Tucker Balch
Abstract:
Simulated environments are increasingly used by trading firms and investment banks to evaluate trading strategies before approaching real markets. Backtesting, a widely used approach, consists of simulating experimental strategies while replaying historical market scenarios. Unfortunately, this approach does not capture the market response to the experimental agents' actions. In contrast, multi-ag…
▽ More
Simulated environments are increasingly used by trading firms and investment banks to evaluate trading strategies before approaching real markets. Backtesting, a widely used approach, consists of simulating experimental strategies while replaying historical market scenarios. Unfortunately, this approach does not capture the market response to the experimental agents' actions. In contrast, multi-agent simulation presents a natural bottom-up approach to emulating agent interaction in financial markets. It allows to set up pools of traders with diverse strategies to mimic the financial market trader population, and test the performance of new experimental strategies. Since individual agent-level historical data is typically proprietary and not available for public use, it is difficult to calibrate multiple market agents to obtain the realism required for testing trading strategies. To addresses this challenge we propose a synthetic market generator based on Conditional Generative Adversarial Networks (CGANs) trained on real aggregate-level historical data. A CGAN-based "world" agent can generate meaningful orders in response to an experimental agent. We integrate our synthetic market generator into ABIDES, an open source simulator of financial markets. By means of extensive simulations we show that our proposal outperforms previous work in terms of stylized facts reflecting market responsiveness and realism.
△ Less
Submitted 25 October, 2021;
originally announced October 2021.
-
Evolutionary study of complex organic molecules in high-mass star-forming regions
Authors:
A. Coletta,
F. Fontani,
V. M. Rivilla,
C. Mininni,
L. Colzi,
Á. Sánchez-Monge,
M. T. Beltrán
Abstract:
We have studied four complex organic molecules (COMs), methyl formate ($CH_3OCHO$), dimethyl ether ($CH_3OCH_3$), formamide ($NH_2CHO$), and ethyl cyanide ($C_2H_5CN$), towards a large sample of 39 high-mass star-forming regions representing different evolutionary stages, from early to evolved phases. We aim to identify potential correlations between the molecules and to trace their evolutionary s…
▽ More
We have studied four complex organic molecules (COMs), methyl formate ($CH_3OCHO$), dimethyl ether ($CH_3OCH_3$), formamide ($NH_2CHO$), and ethyl cyanide ($C_2H_5CN$), towards a large sample of 39 high-mass star-forming regions representing different evolutionary stages, from early to evolved phases. We aim to identify potential correlations between the molecules and to trace their evolutionary sequence through the star formation process. We analysed spectra obtained at 3, 2, and 0.9 mm with the IRAM-30m telescope. We derived the main physical parameters for each species by fitting the molecular lines. We compared them and evaluated their evolution, also taking several other interstellar environments into account. We report detections in 20 sources, revealing a clear dust absorption effect on column densities. Derived abundances are ~$10^{-10}-10^{-7}$ for $CH_3OCHO$ and $CH_3OCH_3$, ~$10^{-12}-10^{-10}$ for $NH_2CHO$, and ~$10^{-11}-10^{-9}$ for $C_2H_5CN$. The abundances of $CH_3OCHO$, $CH_3OCH_3$, and $C_2H_5CN$ are very strongly correlated (r>0.92) across ~4 orders of magnitude. $CH_3OCHO$ and $CH_3OCH_3$ show the strongest correlations in most parameters, and a nearly constant ratio (~1) over a remarkable ~9 orders of magnitude in luminosity for a wide variety of sources: pre-stellar to evolved cores, low- to high-mass objects, shocks, Galactic clouds, and comets. This indicates that COMs chemistry is likely early developed and then preserved through evolved phases. Moreover, the molecular abundances clearly increase with evolution. We consider $CH_3OCHO$ and $CH_3OCH_3$ to be most likely chemically linked: they could e.g. share a common precursor, or be formed one from the other. We propose a general scenario for all COMs, involving a formation in the cold, earliest phases of star formation and a following increasing desorption with the progressive heating of the evolving core.
△ Less
Submitted 27 June, 2020;
originally announced June 2020.
-
My SIM is Leaking My Data: Exposing Self-Login Privacy Breaches in Smartphones
Authors:
Andrea Coletta,
Gaia Maselli,
Mauro Piva,
Domenicomichele Silvestri,
Francesco Restuccia
Abstract:
We expose a new security leak for smartphone users, which allows to stole user personal data by accessing the mobile operator user page when auto-login is employed. We show how any "apparently" genuine app can steal these data from some mobile operators, affecting more than 80% of Italian mobile smartphones.
We expose a new security leak for smartphone users, which allows to stole user personal data by accessing the mobile operator user page when auto-login is employed. We show how any "apparently" genuine app can steal these data from some mobile operators, affecting more than 80% of Italian mobile smartphones.
△ Less
Submitted 8 April, 2020; v1 submitted 18 March, 2020;
originally announced March 2020.
-
X-ray afterglow detection of the short gamma-ray burst 991014
Authors:
J. J. M. in 't Zand,
L. Kuiper,
L. Amati,
L. A. Antonelli,
K. Hurley,
A. Coletta,
E. Costa,
M. Feroci,
F. Frontera,
G. Gandolfi,
J. Heise,
E. Kuulkers,
J. M. Muller,
L. Nicastro,
L. Piro,
M. J. S. Smith,
M. Tavani
Abstract:
GRB 991014 is one of the shortest gamma-ray bursts detected so far with the Wide Field Cameras aboard BeppoSAX, both in gamma-rays and X-rays. The duration is 9.6 sec in 2-28 keV and 3.2 sec in 40 to 700 keV (as measured between the times when 5 and 95% of the burst photons have been accumulated). We refine the InterPlanetary Network annulus of the burst, present the detection of the X-ray after…
▽ More
GRB 991014 is one of the shortest gamma-ray bursts detected so far with the Wide Field Cameras aboard BeppoSAX, both in gamma-rays and X-rays. The duration is 9.6 sec in 2-28 keV and 3.2 sec in 40 to 700 keV (as measured between the times when 5 and 95% of the burst photons have been accumulated). We refine the InterPlanetary Network annulus of the burst, present the detection of the X-ray afterglow of GRB 991014 within this refined annulus, and discuss X-ray and gamma-ray observations of the prompt and afterglow emission. Except for the briefness of the prompt event, no other unusual aspects were found in the prompt and afterglow observations as compared to such measurements in previous gamma-ray bursts.
△ Less
Submitted 19 July, 2000;
originally announced July 2000.
-
BeppoSAX spectrum of GRB971214: evidence of a substantial energy output during afterglow
Authors:
D. Dal Fiume,
L. Amati,
L. A. Antonelli,
F. Fiore,
J. M. Muller,
A. Parmar,
N. Masetti,
E. Pian,
E. Costa,
F. Frontera,
L. Piro,
J. Heise,
R. C. Butler,
A. Coletta,
M. Feroci,
P. Giommi,
L. Nicastro,
M. Orlandini,
E. Palazzi,
G. Pizzichini,
M. Tavani
Abstract:
We report the X/gamma-ray spectrum of GRB971214 and of its afterglow. The afterglow was measured few hours after the main event and for an elapsed time of more than two days. The measure of this GRB and afterglow is relevant due to its extreme, cosmological distance (z=3.42). The prompt event shows a hard photon spectrum, consistent with a broken power law with photon indices Gamma_X~0.1 below ~…
▽ More
We report the X/gamma-ray spectrum of GRB971214 and of its afterglow. The afterglow was measured few hours after the main event and for an elapsed time of more than two days. The measure of this GRB and afterglow is relevant due to its extreme, cosmological distance (z=3.42). The prompt event shows a hard photon spectrum, consistent with a broken power law with photon indices Gamma_X~0.1 below ~20 keV and Gamma_g~1.3 above 60 keV. The afterglow spectrum, measured with the MECS and LECS BeppoSAX telescopes, is consistent with a power law with spectral photon index Gamma=1.6. Within the statistical accuracy of our measure no spectral evolution is detected during the observation of the afterglow. When integrated during the time span covered by BeppoSAX observations, the power in the afterglow emission, even with very conservative assumptions, is at least comparable with the power in the main event. The IR-to-X rays broad band spectrum is also presented, collecting data from the literature and adding them to the BeppoSAX measure. It shows that the predictions from synchrotron emission models is qualitatively confirmed. The BeppoSAX measurement of the X and gamma ray spectrum of this GRB/afterglow is discussed in the framework of current theoretical models.
△ Less
Submitted 10 February, 2000;
originally announced February 2000.
-
What can BeppoSAX tell us about short GRBs: An update from the Subsecond GRB Project
Authors:
G. Gandolfi,
M. Feroci,
E. Costa,
L. Piro,
M. J. S. Smith,
J. M. Muller,
A. Coletta,
G. Celidonio,
L. Di Ciolo,
A. Paolino,
G. Tarei,
G. Tassone,
F. Frontera
Abstract:
We present some statistical considerations on the BeppoSAX hunt for subsecond GRBs at the Scientific Operation Center. Archive analysis of a BATSE/SAX sub-sample of bursts indicates that the GRB Monitor is sensitive to short (< 2 sec) events, that are in fact about 22% of the total. The non-detection of corresponding prompt X-ray counterparts to short bursts in the Wide Field Cameras, in about 3…
▽ More
We present some statistical considerations on the BeppoSAX hunt for subsecond GRBs at the Scientific Operation Center. Archive analysis of a BATSE/SAX sub-sample of bursts indicates that the GRB Monitor is sensitive to short (< 2 sec) events, that are in fact about 22% of the total. The non-detection of corresponding prompt X-ray counterparts to short bursts in the Wide Field Cameras, in about 3 years of operations, is discussed: with present data no implications on the X-to-gamma-ray spectra of short vs long GRBs may be inferred. Finally, the status of searching procedures at SOC is reviewed.
△ Less
Submitted 3 January, 2000;
originally announced January 2000.
-
BeppoSAX Observations of GRB980425: Detection of the Prompt Event and Monitoring of the Error Box
Authors:
E. Pian,
L. Amati,
L. A. Antonelli,
R. C. Butler,
E. Costa,
G. Cusumano,
J. Danziger,
M. Feroci,
F. Fiore,
F. Frontera,
P. Giommi,
N. Masetti,
J. M. Muller,
L. Nicastro,
T. Oosterbroek,
M. Orlandini,
A. Owens,
E. Palazzi,
A. Parmar,
L. Piro,
J. J. M. in 't Zand,
A. Castro-Tirado,
A. Coletta,
D. Dal Fiume,
S. Del Sordo
, et al. (3 additional authors not shown)
Abstract:
We present BeppoSAX follow-up observations of GRB980425 obtained with the Narrow Field Instruments (NFI) in April, May, and November 1998. The first NFI observation has detected within the 8' radius error box of the GRB an X-ray source positionally consistent with the supernova 1998bw, which exploded within a day of GRB980425, and a fainter X-ray source, not consistent with the position of the s…
▽ More
We present BeppoSAX follow-up observations of GRB980425 obtained with the Narrow Field Instruments (NFI) in April, May, and November 1998. The first NFI observation has detected within the 8' radius error box of the GRB an X-ray source positionally consistent with the supernova 1998bw, which exploded within a day of GRB980425, and a fainter X-ray source, not consistent with the position of the supernova. The former source is detected in the following NFI pointings and exhibits a decline of a factor of two in six months. If it is associated with SN 1998bw, this is the first detection of X-ray emission from a Type I supernova above 2 keV. The latter source exhibits only marginally significant variability. The X-ray spectra and variability of the supernova are compared with thermal and non-thermal models of supernova high energy emission. Based on the BeppoSAX data, it is not possible to firmly establish which of the two detected sources is the GRB X-ray counterpart, although probability considerations favor the supernova.
△ Less
Submitted 13 October, 1999;
originally announced October 1999.
-
The 0.1-100 keV spectral shape and variability of Mkn421 in high state
Authors:
A. Malizia,
M. Capalbi,
F. Fiore,
P. Giommi,
G. Gandolfi,
A. Tesseri,
L. A. Antonelli,
R. C. Butler,
G. Celidonio,
A. Coletta,
L. Di Ciolo,
J. M. Muller,
L. Piro,
S. Rebecchi,
D. Ricci,
R. Ricci,
M. Smith,
V. Torroni
Abstract:
The results of a BeppoSAX TOO observation of the BL Lac object Mkn421 during a high intensity state are reported and compared with monitoring X-ray data collected with the BeppoSAX Wide Field Cameras (WFC) and the RXTE All Sky Monitor(ASM). The 0.1-100 keV spectrum of Mkn421 shows continuous convex curvature that can be interpreted as the high-energy end of the synchrotron emission. The source s…
▽ More
The results of a BeppoSAX TOO observation of the BL Lac object Mkn421 during a high intensity state are reported and compared with monitoring X-ray data collected with the BeppoSAX Wide Field Cameras (WFC) and the RXTE All Sky Monitor(ASM). The 0.1-100 keV spectrum of Mkn421 shows continuous convex curvature that can be interpreted as the high-energy end of the synchrotron emission. The source shows significant short-term temporal and spectral variability, which can be interpreted in terms of synchrotron cooling. The comparison of our results with those of previous observations when the source was a factor 3-5 fainter shows evidence for strong spectral variability, with the maximum of the synchrotron power shifting to higher energy during high states. This behaviour suggest an increase in the number of energetic electrons during high states.
△ Less
Submitted 13 September, 1999;
originally announced September 1999.
-
BeppoSAX Detection and Follow-up of GRB980425
Authors:
E. Pian,
L. Amati,
L. A. Antonelli,
R. C. Butler,
E. Costa,
G. Cusumano,
J. Danziger,
M. Feroci,
F. Fiore,
F. Frontera,
P. Giommi,
N. Masetti,
J. M. Muller,
T. Oosterbroek,
A. Owens,
E. Palazzi,
L. Piro,
A. Castro-Tirado,
A. Coletta,
D. Dal Fiume,
S. Del Sordo,
J. Heise,
L. Nicastro,
M. Orlandini,
A. Parmar
, et al. (3 additional authors not shown)
Abstract:
We present BeppoSAX GRBM and WFC light curves of GRB980425 and NFI follow-up data taken in 1998 April, May, and November. The first NFI observation has detected within the 8' radius error box of the GRB an X-ray source positionally consistent with the supernova SN 1998bw, exploded within a day of GRB980425, and a fainter X-ray source, not consistent with the position of the supernova. The former…
▽ More
We present BeppoSAX GRBM and WFC light curves of GRB980425 and NFI follow-up data taken in 1998 April, May, and November. The first NFI observation has detected within the 8' radius error box of the GRB an X-ray source positionally consistent with the supernova SN 1998bw, exploded within a day of GRB980425, and a fainter X-ray source, not consistent with the position of the supernova. The former source is detected in the following NFI pointings and exhibits a decline of a factor of two in six months. If it is associated with SN 1998bw, this is the first detection of hard X-ray emission from a Type I supernova. The latter source exhibits only marginally significant variability. Based on these data, it is not possible to select either source as a firm candidate for the GRB counterpart.
△ Less
Submitted 13 July, 1999; v1 submitted 8 March, 1999;
originally announced March 1999.
-
Gamma-Ray Burst 980329 and its X-Ray Afterglow
Authors:
J. J. M. in 't Zand,
L. Amati,
L. A. Antonelli,
R. C. Butler,
A. J. Castro-Tirado,
A. Coletta,
E. Costa,
M. Feroci,
F. Frontera,
J. Heise,
S. Molendi,
L. Nicastro,
A. Owens,
E. Palazzi,
E. Pian,
L. Piro,
G. Pizzichini,
M. J. S. Smith,
M. Tavani
Abstract:
GRB 980329 is the brightest gamma-ray burst detected so far with the Wide Field Cameras aboard BeppoSAX, both in gamma-rays and X-rays. With respect to its fluence (2.6 X 10**-5 erg/s/cm**2 in 50 to 300 keV) it would be in the top 4% of gamma-ray bursts in the 4B catalog (Meegan et al. 1998). The time-averaged burst spectrum from 2 to 20 and 70 to 650 keV can be well described by the empirical m…
▽ More
GRB 980329 is the brightest gamma-ray burst detected so far with the Wide Field Cameras aboard BeppoSAX, both in gamma-rays and X-rays. With respect to its fluence (2.6 X 10**-5 erg/s/cm**2 in 50 to 300 keV) it would be in the top 4% of gamma-ray bursts in the 4B catalog (Meegan et al. 1998). The time-averaged burst spectrum from 2 to 20 and 70 to 650 keV can be well described by the empirical model of Band et al. (1993). The resulting photon index above the break energy is exceptionally hard at -1.32 +/- 0.03. An X-ray afterglow was detected with the narrow-field instruments aboard BeppoSAX 7 h after the event within the error box as determined with the Wide Field Cameras. Its peak flux is (1.4 +/- 0.2) X 10**-12 erg/s/cm**2 (2 to 10 keV). The afterglow decayed according to a power law function with an index of -1.35 +/- 0.03. GRB 980329 is characterized by being bright and hard, and lacking strong spectral evolution.
△ Less
Submitted 30 July, 1998;
originally announced July 1998.
-
In-flight performances of the BeppoSAX Gamma-Ray Burst Monitor
Authors:
M. Feroci,
F. Frontera,
E. Costa,
D. Dal Fiume,
L. Amati,
L. Bruca,
M. N. Cinti,
A. Coletta,
P. Collina,
C. Guidorzi,
L. Nicastro,
M. Orlandini,
E. Palazzi,
M. Rapisarda,
G. Zavattini,
R. C. Butler
Abstract:
The Italian-Dutch satellite for X-ray Astronomy BeppoSAX is successfully operating on a 600 km equatorial orbit since May 1996. We present here the in-flight performances of the Gamma Ray Burst Monitor experiment during its first year of operation. The GRBM is the secondary function of the four CsI(Na) slabs primarily operating as an active anticoincidence of the PDS hard X-ray experiment.. It h…
▽ More
The Italian-Dutch satellite for X-ray Astronomy BeppoSAX is successfully operating on a 600 km equatorial orbit since May 1996. We present here the in-flight performances of the Gamma Ray Burst Monitor experiment during its first year of operation. The GRBM is the secondary function of the four CsI(Na) slabs primarily operating as an active anticoincidence of the PDS hard X-ray experiment.. It has a geometric area of about 4000 cm2 but, due to its location in the core of the satellite its effective area is dependent on the energy and direction of the impinging photons. A dedicated electronics allows to trigger on cosmic gamma-ray bursts. When the trigger condition is satisfied the light curve of the event is recorded from 8 s before to 98 s after the trigger time, with a maximum time resolution of 0.48 ms, in an energy band of 40-700 keV.
△ Less
Submitted 19 August, 1997;
originally announced August 1997.
-
Discovery of the X-Ray Afterglow of the Gamma-Ray Burst of February 28 1997
Authors:
E. Costa,
F. Frontera,
J. Heise,
M. Feroci,
J. in 't Zand,
F. Fiore,
M. N. Cinti,
D. Dal Fiume,
L. Nicastro,
M. Orlandini,
E. Palazzi,
M. Rapisarda,
G. Zavattini,
R. Jager,
A. Parmar,
A. Owens,
S. Molendi,
G. Cusumano,
M. C. Maccarone,
S. Giarrusso,
A. Coletta,
L. A. Antonelli,
P. Giommi,
J. M. Muller,
L. Piro
, et al. (1 additional authors not shown)
Abstract:
Here we report the discovery in the X-ray band of the first afterglow of a gamma-ray burst. It was detected and quickly positioned by the Beppo-SAX satellite on 1997 February 28 (GRB970228). The X-ray afterglow source was detected with the X-ray telescopes aboard the same satellite about eight hours after the burst and faded away in a few days with a power law decay function. The energetic conte…
▽ More
Here we report the discovery in the X-ray band of the first afterglow of a gamma-ray burst. It was detected and quickly positioned by the Beppo-SAX satellite on 1997 February 28 (GRB970228). The X-ray afterglow source was detected with the X-ray telescopes aboard the same satellite about eight hours after the burst and faded away in a few days with a power law decay function. The energetic content of the X-ray afterglow results to be a significant fraction of gamma-ray burst energetics. The Beppo-SAX detection and fast imaging of GRB970228 started a multiwavelength campaign that lead to the identification of a fading optical source in a position consistent with the X-ray source.
△ Less
Submitted 6 June, 1997;
originally announced June 1997.