subscribe to arXiv mailings

Seamless Monitoring of Stress Levels Leveraging a Universal Model for Time Sequences

Authors: Davide Gabrielli, Bardh Prenkaj, Paola Velardi

Abstract: Monitoring the stress level in patients with neurodegenerative diseases can help manage symptoms, improve patient's quality of life, and provide insight into disease progression. In the literature, ECG, actigraphy, speech, voice, and facial analysis have proven effective at detecting patients' emotions. On the other hand, these tools are invasive and do not integrate smoothly into the patient's da… ▽ More Monitoring the stress level in patients with neurodegenerative diseases can help manage symptoms, improve patient's quality of life, and provide insight into disease progression. In the literature, ECG, actigraphy, speech, voice, and facial analysis have proven effective at detecting patients' emotions. On the other hand, these tools are invasive and do not integrate smoothly into the patient's daily life. HRV has also been proven to effectively indicate stress conditions, especially in combination with other signals. However, when HRV is derived from less invasive devices than the ECG, like smartwatches and bracelets, the quality of measurements significantly degrades. This paper presents a methodology for stress detection from a smartwatch based on a universal model for time series, UniTS, which we fine-tuned for the task. We cast the problem as anomaly detection rather than classification to favor model adaptation to individual patients and allow the clinician to maintain greater control over the system's predictions. We demonstrate that our proposed model considerably surpasses 12 top-performing methods on 3 benchmark datasets. Furthermore, unlike other state-of-the-art systems, UniTS enables seamless monitoring, as it shows comparable performance when using signals from invasive or lightweight devices. △ Less

Submitted 4 July, 2024; originally announced July 2024.

arXiv:2406.15259 [pdf, other]

V-RECS, a Low-Cost LLM4VIS Recommender with Explanations, Captioning and Suggestions

Authors: Luca Podo, Marco Angelini, Paola Velardi

Abstract: NL2VIS (natural language to visualization) is a promising and recent research area that involves interpreting natural language queries and translating them into visualizations that accurately represent the underlying data. As we navigate the era of big data, NL2VIS holds considerable application potential since it greatly facilitates data exploration by non-expert users. Following the increasingly… ▽ More NL2VIS (natural language to visualization) is a promising and recent research area that involves interpreting natural language queries and translating them into visualizations that accurately represent the underlying data. As we navigate the era of big data, NL2VIS holds considerable application potential since it greatly facilitates data exploration by non-expert users. Following the increasingly widespread usage of generative AI in NL2VIS applications, in this paper we present V-RECS, the first LLM-based Visual Recommender augmented with explanations(E), captioning(C), and suggestions(S) for further data exploration. V-RECS' visualization narratives facilitate both response verification and data exploration by non-expert users. Furthermore, our proposed solution mitigates computational, controllability, and cost issues associated with using powerful LLMs by leveraging a methodology to effectively fine-tune small models. To generate insightful visualization narratives, we use Chain-of-Thoughts (CoT), a prompt engineering technique to help LLM identify and generate the logical steps to produce a correct answer. Since CoT is reported to perform poorly with small LLMs, we adopted a strategy in which a large LLM (GPT-4), acting as a Teacher, generates CoT-based instructions to fine-tune a small model, Llama-2-7B, which plays the role of a Student. Extensive experiments-based on a framework for the quantitative evaluation of AI-based visualizations and on manual assessment by a group of participants-show that V-RECS achieves performance scores comparable to GPT-4, at a much lower cost. The efficacy of the V-RECS teacher-student paradigm is also demonstrated by the fact that the un-tuned Llama fails to perform the task in the vast majority of test cases. We release V-RECS for the visualization community to assist visualization designers throughout the entire visualization generation process. △ Less

Submitted 31 July, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

arXiv:2308.01915 [pdf, other]

LOB-Based Deep Learning Models for Stock Price Trend Prediction: A Benchmark Study

Authors: Matteo Prata, Giuseppe Masi, Leonardo Berti, Viviana Arrigoni, Andrea Coletta, Irene Cannistraci, Svitlana Vyetrenko, Paola Velardi, Novella Bartolini

Abstract: The recent advancements in Deep Learning (DL) research have notably influenced the finance sector. We examine the robustness and generalizability of fifteen state-of-the-art DL models focusing on Stock Price Trend Prediction (SPTP) based on Limit Order Book (LOB) data. To carry out this study, we developed LOBCAST, an open-source framework that incorporates data preprocessing, DL model training, e… ▽ More The recent advancements in Deep Learning (DL) research have notably influenced the finance sector. We examine the robustness and generalizability of fifteen state-of-the-art DL models focusing on Stock Price Trend Prediction (SPTP) based on Limit Order Book (LOB) data. To carry out this study, we developed LOBCAST, an open-source framework that incorporates data preprocessing, DL model training, evaluation and profit analysis. Our extensive experiments reveal that all models exhibit a significant performance drop when exposed to new data, thereby raising questions about their real-world market applicability. Our work serves as a benchmark, illuminating the potential and the limitations of current approaches and providing insight for innovative solutions. △ Less

Submitted 19 September, 2023; v1 submitted 5 July, 2023; originally announced August 2023.

arXiv:2302.06304 [pdf, other]

Programming Skills are Not Enough: a Greedy Strategy to Attract More Girls to Study Computer Science

Authors: Tiziana Catarci, Luca Podo, Daniel Raffini, Paola Velardi

Abstract: It has been observed in many studies that female students in general are unwilling to undertake a course of study in ICT. Recent literature has also pointed out that undermining the prejudices of girls with respect to these disciplines is very difficult in adolescence, suggesting that, to be effective, awareness programs on computer disciplines should be offered in pre-school or lower school age.… ▽ More It has been observed in many studies that female students in general are unwilling to undertake a course of study in ICT. Recent literature has also pointed out that undermining the prejudices of girls with respect to these disciplines is very difficult in adolescence, suggesting that, to be effective, awareness programs on computer disciplines should be offered in pre-school or lower school age. On the other hand, even assuming that large-scale computer literacy programs can be immediately activated in lower schools and kindergartens, we can't wait for >15-20 years before we can appreciate the effectiveness of these programs. The scarcity of women in ICT has a tangible negative impact on countries' technological innovation, which requires immediate action. In this paper, we describe a strategy, and the details of a number of programs coordinated by the Engineering and Computer Science Departments at Sapienza University, to make high school girl students aware of the importance of new technologies and ICT. In addition to describing the theoretical approach, the paper offers some project examples. △ Less

Submitted 11 September, 2024; v1 submitted 13 February, 2023; originally announced February 2023.

Comments: 11 pages, 3 figures

ACM Class: K.4; K.4.2

arXiv:2302.06228 [pdf, ps, other]

doi 10.1109/TKDE.2023.3320184

Unsupervised Detection of Behavioural Drifts with Dynamic Clustering and Trajectory Analysis

Authors: Bardh Prenkaj, Paola Velardi

Abstract: Real-time monitoring of human behaviours, especially in e-Health applications, has been an active area of research in the past decades. On top of IoT-based sensing environments, anomaly detection algorithms have been proposed for the early detection of abnormalities. Gradual change procedures, commonly referred to as drift anomalies, have received much less attention in the literature because they… ▽ More Real-time monitoring of human behaviours, especially in e-Health applications, has been an active area of research in the past decades. On top of IoT-based sensing environments, anomaly detection algorithms have been proposed for the early detection of abnormalities. Gradual change procedures, commonly referred to as drift anomalies, have received much less attention in the literature because they represent a much more challenging scenario than sudden temporary changes (point anomalies). In this paper, we propose, for the first time, a fully unsupervised real-time drift detection algorithm named DynAmo, which can identify drift periods as they are happening. DynAmo comprises a dynamic clustering component to capture the overall trends of monitored behaviours and a trajectory generation component, which extracts features from the densest cluster centroids. Finally, we apply an ensemble of divergence tests on sliding reference and detection windows to detect drift periods in the behavioural sequence. △ Less

Submitted 14 December, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

Comments: Accepted to IEEE TKDE

Journal ref: IEEE Transactions on Knowledge and Data Engineering 2023

arXiv:2302.00569 [pdf, ps, other]

doi 10.1109/TVCG.2024.3374571

Agnostic Visual Recommendation Systems: Open Challenges and Future Directions

Authors: Luca Podo, Bardh Prenkaj, Paola Velardi

Abstract: Visualization Recommendation Systems (VRSs) are a novel and challenging field of study aiming to help generate insightful visualizations from data and support non-expert users in information discovery. Among the many contributions proposed in this area, some systems embrace the ambitious objective of imitating human analysts to identify relevant relationships in data and make appropriate design ch… ▽ More Visualization Recommendation Systems (VRSs) are a novel and challenging field of study aiming to help generate insightful visualizations from data and support non-expert users in information discovery. Among the many contributions proposed in this area, some systems embrace the ambitious objective of imitating human analysts to identify relevant relationships in data and make appropriate design choices to represent these relationships with insightful charts. We denote these systems as "agnostic" VRSs since they do not rely on human-provided constraints and rules but try to learn the task autonomously. Despite the high application potential of agnostic VRSs, their progress is hindered by several obstacles, including the absence of standardized datasets to train recommendation algorithms, the difficulty of learning design rules, and defining quantitative criteria for evaluating the perceptual effectiveness of generated plots. This paper summarizes the literature on agnostic VRSs and outlines promising future research directions. △ Less

Submitted 16 March, 2024; v1 submitted 1 February, 2023; originally announced February 2023.

Comments: 16 pages, 4 figures

Journal ref: TVCG (2024)

arXiv:2206.06182 [pdf, other]

AI-based Data Preparation and Data Analytics in Healthcare: The Case of Diabetes

Authors: Marianna Maranghi, Aris Anagnostopoulos, Irene Cannistraci, Ioannis Chatzigiannakis, Federico Croce, Giulia Di Teodoro, Michele Gentile, Giorgio Grani, Maurizio Lenzerini, Stefano Leonardi, Andrea Mastropietro, Laura Palagi, Massimiliano Pappa, Riccardo Rosati, Riccardo Valentini, Paola Velardi

Abstract: The Associazione Medici Diabetologi (AMD) collects and manages one of the largest worldwide-available collections of diabetic patient records, also known as the AMD database. This paper presents the initial results of an ongoing project whose focus is the application of Artificial Intelligence and Machine Learning techniques for conceptualizing, cleaning, and analyzing such an important and valuab… ▽ More The Associazione Medici Diabetologi (AMD) collects and manages one of the largest worldwide-available collections of diabetic patient records, also known as the AMD database. This paper presents the initial results of an ongoing project whose focus is the application of Artificial Intelligence and Machine Learning techniques for conceptualizing, cleaning, and analyzing such an important and valuable dataset, with the goal of providing predictive insights to better support diabetologists in their diagnostic and therapeutic choices. △ Less

Submitted 20 July, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

Comments: The work has been presented at the conference Ital-IA 2022 (https://www.ital-ia2022.it/)

arXiv:2104.00386 [pdf, other]

A network-based analysis of disease modules from a taxonomic perspective

Authors: Giorgio Grani, Lorenzo Madeddu, Paola Velardi

Abstract: Objective: Human-curated disease ontologies are widely used for diagnostic evaluation, treatment and data comparisons over time, and clinical decision support. The classification principles underlying these ontologies are guided by the analysis of observable pathological similarities between disorders, often based on anatomical or histological principles. Although, thanks to recent advances in mol… ▽ More Objective: Human-curated disease ontologies are widely used for diagnostic evaluation, treatment and data comparisons over time, and clinical decision support. The classification principles underlying these ontologies are guided by the analysis of observable pathological similarities between disorders, often based on anatomical or histological principles. Although, thanks to recent advances in molecular biology, disease ontologies are slowly changing to integrate the etiological and genetic origins of diseases, nosology still reflects this "reductionist" perspective. Proximity relationships of disease modules (hereafter DMs) in the human interactome network are now increasingly used in diagnostics, to identify pathobiologically similar diseases and to support drug repurposing and discovery. On the other hand, similarity relations induced from structural proximity of DMs also have several limitations, such as incomplete knowledge of disease-gene relationships and reliability of clinical trials to assess their validity. The purpose of the study described in this paper is to shed more light on disease similarities by analyzing the relationship between categorical proximity of diseases in human-curated ontologies and structural proximity of the related DM in the interactome. Method: We propose a methodology (and related algorithms) to automatically induce a hierarchical structure from proximity relations between DMs, and to compare this structure with a human-curated disease taxonomy. Results: We demonstrate that the proposed methodology allows to systematically analyze commonalities and differences among structural and categorical similarity of human diseases, help refine and extend human disease classification systems, and identify promising network areas where new disease-gene interactions can be discovered. △ Less

Submitted 1 April, 2021; originally announced April 2021.

Comments: 8 pages, 5 figures

ACM Class: J.3

arXiv:1902.10117 [pdf, other]

Network-based methods for disease-gene prediction

Authors: Lorenzo Madeddu, Giovanni Stilo, Paola Velardi

Abstract: We predict disease-genes relations on the Human Interactome network using a methodology that jointly learns functional and connectivity patterns surrounding proteins. Contrary to other data structures, the Interactome is characterized by high incompleteness and absence of explicit negative knowledge, which makes predictive tasks particularly challenging. To exploit at best latent information in th… ▽ More We predict disease-genes relations on the Human Interactome network using a methodology that jointly learns functional and connectivity patterns surrounding proteins. Contrary to other data structures, the Interactome is characterized by high incompleteness and absence of explicit negative knowledge, which makes predictive tasks particularly challenging. To exploit at best latent information in the network, we propose an extended version of random walks, named Random Watcher-Walker ($RW^2$), which is able to learn rich representations of disease genes (or gene products) features. Our method successfully compares with the best known system for disease gene prediction, and other state-of-the-art graph-based methods. We perform sensitivity analysis and apply perturbations to ensure robustness. In contrast with previous studies, our results demonstrate that connectivity alone is not sufficient to classify disease-related genes. △ Less

Submitted 26 February, 2019; originally announced February 2019.

arXiv:1902.06548 [pdf, other]

Quality of Life Assessment of Diabetic patients from health-related blogs

Authors: Andrea Lenzi, Marianna Maranghi, Giovanni Stilo, Paola Velardi

Abstract: Motivations: People are generating an enormous amount of social data to describe their health care experiences, and continuously search information about diseases, symptoms, diagnoses, doctors, treatment options and medicines. The increasing availability of these social traces presents an interesting opportunity to enhance timeliness and efficiency of care. By collecting, analyzing and exploiting… ▽ More Motivations: People are generating an enormous amount of social data to describe their health care experiences, and continuously search information about diseases, symptoms, diagnoses, doctors, treatment options and medicines. The increasing availability of these social traces presents an interesting opportunity to enhance timeliness and efficiency of care. By collecting, analyzing and exploiting this information, it is possible to modify or in any case significantly improve our knowledge on the manifestation of a pathology and obtain a more detailed and nuanced vision of patients' experience, that we call the "social phenotype" of diseases. Materials and methods: In this paper we present a data analytic framework to represent, extract and analyze the social phenotype of diseases. To show the effectiveness of our methodology we presents a detailed case study on diabetes. First, we create a high quality data sample of diabetic patients' messages, extracted from popular medical forums during more than 10 years. Next, we use a topic extraction techniques based on latent analysis and word embeddings, to identify the main complications, the frequently reported symptoms and the common concerns of these patients. Results: We show that a freely manifested perception of a disease can be noticeably different from what is inferred from questionnaires, surveys and other common methodologies used to measure the impact of a disease on the patients' quality of life. In our case study on diabetes, we found that issues reported to have a daily impact on diabetic patients are diet, glycemic control, drugs and clinical tests. These problems are not commonly considered in Quality of Life assessments, since they are not perceived by doctors as representing severe limitations. △ Less

Submitted 18 February, 2019; originally announced February 2019.

arXiv:1507.04900 [pdf]

Analysis of women leadership in enterprise social networks

Authors: Giorgia Di Tommaso, Giovanni Stilo, Paola Velardi

Abstract: This paper describes a Social Network Analysis toolkit to monitor an Enterprise Social Network and help analyzing informal leadership as a function of social ties and topic discussions. The toolkit has been developed in the context of a regional project, Fiordaliso, funded by Regione Lazio (a region of central Italy) and leaded by Reply, an international network of specialized companies in the fie… ▽ More This paper describes a Social Network Analysis toolkit to monitor an Enterprise Social Network and help analyzing informal leadership as a function of social ties and topic discussions. The toolkit has been developed in the context of a regional project, Fiordaliso, funded by Regione Lazio (a region of central Italy) and leaded by Reply, an international network of specialized companies in the field of digital services. △ Less

Submitted 17 July, 2015; originally announced July 2015.

Showing 1–11 of 11 results for author: Velardi, P