Skip to main content

Showing 1–8 of 8 results for author: Belkadi, S

  1. arXiv:2409.11897  [pdf, other

    cs.RO cs.CR cs.LG

    Secure Control Systems for Autonomous Quadrotors against Cyber-Attacks

    Authors: Samuel Belkadi

    Abstract: The problem of safety for robotic systems has been extensively studied. However, little attention has been given to security issues for three-dimensional systems, such as quadrotors. Malicious adversaries can compromise robot sensors and communication networks, causing incidents, achieving illegal objectives, or even injuring people. This study first designs an intelligent control system for auton… ▽ More

    Submitted 18 September, 2024; originally announced September 2024.

    Comments: The paper is based on an undergraduate thesis and is not intended for publication in a journal

  2. arXiv:2409.09831  [pdf, other

    cs.CL cs.LG

    Generating Synthetic Free-text Medical Records with Low Re-identification Risk using Masked Language Modeling

    Authors: Samuel Belkadi, Libo Ren, Nicolo Micheletti, Lifeng Han, Goran Nenadic

    Abstract: In this paper, we present a system that generates synthetic free-text medical records, such as discharge summaries, admission notes and doctor correspondences, using Masked Language Modeling (MLM). Our system is designed to preserve the critical information of the records while introducing significant diversity and minimizing re-identification risk. The system incorporates a de-identification comp… ▽ More

    Submitted 17 September, 2024; v1 submitted 15 September, 2024; originally announced September 2024.

    Comments: Added references and rephrased some sentences

  3. arXiv:2409.09501  [pdf, other

    cs.CL cs.AI

    Synthetic4Health: Generating Annotated Synthetic Clinical Letters

    Authors: Libo Ren, Samuel Belkadi, Lifeng Han, Warren Del-Pinto, Goran Nenadic

    Abstract: Since clinical letters contain sensitive information, clinical-related datasets can not be widely applied in model training, medical research, and teaching. This work aims to generate reliable, various, and de-identified synthetic clinical letters. To achieve this goal, we explored different pre-trained language models (PLMs) for masking and generating text. After that, we worked on Bio\_ClinicalB… ▽ More

    Submitted 14 September, 2024; originally announced September 2024.

    Comments: ongoing work, 48 pages

  4. arXiv:2408.03871  [pdf, other

    cs.CL cs.AI

    Large Language Models for Biomedical Text Simplification: Promising But Not There Yet

    Authors: Zihao Li, Samuel Belkadi, Nicolo Micheletti, Lifeng Han, Matthew Shardlow, Goran Nenadic

    Abstract: In this system report, we describe the models and methods we used for our participation in the PLABA2023 task on biomedical abstract simplification, part of the TAC 2023 tracks. The system outputs we submitted come from the following three categories: 1) domain fine-tuned T5-like models including Biomedical-T5 and Lay-SciFive; 2) fine-tuned BARTLarge model with controllable attributes (via tokens)… ▽ More

    Submitted 24 September, 2024; v1 submitted 7 August, 2024; originally announced August 2024.

    Comments: Extended system report for PLABA-2023. arXiv admin note: substantial text overlap with arXiv:2309.13202

  5. arXiv:2405.12630  [pdf, other

    cs.CL cs.AI

    Exploration of Masked and Causal Language Modelling for Text Generation

    Authors: Nicolo Micheletti, Samuel Belkadi, Lifeng Han, Goran Nenadic

    Abstract: Large Language Models (LLMs) have revolutionised the field of Natural Language Processing (NLP) and have achieved state-of-the-art performance in practically every task in this field. However, the prevalent approach used in text generation, Causal Language Modelling (CLM), which generates text sequentially from left to right, inherently limits the freedom of the model, which does not decide when a… ▽ More

    Submitted 8 August, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: working paper - under review

  6. arXiv:2310.19727  [pdf, other

    cs.CL cs.AI cs.LG

    Generating Medical Prescriptions with Conditional Transformer

    Authors: Samuel Belkadi, Nicolo Micheletti, Lifeng Han, Warren Del-Pinto, Goran Nenadic

    Abstract: Access to real-world medication prescriptions is essential for medical research and healthcare quality improvement. However, access to real medication prescriptions is often limited due to the sensitive nature of the information expressed. Additionally, manually labelling these instructions for training and fine-tuning Natural Language Processing (NLP) models can be tedious and expensive. We intro… ▽ More

    Submitted 18 November, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted to: Workshop on Synthetic Data Generation with Generative AI (SyntheticData4ML Workshop) at NeurIPS 2023

  7. arXiv:2309.13202  [pdf, other

    cs.CL cs.AI

    Investigating Large Language Models and Control Mechanisms to Improve Text Readability of Biomedical Abstracts

    Authors: Zihao Li, Samuel Belkadi, Nicolo Micheletti, Lifeng Han, Matthew Shardlow, Goran Nenadic

    Abstract: Biomedical literature often uses complex language and inaccessible professional terminologies. That is why simplification plays an important role in improving public health literacy. Applying Natural Language Processing (NLP) models to automate such tasks allows for quick and direct accessibility for lay readers. In this work, we investigate the ability of state-of-the-art large language models (L… ▽ More

    Submitted 16 March, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: Accepted by IEEE-ICHI 2024 https://ieeeichi2024.github.io/

  8. arXiv:2210.12770  [pdf, other

    cs.CL cs.AI cs.LG

    Exploring the Value of Pre-trained Language Models for Clinical Named Entity Recognition

    Authors: Samuel Belkadi, Lifeng Han, Yuping Wu, Goran Nenadic

    Abstract: The practice of fine-tuning Pre-trained Language Models (PLMs) from general or domain-specific data to a specific task with limited resources, has gained popularity within the field of natural language processing (NLP). In this work, we re-visit this assumption and carry out an investigation in clinical NLP, specifically Named Entity Recognition on drugs and their related attributes. We compare Tr… ▽ More

    Submitted 30 October, 2023; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: working paper - Large Language Models, Fine-tuning LLMs, Clinical NLP, Medication Mining, AI for Healthcare