-
UF-HOBI at "Discharge Me!": A Hybrid Solution for Discharge Summary Generation Through Prompt-based Tuning of GatorTronGPT Models
Authors:
Mengxian Lyu,
Cheng Peng,
Daniel Paredes,
Ziyi Chen,
Aokun Chen,
Jiang Bian,
Yonghui Wu
Abstract:
Automatic generation of discharge summaries presents significant challenges due to the length of clinical documentation, the dispersed nature of patient information, and the diverse terminology used in healthcare. This paper presents a hybrid solution for generating discharge summary sections as part of our participation in the "Discharge Me!" Challenge at the BioNLP 2024 Shared Task. We developed…
▽ More
Automatic generation of discharge summaries presents significant challenges due to the length of clinical documentation, the dispersed nature of patient information, and the diverse terminology used in healthcare. This paper presents a hybrid solution for generating discharge summary sections as part of our participation in the "Discharge Me!" Challenge at the BioNLP 2024 Shared Task. We developed a two-stage generation method using both extractive and abstractive techniques, in which we first apply name entity recognition (NER) to extract key clinical concepts, which are then used as input for a prompt-tuning-based GatorTronGPT model to generate coherent text for two important sections including "Brief Hospital Course" and "Discharge Instructions". Our system was ranked 5th in this challenge, achieving an overall score of 0.284. The results demonstrate the effectiveness of our hybrid solution in improving the quality of automated discharge section generation.
△ Less
Submitted 22 July, 2024;
originally announced July 2024.
-
Robust hybrid finite element methods for reaction-dominated diffusion problems
Authors:
Thomas Führer,
Diego Paredes
Abstract:
For a reaction-dominated diffusion problem we study a primal and a dual hybrid finite element method where weak continuity conditions are enforced by Lagrange multipliers. Uniform robustness of the discrete methods is achieved by enriching the local discretization spaces with modified face bubble functions which decay exponentially in the interior of an element depending on the ratio of the singul…
▽ More
For a reaction-dominated diffusion problem we study a primal and a dual hybrid finite element method where weak continuity conditions are enforced by Lagrange multipliers. Uniform robustness of the discrete methods is achieved by enriching the local discretization spaces with modified face bubble functions which decay exponentially in the interior of an element depending on the ratio of the singular perturbation parameter and the local mesh-size. A posteriori error estimators are derived using Fortin operators. They are robust with respect to the singular perturbation parameter. Numerical experiments are presented that show that oscillations, if present, are significantly smaller then those observed in common finite element methods.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
Testing precision and accuracy of weak value measurements in an IBM quantum system
Authors:
David R. A. Ruelas Paredes,
Mariano Uria,
Eduardo Massoni,
Francisco De Zela
Abstract:
Historically, weak values have been associated with weak measurements performed on quantum systems. Over the past two decades, a series of works have shown that weak values can be determined via measurements of arbitrary strength. One such proposal by Denkmayr et al. [Phys. Rev. Lett. 118, 010402 (2017)], carried out in neutron interferometry experiments, yielded better outcomes for strong than fo…
▽ More
Historically, weak values have been associated with weak measurements performed on quantum systems. Over the past two decades, a series of works have shown that weak values can be determined via measurements of arbitrary strength. One such proposal by Denkmayr et al. [Phys. Rev. Lett. 118, 010402 (2017)], carried out in neutron interferometry experiments, yielded better outcomes for strong than for weak measurements. We extend this scheme and explain how to implement it in an optical setting as well as in a quantum computational context. Our implementation in a quantum computing system provided by IBM confirms that weak values can be measured, with varying degrees of performance, over a range of measurement strengths. However, at least for this model, strong measurements do not always perform better than weak ones.
△ Less
Submitted 1 September, 2023;
originally announced September 2023.
-
Extracting Thyroid Nodules Characteristics from Ultrasound Reports Using Transformer-based Natural Language Processing Methods
Authors:
Aman Pathak,
Zehao Yu,
Daniel Paredes,
Elio Paul Monsour,
Andrea Ortiz Rocha,
Juan P. Brito,
Naykky Singh Ospina,
Yonghui Wu
Abstract:
The ultrasound characteristics of thyroid nodules guide the evaluation of thyroid cancer in patients with thyroid nodules. However, the characteristics of thyroid nodules are often documented in clinical narratives such as ultrasound reports. Previous studies have examined natural language processing (NLP) methods in extracting a limited number of characteristics (<9) using rule-based NLP systems.…
▽ More
The ultrasound characteristics of thyroid nodules guide the evaluation of thyroid cancer in patients with thyroid nodules. However, the characteristics of thyroid nodules are often documented in clinical narratives such as ultrasound reports. Previous studies have examined natural language processing (NLP) methods in extracting a limited number of characteristics (<9) using rule-based NLP systems. In this study, a multidisciplinary team of NLP experts and thyroid specialists, identified thyroid nodule characteristics that are important for clinical care, composed annotation guidelines, developed a corpus, and compared 5 state-of-the-art transformer-based NLP methods, including BERT, RoBERTa, LongFormer, DeBERTa, and GatorTron, for extraction of thyroid nodule characteristics from ultrasound reports. Our GatorTron model, a transformer-based large language model trained using over 90 billion words of text, achieved the best strict and lenient F1-score of 0.8851 and 0.9495 for the extraction of a total number of 16 thyroid nodule characteristics, and 0.9321 for linking characteristics to nodules, outperforming other clinical transformer models. To the best of our knowledge, this is the first study to systematically categorize and apply transformer-based NLP models to extract a large number of clinical relevant thyroid nodule characteristics from ultrasound reports. This study lays ground for assessing the documentation quality of thyroid ultrasound reports and examining outcomes of patients with thyroid nodules using electronic health records.
△ Less
Submitted 31 March, 2023;
originally announced April 2023.
-
Identifying Symptoms of Delirium from Clinical Narratives Using Natural Language Processing
Authors:
Aokun Chen,
Daniel Paredes,
Zehao Yu,
Xiwei Lou,
Roberta Brunson,
Jamie N. Thomas,
Kimberly A. Martinez,
Robert J. Lucero,
Tanja Magoc,
Laurence M. Solberg,
Urszula A. Snigurska,
Sarah E. Ser,
Mattia Prosperi,
Jiang Bian,
Ragnhildur I. Bjarnadottir,
Yonghui Wu
Abstract:
Delirium is an acute decline or fluctuation in attention, awareness, or other cognitive function that can lead to serious adverse outcomes. Despite the severe outcomes, delirium is frequently unrecognized and uncoded in patients' electronic health records (EHRs) due to its transient and diverse nature. Natural language processing (NLP), a key technology that extracts medical concepts from clinical…
▽ More
Delirium is an acute decline or fluctuation in attention, awareness, or other cognitive function that can lead to serious adverse outcomes. Despite the severe outcomes, delirium is frequently unrecognized and uncoded in patients' electronic health records (EHRs) due to its transient and diverse nature. Natural language processing (NLP), a key technology that extracts medical concepts from clinical narratives, has shown great potential in studies of delirium outcomes and symptoms. To assist in the diagnosis and phenotyping of delirium, we formed an expert panel to categorize diverse delirium symptoms, composed annotation guidelines, created a delirium corpus with diverse delirium symptoms, and developed NLP methods to extract delirium symptoms from clinical notes. We compared 5 state-of-the-art transformer models including 2 models (BERT and RoBERTa) from the general domain and 3 models (BERT_MIMIC, RoBERTa_MIMIC, and GatorTron) from the clinical domain. GatorTron achieved the best strict and lenient F1 scores of 0.8055 and 0.8759, respectively. We conducted an error analysis to identify challenges in annotating delirium symptoms and developing NLP systems. To the best of our knowledge, this is the first large language model-based delirium symptom extraction system. Our study lays the foundation for the future development of computable phenotypes and diagnosis methods for delirium.
△ Less
Submitted 31 March, 2023;
originally announced April 2023.
-
Link between interlayer hybridization and ultrafast charge transfer in WS$_2$-graphene heterostructures
Authors:
Niklas Hofmann,
Leonard Weigl,
Johannes Gradl,
Neeraj Mishra,
Giorgio Orlandini,
Stiven Forti,
Camilla Coletti,
Simone Latini,
Lede Xian,
Angel Rubio,
Dilan Perez Paredes,
Raul Perea Causin,
Samuel Brem,
Ermin Malic,
Isabella Gierz
Abstract:
Ultrafast charge separation after photoexcitation is a common phenomenon in various van-der-Waals (vdW) heterostructures with great relevance for future applications in light harvesting and detection. Theoretical understanding of this phenomenon converges towards a coherent mechanism through charge transfer states accompanied by energy dissipation into strongly coupled phonons. The detailed micros…
▽ More
Ultrafast charge separation after photoexcitation is a common phenomenon in various van-der-Waals (vdW) heterostructures with great relevance for future applications in light harvesting and detection. Theoretical understanding of this phenomenon converges towards a coherent mechanism through charge transfer states accompanied by energy dissipation into strongly coupled phonons. The detailed microscopic pathways are material specific as they sensitively depend on the band structures of the individual layers, the relative band alignment in the heterostructure, the twist angle between the layers, and interlayer interactions resulting in hybridization. We used time- and angle-resolved photoemission spectroscopy combined with tight binding and density functional theory electronic structure calculations to investigate ultrafast charge separation and recombination in WS$_2$-graphene vdW heterostructures. We identify several avoided crossings in the band structure and discuss their relevance for ultrafast charge transfer. We relate our own observations to existing theoretical models and propose a unified picture for ultrafast charge transfer in vdW heterostructures where band alignment and twist angle emerge as the most important control parameters.
△ Less
Submitted 21 March, 2023;
originally announced March 2023.
-
Opening Access to Visual Exploration of Audiovisual Digital Biomarkers: an OpenDBM Analytics Tool
Authors:
Carla Floricel,
Jacob Epifano,
Stephanie Caamano,
Sarah Kark,
Rich Christie,
Aaron Masino,
Andre D Paredes
Abstract:
Digital biomarkers (DBMs) are a growing field and increasingly tested in the therapeutic areas of psychiatric and neurodegenerative disorders. Meanwhile, isolated silos of knowledge of audiovisual DBMs use in industry, academia, and clinics hinder their widespread adoption in clinical research. How can we help these non-technical domain experts to explore audiovisual digital biomarkers? The use of…
▽ More
Digital biomarkers (DBMs) are a growing field and increasingly tested in the therapeutic areas of psychiatric and neurodegenerative disorders. Meanwhile, isolated silos of knowledge of audiovisual DBMs use in industry, academia, and clinics hinder their widespread adoption in clinical research. How can we help these non-technical domain experts to explore audiovisual digital biomarkers? The use of open source software in biomedical research to extract patient behavior changes is growing and inspiring a shift toward accessibility to address this problem. OpenDBM integrates several popular audio and visual open source behavior extraction toolkits. We present a visual analysis tool as an extension of the growing open source software, OpenDBM, to promote the adoption of audiovisual DBMs in basic and applied research. Our tool illustrates patterns in behavioral data while supporting interactive visual analysis of any subset of derived or raw DBM variables extracted through OpenDBM.
△ Less
Submitted 4 October, 2022;
originally announced October 2022.
-
A Novel Framework for Characterization of Tumor-Immune Spatial Relationships in Tumor Microenvironment
Authors:
Mahmudul Hasan,
Jakub R. Kaczmarzyk,
David Paredes,
Lyanne Oblein,
Jaymie Oentoro,
Shahira Abousamra,
Michael Horowitz,
Dimitris Samaras,
Chao Chen,
Tahsin Kurc,
Kenneth R. Shroyer,
Joel Saltz
Abstract:
Understanding the impact of tumor biology on the composition of nearby cells often requires characterizing the impact of biologically distinct tumor regions. Biomarkers have been developed to label biologically distinct tumor regions, but challenges arise because of differences in the spatial extent and distribution of differentially labeled regions. In this work, we present a framework for system…
▽ More
Understanding the impact of tumor biology on the composition of nearby cells often requires characterizing the impact of biologically distinct tumor regions. Biomarkers have been developed to label biologically distinct tumor regions, but challenges arise because of differences in the spatial extent and distribution of differentially labeled regions. In this work, we present a framework for systematically investigating the impact of distinct tumor regions on cells near the tumor borders, accounting their cross spatial distributions. We apply the framework to multiplex immunohistochemistry (mIHC) studies of pancreatic cancer and show its efficacy in demonstrating how biologically different tumor regions impact the immune response in the tumor microenvironment. Furthermore, we show that the proposed framework can be extended to largescale whole slide image analysis.
△ Less
Submitted 1 May, 2022; v1 submitted 23 April, 2022;
originally announced April 2022.
-
On the Implementation of a Scalable Simulator for Multiscale Hybrid-Mixed Methods
Authors:
Antonio Tadeu A. Gomes,
Weslley S. Pereira,
Frederic Valentin,
Diego Paredes
Abstract:
The family of Multiscale Hybrid-Mixed (MHM) finite element methods has received considerable attention from the mathematics and engineering community in the last few years. The MHM methods allow solving highly heterogeneous problems on coarse meshes while providing solutions with high-order precision. It embeds independent local problems which are responsible for upscaling unresolved scales into t…
▽ More
The family of Multiscale Hybrid-Mixed (MHM) finite element methods has received considerable attention from the mathematics and engineering community in the last few years. The MHM methods allow solving highly heterogeneous problems on coarse meshes while providing solutions with high-order precision. It embeds independent local problems which are responsible for upscaling unresolved scales into the numerical solution. These local contributions are brought together through a global problem defined on the skeleton of the coarse partition. Since the local problems are completely independent, they can be easily computed in parallel. In this paper, we present two simulator prototypes specifically crafted for the MHM methods, which adopt two different implementation strategies: (i) a multi-programming language approach, each language tackling different simulation issues; and (ii) a classical, single-programming language approach. Specifically, we use C++ for numerical computation of the global and local problems in a modular way; for process distribution in the simulator, we adopt the Erlang concurrent language in the first approach, and the MPI standard in the second approach. The aim of exploring these different approaches is twofold: (i) allow for the deployment of the simulator both in high-performance computing (with MPI) and in cloud computing environments (with Erlang); and (ii) pave the way for further exploration of quality attributes related to software productivity and fault-tolerance, which are key to Exascale systems. We present a performance evaluation of the two simulator prototypes taking into account their efficiency.
△ Less
Submitted 30 March, 2017;
originally announced March 2017.
-
Matching for balance, pairing for heterogeneity in an observational study of the effectiveness of for-profit and not-for-profit high schools in Chile
Authors:
José R. Zubizarreta,
Ricardo D. Paredes,
Paul R. Rosenbaum
Abstract:
Conventionally, the construction of a pair-matched sample selects treated and control units and pairs them in a single step with a view to balancing observed covariates $\mathbf{x}$ and reducing the heterogeneity or dispersion of treated-minus-control response differences, $Y$. In contrast, the method of cardinality matching developed here first selects the maximum number of units subject to covar…
▽ More
Conventionally, the construction of a pair-matched sample selects treated and control units and pairs them in a single step with a view to balancing observed covariates $\mathbf{x}$ and reducing the heterogeneity or dispersion of treated-minus-control response differences, $Y$. In contrast, the method of cardinality matching developed here first selects the maximum number of units subject to covariate balance constraints and, with a balanced sample for $\mathbf{x}$ in hand, then separately pairs the units to minimize heterogeneity in $Y$. Reduced heterogeneity of pair differences in responses $Y$ is known to reduce sensitivity to unmeasured biases, so one might hope that cardinality matching would succeed at both tasks, balancing $\mathbf{x}$, stabilizing $Y$. We use cardinality matching in an observational study of the effectiveness of for-profit and not-for-profit private high schools in Chile - a controversial subject in Chile - focusing on students who were in government run primary schools in 2004 but then switched to private high schools. By pairing to minimize heterogeneity in a cardinality match that has balanced covariates, a meaningful reduction in sensitivity to unmeasured biases is obtained.
△ Less
Submitted 14 April, 2014;
originally announced April 2014.