-
Hadronic cross section measurements with the DAMPE space mission using 20GeV-10TeV cosmic-ray protons and $^4$He
Authors:
F. Alemanno,
Q. An,
P. Azzarello,
F. C. T. Barbato,
P. Bernardini,
X. J. Bi,
I. Cagnoli,
M. S. Cai,
E. Casilli,
E. Catanzani,
J. Chang,
D. Y. Chen,
J. L. Chen,
Z. F. Chen,
P. Coppin,
M. Y. Cui,
T. S. Cui,
Y. X. Cui,
H. T. Dai,
A. De Benedittis,
I. De Mitri,
F. de Palma,
A. Di Giovanni,
Q. Ding,
T. K. Dong
, et al. (126 additional authors not shown)
Abstract:
Precise direct cosmic-ray (CR) measurements provide an important probe to study the energetic particle sources in our Galaxy, and the interstellar environment through which these particles propagate. Uncertainties on hadronic models, ion-nucleon cross sections in particular, are currently the limiting factor towards obtaining more accurate CR ion flux measurements with calorimetric space-based exp…
▽ More
Precise direct cosmic-ray (CR) measurements provide an important probe to study the energetic particle sources in our Galaxy, and the interstellar environment through which these particles propagate. Uncertainties on hadronic models, ion-nucleon cross sections in particular, are currently the limiting factor towards obtaining more accurate CR ion flux measurements with calorimetric space-based experiments. We present an energy-dependent measurement of the inelastic cross section of protons and helium-4 nuclei (alpha particles) on a Bi$_4$Ge$_3$O$_{12}$ target, using 88 months of data collected by the DAMPE space mission. The kinetic energy range per nucleon of the measurement points ranges from 18 GeV to 9 TeV for protons, and from 5 GeV/n to 3 TeV/n for helium-4 nuclei. Our results lead to a significant improvement of the CR flux normalisation. In the case of helium-4, these results correspond to the first cross section measurements on a heavy target material at energies above 10 GeV/n.
△ Less
Submitted 30 August, 2024;
originally announced August 2024.
-
Refining Packing and Shuffling Strategies for Enhanced Performance in Generative Language Models
Authors:
Yanbing Chen,
Ruilin Wang,
Zihao Yang,
Lavender Yao Jiang,
Eric Karl Oermann
Abstract:
Packing and shuffling tokens is a common practice in training auto-regressive language models (LMs) to prevent overfitting and improve efficiency. Typically documents are concatenated to chunks of maximum sequence length (MSL) and then shuffled. However setting the atom size, the length for each data chunk accompanied by random shuffling, to MSL may lead to contextual incoherence due to tokens fro…
▽ More
Packing and shuffling tokens is a common practice in training auto-regressive language models (LMs) to prevent overfitting and improve efficiency. Typically documents are concatenated to chunks of maximum sequence length (MSL) and then shuffled. However setting the atom size, the length for each data chunk accompanied by random shuffling, to MSL may lead to contextual incoherence due to tokens from different documents being packed into the same chunk. An alternative approach is to utilize padding, another common data packing strategy, to avoid contextual incoherence by only including one document in each shuffled chunk. To optimize both packing strategies (concatenation vs padding), we investigated the optimal atom size for shuffling and compared their performance and efficiency. We found that matching atom size to MSL optimizes performance for both packing methods (concatenation and padding), and padding yields lower final perplexity (higher performance) than concatenation at the cost of more training steps and lower compute efficiency. This trade-off informs the choice of packing methods in training language models.
△ Less
Submitted 18 August, 2024;
originally announced August 2024.
-
Generalization in Healthcare AI: Evaluation of a Clinical Large Language Model
Authors:
Salman Rahman,
Lavender Yao Jiang,
Saadia Gabriel,
Yindalon Aphinyanaphongs,
Eric Karl Oermann,
Rumi Chunara
Abstract:
Advances in large language models (LLMs) provide new opportunities in healthcare for improved patient care, clinical decision-making, and enhancement of physician and administrator workflows. However, the potential of these models importantly depends on their ability to generalize effectively across clinical environments and populations, a challenge often underestimated in early development. To be…
▽ More
Advances in large language models (LLMs) provide new opportunities in healthcare for improved patient care, clinical decision-making, and enhancement of physician and administrator workflows. However, the potential of these models importantly depends on their ability to generalize effectively across clinical environments and populations, a challenge often underestimated in early development. To better understand reasons for these challenges and inform mitigation approaches, we evaluated ClinicLLM, an LLM trained on [HOSPITAL]'s clinical notes, analyzing its performance on 30-day all-cause readmission prediction focusing on variability across hospitals and patient characteristics. We found poorer generalization particularly in hospitals with fewer samples, among patients with government and unspecified insurance, the elderly, and those with high comorbidities. To understand reasons for lack of generalization, we investigated sample sizes for fine-tuning, note content (number of words per note), patient characteristics (comorbidity level, age, insurance type, borough), and health system aspects (hospital, all-cause 30-day readmission, and mortality rates). We used descriptive statistics and supervised classification to identify features. We found that, along with sample size, patient age, number of comorbidities, and the number of words in notes are all important factors related to generalization. Finally, we compared local fine-tuning (hospital specific), instance-based augmented fine-tuning and cluster-based fine-tuning for improving generalization. Among these, local fine-tuning proved most effective, increasing AUC by 0.25% to 11.74% (most helpful in settings with limited data). Overall, this study provides new insights for enhancing the deployment of large language models in the societally important domain of healthcare, and improving their performance for broader populations.
△ Less
Submitted 24 February, 2024; v1 submitted 14 February, 2024;
originally announced February 2024.
-
UV-Optical Emission of AB Aur b is Consistent with Scattered Stellar Light
Authors:
Yifan Zhou,
Brendan P. Bowler,
Haifeng Yang,
Aniket Sanghi,
Gregory J. Herczeg,
Adam L. Kraus,
Jaehan Bae,
Feng Long,
Katherine B. Follette,
Kimberley Ward-Duong,
Zhaohuan Zhu,
Lauren I. Biddle,
Laird M. Close,
Lillian Yushu Jiang,
Ya-Lin Wu
Abstract:
The proposed protoplanet AB Aur b is a spatially concentrated emission source imaged in the mm-wavelength disk gap of the Herbig Ae/Be star AB Aur. Its near-infrared spectrum and absence of strong polarized light have been interpreted as evidence supporting the protoplanet interpretation. However, the complex scattered light structures in the AB Aur disk pose challenges in resolving the emission s…
▽ More
The proposed protoplanet AB Aur b is a spatially concentrated emission source imaged in the mm-wavelength disk gap of the Herbig Ae/Be star AB Aur. Its near-infrared spectrum and absence of strong polarized light have been interpreted as evidence supporting the protoplanet interpretation. However, the complex scattered light structures in the AB Aur disk pose challenges in resolving the emission source and interpreting the true nature of AB Aur b. We present new images of the AB Aur system obtained using the Hubble Space Telescope Wide Field Camera 3 in the ultraviolet (UV) and optical bands. AB Aur b and the known disk spirals are recovered in the F336W, F410M, and F645N bands. The spectral energy distribution of AB Aur b shows absorption in the Balmer jump, mimicking those of early-type stars. By comparing the colors of AB Aur b to those of the host star, the disk spirals, and predictions from scattered light and self-luminous models, we find that the emission from AB Aur b is inconsistent with planetary photospheric or accretion shock models. Instead, it is consistent with those measured in the circumstellar disks that trace scattered light. We conclude that the UV and visible emission from AB Aur b does not necessitate the presence of a protoplanet. We synthesize observational constraints on AB Aur b and discuss inconsistent interpretations of AB Aur b among different datasets. Considering the significance of the AB Aur b discovery, we advocate for further observational evidence to verify its planetary nature.
△ Less
Submitted 30 August, 2023;
originally announced August 2023.
-
Making the Most Out of the Limited Context Length: Predictive Power Varies with Clinical Note Type and Note Section
Authors:
Hongyi Zheng,
Yixin Zhu,
Lavender Yao Jiang,
Kyunghyun Cho,
Eric Karl Oermann
Abstract:
Recent advances in large language models have led to renewed interest in natural language processing in healthcare using the free text of clinical notes. One distinguishing characteristic of clinical notes is their long time span over multiple long documents. The unique structure of clinical notes creates a new design choice: when the context length for a language model predictor is limited, which…
▽ More
Recent advances in large language models have led to renewed interest in natural language processing in healthcare using the free text of clinical notes. One distinguishing characteristic of clinical notes is their long time span over multiple long documents. The unique structure of clinical notes creates a new design choice: when the context length for a language model predictor is limited, which part of clinical notes should we choose as the input? Existing studies either choose the inputs with domain knowledge or simply truncate them. We propose a framework to analyze the sections with high predictive power. Using MIMIC-III, we show that: 1) predictive power distribution is different between nursing notes and discharge notes and 2) combining different types of notes could improve performance when the context length is large. Our findings suggest that a carefully selected sampling function could enable more efficient information extraction from clinical notes.
△ Less
Submitted 13 July, 2023;
originally announced July 2023.
-
Measurement of the cosmic p+He energy spectrum from 50 GeV to 0.5 PeV with the DAMPE space mission
Authors:
DAMPE Collaboration,
F. Alemanno,
C. Altomare,
Q. An,
P. Azzarello,
F. C. T. Barbato,
P. Bernardini,
X. J. Bi,
I. Cagnoli,
M. S. Cai,
E. Casilli,
E. Catanzani,
J. Chang,
D. Y. Chen,
J. L. Chen,
Z. F. Chen,
P. Coppin,
M. Y. Cui,
T. S. Cui,
Y. X. Cui,
H. T. Dai,
A. De Benedittis,
I. De Mitri,
F. de Palma,
M. Deliyergiyev
, et al. (130 additional authors not shown)
Abstract:
Recent observations of the light component of the cosmic-ray spectrum have revealed unexpected features that motivate further and more precise measurements up to the highest energies. The Dark Matter Particle Explorer is a satellite-based cosmic-ray experiment that has been operational since December 2015, continuously collecting data on high-energy cosmic particles with very good statistics, ener…
▽ More
Recent observations of the light component of the cosmic-ray spectrum have revealed unexpected features that motivate further and more precise measurements up to the highest energies. The Dark Matter Particle Explorer is a satellite-based cosmic-ray experiment that has been operational since December 2015, continuously collecting data on high-energy cosmic particles with very good statistics, energy resolution, and particle identification capabilities. In this work, the latest measurements of the energy spectrum of proton+helium in the energy range from 46 GeV to 464 TeV are presented. Among the most distinctive features of the spectrum, a spectral hardening at 600 GeV has been observed, along with a softening at 29 TeV measured with a 6.6σ significance. Moreover, the detector features and the analysis approach allowed for the extension of the spectral measurement up to the sub-PeV region. Even if with small statistical significance due to the low number of events, data suggest a new spectral hardening at about 150 TeV.
△ Less
Submitted 14 August, 2024; v1 submitted 31 March, 2023;
originally announced April 2023.
-
Language Model Classifier Aligns Better with Physician Word Sensitivity than XGBoost on Readmission Prediction
Authors:
Grace Yang,
Ming Cao,
Lavender Y. Jiang,
Xujin C. Liu,
Alexander T. M. Cheung,
Hannah Weiss,
David Kurland,
Kyunghyun Cho,
Eric K. Oermann
Abstract:
Traditional evaluation metrics for classification in natural language processing such as accuracy and area under the curve fail to differentiate between models with different predictive behaviors despite their similar performance metrics. We introduce sensitivity score, a metric that scrutinizes models' behaviors at the vocabulary level to provide insights into disparities in their decision-making…
▽ More
Traditional evaluation metrics for classification in natural language processing such as accuracy and area under the curve fail to differentiate between models with different predictive behaviors despite their similar performance metrics. We introduce sensitivity score, a metric that scrutinizes models' behaviors at the vocabulary level to provide insights into disparities in their decision-making logic. We assess the sensitivity score on a set of representative words in the test set using two classifiers trained for hospital readmission classification with similar performance statistics. Our experiments compare the decision-making logic of clinicians and classifiers based on rank correlations of sensitivity scores. The results indicate that the language model's sensitivity score aligns better with the professionals than the xgboost classifier on tf-idf embeddings, which suggests that xgboost uses some spurious features. Overall, this metric offers a novel perspective on assessing models' robustness by quantifying their discrepancy with professional opinions. Our code is available on GitHub (https://github.com/nyuolab/Model_Sensitivity).
△ Less
Submitted 15 November, 2022; v1 submitted 13 November, 2022;
originally announced November 2022.
-
Search for relativistic fractionally charged particles in space
Authors:
DAMPE Collaboration,
F. Alemanno,
C. Altomare,
Q. An,
P. Azzarello,
F. C. T. Barbato,
P. Bernardini,
X. J. Bi,
M. S. Cai,
E. Casilli,
E. Catanzani,
J. Chang,
D. Y. Chen,
J. L. Chen,
Z. F. Chen,
M. Y. Cui,
T. S. Cui,
Y. X. Cui,
H. T. Dai,
A. De-Benedittis,
I. De Mitri,
F. de Palma,
M. Deliyergiyev,
A. Di Giovanni,
M. Di Santo
, et al. (126 additional authors not shown)
Abstract:
More than a century after the performance of the oil drop experiment, the possible existence of fractionally charged particles FCP still remains unsettled. The search for FCPs is crucial for some extensions of the Standard Model in particle physics. Most of the previously conducted searches for FCPs in cosmic rays were based on experiments underground or at high altitudes. However, there have been…
▽ More
More than a century after the performance of the oil drop experiment, the possible existence of fractionally charged particles FCP still remains unsettled. The search for FCPs is crucial for some extensions of the Standard Model in particle physics. Most of the previously conducted searches for FCPs in cosmic rays were based on experiments underground or at high altitudes. However, there have been few searches for FCPs in cosmic rays carried out in orbit other than AMS-01 flown by a space shuttle and BESS by a balloon at the top of the atmosphere. In this study, we conduct an FCP search in space based on on-orbit data obtained using the DArk Matter Particle Explorer (DAMPE) satellite over a period of five years. Unlike underground experiments, which require an FCP energy of the order of hundreds of GeV, our FCP search starts at only a few GeV. An upper limit of $6.2\times 10^{-10}~~\mathrm{cm^{-2}sr^{-1} s^{-1}}$ is obtained for the flux. Our results demonstrate that DAMPE exhibits higher sensitivity than experiments of similar types by three orders of magnitude that more stringently restricts the conditions for the existence of FCP in primary cosmic rays.
△ Less
Submitted 9 September, 2022;
originally announced September 2022.
-
Edge Entropy as an Indicator of the Effectiveness of GNNs over CNNs for Node Classification
Authors:
Lavender Yao Jiang,
John Shi,
Mark Cheung,
Oren Wright,
José M. F. Moura
Abstract:
Graph neural networks (GNNs) extend convolutional neural networks (CNNs) to graph-based data. A question that arises is how much performance improvement does the underlying graph structure in the GNN provide over the CNN (that ignores this graph structure). To address this question, we introduce edge entropy and evaluate how good an indicator it is for possible performance improvement of GNNs over…
▽ More
Graph neural networks (GNNs) extend convolutional neural networks (CNNs) to graph-based data. A question that arises is how much performance improvement does the underlying graph structure in the GNN provide over the CNN (that ignores this graph structure). To address this question, we introduce edge entropy and evaluate how good an indicator it is for possible performance improvement of GNNs over CNNs. Our results on node classification with synthetic and real datasets show that lower values of edge entropy predict larger expected performance gains of GNNs over CNNs, and, conversely, higher edge entropy leads to expected smaller improvement gains.
△ Less
Submitted 15 December, 2020;
originally announced December 2020.
-
Graph Signal Processing and Deep Learning: Convolution, Pooling, and Topology
Authors:
Mark Cheung,
John Shi,
Oren Wright,
Lavender Y. Jiang,
Xujin Liu,
José M. F. Moura
Abstract:
Deep learning, particularly convolutional neural networks (CNNs), have yielded rapid, significant improvements in computer vision and related domains. But conventional deep learning architectures perform poorly when data have an underlying graph structure, as in social, biological, and many other domains. This paper explores 1)how graph signal processing (GSP) can be used to extend CNN components…
▽ More
Deep learning, particularly convolutional neural networks (CNNs), have yielded rapid, significant improvements in computer vision and related domains. But conventional deep learning architectures perform poorly when data have an underlying graph structure, as in social, biological, and many other domains. This paper explores 1)how graph signal processing (GSP) can be used to extend CNN components to graphs in order to improve model performance; and 2)how to design the graph CNN architecture based on the topology or structure of the data graph.
△ Less
Submitted 3 August, 2020;
originally announced August 2020.
-
Pooling in Graph Convolutional Neural Networks
Authors:
Mark Cheung,
John Shi,
Lavender Yao Jiang,
Oren Wright,
José M. F. Moura
Abstract:
Graph convolutional neural networks (GCNNs) are a powerful extension of deep learning techniques to graph-structured data problems. We empirically evaluate several pooling methods for GCNNs, and combinations of those graph pooling methods with three different architectures: GCN, TAGCN, and GraphSAGE. We confirm that graph pooling, especially DiffPool, improves classification accuracy on popular gr…
▽ More
Graph convolutional neural networks (GCNNs) are a powerful extension of deep learning techniques to graph-structured data problems. We empirically evaluate several pooling methods for GCNNs, and combinations of those graph pooling methods with three different architectures: GCN, TAGCN, and GraphSAGE. We confirm that graph pooling, especially DiffPool, improves classification accuracy on popular graph classification datasets and find that, on average, TAGCN achieves comparable or better accuracy than GCN and GraphSAGE, particularly for datasets with larger and sparser graph structures.
△ Less
Submitted 7 April, 2020;
originally announced April 2020.
-
Evaluation of the 1077keV gamma-ray emission probability from 68Ga decay
Authors:
X. L. Huang,
L. Y. Jiang,
X. J. Chen,
G. C. Chen
Abstract:
68Ga decays to the excited states of 68Zn through the electron capture decay mode. New recommended values for the emission probability of 1077keV gamma-ray given by the ENSDF and DDEP databases all use data from absolute measurements. In 2011 Jiang Liyang deduced a new value for 1077keV gamma-ray emission probability by measuring the 69Ga(n,2n)68Ga reaction cross section. The new value is about 20…
▽ More
68Ga decays to the excited states of 68Zn through the electron capture decay mode. New recommended values for the emission probability of 1077keV gamma-ray given by the ENSDF and DDEP databases all use data from absolute measurements. In 2011 Jiang Liyang deduced a new value for 1077keV gamma-ray emission probability by measuring the 69Ga(n,2n)68Ga reaction cross section. The new value is about 20% lower than values obtained from previous absolute measurements and evaluations. In this paper, the discrepancies among the measurements and evaluations are analyzed carefully and the new values are re-recommended. Our recommended value for the emission probability of 1077keV gamma-ray is 2.72+-0.16 %.
△ Less
Submitted 30 September, 2013;
originally announced September 2013.