-
CartesianMoE: Boosting Knowledge Sharing among Experts via Cartesian Product Routing in Mixture-of-Experts
Authors:
Zhenpeng Su,
Xing Wu,
Zijia Lin,
Yizhe Xiong,
Minxuan Lv,
Guangyuan Ma,
Hui Chen,
Songlin Hu,
Guiguang Ding
Abstract:
Large language models (LLM) have been attracting much attention from the community recently, due to their remarkable performance in all kinds of downstream tasks. According to the well-known scaling law, scaling up a dense LLM enhances its capabilities, but also significantly increases the computational complexity. Mixture-of-Experts (MoE) models address that by allowing the model size to grow wit…
▽ More
Large language models (LLM) have been attracting much attention from the community recently, due to their remarkable performance in all kinds of downstream tasks. According to the well-known scaling law, scaling up a dense LLM enhances its capabilities, but also significantly increases the computational complexity. Mixture-of-Experts (MoE) models address that by allowing the model size to grow without substantially raising training or inference costs. Yet MoE models face challenges regarding knowledge sharing among experts, making their performance somehow sensitive to routing accuracy. To tackle that, previous works introduced shared experts and combined their outputs with those of the top $K$ routed experts in an ``addition'' manner. In this paper, inspired by collective matrix factorization to learn shared knowledge among data, we propose CartesianMoE, which implements more effective knowledge sharing among experts in more like a ``multiplication'' manner. Extensive experimental results indicate that CartesianMoE outperforms previous MoE models for building LLMs, in terms of both perplexity and downstream task performance. And we also find that CartesianMoE achieves better expert routing robustness.
△ Less
Submitted 21 October, 2024;
originally announced October 2024.
-
Test of lepton flavour universality with $B_s^0 \rightarrow φ\ell^+\ell^-$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1124 additional authors not shown)
Abstract:
Lepton flavour universality in rare $b\rightarrow s$ transitions is tested for the first time using $B_s^0$ meson decays. The measurements are performed using $pp$ collision data collected by the LHCb experiment between 2011 and 2018, corresponding to a total integrated luminosity of 9$\,{\rm fb}^{-1}$. Branching fraction ratios between the $B_s^0 \rightarrow φe^+e^-$ and…
▽ More
Lepton flavour universality in rare $b\rightarrow s$ transitions is tested for the first time using $B_s^0$ meson decays. The measurements are performed using $pp$ collision data collected by the LHCb experiment between 2011 and 2018, corresponding to a total integrated luminosity of 9$\,{\rm fb}^{-1}$. Branching fraction ratios between the $B_s^0 \rightarrow φe^+e^-$ and $B_s^0 \rightarrow φμ^+μ^-$ decays are measured in three regions of dilepton mass squared, $q^2$, with $0.1 < q^2 < 1.1$, $1.1 < q^2 < 6.0$, and $15 < q^2 < 19\,{\rm GeV}^2/c^4$. The results agree with the Standard Model expectation of lepton flavour universality.
△ Less
Submitted 17 October, 2024;
originally announced October 2024.
-
Development of COVID-19 Booster Vaccine Policy by Microsimulation and Q-learning
Authors:
Guoxuan Ma,
Lili Zhao,
Jian Kang
Abstract:
The COVID-19 pandemic highlighted the urgent need for effective vaccine policies, but traditional clinical trials often lack sufficient data to capture the diverse population characteristics necessary for comprehensive public health strategies. Ethical concerns around randomized trials during a pandemic further complicate policy development for public health. Reinforcement Learning (RL) offers a p…
▽ More
The COVID-19 pandemic highlighted the urgent need for effective vaccine policies, but traditional clinical trials often lack sufficient data to capture the diverse population characteristics necessary for comprehensive public health strategies. Ethical concerns around randomized trials during a pandemic further complicate policy development for public health. Reinforcement Learning (RL) offers a promising alternative for vaccine policy development. However, direct online RL exploration in real-world scenarios can result in suboptimal and potentially harmful decisions. This study proposes a novel framework combining tabular Q-learning with microsimulation (i.e., a Recurrent Neural Network (RNN) environment simulator) to address these challenges in public health vaccine policymaking, which enables effective vaccine policy learning without real-world interaction, addressing both ethical and exploration challenges. The RNN environment simulator captures temporal associations between infection and patient characteristics, generating realistic simulation data. Our tabular Q-learning model produces an interpretable policy table that balances the risks of severe infection against vaccination side effects. Applied to COVID-19 booster policies, the learned Q-learning-based policy outperforms current practices, offering a path toward more effective vaccination strategies.
△ Less
Submitted 16 October, 2024;
originally announced October 2024.
-
One2set + Large Language Model: Best Partners for Keyphrase Generation
Authors:
Liangying Shao,
Liang Zhang,
Minlong Peng,
Guoqi Ma,
Hao Yue,
Mingming Sun,
Jinsong Su
Abstract:
Keyphrase generation (KPG) aims to automatically generate a collection of phrases representing the core concepts of a given document. The dominant paradigms in KPG include one2seq and one2set. Recently, there has been increasing interest in applying large language models (LLMs) to KPG. Our preliminary experiments reveal that it is challenging for a single model to excel in both recall and precisio…
▽ More
Keyphrase generation (KPG) aims to automatically generate a collection of phrases representing the core concepts of a given document. The dominant paradigms in KPG include one2seq and one2set. Recently, there has been increasing interest in applying large language models (LLMs) to KPG. Our preliminary experiments reveal that it is challenging for a single model to excel in both recall and precision. Further analysis shows that: 1) the one2set paradigm owns the advantage of high recall, but suffers from improper assignments of supervision signals during training; 2) LLMs are powerful in keyphrase selection, but existing selection methods often make redundant selections. Given these observations, we introduce a generate-then-select framework decomposing KPG into two steps, where we adopt a one2set-based model as generator to produce candidates and then use an LLM as selector to select keyphrases from these candidates. Particularly, we make two important improvements on our generator and selector: 1) we design an Optimal Transport-based assignment strategy to address the above improper assignments; 2) we model the keyphrase selection as a sequence labeling task to alleviate redundant selections. Experimental results on multiple benchmark datasets show that our framework significantly surpasses state-of-the-art models, especially in absent keyphrase prediction.
△ Less
Submitted 20 October, 2024; v1 submitted 4 October, 2024;
originally announced October 2024.
-
Measurement of the effective leptonic weak mixing angle
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1117 additional authors not shown)
Abstract:
Using $pp$ collision data at $\sqrt{s}=13$ TeV, recorded by the LHCb experiment between 2016 and 2018 and corresponding to an integrated luminosity of $5.4$ fb$^{-1}$, the forward-backward asymmetry in the $pp \to Z/γ^{*} \to μ^+μ^-$ process is measured. The measurement is carried out in ten intervals of the difference between the muon pseudorapidities, within a fiducial region covering dimuon mas…
▽ More
Using $pp$ collision data at $\sqrt{s}=13$ TeV, recorded by the LHCb experiment between 2016 and 2018 and corresponding to an integrated luminosity of $5.4$ fb$^{-1}$, the forward-backward asymmetry in the $pp \to Z/γ^{*} \to μ^+μ^-$ process is measured. The measurement is carried out in ten intervals of the difference between the muon pseudorapidities, within a fiducial region covering dimuon masses between $66$ and $116$ GeV, muon pseudorapidities between $2.0$ and $4.5$ and muon transverse momenta above $20$ GeV. These forward-backward asymmetries are compared with predictions, at next-to-leading order in the strong and electroweak couplings. The measured effective leptonic weak mixing angle is $\sin^2θ_{\rm eff}^\ell = 0.23147 \pm 0.00044 \pm 0.00005 \pm 0.00023$, where the first uncertainty is statistical, the second arises from systematic uncertainties associated with the asymmetry measurement, and the third arises from uncertainties in the fit model used to extract $\sin^2θ_{\rm eff}^\ell$ from the asymmetry measurement. This result is based on an arithmetic average of results using the CT18, MSHT20, and NNPDF31 parameterisations of the proton internal structure, and is consistent with previous measurements and with predictions from the global electroweak fit.
△ Less
Submitted 3 October, 2024;
originally announced October 2024.
-
Structural-Entropy-Based Sample Selection for Efficient and Effective Learning
Authors:
Tianchi Xie,
Jiangning Zhu,
Guozu Ma,
Minzhi Lin,
Wei Chen,
Weikai Yang,
Shixia Liu
Abstract:
Sample selection improves the efficiency and effectiveness of machine learning models by providing informative and representative samples. Typically, samples can be modeled as a sample graph, where nodes are samples and edges represent their similarities. Most existing methods are based on local information, such as the training difficulty of samples, thereby overlooking global information, such a…
▽ More
Sample selection improves the efficiency and effectiveness of machine learning models by providing informative and representative samples. Typically, samples can be modeled as a sample graph, where nodes are samples and edges represent their similarities. Most existing methods are based on local information, such as the training difficulty of samples, thereby overlooking global information, such as connectivity patterns. This oversight can result in suboptimal selection because global information is crucial for ensuring that the selected samples well represent the structural properties of the graph. To address this issue, we employ structural entropy to quantify global information and losslessly decompose it from the whole graph to individual nodes using the Shapley value. Based on the decomposition, we present $\textbf{S}$tructural-$\textbf{E}$ntropy-based sample $\textbf{S}$election ($\textbf{SES}$), a method that integrates both global and local information to select informative and representative samples. SES begins by constructing a $k$NN-graph among samples based on their similarities. It then measures sample importance by combining structural entropy (global metric) with training difficulty (local metric). Finally, SES applies importance-biased blue noise sampling to select a set of diverse and representative samples. Comprehensive experiments on three learning scenarios -- supervised learning, active learning, and continual learning -- clearly demonstrate the effectiveness of our method.
△ Less
Submitted 5 October, 2024; v1 submitted 3 October, 2024;
originally announced October 2024.
-
Quantum Fast Implementation of Private Information Retrieval and Functional Bootstrapping
Authors:
Guangsheng Ma,
Hongbo Li
Abstract:
Quantum computation has found greater efficiency and security across various fields. We show that, in a near-term hybrid cloud computing scenario with only one single quantum server and an entirely classical client, critical bottlenecks in privacy-preserving computation can be addressed.
First, we propose an efficient quantum functional bootstrapping algorithm with a runtime polynomial in the pl…
▽ More
Quantum computation has found greater efficiency and security across various fields. We show that, in a near-term hybrid cloud computing scenario with only one single quantum server and an entirely classical client, critical bottlenecks in privacy-preserving computation can be addressed.
First, we propose an efficient quantum functional bootstrapping algorithm with a runtime polynomial in the plaintext-size, providing an exponential quantum speedup over classical algorithms. Second, we present a secure and fast quantum private information retrieval protocol with logarithmic query time. The security relies on the learning with errors (LWE) problem with polynomial modulus, greatly improving the security of classical fast PIR protocol based on ring-LWE with super-polynomial modulus.
Technically, we extend an important classical homomorphic operation, known as blind rotation, to the quantum case by an encrypted conditional rotation technique. This technique holds promise for broader applications in quantum cryptography.
△ Less
Submitted 30 September, 2024;
originally announced September 2024.
-
Observation of Transient Trion Induced by Ultrafast Charge Transfer in Graphene/MoS2 Heterostructure
Authors:
Chen Wang,
Yu Chen,
Qiushi Ma,
Peng Suo,
Kaiwen Sun,
Yifan Cheng,
Xian Lin,
Weimin Liu,
Guohong Ma
Abstract:
Van der Waals (Vdw) heterostructures constructed from TMDCs provide an ideal platform for exploring various quasiparticle behaviors, with trion-composed of neutral exciton and charged carrier-being a notable example. There are typically three methods to generate trion: electrical doping, chemical doping, and direct optical doping. The first two methods generate static trion, while the last gives r…
▽ More
Van der Waals (Vdw) heterostructures constructed from TMDCs provide an ideal platform for exploring various quasiparticle behaviors, with trion-composed of neutral exciton and charged carrier-being a notable example. There are typically three methods to generate trion: electrical doping, chemical doping, and direct optical doping. The first two methods generate static trion, while the last gives rise to transient trion. Here, we present an indirect optical doping approach to generate transient trion via ultrafast charge transfer (CT) and achieve control over the trion-to-exciton ratio by adjusting CT in Gr/MoS2 heterostructure. Furthermore, we demonstrated that dynamics of the transient trion generated with this method, which shows slightly longer lifetime than that of exciton accounted for the Coulomb interactions between trion and charged defect. This study provides fresh perspectives on the construction of new quasiparticles, dynamical characterization and the control of the many-body interaction in two-dimensional structure.
△ Less
Submitted 26 September, 2024;
originally announced September 2024.
-
Search for $B_{(s)}^{*0}\toμ^+μ^-$ in $B_c^+\toπ^+μ^+μ^-$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1113 additional authors not shown)
Abstract:
A search for the very rare $B^{*0}\toμ^+μ^-$ and $B_{s}^{*0}\toμ^+μ^-$ decays is conducted by analysing the $B_c^+\to π^+μ^+μ^-$ process. The analysis uses proton-proton collision data collected with the LHCb detector between 2011 and 2018, corresponding to an integrated luminosity of 9$\text{\,fb}^{-1}$. The signal signatures correspond to simultaneous peaks in the $μ^+μ^-$ and $π^+μ^+μ^-$ invari…
▽ More
A search for the very rare $B^{*0}\toμ^+μ^-$ and $B_{s}^{*0}\toμ^+μ^-$ decays is conducted by analysing the $B_c^+\to π^+μ^+μ^-$ process. The analysis uses proton-proton collision data collected with the LHCb detector between 2011 and 2018, corresponding to an integrated luminosity of 9$\text{\,fb}^{-1}$. The signal signatures correspond to simultaneous peaks in the $μ^+μ^-$ and $π^+μ^+μ^-$ invariant masses. No evidence for an excess of events over background is observed for either signal decay mode. Upper limits at the $90\%$ confidence level are set on the branching fractions relative to that for $B_c^+\to J\mskip -3mu/\mskip -2muψπ^+$ decays, \begin{align*}
{\cal R}_{B^{*0}(μ^+μ^-)π^+/J\mskip -3mu/\mskip -2muψπ^+} &< 3.8\times 10^{-5}\ \text{ and }
{\cal R}_{B_{s}^{*0}(μ^+μ^-)π^+/J\mskip -3mu/\mskip -2muψπ^+} &< 5.0\times 10^{-5}\,. \end{align*}
△ Less
Submitted 25 September, 2024;
originally announced September 2024.
-
Analysis of $\itΛ^\mathrm{0}_b \rightarrow pK^-μ^+μ^-$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1114 additional authors not shown)
Abstract:
The differential branching fraction and angular coefficients of \ensuremath{\itΛ^\mathrm{0}_b \rightarrow pK^-μ^+μ^-}\xspace decays are measured in bins of the dimuon mass squared and dihadron mass. The analysis is performed using a data set corresponding to 9$\aunit{fb}^{-1}$ of integrated luminosity collected with the $\mbox{LHCb}$ detector between 2011 and 2018. The data are consistent with rec…
▽ More
The differential branching fraction and angular coefficients of \ensuremath{\itΛ^\mathrm{0}_b \rightarrow pK^-μ^+μ^-}\xspace decays are measured in bins of the dimuon mass squared and dihadron mass. The analysis is performed using a data set corresponding to 9$\aunit{fb}^{-1}$ of integrated luminosity collected with the $\mbox{LHCb}$ detector between 2011 and 2018. The data are consistent with receiving contributions from a mixture of $\itΛ$ resonances with different spin-parity quantum numbers. The angular coefficients show a pattern of vector--axial vector interference that is a characteristic of the type of flavour-changing neutral-current transition relevant for these decays.
△ Less
Submitted 19 September, 2024;
originally announced September 2024.
-
First determination of the spin-parity of $Ξ_{c}(3055)^{+,0}$ baryons
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1109 additional authors not shown)
Abstract:
The ${Ξ_{b}^{0(-)}\toΞ_{c}(3055)^{+(0)}(\to D^{+(0)}Λ)π^{-}}$ decay chains are observed, and the spin-parity of $Ξ_{c}(3055)^{+(0)}$ baryons is determined for the first time. The measurement is performed using proton-proton collision data at a center-of-mass energy of $\sqrt{s}=13\,\text{TeV}$, corresponding to an integrated luminosity of $5.4\,\text{fb}^{-1}$, recorded by the~$\text{LHCb}$ experi…
▽ More
The ${Ξ_{b}^{0(-)}\toΞ_{c}(3055)^{+(0)}(\to D^{+(0)}Λ)π^{-}}$ decay chains are observed, and the spin-parity of $Ξ_{c}(3055)^{+(0)}$ baryons is determined for the first time. The measurement is performed using proton-proton collision data at a center-of-mass energy of $\sqrt{s}=13\,\text{TeV}$, corresponding to an integrated luminosity of $5.4\,\text{fb}^{-1}$, recorded by the~$\text{LHCb}$ experiment between 2016 and 2018. The spin-parity of the $Ξ_{c}(3055)^{+(0)}$ baryons is determined to be $3/2^{+}$ with a significance of more than $6.5σ$ ($3.5σ$) compared to all other tested hypotheses. The up-down asymmetries of the ${Ξ_{b}^{0(-)}\toΞ_{c}(3055)^{+(0)}π^{-}}$ transitions are measured to be $-0.92\pm0.10\pm0.05$ ($-0.92\pm0.16\pm0.22$), consistent with maximal parity violation, where the first uncertainty is statistical and the second is systematic. These results support the hypothesis that the $Ξ_{c}(3055)^{+(0)}$ baryons correspond to the first $D$-wave $λ$-mode excitation of the $Ξ_{c}$ flavor triplet.
△ Less
Submitted 9 September, 2024;
originally announced September 2024.
-
Label-free evaluation of lung and heart transplant biopsies using virtual staining
Authors:
Yuzhu Li,
Nir Pillar,
Tairan Liu,
Guangdong Ma,
Yuxuan Qi,
Kevin de Haan,
Yijie Zhang,
Xilin Yang,
Adrian J. Correa,
Guangqian Xiao,
Kuang-Yu Jen,
Kenneth A. Iczkowski,
Yulun Wu,
William Dean Wallace,
Aydogan Ozcan
Abstract:
Organ transplantation serves as the primary therapeutic strategy for end-stage organ failures. However, allograft rejection is a common complication of organ transplantation. Histological assessment is essential for the timely detection and diagnosis of transplant rejection and remains the gold standard. Nevertheless, the traditional histochemical staining process is time-consuming, costly, and la…
▽ More
Organ transplantation serves as the primary therapeutic strategy for end-stage organ failures. However, allograft rejection is a common complication of organ transplantation. Histological assessment is essential for the timely detection and diagnosis of transplant rejection and remains the gold standard. Nevertheless, the traditional histochemical staining process is time-consuming, costly, and labor-intensive. Here, we present a panel of virtual staining neural networks for lung and heart transplant biopsies, which digitally convert autofluorescence microscopic images of label-free tissue sections into their brightfield histologically stained counterparts, bypassing the traditional histochemical staining process. Specifically, we virtually generated Hematoxylin and Eosin (H&E), Masson's Trichrome (MT), and Elastic Verhoeff-Van Gieson (EVG) stains for label-free transplant lung tissue, along with H&E and MT stains for label-free transplant heart tissue. Subsequent blind evaluations conducted by three board-certified pathologists have confirmed that the virtual staining networks consistently produce high-quality histology images with high color uniformity, closely resembling their well-stained histochemical counterparts across various tissue features. The use of virtually stained images for the evaluation of transplant biopsies achieved comparable diagnostic outcomes to those obtained via traditional histochemical staining, with a concordance rate of 82.4% for lung samples and 91.7% for heart samples. Moreover, virtual staining models create multiple stains from the same autofluorescence input, eliminating structural mismatches observed between adjacent sections stained in the traditional workflow, while also saving tissue, expert time, and staining costs.
△ Less
Submitted 8 September, 2024;
originally announced September 2024.
-
Measurement of exclusive $J/ψ$ and $ψ(2S)$ production at $\sqrt{s}=13$ TeV
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1072 additional authors not shown)
Abstract:
Measurements are presented of the cross-section for the central exclusive production of $J/ψ\toμ^+μ^-$ and $ψ(2S)\toμ^+μ^-$ processes in proton-proton collisions at $\sqrt{s} = 13 $ TeV with 2016-2018 data. They are performed by requiring both muons to be in the LHCb acceptance (with pseudorapidity $2<η_{μ^\pm} < 4.5$) and mesons in the rapidity range $2.0 < y < 4.5$. The integrated cross-section…
▽ More
Measurements are presented of the cross-section for the central exclusive production of $J/ψ\toμ^+μ^-$ and $ψ(2S)\toμ^+μ^-$ processes in proton-proton collisions at $\sqrt{s} = 13 $ TeV with 2016-2018 data. They are performed by requiring both muons to be in the LHCb acceptance (with pseudorapidity $2<η_{μ^\pm} < 4.5$) and mesons in the rapidity range $2.0 < y < 4.5$. The integrated cross-section results are \begin{equation*}
σ_{J/ψ\toμ^+μ^-}(2.0<y_{J/ψ}<4.5,2.0<η_{μ^\pm} < 4.5) = 400 \pm 2 \pm 5 \pm 12 \,{\rm pb}\,,
\end{equation*} \begin{equation*}
σ_{ψ(2S)\toμ^+μ^-}(2.0<y_{ψ(2S)}<4.5,2.0<η_{μ^\pm} < 4.5) = 9.40 \pm 0.15 \pm 0.13 \pm 0.27 \,{\rm pb}\,, \end{equation*} where the uncertainties are statistical, systematic and due to the luminosity determination. In addition, a measurement of the ratio of $ψ(2S)$ and $J/ψ$ cross-sections, at an average photon-proton centre-of-mass energy of 1 TeV, is performed, giving \begin{equation*}
\frac{σ_{ψ(2S)}}{σ_{J/ψ}} = 0.1763 \pm 0.0029 \pm 0.0008 \pm 0.0039 \,, \end{equation*} where the first uncertainty is statistical, the second systematic and the third due to the knowledge of the involved branching fractions. For the first time, the dependence of the $J/ψ$ and $ψ(2S)$ cross-sections on the total transverse momentum transfer is determined in $pp$ collisions and is found consistent with the behaviour observed in electron-proton collisions.
△ Less
Submitted 11 September, 2024; v1 submitted 5 September, 2024;
originally announced September 2024.
-
Measurement of $CP$ violation in ${B^0}\rightarrow{D^{+}D^{-}}$ and ${B^{0}_{s}}\rightarrow{D^{+}_{s}D^{-}_{s}}$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1115 additional authors not shown)
Abstract:
A time-dependent, flavour-tagged measurement of $CP$ violation is performed with ${B^0}\rightarrow{D^{+}D^{-}}$ and ${B^{0}_{s}}\rightarrow{D^{+}_{s}D^{-}_{s}}$ decays, using data collected by the LHCb detector in proton-proton collisions at a centre-of-mass energy of 13 TeV corresponding to an integrated luminosity of 6 fb$^{-1}$. In ${B^0}\rightarrow{D^{+}D^{-}}$ decays the $CP$-violation parame…
▽ More
A time-dependent, flavour-tagged measurement of $CP$ violation is performed with ${B^0}\rightarrow{D^{+}D^{-}}$ and ${B^{0}_{s}}\rightarrow{D^{+}_{s}D^{-}_{s}}$ decays, using data collected by the LHCb detector in proton-proton collisions at a centre-of-mass energy of 13 TeV corresponding to an integrated luminosity of 6 fb$^{-1}$. In ${B^0}\rightarrow{D^{+}D^{-}}$ decays the $CP$-violation parameters are measured to be \begin{align}
S_{D^{+}D^{-}} & = -0.552 \pm 0.100\,\text{(stat)} \pm 0.010\,\text{(syst)}, \nonumber \newline
C_{D^{+}D^{-}} & = \phantom{-}0.128 \pm0.103\,\text{(stat)} \pm 0.010\,\text{(syst)}. \nonumber \end{align} In $B^{0}_{s} \rightarrow D^{+}_{s}D^{-}_{s}$ decays the $CP$-violating parameter formulation in terms of $φ_{s}$ and $|λ|$ results in \begin{align}
φ_{s} & = -0.086 \pm 0.106 \,\text{(stat)} \pm 0.028\,\text{(syst)} \,\text{rad}, \nonumber \newline
|λ_{D^{+}_{s}D^{-}_{s}}| & = \phantom{-}1.145 \pm 0.126\,\text{(stat)} \pm 0.031\,\text{(syst)}. \nonumber \end{align} These results represent the most precise single measurement of the $CP$-violation parameters in their respective channels. For the first time in a single measurement, $CP$ symmetry is observed to be violated in ${B^0}\rightarrow{D^{+}D^{-}}$ decays with a significance exceeding six standard deviations.
△ Less
Submitted 4 September, 2024;
originally announced September 2024.
-
Measurement of $\itΛ_\it{b}^0$, $\itΛ_\it{c}^+$ and $\itΛ$ decay parameters using $\itΛ_\it{b}^0 \to \itΛ_\it{c}^+ h^-$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1103 additional authors not shown)
Abstract:
A comprehensive study of the angular distributions in the bottom-baryon decays $\itΛ^\mathrm{0}_b\to\itΛ_c^+ h^-(h=π, K)$, followed by $\itΛ_c^+\to\itΛ h^+$ with $\itΛ\to \it{p} π^-$ or $\itΛ_c^+\to\it{p}\it{K}^0_\mathrm{S}$ decays, is performed using a data sample of proton-proton collisions corresponding to an integrated luminosity of $9~\mathrm{fb}^{-1}$ collected by the LHCb experiment at cent…
▽ More
A comprehensive study of the angular distributions in the bottom-baryon decays $\itΛ^\mathrm{0}_b\to\itΛ_c^+ h^-(h=π, K)$, followed by $\itΛ_c^+\to\itΛ h^+$ with $\itΛ\to \it{p} π^-$ or $\itΛ_c^+\to\it{p}\it{K}^0_\mathrm{S}$ decays, is performed using a data sample of proton-proton collisions corresponding to an integrated luminosity of $9~\mathrm{fb}^{-1}$ collected by the LHCb experiment at center-of-mass energies of 7, 8 and 13 $\mathrm{Te\kern -0.1em V}$. The decay parameters and the associated charge-parity ($C\!P$) asymmetries are measured, with no significant $C\!P$ violation observed. For the first time, the $\itΛ^\mathrm{0}_b \to \itΛ_c^+ h^-$ decay parameters are measured. The most precise measurements of the decay parameters $α, β$ and $γ$ are obtained for $\itΛ_c^+$ decays and an independent measurement of the decay parameters for the strange-baryon $\itΛ$ decay is provided. The results deepen our understanding of weak decay dynamics in baryon decays.
△ Less
Submitted 4 September, 2024;
originally announced September 2024.
-
Measurement of $C\!P$ violation observables in $D^+\rightarrow K^-K^+π^+$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1109 additional authors not shown)
Abstract:
A search for violation of the charge-parity $C\!P$ symmetry in the $D^+\rightarrow K^-K^+π^+$ decay is presented, with proton-proton collision data corresponding to an integrated luminosity of 5.4 fb$^{-1}$, collected at a center-of-mass energy of $13$ TeV with the LHCb detector. A novel model-independent technique is used to compare the $D^+$ and $D^-$ phase-space distributions, with instrumental…
▽ More
A search for violation of the charge-parity $C\!P$ symmetry in the $D^+\rightarrow K^-K^+π^+$ decay is presented, with proton-proton collision data corresponding to an integrated luminosity of 5.4 fb$^{-1}$, collected at a center-of-mass energy of $13$ TeV with the LHCb detector. A novel model-independent technique is used to compare the $D^+$ and $D^-$ phase-space distributions, with instrumental asymmetries subtracted using the $D^+_{s}\rightarrow K^-K^+π^+$ decay as a control channel. The $p$-value for the hypothesis of $C\!P$ conservation is $8.1\%$. The $C\!P$ asymmetry observables $A_{C\!P|S}^{φπ^+} = (0.95 \pm 0.43_{stat} \pm 0.26_{syst})\times 10^{-3}$ and $A_{C\!P|S}^{\overline{K}^{*0}K^+} = (-0.26 \pm 0.56_{ stat} \pm 0.18_{syst})\times 10^{-3}$ are also measured. These results show no evidence of $C\!P$ violation and represent the most sensitive search performed through the phase space of a multibody decay.
△ Less
Submitted 2 September, 2024;
originally announced September 2024.
-
Study of the rare decay $J/ψ\to μ^+μ^-μ^+μ^-$
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1096 additional authors not shown)
Abstract:
The rare electromagnetic $J/ψ\to μ^+μ^-μ^+μ^-$ decay is observed with a significance greatly exceeding the discovery threshold, using proton-proton collision data collected by the LHCb experiment during 2016-2018 at a center-of-mass energy of 13 TeV, corresponding to an integrated luminosity of $5.4\,\text{fb}^{-1}$. The rate of this decay is measured relative to that of the $J/ψ\to μ^+μ^-$ mode.…
▽ More
The rare electromagnetic $J/ψ\to μ^+μ^-μ^+μ^-$ decay is observed with a significance greatly exceeding the discovery threshold, using proton-proton collision data collected by the LHCb experiment during 2016-2018 at a center-of-mass energy of 13 TeV, corresponding to an integrated luminosity of $5.4\,\text{fb}^{-1}$. The rate of this decay is measured relative to that of the $J/ψ\to μ^+μ^-$ mode. Using the QED model for the four-muon decay in the efficiency estimation, its branching fraction is determined to be \begin{equation*}
{\mathcal{B}}(J/ψ\to μ^+μ^-μ^+μ^-) = (1.13\pm0.10\pm0.05\pm0.01)\times 10^{-6}, \end{equation*} where the uncertainties are statistical, systematic and due to the uncertainty on the branching fraction of the $J/ψ\to μ^+μ^-$ decay.
△ Less
Submitted 29 August, 2024;
originally announced August 2024.
-
ConsistencyTrack: A Robust Multi-Object Tracker with a Generation Strategy of Consistency Model
Authors:
Lifan Jiang,
Zhihui Wang,
Siqi Yin,
Guangxiao Ma,
Peng Zhang,
Boxi Wu
Abstract:
Multi-object tracking (MOT) is a critical technology in computer vision, designed to detect multiple targets in video sequences and assign each target a unique ID per frame. Existed MOT methods excel at accurately tracking multiple objects in real-time across various scenarios. However, these methods still face challenges such as poor noise resistance and frequent ID switches. In this research, we…
▽ More
Multi-object tracking (MOT) is a critical technology in computer vision, designed to detect multiple targets in video sequences and assign each target a unique ID per frame. Existed MOT methods excel at accurately tracking multiple objects in real-time across various scenarios. However, these methods still face challenges such as poor noise resistance and frequent ID switches. In this research, we propose a novel ConsistencyTrack, joint detection and tracking(JDT) framework that formulates detection and association as a denoising diffusion process on perturbed bounding boxes. This progressive denoising strategy significantly improves the model's noise resistance. During the training phase, paired object boxes within two adjacent frames are diffused from ground-truth boxes to a random distribution, and then the model learns to detect and track by reversing this process. In inference, the model refines randomly generated boxes into detection and tracking results through minimal denoising steps. ConsistencyTrack also introduces an innovative target association strategy to address target occlusion. Experiments on the MOT17 and DanceTrack datasets demonstrate that ConsistencyTrack outperforms other compared methods, especially better than DiffusionTrack in inference speed and other performance metrics. Our code is available at https://github.com/Tankowa/ConsistencyTrack.
△ Less
Submitted 28 August, 2024;
originally announced August 2024.
-
Critical Point Extraction from Multivariate Functional Approximation
Authors:
Guanqun Ma,
David Lenz,
Tom Peterka,
Hanqi Guo,
Bei Wang
Abstract:
Advances in high-performance computing require new ways to represent large-scale scientific data to support data storage, data transfers, and data analysis within scientific workflows. Multivariate functional approximation (MFA) has recently emerged as a new continuous meshless representation that approximates raw discrete data with a set of piecewise smooth functions. An MFA model of data thus of…
▽ More
Advances in high-performance computing require new ways to represent large-scale scientific data to support data storage, data transfers, and data analysis within scientific workflows. Multivariate functional approximation (MFA) has recently emerged as a new continuous meshless representation that approximates raw discrete data with a set of piecewise smooth functions. An MFA model of data thus offers a compact representation and supports high-order evaluation of values and derivatives anywhere in the domain. In this paper, we present CPE-MFA, the first critical point extraction framework designed for MFA models of large-scale, high-dimensional data. CPE-MFA extracts critical points directly from an MFA model without the need for discretization or resampling. This is the first step toward enabling continuous implicit models such as MFA to support topological data analysis at scale.
△ Less
Submitted 23 August, 2024;
originally announced August 2024.
-
Multi-Agent Based Simulation for Decentralized Electric Vehicle Charging Strategies and their Impacts
Authors:
Kristoffer Christensen,
Bo Nørregaard Jørgensen,
Zheng Grace Ma
Abstract:
The growing shift towards a Smart Grid involves integrating numerous new digital energy solutions into the energy ecosystems to address problems arising from the transition to carbon neutrality, particularly in linking the electricity and transportation sectors. Yet, this shift brings challenges due to mass electric vehicle adoption and the lack of methods to adequately assess various EV charging…
▽ More
The growing shift towards a Smart Grid involves integrating numerous new digital energy solutions into the energy ecosystems to address problems arising from the transition to carbon neutrality, particularly in linking the electricity and transportation sectors. Yet, this shift brings challenges due to mass electric vehicle adoption and the lack of methods to adequately assess various EV charging algorithms and their ecosystem impacts. This paper introduces a multi-agent based simulation model, validated through a case study of a Danish radial distribution network serving 126 households. The study reveals that traditional charging leads to grid overload by 2031 at 67% EV penetration, while decentralized strategies like Real-Time Pricing could cause overloads as early as 2028. The developed multi-agent based simulation demonstrates its ability to offer detailed, hourly analysis of future load profiles in distribution grids, and therefore, can be applied to other prospective scenarios in similar energy systems.
△ Less
Submitted 20 August, 2024;
originally announced August 2024.
-
Multi-agent based modeling for investigating excess heat utilization from electrolyzer production to district heating network
Authors:
Kristoffer Christensen,
Bo Nørregaard Jørgensen,
Zheng Grace Ma
Abstract:
Power-to-Hydrogen is crucial for the renewable energy transition, yet existing literature lacks business models for the significant excess heat it generates. This study addresses this by evaluating three models for selling electrolyzer-generated heat to district heating grids: constant, flexible, and renewable-source hydrogen production, with and without heat sales. Using agent-based modeling and…
▽ More
Power-to-Hydrogen is crucial for the renewable energy transition, yet existing literature lacks business models for the significant excess heat it generates. This study addresses this by evaluating three models for selling electrolyzer-generated heat to district heating grids: constant, flexible, and renewable-source hydrogen production, with and without heat sales. Using agent-based modeling and multi-criteria decision-making methods (VIKOR, TOPSIS, PROMETHEE), it finds that selling excess heat can cut hydrogen production costs by 5.6%. The optimal model operates flexibly with electricity spot prices, includes heat sales, and maintains a hydrogen price of 3.3 EUR/kg. Environmentally, hydrogen production from grid electricity could emit up to 13,783.8 tons of CO2 over four years from 2023. The best economic and environmental model uses renewable sources and sells heat at 3.5 EUR/kg
△ Less
Submitted 20 August, 2024;
originally announced August 2024.
-
Multi-Agent Based Simulation for Investigating Centralized Charging Strategies and their Impact on Electric Vehicle Home Charging Ecosystem
Authors:
Kristoffer Christensen,
Bo Nørregaard Jørgensen,
Zheng Grace Ma
Abstract:
This paper addresses the critical integration of electric vehicles (EVs) into the electricity grid, which is essential for achieving carbon neutrality by 2050. The rapid increase in EV adoption poses significant challenges to the existing grid infrastructure, particularly in managing the increasing electricity demand and mitigating the risk of grid overloads. Centralized EV charging strategies are…
▽ More
This paper addresses the critical integration of electric vehicles (EVs) into the electricity grid, which is essential for achieving carbon neutrality by 2050. The rapid increase in EV adoption poses significant challenges to the existing grid infrastructure, particularly in managing the increasing electricity demand and mitigating the risk of grid overloads. Centralized EV charging strategies are investigated due to their potential to optimize grid stability and efficiency, compared to decentralized approaches that may exacerbate grid stress. Utilizing a multi-agent based simulation model, the study provides a realistic representation of the electric vehicle home charging ecosystem in a case study of Strib, Denmark. The findings show that the Earliest-deadline-first and Round Robin perform best with 100% EV adoption in terms of EV user satisfaction. The simulation considers a realistic adoption curve, EV charging strategies, EV models, and driving patterns to capture the full ecosystem dynamics over a long-term period with high resolution (hourly). Additionally, the study offers detailed load profiles for future distribution grids, demonstrating how centralized charging strategies can efficiently manage grid loads and prevent overloads.
△ Less
Submitted 20 August, 2024;
originally announced August 2024.
-
Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval
Authors:
Guangyuan Ma,
Yongliang Ma,
Xing Wu,
Zhenpeng Su,
Ming Zhou,
Songlin Hu
Abstract:
Large Language Model-based Dense Retrieval (LLM-DR) optimizes over numerous heterogeneous fine-tuning collections from different domains. However, the discussion about its training data distribution is still minimal. Previous studies rely on empirically assigned dataset choices or sampling ratios, which inevitably leads to sub-optimal retrieval performances. In this paper, we propose a new task-le…
▽ More
Large Language Model-based Dense Retrieval (LLM-DR) optimizes over numerous heterogeneous fine-tuning collections from different domains. However, the discussion about its training data distribution is still minimal. Previous studies rely on empirically assigned dataset choices or sampling ratios, which inevitably leads to sub-optimal retrieval performances. In this paper, we propose a new task-level Distributionally Robust Optimization (tDRO) algorithm for LLM-DR fine-tuning, targeted at improving the universal domain generalization ability by end-to-end reweighting the data distribution of each task. The tDRO parameterizes the domain weights and updates them with scaled domain gradients. The optimized weights are then transferred to the LLM-DR fine-tuning to train more robust retrievers. Experiments show optimal improvements in large-scale retrieval benchmarks and reduce up to 30% dataset usage after applying our optimization algorithm with a series of different-sized LLM-DR models.
△ Less
Submitted 20 August, 2024;
originally announced August 2024.
-
Unidirectional imaging with partially coherent light
Authors:
Guangdong Ma,
Che-Yung Shen,
Jingxi Li,
Luzhe Huang,
Cagatay Isil,
Fazil Onuralp Ardic,
Xilin Yang,
Yuhang Li,
Yuntian Wang,
Md Sadman Sakib Rahman,
Aydogan Ozcan
Abstract:
Unidirectional imagers form images of input objects only in one direction, e.g., from field-of-view (FOV) A to FOV B, while blocking the image formation in the reverse direction, from FOV B to FOV A. Here, we report unidirectional imaging under spatially partially coherent light and demonstrate high-quality imaging only in the forward direction (A->B) with high power efficiency while distorting th…
▽ More
Unidirectional imagers form images of input objects only in one direction, e.g., from field-of-view (FOV) A to FOV B, while blocking the image formation in the reverse direction, from FOV B to FOV A. Here, we report unidirectional imaging under spatially partially coherent light and demonstrate high-quality imaging only in the forward direction (A->B) with high power efficiency while distorting the image formation in the backward direction (B->A) along with low power efficiency. Our reciprocal design features a set of spatially engineered linear diffractive layers that are statistically optimized for partially coherent illumination with a given phase correlation length. Our analyses reveal that when illuminated by a partially coherent beam with a correlation length of ~1.5 w or larger, where w is the wavelength of light, diffractive unidirectional imagers achieve robust performance, exhibiting asymmetric imaging performance between the forward and backward directions - as desired. A partially coherent unidirectional imager designed with a smaller correlation length of less than 1.5 w still supports unidirectional image transmission, but with a reduced figure of merit. These partially coherent diffractive unidirectional imagers are compact (axially spanning less than 75 w), polarization-independent, and compatible with various types of illumination sources, making them well-suited for applications in asymmetric visual information processing and communication.
△ Less
Submitted 10 August, 2024;
originally announced August 2024.
-
Observation of muonic Dalitz decays of $χ_{b}$ mesons and precise spectroscopy of hidden-beauty states
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1114 additional authors not shown)
Abstract:
The decays of the $χ_{b1}(1P)$, $χ_{b2}(1P)$, $χ_{b1}(2P)$ and $χ_{b2}(2P)$~mesons into the~$Υ(1S)μ^+μ^-$ final state are observed with a high significance using proton-proton collision data collected with the LHCb detector and corresponding to an integrated luminosity of 9fb$^{-1}$. The newly observed decays together with the $Υ(2S)\rightarrow Υ(1S)π^+π^-$ and $Υ(3S)\rightarrow Υ(2S)π^+π^-$ decay…
▽ More
The decays of the $χ_{b1}(1P)$, $χ_{b2}(1P)$, $χ_{b1}(2P)$ and $χ_{b2}(2P)$~mesons into the~$Υ(1S)μ^+μ^-$ final state are observed with a high significance using proton-proton collision data collected with the LHCb detector and corresponding to an integrated luminosity of 9fb$^{-1}$. The newly observed decays together with the $Υ(2S)\rightarrow Υ(1S)π^+π^-$ and $Υ(3S)\rightarrow Υ(2S)π^+π^-$ decay modes are used for precision measurements of the mass and mass splittings for the hidden-beauty states.
△ Less
Submitted 9 August, 2024;
originally announced August 2024.
-
Measurement of $D^0-\overline{D}^0$ mixing and search for $CP$ violation with $D^0\rightarrow K^+π^-$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1065 additional authors not shown)
Abstract:
A measurement of the time-dependent ratio of the $D^0\rightarrow K^+π^-$ to $\overline{D}^0\rightarrow K^+π^-$ decay rates is reported. The analysis uses a sample of proton-proton collisions corresponding to an integrated luminosity of 6 fb$^-1$ recorded by the LHCb experiment from 2015 through 2018 at a center-of-mass energy of 13 TeV. The $D^0$ meson is required to originate from a…
▽ More
A measurement of the time-dependent ratio of the $D^0\rightarrow K^+π^-$ to $\overline{D}^0\rightarrow K^+π^-$ decay rates is reported. The analysis uses a sample of proton-proton collisions corresponding to an integrated luminosity of 6 fb$^-1$ recorded by the LHCb experiment from 2015 through 2018 at a center-of-mass energy of 13 TeV. The $D^0$ meson is required to originate from a $D^{*+}\rightarrow D^0π^+$ decay, such that its flavor at production is inferred from the charge of the accompanying pion. The measurement is performed simultaneously for the $K^+π^-$ and $K^-π^+$ final states, allowing both mixing and $CP$-violation parameters to be determined. The value of the ratio of the decay rates at production is determined to be $R_{Kπ} = (343.1 \pm 2.0) \times 10^{-5}$. The mixing parameters are measured to be $c_{Kπ} = (51.4 \pm 3.5) \times 10^{-4}$ and $c_{Kπ}^{\prime} = (13 \pm 4) \times 10^{-6}$, where $\sqrt{R_{Kπ}}c_{Kπ}$ is the linear coefficient of the expansion of the ratio as a function of decay time in units of the $D^0$ lifetime, and $c_{Kπ}^{\prime}$ is the quadratic coefficient, both averaged between the $K^+π^-$ and $K^-π^+$ final states. The precision is improved relative to the previous best measurement by approximately 60%. No evidence for $CP$ violation is found.
△ Less
Submitted 25 July, 2024;
originally announced July 2024.
-
A Two-Stage Imaging Framework Combining CNN and Physics-Informed Neural Networks for Full-Inverse Tomography: A Case Study in Electrical Impedance Tomography (EIT)
Authors:
Xuanxuan Yang,
Yangming Zhang,
Haofeng Chen,
Gang Ma,
Xiaojie Wang
Abstract:
Physics-Informed Neural Networks (PINNs) are a machine learning technique for solving partial differential equations (PDEs) by incorporating PDEs as loss terms in neural networks and minimizing the loss function during training. Tomographic imaging, a method to reconstruct internal properties from external measurement data, is highly complex and ill-posed, making it an inverse problem. Recently, P…
▽ More
Physics-Informed Neural Networks (PINNs) are a machine learning technique for solving partial differential equations (PDEs) by incorporating PDEs as loss terms in neural networks and minimizing the loss function during training. Tomographic imaging, a method to reconstruct internal properties from external measurement data, is highly complex and ill-posed, making it an inverse problem. Recently, PINNs have shown significant potential in computational fluid dynamics (CFD) and have advantages in solving inverse problems. However, existing research has primarily focused on semi-inverse Electrical Impedance Tomography (EIT), where internal electric potentials are accessible. The practical full inverse EIT problem, where only boundary voltage measurements are available, remains challenging. To address this, we propose a two-stage hybrid learning framework combining Convolutional Neural Networks (CNNs) and PINNs to solve the full inverse EIT problem. This framework integrates data-driven and model-driven approaches, combines supervised and unsupervised learning, and decouples the forward and inverse problems within the PINN framework in EIT. Stage I: a U-Net constructs an end-to-end mapping from boundary voltage measurements to the internal potential distribution using supervised learning. Stage II: a Multilayer Perceptron (MLP)-based PINN takes the predicted internal potentials as input to solve for the conductivity distribution through unsupervised learning.
△ Less
Submitted 24 July, 2024;
originally announced July 2024.
-
Observation of exotic $J/ψφ$ resonances in diffractive processes in proton-proton collisions
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1068 additional authors not shown)
Abstract:
The first study of $J/ψφ$ production in diffractive processes in proton-proton collisions is presented. The study is based on an LHCb dataset recorded at centre-of-mass energy of 13 TeV, corresponding to an integrated luminosity of 5 fb$^{-1}$. The data disfavour a nonresonant $J/ψφ$ production but are consistent with a resonant model including several resonant states observed previously only in…
▽ More
The first study of $J/ψφ$ production in diffractive processes in proton-proton collisions is presented. The study is based on an LHCb dataset recorded at centre-of-mass energy of 13 TeV, corresponding to an integrated luminosity of 5 fb$^{-1}$. The data disfavour a nonresonant $J/ψφ$ production but are consistent with a resonant model including several resonant states observed previously only in $B^+ \to J/ψφK^+$ decays. The $χ_{c0}(4500)$ state is observed with a significance over $5σ$ and the $χ_{c1}(4274)$ is confirmed with a significance of more than $4σ$.
△ Less
Submitted 19 July, 2024;
originally announced July 2024.
-
Study of charmonium production via the decay to $p\bar{p}$ at $\sqrt{s} = 13 TeV$
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1060 additional authors not shown)
Abstract:
Charmonium production cross-section in proton-proton collisions is measured at the centre-of-mass energy $\sqrt{s}=13\,TeV$ using decays to $p\bar{p}$ final state. The study is performed using a data sample corresponding to an integrated luminosity of $2.2\,{fb}^{-1}$ collected in 2018 with the $LHCb$ detector. The production cross-section of the $η_c$ meson is measured in a rapidity range of…
▽ More
Charmonium production cross-section in proton-proton collisions is measured at the centre-of-mass energy $\sqrt{s}=13\,TeV$ using decays to $p\bar{p}$ final state. The study is performed using a data sample corresponding to an integrated luminosity of $2.2\,{fb}^{-1}$ collected in 2018 with the $LHCb$ detector. The production cross-section of the $η_c$ meson is measured in a rapidity range of $2.0 < y < 4.0$ and in a transverse momentum range of $5.0 < p_{T} < 20.0\,{GeV/\it{c}}$, which is extended compared with previous $LHCb$ analyses. The differential cross-section is measured in bins of $p_{T}$ and, for the first time, of $y$. Upper limits, at 90% and 95% confidence levels, on the $η_c(2S)$ and $h_c(1P)$ prompt production cross-sections are determined for the first time.
△ Less
Submitted 19 July, 2024;
originally announced July 2024.
-
CC-DCNet: Dynamic Convolutional Neural Network with Contrastive Constraints for Identifying Lung Cancer Subtypes on Multi-modality Images
Authors:
Yuan Jin,
Gege Ma,
Geng Chen,
Tianling Lyu,
Jan Egger,
Junhui Lyu,
Shaoting Zhang,
Wentao Zhu
Abstract:
The accurate diagnosis of pathological subtypes of lung cancer is of paramount importance for follow-up treatments and prognosis managements. Assessment methods utilizing deep learning technologies have introduced novel approaches for clinical diagnosis. However, the majority of existing models rely solely on single-modality image input, leading to limited diagnostic accuracy. To this end, we prop…
▽ More
The accurate diagnosis of pathological subtypes of lung cancer is of paramount importance for follow-up treatments and prognosis managements. Assessment methods utilizing deep learning technologies have introduced novel approaches for clinical diagnosis. However, the majority of existing models rely solely on single-modality image input, leading to limited diagnostic accuracy. To this end, we propose a novel deep learning network designed to accurately classify lung cancer subtype with multi-dimensional and multi-modality images, i.e., CT and pathological images. The strength of the proposed model lies in its ability to dynamically process both paired CT-pathological image sets as well as independent CT image sets, and consequently optimize the pathology-related feature extractions from CT images. This adaptive learning approach enhances the flexibility in processing multi-dimensional and multi-modality datasets and results in performance elevating in the model testing phase. We also develop a contrastive constraint module, which quantitatively maps the cross-modality associations through network training, and thereby helps to explore the "gold standard" pathological information from the corresponding CT scans. To evaluate the effectiveness, adaptability, and generalization ability of our model, we conducted extensive experiments on a large-scale multi-center dataset and compared our model with a series of state-of-the-art classification models. The experimental results demonstrated the superiority of our model for lung cancer subtype classification, showcasing significant improvements in accuracy metrics such as ACC, AUC, and F1-score.
△ Less
Submitted 17 July, 2024;
originally announced July 2024.
-
Amplitude analysis of $B^+ \to ψ(2S) K^+ π^+ π^-$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1092 additional authors not shown)
Abstract:
The first full amplitude analysis of $B^+ \to ψ(2S) K^+ π^+ π^-$ decays is performed using proton-proton collision data corresponding to an integrated luminosity of $9\,\text{fb}^{-1}$ recorded with the LHCb detector. The rich $K^+ π^+ π^-$ spectrum is studied and the branching fractions of the resonant substructure associated with the prominent $K_1(1270)^+$ contribution are measured. The data ca…
▽ More
The first full amplitude analysis of $B^+ \to ψ(2S) K^+ π^+ π^-$ decays is performed using proton-proton collision data corresponding to an integrated luminosity of $9\,\text{fb}^{-1}$ recorded with the LHCb detector. The rich $K^+ π^+ π^-$ spectrum is studied and the branching fractions of the resonant substructure associated with the prominent $K_1(1270)^+$ contribution are measured. The data cannot be described by conventional strange and charmonium resonances only. An amplitude model with 53 components is developed comprising 11 hidden-charm exotic hadrons. New production mechanisms for charged charmonium-like states are observed. Significant resonant activity with spin-parity $J^P = 1^+$ in the $ψ(2S) π^+$ system is confirmed and a multi-pole structure is demonstrated. The spectral decomposition of the $ψ(2S) π^+ π^-$ invariant-mass structure, dominated by $X^0 \to ψ(2S) ρ(770)^0$ decays, broadly resembles the $J/ψφ$ spectrum observed in $B^+ \to J/ψφK^+$ decays. Exotic $ψ(2S) K^+ π^-$ resonances are observed for the first time.
△ Less
Submitted 17 July, 2024;
originally announced July 2024.
-
Dissipationless topological quantum computation for Majorana objects in sparse-dense mixed encoding process
Authors:
Ye-Min Zhan,
Guan-Dong Mao,
Yu-Ge Chen,
Yue Yu,
Xi Luo
Abstract:
Topological quantum computation based on Majorana objects is subject to a significant challenge because at least some of the two-qubit quantum gates rely on the fermion (either charge or spin) parity of the qubits. This dependency renders the quantum operations involving these gates probabilistic when attempting to advance quantum processes within the quantum circuit model. Such an approach leads…
▽ More
Topological quantum computation based on Majorana objects is subject to a significant challenge because at least some of the two-qubit quantum gates rely on the fermion (either charge or spin) parity of the qubits. This dependency renders the quantum operations involving these gates probabilistic when attempting to advance quantum processes within the quantum circuit model. Such an approach leads to significant information loss whenever measurements yield the undesired fermion parity. To resolve the problem of wasting information, we devise topological operations that allow for the non-dissipative correction of information from undesired fermion parity to the desired one. We will use the sparse-dense mixed encoding process for the controlled-NOT gate as an example to explain how corrections can be implemented without affecting the quantum information carried by the computational qubits. This correction process can be applied {to} either the undesired input qubits or the fermion parity-dependent quantum gates, and it works for both Majorana-zero-mode-based and Majorana-edge-mode-based topological quantum computation.
△ Less
Submitted 1 August, 2024; v1 submitted 16 July, 2024;
originally announced July 2024.
-
Search for the rare decay of charmed baryon $Λ_c^+$ into $p μ^+ μ^-$ final state
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1063 additional authors not shown)
Abstract:
A search for the nonresonant $Λ_c^+ \to p μ^+ μ^-$ decay is performed using proton-proton collision data recorded at a centre-of-mass energy of 13 TeV by the LHCb experiment, corresponding to an integrated luminosity of 5.4 fb$^{-1}$. No evidence for the decay is found in the dimuon invariant-mass regions where the expected contributions of resonances is subdominant. The upper limit on the branchi…
▽ More
A search for the nonresonant $Λ_c^+ \to p μ^+ μ^-$ decay is performed using proton-proton collision data recorded at a centre-of-mass energy of 13 TeV by the LHCb experiment, corresponding to an integrated luminosity of 5.4 fb$^{-1}$. No evidence for the decay is found in the dimuon invariant-mass regions where the expected contributions of resonances is subdominant. The upper limit on the branching fraction of the $Λ_c^+ \to p μ^+ μ^-$ decay is determined to be $2.9~(3.2) \times 10^{-8}$ at 90% (95%) confidence level. The branching fractions in the dimuon invariant-mass regions dominated by the $η$, $ρ$ and $ω$ resonances are also determined.
△ Less
Submitted 27 September, 2024; v1 submitted 16 July, 2024;
originally announced July 2024.
-
MaskMoE: Boosting Token-Level Learning via Routing Mask in Mixture-of-Experts
Authors:
Zhenpeng Su,
Zijia Lin,
Xue Bai,
Xing Wu,
Yizhe Xiong,
Haoran Lian,
Guangyuan Ma,
Hui Chen,
Guiguang Ding,
Wei Zhou,
Songlin Hu
Abstract:
Scaling the size of a model enhances its capabilities but significantly increases computation complexity. Mixture-of-Experts models (MoE) address the issue by allowing model size to scale up without substantially increasing training or inference costs. In MoE, there is an important module called the router, which is used to distribute each token to the experts. Currently, the mainstream routing me…
▽ More
Scaling the size of a model enhances its capabilities but significantly increases computation complexity. Mixture-of-Experts models (MoE) address the issue by allowing model size to scale up without substantially increasing training or inference costs. In MoE, there is an important module called the router, which is used to distribute each token to the experts. Currently, the mainstream routing methods include dynamic routing and fixed routing. Despite their promising results, MoE models encounter several challenges. Primarily, for dynamic routing methods, the dispersion of training tokens across multiple experts can lead to underfitting, particularly for infrequent tokens. Additionally, though fixed routing methods can mitigate that issue, they compromise on the diversity of representations. In this paper, we propose \textbf{MaskMoE}, a method designed to enhance token-level learning by employing a routing \textbf{mask}ing technique within the \textbf{M}ixture-\textbf{o}f-\textbf{E}xperts model. MaskMoE is capable of maintaining representation diversity while achieving more comprehensive training. Experimental results demonstrate that our method outperforms previous dominant Mixture-of-Experts models in terms of both perplexity (PPL) and downstream task performance.
△ Less
Submitted 29 August, 2024; v1 submitted 13 July, 2024;
originally announced July 2024.
-
On $\{1,2\}$-distance-balancedness of generalized Petersen graphs
Authors:
Gang Ma,
Jianfeng Wang,
Sandi Klavžar
Abstract:
A connected graph $G$ of diameter ${\rm diam}(G) \ge \ell$ is $\ell$-distance-balanced if $|W_{xy}|=|W_{yx}|$ for every $x,y\in V(G)$ with $d_{G}(x,y)=\ell$, where $W_{xy}$ is the set of vertices of $G$ that are closer to $x$ than to $y$. It is proved that if $k\ge 3$ and $n>k(k+2)$, then the generalized Petersen graph $GP(n,k)$ is not distance-balanced and that $GP(k(k+2),k)$ is distance-balanced…
▽ More
A connected graph $G$ of diameter ${\rm diam}(G) \ge \ell$ is $\ell$-distance-balanced if $|W_{xy}|=|W_{yx}|$ for every $x,y\in V(G)$ with $d_{G}(x,y)=\ell$, where $W_{xy}$ is the set of vertices of $G$ that are closer to $x$ than to $y$. It is proved that if $k\ge 3$ and $n>k(k+2)$, then the generalized Petersen graph $GP(n,k)$ is not distance-balanced and that $GP(k(k+2),k)$ is distance-balanced. This significantly improves the main result of Yang et al.\ [Electron.\ J.\ Combin.\ 16 (2009) \#N33]. It is also proved that if $k\ge 6$, where $k$ is even, and $n>\frac{5}{4}k^2+2k$, or if $k\ge 5$, where $k$ is odd, and $n>\frac{7}{4}k^2+\frac{3}{4}k$, then $GP(n,k)$ is not $2$-distance-balanced. These results partially resolve a conjecture of Miklavič and Šparl [Discrete Appl.\ Math.\ 244 (2018) 143--154].
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Probing the nature of the $χ_{c1}(3872)$ state using radiative decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1094 additional authors not shown)
Abstract:
The radiative decays $χ_{c1}(3872)\rightarrowψ(2S)γ$ and $χ_{c1}(3872)\rightarrow J/ψγ$ are used to probe the~nature of the~$χ_{c1}(3872)$ state using proton-proton collision data collected with the LHCb detector, corresponding to an~integrated luminosity of~9fb$^{-1}$. Using the~$B^+\rightarrow χ_{c1}(3872)K^+$decay, the $χ_{c1}(3872)\rightarrow ψ(2S)γ$ process is observed for the first time and…
▽ More
The radiative decays $χ_{c1}(3872)\rightarrowψ(2S)γ$ and $χ_{c1}(3872)\rightarrow J/ψγ$ are used to probe the~nature of the~$χ_{c1}(3872)$ state using proton-proton collision data collected with the LHCb detector, corresponding to an~integrated luminosity of~9fb$^{-1}$. Using the~$B^+\rightarrow χ_{c1}(3872)K^+$decay, the $χ_{c1}(3872)\rightarrow ψ(2S)γ$ process is observed for the first time and the ratio of its partial width to that of the $χ_{c1}(3872)\rightarrow J/ψγ$ decay is measured to be $$ \frac{Γ_{χ_{c1}(3872)\rightarrow ψ(2S)γ}}
{Γ_{χ_{c1}(3872)\rightarrow J/ψγ}} = 1.67 \pm 0.21 \pm 0.12 \pm0.04 , $$ where the first uncertainty is statistical, the second systematic and the third is due to the uncertainties on the branching fractions of the $ψ(2S)$ and $J/ψ$ mesons. The measured ratio makes the interpretation of the $χ_{c1}(3872)$ state as a~pure $D^0\bar{D}^{*0}+\bar{D}^0D^{*0}$ molecule questionable and strongly indicates a sizeable compact charmonium or tetraquark component within the $χ_{c1}(3872)$ state.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Search for fractionally charged particles with CUORE
Authors:
CUORE Collaboration,
D. Q. Adams,
C. Alduino,
K. Alfonso,
F. T. Avignone III,
O. Azzolini,
G. Bari,
F. Bellini,
G. Benato,
M. Beretta,
M. Biassoni,
A. Branca,
C. Brofferio,
C. Bucci,
J. Camilleri,
A. Caminata,
A. Campani,
J. Cao,
S. Capelli,
C. Capelli,
L. Cappelli,
L. Cardani,
P. Carniti,
N. Casali,
E. Celi
, et al. (95 additional authors not shown)
Abstract:
The Cryogenic Underground Observatory for Rare Events (CUORE) is a detector array comprised by 988 5$\;$cm$\times$5$\;$cm$\times$5$\;$cm TeO$_2$ crystals held below 20 mK, primarily searching for neutrinoless double-beta decay in $^{130}$Te. Unprecedented in size amongst cryogenic calorimetric experiments, CUORE provides a promising setting for the study of exotic through-going particles. Using th…
▽ More
The Cryogenic Underground Observatory for Rare Events (CUORE) is a detector array comprised by 988 5$\;$cm$\times$5$\;$cm$\times$5$\;$cm TeO$_2$ crystals held below 20 mK, primarily searching for neutrinoless double-beta decay in $^{130}$Te. Unprecedented in size amongst cryogenic calorimetric experiments, CUORE provides a promising setting for the study of exotic through-going particles. Using the first tonne-year of CUORE's exposure, we perform a search for hypothesized fractionally charged particles (FCPs), which are well-motivated by various Standard Model extensions and would have suppressed interactions with matter. No excess of FCP candidate tracks is observed over background, setting leading limits on the underground FCP flux with charges between $e/24-e/5$ at 90\% confidence level. Using the low background environment and segmented geometry of CUORE, we establish the sensitivity of tonne-scale sub-Kelvin detectors to diverse signatures of new physics.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Precision measurement of the $Ξ^-_b$ baryon lifetime
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1064 additional authors not shown)
Abstract:
A sample of $pp$ collision data, corresponding to an integrated luminosity of 5.5 fb$^{-1}$ and collected by the LHCb experiment during LHC Run 2, is used to measure the ratio of the lifetime of the $Ξ^-_b$ baryon to that of the $Λ^0_b$ baryon, $r_τ\equivτ_{Ξ^-_b}/τ_{Λ^0_b}$. The value ${r_τ=1.076\pm0.013\pm0.006}$ is obtained, where the first uncertainty is statistical and the second systematic.…
▽ More
A sample of $pp$ collision data, corresponding to an integrated luminosity of 5.5 fb$^{-1}$ and collected by the LHCb experiment during LHC Run 2, is used to measure the ratio of the lifetime of the $Ξ^-_b$ baryon to that of the $Λ^0_b$ baryon, $r_τ\equivτ_{Ξ^-_b}/τ_{Λ^0_b}$. The value ${r_τ=1.076\pm0.013\pm0.006}$ is obtained, where the first uncertainty is statistical and the second systematic. This value is averaged with the corresponding value from Run 1 to obtain ${r_τ^{\rm Run\,1,2} = 1.078\pm0.012\pm0.007}$. Multiplying by the world-average value of the $Λ^0_b$ lifetime yields $τ_{Ξ^-_b}^{\rm Run~1,2} = 1.578\pm0.018\pm0.010\pm0.011$ ps, where the uncertainties are statistical, systematic, and due to the limited knowledge of the $Λ^0_b$ lifetime. This measurement improves the precision of the current world average of the $Ξ^-_b$ lifetime by about a factor of two, and is in good agreement with the most recent theoretical predictions.
△ Less
Submitted 4 October, 2024; v1 submitted 17 June, 2024;
originally announced June 2024.
-
MobileAgentBench: An Efficient and User-Friendly Benchmark for Mobile LLM Agents
Authors:
Luyuan Wang,
Yongyu Deng,
Yiwei Zha,
Guodong Mao,
Qinmin Wang,
Tianchen Min,
Wei Chen,
Shoufa Chen
Abstract:
Large language model (LLM)-based mobile agents are increasingly popular due to their capability to interact directly with mobile phone Graphic User Interfaces (GUIs) and their potential to autonomously manage daily tasks. Despite their promising prospects in both academic and industrial sectors, little research has focused on benchmarking the performance of existing mobile agents, due to the inexh…
▽ More
Large language model (LLM)-based mobile agents are increasingly popular due to their capability to interact directly with mobile phone Graphic User Interfaces (GUIs) and their potential to autonomously manage daily tasks. Despite their promising prospects in both academic and industrial sectors, little research has focused on benchmarking the performance of existing mobile agents, due to the inexhaustible states of apps and the vague definition of feasible action sequences. To address this challenge, we propose an efficient and user-friendly benchmark, MobileAgentBench, designed to alleviate the burden of extensive manual testing. We initially define 100 tasks across 10 open-source apps, categorized by multiple levels of difficulty. Subsequently, we evaluate several existing mobile agents, including AppAgent and MobileAgent, to thoroughly and systematically compare their performance. All materials are accessible on our project webpage: https://MobileAgentBench.github.io, contributing to the advancement of both academic and industrial fields.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Disrupting Bipartite Trading Networks: Matching for Revenue Maximization
Authors:
Luca D'Amico-Wong,
Yannai A. Gonczarowski,
Gary Qiurui Ma,
David C. Parkes
Abstract:
We model the role of an online platform disrupting a market with unit-demand buyers and unit-supply sellers. Each seller can transact with a subset of the buyers whom she already knows, as well as with any additional buyers to whom she is introduced by the platform. Given these constraints on trade, prices and transactions are induced by a competitive equilibrium. The platform's revenue is proport…
▽ More
We model the role of an online platform disrupting a market with unit-demand buyers and unit-supply sellers. Each seller can transact with a subset of the buyers whom she already knows, as well as with any additional buyers to whom she is introduced by the platform. Given these constraints on trade, prices and transactions are induced by a competitive equilibrium. The platform's revenue is proportional to the total price of all trades between platform-introduced buyers and sellers.
In general, we show that the platform's revenue-maximization problem is computationally intractable. We provide structural results for revenue-optimal matchings and isolate special cases in which the platform can efficiently compute them. Furthermore, in a market where the maximum increase in social welfare that the platform can create is $ΔW$, we prove that the platform can attain revenue $Ω(ΔW/\log(\min\{n,m\}))$, where $n$ and $m$ are the numbers of buyers and sellers, respectively. When $ΔW$ is large compared to welfare without the platform, this gives a polynomial-time algorithm that guarantees a logarithmic approximation of the optimal welfare as revenue. We also show that even when the platform optimizes for revenue, the social welfare is at least an $O(\log(\min\{n,m\}))$-approximation to the optimal welfare. Finally, we prove significantly stronger bounds for revenue and social welfare in homogeneous-goods markets.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Bayesian inference of nuclear incompressibility from proton elliptic flow in central Au+Au collisions at 400 MeV/nucleon
Authors:
J. M. Wang,
X. G. Deng,
W. J. Xie,
B. A. Li,
Y. G. Ma
Abstract:
The incompressibility $K$ of symmetric nuclear matter (SNM) is inferred in a Bayesian analysis of proton elliptic flow in mid-central Au + Au collisions at $E = 400$ MeV/nucleon using a Gaussian process (GP) emulator of the isospin-dependent quantum molecular dynamics (IQMD) model for heavy-ion collisions, with or without considering the momentum dependence of single-nucleon potentials. Consistent…
▽ More
The incompressibility $K$ of symmetric nuclear matter (SNM) is inferred in a Bayesian analysis of proton elliptic flow in mid-central Au + Au collisions at $E = 400$ MeV/nucleon using a Gaussian process (GP) emulator of the isospin-dependent quantum molecular dynamics (IQMD) model for heavy-ion collisions, with or without considering the momentum dependence of single-nucleon potentials. Consistent but with smaller quantified uncertainties than previous results from forward modeling of the collective flow in heavy-ion collisions using IQMD, considering the momentum dependence of nucleon potentials, $K=191.3^{+3.7}_{-6.3}$ MeV at 68\% confidence level, indicating a very soft SNM equation of state, is inferred from the combined data of the rapidity and transverse momentum dependence of the proton elliptic flow in the Au+Au collisions considered. Ignoring the momentum dependence of single-nucleon potentials, the extracted value for $K$ is $234.7^{+14.6}_{-11.4}$ MeV, in agreement with its fiducial value derived from giant resonance studies.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Baking Symmetry into GFlowNets
Authors:
George Ma,
Emmanuel Bengio,
Yoshua Bengio,
Dinghuai Zhang
Abstract:
GFlowNets have exhibited promising performance in generating diverse candidates with high rewards. These networks generate objects incrementally and aim to learn a policy that assigns probability of sampling objects in proportion to rewards. However, the current training pipelines of GFlowNets do not consider the presence of isomorphic actions, which are actions resulting in symmetric or isomorphi…
▽ More
GFlowNets have exhibited promising performance in generating diverse candidates with high rewards. These networks generate objects incrementally and aim to learn a policy that assigns probability of sampling objects in proportion to rewards. However, the current training pipelines of GFlowNets do not consider the presence of isomorphic actions, which are actions resulting in symmetric or isomorphic states. This lack of symmetry increases the amount of samples required for training GFlowNets and can result in inefficient and potentially incorrect flow functions. As a consequence, the reward and diversity of the generated objects decrease. In this study, our objective is to integrate symmetries into GFlowNets by identifying equivalent actions during the generation process. Experimental results using synthetic data demonstrate the promising performance of our proposed approaches.
△ Less
Submitted 8 June, 2024;
originally announced June 2024.
-
Measurement of the branching fraction ratios $R(D^{+})$ and $R(D^{*+})$ using muonic $τ$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1063 additional authors not shown)
Abstract:
The branching fraction ratios of $\overline{B}^0\to D^+τ^-\overlineν_τ$ and $\overline{B}^0\to D^{*+}τ^-\overlineν_τ$ decays are measured with respect to their muonic counterparts, using a data sample corresponding to an integrated luminosity of 2.0 fb$^{-1}$ collected by the LHCb experiment in proton-proton collisions at $\sqrt{s} = 13$ TeV. The reconstructed final states are formed by combining…
▽ More
The branching fraction ratios of $\overline{B}^0\to D^+τ^-\overlineν_τ$ and $\overline{B}^0\to D^{*+}τ^-\overlineν_τ$ decays are measured with respect to their muonic counterparts, using a data sample corresponding to an integrated luminosity of 2.0 fb$^{-1}$ collected by the LHCb experiment in proton-proton collisions at $\sqrt{s} = 13$ TeV. The reconstructed final states are formed by combining $D^+$ mesons with $τ^-\toμ^-\overlineν_μν_τ$ candidates, where the $D^+$ is reconstructed via the $D^+\to K^-π^+π^+$ decay. The results are
\begin{align*}
R(D^{+}) &= 0.249 \pm 0.043 \pm 0.047,
R(D^{*+}) &= 0.402 \pm 0.081\pm 0.085,
\end{align*}
where the first uncertainties are statistical and the second systematic. The two measurements have a correlation coefficient of $-0.39$ and are compatible with the Standard Model.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Observation of new charmonium(-like) states in $B^+ \to D^{*\pm} D^{\mp} K^+$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1062 additional authors not shown)
Abstract:
A study of resonant structures in $B^{+}\rightarrow{D^{\ast+}D^{-}K^{+}}$ and $B^{+}\rightarrow{D^{\ast-}D^{+}K^{+}}$ decays is performed, using proton-proton collision data at centre-of-mass energies of $\sqrt{s}=7, 8$, and $13$ TeV recorded by the LHCb experiment, corresponding to an integrated luminosity of 9 fb$^{-1}$. A simultaneous amplitude fit is performed to the two channels with contribu…
▽ More
A study of resonant structures in $B^{+}\rightarrow{D^{\ast+}D^{-}K^{+}}$ and $B^{+}\rightarrow{D^{\ast-}D^{+}K^{+}}$ decays is performed, using proton-proton collision data at centre-of-mass energies of $\sqrt{s}=7, 8$, and $13$ TeV recorded by the LHCb experiment, corresponding to an integrated luminosity of 9 fb$^{-1}$. A simultaneous amplitude fit is performed to the two channels with contributions from resonances decaying to $D^{\ast-}D^{+}$ and $D^{\ast+}D^{-}$ states linked by $C$ parity. This procedure allows the $C$-parities of resonances in the $D^{\ast\pm}D^{\mp}$ mass spectra to be determined. Four charmonium(-like) states are observed decaying into $D^{\ast\pm}D^{\mp}$: $η_c(3945)$, $h_c(4000)$, $χ_{c1}(4010)$ and $h_c(4300)$, with quantum numbers $J^{PC}$ equal to $0^{-+}$, $1^{+-}$, $1^{++}$ and $1^{+-}$, respectively. At least three of these states have not been observed previously. In addition, the existence of the $T_{\bar{c}\bar{s}0}^{*}(2870)^{0}$ and $T_{\bar{c}\bar{s}1}^{*}(2900)^{0}$ resonances in the $D^-K^+$ mass spectrum, already observed in the $B^+ \to D^+ D^- K^+$ decay, is confirmed in a different production channel.
△ Less
Submitted 12 October, 2024; v1 submitted 5 June, 2024;
originally announced June 2024.
-
FedMKT: Federated Mutual Knowledge Transfer for Large and Small Language Models
Authors:
Tao Fan,
Guoqiang Ma,
Yan Kang,
Hanlin Gu,
Yuanfeng Song,
Lixin Fan,
Kai Chen,
Qiang Yang
Abstract:
Recent research in federated large language models (LLMs) has primarily focused on enabling clients to fine-tune their locally deployed homogeneous LLMs collaboratively or on transferring knowledge from server-based LLMs to small language models (SLMs) at downstream clients. However, a significant gap remains in the simultaneous mutual enhancement of both the server's LLM and clients' SLMs. To bri…
▽ More
Recent research in federated large language models (LLMs) has primarily focused on enabling clients to fine-tune their locally deployed homogeneous LLMs collaboratively or on transferring knowledge from server-based LLMs to small language models (SLMs) at downstream clients. However, a significant gap remains in the simultaneous mutual enhancement of both the server's LLM and clients' SLMs. To bridge this gap, we propose FedMKT, a parameter-efficient federated mutual knowledge transfer framework for large and small language models. This framework is designed to adaptively transfer knowledge from the server's LLM to clients' SLMs while concurrently enriching the LLM with clients' unique domain insights. We facilitate token alignment using minimum edit distance (MinED) and then selective mutual knowledge transfer between client-side SLMs and a server-side LLM, aiming to collectively enhance their performance. Through extensive experiments across three distinct scenarios, we evaluate the effectiveness of FedMKT using various public LLMs and SLMs on a range of NLP text generation tasks. Empirical results demonstrate that FedMKT simultaneously boosts the performance of both LLMs and SLMs.
△ Less
Submitted 18 June, 2024; v1 submitted 4 June, 2024;
originally announced June 2024.
-
Amplitude analysis of the radiative decay $B^0_s\to K^+K^-γ$
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1061 additional authors not shown)
Abstract:
A search for radiative decay of $B^0_s$ mesons to orbitally excited $K^+K^-$ states is performed using proton proton collisions recorded by the \mbox{LHCb}\xspace experiment, corresponding to an integrated luminosity of 9~fb$^{-1}$. The dikaon spectrum in the mass range $m_{KK}<2400$~{\ensuremath{\,\text{Me\kern -0.1em V\!/}c^2}\xspace} is dominated by the $φ(1020)$ resonance that accounts for alm…
▽ More
A search for radiative decay of $B^0_s$ mesons to orbitally excited $K^+K^-$ states is performed using proton proton collisions recorded by the \mbox{LHCb}\xspace experiment, corresponding to an integrated luminosity of 9~fb$^{-1}$. The dikaon spectrum in the mass range $m_{KK}<2400$~{\ensuremath{\,\text{Me\kern -0.1em V\!/}c^2}\xspace} is dominated by the $φ(1020)$ resonance that accounts for almost 70$\%$ of the decay rate. Considering the possible contributions of $f_2{(1270)}$, $f'_2{(1525)}$ and $f_2{(2010)}$ meson states, the overall tensor contribution to the amplitude is measured to be \begin{equation}
{\cal F}_{\{f_2\}}=16.8\pm 0.5\mathrm{~(stat.)}\pm0.7\mathrm{~(syst.)}\%,\nonumber \end{equation} mostly dominated by the $f'_2(1525)$ state. Several statistically equivalent solutions are obtained for the detailed resonant structure depending on whether the smaller amplitudes interfere destructively or constructively with the dominant amplitude. The preferred solution that corresponds to the lowest values of the fit fractions along with constructive interference leads to the relative branching ratio measurement \begin{equation}
\frac{{\cal B}(B^0_s\to f'_2γ)}{{\cal B}(B^0_s\toφγ)}= 19.4^{+0.9}_{-0.8}\mathrm{~(stat.)}{}^{+1.4}_{-0.5}\mathrm{~(syst.)}\pm0.5\mathrm{~(\cal{B})}\%\nonumber, \end{equation} where the last uncertainty is due to the ratio of measured branching fractions to the $K^+K^-$ final state. This result represents the first observation of the radiative $B^0_s\to f'_2(1525)γ$ decay, which is the second radiative transition observed in the $B^0_s$ sector.
△ Less
Submitted 21 August, 2024; v1 submitted 31 May, 2024;
originally announced June 2024.
-
A Canonization Perspective on Invariant and Equivariant Learning
Authors:
George Ma,
Yifei Wang,
Derek Lim,
Stefanie Jegelka,
Yisen Wang
Abstract:
In many applications, we desire neural networks to exhibit invariance or equivariance to certain groups due to symmetries inherent in the data. Recently, frame-averaging methods emerged to be a unified framework for attaining symmetries efficiently by averaging over input-dependent subsets of the group, i.e., frames. What we currently lack is a principled understanding of the design of frames. In…
▽ More
In many applications, we desire neural networks to exhibit invariance or equivariance to certain groups due to symmetries inherent in the data. Recently, frame-averaging methods emerged to be a unified framework for attaining symmetries efficiently by averaging over input-dependent subsets of the group, i.e., frames. What we currently lack is a principled understanding of the design of frames. In this work, we introduce a canonization perspective that provides an essential and complete view of the design of frames. Canonization is a classic approach for attaining invariance by mapping inputs to their canonical forms. We show that there exists an inherent connection between frames and canonical forms. Leveraging this connection, we can efficiently compare the complexity of frames as well as determine the optimality of certain frames. Guided by this principle, we design novel frames for eigenvectors that are strictly superior to existing methods -- some are even optimal -- both theoretically and empirically. The reduction to the canonization perspective further uncovers equivalences between previous methods. These observations suggest that canonization provides a fundamental understanding of existing frame-averaging methods and unifies existing equivariant and invariant learning methods.
△ Less
Submitted 29 May, 2024; v1 submitted 28 May, 2024;
originally announced May 2024.
-
Data-driven background model for the CUORE experiment
Authors:
CUORE Collaboration,
D. Q. Adams,
C. Alduino,
K. Alfonso,
F. T. Avignone III,
O. Azzolini,
G. Bari,
F. Bellini,
G. Benato,
M. Beretta,
M. Biassoni,
A. Branca,
C. Brofferio,
C. Bucci,
J. Camilleri,
A. Caminata,
A. Campani,
J. Cao,
S. Capelli,
C. Capelli,
L. Cappelli,
L. Cardani,
P. Carniti,
N. Casali,
E. Celi
, et al. (93 additional authors not shown)
Abstract:
We present the model we developed to reconstruct the CUORE radioactive background based on the analysis of an experimental exposure of 1038.4 kg yr. The data reconstruction relies on a simultaneous Bayesian fit applied to energy spectra over a broad energy range. The high granularity of the CUORE detector, together with the large exposure and extended stable operations, allow for an in-depth explo…
▽ More
We present the model we developed to reconstruct the CUORE radioactive background based on the analysis of an experimental exposure of 1038.4 kg yr. The data reconstruction relies on a simultaneous Bayesian fit applied to energy spectra over a broad energy range. The high granularity of the CUORE detector, together with the large exposure and extended stable operations, allow for an in-depth exploration of both spatial and time dependence of backgrounds. We achieve high sensitivity to both bulk and surface activities of the materials of the setup, detecting levels as low as 10 nBq kg$^{-1}$ and 0.1 nBq cm$^{-2}$, respectively. We compare the contamination levels we extract from the background model with prior radio-assay data, which informs future background risk mitigation strategies. The results of this background model play a crucial role in constructing the background budget for the CUPID experiment as it will exploit the same CUORE infrastructure.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Comprehensive analysis of local and nonlocal amplitudes in the $B^0\rightarrow K^{*0}μ^+μ^-$ decay
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1070 additional authors not shown)
Abstract:
A comprehensive study of the local and nonlocal amplitudes contributing to the decay $B^0\rightarrow K^{*0}(\to K^+π^-) μ^+μ^-$ is performed by analysing the phase-space distribution of the decay products. The analysis is based on $pp$ collision data corresponding to an integrated luminosity of 8.4fb$^{-1}$ collected by the LHCb experiment. This measurement employs for the first time a model of bo…
▽ More
A comprehensive study of the local and nonlocal amplitudes contributing to the decay $B^0\rightarrow K^{*0}(\to K^+π^-) μ^+μ^-$ is performed by analysing the phase-space distribution of the decay products. The analysis is based on $pp$ collision data corresponding to an integrated luminosity of 8.4fb$^{-1}$ collected by the LHCb experiment. This measurement employs for the first time a model of both one-particle and two-particle nonlocal amplitudes, and utilises the complete dimuon mass spectrum without any veto regions around the narrow charmonium resonances. In this way it is possible to explicitly isolate the local and nonlocal contributions and capture the interference between them. The results show that interference with nonlocal contributions, although larger than predicted, only has a minor impact on the Wilson Coefficients determined from the fit to the data. For the local contributions, the Wilson Coefficient $C_9$, responsible for vector dimuon currents, exhibits a $2.1σ$ deviation from the Standard Model expectation. The Wilson Coefficients $C_{10}$, $C_{9}'$ and $C_{10}'$ are all in better agreement than $C_{9}$ with the Standard Model and the global significance is at the level of $1.5σ$. The model used also accounts for nonlocal contributions from $B^{0}\to K^{*0}\left[τ^+τ^-\to μ^+μ^-\right]$ rescattering, resulting in the first direct measurement of the $b sττ$ vector effective-coupling $C_{9τ}$.
△ Less
Submitted 10 September, 2024; v1 submitted 27 May, 2024;
originally announced May 2024.
-
A structure-aware framework for learning device placements on computation graphs
Authors:
Shukai Duan,
Heng Ping,
Nikos Kanakaris,
Xiongye Xiao,
Peiyu Zhang,
Panagiotis Kyriakis,
Nesreen K. Ahmed,
Guixiang Ma,
Mihai Capota,
Shahin Nazarian,
Theodore L. Willke,
Paul Bogdan
Abstract:
Existing approaches for device placement ignore the topological features of computation graphs and rely mostly on heuristic methods for graph partitioning. At the same time, they either follow a grouper-placer or an encoder-placer architecture, which requires understanding the interaction structure between code operations. To bridge the gap between encoder-placer and grouper-placer techniques, we…
▽ More
Existing approaches for device placement ignore the topological features of computation graphs and rely mostly on heuristic methods for graph partitioning. At the same time, they either follow a grouper-placer or an encoder-placer architecture, which requires understanding the interaction structure between code operations. To bridge the gap between encoder-placer and grouper-placer techniques, we propose a novel framework for the task of device placement, relying on smaller computation graphs extracted from the OpenVINO toolkit using reinforcement learning. The framework consists of five steps, including graph coarsening, node representation learning and policy optimization. It facilitates end-to-end training and takes into consideration the directed and acyclic nature of the computation graphs. We also propose a model variant, inspired by graph parsing networks and complex network analysis, enabling graph representation learning and personalized graph partitioning jointly, using an unspecified number of groups. To train the entire framework, we utilize reinforcement learning techniques by employing the execution time of the suggested device placements to formulate the reward. We demonstrate the flexibility and effectiveness of our approach through multiple experiments with three benchmark models, namely Inception-V3, ResNet, and BERT. The robustness of the proposed framework is also highlighted through an ablation study. The suggested placements improve the inference speed for the benchmark models by up to $58.2\%$ over CPU execution and by up to $60.24\%$ compared to other commonly used baselines.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.