subscribe to arXiv mailings

Beyond 2:4: exploring V:N:M sparsity for efficient transformer inference on GPUs

Authors: Kang Zhao, Tao Yuan, Han Bao, Zhenfeng Su, Chang Gao, Zhaofeng Sun, Zichen Liang, Liping Jing, Jianfei Chen

Abstract: To date, 2:4 sparsity has stood as the only sparse pattern that can be accelerated using sparse tensor cores on GPUs. In practice, 2:4 sparsity often possesses low actual speedups ($\leq 1.3$) and requires fixed sparse ratios, meaning that other ratios, such as 4:8, 8:16, or those exceeding 50% sparsity, do not incur any speedups on GPUs. Recent studies suggest that V:N:M sparsity is promising in… ▽ More To date, 2:4 sparsity has stood as the only sparse pattern that can be accelerated using sparse tensor cores on GPUs. In practice, 2:4 sparsity often possesses low actual speedups ($\leq 1.3$) and requires fixed sparse ratios, meaning that other ratios, such as 4:8, 8:16, or those exceeding 50% sparsity, do not incur any speedups on GPUs. Recent studies suggest that V:N:M sparsity is promising in addressing these limitations of 2:4 sparsity. However, regarding accuracy, the effects of V:N:M sparsity on broader Transformer models, such as vision Transformers and large language models (LLMs), are largely unexamined. Moreover, Some specific issues related to V:N:M sparsity, such as how to select appropriate V and M values, remain unresolved. In this study, we thoroughly investigate the application of V:N:M sparsity in vision models and LLMs across multiple tasks, from pertaining to downstream tasks. We propose three key approaches to enhance the applicability and accuracy of V:N:M-sparse Transformers, including heuristic V and M selection, V:N:M-specific channel permutation, and three-staged LoRA training techniques. Experimental results show that, with our methods, the DeiT-small achieves lossless accuracy at 64:2:5 sparsity, while the DeiT-base maintains accuracy even at 64:2:8 sparsity. In addition, the fine-tuned LLama2-7B at 64:2:5 sparsity performs comparably or better than training-free 2:4 sparse alternatives on downstream tasks. More importantly, V:N:M-sparse Transformers offer a wider range of speedup-accuracy trade-offs compared to 2:4 sparsity. Overall, our exploration largely facilitates the V:N:M sparsity to act as a truly effective acceleration solution for Transformers in cost-sensitive inference scenarios. △ Less

Submitted 21 October, 2024; originally announced October 2024.

arXiv:2409.01559 [pdf, other]

PR2: A Physics- and Photo-realistic Testbed for Embodied AI and Humanoid Robots

Authors: Hangxin Liu, Qi Xie, Zeyu Zhang, Tao Yuan, Xiaokun Leng, Lining Sun, Song-Chun Zhu, Jingwen Zhang, Zhicheng He, Yao Su

Abstract: This paper presents the development of a Physics-realistic and Photo-\underline{r}ealistic humanoid robot testbed, PR2, to facilitate collaborative research between Embodied Artificial Intelligence (Embodied AI) and robotics. PR2 offers high-quality scene rendering and robot dynamic simulation, enabling (i) the creation of diverse scenes using various digital assets, (ii) the integration of advanc… ▽ More This paper presents the development of a Physics-realistic and Photo-\underline{r}ealistic humanoid robot testbed, PR2, to facilitate collaborative research between Embodied Artificial Intelligence (Embodied AI) and robotics. PR2 offers high-quality scene rendering and robot dynamic simulation, enabling (i) the creation of diverse scenes using various digital assets, (ii) the integration of advanced perception or foundation models, and (iii) the implementation of planning and control algorithms for dynamic humanoid robot behaviors based on environmental feedback. The beta version of PR2 has been deployed for the simulation track of a nationwide full-size humanoid robot competition for college students, attracting 137 teams and over 400 participants within four months. This competition covered traditional tasks in bipedal walking, as well as novel challenges in loco-manipulation and language-instruction-based object search, marking a first for public college robotics competitions. A retrospective analysis of the competition suggests that future events should emphasize the integration of locomotion with manipulation and perception. By making the PR2 testbed publicly available at https://github.com/pr2-humanoid/PR2-Platform, we aim to further advance education and training in humanoid robotics. △ Less

Submitted 2 September, 2024; originally announced September 2024.

arXiv:2408.12429 [pdf, other]

FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing

Authors: Jue Wang, Yuxiang Lin, Tianshuo Yuan, Zhi-Qi Cheng, Xiaolong Wang, Jiao GH, Wei Chen, Xiaojiang Peng

Abstract: Combining Vision Large Language Models (VLLMs) with diffusion models offers a powerful method for executing image editing tasks based on human language instructions. However, language instructions alone often fall short in accurately conveying user requirements, particularly when users want to add, replace elements in specific areas of an image. Luckily, masks can effectively indicate the exact lo… ▽ More Combining Vision Large Language Models (VLLMs) with diffusion models offers a powerful method for executing image editing tasks based on human language instructions. However, language instructions alone often fall short in accurately conveying user requirements, particularly when users want to add, replace elements in specific areas of an image. Luckily, masks can effectively indicate the exact locations or elements to be edited, while they require users to precisely draw the shapes at the desired locations, which is highly user-unfriendly. To address this, we propose FlexEdit, an end-to-end image editing method that leverages both free-shape masks and language instructions for Flexible Editing. Our approach employs a VLLM in comprehending the image content, mask, and user instructions. Additionally, we introduce the Mask Enhance Adapter (MEA) that fuses the embeddings of the VLLM with the image data, ensuring a seamless integration of mask information and model output embeddings. Furthermore, we construct FSMI-Edit, a benchmark specifically tailored for free-shape mask, including 8 types of free-shape mask. Extensive experiments show that our method achieves state-of-the-art (SOTA) performance in LLM-based image editing, and our simple prompting technique stands out in its effectiveness. The code and data can be found at https://github.com/A-new-b/flex_edit. △ Less

Submitted 22 August, 2024; originally announced August 2024.

Comments: 15 pages, 14 figures

arXiv:2408.11626 [pdf, other]

Exploring Abelian-Non-Abelian Kinetic Mixing in SMEFT and Beyond

Authors: Van Que Tran, Tzu-Chiang Yuan

Abstract: We explore a novel scenario involving Abelian-non-Abelian kinetic mixing within the framework of the Standard Model Effective Field Theory (SMEFT) and its extension with a real triplet scalar field. In SMEFT, this mixing arises exclusively from a dimension-6 operator involving the Standard Model Higgs doublet, while the real triplet scalar field introduces an additional dimension-5 operator. We de… ▽ More We explore a novel scenario involving Abelian-non-Abelian kinetic mixing within the framework of the Standard Model Effective Field Theory (SMEFT) and its extension with a real triplet scalar field. In SMEFT, this mixing arises exclusively from a dimension-6 operator involving the Standard Model Higgs doublet, while the real triplet scalar field introduces an additional dimension-5 operator. We derive the modifications to electroweak gauge boson properties and impose constraints using electroweak precision data. In SMEFT, we find that $Z$ pole data at LEP-I imposes a stringent constraint on the kinetic mixing parameter, requiring it to be less than $O$($10^{-4}$), which corresponds to a new physics scale of about 10 TeV. In the SMEFT+triplet scenario, the constraint can be significantly relaxed with a sizeable triplet vacuum expectation value while preserving custodial symmetry in a finely-tuned parameter space. Future measurements from the Circular Electron Positron Collider could probe the kinetic mixing parameter down to an order of magnitude smaller. △ Less

Submitted 21 August, 2024; originally announced August 2024.

Comments: 15 pages, 2 figures

arXiv:2408.10828 [pdf, other]

Scalable DAQ system operating the CHIPS-5 neutrino detector

Authors: Belén Alonso Rancurel, Son Cao, Thomas J. Carroll, Rhys Castellan, Erika Catano-Mur, John P. Cesar, João A. B. Coelho, Patrick Dills, Thomas Dodwell, Jack Edmondson, Daan van Eijk, Quinn Fetterly, Zoé Garbal, Stefano Germani, Thomas Gilpin, Anthony Giraudo, Alec Habig, Daniel Hanuska, Harry Hausner, Wilson Y. Hernandez, Anna Holin, Junting Huang, Sebastian B. Jones, Albrecht Karle, George Kileff , et al. (35 additional authors not shown)

Abstract: The CHIPS R&D project focuses on development of low-cost water Cherenkov neutrino detectors through novel design strategies and resourceful engineering. This work presents an end-to-end DAQ solution intended for a recent 5 kt CHIPS prototype, which is largely based on affordable mass-produced components. Much like the detector itself, the presented instrumentation is composed of modular arrays tha… ▽ More The CHIPS R&D project focuses on development of low-cost water Cherenkov neutrino detectors through novel design strategies and resourceful engineering. This work presents an end-to-end DAQ solution intended for a recent 5 kt CHIPS prototype, which is largely based on affordable mass-produced components. Much like the detector itself, the presented instrumentation is composed of modular arrays that can be scaled up and easily serviced. A single such array can carry up to 30 photomultiplier tubes (PMTs) accompanied by electronics that generate high voltage in-situ and deliver time resolution of up to 0.69 ns. In addition, the technology is compatible with the White Rabbit timing system, which can synchronize its elements to within 100 ps. While deployment issues did not permit the presented DAQ system to operate beyond initial evaluation, the presented hardware and software successfully passed numerous commissioning tests that demonstrated their viability for use in a large-scale neutrino detector, instrumented with thousands of PMTs. △ Less

Submitted 20 August, 2024; originally announced August 2024.

Comments: 30 pages, 28 figures, submitted to MDPI Applied Sciences, Special Issue: Advanced Neutrino Detector Development and Application

arXiv:2408.05167 [pdf, other]

Gravitational Waves and Dark Matter in the Gauged Two-Higgs Doublet Model

Authors: Michael J. Ramsey-Musolf, Van Que Tran, Tzu-Chiang Yuan

Abstract: We investigate the possibility of a strong first-order electroweak phase transition during the early universe within the framework of the gauged two-Higgs doublet model (G2HDM) and explore its detectability through stochastic gravitational wave signals. The G2HDM introduces a dark replica of the Standard Model electroweak gauge group, inducing an accidental $Z_2$ symmetry which not only leads to a… ▽ More We investigate the possibility of a strong first-order electroweak phase transition during the early universe within the framework of the gauged two-Higgs doublet model (G2HDM) and explore its detectability through stochastic gravitational wave signals. The G2HDM introduces a dark replica of the Standard Model electroweak gauge group, inducing an accidental $Z_2$ symmetry which not only leads to a simple scalar potential at tree-level but also offers a compelling vectorial dark matter candidate. Using the high temperature expansion in the effective potential that manifests gauge invariance, we find a possible two-step phase transition pattern in the model with a strong first-order transition occurring in the second step at the electroweak scale temperature. Collider data from the LHC plays a crucial role in constraining the parameter space conducive to this two-step transition. Furthermore, satisfying the nucleation condition necessitates the masses of scalar bosons in the hidden sector to align with the electroweak scale, potentially probed by future collider detectors. The stochastic gravitational wave energy spectrum associated with the phase transition is computed. The results indicate that forthcoming detectors such as BBO, LISA, DECIGO, TianQin and Taiji could potentially detect the gravitational wave signals generated by the first-order phase transition. Additionally, we find that the parameter space probed by gravitational waves can also be searched for in future dark matter direct detection experiments, in particular those designed for dark matter masses in the sub-GeV range using the superfluid Helium target detectors. △ Less

Submitted 9 August, 2024; originally announced August 2024.

Comments: 71 pages, 9 figures

arXiv:2408.02544 [pdf, other]

Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions

Authors: Xinbei Ma, Yiting Wang, Yao Yao, Tongxin Yuan, Aston Zhang, Zhuosheng Zhang, Hai Zhao

Abstract: This paper investigates the faithfulness of multimodal large language model (MLLM) agents in the graphical user interface (GUI) environment, aiming to address the research question of whether multimodal GUI agents can be distracted by environmental context. A general setting is proposed where both the user and the agent are benign, and the environment, while not malicious, contains unrelated conte… ▽ More This paper investigates the faithfulness of multimodal large language model (MLLM) agents in the graphical user interface (GUI) environment, aiming to address the research question of whether multimodal GUI agents can be distracted by environmental context. A general setting is proposed where both the user and the agent are benign, and the environment, while not malicious, contains unrelated content. A wide range of MLLMs are evaluated as GUI agents using our simulated dataset, following three working patterns with different levels of perception. Experimental results reveal that even the most powerful models, whether generalist agents or specialist GUI agents, are susceptible to distractions. While recent studies predominantly focus on the helpfulness (i.e., action accuracy) of multimodal agents, our findings indicate that these agents are prone to environmental distractions, resulting in unfaithful behaviors. Furthermore, we switch to the adversarial perspective and implement environment injection, demonstrating that such unfaithfulness can be exploited, leading to unexpected risks. △ Less

Submitted 5 August, 2024; originally announced August 2024.

arXiv:2407.11522 [pdf, other]

FIRE: A Dataset for Feedback Integration and Refinement Evaluation of Multimodal Models

Authors: Pengxiang Li, Zhi Gao, Bofei Zhang, Tao Yuan, Yuwei Wu, Mehrtash Harandi, Yunde Jia, Song-Chun Zhu, Qing Li

Abstract: Vision language models (VLMs) have achieved impressive progress in diverse applications, becoming a prevalent research direction. In this paper, we build FIRE, a feedback-refinement dataset, consisting of 1.1M multi-turn conversations that are derived from 27 source datasets, empowering VLMs to spontaneously refine their responses based on user feedback across diverse tasks. To scale up the data c… ▽ More Vision language models (VLMs) have achieved impressive progress in diverse applications, becoming a prevalent research direction. In this paper, we build FIRE, a feedback-refinement dataset, consisting of 1.1M multi-turn conversations that are derived from 27 source datasets, empowering VLMs to spontaneously refine their responses based on user feedback across diverse tasks. To scale up the data collection, FIRE is collected in two components: FIRE-100K and FIRE-1M, where FIRE-100K is generated by GPT-4V, and FIRE-1M is freely generated via models trained on FIRE-100K. Then, we build FIRE-Bench, a benchmark to comprehensively evaluate the feedback-refining capability of VLMs, which contains 11K feedback-refinement conversations as the test data, two evaluation settings, and a model to provide feedback for VLMs. We develop the FIRE-LLaVA model by fine-tuning LLaVA on FIRE-100K and FIRE-1M, which shows remarkable feedback-refining capability on FIRE-Bench and outperforms untrained VLMs by 50%, making more efficient user-agent interactions and underscoring the significance of the FIRE dataset. △ Less

Submitted 16 July, 2024; originally announced July 2024.

arXiv:2407.01351 [pdf, other]

Probing the connection between IceCube neutrinos and MOJAVE AGN

Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (399 additional authors not shown)

Abstract: Active Galactic Nuclei (AGN) are prime candidate sources of the high-energy, astrophysical neutrinos detected by IceCube. This is demonstrated by the real-time multi-messenger detection of the blazar TXS 0506+056 and the recent evidence of neutrino emission from NGC 1068 from a separate time-averaged study. However, the production mechanism of the astrophysical neutrinos in AGN is not well establi… ▽ More Active Galactic Nuclei (AGN) are prime candidate sources of the high-energy, astrophysical neutrinos detected by IceCube. This is demonstrated by the real-time multi-messenger detection of the blazar TXS 0506+056 and the recent evidence of neutrino emission from NGC 1068 from a separate time-averaged study. However, the production mechanism of the astrophysical neutrinos in AGN is not well established which can be resolved via correlation studies with photon observations. For neutrinos produced due to photohadronic interactions in AGN, in addition to a correlation of neutrinos with high-energy photons, there would also be a correlation of neutrinos with photons emitted at radio wavelengths. In this work, we perform an in-depth stacking study of the correlation between 15 GHz radio observations of AGN reported in the MOJAVE XV catalog, and ten years of neutrino data from IceCube. We also use a time-dependent approach which improves the statistical power of the stacking analysis. No significant correlation was found for both analyses and upper limits are reported. When compared to the IceCube diffuse flux, at 100 TeV and for a spectral index of 2.5, the upper limits derived are $\sim3\%$ and $\sim9\%$ for the time-averaged and time-dependent case, respectively. △ Less

Submitted 1 July, 2024; originally announced July 2024.

Comments: 14 Pages 7 Figures

arXiv:2407.01314 [pdf, other]

doi 10.1103/PhysRevD.110.072007

Search for a light sterile neutrino with 7.5 years of IceCube DeepCore data

Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (399 additional authors not shown)

Abstract: We present a search for an eV-scale sterile neutrino using 7.5 years of data from the IceCube DeepCore detector. The analysis uses a sample of 21,914 events with energies between 5 and 150 GeV to search for sterile neutrinos through atmospheric muon neutrino disappearance. Improvements in event selection and treatment of systematic uncertainties provide greater statistical power compared to previo… ▽ More We present a search for an eV-scale sterile neutrino using 7.5 years of data from the IceCube DeepCore detector. The analysis uses a sample of 21,914 events with energies between 5 and 150 GeV to search for sterile neutrinos through atmospheric muon neutrino disappearance. Improvements in event selection and treatment of systematic uncertainties provide greater statistical power compared to previous DeepCore sterile neutrino searches. Our results are compatible with the absence of mixing between active and sterile neutrino states, and we place constraints on the mixing matrix elements $|U_{μ4}|^2 < 0.0534$ and $|U_{τ4}|^2 < 0.0574$ at 90% CL under the assumption that $Δm^2_{41}\geq 1\;\mathrm{eV^2}$. These null results add to the growing tension between anomalous appearance results and constraints from disappearance searches in the 3+1 sterile neutrino landscape. △ Less

Submitted 9 September, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

Comments: 11 pages, 5 figures. Version accepted by Physical Review D for publication

arXiv:2406.10263 [pdf, other]

A Lightweight Framework for Adaptive Retrieval In Code Completion With Critique Model

Authors: Wenrui Zhang, Tiehang Fu, Ting Yuan, Ge Zhang, Dong Chen, Jie Wang

Abstract: Recent advancements in Retrieval-Augmented Generation have significantly enhanced code completion at the repository level. Various RAG-based code completion systems are proposed based on different design choices. For instance, gaining more effectiveness at the cost of repeating the retrieval-generation process multiple times. However, the indiscriminate use of retrieval in current methods reveals… ▽ More Recent advancements in Retrieval-Augmented Generation have significantly enhanced code completion at the repository level. Various RAG-based code completion systems are proposed based on different design choices. For instance, gaining more effectiveness at the cost of repeating the retrieval-generation process multiple times. However, the indiscriminate use of retrieval in current methods reveals issues in both efficiency and effectiveness, as a considerable portion of retrievals are unnecessary and may introduce unhelpful or even harmful suggestions to code language models. To address these challenges, we introduce CARD, a lightweight critique method designed to provide insights into the necessity of retrievals and select the optimal answer from multiple predictions. CARD can seamlessly integrate into any RAG-based code completion system. Our evaluation shows that CARD saves 21% to 46% times of retrieval for Line completion, 14% to 40% times of retrieval for API completion, and 6% to 46.5% times of retrieval for function completion respectively, while improving the accuracy. CARD reduces latency ranging from 16% to 83%. CARD is generalizable to different LMs, retrievers, and programming languages. It is lightweight with training in few seconds and inference in few milliseconds. △ Less

Submitted 10 June, 2024; originally announced June 2024.

arXiv:2406.07601 [pdf, other]

IceCube Search for Neutrino Emission from X-ray Bright Seyfert Galaxies

Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (400 additional authors not shown)

Abstract: The recent IceCube detection of TeV neutrino emission from the nearby active galaxy NGC 1068 suggests that active galactic nuclei (AGN) could make a sizable contribution to the diffuse flux of astrophysical neutrinos. The absence of TeV $γ$-rays from NGC 1068 indicates neutrino production in the vicinity of the supermassive black hole, where the high radiation density leads to $γ$-ray attenuation.… ▽ More The recent IceCube detection of TeV neutrino emission from the nearby active galaxy NGC 1068 suggests that active galactic nuclei (AGN) could make a sizable contribution to the diffuse flux of astrophysical neutrinos. The absence of TeV $γ$-rays from NGC 1068 indicates neutrino production in the vicinity of the supermassive black hole, where the high radiation density leads to $γ$-ray attenuation. Therefore, any potential neutrino emission from similar sources is not expected to correlate with high-energy $γ$-rays. Disk-corona models predict neutrino emission from Seyfert galaxies to correlate with keV X-rays, as they are tracers of coronal activity. Using through-going track events from the Northern Sky recorded by IceCube between 2011 and 2021, we report results from a search for individual and aggregated neutrino signals from 27 additional Seyfert galaxies that are contained in the BAT AGN Spectroscopic Survey (BASS). Besides the generic single power-law, we evaluate the spectra predicted by the disk-corona model. Assuming all sources to be intrinsically similar to NGC 1068, our findings constrain the collective neutrino emission from X-ray bright Seyfert galaxies in the Northern Hemisphere, but, at the same time, show excesses of neutrinos that could be associated with the objects NGC 4151 and CGCG 420-015. These excesses result in a 2.7$σ$ significance with respect to background expectations. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: 17 pages, 9 figures

arXiv:2406.06684 [pdf, other]

Search for neutrino emission from hard X-ray AGN with IceCube

Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (401 additional authors not shown)

Abstract: Active Galactic Nuclei (AGN) are promising candidate sources of high-energy astrophysical neutrinos since they provide environments rich in matter and photon targets where cosmic ray interactions may lead to the production of gamma rays and neutrinos. We searched for high-energy neutrino emission from AGN using the $\textit{Swift}$-BAT Spectroscopic Survey (BASS) catalog of hard X-ray sources and… ▽ More Active Galactic Nuclei (AGN) are promising candidate sources of high-energy astrophysical neutrinos since they provide environments rich in matter and photon targets where cosmic ray interactions may lead to the production of gamma rays and neutrinos. We searched for high-energy neutrino emission from AGN using the $\textit{Swift}$-BAT Spectroscopic Survey (BASS) catalog of hard X-ray sources and 12 years of IceCube muon track data. First, upon performing a stacked search, no significant emission was found. Second, we searched for neutrinos from a list of 43 candidate sources and found an excess from the direction of two sources, Seyfert galaxies NGC 1068 and NGC 4151. We observed NGC 1068 at flux $φ_{ν_μ+\barν_μ}$ = $4.02_{-1.52}^{+1.58} \times 10^{-11}$ TeV$^{-1}$ cm$^{-2}$ s$^{-1}$ normalized at 1 TeV, with power-law spectral index, $γ$ = 3.10$^{+0.26}_{-0.22}$, consistent with previous IceCube results. The observation of a neutrino excess from the direction of NGC 4151 is at a post-trial significance of 2.9$σ$. If interpreted as an astrophysical signal, the excess observed from NGC 4151 corresponds to a flux $φ_{ν_μ+\barν_μ}$ = $1.51_{-0.81}^{+0.99} \times 10^{-11}$ TeV$^{-1}$ cm$^{-2}$ s$^{-1}$ normalized at 1 TeV and $γ$ = 2.83$^{+0.35}_{-0.28}$. △ Less

Submitted 12 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

arXiv:2406.05984 [pdf, ps, other]

Explainable AI for Mental Disorder Detection via Social Media: A survey and outlook

Authors: Yusif Ibrahimov, Tarique Anwar, Tommy Yuan

Abstract: Mental health constitutes a complex and pervasive global challenge, affecting millions of lives and often leading to severe consequences. In this paper, we conduct a thorough survey to explore the intersection of data science, artificial intelligence, and mental healthcare, focusing on the recent developments of mental disorder detection through online social media (OSM). A significant portion of… ▽ More Mental health constitutes a complex and pervasive global challenge, affecting millions of lives and often leading to severe consequences. In this paper, we conduct a thorough survey to explore the intersection of data science, artificial intelligence, and mental healthcare, focusing on the recent developments of mental disorder detection through online social media (OSM). A significant portion of the population actively engages in OSM platforms, creating a vast repository of personal data that holds immense potential for mental health analytics. The paper navigates through traditional diagnostic methods, state-of-the-art data- and AI-driven research studies, and the emergence of explainable AI (XAI) models for mental healthcare. We review state-of-the-art machine learning methods, particularly those based on modern deep learning, while emphasising the need for explainability in healthcare AI models. The experimental design section provides insights into prevalent practices, including available datasets and evaluation approaches. We also identify key issues and challenges in the field and propose promising future research directions. As mental health decisions demand transparency, interpretability, and ethical considerations, this paper contributes to the ongoing discourse on advancing XAI in mental healthcare through social media. The comprehensive overview presented here aims to guide researchers, practitioners, and policymakers in developing the area of mental disorder detection. △ Less

Submitted 9 June, 2024; originally announced June 2024.

arXiv:2406.02137 [pdf, other]

Enhanced Nonlinear Frequency Conversion Bandwidth through Birefringence induced Mode Hybridization

Authors: Tingge Yuan, Jiangwei Wu, Xueyi Wang, Hao Li, Yuping Chen, Xianfeng Chen

Abstract: On-chip quantum information network requires qubit transfer between different wavelengths while preserving quantum coherence and entanglement, which needs broadband up-conversion available. Herein, we demonstrate a mode-hybridization based broadband nonlinear frequency conversion on X-cut thin film lithium niobate. With the spontaneous quasi-phase matching and quasi groupvelocity matching being si… ▽ More On-chip quantum information network requires qubit transfer between different wavelengths while preserving quantum coherence and entanglement, which needs broadband up-conversion available. Herein, we demonstrate a mode-hybridization based broadband nonlinear frequency conversion on X-cut thin film lithium niobate. With the spontaneous quasi-phase matching and quasi groupvelocity matching being simultaneously satisfied, broadband second harmonic generation with a 3-dB bandwidth up to 13 nm has been achieved in a micro-racetrack resonator. The same mechanism can work on the frequency conversion of the ultra-short pulse in the bent waveguide structure. This work will be beneficial to on-chip tunable frequency conversion and quantum light source generation on integrated photonic platforms, and further enable on-chip large-capacity multiplexing, multichannel optical information processing, and large quantum information networks. △ Less

Submitted 4 June, 2024; originally announced June 2024.

arXiv:2406.01882 [pdf, other]

HoneyGPT: Breaking the Trilemma in Terminal Honeypots with Large Language Model

Authors: Ziyang Wang, Jianzhou You, Haining Wang, Tianwei Yuan, Shichao Lv, Yang Wang, Limin Sun

Abstract: Honeypots, as a strategic cyber-deception mechanism designed to emulate authentic interactions and bait unauthorized entities, continue to struggle with balancing flexibility, interaction depth, and deceptive capability despite their evolution over decades. Often they also lack the capability of proactively adapting to an attacker's evolving tactics, which restricts the depth of engagement and sub… ▽ More Honeypots, as a strategic cyber-deception mechanism designed to emulate authentic interactions and bait unauthorized entities, continue to struggle with balancing flexibility, interaction depth, and deceptive capability despite their evolution over decades. Often they also lack the capability of proactively adapting to an attacker's evolving tactics, which restricts the depth of engagement and subsequent information gathering. Under this context, the emergent capabilities of large language models, in tandem with pioneering prompt-based engineering techniques, offer a transformative shift in the design and deployment of honeypot technologies. In this paper, we introduce HoneyGPT, a pioneering honeypot architecture based on ChatGPT, heralding a new era of intelligent honeypot solutions characterized by their cost-effectiveness, high adaptability, and enhanced interactivity, coupled with a predisposition for proactive attacker engagement. Furthermore, we present a structured prompt engineering framework that augments long-term interaction memory and robust security analytics. This framework, integrating thought of chain tactics attuned to honeypot contexts, enhances interactivity and deception, deepens security analytics, and ensures sustained engagement. The evaluation of HoneyGPT includes two parts: a baseline comparison based on a collected dataset and a field evaluation in real scenarios for four weeks. The baseline comparison demonstrates HoneyGPT's remarkable ability to strike a balance among flexibility, interaction depth, and deceptive capability. The field evaluation further validates HoneyGPT's efficacy, showing its marked superiority in enticing attackers into more profound interactive engagements and capturing a wider array of novel attack vectors in comparison to existing honeypot technologies. △ Less

Submitted 3 June, 2024; originally announced June 2024.

arXiv:2406.00905 [pdf, other]

Exploration of mass splitting and muon/tau mixing parameters for an eV-scale sterile neutrino with IceCube

Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (400 additional authors not shown)

Abstract: We present the first three-parameter fit to a 3+1 sterile neutrino model using 7.634 years of data from the IceCube Neutrino Observatory on $ν_μ+\overlineν_μ$ charged-current interactions in the energy range 500-9976 GeV. Our analysis is sensitive to the mass-squared splitting between the heaviest and lightest mass state ($Δm_{41}^2$), the mixing matrix element connecting muon flavor to the fourth… ▽ More We present the first three-parameter fit to a 3+1 sterile neutrino model using 7.634 years of data from the IceCube Neutrino Observatory on $ν_μ+\overlineν_μ$ charged-current interactions in the energy range 500-9976 GeV. Our analysis is sensitive to the mass-squared splitting between the heaviest and lightest mass state ($Δm_{41}^2$), the mixing matrix element connecting muon flavor to the fourth mass state ($|U_{\mu4}|^2$), and the element connecting tau flavor to the fourth mass state ($|U_{\tau4}|^2$). Predicted propagation effects in matter enhance the signature through a resonance as atmospheric neutrinos from the Northern Hemisphere traverse the Earth to the IceCube detector at the South Pole. The result is consistent with the no-sterile neutrino hypothesis with a probability of 4.3 %. Profiling the likelihood of each parameter yields the 90 % confidence levels: $ 2.4\,\mathrm{eV}^{2} < Δm_{41}^2 <9.6\,\mathrm{eV}^{2} $ , $0.0081 < |U_{\mu4}|^2 < 0.10$ , and $|U_{\tau4}|^2< 0.035$, which narrows the allowed parameter-space for $|U_{\tau4}|^2$. However, the primary result of this analysis is the first map of the 3+1 parameter space exploring the interdependence of $Δm_{41}^2$, $|U_{\mu4}|^2$, and $|U_{\tau4}|^2$. △ Less

Submitted 2 June, 2024; originally announced June 2024.

arXiv:2405.18044 [pdf, other]

Cognitive Insights and Stable Coalition Matching for Fostering Multi-Agent Cooperation

Authors: Jiaqi Shao, Tianjun Yuan, Tao Lin, Xuanyu Cao, Bing Luo

Abstract: Cognitive abilities, such as Theory of Mind (ToM), play a vital role in facilitating cooperation in human social interactions. However, our study reveals that agents with higher ToM abilities may not necessarily exhibit better cooperative behavior compared to those with lower ToM abilities. To address this challenge, we propose a novel matching coalition mechanism that leverages the strengths of a… ▽ More Cognitive abilities, such as Theory of Mind (ToM), play a vital role in facilitating cooperation in human social interactions. However, our study reveals that agents with higher ToM abilities may not necessarily exhibit better cooperative behavior compared to those with lower ToM abilities. To address this challenge, we propose a novel matching coalition mechanism that leverages the strengths of agents with different ToM levels by explicitly considering belief alignment and specialized abilities when forming coalitions. Our proposed matching algorithm seeks to find stable coalitions that maximize the potential for cooperative behavior and ensure long-term viability. By incorporating cognitive insights into the design of multi-agent systems, our work demonstrates the potential of leveraging ToM to create more sophisticated and human-like coordination strategies that foster cooperation and improve overall system performance. △ Less

Submitted 28 May, 2024; originally announced May 2024.

arXiv:2405.08077 [pdf, other]

Methods and stability tests associated with the sterile neutrino search using improved high-energy $ν_μ$ event reconstruction in IceCube

Authors: IceCube Collaboration, R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise , et al. (398 additional authors not shown)

Abstract: We provide supporting details for the search for a 3+1 sterile neutrino using data collected over eleven years at the IceCube Neutrino Observatory. The analysis uses atmospheric muon-flavored neutrinos from 0.5 to 100\, TeV that traverse the Earth to reach the IceCube detector, and finds a best-fit point at $\sin^2(2θ_{24}) = 0.16$ and $Δm^{2}_{41} = 3.5$ eV$^2$ with a goodness-of-fit p-value of 1… ▽ More We provide supporting details for the search for a 3+1 sterile neutrino using data collected over eleven years at the IceCube Neutrino Observatory. The analysis uses atmospheric muon-flavored neutrinos from 0.5 to 100\, TeV that traverse the Earth to reach the IceCube detector, and finds a best-fit point at $\sin^2(2θ_{24}) = 0.16$ and $Δm^{2}_{41} = 3.5$ eV$^2$ with a goodness-of-fit p-value of 12\% and consistency with the null hypothesis of no oscillations to sterile neutrinos with a p-value of 3.1\%. Several improvements were made over past analyses, which are reviewed in this article, including upgrades to the reconstruction and the study of sources of systematic uncertainty. We provide details of the fit quality and discuss stability tests that split the data for separate samples, comparing results. We find that the fits are consistent between split data sets. △ Less

Submitted 13 May, 2024; originally announced May 2024.

Comments: 18 pages, 17 figures, 2 tables. This long-form paper is a companion to the letter "A search for an eV-scale sterile neutrino using improved high-energy νμ event reconstruction in IceCube."

arXiv:2405.08070 [pdf, other]

A search for an eV-scale sterile neutrino using improved high-energy $ν_μ$ event reconstruction in IceCube

Authors: IceCube Collaboration, R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise , et al. (398 additional authors not shown)

Abstract: This Letter presents the result of a 3+1 sterile neutrino search using 10.7 years of IceCube data. We analyze atmospheric muon neutrinos that traverse the Earth with energies ranging from 0.5 to 100 TeV, incorporating significant improvements in modeling neutrino flux and detector response compared to earlier studies. Notably, for the first time, we categorize data into starting and through-going… ▽ More This Letter presents the result of a 3+1 sterile neutrino search using 10.7 years of IceCube data. We analyze atmospheric muon neutrinos that traverse the Earth with energies ranging from 0.5 to 100 TeV, incorporating significant improvements in modeling neutrino flux and detector response compared to earlier studies. Notably, for the first time, we categorize data into starting and through-going events, distinguishing neutrino interactions with vertices inside or outside the instrumented volume, to improve energy resolution. The best-fit point for a 3+1 model is found to be at $\sin^2(2θ_{24}) = 0.16$ and $Δm^{2}_{41} = 3.5$ eV$^2$, which agrees with previous iterations of this study. The result is consistent with the null hypothesis of no sterile neutrinos with a p-value of 3.1\%. △ Less

Submitted 13 May, 2024; originally announced May 2024.

Comments: 9 pages, 3 figures. This letter is supported by the long-form paper "Methods and stability tests associated with the sterile neutrino search using improved high-energy $ν_μ$ event reconstruction in IceCube," also appearing on arXiv

arXiv:2405.04434 [pdf, other]

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Authors: DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding , et al. (132 additional authors not shown)

Abstract: We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference… ▽ More We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference through significantly compressing the Key-Value (KV) cache into a latent vector, while DeepSeekMoE enables training strong models at an economical cost through sparse computation. Compared with DeepSeek 67B, DeepSeek-V2 achieves significantly stronger performance, and meanwhile saves 42.5% of training costs, reduces the KV cache by 93.3%, and boosts the maximum generation throughput to 5.76 times. We pretrain DeepSeek-V2 on a high-quality and multi-source corpus consisting of 8.1T tokens, and further perform Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL) to fully unlock its potential. Evaluation results show that, even with only 21B activated parameters, DeepSeek-V2 and its chat versions still achieve top-tier performance among open-source models. △ Less

Submitted 19 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

arXiv:2405.03817 [pdf, other]

Search for joint multimessenger signals from potential Galactic PeVatrons with HAWC and IceCube

Authors: R. Alfaro, C. Alvarez, J. C. Arteaga-Velázquez, D. Avila Rojas, H. A. Ayala Solares, R. Babu, E. Belmont-Moreno, K. S. Caballero-Mora, T. Capistrán, A. Carramiñana, S. Casanova, U. Cotti, J. Cotzomi, S. Coutiño de León, E. De la Fuente, D. Depaoli, N. Di Lalla, R. Diaz Hernandez, J. C. Díaz-Vélez, K. Engel, T. Ergin, K. L. Fan, K. Fang, N. Fraija, S. Fraija , et al. (469 additional authors not shown)

Abstract: Galactic PeVatrons are sources that can accelerate cosmic rays to PeV energies. The high-energy cosmic rays are expected to interact with the surrounding ambient material or radiation, resulting in the production of gamma rays and neutrinos. To optimize for the detection of such associated production of gamma rays and neutrinos for a given source morphology and spectrum, a multi-messenger analysis… ▽ More Galactic PeVatrons are sources that can accelerate cosmic rays to PeV energies. The high-energy cosmic rays are expected to interact with the surrounding ambient material or radiation, resulting in the production of gamma rays and neutrinos. To optimize for the detection of such associated production of gamma rays and neutrinos for a given source morphology and spectrum, a multi-messenger analysis that combines gamma rays and neutrinos is required. In this study, we use the Multi-Mission Maximum Likelihood framework (3ML) with IceCube Maximum Likelihood Analysis software (i3mla) and HAWC Accelerated Likelihood (HAL) to search for a correlation between 22 known gamma-ray sources from the third HAWC gamma-ray catalog and 14 years of IceCube track-like data. No significant neutrino emission from the direction of the HAWC sources was found. We report the best-fit gamma-ray model and 90% CL neutrino flux limit from the 22 sources. From the neutrino flux limit, we conclude that the gamma-ray emission from five of the sources can not be produced purely from hadronic interactions. We report the limit for the fraction of gamma rays produced by hadronic interactions for these five sources. △ Less

Submitted 6 May, 2024; originally announced May 2024.

arXiv:2405.03127 [pdf, other]

When Standard Model Higgs Meets Its Lighter 95 GeV Higgs

Authors: Abdesslam Arhrib, Khiem Hong Phan, Van Que Tran, Tzu-Chiang Yuan

Abstract: Two excesses reported recently at the LHC in the lighter Higgs mass region around 95 GeV and in the rare $Z γ$ final state of the Standard Model (SM) 125 GeV Higgs decay are simultaneously scrutinized within the framework of minimal gauged two-Higgs-doublet model (G2HDM). Viable parameter space in G2HDM is obtained to account for both excesses. We find a strong correlation between the signal stren… ▽ More Two excesses reported recently at the LHC in the lighter Higgs mass region around 95 GeV and in the rare $Z γ$ final state of the Standard Model (SM) 125 GeV Higgs decay are simultaneously scrutinized within the framework of minimal gauged two-Higgs-doublet model (G2HDM). Viable parameter space in G2HDM is obtained to account for both excesses. We find a strong correlation between the signal strengths of SM 125 GeV Higgs decays into $γγ$ and $Z γ$ modes, whereas this correlation does not extend to its lighter 95 GeV cousin. △ Less

Submitted 5 May, 2024; originally announced May 2024.

Comments: 32 pages, 5 figures

arXiv:2404.19589 [pdf, other]

Acceptance Tests of more than 10 000 Photomultiplier Tubes for the multi-PMT Digital Optical Modules of the IceCube Upgrade

Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (399 additional authors not shown)

Abstract: More than 10,000 photomultiplier tubes (PMTs) with a diameter of 80 mm will be installed in multi-PMT Digital Optical Modules (mDOMs) of the IceCube Upgrade. These have been tested and pre-calibrated at two sites. A throughput of more than 1000 PMTs per week with both sites was achieved with a modular design of the testing facilities and highly automated testing procedures. The testing facilities… ▽ More More than 10,000 photomultiplier tubes (PMTs) with a diameter of 80 mm will be installed in multi-PMT Digital Optical Modules (mDOMs) of the IceCube Upgrade. These have been tested and pre-calibrated at two sites. A throughput of more than 1000 PMTs per week with both sites was achieved with a modular design of the testing facilities and highly automated testing procedures. The testing facilities can easily be adapted to other PMTs, such that they can, e.g., be re-used for testing the PMTs for IceCube-Gen2. Single photoelectron response, high voltage dependence, time resolution, prepulse, late pulse, afterpulse probabilities, and dark rates were measured for each PMT. We describe the design of the testing facilities, the testing procedures, and the results of the acceptance tests. △ Less

Submitted 20 June, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

Comments: 24 pages, 19 figures, 2 tables, submitted to JINST

arXiv:2404.06397 [pdf, other]

New Contributions to $b \to s γ$ in Minimal G2HDM

Authors: Che Hao Liu, Van Que Tran, Qiaoyi Wen, Fanrong Xu, Tzu-Chiang Yuan

Abstract: We study the flavor-changing bottom quark radiative decay $b \to s γ$ induced at one-loop level within the minimal gauged two-Higgs-doublet model (G2HDM). Among the three new contributions to this rare process in G2HDM, we find that only the charged Higgs $\mathcal{H^\pm}$ contribution can be constrained by the current global fit data in $B$-physics. Other two contributions from the complex vector… ▽ More We study the flavor-changing bottom quark radiative decay $b \to s γ$ induced at one-loop level within the minimal gauged two-Higgs-doublet model (G2HDM). Among the three new contributions to this rare process in G2HDM, we find that only the charged Higgs $\mathcal{H^\pm}$ contribution can be constrained by the current global fit data in $B$-physics. Other two contributions from the complex vectorial dark matter $\mathcal{W}$ and dark Higgs $\mathcal{D}$ are not sensitive to the current data. Combining with theoretical constraints imposed on the scalar potential and electroweak precision data for the oblique parameters, we exclude mass regions $m_{\mathcal{H}^\pm} \lesssim 250$ GeV and $m_{\mathcal{D}} \lesssim 100$ GeV at the 95\% confidence level. △ Less

Submitted 9 April, 2024; originally announced April 2024.

Comments: 35 pages, 8 figures

arXiv:2404.06382 [pdf, other]

Traffic Signal Control and Speed Offset Coordination Using Q-Learning for Arterial Road Networks

Authors: Tianchen Yuan, Petros A. Ioannou

Abstract: Arterial traffic interacts with freeway traffic, yet the two are controlled independently. Arterial traffic signals do not take into account freeway traffic and how ramps control ingress traffic and have no control over egress traffic from the freeway. This often results in long queues in either direction that block ramps and spill over to arterial streets or freeway lanes. In this paper, we propo… ▽ More Arterial traffic interacts with freeway traffic, yet the two are controlled independently. Arterial traffic signals do not take into account freeway traffic and how ramps control ingress traffic and have no control over egress traffic from the freeway. This often results in long queues in either direction that block ramps and spill over to arterial streets or freeway lanes. In this paper, we propose an adaptive arterial traffic control strategy that combines traffic signal control (TSC) and dynamic speed offset (DSO) coordination using a Q-learning algorithm for a traffic network that involves a freeway segment and adjacent arterial streets. The TSC agent computes the signal cycle length and split based on observed intersection demands and adjacent freeway off-ramp queues. The DSO agent computes the relative offset and the recommended speeds of both ways between consecutive intersections based on their physical distance, intersection queues, and signal cycles. We evaluate the performance of the proposed arterial traffic control strategy using microscopic traffic simulations of an arterial corridor with seven intersections near the I-710 freeway. The proposed QL-based control significantly outperforms a fixed-time control and MAXBAND in terms of the travel time and the number of stops under low or moderate demands. In high-demand scenarios, the travel-time benefit provided by the QL-based control is reduced as it mitigates off-ramp and intersection queues, which is a necessary trade-off in our perspective. In addition, mutual benefit is obtained by implementing freeway and arterial traffic control simultaneously. △ Less

Submitted 9 April, 2024; originally announced April 2024.

Comments: submitted to TR-C

arXiv:2404.01102 [pdf, other]

Diffusion based Zero-shot Medical Image-to-Image Translation for Cross Modality Segmentation

Authors: Zihao Wang, Yingyu Yang, Yuzhou Chen, Tingting Yuan, Maxime Sermesant, Herve Delingette, Ona Wu

Abstract: Cross-modality image segmentation aims to segment the target modalities using a method designed in the source modality. Deep generative models can translate the target modality images into the source modality, thus enabling cross-modality segmentation. However, a vast body of existing cross-modality image translation methods relies on supervised learning. In this work, we aim to address the challe… ▽ More Cross-modality image segmentation aims to segment the target modalities using a method designed in the source modality. Deep generative models can translate the target modality images into the source modality, thus enabling cross-modality segmentation. However, a vast body of existing cross-modality image translation methods relies on supervised learning. In this work, we aim to address the challenge of zero-shot learning-based image translation tasks (extreme scenarios in the target modality is unseen in the training phase). To leverage generative learning for zero-shot cross-modality image segmentation, we propose a novel unsupervised image translation method. The framework learns to translate the unseen source image to the target modality for image segmentation by leveraging the inherent statistical consistency between different modalities for diffusion guidance. Our framework captures identical cross-modality features in the statistical domain, offering diffusion guidance without relying on direct mappings between the source and target domains. This advantage allows our method to adapt to changing source domains without the need for retraining, making it highly practical when sufficient labeled source domain data is not available. The proposed framework is validated in zero-shot cross-modality image segmentation tasks through empirical comparisons with influential generative models, including adversarial-based and diffusion-based models. △ Less

Submitted 9 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

Comments: Neurips 2023 Diffusion Workshop

arXiv:2403.10521 [pdf, other]

P-MapNet: Far-seeing Map Generator Enhanced by both SDMap and HDMap Priors

Authors: Zhou Jiang, Zhenxin Zhu, Pengfei Li, Huan-ang Gao, Tianyuan Yuan, Yongliang Shi, Hang Zhao, Hao Zhao

Abstract: Autonomous vehicles are gradually entering city roads today, with the help of high-definition maps (HDMaps). However, the reliance on HDMaps prevents autonomous vehicles from stepping into regions without this expensive digital infrastructure. This fact drives many researchers to study online HDMap generation algorithms, but the performance of these algorithms at far regions is still unsatisfying.… ▽ More Autonomous vehicles are gradually entering city roads today, with the help of high-definition maps (HDMaps). However, the reliance on HDMaps prevents autonomous vehicles from stepping into regions without this expensive digital infrastructure. This fact drives many researchers to study online HDMap generation algorithms, but the performance of these algorithms at far regions is still unsatisfying. We present P-MapNet, in which the letter P highlights the fact that we focus on incorporating map priors to improve model performance. Specifically, we exploit priors in both SDMap and HDMap. On one hand, we extract weakly aligned SDMap from OpenStreetMap, and encode it as an additional conditioning branch. Despite the misalignment challenge, our attention-based architecture adaptively attends to relevant SDMap skeletons and significantly improves performance. On the other hand, we exploit a masked autoencoder to capture the prior distribution of HDMap, which can serve as a refinement module to mitigate occlusions and artifacts. We benchmark on the nuScenes and Argoverse2 datasets. Through comprehensive experiments, we show that: (1) our SDMap prior can improve online map generation performance, using both rasterized (by up to $+18.73$ $\rm mIoU$) and vectorized (by up to $+8.50$ $\rm mAP$) output representations. (2) our HDMap prior can improve map perceptual metrics by up to $6.34\%$. (3) P-MapNet can be switched into different inference modes that covers different regions of the accuracy-efficiency trade-off landscape. (4) P-MapNet is a far-seeing solution that brings larger improvements on longer ranges. Codes and models are publicly available at https://jike5.github.io/P-MapNet. △ Less

Submitted 29 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

Comments: Code: https://jike5.github.io/P-MapNet

arXiv:2403.09079 [pdf, other]

PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors

Authors: Tianyuan Yuan, Yucheng Mao, Jiawei Yang, Yicheng Liu, Yue Wang, Hang Zhao

Abstract: Autonomous vehicles rely extensively on perception systems to navigate and interpret their surroundings. Despite significant advancements in these systems recently, challenges persist under conditions like occlusion, extreme lighting, or in unfamiliar urban areas. Unlike these systems, humans do not solely depend on immediate observations to perceive the environment. In navigating new cities, huma… ▽ More Autonomous vehicles rely extensively on perception systems to navigate and interpret their surroundings. Despite significant advancements in these systems recently, challenges persist under conditions like occlusion, extreme lighting, or in unfamiliar urban areas. Unlike these systems, humans do not solely depend on immediate observations to perceive the environment. In navigating new cities, humans gradually develop a preliminary mental map to supplement real-time perception during subsequent visits. Inspired by this human approach, we introduce a novel framework, PreSight, that leverages past traversals to construct static prior memories, enhancing online perception in later navigations. Our method involves optimizing a city-scale neural radiance field with data from previous journeys to generate neural priors. These priors, rich in semantic and geometric details, are derived without manual annotations and can seamlessly augment various state-of-the-art perception models, improving their efficacy with minimal additional computational cost. Experimental results on the nuScenes dataset demonstrate the framework's high compatibility with diverse online perception models. Specifically, it shows remarkable improvements in HD-map construction and occupancy prediction tasks, highlighting its potential as a new perception framework for autonomous driving systems. Our code will be released at https://github.com/yuantianyuan01/PreSight. △ Less

Submitted 14 July, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

arXiv:2403.02470 [pdf, other]

doi 10.1088/1748-0221/19/06/P06026

Improved modeling of in-ice particle showers for IceCube event reconstruction

Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, G. Anton, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise , et al. (394 additional authors not shown)

Abstract: The IceCube Neutrino Observatory relies on an array of photomultiplier tubes to detect Cherenkov light produced by charged particles in the South Pole ice. IceCube data analyses depend on an in-depth characterization of the glacial ice, and on novel approaches in event reconstruction that utilize fast approximations of photoelectron yields. Here, a more accurate model is derived for event reconstr… ▽ More The IceCube Neutrino Observatory relies on an array of photomultiplier tubes to detect Cherenkov light produced by charged particles in the South Pole ice. IceCube data analyses depend on an in-depth characterization of the glacial ice, and on novel approaches in event reconstruction that utilize fast approximations of photoelectron yields. Here, a more accurate model is derived for event reconstruction that better captures our current knowledge of ice optical properties. When evaluated on a Monte Carlo simulation set, the median angular resolution for in-ice particle showers improves by over a factor of three compared to a reconstruction based on a simplified model of the ice. The most substantial improvement is obtained when including effects of birefringence due to the polycrystalline structure of the ice. When evaluated on data classified as particle showers in the high-energy starting events sample, a significantly improved description of the events is observed. △ Less

Submitted 22 April, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

Comments: 28 pages, 18 figures, 1 table, submitted to JINST, updated to account for comments received

Journal ref: 2024 JINST 19 P06026

arXiv:2402.18026 [pdf, other]

doi 10.1103/PhysRevD.110.022001

Characterization of the Astrophysical Diffuse Neutrino Flux using Starting Track Events in IceCube

Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, G. Anton, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise , et al. (394 additional authors not shown)

Abstract: A measurement of the diffuse astrophysical neutrino spectrum is presented using IceCube data collected from 2011-2022 (10.3 years). We developed novel detection techniques to search for events with a contained vertex and exiting track induced by muon neutrinos undergoing a charged-current interaction. Searching for these starting track events allows us to not only more effectively reject atmospher… ▽ More A measurement of the diffuse astrophysical neutrino spectrum is presented using IceCube data collected from 2011-2022 (10.3 years). We developed novel detection techniques to search for events with a contained vertex and exiting track induced by muon neutrinos undergoing a charged-current interaction. Searching for these starting track events allows us to not only more effectively reject atmospheric muons but also atmospheric neutrino backgrounds in the southern sky, opening a new window to the sub-100 TeV astrophysical neutrino sky. The event selection is constructed using a dynamic starting track veto and machine learning algorithms. We use this data to measure the astrophysical diffuse flux as a single power law flux (SPL) with a best-fit spectral index of $γ= 2.58 ^{+0.10}_{-0.09}$ and per-flavor normalization of $φ^{\mathrm{Astro}}_{\mathrm{per-flavor}} = 1.68 ^{+0.19}_{-0.22} \times 10^{-18} \times \mathrm{GeV}^{-1} \mathrm{cm}^{-2} \mathrm{s}^{-1} \mathrm{sr}^{-1}$ (at 100 TeV). The sensitive energy range for this dataset is 3 - 550 TeV under the SPL assumption. This data was also used to measure the flux under a broken power law, however we did not find any evidence of a low energy cutoff. △ Less

Submitted 27 February, 2024; originally announced February 2024.

Comments: 27 pages, 28 figures

Journal ref: Phys. Rev. D 110, 022001 (2024)

arXiv:2402.17652 [pdf, other]

Compass: A Decentralized Scheduler for Latency-Sensitive ML Workflows

Authors: Yuting Yang, Andrea Merlina, Weijia Song, Tiancheng Yuan, Ken Birman, Roman Vitenberg

Abstract: We consider ML query processing in distributed systems where GPU-enabled workers coordinate to execute complex queries: a computing style often seen in applications that interact with users in support of image processing and natural language processing. In such systems, coscheduling of GPU memory management and task placement represents a promising opportunity. We propose Compass, a novel framewor… ▽ More We consider ML query processing in distributed systems where GPU-enabled workers coordinate to execute complex queries: a computing style often seen in applications that interact with users in support of image processing and natural language processing. In such systems, coscheduling of GPU memory management and task placement represents a promising opportunity. We propose Compass, a novel framework that unifies these functions to reduce job latency while using resources efficiently, placing tasks where data dependencies will be satisfied, collocating tasks from the same job (when this will not overload the host or its GPU), and efficiently managing GPU memory. Comparison with other state of the art schedulers shows a significant reduction in completion times while requiring the same amount or even fewer resources. In one case, just half the servers were needed for processing the same workload. △ Less

Submitted 28 February, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

arXiv:2402.16489 [pdf, ps, other]

Multiple Boundary Peak Solution for Critical Elliptic System with Neumann Boundary

Authors: Yuxia Guo, Shengyu Wu, TingFeng Yuan

Abstract: We consider the following elliptic system with Neumann boundary: \begin{equation} \begin{cases} -Δu + μu=v^p, &\hbox{in } Ω, \\-Δv + μv=u^q, &\hbox{in } Ω, \\\frac{\partial u}{\partial n} = \frac{\partial v}{\partial n} = 0, &\hbox{on } \partialΩ, \\u>0,v>0, &\hbox{in } Ω, \end{cases} \end{equation} where $Ω\subset \mathbb{R}^N$ is a smooth bounded domain, $μ$ is a positive constant and $(p,q)$ li… ▽ More We consider the following elliptic system with Neumann boundary: \begin{equation} \begin{cases} -Δu + μu=v^p, &\hbox{in } Ω, \\-Δv + μv=u^q, &\hbox{in } Ω, \\\frac{\partial u}{\partial n} = \frac{\partial v}{\partial n} = 0, &\hbox{on } \partialΩ, \\u>0,v>0, &\hbox{in } Ω, \end{cases} \end{equation} where $Ω\subset \mathbb{R}^N$ is a smooth bounded domain, $μ$ is a positive constant and $(p,q)$ lies in the critical hyperbola: $$ \dfrac{1}{p+1} + \dfrac{1}{q+1} =\dfrac{N-2}{N}. $$ By using the Lyapunov-Schmidt reduction technique, we establish the existence of infinitely many solutions to above system. These solutions have multiple peaks that are located on the boundary $\partial Ω$. Our results show that the geometry of the boundary $\partialΩ,$ especially its mean curvature, plays a crucial role on the existence and the behaviour of the solutions to the problem. △ Less

Submitted 26 February, 2024; originally announced February 2024.

arXiv:2402.05136 [pdf, other]

LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K

Authors: Tao Yuan, Xuefei Ning, Dong Zhou, Zhijie Yang, Shiyao Li, Minghui Zhuang, Zheyue Tan, Zhuyu Yao, Dahua Lin, Boxun Li, Guohao Dai, Shengen Yan, Yu Wang

Abstract: State-of-the-art large language models (LLMs) are now claiming remarkable supported context lengths of 256k or even more. In contrast, the average context lengths of mainstream benchmarks are insufficient (5k-21k), and they suffer from potential knowledge leakage and inaccurate metrics, resulting in biased evaluation. This paper introduces LV-Eval, a challenging long-context benchmark with five le… ▽ More State-of-the-art large language models (LLMs) are now claiming remarkable supported context lengths of 256k or even more. In contrast, the average context lengths of mainstream benchmarks are insufficient (5k-21k), and they suffer from potential knowledge leakage and inaccurate metrics, resulting in biased evaluation. This paper introduces LV-Eval, a challenging long-context benchmark with five length levels (16k, 32k, 64k, 128k, and 256k) reaching up to 256k words. LV-Eval features two main tasks, single-hop QA and multi-hop QA, comprising 11 bilingual datasets. The design of LV-Eval has incorporated three key techniques, namely confusing facts insertion, keyword and phrase replacement, and keyword-recall-based metric design. The advantages of LV-Eval include controllable evaluation across different context lengths, challenging test instances with confusing facts, mitigated knowledge leakage, and more objective evaluations. We evaluate 15 LLMs on LV-Eval and conduct ablation studies on the benchmarking techniques. The results reveal that: (i) Moonshot-v1 and recent large-scale open-source models, such as Qwen-2.5-72B and Llama-3.1-70B, achieve the highest performance on LV-Eval, particularly at lengths below 64k. (ii) Models exhibit distinct score trends. For example, GLM-4-9B-128k, Yi-6B-200k, and Llama3-8B-1M exhibit a relatively gentle degradation of performance, but their absolute performances may not necessarily be higher than those of LLMs with shorter context lengths. (iii) LLMs' performances can significantly degrade in the presence of confusing information, especially in the pressure test of "needle in a haystack". (iv) Issues related to knowledge leakage and inaccurate metrics introduce bias in evaluation, and these concerns are alleviated in LV-Eval. All datasets and evaluation codes are released at: https://github.com/infinigence/LVEval. △ Less

Submitted 3 October, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

arXiv:2402.04247 [pdf, other]

Prioritizing Safeguarding Over Autonomy: Risks of LLM Agents for Science

Authors: Xiangru Tang, Qiao Jin, Kunlun Zhu, Tongxin Yuan, Yichi Zhang, Wangchunshu Zhou, Meng Qu, Yilun Zhao, Jian Tang, Zhuosheng Zhang, Arman Cohan, Zhiyong Lu, Mark Gerstein

Abstract: Intelligent agents powered by large language models (LLMs) have demonstrated substantial promise in autonomously conducting experiments and facilitating scientific discoveries across various disciplines. While their capabilities are promising, these agents, called scientific LLM agents, also introduce novel vulnerabilities that demand careful consideration for safety. However, there exists a notab… ▽ More Intelligent agents powered by large language models (LLMs) have demonstrated substantial promise in autonomously conducting experiments and facilitating scientific discoveries across various disciplines. While their capabilities are promising, these agents, called scientific LLM agents, also introduce novel vulnerabilities that demand careful consideration for safety. However, there exists a notable gap in the literature, as there has been no comprehensive exploration of these vulnerabilities. This perspective paper fills this gap by conducting a thorough examination of vulnerabilities in LLM-based agents within scientific domains, shedding light on potential risks associated with their misuse and emphasizing the need for safety measures. We begin by providing a comprehensive overview of the potential risks inherent to scientific LLM agents, taking into account user intent, the specific scientific domain, and their potential impact on the external environment. Then, we delve into the origins of these vulnerabilities and provide a scoping review of the limited existing works. Based on our analysis, we propose a triadic framework involving human regulation, agent alignment, and an understanding of environmental feedback (agent regulation) to mitigate these identified risks. Furthermore, we highlight the limitations and challenges associated with safeguarding scientific agents and advocate for the development of improved models, robust benchmarks, and comprehensive regulations to address these issues effectively. △ Less

Submitted 5 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

arXiv:2402.03610 [pdf, other]

RAP: Retrieval-Augmented Planning with Contextual Memory for Multimodal LLM Agents

Authors: Tomoyuki Kagaya, Thong Jing Yuan, Yuxuan Lou, Jayashree Karlekar, Sugiri Pranata, Akira Kinose, Koki Oguri, Felix Wick, Yang You

Abstract: Owing to recent advancements, Large Language Models (LLMs) can now be deployed as agents for increasingly complex decision-making applications in areas including robotics, gaming, and API integration. However, reflecting past experiences in current decision-making processes, an innate human behavior, continues to pose significant challenges. Addressing this, we propose Retrieval-Augmented Planning… ▽ More Owing to recent advancements, Large Language Models (LLMs) can now be deployed as agents for increasingly complex decision-making applications in areas including robotics, gaming, and API integration. However, reflecting past experiences in current decision-making processes, an innate human behavior, continues to pose significant challenges. Addressing this, we propose Retrieval-Augmented Planning (RAP) framework, designed to dynamically leverage past experiences corresponding to the current situation and context, thereby enhancing agents' planning capabilities. RAP distinguishes itself by being versatile: it excels in both text-only and multimodal environments, making it suitable for a wide range of tasks. Empirical evaluations demonstrate RAP's effectiveness, where it achieves SOTA performance in textual scenarios and notably enhances multimodal LLM agents' performance for embodied tasks. These results highlight RAP's potential in advancing the functionality and applicability of LLM agents in complex, real-world applications. △ Less

Submitted 5 February, 2024; originally announced February 2024.

arXiv:2401.11994 [pdf, other]

Citizen Science for IceCube: Name that Neutrino

Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, G. Anton, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (391 additional authors not shown)

Abstract: Name that Neutrino is a citizen science project where volunteers aid in classification of events for the IceCube Neutrino Observatory, an immense particle detector at the geographic South Pole. From March 2023 to September 2023, volunteers did classifications of videos produced from simulated data of both neutrino signal and background interactions. Name that Neutrino obtained more than 128,000 cl… ▽ More Name that Neutrino is a citizen science project where volunteers aid in classification of events for the IceCube Neutrino Observatory, an immense particle detector at the geographic South Pole. From March 2023 to September 2023, volunteers did classifications of videos produced from simulated data of both neutrino signal and background interactions. Name that Neutrino obtained more than 128,000 classifications by over 1,800 registered volunteers that were compared to results obtained by a deep neural network machine-learning algorithm. Possible improvements for both Name that Neutrino and the deep neural network are discussed. △ Less

Submitted 22 January, 2024; originally announced January 2024.

arXiv:2401.11728 [pdf, other]

The Design and Construction of the Chips Water Cherenkov Neutrino Detector

Authors: B. Alonso Rancurel, N. Angelides, G. Augustoni, S. Bash, B. Bergmann, N. Bertschinger, P. Bizouard, M. Campbell, S. Cao, T. J. Carroll, R. Castellan, E. Catano-Mur, J. P. Cesar, J. A. B. Coelho, P. Dills, T. Dodwell, J. Edmondson, D. van Eijk, Q. Fetterly, Z. Garbal, S. Germani, T. Gilpin, A. Giraudo, A. Habig, D. Hanuska , et al. (42 additional authors not shown)

Abstract: CHIPS (CHerenkov detectors In mine PitS) was a prototype large-scale water Cherenkov detector located in northern Minnesota. The main aim of the R&D project was to demonstrate that construction costs of neutrino oscillation detectors could be reduced by at least an order of magnitude compared to other equivalent experiments. This article presents design features of the CHIPS detector along with de… ▽ More CHIPS (CHerenkov detectors In mine PitS) was a prototype large-scale water Cherenkov detector located in northern Minnesota. The main aim of the R&D project was to demonstrate that construction costs of neutrino oscillation detectors could be reduced by at least an order of magnitude compared to other equivalent experiments. This article presents design features of the CHIPS detector along with details of the implementation and deployment of the prototype. While issues during and after the deployment of the detector prevented data taking, a number of key concepts and designs were successfully demonstrated. △ Less

Submitted 25 September, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

arXiv:2401.10019 [pdf, other]

R-Judge: Benchmarking Safety Risk Awareness for LLM Agents

Authors: Tongxin Yuan, Zhiwei He, Lingzhong Dong, Yiming Wang, Ruijie Zhao, Tian Xia, Lizhen Xu, Binglin Zhou, Fangqi Li, Zhuosheng Zhang, Rui Wang, Gongshen Liu

Abstract: Large language models (LLMs) have exhibited great potential in autonomously completing tasks across real-world applications. Despite this, these LLM agents introduce unexpected safety risks when operating in interactive environments. Instead of centering on the harmlessness of LLM-generated content in most prior studies, this work addresses the imperative need for benchmarking the behavioral safet… ▽ More Large language models (LLMs) have exhibited great potential in autonomously completing tasks across real-world applications. Despite this, these LLM agents introduce unexpected safety risks when operating in interactive environments. Instead of centering on the harmlessness of LLM-generated content in most prior studies, this work addresses the imperative need for benchmarking the behavioral safety of LLM agents within diverse environments. We introduce R-Judge, a benchmark crafted to evaluate the proficiency of LLMs in judging and identifying safety risks given agent interaction records. R-Judge comprises 569 records of multi-turn agent interaction, encompassing 27 key risk scenarios among 5 application categories and 10 risk types. It is of high-quality curation with annotated safety labels and risk descriptions. Evaluation of 11 LLMs on R-Judge shows considerable room for enhancing the risk awareness of LLMs: The best-performing model, GPT-4o, achieves 74.42% while no other models significantly exceed the random. Moreover, we reveal that risk awareness in open agent scenarios is a multi-dimensional capability involving knowledge and reasoning, thus challenging for LLMs. With further experiments, we find that fine-tuning on safety judgment significantly improve model performance while straightforward prompting mechanisms fail. R-Judge is publicly available at https://github.com/Lordog/R-Judge. △ Less

Submitted 5 October, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

Comments: EMNLP Findings 2024

arXiv:2401.02063 [pdf, other]

Windows on the Universe: Establishing the Infrastructure for a Collaborative Multi-messenger Ecosystem

Authors: The 2023 Windows on the Universe Workshop White Paper Working Group, T. Ahumada, J. E. Andrews, S. Antier, E. Blaufuss, P. R. Brady, A. M. Brazier, E. Burns, S. B. Cenko, P. Chandra, D. Chatterjee, A. Corsi, M. W. Coughlin, D. A. Coulter, S. Fu, A. Goldstein, L. P. Guy, E. J. Hooper, S. B. Howell, T. B. Humensky, J. A. Kennea, S. M. Jarrett, R. M. Lau, T. R. Lewis, L. Lu , et al. (21 additional authors not shown)

Abstract: In this White Paper, we present recommendations for the scientific community and funding agencies to foster the infrastructure for a collaborative multi-messenger and time-domain astronomy (MMA/TDA) ecosystem. MMA/TDA is poised for breakthrough discoveries in the coming decade. In much the same way that expanding beyond the optical bandpass revealed entirely new and unexpected discoveries, cosmic… ▽ More In this White Paper, we present recommendations for the scientific community and funding agencies to foster the infrastructure for a collaborative multi-messenger and time-domain astronomy (MMA/TDA) ecosystem. MMA/TDA is poised for breakthrough discoveries in the coming decade. In much the same way that expanding beyond the optical bandpass revealed entirely new and unexpected discoveries, cosmic messengers beyond light (i.e., gravitational waves, neutrinos, and cosmic rays) open entirely new windows to answer some of the most fundamental questions in (astro)physics: heavy element synthesis, equation of state of dense matter, particle acceleration, etc. This field was prioritized as a frontier scientific pursuit in the 2020 Decadal Survey on Astronomy and Astrophysics via its "New Windows on the Dynamic Universe" theme. MMA/TDA science presents technical challenges distinct from those experienced in other disciplines. Successful observations require coordination across myriad boundaries -- different cosmic messengers, ground vs. space, international borders, etc. -- all for sources that may not be well localized, and whose brightness may be changing rapidly with time. Add that all of this work is undertaken by real human beings, with distinct backgrounds, experiences, cultures, and expectations, that often conflict. To address these challenges and help MMA/TDA realize its full scientific potential in the coming decade (and beyond), the second in a series of community workshops sponsored by the U.S. National Science Foundation (NSF) and NASA titled "Windows on the Universe: Establishing the Infrastructure for a Collaborative Multi-Messenger Ecosystem" was held on October 16-18, 2023 in Tucson, AZ. Here we present the primary recommendations from this workshop focused on three key topics -- hardware, software, and people and policy. [abridged] △ Less

Submitted 3 April, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

Comments: Workshop white paper

arXiv:2401.00865 [pdf, other]

Xorbits: Automating Operator Tiling for Distributed Data Science

Authors: Weizheng Lu, Kaisheng He, Xuye Qin, Chengjie Li, Zhong Wang, Tao Yuan, Xia Liao, Feng Zhang, Yueguo Chen, Xiaoyong Du

Abstract: Data science pipelines commonly utilize dataframe and array operations for tasks such as data preprocessing, analysis, and machine learning. The most popular tools for these tasks are pandas and NumPy. However, these tools are limited to executing on a single node, making them unsuitable for processing large-scale data. Several systems have attempted to distribute data science applications to clus… ▽ More Data science pipelines commonly utilize dataframe and array operations for tasks such as data preprocessing, analysis, and machine learning. The most popular tools for these tasks are pandas and NumPy. However, these tools are limited to executing on a single node, making them unsuitable for processing large-scale data. Several systems have attempted to distribute data science applications to clusters while maintaining interfaces similar to single-node libraries, enabling data scientists to scale their workloads without significant effort. However, existing systems often struggle with processing large datasets due to Out-of-Memory (OOM) problems caused by poor data partitioning. To overcome these challenges, we develop Xorbits, a high-performance, scalable data science framework specifically designed to distribute data science workloads across clusters while retaining familiar APIs. The key differentiator of Xorbits is its ability to dynamically switch between graph construction and graph execution. Xorbits has been successfully deployed in production environments with up to 5k CPU cores. Its applications span various domains, including user behavior analysis and recommendation systems in the e-commerce sector, as well as credit assessment and risk management in the finance industry. Users can easily scale their data science workloads by simply changing the import line of their pandas and NumPy code. Our experiments demonstrate that Xorbits can effectively process very large datasets without encountering OOM or data-skewing problems. Over the fastest state-of-the-art solutions, Xorbits achieves an impressive 2.66* speedup on average. In terms of API coverage, Xorbits attains a compatibility rate of 96.7%, surpassing the fastest framework by an impressive margin of 60 percentage points. Xorbits is available at https://github.com/xorbitsai/xorbits. △ Less

Submitted 19 March, 2024; v1 submitted 29 December, 2023; originally announced January 2024.

Comments: ICDE 2024 Industrial and Application Track

arXiv:2312.15740 [pdf, other]

BiSwift: Bandwidth Orchestrator for Multi-Stream Video Analytics on Edge

Authors: Lin Sun, Weijun Wang, Tingting Yuan, Liang Mi, Haipeng Dai, Yunxin Liu, Xiaoming Fu

Abstract: High-definition (HD) cameras for surveillance and road traffic have experienced tremendous growth, demanding intensive computation resources for real-time analytics. Recently, offloading frames from the front-end device to the back-end edge server has shown great promise. In multi-stream competitive environments, efficient bandwidth management and proper scheduling are crucial to ensure both high… ▽ More High-definition (HD) cameras for surveillance and road traffic have experienced tremendous growth, demanding intensive computation resources for real-time analytics. Recently, offloading frames from the front-end device to the back-end edge server has shown great promise. In multi-stream competitive environments, efficient bandwidth management and proper scheduling are crucial to ensure both high inference accuracy and high throughput. To achieve this goal, we propose BiSwift, a bi-level framework that scales the concurrent real-time video analytics by a novel adaptive hybrid codec integrated with multi-level pipelines, and a global bandwidth controller for multiple video streams. The lower-level front-back-end collaborative mechanism (called adaptive hybrid codec) locally optimizes the accuracy and accelerates end-to-end video analytics for a single stream. The upper-level scheduler aims to accuracy fairness among multiple streams via the global bandwidth controller. The evaluation of BiSwift shows that BiSwift is able to real-time object detection on 9 streams with an edge device only equipped with an NVIDIA RTX3070 (8G) GPU. BiSwift improves 10%$\sim$21% accuracy and presents 1.2$\sim$9$\times$ throughput compared with the state-of-the-art video analytics pipelines. △ Less

Submitted 4 February, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

Comments: Accepted by 2024 IEEE INFOCOM

arXiv:2312.11515 [pdf, other]

doi 10.3847/1538-4357/ad220b

doi 10.3847/1538-4357/ad683e

Search for 10--1000 GeV neutrinos from Gamma Ray Bursts with IceCube

Authors: IceCube Collaboration, R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, G. Anton, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise , et al. (384 additional authors not shown)

Abstract: We present the results of a search for 10--1,000 GeV neutrinos from 2,268 gamma-ray bursts over 8 years of IceCube-DeepCore data. This work probes burst physics below the photosphere where electromagnetic radiation cannot escape. Neutrinos of tens of GeVs are predicted in sub-photospheric collision of free streaming neutrons with bulk-jet protons. In a first analysis, we searched for the most sign… ▽ More We present the results of a search for 10--1,000 GeV neutrinos from 2,268 gamma-ray bursts over 8 years of IceCube-DeepCore data. This work probes burst physics below the photosphere where electromagnetic radiation cannot escape. Neutrinos of tens of GeVs are predicted in sub-photospheric collision of free streaming neutrons with bulk-jet protons. In a first analysis, we searched for the most significant neutrino-GRB coincidence using six overlapping time windows centered on the prompt phase of each GRB. In a second analysis, we conducted a search for a group of GRBs, each individually too weak to be detectable, but potentially significant when combined. No evidence of neutrino emission is found for either analysis. The most significant neutrino coincidence is for Fermi-GBM GRB bn 140807500, with a p-value of 0.097 corrected for all trials. The binomial test used to search for a group of GRBs had a p-value of 0.65 after all trial corrections. The binomial test found a group consisting only of GRB bn 140807500 and no additional GRBs. The neutrino limits of this work complement those obtained by IceCube at TeV to PeV energies. We compare our findings for the large set of GRBs as well as GRB 221009A to the sub-photospheric neutron-proton collision model and find that GRB 221009A provides the most constraining limit on baryon loading. For a jet Lorentz factor of 300 (800), the baryon loading on GRB 221009A is lower than 3.85 (2.13) at a 90% confidence level. △ Less

Submitted 29 July, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

Journal ref: ApJ 964 126 (2024)

arXiv:2312.10785 [pdf, other]

Self-interacting Vectorial Dark Matter in a SM-like Dark Sector

Authors: Van Que Tran, Thong T. Q. Nguyen, Tzu-Chiang Yuan

Abstract: A $SU(2)_D \times U(1)_D$ gauge-Higgs sector, an exact dark copy of the Standard Model (SM) one, is proposed. It is demonstrated that the dark gauge bosons ${\cal W}^{(p,m)}$, in analogous to the SM $W^\pm$, can fulfill the role as a self-interacting vector dark matter candidate, solving the core versus cusp and missing satellites problems faced by the conventional paradigm of collisionless weakly… ▽ More A $SU(2)_D \times U(1)_D$ gauge-Higgs sector, an exact dark copy of the Standard Model (SM) one, is proposed. It is demonstrated that the dark gauge bosons ${\cal W}^{(p,m)}$, in analogous to the SM $W^\pm$, can fulfill the role as a self-interacting vector dark matter candidate, solving the core versus cusp and missing satellites problems faced by the conventional paradigm of collisionless weakly interacting massive particle. Constraints from collider, astroparticle and cosmology on such a self-interacting vector dark matter candidate are scrutinized. Implications for the future searches of ${\cal W}^{(p,m)}$ in direct detection experiments are discussed. △ Less

Submitted 17 December, 2023; originally announced December 2023.

Comments: 42 pages, 9 figures

arXiv:2312.05362 [pdf, other]

doi 10.3847/1538-4357/ad3730

All-Sky Search for Transient Astrophysical Neutrino Emission with 10 Years of IceCube Cascade Events

Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, G. Anton, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (382 additional authors not shown)

Abstract: We present the results of a time-dependent search for neutrino flares in data collected by IceCube between May 2011 and 2021. This data set contains cascade-like events originating from charged-current electron neutrino and tau neutrino interactions and all-flavor neutral-current interactions. IceCube's previous all-sky searches for neutrino flares used data sets consisting of track-like events or… ▽ More We present the results of a time-dependent search for neutrino flares in data collected by IceCube between May 2011 and 2021. This data set contains cascade-like events originating from charged-current electron neutrino and tau neutrino interactions and all-flavor neutral-current interactions. IceCube's previous all-sky searches for neutrino flares used data sets consisting of track-like events originating from charged-current muon neutrino interactions. The cascade data sets are statistically independent of the track data sets and provide a new opportunity to observe the transient all-sky landscape. This search uses the spatial, temporal, and energy information of the cascade-like events to conduct searches for the most statistically significant neutrino flares in the northern and southern skies. No statistically significant time-dependent neutrino emission was observed. For the most statistically significant location in the northern sky, $p_\mathrm{global} =$ 0.71, and in the southern sky, $p_\mathrm{global} =$ 0.51. These results are compatible with the background hypothesis. Assuming an E$^{-2.53}$ spectrum from the diffuse astrophysical neutrino flux as measured with cascades, these results are used to calculate upper limits at the 90\% confidence level on neutrino flares of varying duration and constrain the contribution of these flares to the diffuse astrophysical neutrino flux. These constraints are independent of a specified class of astrophysical objects and show that multiple unresolved transient sources may contribute to the diffuse astrophysical neutrino flux. △ Less

Submitted 11 March, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

Comments: Submitted to The Astrophysical Journal

arXiv:2312.01687 [pdf]

Optimizing Bus Travel: A Novel Approach to Feature Mining with P-KMEANS and P-LDA Algorithms

Authors: Hongjie Liu, Haotian Shi, Sicheng Fu, Tengfei Yuan, Xinhuan Zhang, Hongzhe Xu, Bin Ran

Abstract: Customizing services for bus travel can bolster its attractiveness, optimize usage, alleviate traffic congestion, and diminish carbon emissions. This potential is realized by harnessing recent advancements in positioning communication facilities, the Internet of Things, and artificial intelligence for feature mining in public transportation. However, the inherent complexities of disorganized and u… ▽ More Customizing services for bus travel can bolster its attractiveness, optimize usage, alleviate traffic congestion, and diminish carbon emissions. This potential is realized by harnessing recent advancements in positioning communication facilities, the Internet of Things, and artificial intelligence for feature mining in public transportation. However, the inherent complexities of disorganized and unstructured public transportation data introduce substantial challenges to travel feature extraction. This study presents a bus travel feature extraction method rooted in Point of Interest (POI) data, employing enhanced P-KMENAS and P-LDA algorithms to overcome these limitations. While the KMEANS algorithm adeptly segments passenger travel paths into distinct clusters, its outcomes can be influenced by the initial K value. On the other hand, Latent Dirichlet Allocation (LDA) excels at feature identification and probabilistic interpretations yet encounters difficulties with feature intermingling and nuanced sub-feature interactions. Incorporating the POI dimension enhances our understanding of travel behavior, aligning it more closely with passenger attributes and facilitating easier data analysis. By incorporating POI data, our refined P-KMENAS and P-LDA algorithms grant a holistic insight into travel behaviors and attributes, effectively mitigating the limitations above. Consequently, this POI-centric algorithm effectively amalgamates diverse POI attributes, delineates varied travel contexts, and imparts probabilistic metrics to feature properties. Our method successfully mines the diverse aspects of bus travel, such as age, occupation, gender, sports, cost, safety, and personality traits. It effectively calculates relationships between individual travel behaviors and assigns explanatory and evaluative probabilities to POI labels, thereby enhancing bus travel optimization. △ Less

Submitted 4 December, 2023; originally announced December 2023.

arXiv:2311.17031 [pdf, other]

Heegaard Floer Symplectic homology and Viterbo's isomorphism theorem in the context of multiple particles

Authors: Roman Krutowski, Tianyu Yuan

Abstract: Given a Liouville manifold $M$, we introduce an invariant of $M$ that we call the Heegaard Floer symplectic cohomology $SH^*_κ(M)$ for any $κ\ge 1$ that coincides with the symplectic cohomology for $κ=1$. Writing $\hat{M}$ for the completion of $M$, the differential counts pseudoholomorphic curves of arbitrary genus in $\mathbb{R} \times S^1 \times \hat{M}$ that are required to be branched $κ$-she… ▽ More Given a Liouville manifold $M$, we introduce an invariant of $M$ that we call the Heegaard Floer symplectic cohomology $SH^*_κ(M)$ for any $κ\ge 1$ that coincides with the symplectic cohomology for $κ=1$. Writing $\hat{M}$ for the completion of $M$, the differential counts pseudoholomorphic curves of arbitrary genus in $\mathbb{R} \times S^1 \times \hat{M}$ that are required to be branched $κ$-sheeted covers when projected to the $\mathbb{R} \times S^1$-direction; this resembles the cylindrical reformulation of Heegaard Floer homology by Lipshitz. These cohomology groups provide a closed-string analogue of higher-dimensional Heegaard Floer homology introduced by Colin, Honda, and Tian. When $\hat{M}=T^*Q$ with $Q$ an orientable manifold, we introduce a Morse-theoretic analogue of Heegaard Floer symplectic cohomology, which we call the free multiloop complex of $Q$. When $Q$ has vanishing relative second Stiefel-Whitney class, we prove a generalized version of Viterbo's isomorphism theorem by showing that the cohomology groups $SH^*_κ(T^*Q)$ are isomorphic to the cohomology groups of the free multiloop complex of $Q$. △ Less

Submitted 28 November, 2023; originally announced November 2023.

Comments: 78 pages, 14 figures

MSC Class: 53D40 (Primary) 57R17 (Secondary)

arXiv:2311.14962 [pdf, other]

doi 10.1021/acsphotonics.4c00391

Self-Amplification-Assisted Highly Efficient Integrated Laser

Authors: Jiangwei Wu, Xiongshuo Yan, Xueyi Wang, Tingge Yuan, Chengyu Chen, Hao Li, Yuping Chen, Xianfeng Chen

Abstract: Light source is indispensable component in on-chip system. Compared with hybrid or heterogeneous integrated laser, monolithically integrated laser is more suitable for high density photonic integrated circuit (PIC) since the capability of large-scale manufacturing, lower active-passive coupling loss and less test complexity. Recent years have seen the spark of researches on rare-earth ion doped th… ▽ More Light source is indispensable component in on-chip system. Compared with hybrid or heterogeneous integrated laser, monolithically integrated laser is more suitable for high density photonic integrated circuit (PIC) since the capability of large-scale manufacturing, lower active-passive coupling loss and less test complexity. Recent years have seen the spark of researches on rare-earth ion doped thin film lithium niobate (REI:TFLN), demonstrations have been made both in classical and quantum chips. However, low output power and limited quantum emitting efficiency hinder the application of the chip-scale laser source based on REI:TFLN. Here a highly efficient integrated laser assisted by cascaded amplifiers is proposed and experimentally prepared on Erbium-doped TFLN. A slope efficiency of 0.43% and a linewidth of 47.86 kHz are obtained. The maximum integrated laser power is 7.989 μW. Our results show a viable solution to improve efficiency by self-amplification without changing the intrinsic quantum emitting efficiency of the material, and our design has potential application in incorporating with functional devices such as optical communications, integrated quantum memory and quantum emission. △ Less

Submitted 1 December, 2023; v1 submitted 25 November, 2023; originally announced November 2023.

arXiv:2310.16748 [pdf, other]

Integrated Freeway Traffic Control Using Q-Learning with Adjacent Arterial Traffic Considerations

Authors: Tianchen Yuan, Petros A. Ioannou

Abstract: Numerous studies have shown the effectiveness of intelligent transportation system techniques such as variable speed limit (VSL), lane change (LC) control, and ramp metering (RM) in freeway traffic flow control. The integration of these techniques has the potential to further enhance the traffic operation efficiency of both freeway and adjacent arterial networks. In this regard, we propose a freew… ▽ More Numerous studies have shown the effectiveness of intelligent transportation system techniques such as variable speed limit (VSL), lane change (LC) control, and ramp metering (RM) in freeway traffic flow control. The integration of these techniques has the potential to further enhance the traffic operation efficiency of both freeway and adjacent arterial networks. In this regard, we propose a freeway traffic control (FTC) strategy that coordinates VSL, LC, RM actions using a Q-learning (QL) framework which takes into account arterial traffic characteristics. The signal timing and demands of adjacent arterial intersections are incorporated as state variables of the FTC agent. The FTC agent is initially trained offline using a single-section road network, and subsequently deployed online in a connected freeway and arterial simulation network for continuous learning. The arterial network is assumed to be regulated by a traffic-responsive signal control strategy based on a cycle length model. Microscopic simulations demonstrate that the fully-trained FTC agent provides significant reductions in freeway travel time and the number of stops in scenarios with traffic congestion. It clearly outperforms an uncoordinated FTC and a decentralized feedback control strategy. Even though the FTC agent does not control the arterial traffic signals, it leads to shorter average queue lengths at arterial intersections by taking into account the arterial traffic conditions in controlling freeway traffic. These results motivate a future research where the QL framework will also include the control of arterial traffic signals. △ Less

Submitted 19 May, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

Comments: 12 pages, 9 figures, 5 tables

arXiv:2310.06713 [pdf, other]

Interpretable Traffic Event Analysis with Bayesian Networks

Authors: Tong Yuan, Jian Yang, Zeyi Wen

Abstract: Although existing machine learning-based methods for traffic accident analysis can provide good quality results to downstream tasks, they lack interpretability which is crucial for this critical problem. This paper proposes an interpretable framework based on Bayesian Networks for traffic accident prediction. To enable the ease of interpretability, we design a dataset construction pipeline to feed… ▽ More Although existing machine learning-based methods for traffic accident analysis can provide good quality results to downstream tasks, they lack interpretability which is crucial for this critical problem. This paper proposes an interpretable framework based on Bayesian Networks for traffic accident prediction. To enable the ease of interpretability, we design a dataset construction pipeline to feed the traffic data into the framework while retaining the essential traffic data information. With a concrete case study, our framework can derive a Bayesian Network from a dataset based on the causal relationships between weather and traffic events across the United States. Consequently, our framework enables the prediction of traffic accidents with competitive accuracy while examining how the probability of these events changes under different conditions, thus illustrating transparent relationships between traffic and weather events. Additionally, the visualization of the network simplifies the analysis of relationships between different variables, revealing the primary causes of traffic accidents and ultimately providing a valuable reference for reducing traffic accidents. △ Less

Submitted 10 October, 2023; originally announced October 2023.

Comments: 11 pages, 7 figures

MSC Class: 62F15 ACM Class: G.3

Showing 1–50 of 443 results for author: Yuan, T