subscribe to arXiv mailings

Performance assessment of the HERD calorimeter with a photo-diode read-out system for high-energy electron beams

Authors: O. Adriani, G. Ambrosi, M. Antonelli, Y. Bai, X. Bai, T. Bao, M. Barbanera, E. Berti, P. Betti, G. Bigongiari, M. Bongi, V. Bonvicini, S. Bottai, I. Cagnoli, W. Cao, J. Casaus, D. Cerasole, Z. Chen, X. Cui, R. D'Alessandro, L. Di Venere, C. Diaz, Y. Dong, S. Detti, M. Duranti , et al. (41 additional authors not shown)

Abstract: The measurement of cosmic rays at energies exceeding 100 TeV per nucleon is crucial for enhancing the understanding of high-energy particle propagation and acceleration models in the Galaxy. HERD is a space-borne calorimetric experiment that aims to extend the current direct measurements of cosmic rays to unexplored energies. The payload is scheduled to be installed on the Chinese Space Station in… ▽ More The measurement of cosmic rays at energies exceeding 100 TeV per nucleon is crucial for enhancing the understanding of high-energy particle propagation and acceleration models in the Galaxy. HERD is a space-borne calorimetric experiment that aims to extend the current direct measurements of cosmic rays to unexplored energies. The payload is scheduled to be installed on the Chinese Space Station in 2027. The primary peculiarity of the instrument is its capability to measure particles coming from all directions, with the main detector being a deep, homogeneous, 3D calorimeter. The active elements are read out using two independent systems: one based on wavelength shifter fibers coupled to CMOS cameras, and the other based on photo-diodes read-out with custom front-end electronics. A large calorimeter prototype was tested in 2023 during an extensive beam test campaign at CERN. In this paper, the performance of the calorimeter for high-energy electron beams, as obtained from the photo-diode system data, is presented. The prototype demonstrated excellent performance, e.g., an energy resolution better than 1% for electrons at 250 GeV. A comparison between beam test data and Monte Carlo simulation data is also presented. △ Less

Submitted 4 October, 2024; originally announced October 2024.

arXiv:2409.13699 [pdf, other]

Vietnamese Legal Information Retrieval in Question-Answering System

Authors: Thiem Nguyen Ba, Vinh Doan The, Tung Pham Quang, Toan Tran Van

Abstract: In the modern era of rapidly increasing data volumes, accurately retrieving and recommending relevant documents has become crucial in enhancing the reliability of Question Answering (QA) systems. Recently, Retrieval Augmented Generation (RAG) has gained significant recognition for enhancing the capabilities of large language models (LLMs) by mitigating hallucination issues in QA systems, which is… ▽ More In the modern era of rapidly increasing data volumes, accurately retrieving and recommending relevant documents has become crucial in enhancing the reliability of Question Answering (QA) systems. Recently, Retrieval Augmented Generation (RAG) has gained significant recognition for enhancing the capabilities of large language models (LLMs) by mitigating hallucination issues in QA systems, which is particularly beneficial in the legal domain. Various methods, such as semantic search using dense vector embeddings or a combination of multiple techniques to improve results before feeding them to LLMs, have been proposed. However, these methods often fall short when applied to the Vietnamese language due to several challenges, namely inefficient Vietnamese data processing leading to excessive token length or overly simplistic ensemble techniques that lead to instability and limited improvement. Moreover, a critical issue often overlooked is the ordering of final relevant documents which are used as reference to ensure the accuracy of the answers provided by LLMs. In this report, we introduce our three main modifications taken to address these challenges. First, we explore various practical approaches to data processing to overcome the limitations of the embedding model. Additionally, we enhance Reciprocal Rank Fusion by normalizing order to combine results from keyword and vector searches effectively. We also meticulously re-rank the source pieces of information used by LLMs with Active Retrieval to improve user experience when refining the information generated. In our opinion, this technique can also be considered as a new re-ranking method that might be used in place of the traditional cross encoder. Finally, we integrate these techniques into a comprehensive QA system, significantly improving its performance and reliability △ Less

Submitted 4 September, 2024; originally announced September 2024.

Comments: 7 pages

arXiv:2409.13006 [pdf]

AutoPET III Challenge: PET/CT Semantic Segmentation

Authors: Reza Safdari, Mohammad Koohi-Moghaddam, Kyongtae Tyler Bae

Abstract: In this study, we implemented a two-stage deep learning-based approach to segment lesions in PET/CT images for the AutoPET III challenge. The first stage utilized a DynUNet model for coarse segmentation, identifying broad regions of interest. The second stage refined this segmentation using an ensemble of SwinUNETR, SegResNet, and UNet models. Preprocessing involved resampling images to a common r… ▽ More In this study, we implemented a two-stage deep learning-based approach to segment lesions in PET/CT images for the AutoPET III challenge. The first stage utilized a DynUNet model for coarse segmentation, identifying broad regions of interest. The second stage refined this segmentation using an ensemble of SwinUNETR, SegResNet, and UNet models. Preprocessing involved resampling images to a common resolution and normalization, while data augmentation techniques such as affine transformations and intensity adjustments were applied to enhance model generalization. The dataset was split into 80% training and 20% validation, excluding healthy cases. This method leverages multi-stage segmentation and model ensembling to achieve precise lesion segmentation, aiming to improve robustness and overall performance. △ Less

Submitted 19 September, 2024; originally announced September 2024.

arXiv:2408.16053 [pdf, other]

Cataclysmic Variables and AM CVn Binaries in SRG/eROSITA + Gaia: Volume Limited Samples, X-ray Luminosity Functions, and Space Densities

Authors: Antonio C. Rodriguez, Kareem El-Badry, Valery Suleimanov, Anna F. Pala, Shrinivas R. Kulkarni, Boris Gaensicke, Kaya Mori, R. Michael Rich, Arnab Sarkar, Tong Bao, Raimundo Lopes de Oliveira, Gavin Ramsay, Paula Szkody, Matthew Graham, Thomas A. Prince, Ilaria Caiazzo, Zachary P. Vanderbosch, Jan van Roestel, Kaustav K. Das, Yu-Jing Qin, Mansi M. Kasliwal, Avery Wold, Steven L. Groom, Daniel Reiley, Reed Riddle

Abstract: We present volume-limited samples of cataclysmic variables (CVs) and AM CVn binaries jointly selected from SRG/eROSITA eRASS1 and \textit{Gaia} DR3 using an X-ray + optical color-color diagram (the ``X-ray Main Sequence"). This tool identifies all CV subtypes, including magnetic and low-accretion rate systems, in contrast to most previous surveys. We find 23 CVs, 3 of which are AM CVns, out to 150… ▽ More We present volume-limited samples of cataclysmic variables (CVs) and AM CVn binaries jointly selected from SRG/eROSITA eRASS1 and \textit{Gaia} DR3 using an X-ray + optical color-color diagram (the ``X-ray Main Sequence"). This tool identifies all CV subtypes, including magnetic and low-accretion rate systems, in contrast to most previous surveys. We find 23 CVs, 3 of which are AM CVns, out to 150 pc in the Western Galactic Hemisphere. Our 150 pc sample is spectroscopically verified and complete down to $L_X = 1.3\times 10^{29} \;\textrm{erg s}^{-1}$ in the 0.2--2.3 keV band, and we also present CV candidates out to 300 pc and 1000 pc. We discovered two previously unknown systems in our 150 pc sample: the third nearest AM CVn and a magnetic period bouncer. We find the mean $L_X$ of CVs to be $\langle L_X \rangle \approx 4.6\times 10^{30} \;\textrm{erg s}^{-1}$, in contrast to previous surveys which yielded $\langle L_X \rangle \sim 10^{31}-10^{32} \;\textrm{erg s}^{-1}$. We construct X-ray luminosity functions that, for the first time, flatten out at $L_X\sim 10^{30} \; \textrm{erg s}^{-1}$. We find average number, mass, and luminosity densities of $ρ_\textrm{N, CV} = (3.7 \pm 0.7) \times 10^{-6} \textrm{pc}^{-3}$, $ρ_M = (5.0 \pm 1.0) \times 10^{-5} M_\odot^{-1}$, and $ρ_{L_X} = (2.3 \pm 0.4) \times 10^{26} \textrm{erg s}^{-1}M_\odot^{-1}$, respectively, in the solar neighborhood. Our uniform selection method also allows us to place meaningful estimates on the space density of AM CVns, $ρ_\textrm{N, AM CVn} = (5.5 \pm 3.7) \times 10^{-7} \textrm{pc}^{-3}$. Magnetic CVs and period bouncers make up $35\%$ and $25\%$ of our sample, respectively. This work, through a novel discovery technique, shows that the observed number densities of CVs and AM CVns, as well as the fraction of period bouncers, are still in tension with population synthesis estimates. △ Less

Submitted 28 August, 2024; originally announced August 2024.

Comments: Submitted to PASP, comments welcome

arXiv:2408.02153 [pdf, other]

ARVO: Atlas of Reproducible Vulnerabilities for Open Source Software

Authors: Xiang Mei, Pulkit Singh Singaria, Jordi Del Castillo, Haoran Xi, Abdelouahab, Benchikh, Tiffany Bao, Ruoyu Wang, Yan Shoshitaishvili, Adam Doupé, Hammond Pearce, Brendan Dolan-Gavitt

Abstract: High-quality datasets of real-world vulnerabilities are enormously valuable for downstream research in software security, but existing datasets are typically small, require extensive manual effort to update, and are missing crucial features that such research needs. In this paper, we introduce ARVO: an Atlas of Reproducible Vulnerabilities in Open-source software. By sourcing vulnerabilities from… ▽ More High-quality datasets of real-world vulnerabilities are enormously valuable for downstream research in software security, but existing datasets are typically small, require extensive manual effort to update, and are missing crucial features that such research needs. In this paper, we introduce ARVO: an Atlas of Reproducible Vulnerabilities in Open-source software. By sourcing vulnerabilities from C/C++ projects that Google's OSS-Fuzz discovered and implementing a reliable re-compilation system, we successfully reproduce more than 5,000 memory vulnerabilities across over 250 projects, each with a triggering input, the canonical developer-written patch for fixing the vulnerability, and the ability to automatically rebuild the project from source and run it at its vulnerable and patched revisions. Moreover, our dataset can be automatically updated as OSS-Fuzz finds new vulnerabilities, allowing it to grow over time. We provide a thorough characterization of the ARVO dataset, show that it can locate fixes more accurately than Google's own OSV reproduction effort, and demonstrate its value for future research through two case studies: firstly evaluating real-world LLM-based vulnerability repair, and secondly identifying over 300 falsely patched (still-active) zero-day vulnerabilities from projects improperly labeled by OSS-Fuzz. △ Less

Submitted 4 August, 2024; originally announced August 2024.

Comments: 14 pages, 9 figures

arXiv:2407.19657 [pdf, other]

Sustainable Task Offloading in Secure UAV-assisted Smart Farm Networks: A Multi-Agent DRL with Action Mask Approach

Authors: Tingnan Bao, Aisha Syed, William Sean Kennedy, Melike Erol-Kantarci

Abstract: The integration of unmanned aerial vehicles (UAVs) with mobile edge computing (MEC) and Internet of Things (IoT) technology in smart farms is pivotal for efficient resource management and enhanced agricultural productivity sustainably. This paper addresses the critical need for optimizing task offloading in secure UAV-assisted smart farm networks, aiming to reduce total delay and energy consumptio… ▽ More The integration of unmanned aerial vehicles (UAVs) with mobile edge computing (MEC) and Internet of Things (IoT) technology in smart farms is pivotal for efficient resource management and enhanced agricultural productivity sustainably. This paper addresses the critical need for optimizing task offloading in secure UAV-assisted smart farm networks, aiming to reduce total delay and energy consumption while maintaining robust security in data communications. We propose a multi-agent deep reinforcement learning (DRL)-based approach using a deep double Q-network (DDQN) with an action mask (AM), designed to manage task offloading dynamically and efficiently. The simulation results demonstrate the superior performance of our method in managing task offloading, highlighting significant improvements in operational efficiency by reducing delay and energy consumption. This aligns with the goal of developing sustainable and energy-efficient solutions for next-generation network infrastructures, making our approach an advanced solution for achieving both performance and sustainability in smart farming applications. △ Less

Submitted 28 July, 2024; originally announced July 2024.

arXiv:2407.10614 [pdf, other]

Investigating shocking events in the Ethereum stablecoin ecosystem through temporal multilayer graph structure

Authors: Cheick Tidiane Ba, Richard G. Clegg, Ben A. Steer, Matteo Zignani

Abstract: In the dynamic landscape of the Web, we are witnessing the emergence of the Web3 paradigm, which dictates that platforms should rely on blockchain technology and cryptocurrencies to sustain themselves and their profitability. Cryptocurrencies are characterised by high market volatility and susceptibility to substantial crashes, issues that require temporal analysis methodologies able to tackle the… ▽ More In the dynamic landscape of the Web, we are witnessing the emergence of the Web3 paradigm, which dictates that platforms should rely on blockchain technology and cryptocurrencies to sustain themselves and their profitability. Cryptocurrencies are characterised by high market volatility and susceptibility to substantial crashes, issues that require temporal analysis methodologies able to tackle the high temporal resolution, heterogeneity and scale of blockchain data. While existing research attempts to analyse crash events, fundamental questions persist regarding the optimal time scale for analysis, differentiation between long-term and short-term trends, and the identification and characterisation of shock events within these decentralised systems. This paper addresses these issues by examining cryptocurrencies traded on the Ethereum blockchain, with a spotlight on the crash of the stablecoin TerraUSD and the currency LUNA designed to stabilise it. Utilising complex network analysis and a multi-layer temporal graph allows the study of the correlations between the layers representing the currencies and system evolution across diverse time scales. The investigation sheds light on the strong interconnections among stablecoins pre-crash and the significant post-crash transformations. We identify anomalous signals before, during, and after the collapse, emphasising their impact on graph structure metrics and user movement across layers. This paper pioneers temporal, cross-chain graph analysis to explore a cryptocurrency collapse. It emphasises the importance of temporal analysis for studies on web-derived data and how graph-based analysis can enhance traditional econometric results. Overall, this research carries implications beyond its field, for example for regulatory agencies aiming to safeguard users from shocks and monitor investment risks for citizens and clients. △ Less

Submitted 15 July, 2024; originally announced July 2024.

arXiv:2407.09938

Theoretical Study of the Photo-stimulated Radio-electric Effect in Asymmetric Semi-parabolic Quantum Wells in the Presence of a Laser Radiation Field

Authors: Cao Thi Vi Ba, Nguyen Quang Bau, Nguyen Thu Huong, Bui Thi Dung, Anh-Tuan Tran

Abstract: In this study, based on the quantum kinetic equation approach, we systematically present the radio-electric effect in asymmetric semi-parabolic quantum wells under the influence of a laser radiation field taking into account the electron-longitudinal optical phonon scattering mechanism. The numerical results show that the blue-shift of the maximum peaks in the photon energy range is less than 60 m… ▽ More In this study, based on the quantum kinetic equation approach, we systematically present the radio-electric effect in asymmetric semi-parabolic quantum wells under the influence of a laser radiation field taking into account the electron-longitudinal optical phonon scattering mechanism. The numerical results show that the blue-shift of the maximum peaks in the photon energy range is less than 60 meV. The height of maximum peaks increases according to an exponential rule, depending nonlinearly on the structural parameters of the asymmetric semi-parabolic quantum wells. In the photon energy range greater than 100 meV, the saturated radio-electric field increases with temperature and geometric parameters of the quantum well. Temperature also strongly affects full-width at half-maximum with rules consistent with previous theoretical calculations and experimental observations. The results show the differences between symmetric and asymmetric semi-parabolic quantum wells, highlighting the influence of asymmetric structures on radio-electric effects in two-dimensional quantum well systems. △ Less

Submitted 17 August, 2024; v1 submitted 13 July, 2024; originally announced July 2024.

Comments: The manuscript was rejected by the journal

Journal ref: Physica B: Condensed Matter 2024

arXiv:2407.03025 [pdf, other]

doi 10.1051/0004-6361/202449228

XMM-Newton and NuSTAR discovery of a likely IP candidate XMMU J173029.8-330920 in the Galactic Disk

Authors: Samaresh Mondal, Gabriele Ponti, Luke Filor, Tong Bao, Frank Haberl, Ciro Salcedo, Sergio Campana, Charles J. Hailey, Kaya Mori, Nanda Rea

Abstract: We aim at characterizing the population of low-luminosity X-ray sources in the Galactic plane by studying their X-ray spectra and periodic signals in the light curves. We are performing an X-ray survey of the Galactic disk using XMM-Newton, and the source XMMU J173029.8-330920 was serendipitously discovered in our campaign. We performed a follow-up observation of the source using our pre-approved… ▽ More We aim at characterizing the population of low-luminosity X-ray sources in the Galactic plane by studying their X-ray spectra and periodic signals in the light curves. We are performing an X-ray survey of the Galactic disk using XMM-Newton, and the source XMMU J173029.8-330920 was serendipitously discovered in our campaign. We performed a follow-up observation of the source using our pre-approved NuSTAR target of opportunity time. We used various phenomenological models in xspec for the X-ray spectral modeling. We also computed the Lomb-Scargle periodogram to search for X-ray periodicity. A Monte Carlo method was used to simulate 1000 artificial light curves to estimate the significance of the detected period. We also searched for X-ray, optical, and infrared counterparts of the source in various catalogs. The spectral modeling indicates the presence of an intervening cloud with $N_{\rm H}\sim(1.5-2.3)\times10^{23}\ \rm cm^{-2}$ that partially absorbs the incoming X-ray photons. The X-ray spectra are best fit by a model representing emission from a collisionally ionized diffuse gas with plasma temperature $kT=26^{+11}_{-5}$ keV. Furthermore, an Fe $K_α$ line at $6.47^{+0.13}_{-0.06}$ keV was detected with an equivalent width of the line of $312\pm104$ eV. We discovered a coherent pulsation with a period of $521.7\pm0.8$ s. The 3-10 keV pulsed fraction of the source is around $\sim$50-60\%. The hard X-ray emission with plasma temperature $kT=26^{+11}_{-5}$ keV, iron $K_α$ emission at 6.4 keV and a periodic behavior of $521.7\pm0.8$ s suggest XMMU J173029.8-33092 to be an intermediate polar. We estimated the mass of the central white dwarf to be $0.94-1.4\ M_{\odot}$ by assuming a distance to the source of $\sim1.4-5$ kpc. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: 7 pages, 4 figures, accepted for publication in A&A

Journal ref: A&A 689, A172 (2024)

arXiv:2406.19310 [pdf]

Imaging semiconductor-to-metal transition and topological flat bands of twisted bilayer MoTe2

Authors: Yufeng Liu, Yu Gu, Ting Bao, Ning Mao, Can Li, Shudan Jiang, Liang Liu, Dandan Guan, Yaoyi Li, Hao Zheng, Canhua Liu, Kenji Watanabe, Takashi Taniguchi, Wenhui Duan, Jinfeng Jia, Xiaoxue Liu, Yang Zhang, Tingxin Li, Shiyong Wang

Abstract: Two-dimensional (2D) moiré materials have emerged as a highly tunable platform for investigating novel quantum states of matter arising from strong electronic correlations and nontrivial band topology. Recently, topological flat bands formed in 2D semiconducting moiré superlattices have attracted great interests. In particular, a series of topological quantum phases, including the long-sought frac… ▽ More Two-dimensional (2D) moiré materials have emerged as a highly tunable platform for investigating novel quantum states of matter arising from strong electronic correlations and nontrivial band topology. Recently, topological flat bands formed in 2D semiconducting moiré superlattices have attracted great interests. In particular, a series of topological quantum phases, including the long-sought fractional quantum anomalous Hall (FQAH) effect, have recently been experimentally observed in twisted bilayer MoTe2 (tMoTe2). However, the microscopic information of tMoTe2 moiré superlattice and its electronic structure is still lacking. Here, we present scanning tunneling microscopy and spectroscopy (STM/STS) studies of the tMoTe2 moiré superlattice, with twist angles ranging from about 2.3° to 2.8°. We developed a contact-STM mode to apply pressure on tMoTe2 and observed a phase transition from band insulator to metal of tMoTe2 under pressure at the charge neutrality point. STM imaging reveals a pronounced in-plane lattice reconstruction with periodic strain redistribution in the tMoTe2, which serves as gauge fields for generating topological moiré bands. Importantly, the electronic states of the low-energy moiré flat bands primarily concentrate at the XM and MX regions as revealed by STS imaging. Such spatial distributions are nicely reproduced by our first principal calculations with a large-scale basis, suggesting the low-energy moiré flat bands are formed through the hybridization of K valley bands of the top layer and K' valley bands of the bottom layer. Overall, our findings provide compelling real-space evidence of electronic structure under pressure and topological flat bands of tMoTe2, paving the way for further STM/STS investigations of correlated topological states within the topological flat band in gate-tunable tMoTe2 devices. △ Less

Submitted 27 June, 2024; originally announced June 2024.

arXiv:2406.10536 [pdf, other]

doi 10.1016/j.scib.2024.06.011

Universal materials model of deep-learning density functional theory Hamiltonian

Authors: Yuxiang Wang, Yang Li, Zechen Tang, He Li, Zilong Yuan, Honggeng Tao, Nianlong Zou, Ting Bao, Xinghao Liang, Zezhou Chen, Shanghua Xu, Ce Bian, Zhiming Xu, Chong Wang, Chen Si, Wenhui Duan, Yong Xu

Abstract: Realizing large materials models has emerged as a critical endeavor for materials research in the new era of artificial intelligence, but how to achieve this fantastic and challenging objective remains elusive. Here, we propose a feasible pathway to address this paramount pursuit by developing universal materials models of deep-learning density functional theory Hamiltonian (DeepH), enabling compu… ▽ More Realizing large materials models has emerged as a critical endeavor for materials research in the new era of artificial intelligence, but how to achieve this fantastic and challenging objective remains elusive. Here, we propose a feasible pathway to address this paramount pursuit by developing universal materials models of deep-learning density functional theory Hamiltonian (DeepH), enabling computational modeling of the complicated structure-property relationship of materials in general. By constructing a large materials database and substantially improving the DeepH method, we obtain a universal materials model of DeepH capable of handling diverse elemental compositions and material structures, achieving remarkable accuracy in predicting material properties. We further showcase a promising application of fine-tuning universal materials models for enhancing specific materials models. This work not only demonstrates the concept of DeepH's universal materials model but also lays the groundwork for developing large materials models, opening up significant opportunities for advancing artificial intelligence-driven materials discovery. △ Less

Submitted 15 June, 2024; originally announced June 2024.

arXiv:2406.02624 [pdf, other]

Take a Step Further: Understanding Page Spray in Linux Kernel Exploitation

Authors: Ziyi Guo, Dang K Le, Zhenpeng Lin, Kyle Zeng, Ruoyu Wang, Tiffany Bao, Yan Shoshitaishvili, Adam Doupé, Xinyu Xing

Abstract: Recently, a novel method known as Page Spray emerges, focusing on page-level exploitation for kernel vulnerabilities. Despite the advantages it offers in terms of exploitability, stability, and compatibility, comprehensive research on Page Spray remains scarce. Questions regarding its root causes, exploitation model, comparative benefits over other exploitation techniques, and possible mitigation… ▽ More Recently, a novel method known as Page Spray emerges, focusing on page-level exploitation for kernel vulnerabilities. Despite the advantages it offers in terms of exploitability, stability, and compatibility, comprehensive research on Page Spray remains scarce. Questions regarding its root causes, exploitation model, comparative benefits over other exploitation techniques, and possible mitigation strategies have largely remained unanswered. In this paper, we conduct a systematic investigation into Page Spray, providing an in-depth understanding of this exploitation technique. We introduce a comprehensive exploit model termed the \sys model, elucidating its fundamental principles. Additionally, we conduct a thorough analysis of the root causes underlying Page Spray occurrences within the Linux Kernel. We design an analyzer based on the Page Spray analysis model to identify Page Spray callsites. Subsequently, we evaluate the stability, exploitability, and compatibility of Page Spray through meticulously designed experiments. Finally, we propose mitigation principles for addressing Page Spray and introduce our own lightweight mitigation approach. This research aims to assist security researchers and developers in gaining insights into Page Spray, ultimately enhancing our collective understanding of this emerging exploitation technique and making improvements to the community. △ Less

Submitted 6 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

arXiv:2404.13053 [pdf]

Harnessing Large Language Model to collect and analyze Metal-organic framework property dataset

Authors: Wonseok Lee, Yeonghun Kang, Taeun Bae, Jihan Kim

Abstract: This research was focused on the efficient collection of experimental Metal-Organic Framework (MOF) data from scientific literature to address the challenges of accessing hard-to-find data and improving the quality of information available for machine learning studies in materials science. Utilizing a chain of advanced Large Language Models (LLMs), we developed a systematic approach to extract and… ▽ More This research was focused on the efficient collection of experimental Metal-Organic Framework (MOF) data from scientific literature to address the challenges of accessing hard-to-find data and improving the quality of information available for machine learning studies in materials science. Utilizing a chain of advanced Large Language Models (LLMs), we developed a systematic approach to extract and organize MOF data into a structured format. Our methodology successfully compiled information from more than 40,000 research articles, creating a comprehensive and ready-to-use dataset. The findings highlight the significant advantage of incorporating experimental data over relying solely on simulated data for enhancing the accuracy of machine learning predictions in the field of MOF research. △ Less

Submitted 31 March, 2024; originally announced April 2024.

arXiv:2404.11900 [pdf, other]

A New Hybrid Automaton Framework with Partial Differential Equation Dynamics

Authors: Tianshu Bao, Hengrong Du, Weiming Xiang, Taylor T. Johnson

Abstract: This paper presents the syntax and semantics of a novel type of hybrid automaton (HA) with partial differential equation (PDE) dynamic, partial differential hybrid automata (PDHA). In PDHA, we add a spatial domain $X$ and harness a mathematic conception, partition, to help us formally define the spatial relations. While classically the dynamics of HA are described by ordinary differential equation… ▽ More This paper presents the syntax and semantics of a novel type of hybrid automaton (HA) with partial differential equation (PDE) dynamic, partial differential hybrid automata (PDHA). In PDHA, we add a spatial domain $X$ and harness a mathematic conception, partition, to help us formally define the spatial relations. While classically the dynamics of HA are described by ordinary differential equations (ODEs) and differential inclusions, PDHA is capable of describing the behavior of cyber-physical systems (CPS) with continuous dynamics that cannot be modelled using the canonical hybrid systems' framework. For the purposes of analyzing PDHA, we propose another model called the discrete space partial differential hybrid automata (DSPDHA) which handles discrete spatial domains using finite difference methods (FDM) and this simple and intuitive approach reduces the PDHA into HA with ODE systems. We conclude with two illustrative examples in order to exhibit the nature of PDHA and DSPDHA. △ Less

Submitted 18 April, 2024; originally announced April 2024.

Comments: 17 pages

arXiv:2404.07432 [pdf, other]

A Chandra Search for Periodic X-ray Sources in the Bulge of M31

Authors: Jiachang Zhang, Tong Bao, Zhiyuan Li

Abstract: We present a systematic search for periodic X-ray sources in the bulge of M31, using ~ 2 Ms of archival Chandra observations spanning a temporal baseline of 16 years. Utilizing the Gregory-Loredo algorithm that is designed for photon-counting, phase-folded light curves, we detect seven periodic X-ray sources, among which four are newly discovered. Three of these sources are novae, the identified p… ▽ More We present a systematic search for periodic X-ray sources in the bulge of M31, using ~ 2 Ms of archival Chandra observations spanning a temporal baseline of 16 years. Utilizing the Gregory-Loredo algorithm that is designed for photon-counting, phase-folded light curves, we detect seven periodic X-ray sources, among which four are newly discovered. Three of these sources are novae, the identified periods of which range between 1.3-2.0 hour and is most likely the orbital period. The other four sources are low-mass X-ray binaries, the identified periods of which range between 0.13-19.3 hour and are also likely orbital due to a clear eclipsing/dipping behavior in the light curve. We address implications on the X-ray binary population of the M31 bulge. Our study demonstrates the potential of using archival X-ray observations to systematically identify periodic X-ray sources in external galaxies, which would provide valuable information about the underlying exotic stellar populations. △ Less

Submitted 25 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

Comments: 19 pages, 7 figures, accepted for publication in Monthly Notices of the Royal Astronomical Society

arXiv:2404.06449 [pdf, other]

Deep-Learning Database of Density Functional Theory Hamiltonians for Twisted Materials

Authors: Ting Bao, Runzhang Xu, He Li, Xiaoxun Gong, Zechen Tang, Jingheng Fu, Wenhui Duan, Yong Xu

Abstract: Moiré-twisted materials have garnered significant research interest due to their distinctive properties and intriguing physics. However, conducting first-principles studies on such materials faces challenges, notably the formidable computational cost associated with simulating ultra-large twisted structures. This obstacle impedes the construction of a twisted materials database crucial for datadri… ▽ More Moiré-twisted materials have garnered significant research interest due to their distinctive properties and intriguing physics. However, conducting first-principles studies on such materials faces challenges, notably the formidable computational cost associated with simulating ultra-large twisted structures. This obstacle impedes the construction of a twisted materials database crucial for datadriven materials discovery. Here, by using high-throughput calculations and state-of-the-art neural network methods, we construct a Deep-learning Database of density functional theory (DFT) Hamiltonians for Twisted materials named DDHT. The DDHT database comprises trained neural-network models of over a hundred homo-bilayer and hetero-bilayer moiré-twisted materials. These models enable accurate prediction of the DFT Hamiltonian for these materials across arbitrary twist angles, with an averaged mean absolute error of approximately 1.0 meV or lower. The database facilitates the exploration of flat bands and correlated materials platforms within ultra-large twisted structures. △ Less

Submitted 9 April, 2024; originally announced April 2024.

arXiv:2403.14480 [pdf, other]

doi 10.1051/0004-6361/202449527

Periodicity from X-ray sources within the inner Galactic disk

Authors: Samaresh Mondal, Gabriele Ponti, Tong Bao, Frank Haberl, Sergio Campana, Charles J. Hailey, Shifra Mandel, Sandro Mereghetti, Kaya Mori, Mark R. Morris, Nanda Rea, Lara Sidoli

Abstract: For many years, it has been claimed that the Galactic ridge X-ray emission at the Galactic Center (GC) is truly diffuse in nature. However, with the advancement of modern X-ray satellites, it has been found that most of the diffuse emission is actually comprised of thousands of previously unresolved X-ray point sources. Further, many studies suggest that a vast majority of these X-ray point source… ▽ More For many years, it has been claimed that the Galactic ridge X-ray emission at the Galactic Center (GC) is truly diffuse in nature. However, with the advancement of modern X-ray satellites, it has been found that most of the diffuse emission is actually comprised of thousands of previously unresolved X-ray point sources. Further, many studies suggest that a vast majority of these X-ray point sources are magnetic cataclysmic variables (mCVs) and active binaries. One unambiguous way to identify these mCVs and other sources is by detecting their X-ray periodicity. Therefore, we systematically searched for periodic X-ray sources in the inner Galactic disk, including the GC region. We have used data from our ongoing XMM-Newton Heritage survey of the inner Galactic disk ($350^{\circ}\lesssim l\lesssim+7^{\circ}$ and $-1^{\circ}\lesssim b\lesssim +1^{\circ}$) plus the XMM-Newton archival observations of the GC. We computed the Lomb-Scargle periodogram of the light curves for the periodicity search. We fitted the energy spectra of the sources using a simple power-law model plus three Gaussians at 6.4, 6.7, and 6.9 keV for the iron $K$ emission complex. We detected periodicity in 26 sources. For 14 of them, this is the first discovery of periodicity. For the other 12 sources, we found periods similar to those already known, indicating no significant period evolution. We also searched for the Gaia counterparts of the periodic sources to estimate their distances using the Gaia parallax. We found a likely Gaia counterpart for seven sources. We have classified the sources into four categories based on the periodicity, hardness ratio, and the equivalent width of Fe $K$ line emission. Of the 14 sources where we detect the periodicity for the first time, four are likely to be intermediate polars, five are likely to be polars, two are neutron star X-ray binaries, and three are of unknown nature. △ Less

Submitted 21 March, 2024; originally announced March 2024.

Comments: 19 pages, 9 figures, accepted for publication in A&A

Journal ref: A&A 686, A125 (2024)

arXiv:2402.11494 [pdf, other]

Graph Out-of-Distribution Generalization via Causal Intervention

Authors: Qitian Wu, Fan Nie, Chenxiao Yang, Tianyi Bao, Junchi Yan

Abstract: Out-of-distribution (OOD) generalization has gained increasing attentions for learning on graphs, as graph neural networks (GNNs) often exhibit performance degradation with distribution shifts. The challenge is that distribution shifts on graphs involve intricate interconnections between nodes, and the environment labels are often absent in data. In this paper, we adopt a bottom-up data-generative… ▽ More Out-of-distribution (OOD) generalization has gained increasing attentions for learning on graphs, as graph neural networks (GNNs) often exhibit performance degradation with distribution shifts. The challenge is that distribution shifts on graphs involve intricate interconnections between nodes, and the environment labels are often absent in data. In this paper, we adopt a bottom-up data-generative perspective and reveal a key observation through causal analysis: the crux of GNNs' failure in OOD generalization lies in the latent confounding bias from the environment. The latter misguides the model to leverage environment-sensitive correlations between ego-graph features and target nodes' labels, resulting in undesirable generalization on new unseen nodes. Built upon this analysis, we introduce a conceptually simple yet principled approach for training robust GNNs under node-level distribution shifts, without prior knowledge of environment labels. Our method resorts to a new learning objective derived from causal inference that coordinates an environment estimator and a mixture-of-expert GNN predictor. The new approach can counteract the confounding bias in training data and facilitate learning generalizable predictive relations. Extensive experiment demonstrates that our model can effectively enhance generalization with various types of distribution shifts and yield up to 27.4\% accuracy improvement over state-of-the-arts on graph OOD generalization benchmarks. Source codes are available at https://github.com/fannie1208/CaNet. △ Less

Submitted 16 August, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

Comments: Accepted by the research paper track of The Web Conference (WWW) 2024. The codes are available at https://github.com/fannie1208/CaNet

arXiv:2402.09272 [pdf, other]

Insights and caveats from mining local and global temporal motifs in cryptocurrency transaction networks

Authors: Naomi A. Arnold, Peijie Zhong, Cheick Tidiane Ba, Ben Steer, Raul Mondragon, Felix Cuadrado, Renaud Lambiotte, Richard G. Clegg

Abstract: Distributed ledger technologies have opened up a wealth of fine-grained transaction data from cryptocurrencies like Bitcoin and Ethereum. This allows research into problems like anomaly detection, anti-money laundering, pattern mining and activity clustering (where data from traditional currencies is rarely available). The formalism of temporal networks offers a natural way of representing this da… ▽ More Distributed ledger technologies have opened up a wealth of fine-grained transaction data from cryptocurrencies like Bitcoin and Ethereum. This allows research into problems like anomaly detection, anti-money laundering, pattern mining and activity clustering (where data from traditional currencies is rarely available). The formalism of temporal networks offers a natural way of representing this data and offers access to a wealth of metrics and models. However, the large scale of the data presents a challenge using standard graph analysis techniques. We use temporal motifs to analyse two Bitcoin datasets and one NFT dataset, using sequences of three transactions and up to three users. We show that the commonly used technique of simply counting temporal motifs over all users and all time can give misleading conclusions. Here we also study the motifs contributed by each user and discover that the motif distribution is heavy-tailed and that the key players have diverse motif signatures. We study the motifs that occur in different time periods and find events and anomalous activity that cannot be seen just by a count on the whole dataset. Studying motif completion time reveals dynamics driven by human behaviour as well as algorithmic behaviour. △ Less

Submitted 4 October, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

arXiv:2312.05275 [pdf, other]

Exploring the Limits of ChatGPT in Software Security Applications

Authors: Fangzhou Wu, Qingzhao Zhang, Ati Priya Bajaj, Tiffany Bao, Ning Zhang, Ruoyu "Fish" Wang, Chaowei Xiao

Abstract: Large language models (LLMs) have undergone rapid evolution and achieved remarkable results in recent times. OpenAI's ChatGPT, backed by GPT-3.5 or GPT-4, has gained instant popularity due to its strong capability across a wide range of tasks, including natural language tasks, coding, mathematics, and engaging conversations. However, the impacts and limits of such LLMs in system security domain ar… ▽ More Large language models (LLMs) have undergone rapid evolution and achieved remarkable results in recent times. OpenAI's ChatGPT, backed by GPT-3.5 or GPT-4, has gained instant popularity due to its strong capability across a wide range of tasks, including natural language tasks, coding, mathematics, and engaging conversations. However, the impacts and limits of such LLMs in system security domain are less explored. In this paper, we delve into the limits of LLMs (i.e., ChatGPT) in seven software security applications including vulnerability detection/repair, debugging, debloating, decompilation, patching, root cause analysis, symbolic execution, and fuzzing. Our exploration reveals that ChatGPT not only excels at generating code, which is the conventional application of language models, but also demonstrates strong capability in understanding user-provided commands in natural languages, reasoning about control and data flows within programs, generating complex data structures, and even decompiling assembly code. Notably, GPT-4 showcases significant improvements over GPT-3.5 in most security tasks. Also, certain limitations of ChatGPT in security-related tasks are identified, such as its constrained ability to process long code contexts. △ Less

Submitted 7 December, 2023; originally announced December 2023.

arXiv:2311.18149 [pdf, other]

STF: Spatial Temporal Fusion for Trajectory Prediction

Authors: Pengqian Han, Partha Roop, Jiamou Liu, Tianzhe Bao, Yifei Wang

Abstract: Trajectory prediction is a challenging task that aims to predict the future trajectory of vehicles or pedestrians over a short time horizon based on their historical positions. The main reason is that the trajectory is a kind of complex data, including spatial and temporal information, which is crucial for accurate prediction. Intuitively, the more information the model can capture, the more preci… ▽ More Trajectory prediction is a challenging task that aims to predict the future trajectory of vehicles or pedestrians over a short time horizon based on their historical positions. The main reason is that the trajectory is a kind of complex data, including spatial and temporal information, which is crucial for accurate prediction. Intuitively, the more information the model can capture, the more precise the future trajectory can be predicted. However, previous works based on deep learning methods processed spatial and temporal information separately, leading to inadequate spatial information capture, which means they failed to capture the complete spatial information. Therefore, it is of significance to capture information more fully and effectively on vehicle interactions. In this study, we introduced an integrated 3D graph that incorporates both spatial and temporal edges. Based on this, we proposed the integrated 3D graph, which considers the cross-time interaction information. In specific, we design a Spatial-Temporal Fusion (STF) model including Multi-layer perceptions (MLP) and Graph Attention (GAT) to capture the spatial and temporal information historical trajectories simultaneously on the 3D graph. Our experiment on the ApolloScape Trajectory Datasets shows that the proposed STF outperforms several baseline methods, especially on the long-time-horizon trajectory prediction. △ Less

Submitted 29 November, 2023; originally announced November 2023.

Comments: 6 pages, 6 figures

arXiv:2311.14511 [pdf, other]

A Chandra Survey of Milky Way Globular Clusters -- IV. Periodic X-ray sources

Authors: Tong Bao, Zhiyuan Li, Zhongqun Cheng, Diogo Belloni

Abstract: We present a systematic search for periodic X-ray sources in 10 Galactic globular clusters (GCs) utilizing deep archival Chandra observations. By applying the Gregory-Loredo algorithm, we detect 28 periodic signals among 27 independent X-ray sources in 6 GCs, which include 21 newly discovered ones in the X-ray band. The remaining 4 GCs exhibit no periodic X-ray sources, mainly due to a relatively… ▽ More We present a systematic search for periodic X-ray sources in 10 Galactic globular clusters (GCs) utilizing deep archival Chandra observations. By applying the Gregory-Loredo algorithm, we detect 28 periodic signals among 27 independent X-ray sources in 6 GCs, which include 21 newly discovered ones in the X-ray band. The remaining 4 GCs exhibit no periodic X-ray sources, mainly due to a relatively lower sensitivity of the data. Through analysis of their X-ray timing and spectral properties, complemented with available optical and ultraviolet information, we identify 21 of these periodic sources as cataclysmic variables (CVs). Combining with 11 periodic CVs in 47 Tuc similarly identified in the X-ray band, we compile the most comprehensive sample to date of GC CVs with a probable orbital period. The scarcity of old, short-period CVs in GCs compared to the Galactic inner bulge and solar neighborhood, can be attributed to both a selection effect favouring younger, dynamically-formed systems and the hindrance of CV formation through primordial binary evolution by stellar dynamical interactions common to the GC environment. Additionally, we identify a significant fraction of the GC CVs, most with an orbital period below or within the CV period gap, as probable magnetic CVs, but in the meantime there is a deficiency of luminous intermediate polars in the GC sample compared to the solar neighborhood. △ Less

Submitted 24 November, 2023; originally announced November 2023.

Comments: 20 pages, 6 figures; accepted for publication in MNRAS

arXiv:2311.11315 [pdf, other]

TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems

Authors: Yilun Kong, Jingqing Ruan, Yihong Chen, Bin Zhang, Tianpeng Bao, Shiwei Shi, Guoqing Du, Xiaoru Hu, Hangyu Mao, Ziyue Li, Xingyu Zeng, Rui Zhao

Abstract: Large Language Models (LLMs) have demonstrated proficiency in addressing tasks that necessitate a combination of task planning and the usage of external tools that require a blend of task planning and the utilization of external tools, such as APIs. However, real-world complex systems present three prevalent challenges concerning task planning and tool usage: (1) The real system usually has a vast… ▽ More Large Language Models (LLMs) have demonstrated proficiency in addressing tasks that necessitate a combination of task planning and the usage of external tools that require a blend of task planning and the utilization of external tools, such as APIs. However, real-world complex systems present three prevalent challenges concerning task planning and tool usage: (1) The real system usually has a vast array of APIs, so it is impossible to feed the descriptions of all APIs to the prompt of LLMs as the token length is limited; (2) the real system is designed for handling complex tasks, and the base LLMs can hardly plan a correct sub-task order and API-calling order for such tasks; (3) Similar semantics and functionalities among APIs in real systems create challenges for both LLMs and even humans in distinguishing between them. In response, this paper introduces a comprehensive framework aimed at enhancing the Task Planning and Tool Usage (TPTU) abilities of LLM-based agents operating within real-world systems. Our framework comprises three key components designed to address these challenges: (1) the API Retriever selects the most pertinent APIs for the user task among the extensive array available; (2) LLM Finetuner tunes a base LLM so that the finetuned LLM can be more capable for task planning and API calling; (3) the Demo Selector adaptively retrieves different demonstrations related to hard-to-distinguish APIs, which is further used for in-context learning to boost the final performance. We validate our methods using a real-world commercial system as well as an open-sourced academic dataset, and the outcomes clearly showcase the efficacy of each individual component as well as the integrated framework. △ Less

Submitted 19 November, 2023; originally announced November 2023.

arXiv:2311.07533 [pdf]

doi 10.1038/s42005-024-01754-y

Transfer learning relaxation, electronic structure and continuum model for twisted bilayer MoTe$_2$

Authors: Ning Mao, Cheng Xu, Jiangxu Li, Ting Bao, Peitao Liu, Yong Xu, Claudia Felser, Liang Fu, Yang Zhang

Abstract: Large-scale moiré systems are extraordinarily sensitive, with even minute atomic shifts leading to significant changes in electronic structures. Here, we investigate the lattice relaxation effect on moiré band structures in twisted bilayer MoTe$_2$ with two approaches: (a) large-scale plane-wave basis first principle calculation down to $2.88^{\circ}$, (b) transfer learning structure relaxation +… ▽ More Large-scale moiré systems are extraordinarily sensitive, with even minute atomic shifts leading to significant changes in electronic structures. Here, we investigate the lattice relaxation effect on moiré band structures in twisted bilayer MoTe$_2$ with two approaches: (a) large-scale plane-wave basis first principle calculation down to $2.88^{\circ}$, (b) transfer learning structure relaxation + local-basis first principles calculation down to $1.1^{\circ}$. We use two types of van der Waals corrections: the D2 method of Grimme and the density-dependent energy correction, and find that the density-dependent energy correction yields a continuous evolution of bandwidth with twist angles. Based on the above results. we develop a more complete continuum model with a single set of parameters for a wide range of twist angles, and perform many-body simulations at $ν=-1,-2/3, -1/3$. △ Less

Submitted 13 August, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

Comments: 6+15 pages, 5+16 figs, updated continuum model fitting, dDsC correction in maintext, D2 correction in the SM

Journal ref: Commun. Phys. 7, 262 (2024)

arXiv:2311.06541 [pdf, other]

Room-Temperature entangled quantum processor on integrated semiconductor photonics platform

Authors: Haibo Hu, Yu Zhou, Ailun Yi, Tongyuan Bao, Chengying Liu, Qi Luo, Yao Zhang, Zi Wang, Zhengtong Liu, Shuming Xiao, Xin Ou, Qinghai Song

Abstract: The rise of the 4H-silicon-carbide-on-insulator (SiCOI) platform marks a promising pathway towards the realization of monolithic quantum photonic networks. However, the challenge of establishing room-temperature entangled registers on these integrated photonics platforms remains unresolved. Herein, we demonstrate the first entangled processor on the SiCOI platform. We show that both deterministic… ▽ More The rise of the 4H-silicon-carbide-on-insulator (SiCOI) platform marks a promising pathway towards the realization of monolithic quantum photonic networks. However, the challenge of establishing room-temperature entangled registers on these integrated photonics platforms remains unresolved. Herein, we demonstrate the first entangled processor on the SiCOI platform. We show that both deterministic generation of single divacancy electron spins and near-unity spin initialization of a single $^{13}$C nuclear spin can be achieved on SiCOI at room temperature. Besides coherently manipulating the single nuclear spin, a maximally entangled state with a fidelity of 0.89 has been prepared on this CMOS-compatible semiconductor-integrated photonics system. This work establishes the foundation for compact and on-chip solutions within existing defect-based computing and sensing protocols, positioning the SiCOI platform as the most promising candidate for integrated monolithic quantum photonic networks. △ Less

Submitted 11 November, 2023; originally announced November 2023.

Comments: 16 pages, 4 figures

arXiv:2311.04854 [pdf, other]

The High Energy X-ray Probe (HEX-P): resolving the nature of Sgr A* flares, compact object binaries and diffuse X-ray emission in the Galactic Center and beyond

Authors: Kaya Mori, Gabriele Ponti, Matteo Bachetti, Arash Bodaghee, Jonathan Grindlay, Jaesub Hong, Roman Krivonos, Ekaterina Kuznetsova, Shifra Mandel, Antonio Rodriguez, Giovanni Stel, Shuo Zhang, Tong Bao, Franz Bauer, Maica Clavel, Benjamin Coughenour, Javier A. Garcia, Julian Gerber, Brian Grefenstette, Amruta Jaodand, Bret Lehmer, Kristin Madsen, Melania Nynka, Peter Predehl, Ciro Salcedo , et al. (2 additional authors not shown)

Abstract: HEX-P is a probe-class mission concept that will combine high spatial resolution X-ray imaging ($<10"$ FWHM) and broad spectral coverage (0.2-80 keV) with an effective area far superior to current facilities' (including XMM-Newton and NuSTAR). These capabilities will enable revolutionary new insights into a variety of important astrophysical problems. We present scientific objectives and simulatio… ▽ More HEX-P is a probe-class mission concept that will combine high spatial resolution X-ray imaging ($<10"$ FWHM) and broad spectral coverage (0.2-80 keV) with an effective area far superior to current facilities' (including XMM-Newton and NuSTAR). These capabilities will enable revolutionary new insights into a variety of important astrophysical problems. We present scientific objectives and simulations of HEX-P observations of the Galactic Center (GC) and Bulge. We demonstrate the unique and powerful capabilities of the HEX-P observatory for studying both X-ray point sources and diffuse X-ray emission. HEX-P will be uniquely equipped to explore a variety of major topics in Galactic astrophysics, allowing us to (1) investigate broad-band properties of X-ray flares emitted from the supermassive black hole (BH) at Sgr A* and probe the associated particle acceleration and emission mechanisms; (2) identify hard X-ray sources detected by NuSTAR and determine X-ray point source populations in different regions and luminosity ranges; (3) determine the distribution of compact object binaries in the nuclear star cluster and the composition of the Galactic Ridge X-ray emission; (4) identify X-ray transients and measure fundamental parameters such as BH spin; (5) find hidden pulsars in the GC; (6) search for BH-OB binaries and hard X-ray flares from young stellar objects in young massive clusters; (7) measure white dwarf (WD) masses of magnetic CVs to deepen our understanding of CV evolution and the origin of WD magnetic fields; (8) explore primary particle accelerators in the GC in synergy with future TeV and neutrino observatories; (9) map out cosmic-ray distributions by observing non-thermal X-ray filaments; (10) explore past X-ray outbursts from Sgr A* through X-ray reflection components from giant molecular clouds. △ Less

Submitted 8 November, 2023; originally announced November 2023.

arXiv:2309.08901 [pdf, other]

Combinatorial curvature flows for generalized hyperbolic circle packings

Authors: Te Ba, Chao Zheng

Abstract: Generalized circle packings were introduced in \cite{Ba-Hu-Sun} as a generalization of tangential circle packings in hyperbolic background geometry. In this paper, we introduce the combinatorial Calabi flow, fractional combinatorial Calabi flow and combinatorial $p$-th Calabi flow for generalized hyperbolic circle packings. We establish several equivalent conditions regarding the longtime behavior… ▽ More Generalized circle packings were introduced in \cite{Ba-Hu-Sun} as a generalization of tangential circle packings in hyperbolic background geometry. In this paper, we introduce the combinatorial Calabi flow, fractional combinatorial Calabi flow and combinatorial $p$-th Calabi flow for generalized hyperbolic circle packings. We establish several equivalent conditions regarding the longtime behaviors of these flows. This provides effective algorithms for finding the generalized circle packings with prescribed total geodesic curvatures. △ Less

Submitted 16 September, 2023; originally announced September 2023.

MSC Class: 52C26; 53E99; 57Q15

arXiv:2308.15208 [pdf, other]

doi 10.1088/1748-0221/18/09/P09002

Optimization of WLS fiber readout for the HERD calorimeter

Authors: X. Liu, Z. Quan, Y. W. Dong, M. Xu, J. J. Wang, R. J. Wang, Z. G. Wang, X. Z. Cui, T. W. Bao, C. L. Liao, J. F. Han, Y. Chen

Abstract: A novel 3-D calorimeter, composed of about 7500 LYSO cubes, is the key and crucial detector of the High Energy cosmic-Radiation Detection (HERD) facility to be installed onboard the China Space Station. Energy deposition from cosmic ray in each LYSO cube is translated by multiple wavelength shifting (WLS) fibers for multi-range data acquisition and real-time triggering. In this study, various me… ▽ More A novel 3-D calorimeter, composed of about 7500 LYSO cubes, is the key and crucial detector of the High Energy cosmic-Radiation Detection (HERD) facility to be installed onboard the China Space Station. Energy deposition from cosmic ray in each LYSO cube is translated by multiple wavelength shifting (WLS) fibers for multi-range data acquisition and real-time triggering. In this study, various methods of surface finish and encapsulation of the LYSO cube were investigated to optimize the amplitude from the WLS fiber end with the aim of improving the signal-to-noise ratio of Intensified scientific CMOS (IsCMOS) collection. The LYSO cube with five rough surfaces and a specular reflector achieves the maximum amplitude at the low-range fiber end, which is increased by roughly 44% compared to the polished cube with PTFE wrapping. The non-uniformity of amplitude at different positions on the LYSO cube surface was measured by X-ray and the positional correlation factor was derived for the entire cube. A simulation based on HERD CALO was conducted, which revealed that both the LYSO cube with five rough surfaces and the cube with rough bottom face exhibit superior energy resolution for electrons compared to the other two configurations. △ Less

Submitted 29 August, 2023; originally announced August 2023.

arXiv:2308.03427 [pdf, other]

TPTU: Large Language Model-based AI Agents for Task Planning and Tool Usage

Authors: Jingqing Ruan, Yihong Chen, Bin Zhang, Zhiwei Xu, Tianpeng Bao, Guoqing Du, Shiwei Shi, Hangyu Mao, Ziyue Li, Xingyu Zeng, Rui Zhao

Abstract: With recent advancements in natural language processing, Large Language Models (LLMs) have emerged as powerful tools for various real-world applications. Despite their prowess, the intrinsic generative abilities of LLMs may prove insufficient for handling complex tasks which necessitate a combination of task planning and the usage of external tools. In this paper, we first propose a structured fra… ▽ More With recent advancements in natural language processing, Large Language Models (LLMs) have emerged as powerful tools for various real-world applications. Despite their prowess, the intrinsic generative abilities of LLMs may prove insufficient for handling complex tasks which necessitate a combination of task planning and the usage of external tools. In this paper, we first propose a structured framework tailored for LLM-based AI Agents and discuss the crucial capabilities necessary for tackling intricate problems. Within this framework, we design two distinct types of agents (i.e., one-step agent and sequential agent) to execute the inference process. Subsequently, we instantiate the framework using various LLMs and evaluate their Task Planning and Tool Usage (TPTU) abilities on typical tasks. By highlighting key findings and challenges, our goal is to provide a helpful resource for researchers and practitioners to leverage the power of LLMs in their AI applications. Our study emphasizes the substantial potential of these models, while also identifying areas that need more investigation and improvement. △ Less

Submitted 7 November, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

Comments: Accepted in NeurIPS-2023 Workshop on Foundation Models for Decision Making

arXiv:2307.13572 [pdf, other]

Circle packings and total geodesic curvatures in hyperbolic background geometry

Authors: Te Ba, Guangming Hu, Yu Sun

Abstract: In this paper, we study a new type of circle packings in hyperbolic background geometry. Horocycles and hypercycles are also considered in this packing. We give the existence and rigidity of this type of circle packing with conical singularities in terms of the total geodesic curvature. Moreover, we introduce the combinatorial curvature flow on surfaces to find the desired circle packing with the… ▽ More In this paper, we study a new type of circle packings in hyperbolic background geometry. Horocycles and hypercycles are also considered in this packing. We give the existence and rigidity of this type of circle packing with conical singularities in terms of the total geodesic curvature. Moreover, we introduce the combinatorial curvature flow on surfaces to find the desired circle packing with the prescribed total geodesic curvature. △ Less

Submitted 16 August, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

MSC Class: 52C26; 53A70; 53E20; 57Q15

arXiv:2306.16309 [pdf, other]

Raphtory: The temporal graph engine for Rust and Python

Authors: Ben Steer, Naomi Arnold, Cheick Tidiane Ba, Renaud Lambiotte, Haaroon Yousaf, Lucas Jeub, Fabian Murariu, Shivam Kapoor, Pedro Rico, Rachel Chan, Louis Chan, James Alford, Richard G. Clegg, Felix Cuadrado, Matthew Russell Barnes, Peijie Zhong, John N. Pougué Biyong, Alhamza Alnaimi

Abstract: Raphtory is a platform for building and analysing temporal networks. The library includes methods for creating networks from a variety of data sources; algorithms to explore their structure and evolution; and an extensible GraphQL server for deployment of applications built on top. Raphtory's core engine is built in Rust, for efficiency, with Python interfaces, for ease of use. Raphtory is develop… ▽ More Raphtory is a platform for building and analysing temporal networks. The library includes methods for creating networks from a variety of data sources; algorithms to explore their structure and evolution; and an extensible GraphQL server for deployment of applications built on top. Raphtory's core engine is built in Rust, for efficiency, with Python interfaces, for ease of use. Raphtory is developed by network scientists, with a background in Physics, Applied Mathematics, Engineering and Computer Science, for use across academia and industry. △ Less

Submitted 3 January, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

arXiv:2306.02061 [pdf, other]

Balancing Logit Variation for Long-tailed Semantic Segmentation

Authors: Yuchao Wang, Jingjing Fei, Haochen Wang, Wei Li, Tianpeng Bao, Liwei Wu, Rui Zhao, Yujun Shen

Abstract: Semantic segmentation usually suffers from a long-tail data distribution. Due to the imbalanced number of samples across categories, the features of those tail classes may get squeezed into a narrow area in the feature space. Towards a balanced feature distribution, we introduce category-wise variation into the network predictions in the training phase such that an instance is no longer projected… ▽ More Semantic segmentation usually suffers from a long-tail data distribution. Due to the imbalanced number of samples across categories, the features of those tail classes may get squeezed into a narrow area in the feature space. Towards a balanced feature distribution, we introduce category-wise variation into the network predictions in the training phase such that an instance is no longer projected to a feature point, but a small region instead. Such a perturbation is highly dependent on the category scale, which appears as assigning smaller variation to head classes and larger variation to tail classes. In this way, we manage to close the gap between the feature areas of different categories, resulting in a more balanced representation. It is noteworthy that the introduced variation is discarded at the inference stage to facilitate a confident prediction. Although with an embarrassingly simple implementation, our method manifests itself in strong generalizability to various datasets and task settings. Extensive experiments suggest that our plug-in design lends itself well to a range of state-of-the-art approaches and boosts the performance on top of them. △ Less

Submitted 3 June, 2023; originally announced June 2023.

arXiv:2305.17934 [pdf, other]

ZeroPose: CAD-Prompted Zero-shot Object 6D Pose Estimation in Cluttered Scenes

Authors: Jianqiu Chen, Zikun Zhou, Mingshan Sun, Tianpeng Bao, Rui Zhao, Liwei Wu, Zhenyu He

Abstract: Many robotics and industry applications have a high demand for the capability to estimate the 6D pose of novel objects from the cluttered scene. However, existing classic pose estimation methods are object-specific, which can only handle the specific objects seen during training. When applied to a novel object, these methods necessitate a cumbersome onboarding process, which involves extensive dat… ▽ More Many robotics and industry applications have a high demand for the capability to estimate the 6D pose of novel objects from the cluttered scene. However, existing classic pose estimation methods are object-specific, which can only handle the specific objects seen during training. When applied to a novel object, these methods necessitate a cumbersome onboarding process, which involves extensive dataset preparation and model retraining. The extensive duration and resource consumption of onboarding limit their practicality in real-world applications. In this paper, we introduce ZeroPose, a novel zero-shot framework that performs pose estimation following a Discovery-Orientation-Registration (DOR) inference pipeline. This framework generalizes to novel objects without requiring model retraining. Given the CAD model of a novel object, ZeroPose enables in seconds onboarding time to extract visual and geometric embeddings from the CAD model as a prompt. With the prompting of the above embeddings, DOR can discover all related instances and estimate their 6D poses without additional human interaction or presupposing scene conditions. Compared with existing zero-shot methods solved by the render-and-compare paradigm, the DOR pipeline formulates the object pose estimation into a feature-matching problem, which avoids time-consuming online rendering and improves efficiency. Experimental results on the seven datasets show that ZeroPose as a zero-shot method achieves comparable performance with object-specific training methods and outperforms the state-of-the-art zero-shot method with 50x inference speed improvement. △ Less

Submitted 29 September, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

arXiv:2305.09953 [pdf, other]

Low Complexity Detection of Spatial Modulation Aided OTFS in Doubly-Selective Channels

Authors: Zeping Sui, Hongming Zhang, Yu Xin, Tong Bao, Lie-Liang Yang, Lajos Hanzo

Abstract: A spatial modulation-aided orthogonal time frequency space (SM-OTFS) scheme is proposed for high-Doppler scenarios, which relies on a low-complexity distance-based detection algorithm. We first derive the delay-Doppler (DD) domain input-output relationship of our SM-OTFS system by exploiting an SM mapper, followed by characterizing the doubly-selective channels considered. Then we propose a distan… ▽ More A spatial modulation-aided orthogonal time frequency space (SM-OTFS) scheme is proposed for high-Doppler scenarios, which relies on a low-complexity distance-based detection algorithm. We first derive the delay-Doppler (DD) domain input-output relationship of our SM-OTFS system by exploiting an SM mapper, followed by characterizing the doubly-selective channels considered. Then we propose a distance-based ordering subspace check detector (DOSCD) exploiting the \emph{a priori} information of the transmit symbol vector. Moreover, we derive the discrete-input continuous-output memoryless channel (DCMC) capacity of the system. Finally, our simulation results demonstrate that the proposed SM-OTFS system outperforms the conventional single-input-multiple-output (SIMO)-OTFS system, and that the DOSCD conceived is capable of striking an attractive bit error ratio (BER) vs. complexity trade-off. △ Less

Submitted 17 May, 2023; originally announced May 2023.

arXiv:2304.12230 [pdf, other]

doi 10.1103/PhysRevMaterials.7.L041801

Revealing the two-dimensional electronic structure and anisotropic superconductivity in a natural van der Waals superlattice (PbSe)$_{1.14}$NbSe$_2$

Authors: Haoyuan Zhong, Hongyun Zhang, Haoxiong Zhang, Ting Bao, Kenan Zhang, Shengnan Xu, Laipeng Luo, Awabaikeli Rousuli, Wei Yao, Jonathan D. Denlinger, Yaobo Huang, Yang Wu, Yong Xu, Wenhui Duan, Shuyun Zhou

Abstract: Van der Waals superlattices are important for tailoring the electronic structures and properties of layered materials. Here we report the superconducting properties and electronic structure of a natural van der Waals superlattice (PbSe)$_{1.14}$NbSe$_2$. Anisotropic superconductivity with a transition temperature $T_c$ = 5.6 $\pm$ 0.1 K, which is higher than monolayer NbSe$_2$, is revealed by tran… ▽ More Van der Waals superlattices are important for tailoring the electronic structures and properties of layered materials. Here we report the superconducting properties and electronic structure of a natural van der Waals superlattice (PbSe)$_{1.14}$NbSe$_2$. Anisotropic superconductivity with a transition temperature $T_c$ = 5.6 $\pm$ 0.1 K, which is higher than monolayer NbSe$_2$, is revealed by transport measurements on high-quality samples. Angle-resolved photoemission spectroscopy (ARPES) measurements reveal the two-dimensional electronic structure and a charge transfer of 0.43 electrons per NbSe$_2$ unit cell from the blocking PbSe layer. In addition, polarization-dependent ARPES measurements reveal a significant circular dichroism with opposite contrast at K and K' valleys, suggesting a significant spin-orbital coupling and distinct orbital angular momentum. Our work suggests natural van der Waals superlattice as an effective pathway for achieving intriguing properties distinct from both the bulk and monolayer samples. △ Less

Submitted 24 April, 2023; originally announced April 2023.

Comments: 8 pages, 4 figures

Journal ref: Phys. Rev. Mater. 7, L041801 (2023)

arXiv:2304.12130 [pdf, other]

Reconstructing Turbulent Flows Using Physics-Aware Spatio-Temporal Dynamics and Test-Time Refinement

Authors: Shengyu Chen, Tianshu Bao, Peyman Givi, Can Zheng, Xiaowei Jia

Abstract: Simulating turbulence is critical for many societally important applications in aerospace engineering, environmental science, the energy industry, and biomedicine. Large eddy simulation (LES) has been widely used as an alternative to direct numerical simulation (DNS) for simulating turbulent flows due to its reduced computational cost. However, LES is unable to capture all of the scales of turbule… ▽ More Simulating turbulence is critical for many societally important applications in aerospace engineering, environmental science, the energy industry, and biomedicine. Large eddy simulation (LES) has been widely used as an alternative to direct numerical simulation (DNS) for simulating turbulent flows due to its reduced computational cost. However, LES is unable to capture all of the scales of turbulent transport accurately. Reconstructing DNS from low-resolution LES is critical for many scientific and engineering disciplines, but it poses many challenges to existing super-resolution methods due to the spatio-temporal complexity of turbulent flows. In this work, we propose a new physics-guided neural network for reconstructing the sequential DNS from low-resolution LES data. The proposed method leverages the partial differential equation that underlies the flow dynamics in the design of spatio-temporal model architecture. A degradation-based refinement method is also developed to enforce physical constraints and further reduce the accumulated reconstruction errors over long periods. The results on two different types of turbulent flow data confirm the superiority of the proposed method in reconstructing the high-resolution DNS data and preserving the physical characteristics of flow transport. △ Less

Submitted 12 December, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

Comments: 19 pages

arXiv:2303.09714 [pdf, other]

doi 10.1093/mnras/stad836

Periodic X-ray sources in the Massive Globular Cluster 47 Tucanae: Evidence for Dynamically Formed Cataclysmic Variables

Authors: Tong Bao, Zhiyuan Li, Zhongqun Cheng

Abstract: We present a systematic study of periodic X-ray sources in the massive globular cluster 47 Tuc, utilizing deep archival Chandra observations that resolve the cluster core and recently available eROSITA observations that cover the cluster outskirt. By applying the Gregory-Loredo algorithm, we detect 20 periodic signals among 18 X-ray sources, ranging between 205-95731 second. Fourteen periods are n… ▽ More We present a systematic study of periodic X-ray sources in the massive globular cluster 47 Tuc, utilizing deep archival Chandra observations that resolve the cluster core and recently available eROSITA observations that cover the cluster outskirt. By applying the Gregory-Loredo algorithm, we detect 20 periodic signals among 18 X-ray sources, ranging between 205-95731 second. Fourteen periods are newly discovered in the X-ray band. We classify these periodic sources into four quiescent low-mass X-ray binaries, one milli-second pulsar, two coronally-active binaries and eleven cataclysmic variables (CVs), based on their X-ray temporal and spectral properties, as well as multi-band information. Despite a small sample subject to potential selection bias against faint and non-magnetic CVs, the 11 CVs together define an orbital period distribution significantly different from that of the CVs previously found in the solar neighborhood and the Galactic bulge. In particular, there exists in 47 Tuc an apparent paucity of short-period CVs below the period gap, which might be attributed to a high occupation fraction of non-magnetic CVs. Also characteristic of the 47 Tuc CVs are an overabundance of long-period CVs with a subgiant donor, a substantial fraction of CVs within the period gap, and a steep radial surface density profile. These are best understood as a group of CVs having recently formed via dynamical interactions in the dense cluster core. Despite sufficient sensitivity of the X-ray data, only one periodic source is found between one-third of the half-light radius and the tidal radius, the nature of which is unclear. △ Less

Submitted 16 March, 2023; originally announced March 2023.

Comments: 20 pages, 10 figures, Accepted for publication in MNRAS

arXiv:2303.08481 [pdf, other]

SeqCo-DETR: Sequence Consistency Training for Self-Supervised Object Detection with Transformers

Authors: Guoqiang Jin, Fan Yang, Mingshan Sun, Ruyi Zhao, Yakun Liu, Wei Li, Tianpeng Bao, Liwei Wu, Xingyu Zeng, Rui Zhao

Abstract: Self-supervised pre-training and transformer-based networks have significantly improved the performance of object detection. However, most of the current self-supervised object detection methods are built on convolutional-based architectures. We believe that the transformers' sequence characteristics should be considered when designing a transformer-based self-supervised method for the object dete… ▽ More Self-supervised pre-training and transformer-based networks have significantly improved the performance of object detection. However, most of the current self-supervised object detection methods are built on convolutional-based architectures. We believe that the transformers' sequence characteristics should be considered when designing a transformer-based self-supervised method for the object detection task. To this end, we propose SeqCo-DETR, a novel Sequence Consistency-based self-supervised method for object DEtection with TRansformers. SeqCo-DETR defines a simple but effective pretext by minimizes the discrepancy of the output sequences of transformers with different image views as input and leverages bipartite matching to find the most relevant sequence pairs to improve the sequence-level self-supervised representation learning performance. Furthermore, we provide a mask-based augmentation strategy incorporated with the sequence consistency strategy to extract more representative contextual information about the object for the object detection task. Our method achieves state-of-the-art results on MS COCO (45.8 AP) and PASCAL VOC (64.1 AP), demonstrating the effectiveness of our approach. △ Less

Submitted 15 March, 2023; originally announced March 2023.

arXiv:2211.13968 [pdf, other]

MIAD: A Maintenance Inspection Dataset for Unsupervised Anomaly Detection

Authors: Tianpeng Bao, Jiadong Chen, Wei Li, Xiang Wang, Jingjing Fei, Liwei Wu, Rui Zhao, Ye Zheng

Abstract: Visual anomaly detection plays a crucial role in not only manufacturing inspection to find defects of products during manufacturing processes, but also maintenance inspection to keep equipment in optimum working condition particularly outdoors. Due to the scarcity of the defective samples, unsupervised anomaly detection has attracted great attention in recent years. However, existing datasets for… ▽ More Visual anomaly detection plays a crucial role in not only manufacturing inspection to find defects of products during manufacturing processes, but also maintenance inspection to keep equipment in optimum working condition particularly outdoors. Due to the scarcity of the defective samples, unsupervised anomaly detection has attracted great attention in recent years. However, existing datasets for unsupervised anomaly detection are biased towards manufacturing inspection, not considering maintenance inspection which is usually conducted under outdoor uncontrolled environment such as varying camera viewpoints, messy background and degradation of object surface after long-term working. We focus on outdoor maintenance inspection and contribute a comprehensive Maintenance Inspection Anomaly Detection (MIAD) dataset which contains more than 100K high-resolution color images in various outdoor industrial scenarios. This dataset is generated by a 3D graphics software and covers both surface and logical anomalies with pixel-precise ground truth. Extensive evaluations of representative algorithms for unsupervised anomaly detection are conducted, and we expect MIAD and corresponding experimental results can inspire research community in outdoor unsupervised anomaly detection tasks. Worthwhile and related future work can be spawned from our new dataset. △ Less

Submitted 28 November, 2022; v1 submitted 25 November, 2022; originally announced November 2022.

arXiv:2211.12315 [pdf, other]

Boosting Personalised Musculoskeletal Modelling with Physics-informed Knowledge Transfer

Authors: Jie Zhang, Yihui Zhao, Tianzhe Bao, Zhenhong Li, Kun Qian, Alejandro F. Frangi, Sheng Quan Xie, Zhi-Qiang Zhang

Abstract: Data-driven methods have become increasingly more prominent for musculoskeletal modelling due to their conceptually intuitive simple and fast implementation. However, the performance of a pre-trained data-driven model using the data from specific subject(s) may be seriously degraded when validated using the data from a new subject, hindering the utility of the personalised musculoskeletal model in… ▽ More Data-driven methods have become increasingly more prominent for musculoskeletal modelling due to their conceptually intuitive simple and fast implementation. However, the performance of a pre-trained data-driven model using the data from specific subject(s) may be seriously degraded when validated using the data from a new subject, hindering the utility of the personalised musculoskeletal model in clinical applications. This paper develops an active physics-informed deep transfer learning framework to enhance the dynamic tracking capability of the musculoskeletal model on the unseen data. The salient advantages of the proposed framework are twofold: 1) For the generic model, physics-based domain knowledge is embedded into the loss function of the data-driven model as soft constraints to penalise/regularise the data-driven model. 2) For the personalised model, the parameters relating to the feature extraction will be directly inherited from the generic model, and only the parameters relating to the subject-specific inference will be finetuned by jointly minimising the conventional data prediction loss and the modified physics-based loss. In this paper, we use the synchronous muscle forces and joint kinematics prediction from surface electromyogram (sEMG) as the exemplar to illustrate the proposed framework. Moreover, convolutional neural network (CNN) is employed as the deep neural network to implement the proposed framework, and the physics law between muscle forces and joint kinematics is utilised as the soft constraints. Results of comprehensive experiments on a self-collected dataset from eight healthy subjects indicate the effectiveness and great generalization of the proposed framework. △ Less

Submitted 22 November, 2022; originally announced November 2022.

Comments: arXiv admin note: text overlap with arXiv:2207.01435

arXiv:2210.10959 [pdf, other]

Geo6D: Geometric Constraints Learning for 6D Pose Estimation

Authors: Jianqiu Chen, Mingshan Sun, Ye Zheng, Tianpeng Bao, Zhenyu He, Donghai Li, Guoqiang Jin, Rui Zhao, Liwei Wu, Xiaoke Jiang

Abstract: Numerous 6D pose estimation methods have been proposed that employ end-to-end regression to directly estimate the target pose parameters. Since the visible features of objects are implicitly influenced by their poses, the network allows inferring the pose by analyzing the differences in features in the visible region. However, due to the unpredictable and unrestricted range of pose variations, the… ▽ More Numerous 6D pose estimation methods have been proposed that employ end-to-end regression to directly estimate the target pose parameters. Since the visible features of objects are implicitly influenced by their poses, the network allows inferring the pose by analyzing the differences in features in the visible region. However, due to the unpredictable and unrestricted range of pose variations, the implicitly learned visible feature-pose constraints are insufficiently covered by the training samples, making the network vulnerable to unseen object poses. To tackle these challenges, we proposed a novel geometric constraints learning approach called Geo6D for direct regression 6D pose estimation methods. It introduces a pose transformation formula expressed in relative offset representation, which is leveraged as geometric constraints to reconstruct the input and output targets of the network. These reconstructed data enable the network to estimate the pose based on explicit geometric constraints and relative offset representation mitigates the issue of the pose distribution gap. Extensive experimental results show that when equipped with Geo6D, the direct 6D methods achieve state-of-the-art performance on multiple datasets and demonstrate significant effectiveness, even with only 10% amount of data. △ Less

Submitted 21 August, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

arXiv:2208.06416 [pdf, other]

Uni6Dv2: Noise Elimination for 6D Pose Estimation

Authors: Mingshan Sun, Ye Zheng, Tianpeng Bao, Jianqiu Chen, Guoqiang Jin, Liwei Wu, Rui Zhao, Xiaoke Jiang

Abstract: Uni6D is the first 6D pose estimation approach to employ a unified backbone network to extract features from both RGB and depth images. We discover that the principal reasons of Uni6D performance limitations are Instance-Outside and Instance-Inside noise. Uni6D's simple pipeline design inherently introduces Instance-Outside noise from background pixels in the receptive field, while ignoring Instan… ▽ More Uni6D is the first 6D pose estimation approach to employ a unified backbone network to extract features from both RGB and depth images. We discover that the principal reasons of Uni6D performance limitations are Instance-Outside and Instance-Inside noise. Uni6D's simple pipeline design inherently introduces Instance-Outside noise from background pixels in the receptive field, while ignoring Instance-Inside noise in the input depth data. In this paper, we propose a two-step denoising approach for dealing with the aforementioned noise in Uni6D. To reduce noise from non-instance regions, an instance segmentation network is utilized in the first step to crop and mask the instance. A lightweight depth denoising module is proposed in the second step to calibrate the depth feature before feeding it into the pose regression network. Extensive experiments show that our Uni6Dv2 reliably and robustly eliminates noise, outperforming Uni6D without sacrificing too much inference efficiency. It also reduces the need for annotated real data that requires costly labeling. △ Less

Submitted 16 March, 2023; v1 submitted 15 August, 2022; originally announced August 2022.

arXiv:2205.03606 [pdf, other]

doi 10.1007/s00526-022-02422-1

Rigidity of bordered polyhedral surfaces

Authors: Te Ba, Shengyu Li, Yaping Xu

Abstract: This paper investigates the rigidity of bordered polyhedral surfaces. Using the variational principle, we show that bordered polyhedral surfaces are determined by boundary value and discrete curvatures on the interior edges. As a corollary, we reprove the classical result that two Euclidean cyclic polygons (or hyperbolic cyclic polygons) are congruent if the lengths of their sides are equal. This paper investigates the rigidity of bordered polyhedral surfaces. Using the variational principle, we show that bordered polyhedral surfaces are determined by boundary value and discrete curvatures on the interior edges. As a corollary, we reprove the classical result that two Euclidean cyclic polygons (or hyperbolic cyclic polygons) are congruent if the lengths of their sides are equal. △ Less

Submitted 28 December, 2022; v1 submitted 7 May, 2022; originally announced May 2022.

MSC Class: 52C25; 52C26; 51M04; 51M09

Journal ref: Calc. Var. Partial Differential Equations62(2023), no.3, Paper No. 78, 20 pp

arXiv:2204.08592 [pdf]

Context-Auditor: Context-sensitive Content Injection Mitigation

Authors: Faezeh Kalantari, Mehrnoosh Zaeifi, Tiffany Bao, Ruoyu Wang, Yan Shoshitaishvili, Adam Doupé

Abstract: Cross-site scripting (XSS) is the most common vulnerability class in web applications over the last decade. Much research attention has focused on building exploit mitigation defenses for this problem, but no technique provides adequate protection in the face of advanced attacks. One technique that bypasses XSS mitigations is the scriptless attack: a content injection technique that uses (among ot… ▽ More Cross-site scripting (XSS) is the most common vulnerability class in web applications over the last decade. Much research attention has focused on building exploit mitigation defenses for this problem, but no technique provides adequate protection in the face of advanced attacks. One technique that bypasses XSS mitigations is the scriptless attack: a content injection technique that uses (among other options) CSS and HTML injection to infiltrate data. In studying this technique and others, we realized that the common property among the exploitation of all content injection vulnerabilities, including not just XSS and scriptless attacks, but also command injections and several others, is an unintended context switch in the victim program's parsing engine that is caused by untrusted user input. In this paper, we propose Context-Auditor, a novel technique that leverages this insight to identify content injection vulnerabilities ranging from XSS to scriptless attacks and command injections. We implemented Context-Auditor as a general solution to content injection exploit detection problem in the form of a flexible, stand-alone detection module. We deployed instances of Context-Auditor as (1) a browser plugin, (2) a web proxy (3) a web server plugin, and (4) as a wrapper around potentially-injectable system endpoints. Because Context-Auditor targets the root cause of content injection exploitation (and, more specifically for the purpose of our prototype, XSS exploitation, scriptless exploitation, and command injection), our evaluation results demonstrate that Context-Auditor can identify and block content injection exploits that modern defenses cannot while maintaining low throughput overhead and avoiding false positives. △ Less

Submitted 28 April, 2022; v1 submitted 18 April, 2022; originally announced April 2022.

arXiv:2201.13180 [pdf, other]

Learning on Arbitrary Graph Topologies via Predictive Coding

Authors: Tommaso Salvatori, Luca Pinchetti, Beren Millidge, Yuhang Song, Tianyi Bao, Rafal Bogacz, Thomas Lukasiewicz

Abstract: Training with backpropagation (BP) in standard deep learning consists of two main steps: a forward pass that maps a data point to its prediction, and a backward pass that propagates the error of this prediction back through the network. This process is highly effective when the goal is to minimize a specific objective function. However, it does not allow training on networks with cyclic or backwar… ▽ More Training with backpropagation (BP) in standard deep learning consists of two main steps: a forward pass that maps a data point to its prediction, and a backward pass that propagates the error of this prediction back through the network. This process is highly effective when the goal is to minimize a specific objective function. However, it does not allow training on networks with cyclic or backward connections. This is an obstacle to reaching brain-like capabilities, as the highly complex heterarchical structure of the neural connections in the neocortex are potentially fundamental for its effectiveness. In this paper, we show how predictive coding (PC), a theory of information processing in the cortex, can be used to perform inference and learning on arbitrary graph topologies. We experimentally show how this formulation, called PC graphs, can be used to flexibly perform different tasks with the same network by simply stimulating specific neurons, and investigate how the topology of the graph influences the final performance. We conclude by comparing against simple baselines trained~with~BP. △ Less

Submitted 12 October, 2022; v1 submitted 31 January, 2022; originally announced January 2022.

Comments: 15 pages, 11 figures

arXiv:2111.11800 [pdf, other]

doi 10.3847/1538-4357/ac80c2

On planet formation around supermassive black holes and the grain disruption barriers by radiative torques

Authors: Nguyen Chau Giang, Thiem Hoang, Le Ngoc Tram, Nguyen Duc Dieu, Pham Ngoc Diep, Nguyen Thi Phuong, Bui Van Tuan, Truong Le Gia Bao

Abstract: It has recently been suggested that planets can form by dust coagulation in the torus of active galactic nuclei (AGN) with low luminosity of $L_{\rm bol}\lesssim 10^{42} erg s^{-1}$, constituting a new class of exoplanets orbiting the supermassive black hole called \textit{blanets}. However, large dust grains in the AGN torus may be rotationally disrupted by the Radiative Torque Disruption (RATD)… ▽ More It has recently been suggested that planets can form by dust coagulation in the torus of active galactic nuclei (AGN) with low luminosity of $L_{\rm bol}\lesssim 10^{42} erg s^{-1}$, constituting a new class of exoplanets orbiting the supermassive black hole called \textit{blanets}. However, large dust grains in the AGN torus may be rotationally disrupted by the Radiative Torque Disruption (RATD) mechanism due to AGN radiation feedback, which would prevent the blanet formation. To test this scenario, we adopt the simple smooth and clumpy dust/gas distribution inside the torus region to study the effect of RATD on the evolution of composite dust grains in the midplane of the torus. We found that grain growth and then blanet formation are possible in the smooth torus model. However, in the clumpy torus model, grain growth will be strongly constrained by RATD, assuming the gas density distribution as adopted in Wada et al. We also found that icy grain mantles inside clumps are quickly detached from the grain core by rotational desorption, reducing the sticking coefficient between icy grains and coagulation efficiency. The grain rotational disruption and ice desorption occur on timescales much shorter than the growth time up to a factor of $\sim 10^{4}$, which are the new barriers that grain growth must overcome to form blanets. Further studies with more realistic AGN models are required to better constrain the effect of RATD on grain growth and blanet formation hypothesis around low luminosity AGN. △ Less

Submitted 27 July, 2022; v1 submitted 23 November, 2021; originally announced November 2021.

Comments: 27 pages, 13 figures Accepted to ApJ

arXiv:2111.10836 [pdf, ps, other]

doi 10.1007/s41365-021-00876-0

Charge resolution in the isochronous mass spectrometry and the mass of $^{51}$Co

Authors: Xu Zhou, Meng Wang, Yu-Hu Zhang, Hu-Shan Xu, You-Jin Yuan, Jian-Cheng Yang, Yu. A. Litvinov, S. A. Litvinov, Bo Mei, Xin-Liang Yan, Xing Xu, Peng Shuai, Yuan-Ming Xing, Rui-Jiu Chen, Xiang-Cheng Chen, Chao-Yi Fu, Qi Zeng, Ming-Ze Sun, Hong-Fu Li, Qian Wang, Tong Bao, Min Zhang, Min Si, Han-Yu Deng, Ming-Zheng Liu , et al. (3 additional authors not shown)

Abstract: Isochronous mass spectrometry (IMS) of heavyion storage rings is a powerful tool for the mass measurements of short-lived nuclei. In IMS experiments, masses are determined through precision measurements of the revolution times of the ions stored in the ring. However, the revolution times cannot be resolved for particles with nearly the same mass-to-charge (m/q) ratios. To overcome this limitation… ▽ More Isochronous mass spectrometry (IMS) of heavyion storage rings is a powerful tool for the mass measurements of short-lived nuclei. In IMS experiments, masses are determined through precision measurements of the revolution times of the ions stored in the ring. However, the revolution times cannot be resolved for particles with nearly the same mass-to-charge (m/q) ratios. To overcome this limitation and to extract the accurate revolution times for such pairs of ion species with very close m/q ratios, in our early work on particle identification, we analyzed the amplitudes of the timing signals from the detector based on the emission of secondary electrons. Here, the previous data analysis method is further improved by considering the signal amplitudes, detection efficiencies, and number of stored ions in the ring. A sensitive Z-dependent parameter is introduced in the data analysis, leading to a better resolution of $^{34}$Ar$^{18+}$ and $^{51}$Co$^{27+}$ with A/Z=17/9. The mean revolution times of $^{34}$Ar$^{18+}$ and $^{51}$Co$^{27+}$ are deduced, although their time difference is merely 1.8 ps. The uncorrected, overlapped peak of these ions has a full width at half maximum of 7.7 ps. The mass excess of $^{51}$Co was determined to be -27332(41) keV, which is in agreement with the previous value of -27342(48) keV. △ Less

Submitted 20 December, 2021; v1 submitted 21 November, 2021; originally announced November 2021.

Comments: 8 pages, 6 figures

Report number: NUCL SCI TECH (2021) 32:37

arXiv:2111.06024 [pdf, other]

doi 10.1093/mnras/stab3259

Searching for Quasi-periodic Oscillations in Active Galactic Nuclei of the Chandra Deep Field South

Authors: Tong Bao, Zhiyuan Li

Abstract: Recent X-ray observations have revealed growing evidence of quasi-periodic oscillation (QPO) in the light curve of active galactic nuclei (AGNs), which may serve as a useful probe of black hole physics. In this work, we present a systematic search for X-ray QPOs among ~ 1000 AGNs of the Chandra Deep Field South (CDF-S) in a homogeneous fashion. Dividing the 7-Ms Chandra observations into four epoc… ▽ More Recent X-ray observations have revealed growing evidence of quasi-periodic oscillation (QPO) in the light curve of active galactic nuclei (AGNs), which may serve as a useful probe of black hole physics. In this work, we present a systematic search for X-ray QPOs among ~ 1000 AGNs of the Chandra Deep Field South (CDF-S) in a homogeneous fashion. Dividing the 7-Ms Chandra observations into four epochs, we search for periodic signals that are persistent throughout any of these epochs, using two independent methods: Lomb-Scargle periodogram and Gregory-Loredo Algorithm. No statistically significant periodic signal is found with either method on any of the four epochs. Our extensive simulations of source light curves suggest that this non-detection is primarily due to a moderate sensitivity of the CDF-S data in QPO detection. Using the simulation-predicted detection efficiency, we are able to provide a meaningful constraint on the intrinsic occurrence rate of persistent QPOs, < (15-20) %, provided that they share a similar power spectral density with a handful of currently known AGN QPOs. The true intrinsic occurrence rate might be significantly below this upper limit, however, given the non-detection among the CDF-S sources. Our additional search for short-lived QPOs that are only detected over a small subset of all observations results in two candidates, one in source XID 643 at a period of ~ 13273 s and the other in source XID 876 at a period of ~ 7065 s. △ Less

Submitted 10 November, 2021; originally announced November 2021.

Comments: 11 pages, 8 figures, Accepted for publication in MNRAS

arXiv:2110.14878 [pdf, other]

A novel measurement of initial-state gluon radiation in hadron collisions using Drell-Yan events

Authors: CDF Collaboration, T. Aaltonen, S. Amerio, D. Amidei, A. Anastassov, A. Annovi, J. Antos, G. Apollinari, J. A. Appel, T. Arisawa, A. Artikov, J. Asaadi, W. Ashmanskas, B. Auerbach, A. Aurisano, F. Azfar, W. Badgett, T. Bae, A. Barbaro-Galtieri, V. E. Barnes, B. A. Barnett, P. Barria, P. Bartos, M. Bauce, F. Bedeschi , et al. (375 additional authors not shown)

Abstract: A study of initial-state gluon radiation (ISR) in hadron collisions is presented using Drell-Yan (DY) events produced in proton-antiproton collisions by the Tevatron collider at a center-of-mass energy of 1.96 TeV. This paper adopts a novel approach which uses the mean value of the Z/$γ^*$ transverse momentum $<p_T^{DY}>$ in DY events as a powerful observable to characterize the effect of ISR. In… ▽ More A study of initial-state gluon radiation (ISR) in hadron collisions is presented using Drell-Yan (DY) events produced in proton-antiproton collisions by the Tevatron collider at a center-of-mass energy of 1.96 TeV. This paper adopts a novel approach which uses the mean value of the Z/$γ^*$ transverse momentum $<p_T^{DY}>$ in DY events as a powerful observable to characterize the effect of ISR. In a data sample corresponding to an integrated luminosity of 9.4 fb$^{-1}$ collected with the CDF Run II detector, $<p_T^{DY}>$ is measured as a function of the Z/$γ^*$ invariant mass. It is found that these two observables have a dependence, $<p_T^{DY}> = -8 + 2.2 \ln m_{DY}^2$ [GeV/c], where $m_{DY}$ is the value of the Z/$γ^*$ mass measured in units of GeV/$c^2$. This linear dependence is observed for the first time in this analysis. It may be exploited to model the effect of ISR and constrain its impact in other processes. △ Less

Submitted 28 October, 2021; v1 submitted 28 October, 2021; originally announced October 2021.

Comments: 14 pages, 14 figures

arXiv:2110.07876 [pdf]

A Bayesian Approach for In-Situ Stress Prediction and Uncertainty Quantification for Subsurface Engineering

Authors: Ting Bao, Jeff Burghardt

Abstract: Many subsurface engineering applications require accurate knowledge of the in-situ state of stress for their safe design and operation. Existing methods to meet this need primarily include field measurements for estimating one or more of the principal stresses from a borehole, or optimization methods for constructing a 3D geomechanical model in terms of geophysical measurements. These methods, how… ▽ More Many subsurface engineering applications require accurate knowledge of the in-situ state of stress for their safe design and operation. Existing methods to meet this need primarily include field measurements for estimating one or more of the principal stresses from a borehole, or optimization methods for constructing a 3D geomechanical model in terms of geophysical measurements. These methods, however, often contain considerable uncertainty in estimating the state of stress. In this paper, we build on a Bayesian approach to quantify uncertainty in stress estimations for subsurface engineering applications. This approach can provide an estimate of the 3D distribution of stress throughout the volume of interest and provide an estimate of the uncertainty arising from the stress measurement, the rheology parameters, and a paucity of measurements. The value of this approach is demonstrated using stress measurements from the In Salah carbon storage site, which was one of the first industrial carbon capture and storage projects in the world. This demonstration shows the application of this Bayesian approach for estimating the initial state of stress for In Salah and quantifying the uncertainty in the estimated stress. Also, an assessment of a maximum injection pressure to prevent geomechanical risks from CO2 injection pressures is provided in terms of the probability distribution of the minimum principal stress quantified by the approach. With the In Salah case study, this paper demonstrates that using the Bayesian approach can provide additional insights for site explorations and/or project operations to make informed-site decisions for subsurface engineering applications. △ Less

Submitted 15 October, 2021; originally announced October 2021.

Comments: 34 pages, 15 figures

Report number: PNNL-SA-167337

Showing 1–50 of 185 results for author: Ba, T