-
EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges
Authors:
Talor Abramovich,
Meet Udeshi,
Minghao Shao,
Kilian Lieret,
Haoran Xi,
Kimberly Milner,
Sofija Jancheska,
John Yang,
Carlos E. Jimenez,
Farshad Khorrami,
Prashanth Krishnamurthy,
Brendan Dolan-Gavitt,
Muhammad Shafique,
Karthik Narasimhan,
Ramesh Karri,
Ofir Press
Abstract:
Although language model (LM) agents are demonstrating growing potential in many domains, their success in cybersecurity has been limited due to simplistic design and the lack of fundamental features for this domain. We present EnIGMA, an LM agent for autonomously solving Capture The Flag (CTF) challenges. EnIGMA introduces new Agent-Computer Interfaces (ACIs) to improve the success rate on CTF cha…
▽ More
Although language model (LM) agents are demonstrating growing potential in many domains, their success in cybersecurity has been limited due to simplistic design and the lack of fundamental features for this domain. We present EnIGMA, an LM agent for autonomously solving Capture The Flag (CTF) challenges. EnIGMA introduces new Agent-Computer Interfaces (ACIs) to improve the success rate on CTF challenges. We establish the novel Interactive Agent Tool concept, which enables LM agents to run interactive command-line utilities essential for these challenges. Empirical analysis of EnIGMA on over 350 CTF challenges from three different benchmarks indicates that providing a robust set of new tools with demonstration of their usage helps the LM solve complex problems and achieves state-of-the-art results on the NYU CTF and Intercode-CTF benchmarks. Finally, we discuss insights on ACI design and agent behavior on cybersecurity tasks that highlight the need to adapt real-world tools for LM agents.
△ Less
Submitted 24 September, 2024;
originally announced September 2024.
-
FAST Ultra-Deep Survey (FUDS): Data Release for FUDS0
Authors:
Hongwei Xi,
Bo Peng,
Lister Staveley-Smith,
Bi-Qing For,
Bin Liu,
Dejian Ding
Abstract:
We have used the Five-hundred-meter Aperture Spherical radio Telescope (FAST) to make a blind ultra-deep survey for neutral hydrogen (HI). We present the complete results from the first of six fields (FUDS0). This observation of 95 hours allowed us to achieve a high sensitivity ($\sim 50~μ$Jy beam$^{-1}$) and a high frequency resolution (22.9 kHz) over an area of 0.72 deg$^2$. We detected 128 gala…
▽ More
We have used the Five-hundred-meter Aperture Spherical radio Telescope (FAST) to make a blind ultra-deep survey for neutral hydrogen (HI). We present the complete results from the first of six fields (FUDS0). This observation of 95 hours allowed us to achieve a high sensitivity ($\sim 50~μ$Jy beam$^{-1}$) and a high frequency resolution (22.9 kHz) over an area of 0.72 deg$^2$. We detected 128 galaxies in HI distributed over the redshift range of $0<z<0.4$ with HI masses in the range of $6.67 \leq \log(M_{\rm HI}/h_{70}^{-2} \rm M_\odot) \leq 10.92$, and three faint high-velocity clouds (HVCs) with peak column density of $N_{\rm HI} \leq 3.1 \times 10^{17}$ cm$^{-2}$. Of the galaxies, 95 are new detections and six have $z > 0.38$, where no unlensed HI emission has previously been directly detected. Estimates of completeness and reliability are presented for the catalog. Consistency of continuum and HI flux estimates with NVSS and AUDS, respectively, confirms the accuracy of calibration method and data reduction pipeline developed for the full FUDS survey.
△ Less
Submitted 9 September, 2024;
originally announced September 2024.
-
STLLM-DF: A Spatial-Temporal Large Language Model with Diffusion for Enhanced Multi-Mode Traffic System Forecasting
Authors:
Zhiqi Shao,
Haoning Xi,
Haohui Lu,
Ze Wang,
Michael G. H. Bell,
Junbin Gao
Abstract:
The rapid advancement of Intelligent Transportation Systems (ITS) presents challenges, particularly with missing data in multi-modal transportation and the complexity of handling diverse sequential tasks within a centralized framework. To address these issues, we propose the Spatial-Temporal Large Language Model Diffusion (STLLM-DF), an innovative model that leverages Denoising Diffusion Probabili…
▽ More
The rapid advancement of Intelligent Transportation Systems (ITS) presents challenges, particularly with missing data in multi-modal transportation and the complexity of handling diverse sequential tasks within a centralized framework. To address these issues, we propose the Spatial-Temporal Large Language Model Diffusion (STLLM-DF), an innovative model that leverages Denoising Diffusion Probabilistic Models (DDPMs) and Large Language Models (LLMs) to improve multi-task transportation prediction. The DDPM's robust denoising capabilities enable it to recover underlying data patterns from noisy inputs, making it particularly effective in complex transportation systems. Meanwhile, the non-pretrained LLM dynamically adapts to spatial-temporal relationships within multi-modal networks, allowing the system to efficiently manage diverse transportation tasks in both long-term and short-term predictions. Extensive experiments demonstrate that STLLM-DF consistently outperforms existing models, achieving an average reduction of 2.40\% in MAE, 4.50\% in RMSE, and 1.51\% in MAPE. This model significantly advances centralized ITS by enhancing predictive accuracy, robustness, and overall system performance across multiple tasks, thus paving the way for more effective spatio-temporal traffic forecasting through the integration of frozen transformer language models and diffusion techniques.
△ Less
Submitted 8 September, 2024;
originally announced September 2024.
-
Fast and Communication-Efficient Multi-UAV Exploration Via Voronoi Partition on Dynamic Topological Graph
Authors:
Qianli Dong,
Haobo Xi,
Shiyong Zhang,
Qingchen Bi,
Tianyi Li,
Ziyu Wang,
Xuebo Zhang
Abstract:
Efficient data transmission and reasonable task allocation are important to improve multi-robot exploration efficiency. However, most communication data types typically contain redundant information and thus require massive communication volume. Moreover, exploration-oriented task allocation is far from trivial and becomes even more challenging for resource-limited unmanned aerial vehicles (UAVs).…
▽ More
Efficient data transmission and reasonable task allocation are important to improve multi-robot exploration efficiency. However, most communication data types typically contain redundant information and thus require massive communication volume. Moreover, exploration-oriented task allocation is far from trivial and becomes even more challenging for resource-limited unmanned aerial vehicles (UAVs). In this paper, we propose a fast and communication-efficient multi-UAV exploration method for exploring large environments. We first design a multi-robot dynamic topological graph (MR-DTG) consisting of nodes representing the explored and exploring regions and edges connecting nodes. Supported by MR-DTG, our method achieves efficient communication by only transferring the necessary information required by exploration planning. To further improve the exploration efficiency, a hierarchical multi-UAV exploration method is devised using MR-DTG. Specifically, the \emph{graph Voronoi partition} is used to allocate MR-DTG's nodes to the closest UAVs, considering the actual motion cost, thus achieving reasonable task allocation. To our knowledge, this is the first work to address multi-UAV exploration using \emph{graph Voronoi partition}. The proposed method is compared with a state-of-the-art method in simulations. The results show that the proposed method is able to reduce the exploration time and communication volume by up to 38.3\% and 95.5\%, respectively. Finally, the effectiveness of our method is validated in the real-world experiment with 6 UAVs. We will release the source code to benefit the community.
△ Less
Submitted 11 August, 2024;
originally announced August 2024.
-
Charmonium-like states with the exotic quantum number $J^{PC} = 3^{-+}$
Authors:
Hong-Zhou Xi,
Hua-Xing Chen,
Wei Chen,
T. G. Steele,
Yong Zhang,
Dan Zhou
Abstract:
We apply the method of QCD sum rules to study the $q c \bar q \bar c$ tetraquark states with the exotic quantum number $J^{PC} = 3^{-+}$, and extract the mass of the lowest-lying state to be ${4.49^{+0.45}_{-0.41}}$ GeV. To construct the relevant tetraquark currents we need to explicitly add the covariant derivative operator. Our systematic analysis of these interpolating currents indicates that:…
▽ More
We apply the method of QCD sum rules to study the $q c \bar q \bar c$ tetraquark states with the exotic quantum number $J^{PC} = 3^{-+}$, and extract the mass of the lowest-lying state to be ${4.49^{+0.45}_{-0.41}}$ GeV. To construct the relevant tetraquark currents we need to explicitly add the covariant derivative operator. Our systematic analysis of these interpolating currents indicates that: a) this state readily decays into the $P$-wave $[ρJ/ψ] / [ωJ/ψ]$ channel but not into the $ [ρχ_{c2}]/[ωχ_{c2}]/[J/ψf_2(1270)]$ channels, and b) it readily decays into the $[D^* \bar D_2^*]$ channel but not into the $P$-wave $[D^* \bar D^*]$ channel.
△ Less
Submitted 19 October, 2024; v1 submitted 8 August, 2024;
originally announced August 2024.
-
ARVO: Atlas of Reproducible Vulnerabilities for Open Source Software
Authors:
Xiang Mei,
Pulkit Singh Singaria,
Jordi Del Castillo,
Haoran Xi,
Abdelouahab,
Benchikh,
Tiffany Bao,
Ruoyu Wang,
Yan Shoshitaishvili,
Adam Doupé,
Hammond Pearce,
Brendan Dolan-Gavitt
Abstract:
High-quality datasets of real-world vulnerabilities are enormously valuable for downstream research in software security, but existing datasets are typically small, require extensive manual effort to update, and are missing crucial features that such research needs. In this paper, we introduce ARVO: an Atlas of Reproducible Vulnerabilities in Open-source software. By sourcing vulnerabilities from…
▽ More
High-quality datasets of real-world vulnerabilities are enormously valuable for downstream research in software security, but existing datasets are typically small, require extensive manual effort to update, and are missing crucial features that such research needs. In this paper, we introduce ARVO: an Atlas of Reproducible Vulnerabilities in Open-source software. By sourcing vulnerabilities from C/C++ projects that Google's OSS-Fuzz discovered and implementing a reliable re-compilation system, we successfully reproduce more than 5,000 memory vulnerabilities across over 250 projects, each with a triggering input, the canonical developer-written patch for fixing the vulnerability, and the ability to automatically rebuild the project from source and run it at its vulnerable and patched revisions. Moreover, our dataset can be automatically updated as OSS-Fuzz finds new vulnerabilities, allowing it to grow over time. We provide a thorough characterization of the ARVO dataset, show that it can locate fixes more accurately than Google's own OSV reproduction effort, and demonstrate its value for future research through two case studies: firstly evaluating real-world LLM-based vulnerability repair, and secondly identifying over 300 falsely patched (still-active) zero-day vulnerabilities from projects improperly labeled by OSS-Fuzz.
△ Less
Submitted 4 August, 2024;
originally announced August 2024.
-
The most distant HI galaxies discovered by the 500 m dish FAST
Authors:
Hongwei Xi,
Bo Peng,
Lister Staveley-Smith,
Bi-Qing For,
Bin Liu,
Ru-Rong Chen,
Lei Yu,
Dejian Ding,
Wei-Jian Guo,
Hu Zou,
Suijian Xue,
Jing Wang,
Thomas G. Brink,
WeiKang Zheng,
Alexei V. Filippenko,
Yi Yang,
Jianyan Wei,
Y. Sophia Dai,
Zi-Jian Li,
Zizhao He,
Chengzi Jiang,
Alexei Moiseev,
Sergey Kotov
Abstract:
Neutral hydrogen (HI) is the primary component of the cool interstellar medium (ISM) and is the reservoir of fuel for star formation. Owing to the sensitivity of existing radio telescopes, our understanding of the evolution of the ISM in galaxies remains limited, as it is based on only a few hundred galaxies detected in HI beyond the local Universe. With the high sensitivity of the Five-hundred-me…
▽ More
Neutral hydrogen (HI) is the primary component of the cool interstellar medium (ISM) and is the reservoir of fuel for star formation. Owing to the sensitivity of existing radio telescopes, our understanding of the evolution of the ISM in galaxies remains limited, as it is based on only a few hundred galaxies detected in HI beyond the local Universe. With the high sensitivity of the Five-hundred-meter Aperture Spherical radio Telescope (FAST), we carried out a blind HI search, the FAST Ultra-Deep Survey (FUDS), which extends to redshifts up to 0.42 and a sensitivity of 50 $\rm μJy \cdot beam^{-1}$. Here, we report the first discovery of six galaxies in HI at $z>0.38$. For these galaxies, the FAST angular resolution of $\sim\,4'$ corresponds to a mean linear size of $\sim1.3\,h_{70}^{-1}\,$Mpc. These galaxies are among the most distant HI emission detections known, with one having the most massive HI content ($10^{10.93 \pm 0.04}~h_{70}^{-2}\, \rm M_\odot$). Using recent data from the DESI survey, and new observations with the Hale, BTA, and Keck telescopes, optical counterparts are detected for all galaxies within the 3-$σ$ positional uncertainty ($0.5\,h_{70}^{-1}\,$Mpc) and $\rm 200\,km \cdot s^{-1}$ in recession velocity. Assuming that the dominant source of HI is the identified optical counterpart, we find an evidence of evolution in the HI content of galaxies over the last 4.2 Gyr. Our new high-redshift HI galaxy sample provides the opportunity to better investigate the evolution of cool gas in galaxies. A larger sample size in the future will allow us to refine our knowledge of the formation and evolution of galaxies.
△ Less
Submitted 1 August, 2024;
originally announced August 2024.
-
Heralded High-Dimensional Photon-Photon Quantum Gate
Authors:
Zhi-Feng Liu,
Zhi-Cheng Ren,
Pei Wan,
Wen-Zheng Zhu,
Zi-Mo Cheng,
Jing Wang,
Yu-Peng Shi,
Han-Bing Xi,
Marcus Huber,
Nicolai Friis,
Xiaoqin Gao,
Xi-Lin Wang,
Hui-Tian Wang
Abstract:
High-dimensional encoding of quantum information holds the potential to greatly increase the computational power of existing devices by enlarging the accessible state space for fixed register size and by reducing the number of required entangling gates. However, qudit-based quantum computation remains far less developed than conventional qubit-based approaches, in particular for photons, which rep…
▽ More
High-dimensional encoding of quantum information holds the potential to greatly increase the computational power of existing devices by enlarging the accessible state space for fixed register size and by reducing the number of required entangling gates. However, qudit-based quantum computation remains far less developed than conventional qubit-based approaches, in particular for photons, which represent natural multi-level information carriers that play a crucial role in the development of quantum networks. A major obstacle for realizing quantum gates between two individual photons is the restriction of direct interaction between photons in linear media. In particular, essential logic components for quantum operations such as native qudit-qudit entangling gates are still missing for optical quantum information processing. Here we address this challenge by presenting a protocol for realizing an entangling gate -- the controlled phase-flip (CPF) gate -- for two photonic qudits in arbitrary dimension. We experimentally demonstrate this protocol by realizing a four-dimensional qudit-qudit CPF gate, whose decomposition would require at least 13 two-qubit entangling gates. Our photonic qudits are encoded in orbital angular momentum (OAM) and we have developed a new active high-precision phase-locking technology to construct a high-dimensional OAM beam splitter that increases the stability of the CPF gate, resulting in a process fidelity within a range of $ [0.64 \pm 0.01, 0.82 \pm 0.01]$. Our experiment represents a significant advance for high-dimensional optical quantum information processing and has the potential for wider applications beyond optical system.
△ Less
Submitted 23 July, 2024;
originally announced July 2024.
-
Particle transport and deposition in wall-sheared thermal turbulence
Authors:
Ao Xu,
Ben-Rui Xu,
Heng-Dong Xi
Abstract:
We studied the transport and deposition behavior of point particles in Rayleigh-Bénard convection cells subjected to Couette-type wall shear. Direct numerical simulations (DNS) are performed for Rayleigh number (Ra) in the range \(10^7 \leq Ra \leq 10^9\) with a fixed Prandtl number \(Pr = 0.71\), while the wall-shear Reynolds number (\(Re_w\)) is in the range \(0 \leq Re_w \leq 12000\). With the…
▽ More
We studied the transport and deposition behavior of point particles in Rayleigh-Bénard convection cells subjected to Couette-type wall shear. Direct numerical simulations (DNS) are performed for Rayleigh number (Ra) in the range \(10^7 \leq Ra \leq 10^9\) with a fixed Prandtl number \(Pr = 0.71\), while the wall-shear Reynolds number (\(Re_w\)) is in the range \(0 \leq Re_w \leq 12000\). With the increase of \(Re_w\), the large-scale rolls expanded horizontally, evolving into zonal flow in two-dimensional simulations or streamwise-oriented rolls in three-dimensional simulations. We observed that for particles with a small Stokes number St, they either circulated within the large-scale rolls when buoyancy dominated or drifted near the walls when shear dominated. For medium St particles, pronounced spatial inhomogeneity and preferential concentration were observed regardless of the prevailing flow state. For large St particles, the turbulent flow structure had a minor influence on particles' motion; although clustering still occurred, wall shear had a negligible influence compared to that for medium St particles. We then presented the settling curves to quantify the particle deposition ratio on the walls. Our DNS results aligned well with previous theoretical predictions, which state that small St particles settle with an exponential deposition ratio and large St particles settle with a linear deposition ratio. For medium St particles, where complex particles-turbulence interaction emerges, we developed a new model describing the settling process with an initial linear stage followed by a non-linear stage. Unknown parameters in our model can be determined either by fitting the settling curves or using empirical relations. Compared with DNS results, our model also accurately predicts the average residence time across a wide range of St for various \(Re_w\).
△ Less
Submitted 17 July, 2024;
originally announced July 2024.
-
NYU CTF Dataset: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security
Authors:
Minghao Shao,
Sofija Jancheska,
Meet Udeshi,
Brendan Dolan-Gavitt,
Haoran Xi,
Kimberly Milner,
Boyuan Chen,
Max Yin,
Siddharth Garg,
Prashanth Krishnamurthy,
Farshad Khorrami,
Ramesh Karri,
Muhammad Shafique
Abstract:
Large Language Models (LLMs) are being deployed across various domains today. However, their capacity to solve Capture the Flag (CTF) challenges in cybersecurity has not been thoroughly evaluated. To address this, we develop a novel method to assess LLMs in solving CTF challenges by creating a scalable, open-source benchmark database specifically designed for these applications. This database incl…
▽ More
Large Language Models (LLMs) are being deployed across various domains today. However, their capacity to solve Capture the Flag (CTF) challenges in cybersecurity has not been thoroughly evaluated. To address this, we develop a novel method to assess LLMs in solving CTF challenges by creating a scalable, open-source benchmark database specifically designed for these applications. This database includes metadata for LLM testing and adaptive learning, compiling a diverse range of CTF challenges from popular competitions. Utilizing the advanced function calling capabilities of LLMs, we build a fully automated system with an enhanced workflow and support for external tool calls. Our benchmark dataset and automated framework allow us to evaluate the performance of five LLMs, encompassing both black-box and open-source models. This work lays the foundation for future research into improving the efficiency of LLMs in interactive cybersecurity tasks and automated task planning. By providing a specialized dataset, our project offers an ideal platform for developing, testing, and refining LLM-based approaches to vulnerability detection and resolution. Evaluating LLMs on these challenges and comparing with human performance yields insights into their potential for AI-driven cybersecurity solutions to perform real-world threat management. We make our dataset open source to public https://github.com/NYU-LLM-CTF/LLM_CTF_Database along with our playground automated framework https://github.com/NYU-LLM-CTF/llm_ctf_automation.
△ Less
Submitted 21 August, 2024; v1 submitted 8 June, 2024;
originally announced June 2024.
-
ST-Mamba: Spatial-Temporal Selective State Space Model for Traffic Flow Prediction
Authors:
Zhiqi Shao,
Michael G. H. Bell,
Ze Wang,
D. Glenn Geers,
Haoning Xi,
Junbin Gao
Abstract:
Traffic flow prediction, a critical aspect of intelligent transportation systems, has been increasingly popular in the field of artificial intelligence, driven by the availability of extensive traffic data. The current challenges of traffic flow prediction lie in integrating diverse factors while balancing the trade-off between computational complexity and the precision necessary for effective lon…
▽ More
Traffic flow prediction, a critical aspect of intelligent transportation systems, has been increasingly popular in the field of artificial intelligence, driven by the availability of extensive traffic data. The current challenges of traffic flow prediction lie in integrating diverse factors while balancing the trade-off between computational complexity and the precision necessary for effective long-range and large-scale predictions. To address these challenges, we introduce a Spatial-Temporal Selective State Space (ST-Mamba) model, which is the first to leverage the power of spatial-temporal learning in traffic flow prediction without using graph modeling. The ST-Mamba model can effectively capture the long-range dependency for traffic flow data, thereby avoiding the issue of over-smoothing. The proposed ST-Mamba model incorporates an effective Spatial-Temporal Mixer (ST-Mixer) to seamlessly integrate spatial and temporal data processing into a unified framework and employs a Spatial-Temporal Selective State Space (ST-SSM) block to improve computational efficiency. The proposed ST-Mamba model, specifically designed for spatial-temporal data, simplifies processing procedure and enhances generalization capabilities, thereby significantly improving the accuracy of long-range traffic flow prediction. Compared to the previous state-of-the-art (SOTA) model, the proposed ST-Mamba model achieves a 61.11\% improvement in computational speed and increases prediction accuracy by 0.67\%. Extensive experiments with real-world traffic datasets demonstrate that the \textsf{ST-Mamba} model sets a new benchmark in traffic flow prediction, achieving SOTA performance in computational efficiency for both long- and short-range predictions and significantly improving the overall efficiency and effectiveness of traffic management.
△ Less
Submitted 18 May, 2024; v1 submitted 19 April, 2024;
originally announced April 2024.
-
Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization
Authors:
Haocheng Xi,
Yuxiang Chen,
Kang Zhao,
Kai Jun Teh,
Jianfei Chen,
Jun Zhu
Abstract:
Pretraining transformers are generally time-consuming. Fully quantized training (FQT) is a promising approach to speed up pretraining. However, most FQT methods adopt a quantize-compute-dequantize procedure, which often leads to suboptimal speedup and significant performance degradation when used in transformers due to the high memory access overheads and low-precision computations. In this work,…
▽ More
Pretraining transformers are generally time-consuming. Fully quantized training (FQT) is a promising approach to speed up pretraining. However, most FQT methods adopt a quantize-compute-dequantize procedure, which often leads to suboptimal speedup and significant performance degradation when used in transformers due to the high memory access overheads and low-precision computations. In this work, we propose Jetfire, an efficient and accurate INT8 training method specific to transformers. Our method features an INT8 data flow to optimize memory access and a per-block quantization method to maintain the accuracy of pretrained transformers. Extensive experiments demonstrate that our INT8 FQT method achieves comparable accuracy to the FP16 training baseline and outperforms the existing INT8 training works for transformers. Moreover, for a standard transformer block, our method offers an end-to-end training speedup of 1.42x and a 1.49x memory reduction compared to the FP16 baseline. Our code is open sourced at https://github.com/thu-ml/Jetfire-INT8Training.
△ Less
Submitted 20 July, 2024; v1 submitted 19 March, 2024;
originally announced March 2024.
-
Delving into temperature scaling for adaptive conformal prediction
Authors:
Huajun Xi,
Jianguo Huang,
Kangdao Liu,
Lei Feng,
Hongxin Wei
Abstract:
Conformal prediction, as an emerging uncertainty qualification technique, constructs prediction sets that are guaranteed to contain the true label with pre-defined probability. Previous works often employ temperature scaling to calibrate the classifier, assuming that confidence calibration can benefit conformal prediction. In this work, we empirically show that current confidence calibration metho…
▽ More
Conformal prediction, as an emerging uncertainty qualification technique, constructs prediction sets that are guaranteed to contain the true label with pre-defined probability. Previous works often employ temperature scaling to calibrate the classifier, assuming that confidence calibration can benefit conformal prediction. In this work, we empirically show that current confidence calibration methods (e.g., temperature scaling) normally lead to larger prediction sets in adaptive conformal prediction. Theoretically, we prove that a prediction with higher confidence could result in a smaller prediction set on expectation. Inspired by the analysis, we propose $Conformal$ $Temperature$ $Scaling$ (ConfTS), a variant of temperature scaling that aims to improve the efficiency of adaptive conformal prediction. Specifically, ConfTS optimizes the temperature value by minimizing the gap between the threshold and the non-conformity score of the ground truth for a held-out validation dataset. In this way, the temperature value obtained would lead to an optimal set of high efficiency without violating the marginal coverage property. Extensive experiments demonstrate that our method can effectively enhance adaptive conformal prediction methods in both efficiency and conditional coverage, reducing the average size of APS and RAPS by nearly 50$\%$ on ImageNet at error rate $α=0.1$.
△ Less
Submitted 8 October, 2024; v1 submitted 6 February, 2024;
originally announced February 2024.
-
T-Rex: Text-assisted Retrosynthesis Prediction
Authors:
Yifeng Liu,
Hanwen Xu,
Tangqi Fang,
Haocheng Xi,
Zixuan Liu,
Sheng Zhang,
Hoifung Poon,
Sheng Wang
Abstract:
As a fundamental task in computational chemistry, retrosynthesis prediction aims to identify a set of reactants to synthesize a target molecule. Existing template-free approaches only consider the graph structures of the target molecule, which often cannot generalize well to rare reaction types and large molecules. Here, we propose T-Rex, a text-assisted retrosynthesis prediction approach that exp…
▽ More
As a fundamental task in computational chemistry, retrosynthesis prediction aims to identify a set of reactants to synthesize a target molecule. Existing template-free approaches only consider the graph structures of the target molecule, which often cannot generalize well to rare reaction types and large molecules. Here, we propose T-Rex, a text-assisted retrosynthesis prediction approach that exploits pre-trained text language models, such as ChatGPT, to assist the generation of reactants. T-Rex first exploits ChatGPT to generate a description for the target molecule and rank candidate reaction centers based both the description and the molecular graph. It then re-ranks these candidates by querying the descriptions for each reactants and examines which group of reactants can best synthesize the target molecule. We observed that T-Rex substantially outperformed graph-based state-of-the-art approaches on two datasets, indicating the effectiveness of considering text information. We further found that T-Rex outperformed the variant that only use ChatGPT-based description without the re-ranking step, demonstrate how our framework outperformed a straightforward integration of ChatGPT and graph information. Collectively, we show that text generated by pre-trained language models can substantially improve retrosynthesis prediction, opening up new avenues for exploiting ChatGPT to advance computational chemistry. And the codes can be found at https://github.com/lauyikfung/T-Rex.
△ Less
Submitted 25 January, 2024;
originally announced January 2024.
-
Optimization-Free Test-Time Adaptation for Cross-Person Activity Recognition
Authors:
Shuoyuan Wang,
Jindong Wang,
HuaJun Xi,
Bob Zhang,
Lei Zhang,
Hongxin Wei
Abstract:
Human Activity Recognition (HAR) models often suffer from performance degradation in real-world applications due to distribution shifts in activity patterns across individuals. Test-Time Adaptation (TTA) is an emerging learning paradigm that aims to utilize the test stream to adjust predictions in real-time inference, which has not been explored in HAR before. However, the high computational cost…
▽ More
Human Activity Recognition (HAR) models often suffer from performance degradation in real-world applications due to distribution shifts in activity patterns across individuals. Test-Time Adaptation (TTA) is an emerging learning paradigm that aims to utilize the test stream to adjust predictions in real-time inference, which has not been explored in HAR before. However, the high computational cost of optimization-based TTA algorithms makes it intractable to run on resource-constrained edge devices. In this paper, we propose an Optimization-Free Test-Time Adaptation (OFTTA) framework for sensor-based HAR. OFTTA adjusts the feature extractor and linear classifier simultaneously in an optimization-free manner. For the feature extractor, we propose Exponential DecayTest-time Normalization (EDTN) to replace the conventional batch normalization (CBN) layers. EDTN combines CBN and Test-time batch Normalization (TBN) to extract reliable features against domain shifts with TBN's influence decreasing exponentially in deeper layers. For the classifier, we adjust the prediction by computing the distance between the feature and the prototype, which is calculated by a maintained support set. In addition, the update of the support set is based on the pseudo label, which can benefit from reliable features extracted by EDTN. Extensive experiments on three public cross-person HAR datasets and two different TTA settings demonstrate that OFTTA outperforms the state-of-the-art TTA approaches in both classification performance and computational efficiency. Finally, we verify the superiority of our proposed OFTTA on edge devices, indicating possible deployment in real applications. Our code is available at https://github.com/Claydon-Wang/OFTTA.
△ Less
Submitted 7 February, 2024; v1 submitted 27 October, 2023;
originally announced October 2023.
-
Conformal Prediction for Deep Classifier via Label Ranking
Authors:
Jianguo Huang,
Huajun Xi,
Linjun Zhang,
Huaxiu Yao,
Yue Qiu,
Hongxin Wei
Abstract:
Conformal prediction is a statistical framework that generates prediction sets containing ground-truth labels with a desired coverage guarantee. The predicted probabilities produced by machine learning models are generally miscalibrated, leading to large prediction sets in conformal prediction. To address this issue, we propose a novel algorithm named $\textit{Sorted Adaptive Prediction Sets}$ (SA…
▽ More
Conformal prediction is a statistical framework that generates prediction sets containing ground-truth labels with a desired coverage guarantee. The predicted probabilities produced by machine learning models are generally miscalibrated, leading to large prediction sets in conformal prediction. To address this issue, we propose a novel algorithm named $\textit{Sorted Adaptive Prediction Sets}$ (SAPS), which discards all the probability values except for the maximum softmax probability. The key idea behind SAPS is to minimize the dependence of the non-conformity score on the probability values while retaining the uncertainty information. In this manner, SAPS can produce compact prediction sets and communicate instance-wise uncertainty. Extensive experiments validate that SAPS not only lessens the prediction sets but also broadly enhances the conditional coverage rate of prediction sets.
△ Less
Submitted 6 June, 2024; v1 submitted 10 October, 2023;
originally announced October 2023.
-
A Two-Level Linear Dependent Type Theory
Authors:
Qiancheng Fu,
Hongwei Xi
Abstract:
We present a type theory combining both linearity and dependency by stratifying typing rules into a level for logics and a level for programs. The distinction between logics and programs decouples their semantics, allowing the type system to assume tight resource bounds. A natural notion of irrelevancy is established where all proofs and types occurring inside programs are fully erasable without c…
▽ More
We present a type theory combining both linearity and dependency by stratifying typing rules into a level for logics and a level for programs. The distinction between logics and programs decouples their semantics, allowing the type system to assume tight resource bounds. A natural notion of irrelevancy is established where all proofs and types occurring inside programs are fully erasable without compromising their operational behavior. Through a heap-based operational semantics, we show that extracted programs always make computational progress and run memory clean. Additionally, programs can be freely reflected into the logical level for conducting deep proofs in the style of standard dependent type theories. This enables one to write resource safe programs and verify their correctness using a unified language.
△ Less
Submitted 15 September, 2023;
originally announced September 2023.
-
Multirole Logic and Multiparty Channels
Authors:
Hongwei Xi,
Hanwen Wu
Abstract:
We identify multirole logic as a new form of logic in which conjunction/disjunction is interpreted as an ultrafilter on some underlying set of roles and the notion of negation is generalized to endomorphisms on this set. We formulate both multirole logic (MRL) and linear multirole logic (LMRL) as natural generalizations of classical logic (CL) and classical linear logic (CLL), respectively. Among…
▽ More
We identify multirole logic as a new form of logic in which conjunction/disjunction is interpreted as an ultrafilter on some underlying set of roles and the notion of negation is generalized to endomorphisms on this set. We formulate both multirole logic (MRL) and linear multirole logic (LMRL) as natural generalizations of classical logic (CL) and classical linear logic (CLL), respectively. Among various meta-properties established for MRL and LMRL, we obtain one named multiparty cut-elimination stating that every cut involving one or more sequents (as a generalization of a binary cut involving exactly two sequents) can be eliminated, thus extending the celebrated result of cut-elimination by Gentzen. As a side note, we also give an ultrafilter-based interpretation for intuitionism, formulating MRLJ as a natural generalization of intuitionistic logic (IL). An immediate application of LMRL can be found in a formulation of session types for channels that support multiparty communication in distributed programming. We present a multi-threaded lambda-calculus (MTLC) where threads communicate on linearly typed multiparty channels that are directly rooted in LMRL, establishing for MTLC both type preservation and global progress. The primary contribution of the paper consists of both identifying multirole logic as a new form of logic and establishing a theoretical foundation for it, and the secondary contribution lies in applying multirole logic to the practical domain of distributed programming.
△ Less
Submitted 3 September, 2023;
originally announced September 2023.
-
Fully-strange tetraquark states with the exotic quantum numbers $J^{PC} = 0^{+-}$ and $2^{+-}$
Authors:
Hong-Zhou Xi,
Yi-Wei Jiang,
Hua-Xing Chen,
Atsushi Hosaka,
Niu Su
Abstract:
We study the fully-strange tetraquark states with the exotic quantum numbers $J^{PC} = 0^{+-}$ and $2^{+-}$. We construct their corresponding diquark-antidiquark interpolating currents, and apply the QCD sum rule method to calculate both their diagonal and off-diagonal correlation functions. The obtained results are used to construct some mixing currents that are nearly non-correlated, from which…
▽ More
We study the fully-strange tetraquark states with the exotic quantum numbers $J^{PC} = 0^{+-}$ and $2^{+-}$. We construct their corresponding diquark-antidiquark interpolating currents, and apply the QCD sum rule method to calculate both their diagonal and off-diagonal correlation functions. The obtained results are used to construct some mixing currents that are nearly non-correlated, from which we extract the masses of the lowest-lying states to be $M_{0^{+-}} = 2.45^{+0.33}_{-0.44}$ GeV and $M_{2^{+-}} = 3.07^{+0.25}_{-0.33}$ GeV. We apply the Fierz rearrangement to transform the diquark-antidiquark currents to be the combinations of meson-meson currents, and the obtained Fierz identities indicate that these two states may be searched for in the $P$-wave $φ(1020) f_0(1710)/φ(1020) f_2^\prime(1525) (\to φK \bar K / φππ)$ channels.
△ Less
Submitted 18 October, 2023; v1 submitted 15 July, 2023;
originally announced July 2023.
-
Pore-scale statistics of temperature and thermal energy dissipation rate in turbulent porous convection
Authors:
Ao Xu,
Ben-Rui Xu,
Heng-Dong Xi
Abstract:
We report pore-scale statistical properties of temperature and thermal energy dissipation rate in a two-dimensional porous Rayleigh-Bénard (RB) cell. High-resolution direct numerical simulations were carried out for the fixed Rayleigh number ($Ra$) of $10^{9}$ and the Prandtl numbers ($Pr$) of 5.3 and 0.7. We consider sparse porous media where the solid porous matrix is impermeable to both fluid a…
▽ More
We report pore-scale statistical properties of temperature and thermal energy dissipation rate in a two-dimensional porous Rayleigh-Bénard (RB) cell. High-resolution direct numerical simulations were carried out for the fixed Rayleigh number ($Ra$) of $10^{9}$ and the Prandtl numbers ($Pr$) of 5.3 and 0.7. We consider sparse porous media where the solid porous matrix is impermeable to both fluid and heat flux. The porosity ($φ$) range $0.86 \leq φ\le 0.98$, the corresponding Darcy number ($Da$) range $10^{-4}<Da<10^{-2}$ and the porous Rayleigh number ($Ra^{*}=Ra\cdot Da$) range $10^{5} < Ra^{*} < 10^{7}$. Our results indicate that the plume dynamics in porous RB convection are less coherent when the solid porous matrix is impermeable to heat flux, as compared to the case where it is permeable. The averaged vertical temperature profiles remain almost a constant value in the bulk, while the mean-square fluctuations of temperature increases with decreasing porosity. Furthermore, the absolute values of skewness and flatness of the temperature are much smaller in the porous RB cell than in the canonical RB cell. We found that intense thermal energy dissipation occurs near the top and bottom walls, as well as in the bulk region of the porous RB cell. In comparison with the canonical RB cell, the small-scale thermal energy dissipation field is more intermittent in the porous cell, although both cells exhibit a non-log-normal distribution of thermal energy dissipation rate. This work highlights the impact of impermeable solid porous matrices on the statistical properties of temperature and thermal energy dissipation rate, and the findings may have practical applications in geophysics, energy and environmental engineering, as well as other fields that involve the transport of heat through porous media.
△ Less
Submitted 19 September, 2023; v1 submitted 28 June, 2023;
originally announced June 2023.
-
Training Transformers with 4-bit Integers
Authors:
Haocheng Xi,
Changhao Li,
Jianfei Chen,
Jun Zhu
Abstract:
Quantizing the activation, weight, and gradient to 4-bit is promising to accelerate neural network training. However, existing 4-bit training methods require custom numerical formats which are not supported by contemporary hardware. In this work, we propose a training method for transformers with all matrix multiplications implemented with the INT4 arithmetic. Training with an ultra-low INT4 preci…
▽ More
Quantizing the activation, weight, and gradient to 4-bit is promising to accelerate neural network training. However, existing 4-bit training methods require custom numerical formats which are not supported by contemporary hardware. In this work, we propose a training method for transformers with all matrix multiplications implemented with the INT4 arithmetic. Training with an ultra-low INT4 precision is challenging. To achieve this, we carefully analyze the specific structures of activation and gradients in transformers to propose dedicated quantizers for them. For forward propagation, we identify the challenge of outliers and propose a Hadamard quantizer to suppress the outliers. For backpropagation, we leverage the structural sparsity of gradients by proposing bit splitting and leverage score sampling techniques to quantize gradients accurately. Our algorithm achieves competitive accuracy on a wide range of tasks including natural language understanding, machine translation, and image classification. Unlike previous 4-bit training methods, our algorithm can be implemented on the current generation of GPUs. Our prototypical linear operator implementation is up to 2.2 times faster than the FP16 counterparts and speeds up the training by up to 35.1%.
△ Less
Submitted 22 June, 2023; v1 submitted 20 June, 2023;
originally announced June 2023.
-
Experimental and numerical investigation on folding stable state of bistable deployable composite boom
Authors:
Tian-Wei Liu,
Jiang-Bo Bai,
Hao-Tian Xi,
Nicholas Fantuzzi,
Guang-Yu Bu,
Yan Shi
Abstract:
The bistable deployable composite boom (Bi-DCB) can realize the bistable function by storing and releasing strain energy, which has a good application prospect in the aerospace field. In this paper, the folding stable state of the Bi-DCB was investigated using experimental and numerical approaches. Using the vacuum bag method, six Bi-DCB specimens were prepared. Bistable experiments of Bi-DCB spec…
▽ More
The bistable deployable composite boom (Bi-DCB) can realize the bistable function by storing and releasing strain energy, which has a good application prospect in the aerospace field. In this paper, the folding stable state of the Bi-DCB was investigated using experimental and numerical approaches. Using the vacuum bag method, six Bi-DCB specimens were prepared. Bistable experiments of Bi-DCB specimens were conducted and linear fitting with Archimedes' helix was performed to determine the folding stable configuration. In addition, two Finite Element Models (FEMs) were established for predicting the folding stable state of the Bi-DCB. Two classical failure criteria were utilized to analyze the stress level of the folding stable state of the Bi-DCB. Numerical results of two FEMs agreed well with experimental results, including the bistable deformation process and the folding stable state.
△ Less
Submitted 9 May, 2023;
originally announced May 2023.
-
SKA Science Data Challenge 2: analysis and results
Authors:
P. Hartley,
A. Bonaldi,
R. Braun,
J. N. H. S. Aditya,
S. Aicardi,
L. Alegre,
A. Chakraborty,
X. Chen,
S. Choudhuri,
A. O. Clarke,
J. Coles,
J. S. Collinson,
D. Cornu,
L. Darriba,
M. Delli Veneri,
J. Forbrich,
B. Fraga,
A. Galan,
J. Garrido,
F. Gubanov,
H. Håkansson,
M. J. Hardcastle,
C. Heneka,
D. Herranz,
K. M. Hess
, et al. (83 additional authors not shown)
Abstract:
The Square Kilometre Array Observatory (SKAO) will explore the radio sky to new depths in order to conduct transformational science. SKAO data products made available to astronomers will be correspondingly large and complex, requiring the application of advanced analysis techniques to extract key science findings. To this end, SKAO is conducting a series of Science Data Challenges, each designed t…
▽ More
The Square Kilometre Array Observatory (SKAO) will explore the radio sky to new depths in order to conduct transformational science. SKAO data products made available to astronomers will be correspondingly large and complex, requiring the application of advanced analysis techniques to extract key science findings. To this end, SKAO is conducting a series of Science Data Challenges, each designed to familiarise the scientific community with SKAO data and to drive the development of new analysis techniques. We present the results from Science Data Challenge 2 (SDC2), which invited participants to find and characterise 233245 neutral hydrogen (Hi) sources in a simulated data product representing a 2000~h SKA MID spectral line observation from redshifts 0.25 to 0.5. Through the generous support of eight international supercomputing facilities, participants were able to undertake the Challenge using dedicated computational resources. Alongside the main challenge, `reproducibility awards' were made in recognition of those pipelines which demonstrated Open Science best practice. The Challenge saw over 100 participants develop a range of new and existing techniques, with results that highlight the strengths of multidisciplinary and collaborative effort. The winning strategy -- which combined predictions from two independent machine learning techniques to yield a 20 percent improvement in overall performance -- underscores one of the main Challenge outcomes: that of method complementarity. It is likely that the combination of methods in a so-called ensemble approach will be key to exploiting very large astronomical datasets.
△ Less
Submitted 14 March, 2023;
originally announced March 2023.
-
Long-distance migration with minimal energy consumption in a thermal turbulent environment
Authors:
Ao Xu,
Hua-Lin Wu,
Heng-Dong Xi
Abstract:
We adopt the reinforcement learning algorithm to train the self-propelling agent migrating long-distance in a thermal turbulent environment. We choose the Rayleigh-Bénard turbulent convection cell with an aspect ratio ($Γ$, which is defined as the ratio between cell length and cell height) of 2 as the training environment. Our results showed that, compared to a naive agent that moves straight from…
▽ More
We adopt the reinforcement learning algorithm to train the self-propelling agent migrating long-distance in a thermal turbulent environment. We choose the Rayleigh-Bénard turbulent convection cell with an aspect ratio ($Γ$, which is defined as the ratio between cell length and cell height) of 2 as the training environment. Our results showed that, compared to a naive agent that moves straight from the origin to the destination, the smart agent can learn to utilize the carrier flow currents to save propelling energy. We then apply the optimal policy obtained from the $Γ=2$ cell and test the smart agent migrating in convection cells with $Γ$ up to 32. In a larger $Γ$ cell, the dominant flow modes of horizontally stacked rolls are less stable, and the energy contained in higher-order flow modes increases. We found that the optimized policy can be successfully extended to convection cells with a larger $Γ$. In addition, the ratio of propelling energy consumed by the smart agent to that of the naive agent decreases with the increase of $Γ$, indicating more propelling energy can be saved by the smart agent in a larger $Γ$ cell. We also evaluate the optimized policy when the agents are being released from the randomly chosen origin, which aims to test the robustness of the learning framework, and possible solutions to improve the success rate are suggested. This work has implications for long-distance migration problems, such as unmanned aerial vehicles patrolling in a turbulent convective environment, where planning energy-efficient trajectories can be beneficial to increase their endurance.
△ Less
Submitted 8 February, 2023; v1 submitted 11 January, 2023;
originally announced January 2023.
-
Wall-sheared thermal convection: heat transfer enhancement and turbulence relaminarization
Authors:
Ao Xu,
Ben-Rui Xu,
Heng-Dong Xi
Abstract:
We studied the flow organization and heat transfer properties in two-dimensional and three-dimensional Rayleigh-Bénard cells that are imposed with different types of wall shear. The external wall shear is added with the motivation of manipulating flow mode to control heat transfer efficiency. We imposed three types of wall shear that may facilitate the single-roll, the horizontally stacked double-…
▽ More
We studied the flow organization and heat transfer properties in two-dimensional and three-dimensional Rayleigh-Bénard cells that are imposed with different types of wall shear. The external wall shear is added with the motivation of manipulating flow mode to control heat transfer efficiency. We imposed three types of wall shear that may facilitate the single-roll, the horizontally stacked double-roll, and the vertically stacked double-roll flow modes, respectively. Direct numerical simulations are performed for fixed Rayleigh number $Ra = 10^{8}$ and fixed Prandtl number $Pr = 5.3$, while the wall-shear Reynolds number ($Re_{w}$) is in the range $60 \le Re_{w} \le 6000$. Generally, we found enhanced heat transfer efficiency and global flow strength with the increase of $Re_{w}$. However, even with the same magnitude of global flow strength, the heat transfer efficiency varies significantly when the cells are under different types of wall shear. An interesting finding is that by increasing the wall-shear strength, the thermal turbulence is relaminarized, and more surprisingly, the heat transfer efficiency in the laminar state is higher than that in the turbulent state. We found that the enhanced heat transfer efficiency at the laminar regime is due to the formation of more stable and stronger convection channels. We propose that the origin of thermal turbulence laminarization is the reduced amount of thermal plumes. Because plumes are mainly responsible for turbulent kinetic energy production, when the detached plumes are swept away by the wall shear, the reduced number of plumes leads to weaker turbulent kinetic energy production. We also quantify the efficiency of facilitating heat transport via external shearing, and find that for larger $Re_{w}$, the enhanced heat transfer efficiency comes at a price of a larger expenditure of mechanical energy.
△ Less
Submitted 30 March, 2023; v1 submitted 1 January, 2023;
originally announced January 2023.
-
Linear change and minutes variability of solar wind velocity revealed by FAST
Authors:
Li-Jia Liu,
Bo Peng,
Lei Yu,
Bin Liu,
Ji-Guang Lu,
Ye-Zhao Yu,
Hong-Wei Xi,
Ming Xiong,
O. Chang
Abstract:
Observation of Interplanetary Scintillation (IPS) provides an important and effective way to study the solar wind and the space weather. A series of IPS observations were conducted by the Five-hundred-meter Aperture Spherical radio Telescope (FAST). The extraordinary sensitivity and the wide frequency coverage make FAST an ideal platform for IPS studies. In this paper we present some first scienti…
▽ More
Observation of Interplanetary Scintillation (IPS) provides an important and effective way to study the solar wind and the space weather. A series of IPS observations were conducted by the Five-hundred-meter Aperture Spherical radio Telescope (FAST). The extraordinary sensitivity and the wide frequency coverage make FAST an ideal platform for IPS studies. In this paper we present some first scientific results from FAST observations of IPS with the L-band receiver. Based on the solar wind velocity fitting values of FAST observations on September 26-28, 2020, we found that the velocity decreases with increasing frequency linearly, which has not yet been reported in literature. And we have also detected a variation of solar wind velocity on a timescale of 3-5 minutes, which imply the slow change of the background solar wind, a co-existence of high- and low-speed streams, or a reflect of the quasi-periodic electron-density fluctuations.
△ Less
Submitted 19 July, 2022;
originally announced July 2022.
-
The FAST Ultra-Deep Survey (FUDS): observational strategy, calibration and data reduction
Authors:
Hongwei Xi,
Bo Peng,
Lister Staveley-Smith,
Bi-Qing For,
Bin Liu
Abstract:
The FAST Ultra-Deep Survey (FUDS) is a blind survey that aims for the direct detection of HI in galaxies at redshifts $z<0.42$. The survey uses the multibeam receiver on the Five Hundred Meter Aperture Spherical Telescope (FAST) to map six regions, each of size 0.72 deg$^2$ at high sensitivity ($\sim 50 μ$Jy) and high frequency resolution (23 kHz). The survey will enable studies of the evolution o…
▽ More
The FAST Ultra-Deep Survey (FUDS) is a blind survey that aims for the direct detection of HI in galaxies at redshifts $z<0.42$. The survey uses the multibeam receiver on the Five Hundred Meter Aperture Spherical Telescope (FAST) to map six regions, each of size 0.72 deg$^2$ at high sensitivity ($\sim 50 μ$Jy) and high frequency resolution (23 kHz). The survey will enable studies of the evolution of galaxies and their HI content with an eventual sample size of $\sim 1000$. We present the science goals, observing strategy, the effects of radio frequency interference (RFI) at the FAST site, our mitigation strategies and the methods for calibration, data reduction and imaging as applied to initial data. The observations and reductions for the first field, FUDS0, are completed, with around 128 HI galaxies detected in a preliminary analysis. Example spectra are given in this paper, including a comparison with data from the overlapping GAL2577 field of Arecibo Ultra-Deep Survey (AUDS).
△ Less
Submitted 6 April, 2022;
originally announced April 2022.
-
Migration of self-propelling agent in a turbulent environment with minimal energy consumption
Authors:
Ao Xu,
Hua-Lin Wu,
Heng-Dong Xi
Abstract:
We present a numerical study of training a self-propelling agent to migrate in the unsteady flow environment. We control the agent to utilize the background flow structure by adopting the reinforcement learning algorithm to minimize energy consumption. We considered the agent migrating in two types of flows: one is simple periodical double-gyre flow as a proof-of-concept example, while the other i…
▽ More
We present a numerical study of training a self-propelling agent to migrate in the unsteady flow environment. We control the agent to utilize the background flow structure by adopting the reinforcement learning algorithm to minimize energy consumption. We considered the agent migrating in two types of flows: one is simple periodical double-gyre flow as a proof-of-concept example, while the other is complex turbulent Rayleigh-Bénard convection as a paradigm for migrating in the convective atmosphere or the ocean. The results show that the smart agent in both flows can learn to migrate from one position to another while utilizing background flow currents as much as possible to minimize the energy consumption, which is evident by comparing the smart agent with a naive agent that moves straight from the origin to the destination. In addition, we found that compared to the double-gyre flow, the flow field in the turbulent Rayleigh-Bénard convection exhibits more substantial fluctuations, and the training agent is more likely to explore different migration strategies; thus, the training process is more difficult to converge. Nevertheless, we can still identify an energy-efficient trajectory that corresponds to the strategy with the highest reward received by the agent. These results have important implications for many migration problems such as unmanned aerial vehicles flying in a turbulent convective environment, where planning energy-efficient trajectories are often involved.
△ Less
Submitted 15 March, 2022; v1 submitted 24 January, 2022;
originally announced January 2022.
-
Robust and Precise Facial Landmark Detection by Self-Calibrated Pose Attention Network
Authors:
Jun Wan,
Hui Xi,
Jie Zhou,
Zhihui Lai,
Witold Pedrycz,
Xu Wang,
Hang Sun
Abstract:
Current fully-supervised facial landmark detection methods have progressed rapidly and achieved remarkable performance. However, they still suffer when coping with faces under large poses and heavy occlusions for inaccurate facial shape constraints and insufficient labeled training samples. In this paper, we propose a semi-supervised framework, i.e., a Self-Calibrated Pose Attention Network (SCPAN…
▽ More
Current fully-supervised facial landmark detection methods have progressed rapidly and achieved remarkable performance. However, they still suffer when coping with faces under large poses and heavy occlusions for inaccurate facial shape constraints and insufficient labeled training samples. In this paper, we propose a semi-supervised framework, i.e., a Self-Calibrated Pose Attention Network (SCPAN) to achieve more robust and precise facial landmark detection in challenging scenarios. To be specific, a Boundary-Aware Landmark Intensity (BALI) field is proposed to model more effective facial shape constraints by fusing boundary and landmark intensity field information. Moreover, a Self-Calibrated Pose Attention (SCPA) model is designed to provide a self-learned objective function that enforces intermediate supervision without label information by introducing a self-calibrated mechanism and a pose attention mask. We show that by integrating the BALI fields and SCPA model into a novel self-calibrated pose attention network, more facial prior knowledge can be learned and the detection accuracy and robustness of our method for faces with large poses and heavy occlusions have been improved. The experimental results obtained for challenging benchmark datasets demonstrate that our approach outperforms state-of-the-art methods in the literature.
△ Less
Submitted 22 December, 2021;
originally announced December 2021.
-
Rationalizing the influence of tunable energy levels on quantum efficiency to design optimal non-fullerene acceptor-based ternary organic solar cells
Authors:
Safakath Karuthedath,
Sri H. K . Paleti,
Anirudh Sharma,
Hang Yin,
Catherine S. P. De Castro,
Si Chen,
Han Xi,
Nisreen Alshehri,
Nicolas Ramos,
Jafar I. Khan,
Jaime Martin,
Gang Li,
Frédéric Laquai,
Derya Baran,
Julien Gorenflot
Abstract:
Non-fullerene acceptor (NFA)-based ternary bulk heterojunction solar cells (TSC) are the most efficient organic solar cells (OSCs) today due to their broader absorption and quantum efficiencies (QE) often surpassing those of corresponding binary blends. We study how the energetics driving charge transfer at the electron donor:electron acceptor (D/A) interfaces impact the QE in blends of PBDB-T-2F…
▽ More
Non-fullerene acceptor (NFA)-based ternary bulk heterojunction solar cells (TSC) are the most efficient organic solar cells (OSCs) today due to their broader absorption and quantum efficiencies (QE) often surpassing those of corresponding binary blends. We study how the energetics driving charge transfer at the electron donor:electron acceptor (D/A) interfaces impact the QE in blends of PBDB-T-2F donor with several pairs of lower bandgap NFAs. As in binary blends, the ionization energy offset between donor and acceptor (ΔIE) controls the QE and maximizes for ΔIE > 0.5 eV. However, ΔIE is not controlled by the individual NFAs IEs but by their average, weighted for their blending ratio. Using this property, we improved the QE of a PBDB-T-2F:IEICO binary blend that had an insufficient ΔIE for charge generation by adding a deep IE third component: IT-4F. Combining two NFAs enables to optimize the D/A energy alignment and cells' QE without molecular engineering.
△ Less
Submitted 12 February, 2023; v1 submitted 12 December, 2021;
originally announced December 2021.
-
Production and transport of vorticity in two-dimensional Rayleigh-Bénard convection cell
Authors:
Ao Xu,
Ben-Rui Xu,
Li-Sheng Jiang,
Heng-Dong Xi
Abstract:
We present a numerical study of vorticity production and transport in the two-dimensional Rayleigh-Bénard (RB) convection. Direct numerical simulations are carried out in the Rayleigh number ($Ra$) range $10^{5}\le Ra \le 10^{6}$, the Prandtl number ($Pr$) of 0.71, and the aspect ratio ($Γ$) of the convection cell range $0.75\le Γ\le 6$. We found that the flow structure and temperature distributio…
▽ More
We present a numerical study of vorticity production and transport in the two-dimensional Rayleigh-Bénard (RB) convection. Direct numerical simulations are carried out in the Rayleigh number ($Ra$) range $10^{5}\le Ra \le 10^{6}$, the Prandtl number ($Pr$) of 0.71, and the aspect ratio ($Γ$) of the convection cell range $0.75\le Γ\le 6$. We found that the flow structure and temperature distribution vary with $Γ$ greatly due to multiple vortices interaction. Further investigation on the vorticity production and transport reveals that, in the RB convection, in addition to the vorticity production due to wall shear stress, buoyancy produces significant vorticity in the bulk region. The produced vorticity is transported via advection and diffusion. An interesting finding is that the main vortices and the corner vortices can be visualized via the contour of buoyancy-produced vorticity. Although a vigorous definition of the vortex is still lacking in the community, our efficient vortex visualization approach in the RB convection may shed light on further research toward vortex identification. We also found that the spatial distribution of vorticity flux along the wall is positively correlated with that of the Nusselt number ($Nu$), suggesting the amount of vorticity that enters the flow is directly related to the amount of thermal energy that enters the flow.
△ Less
Submitted 19 January, 2022; v1 submitted 2 November, 2021;
originally announced November 2021.
-
Quantum State Transfer Between Distant Optomechanical Interfaces via Shortcut to Adiabaticity
Authors:
Hanzhe Xi,
Pei Pei
Abstract:
We propose a protocol to realize fast high-fidelity quantum state transfer between distant optomechanical interfaces connected by a continuum waveguide. The scheme consists of three steps: two accelerating adiabatic processes joined by a population conversion process. In comparison to the traditional adiabatic technique, our method reaches a higher transfer fidelity with a shorter time. Numerical…
▽ More
We propose a protocol to realize fast high-fidelity quantum state transfer between distant optomechanical interfaces connected by a continuum waveguide. The scheme consists of three steps: two accelerating adiabatic processes joined by a population conversion process. In comparison to the traditional adiabatic technique, our method reaches a higher transfer fidelity with a shorter time. Numerical results show that the fidelity of this transfer scheme in the dissipative system mainly depends on the protocol speed and the coupling strength of the waveguide and cavities. Assisted by inverting the pulse sequence, a bidirectional transfer can be implemented, indicating the potential to build a quantum network.
△ Less
Submitted 31 October, 2021;
originally announced November 2021.
-
Increasing a microscope's effective field of view via overlapped imaging and machine learning
Authors:
Xing Yao,
Vinayak Pathak,
Haoran Xi,
Amey Chaware,
Colin Cooke,
Kanghyun Kim,
Shiqi Xu,
Yuting Li,
Timothy Dunn,
Pavan Chandra Konda,
Kevin C. Zhou,
Roarke Horstmeyer
Abstract:
This work demonstrates a multi-lens microscopic imaging system that overlaps multiple independent fields of view on a single sensor for high-efficiency automated specimen analysis. Automatic detection, classification and counting of various morphological features of interest is now a crucial component of both biomedical research and disease diagnosis. While convolutional neural networks (CNNs) hav…
▽ More
This work demonstrates a multi-lens microscopic imaging system that overlaps multiple independent fields of view on a single sensor for high-efficiency automated specimen analysis. Automatic detection, classification and counting of various morphological features of interest is now a crucial component of both biomedical research and disease diagnosis. While convolutional neural networks (CNNs) have dramatically improved the accuracy of counting cells and sub-cellular features from acquired digital image data, the overall throughput is still typically hindered by the limited space-bandwidth product (SBP) of conventional microscopes. Here, we show both in simulation and experiment that overlapped imaging and co-designed analysis software can achieve accurate detection of diagnostically-relevant features for several applications, including counting of white blood cells and the malaria parasite, leading to multi-fold increase in detection and processing throughput with minimal reduction in accuracy.
△ Less
Submitted 10 October, 2021;
originally announced October 2021.
-
Modeling User Empathy Elicited by a Robot Storyteller
Authors:
Leena Mathur,
Micol Spitale,
Hao Xi,
Jieyun Li,
Maja J Matarić
Abstract:
Virtual and robotic agents capable of perceiving human empathy have the potential to participate in engaging and meaningful human-machine interactions that support human well-being. Prior research in computational empathy has focused on designing empathic agents that use verbal and nonverbal behaviors to simulate empathy and attempt to elicit empathic responses from humans. The challenge of develo…
▽ More
Virtual and robotic agents capable of perceiving human empathy have the potential to participate in engaging and meaningful human-machine interactions that support human well-being. Prior research in computational empathy has focused on designing empathic agents that use verbal and nonverbal behaviors to simulate empathy and attempt to elicit empathic responses from humans. The challenge of developing agents with the ability to automatically perceive elicited empathy in humans remains largely unexplored. Our paper presents the first approach to modeling user empathy elicited during interactions with a robotic agent. We collected a new dataset from the novel interaction context of participants listening to a robot storyteller (46 participants, 6.9 hours of video). After each storytelling interaction, participants answered a questionnaire that assessed their level of elicited empathy during the interaction with the robot. We conducted experiments with 8 classical machine learning models and 2 deep learning models (long short-term memory networks and temporal convolutional networks) to detect empathy by leveraging patterns in participants' visual behaviors while they were listening to the robot storyteller. Our highest-performing approach, based on XGBoost, achieved an accuracy of 69% and AUC of 72% when detecting empathy in videos. We contribute insights regarding modeling approaches and visual features for automated empathy detection. Our research informs and motivates future development of empathy perception models that can be leveraged by virtual and robotic agents during human-machine interactions.
△ Less
Submitted 29 July, 2021;
originally announced July 2021.
-
Single-leader multi-follower games for the regulation of two-sided Mobility-as-a-Service markets
Authors:
Haoning Xi,
Didier Aussel,
Wei Liu,
S Travis Waller,
David Rey
Abstract:
Mobility-as-a-Service (MaaS) is an emerging business model driven by the concept of "Everything-as-a-Service" and enabled through mobile internet technologies. In the context of economic deregulation, a MaaS system consists of a typical two-sided market, where travelers and transportation service providers (TSPs) are two groups of agents interacting with each other through a MaaS platform. In this…
▽ More
Mobility-as-a-Service (MaaS) is an emerging business model driven by the concept of "Everything-as-a-Service" and enabled through mobile internet technologies. In the context of economic deregulation, a MaaS system consists of a typical two-sided market, where travelers and transportation service providers (TSPs) are two groups of agents interacting with each other through a MaaS platform. In this study, we propose a modeling and optimization framework for the regulation of two-sided MaaS markets. We consider a name-your-own-price (NYOP)-auction mechanism where travelers submit purchase-bids to accommodate their travel demand via MaaS platform, and TSPs submit sell-bids to supply mobility resources for the MaaS platform in exchange for payments. We cast this problem as a single-leader multi-follower game (SLMFG) where the leader is the MaaS regulator and two groups of follower problems represent the travelers and the TSPs. The MaaS regulator aims to maximize its profits by optimizing operations. In response to the MaaS regulator's decisions, travelers (resp. TSPs) adjust their participation level in the MaaS platform to minimize their travel costs (resp. maximize their profits). We analyze cross-group network effects in the MaaS market, and formulate SLMFGs without and with network effects leading to mixed-integer linear bilevel programming and mixed-integer quadratic bilevel programming problems, respectively. We propose customized branch-and-bound algorithms based on strong duality reformulations to solve these SLMFGs. Extensive numerical experiments conducted on large scale simulation instances generated from realistic mobility data highlight that the performance of the proposed algorithms is significantly superior to a benchmarking approach, and provide meaningful managerial insights for the regulation of two-sided MaaS markets in practice.
△ Less
Submitted 17 September, 2021; v1 submitted 3 June, 2021;
originally announced June 2021.
-
Bitcoin Address Clustering Method Based on Multiple Heuristic Conditions
Authors:
He Xi,
He Ketai,
Lin Shenwen,
Yang Jinglin,
Mao Hongliang
Abstract:
We analyzed the associations between Bitcoin transactions and addresses to cluster address and further find groups of addresses controlled by the same entity. It revealed the vulnerabilities of Bitcoin anonymity mechanism, which could be used by the law enforcement agencies to track and crack down illegal transactions. However, single heuristic method and incomplete heuristic conditions were diffi…
▽ More
We analyzed the associations between Bitcoin transactions and addresses to cluster address and further find groups of addresses controlled by the same entity. It revealed the vulnerabilities of Bitcoin anonymity mechanism, which could be used by the law enforcement agencies to track and crack down illegal transactions. However, single heuristic method and incomplete heuristic conditions were difficult to cluster a large number of addresses comprehensively and accurately. Therefore, this paper reviewed a variety of heuristics, and used multiple heuristics comprehensively to cluster addresses to improve the degree of address aggregation and address recall rate, which laid a foundation for further inferring of entity identity.
△ Less
Submitted 11 May, 2022; v1 submitted 18 April, 2021;
originally announced April 2021.
-
Experimental observation of the elastic range scaling in turbulent flow with polymer additives
Authors:
Yi-Bao Zhang,
Eberhard Bodenschatz,
Haitao Xu,
Heng-Dong Xi
Abstract:
Minute amount of long chain flexible polymer dissolved in a turbulent flow can drastically change flow properties, such as reducing the drag and enhancing mixing. One fundamental riddle is how these polymer additives interact with the eddies of different spatial scales existing in the turbulent flow and in turn alter the turbulence energy transfer. Here we show how turbulent kinetic energy is tran…
▽ More
Minute amount of long chain flexible polymer dissolved in a turbulent flow can drastically change flow properties, such as reducing the drag and enhancing mixing. One fundamental riddle is how these polymer additives interact with the eddies of different spatial scales existing in the turbulent flow and in turn alter the turbulence energy transfer. Here we show how turbulent kinetic energy is transferred through deferent scales in the presence of the polymer additives. In particular, we observed experimentally the emerging of a new scaling range, referred to as the elastic range, where increasing amount of energy is transferred by the elasticity of the polymers. In addition, the existence of the elastic range prescribes the scaling of high-order velocity statistics. Our findings have important implications to many turbulence systems such as turbulence in plasmas or superfluids where interaction between turbulent eddies and other nonlinear physical mechanisms are often involved.
△ Less
Submitted 31 January, 2021;
originally announced February 2021.
-
The Arecibo Ultra-Deep Survey
Authors:
Hongwei Xi,
Lister Staveley-Smith,
Bi-Qing For,
Wolfram Freudling,
Martin Zwaan,
Laura Hoppmann,
Fu-Heng Liang,
Bo Peng
Abstract:
The Arecibo Ultra Deep Survey (AUDS) is a blind HI survey aimed at detecting galaxies beyond the local Universe in the 21-cm emission line of neutral hydrogen (HI). The Arecibo $L$-band Feed Array (ALFA) was used to image an area of 1.35~deg$^2$ to a redshift depth of 0.16, using a total on-source integration time of over 700 hours. The long integration time and small observation area makes it one…
▽ More
The Arecibo Ultra Deep Survey (AUDS) is a blind HI survey aimed at detecting galaxies beyond the local Universe in the 21-cm emission line of neutral hydrogen (HI). The Arecibo $L$-band Feed Array (ALFA) was used to image an area of 1.35~deg$^2$ to a redshift depth of 0.16, using a total on-source integration time of over 700 hours. The long integration time and small observation area makes it one of the most sensitive HI surveys, with a noise level of $\sim 75$~$μ$Jy per 21.4~kHz (equivalent to 4.5~km~s$^{-1}$ at redshift $z=0$). We detect 247 galaxies in the survey, more than doubling the number already detected in AUDS60. The mass range of detected galaxies is $\log(M_{\rm HI}~[h_{70}^{-2}{\rm M}_\odot]) = 6.32 - 10.76$. A modified maximum likelihood method is employed to construct an HI mass function (HIMF). The best fitting Schechter parameters are: low-mass slope $α= -1.37 \pm 0.05$, characteristic mass $\log(M^*~[h_{70}^{-2}{\rm M}_\odot]) = 10.15 \pm 0.09$, and density $Φ_* = (2.41 \pm 0.57) \times 10^{-3} h_{70}^3$~Mpc$^{-3}$~dex$^{-1}$. The sample was divided into low and high redshift bins to investigate the evolution of the HIMF. No change in low-mass slope $α$ was measured, but an increased characteristic mass $M^*$, was noted in the higher-redshift sample. Using Sloan Digital Sky Survey (SDSS) data to define relative galaxy number density, the dependence of the HIMF with environment was also investigated in the two AUDS regions. We find no significant variation in $α$ or $M^*$. In the surveyed region, we measured a cosmic HI density $Ω_{\rm HI} = (3.55 \pm 0.30) \times 10^{-4} h_{70}^{-1}$. There appears to be no evolutionary trend in $Ω_{\rm HI}$ above $2σ$ significance between redshifts of 0 and 0.16.
△ Less
Submitted 17 December, 2020;
originally announced December 2020.
-
Tristable flow states and reversal of the large-scale circulation in two-dimensional circular convection cells
Authors:
Ao Xu,
Xin Chen,
Heng-Dong Xi
Abstract:
We present a numerical study of the flow states and reversals of the large-scale circulation (LSC) in a two-dimensional circular Rayleigh-Bénard cell. Long-time direct numerical simulations are carried out in the Rayleigh number ($Ra$) range $10^{7} \le Ra \le 10^{8}$ and Prandtl number ($Pr$) range $2.0 \le Pr \le 20.0$. We found that a new, long-lived, chaotic flow state exists, in addition to t…
▽ More
We present a numerical study of the flow states and reversals of the large-scale circulation (LSC) in a two-dimensional circular Rayleigh-Bénard cell. Long-time direct numerical simulations are carried out in the Rayleigh number ($Ra$) range $10^{7} \le Ra \le 10^{8}$ and Prandtl number ($Pr$) range $2.0 \le Pr \le 20.0$. We found that a new, long-lived, chaotic flow state exists, in addition to the commonly observed circulation states (the LSC in the clockwise and counterclockwise directions). The circulation states consist of one primary roll in the middle and two secondary rolls near the top and bottom circular walls. The primary roll becomes stronger and larger, while the two secondary rolls diminish, with increasing $Ra$. Our results suggest that the reversal of the LSC is accompanied by the secondary rolls growing, breaking the primary roll and then connecting to form a new primary roll with reversed direction. We mapped out the phase diagram of the existence of the LSC and the reversal in the $Ra$-$Pr$ space, which reveals that the flow is in the circulation states when $Ra$ is large and $Pr$ is small. The reversal of the LSC can only occur in a limited $Pr$ range. The phase diagram can be understood in terms of competition between the thermal and viscous diffusions. We also found that the internal flow states manifested themselves into global properties such as Nusselt and Reynolds numbers.
△ Less
Submitted 18 January, 2021; v1 submitted 30 September, 2020;
originally announced October 2020.
-
Dynamic Scheduling and Workforce Assignment in Open Source Software Development
Authors:
Hui Xi,
Dong Xu,
Young-Jun Son
Abstract:
A novel modeling framework is proposed for dynamic scheduling of projects and workforce assignment in open source software development (OSSD). The goal is to help project managers in OSSD distribute workforce to multiple projects to achieve high efficiency in software development (e.g. high workforce utilization and short development time) while ensuring the quality of deliverables (e.g. code modu…
▽ More
A novel modeling framework is proposed for dynamic scheduling of projects and workforce assignment in open source software development (OSSD). The goal is to help project managers in OSSD distribute workforce to multiple projects to achieve high efficiency in software development (e.g. high workforce utilization and short development time) while ensuring the quality of deliverables (e.g. code modularity and software security). The proposed framework consists of two models: 1) a system dynamic model coupled with a meta-heuristic to obtain an optimal schedule of software development projects considering their attributes (e.g. priority, effort, duration) and 2) an agent based model to represent the development community as a social network, where development managers form an optimal team for each project and balance the workload among multiple scheduled projects based on the optimal schedule obtained from the system dynamic model. To illustrate the proposed framework, a software enhancement request process in Kuali foundation is used as a case study. Survey data collected from the Kuali development managers, project managers and actual historical enhancement requests have been used to construct the proposed models. Extensive experiments are conducted to demonstrate the impact of varying parameters on the considered efficiency and quality.
△ Less
Submitted 19 September, 2020;
originally announced September 2020.
-
Correlation of internal flow structure with heat transfer efficiency in turbulent Rayleigh-Bénard convection
Authors:
Ao Xu,
Xin Chen,
Feng Wang,
Heng-Dong Xi
Abstract:
To understand how internal flow structures manifest themselves in the global heat transfer, we study the correlation between different flow modes and the instantaneous Nusselt number ($Nu$) in a two-dimensional square Rayleigh-Bénard convection cell. High-resolution and long-time direct numerical simulations are carried out for Rayleigh numbers between $10^{7}$ and $10^{9}$ and a Prandtl number of…
▽ More
To understand how internal flow structures manifest themselves in the global heat transfer, we study the correlation between different flow modes and the instantaneous Nusselt number ($Nu$) in a two-dimensional square Rayleigh-Bénard convection cell. High-resolution and long-time direct numerical simulations are carried out for Rayleigh numbers between $10^{7}$ and $10^{9}$ and a Prandtl number of 5.3. The investigated Nusselt numbers include the volume-averaged $Nu_{\text{vol}}$, the wall-averaged $Nu_{\text{wall}}$, the kinetic energy dissipation based $Nu_{\text{kinetic}}$, and the thermal energy dissipation based $Nu_{\text{thermal}}$. The Fourier mode decomposition and proper orthogonal decomposition are adopted to extract the coherent flow structure. Our results show that the single-roll mode, the horizontally stacked double-roll mode, and the quadrupolar flow mode are more efficient for heat transfer on average. In contrast, the vertically stacked double-roll mode is inefficient for heat transfer on average. The volume-averaged $Nu_{\text{vol}}$ and the kinetic energy dissipation based $Nu_{\text{kinetic}}$ can better reproduce the correlation of internal flow structures with heat transfer efficiency than that of the wall-averaged $Nu_{\text{wall}}$ and the thermal energy dissipation based $Nu_{\text{thermal}}$, even though these four Nusselt numbers give consistent time-averaged mean values. The ensemble-averaged time trace of $Nu$ during flow reversal shows that only the volume-averaged $Nu_{\text{vol}}$ can reproduce the overshoot phenomena that is observed in the previous experimental study. Our results reveal that the proper choice of $Nu$ is critical to obtain a meaningful interpretation.
△ Less
Submitted 7 October, 2020; v1 submitted 16 September, 2020;
originally announced September 2020.
-
Incentive-compatible mechanisms for online resource allocation in mobility-as-a-service systems
Authors:
Haoning Xi,
Wei Liu,
David Rey,
S. Travis Waller,
Philip Kilby
Abstract:
In the context of `Everything-as-a-Service', the transportation sector has been evolving towards user-centric business models in which customized services and mode-agnostic mobility resources are priced in a unified framework. Yet, in the vast majority of studies on Mobility as a Service (MaaS) systems, mobility resource pricing is based on segmented travel modes, e.g. private vehicle, public tran…
▽ More
In the context of `Everything-as-a-Service', the transportation sector has been evolving towards user-centric business models in which customized services and mode-agnostic mobility resources are priced in a unified framework. Yet, in the vast majority of studies on Mobility as a Service (MaaS) systems, mobility resource pricing is based on segmented travel modes, e.g. private vehicle, public transit and shared mobility services. This study attempts to address this research gap by introducing innovative auction-based online MaaS mechanisms where users can bid for any amount of mode-agnostic mobility resources based on their willingness to pay and preferences. We take the perspective of a MaaS regulator which aims to maximize social welfare by allocating mobility resources to users. We propose two mechanisms which allow users to either pay for the immediate use of mobility service (pay-as-you-go), or to subscribe to mobility service packages (pay-as-a-package). We cast the proposed auction-based mechanisms as online resource allocation problems where users compete for MaaS resources and bid for travel time per trip. We propose (integer-) linear programming formulations to accommodate user bids based on available mobility resources in an online optimization approach. We show that the proposed MaaS mechanisms are incentive-compatible, develop customized online algorithms and derive performance bounds based on competitive analysis. Extensive numerical simulations are conducted on large scale instances generated from realistic mobility data, which highlight the benefits of the proposed MaaS mechanisms and the effectiveness of the proposed online optimization approaches.
△ Less
Submitted 28 March, 2021; v1 submitted 14 September, 2020;
originally announced September 2020.
-
Transport and deposition of dilute microparticles in turbulent thermal convection
Authors:
Ao Xu,
Shi Tao,
Le Shi,
Heng-Dong Xi
Abstract:
We analyze the transport and deposition behavior of dilute microparticles in turbulent Rayleigh-Bénard convection. Two-dimensional direct numerical simulations were carried out for the Rayleigh number ($Ra$) of $10^{8}$ and the Prandtl number ($Pr$) of 0.71 (corresponding to the working fluids of air). The Lagrangian point particle model was used to describe the motion of microparticles in the tur…
▽ More
We analyze the transport and deposition behavior of dilute microparticles in turbulent Rayleigh-Bénard convection. Two-dimensional direct numerical simulations were carried out for the Rayleigh number ($Ra$) of $10^{8}$ and the Prandtl number ($Pr$) of 0.71 (corresponding to the working fluids of air). The Lagrangian point particle model was used to describe the motion of microparticles in the turbulence. Our results show that the suspended particles are homogeneously distributed in the turbulence for the Stokes number ($St$) less than $10^{-3}$, and they tend to cluster into bands for $10^{-3} \lesssim St \lesssim 10^{-2}$. At even larger $St$, the microparticles will quickly sediment in the convection. We also calculate the mean-square displacement (MSD) of the particle's trajectories. At short time intervals, the MSD exhibits a ballistic regime, and it is isotropic in vertical and lateral directions; at longer time intervals, the MSD reflects a confined motion for the particles, and it is anisotropic in different directions. We further obtained a phase diagram of the particle deposition positions on the wall, and we identified three deposition states depending on the particle's density and diameter. An interesting finding is that the dispersed particles preferred to deposit on the vertical wall where the hot plumes arise, which is verified by tilting the cell and altering the rotation direction of the large-scale circulation.
△ Less
Submitted 3 August, 2020; v1 submitted 11 July, 2020;
originally announced July 2020.
-
Statistics of temperature and thermal energy dissipation rate in low-Prandtl number turbulent thermal convection
Authors:
Ao Xu,
Le Shi,
Heng-Dong Xi
Abstract:
We report the statistical properties of temperature and thermal energy dissipation rate in low-Prandtl number turbulent Rayleigh-Bénard convection. High resolution two-dimensional direct numerical simulations were carried out for the Rayleigh number ($Ra$) of $10^{6} \le Ra \le 10^{7}$ and the Prandtl number ($Pr$) of 0.025. Our results show that the global heat transport and momentum scaling in t…
▽ More
We report the statistical properties of temperature and thermal energy dissipation rate in low-Prandtl number turbulent Rayleigh-Bénard convection. High resolution two-dimensional direct numerical simulations were carried out for the Rayleigh number ($Ra$) of $10^{6} \le Ra \le 10^{7}$ and the Prandtl number ($Pr$) of 0.025. Our results show that the global heat transport and momentum scaling in terms of Nusselt number ($Nu$) and Reynolds number ($Re$) are $Nu=0.21Ra^{0.25}$ and $Re=6.11Ra^{0.50}$, respectively, indicating that the scaling exponents are smaller than those for moderate-Prandtl number fluids (such as water or air) in the same convection cell. In the central region of the cell, probability density functions (PDFs) of temperature profiles show stretched exponential peak and the Gaussian tail; in the sidewall region, PDFs of temperature profiles show a multimodal distribution at relative lower $Ra$, while they approach the Gaussian profile at relative higher $Ra$. We split the energy dissipation rate into contributions from bulk and boundary layers and found the locally averaged thermal energy dissipation rate from the boundary layer region is an order of magnitude larger than that from the bulk region. Even if the much smaller volume occupied by the boundary layer region is considered, the globally averaged thermal energy dissipation rate from the boundary layer region is still larger than that from the bulk region. We further numerically determined the scaling exponents of globally averaged thermal energy dissipation rates as functions of $Ra$ and $Re$.
△ Less
Submitted 11 December, 2019; v1 submitted 10 November, 2019;
originally announced November 2019.
-
Lattice Boltzmann simulations of three-dimensional thermal convective flows at high Rayleigh number
Authors:
Ao Xu,
Le Shi,
Heng-Dong Xi
Abstract:
We present numerical simulations of three-dimensional thermal convective flows in a cubic cell at high Rayleigh number using thermal lattice Boltzmann (LB) method. The thermal LB model is based on double distribution function approach, which consists of a D3Q19 model for the Navier-Stokes equations to simulate fluid flows and a D3Q7 model for the convection-diffusion equation to simulate heat tran…
▽ More
We present numerical simulations of three-dimensional thermal convective flows in a cubic cell at high Rayleigh number using thermal lattice Boltzmann (LB) method. The thermal LB model is based on double distribution function approach, which consists of a D3Q19 model for the Navier-Stokes equations to simulate fluid flows and a D3Q7 model for the convection-diffusion equation to simulate heat transfer. Relaxation parameters are adjusted to achieve the isotropy of the fourth-order error term in the thermal LB model. Two types of thermal convective flows are considered: one is laminar thermal convection in side-heated convection cell, which is heated from one vertical side and cooled from the other vertical side; while the other is turbulent thermal convection in Rayleigh-Bénard convection cell, which is heated from the bottom and cooled from the top. In side-heated convection cell, steady results of hydrodynamic quantities and Nusselt numbers are presented at Rayleigh numbers of $10^6$ and $10^7$, and Prandtl number of 0.71, where the mesh sizes are up to $257^3$; in Rayleigh-Bénard convection cell, statistical averaged results of Reynolds and Nusselt numbers, as well as kinetic and thermal energy dissipation rates are presented at Rayleigh numbers of $10^6$, $3\times 10^6$, and $10^7$, and Prandtl numbers of 0.7 and 7, where the nodes within thermal boundary layer are around 8. Compared with existing benchmark data obtained by other methods, the present LB model can give consistent results.
△ Less
Submitted 6 June, 2019; v1 submitted 21 March, 2019;
originally announced March 2019.
-
To Memory Safety through Proofs
Authors:
Hongwei Xi,
Dengping Zhu
Abstract:
We present a type system capable of guaranteeing the memory safety of programs that may involve (sophisticated) pointer manipulation such as pointer arithmetic. With its root in a recently developed framework Applied Type System (ATS), the type system imposes a level of abstraction on program states through a novel notion of recursive stateful views and then relies on a form of linear logic to rea…
▽ More
We present a type system capable of guaranteeing the memory safety of programs that may involve (sophisticated) pointer manipulation such as pointer arithmetic. With its root in a recently developed framework Applied Type System (ATS), the type system imposes a level of abstraction on program states through a novel notion of recursive stateful views and then relies on a form of linear logic to reason about such stateful views. We consider the design and then the formalization of the type system to constitute the primary contribution of the paper. In addition, we also mention a running implementation of the type system and then give some examples in support of the practicality of programming with recursive stateful views.
△ Less
Submitted 29 October, 2018;
originally announced October 2018.
-
Implementing Linking in Multiparty Sessions (Extended Abstract)
Authors:
Hanwen Wu,
Hongwei Xi
Abstract:
The fast growth of service-oriented programming (SOP) is evident in this day and age of the Internet, and handling communication is of paramount importance in SOP. Session types are a formalism that is proposed to specify interactions between communicating processes. In essence, a session type system is a kind of type system designed to enforce (through type-checking) that the involved processes c…
▽ More
The fast growth of service-oriented programming (SOP) is evident in this day and age of the Internet, and handling communication is of paramount importance in SOP. Session types are a formalism that is proposed to specify interactions between communicating processes. In essence, a session type system is a kind of type system designed to enforce (through type-checking) that the involved processes communicate according to a chosen protocol specified as a session type. It is well-known that linear logic plays a pivotal role in the study of session types. For instance, various inference rules in linear logic can be interpreted as ways for constructing channels (used by communicating processes to send/receive messages.) A particularly interesting case is the cut-rule in linear logic, which can be interpreted as a way for connecting the ends of two matching channels to form a single new channel. This form of channel construction is often referred to as linking or (bi-directional) forwarding. We have generalized classical linear logic into classical linear multirole logic (LMRL), where the former can be seen as a special case of the latter involving only two roles. In LMRL, there is a cut-rule involving multiple sequents (instead of exactly two), which we call multiparty cut (mp-cut). We have also formulated a novel multiparty session type system directly based on LMRL. When implementing it, we need to find a way of connecting multiple channels that corresponds to mp-cut. In this paper, we describe an implementation of linking for multiparty sessions in the setting of shared memory. We also describe two novel concepts, two-way linking with residual and three-way linking, which can only be formulated in the setting of multiparty sessions. Notably, linking for binary sessions can be thought of as a specially optimized version of what is implemented for multiparty sessions.
△ Less
Submitted 29 October, 2018;
originally announced October 2018.
-
Multiparty Dependent Session Types (Extended Abstract)
Authors:
Hanwen Wu,
Hongwei Xi
Abstract:
Programs are more distributed and concurrent today than ever before, and structural communications are at the core. Constructing and debugging such programs are hard due to the lack of formal specification/verification of concurrency. This work formalizes the first multiparty dependent session types as an expressive and practical type discipline for enforcing communication protocols. The type syst…
▽ More
Programs are more distributed and concurrent today than ever before, and structural communications are at the core. Constructing and debugging such programs are hard due to the lack of formal specification/verification of concurrency. This work formalizes the first multiparty dependent session types as an expressive and practical type discipline for enforcing communication protocols. The type system is formulated in the setting of multi-threaded $λ$-calculus with inspirations from multirole logic, a generalization of classical logic we discovered earlier. We prove its soundness by a novel technique called deadlock-freeness reducibility. The soundness of the type system implies communication fidelity and absence of deadlock.
△ Less
Submitted 31 July, 2018;
originally announced August 2018.
-
Effect of stochastic grain heating on cold dense clouds chemistry
Authors:
Long-Fei Chen,
Qiang Chang,
Hong-Wei Xi
Abstract:
The temperatures of dust grains play important roles in the chemical evolution of molecular clouds. Unlike large grains, the temperature fluctuations of small grains induced by photons may be significant. Therefore, if the grain size distribution is included in astrochemical models, the temperatures of small dust grains may not be assumed to be constant. We simulate a full gas-grain reaction netwo…
▽ More
The temperatures of dust grains play important roles in the chemical evolution of molecular clouds. Unlike large grains, the temperature fluctuations of small grains induced by photons may be significant. Therefore, if the grain size distribution is included in astrochemical models, the temperatures of small dust grains may not be assumed to be constant. We simulate a full gas-grain reaction network with a set of dust grain radii using the classical MRN grain size distribution and include the temperature fluctuations of small dust grains. Monte Carlo method is used to simulate the real-time dust grain's temperature fluctuations which is caused by the external low energy photons and the internal cosmic ray induced secondary photons. The increase of dust grains radii as ice mantles accumulate on grain surfaces is also included in our models. We found that surface CO$_2$ abundances in models with grain size distribution and temperature fluctuations are more than one order of magnitude larger than those with single grain size. Small amounts of terrestrial complex organic molecules (COMs) can also be formed on small grains due to the temperature spikes induced by external low energy photons. However, cosmic ray induced secondary photons overheat small grains so that surface CO sublime and less radicals are formed on grains surfaces, thus the production of surface CO$_2$ and COMs decreases by about one order of magnitude. The overheating of small grains can be offset by grain growth so that the formation of surface CO$_2$ and COMs becomes more efficient.
△ Less
Submitted 8 June, 2018;
originally announced June 2018.
-
Convergence of eigenvector empirical spectral distribution of sample covariance matrices
Authors:
Haokai Xi,
Fan Yang,
Jun Yin
Abstract:
The eigenvector empirical spectral distribution (VESD) is a useful tool in studying the limiting behavior of eigenvalues and eigenvectors of covariance matrices. In this paper, we study the convergence rate of the VESD of sample covariance matrices to the deformed Marčenko-Pastur (MP) distribution. Consider sample covariance matrices of the form $Σ^{1/2} X X^* Σ^{1/2}$, where $X=(x_{ij})$ is an…
▽ More
The eigenvector empirical spectral distribution (VESD) is a useful tool in studying the limiting behavior of eigenvalues and eigenvectors of covariance matrices. In this paper, we study the convergence rate of the VESD of sample covariance matrices to the deformed Marčenko-Pastur (MP) distribution. Consider sample covariance matrices of the form $Σ^{1/2} X X^* Σ^{1/2}$, where $X=(x_{ij})$ is an $M\times N$ random matrix whose entries are independent random variables with mean zero and variance $N^{-1}$, and $Σ$ is a deterministic positive-definite matrix. We prove that the Kolmogorov distance between the expected VESD and the deformed MP distribution is bounded by $N^{-1+ε}$ for any fixed $ε>0$, provided that the entries $\sqrt{N}x_{ij}$ have uniformly bounded 6th moments and $|N/M-1|\ge τ$ for some constant $τ>0$. This result improves the previous one obtained in \cite{XYZ2013}, which gave the convergence rate $O(N^{-1/2})$ assuming $i.i.d.$ $X$ entries, bounded 10th moment, $Σ=I$ and $M<N$. Moreover, we also prove that under the finite $8$th moment assumption, the convergence rate of the VESD is $O(N^{-1/2+ε})$ almost surely for any fixed $ε>0$, which improves the previous bound $N^{-1/4+ε}$ in \cite{XYZ2013}.
△ Less
Submitted 21 January, 2020; v1 submitted 10 May, 2017;
originally announced May 2017.