subscribe to arXiv mailings

Optimal Communication and Key Rate Region for Hierarchical Secure Aggregation with User Collusion

Authors: Xiang Zhang, Kai Wan, Hua Sun, Shiqiang Wang, Mingyue Ji, Giuseppe Caire

Abstract: Secure aggregation is concerned with the task of securely uploading the inputs of multiple users to an aggregation server without letting the server know the inputs beyond their summation. It finds broad applications in distributed machine learning paradigms such as federated learning (FL) where multiple clients, each having access to a proprietary dataset, periodically upload their locally traine… ▽ More Secure aggregation is concerned with the task of securely uploading the inputs of multiple users to an aggregation server without letting the server know the inputs beyond their summation. It finds broad applications in distributed machine learning paradigms such as federated learning (FL) where multiple clients, each having access to a proprietary dataset, periodically upload their locally trained models (abstracted as inputs) to a parameter server which then generates an aggregate (e.g., averaged) model that is sent back to the clients as an initializing point for a new round of local training. To enhance the data privacy of the clients, secure aggregation protocols are developed using techniques from cryptography to ensure that the server infers no more information of the users' inputs beyond the desired aggregated input, even if the server can collude with some users. Although laying the ground for understanding the fundamental utility-security trade-off in secure aggregation, the simple star client-server architecture cannot capture more complex network architectures used in practical systems. Motivated by hierarchical federated learning, we investigate the secure aggregation problem in a $3$-layer hierarchical network consisting of clustered users connecting to an aggregation server through an intermediate layer of relays. Besides the conventional server security which requires that the server learns nothing beyond the desired sum of inputs, relay security is also imposed so that the relays infer nothing about the users' inputs and remain oblivious. For such a hierarchical secure aggregation (HSA) problem, we characterize the optimal multifaceted trade-off between communication (in terms of user-to-relay and relay-to-server communication rates) and secret key generation efficiency (in terms of individual key and source key rates). △ Less

Submitted 21 October, 2024; v1 submitted 17 October, 2024; originally announced October 2024.

arXiv:2409.11851 [pdf, other]

Mutual neutralization of C$_{60}^+$ and C$_{60}^-$ ions: Excitation energies and state-selective rate coefficients

Authors: Michael Gatchell, Raka Paul, MingChao Ji, Stefan Rosén, Richard D. Thomas, Henrik Cederquist, Henning T. Schmidt, Åsa Larson, Henning Zettergren

Abstract: Context: Mutual neutralization between cations and anions play an important role in determining the charge-balance in certain astrophysical environments. However, empirical data for such reactions involving complex molecular species has been lacking due to challenges in performing experimental studies, leaving the astronomical community to rely on decades old models with large uncertainties for de… ▽ More Context: Mutual neutralization between cations and anions play an important role in determining the charge-balance in certain astrophysical environments. However, empirical data for such reactions involving complex molecular species has been lacking due to challenges in performing experimental studies, leaving the astronomical community to rely on decades old models with large uncertainties for describing these processes in the interstellar medium. Aims: To investigate the mutual neutralization (MN) reaction, C$_{60}^+$ + C$_{60}^-$ $\rightarrow$ C$_{60}^*$ + C$_{60}$, for collisions at interstellar-like conditions. Methods: The mutual neutralization reaction between C$_{60}^+$ and C$_{60}^-$ at collision energies of 100\,meV was studied using the Double ElectroStatic Ion Ring ExpEriment, DESIREE, and its merged-beam capabilities. To aid in the interpretation of the experimental results, semi-classical modeling based on the Landau-Zener approach was performed for the studied reaction. Results: We experimentally identify a narrow range of kinetic energies for the neutral reaction products. Modeling was used to calculate the quantum state-selective reaction probabilities, absolute cross sections, and rate coefficients of these MN reactions, using the experimental results as a benchmark. The MN cross sections are compared with model results for electron attachment to C$_{60}$ and electron recombination with C$_{60}^+$. Conclusions: The present results show that it is crucial to take mutual polarization effects, the finite sizes, and the final quantum states of both molecular ions into account for reliable predictions of MN rates expected to strongly influence the charge-balance and chemistry in, e.g., dense molecular clouds. △ Less

Submitted 18 September, 2024; originally announced September 2024.

Comments: 9 pages, 4 figures

arXiv:2409.02084 [pdf, other]

GraspSplats: Efficient Manipulation with 3D Feature Splatting

Authors: Mazeyu Ji, Ri-Zhao Qiu, Xueyan Zou, Xiaolong Wang

Abstract: The ability for robots to perform efficient and zero-shot grasping of object parts is crucial for practical applications and is becoming prevalent with recent advances in Vision-Language Models (VLMs). To bridge the 2D-to-3D gap for representations to support such a capability, existing methods rely on neural fields (NeRFs) via differentiable rendering or point-based projection methods. However, w… ▽ More The ability for robots to perform efficient and zero-shot grasping of object parts is crucial for practical applications and is becoming prevalent with recent advances in Vision-Language Models (VLMs). To bridge the 2D-to-3D gap for representations to support such a capability, existing methods rely on neural fields (NeRFs) via differentiable rendering or point-based projection methods. However, we demonstrate that NeRFs are inappropriate for scene changes due to their implicitness and point-based methods are inaccurate for part localization without rendering-based optimization. To amend these issues, we propose GraspSplats. Using depth supervision and a novel reference feature computation method, GraspSplats generates high-quality scene representations in under 60 seconds. We further validate the advantages of Gaussian-based representation by showing that the explicit and optimized geometry in GraspSplats is sufficient to natively support (1) real-time grasp sampling and (2) dynamic and articulated object manipulation with point trackers. With extensive experiments on a Franka robot, we demonstrate that GraspSplats significantly outperforms existing methods under diverse task settings. In particular, GraspSplats outperforms NeRF-based methods like F3RM and LERF-TOGO, and 2D detection methods. △ Less

Submitted 3 September, 2024; originally announced September 2024.

Comments: Project webpage: https://graspsplats.github.io/

arXiv:2408.14460 [pdf, other]

Cloud-Based Federation Framework and Prototype for Open, Scalable, and Shared Access to NextG and IoT Testbeds

Authors: Maxwell McManus, Tenzin Rinchen, Annoy Dey, Sumanth Thota, Zhaoxi Zhang, Jiangqi Hu, Xi Wang, Mingyue Ji, Nicholas Mastronarde, Elizabeth Serena Bentley, Michael Medley, Zhangyu Guan

Abstract: In this work, we present a new federation framework for UnionLabs, an innovative cloud-based resource-sharing infrastructure designed for next-generation (NextG) and Internet of Things (IoT) over-the-air (OTA) experiments. The framework aims to reduce the federation complexity for testbeds developers by automating tedious backend operations, thereby providing scalable federation and remote access… ▽ More In this work, we present a new federation framework for UnionLabs, an innovative cloud-based resource-sharing infrastructure designed for next-generation (NextG) and Internet of Things (IoT) over-the-air (OTA) experiments. The framework aims to reduce the federation complexity for testbeds developers by automating tedious backend operations, thereby providing scalable federation and remote access to various wireless testbeds. We first describe the key components of the new federation framework, including the Systems Manager Integration Engine (SMIE), the Automated Script Generator (ASG), and the Database Context Manager (DCM). We then prototype and deploy the new Federation Plane on the Amazon Web Services (AWS) public cloud, demonstrating its effectiveness by federating two wireless testbeds: i) UB NeXT, a 5G-and-beyond (5G+) testbed at the University at Buffalo, and ii) UT IoT, an IoT testbed at the University of Utah. Through this work we aim to initiate a grassroots campaign to democratize access to wireless research testbeds with heterogeneous hardware resources and network environment, and accelerate the establishment of a mature, open experimental ecosystem for the wireless community. The API of the new Federation Plane will be released to the community after internal testing is completed. △ Less

Submitted 28 August, 2024; v1 submitted 26 August, 2024; originally announced August 2024.

arXiv:2408.08484 [pdf, other]

doi 10.1145/3637528.3671704

An Unsupervised Learning Framework Combined with Heuristics for the Maximum Minimal Cut Problem

Authors: Huaiyuan Liu, Xianzhang Liu, Donghua Yang, Hongzhi Wang, Yingchi Long, Mengtong Ji, Dongjing Miao, Zhiyu Liang

Abstract: The Maximum Minimal Cut Problem (MMCP), a NP-hard combinatorial optimization (CO) problem, has not received much attention due to the demanding and challenging bi-connectivity constraint. Moreover, as a CO problem, it is also a daunting task for machine learning, especially without labeled instances. To deal with these problems, this work proposes an unsupervised learning framework combined with h… ▽ More The Maximum Minimal Cut Problem (MMCP), a NP-hard combinatorial optimization (CO) problem, has not received much attention due to the demanding and challenging bi-connectivity constraint. Moreover, as a CO problem, it is also a daunting task for machine learning, especially without labeled instances. To deal with these problems, this work proposes an unsupervised learning framework combined with heuristics for MMCP that can provide valid and high-quality solutions. As far as we know, this is the first work that explores machine learning and heuristics to solve MMCP. The unsupervised solver is inspired by a relaxation-plus-rounding approach, the relaxed solution is parameterized by graph neural networks, and the cost and penalty of MMCP are explicitly written out, which can train the model end-to-end. A crucial observation is that each solution corresponds to at least one spanning tree. Based on this finding, a heuristic solver that implements tree transformations by adding vertices is utilized to repair and improve the solution quality of the unsupervised solver. Alternatively, the graph is simplified while guaranteeing solution consistency, which reduces the running time. We conduct extensive experiments to evaluate our framework and give a specific application. The results demonstrate the superiority of our method against two techniques designed. △ Less

Submitted 15 August, 2024; originally announced August 2024.

arXiv:2407.15567 [pdf, other]

A New Theoretical Perspective on Data Heterogeneity in Federated Optimization

Authors: Jiayi Wang, Shiqiang Wang, Rong-Rong Chen, Mingyue Ji

Abstract: In federated learning (FL), data heterogeneity is the main reason that existing theoretical analyses are pessimistic about the convergence rate. In particular, for many FL algorithms, the convergence rate grows dramatically when the number of local updates becomes large, especially when the product of the gradient divergence and local Lipschitz constant is large. However, empirical studies can sho… ▽ More In federated learning (FL), data heterogeneity is the main reason that existing theoretical analyses are pessimistic about the convergence rate. In particular, for many FL algorithms, the convergence rate grows dramatically when the number of local updates becomes large, especially when the product of the gradient divergence and local Lipschitz constant is large. However, empirical studies can show that more local updates can improve the convergence rate even when these two parameters are large, which is inconsistent with the theoretical findings. This paper aims to bridge this gap between theoretical understanding and practical performance by providing a theoretical analysis from a new perspective on data heterogeneity. In particular, we propose a new and weaker assumption compared to the local Lipschitz gradient assumption, named the heterogeneity-driven pseudo-Lipschitz assumption. We show that this and the gradient divergence assumptions can jointly characterize the effect of data heterogeneity. By deriving a convergence upper bound for FedAvg and its extensions, we show that, compared to the existing works, local Lipschitz constant is replaced by the much smaller heterogeneity-driven pseudo-Lipschitz constant and the corresponding convergence upper bound can be significantly reduced for the same number of local updates, although its order stays the same. In addition, when the local objective function is quadratic, more insights on the impact of data heterogeneity can be obtained using the heterogeneity-driven pseudo-Lipschitz constant. For example, we can identify a region where FedAvg can outperform mini-batch SGD even when the gradient divergence can be arbitrarily large. Our findings are validated using experiments. △ Less

Submitted 22 July, 2024; originally announced July 2024.

Comments: ICML 2024

arXiv:2407.06518 [pdf, other]

Graph Neural Networks and Deep Reinforcement Learning Based Resource Allocation for V2X Communications

Authors: Maoxin Ji, Qiong Wu, Pingyi Fan, Nan Cheng, Wen Chen, Jiangzhou Wang, Khaled B. Letaief

Abstract: In the rapidly evolving landscape of Internet of Vehicles (IoV) technology, Cellular Vehicle-to-Everything (C-V2X) communication has attracted much attention due to its superior performance in coverage, latency, and throughput. Resource allocation within C-V2X is crucial for ensuring the transmission of safety information and meeting the stringent requirements for ultra-low latency and high reliab… ▽ More In the rapidly evolving landscape of Internet of Vehicles (IoV) technology, Cellular Vehicle-to-Everything (C-V2X) communication has attracted much attention due to its superior performance in coverage, latency, and throughput. Resource allocation within C-V2X is crucial for ensuring the transmission of safety information and meeting the stringent requirements for ultra-low latency and high reliability in Vehicle-to-Vehicle (V2V) communication. This paper proposes a method that integrates Graph Neural Networks (GNN) with Deep Reinforcement Learning (DRL) to address this challenge. By constructing a dynamic graph with communication links as nodes and employing the Graph Sample and Aggregation (GraphSAGE) model to adapt to changes in graph structure, the model aims to ensure a high success rate for V2V communication while minimizing interference on Vehicle-to-Infrastructure (V2I) links, thereby ensuring the successful transmission of V2V link information and maintaining high transmission rates for V2I links. The proposed method retains the global feature learning capabilities of GNN and supports distributed network deployment, allowing vehicles to extract low-dimensional features that include structural information from the graph network based on local observations and to make independent resource allocation decisions. Simulation results indicate that the introduction of GNN, with a modest increase in computational load, effectively enhances the decision-making quality of agents, demonstrating superiority to other methods. This study not only provides a theoretically efficient resource allocation strategy for V2V and V2I communications but also paves a new technical path for resource management in practical IoV environments. △ Less

Submitted 8 July, 2024; originally announced July 2024.

Comments: 14 pages, 11 figures. This paper has been submitted to IEEE Journal. The source code has been released at: https://github.com/qiongwu86/GNN-and-DRL-Based-Resource-Allocation-for-V2X-Communications

arXiv:2406.12837 [pdf, other]

LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging

Authors: Jinuk Kim, Marwa El Halabi, Mingi Ji, Hyun Oh Song

Abstract: Recent works show that reducing the number of layers in a convolutional neural network can enhance efficiency while maintaining the performance of the network. Existing depth compression methods remove redundant non-linear activation functions and merge the consecutive convolution layers into a single layer. However, these methods suffer from a critical drawback; the kernel size of the merged laye… ▽ More Recent works show that reducing the number of layers in a convolutional neural network can enhance efficiency while maintaining the performance of the network. Existing depth compression methods remove redundant non-linear activation functions and merge the consecutive convolution layers into a single layer. However, these methods suffer from a critical drawback; the kernel size of the merged layers becomes larger, significantly undermining the latency reduction gained from reducing the depth of the network. We show that this problem can be addressed by jointly pruning convolution layers and activation functions. To this end, we propose LayerMerge, a novel depth compression method that selects which activation layers and convolution layers to remove, to achieve a desired inference speed-up while minimizing performance loss. Since the corresponding selection problem involves an exponential search space, we formulate a novel surrogate optimization problem and efficiently solve it via dynamic programming. Empirical results demonstrate that our method consistently outperforms existing depth compression and layer pruning methods on various network architectures, both on image classification and generation tasks. We release the code at https://github.com/snu-mllab/LayerMerge. △ Less

Submitted 8 July, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

Comments: ICML 2024

arXiv:2405.01311 [pdf, other]

Imagine the Unseen: Occluded Pedestrian Detection via Adversarial Feature Completion

Authors: Shanshan Zhang, Mingqian Ji, Yang Li, Jian Yang

Abstract: Pedestrian detection has significantly progressed in recent years, thanks to the development of DNNs. However, detection performance at occluded scenes is still far from satisfactory, as occlusion increases the intra-class variance of pedestrians, hindering the model from finding an accurate classification boundary between pedestrians and background clutters. From the perspective of reducing intra… ▽ More Pedestrian detection has significantly progressed in recent years, thanks to the development of DNNs. However, detection performance at occluded scenes is still far from satisfactory, as occlusion increases the intra-class variance of pedestrians, hindering the model from finding an accurate classification boundary between pedestrians and background clutters. From the perspective of reducing intra-class variance, we propose to complete features for occluded regions so as to align the features of pedestrians across different occlusion patterns. An important premise for feature completion is to locate occluded regions. From our analysis, channel features of different pedestrian proposals only show high correlation values at visible parts and thus feature correlations can be used to model occlusion patterns. In order to narrow down the gap between completed features and real fully visible ones, we propose an adversarial learning method, which completes occluded features with a generator such that they can hardly be distinguished by the discriminator from real fully visible features. We report experimental results on the CityPersons, Caltech and CrowdHuman datasets. On CityPersons, we show significant improvements over five different baseline detectors, especially on the heavy occlusion subset. Furthermore, we show that our proposed method FeatComp++ achieves state-of-the-art results on all the above three datasets without relying on extra cues. △ Less

Submitted 2 May, 2024; originally announced May 2024.

arXiv:2404.06848 [pdf, other]

Lifetimes of excited states in P-, As- and Sb-

Authors: J. Karls, M. Björkhage, M. Blom, N. D. Gibson, O. Hemdal Lundgren, M. Ji, M. K. Kristiansson, D. Leimbach, J. E. Navarro Navarrete, P. Reinhed, A. Ringvall-Moberg, S. Rosen, H. T. Schmidt, A. Simonsson, D. Hanstorp

Abstract: Radiative lifetimes of three elements of the nitrogen group have been experimentally investigated at the Double ElectroStatic Ion Ring Experiment (DESIREE) facility at Stockholm University. The experiments were performed through selective laser photodetachment of excited states of P$^-$, As$^-$ and Sb$^-$ ions stored in a cryogenic storage ring. The experimental results were compared with theoreti… ▽ More Radiative lifetimes of three elements of the nitrogen group have been experimentally investigated at the Double ElectroStatic Ion Ring Experiment (DESIREE) facility at Stockholm University. The experiments were performed through selective laser photodetachment of excited states of P$^-$, As$^-$ and Sb$^-$ ions stored in a cryogenic storage ring. The experimental results were compared with theoretically predicted lifetimes, yielding a mixture of very good agreements in some cases and large discrepancies in others. These results are part of our efforts to map out the lifetimes of all excited states in negative ions. This data can be used to benchmark atomic theories, in particularly with respect to the degree of electron correlation that is incorporated in various theoretical models. △ Less

Submitted 10 April, 2024; originally announced April 2024.

arXiv:2404.06226 [pdf, other]

Precision measurements on Si-

Authors: J. Karls, H. Cederquist, N. D. Gibson, J. Grumer, M. Ji, I. Kardasch, D. Leimbach, P. Martini, J. E. Navarro Navarrete, R. Poulose, S. Rosen, H. T. Schmidt, A. Simonsson, H. Zettergren, D. Hanstorp

Abstract: High-precision measurements of the electron affinities (EA) of the three stable isotopes of silicon, $^{28}$Si, $^{29}$Si and $^{30}$Si, have been performed at the cryogenic electrostatic ion-beam storage ring DESIREE. The quantum states of the ions were manipulated using laser depletion, and the ions were photodetached by laser photodetachment threshold spectroscopy. These EA values are the first… ▽ More High-precision measurements of the electron affinities (EA) of the three stable isotopes of silicon, $^{28}$Si, $^{29}$Si and $^{30}$Si, have been performed at the cryogenic electrostatic ion-beam storage ring DESIREE. The quantum states of the ions were manipulated using laser depletion, and the ions were photodetached by laser photodetachment threshold spectroscopy. These EA values are the first reported for $^{29}$Si$^-$ and $^{30}$Si$^-$ and provide a reduced uncertainty for $^{28}$Si$^-$. The resulting EAs are $EA(^{28}$Si$) = 1.38952201(17)$ eV, $EA(^{29}$Si$) = 1.38952172(12)$ eV and $EA(^{29}$Si$) = 1.38952078(12)$ eV, with the corresponding isotope shifts $IS(^{29-28}$Si$) = 0.29(16)$ micro eV and $IS(^{30-28}$Si$) = 1.23(16) $ micro eV. In addition to these measurements, the resolution and signal-to-background level was sufficient to reveal the hyperfine structure splitting in the $^{29}$Si$^-$ isotope, which we report to be $1.8(4) micro eV. △ Less

Submitted 9 April, 2024; originally announced April 2024.

arXiv:2404.06222 [pdf, other]

Lifetimes of excited states in Rh-

Authors: J. Karls, J. Grumer, S. Schiffmann, N. D. Gibson, M. Ji, M. K. Kristiansson, D. Leimbach, J. E. Navarro Navarrete, Y. Pena Rodrıguez, R. Ponce, A. Ringvall-Moberg, H. T. Schmidt, S. E. Spielman, C. W. Walter, T. Brage, D. Hanstorp

Abstract: The radiative decay of excited states of the negative ion of rhodium, Rh$^-$, has been investigated experimentally and theoretically. The experiments were conducted at the Double ElectroStatic Ion Ring Experiment (DESIREE) facility at Stockholm University using selective photodetachment from a stored ion beam to monitor the time evolution of the excited state populations. The lifetimes of the Rh… ▽ More The radiative decay of excited states of the negative ion of rhodium, Rh$^-$, has been investigated experimentally and theoretically. The experiments were conducted at the Double ElectroStatic Ion Ring Experiment (DESIREE) facility at Stockholm University using selective photodetachment from a stored ion beam to monitor the time evolution of the excited state populations. The lifetimes of the Rh$^-$ $^3F_{3}$ and $^3F_{2}$ fine structure levels were measured to be 3.2(6)~s and 21(4)~s, respectively. An additional, previously unreported, higher-lying bound state of mixed $^1D_2+^3P_2+(4d^95s)^1D_2+^3F_2$ composition was observed and found to have a lifetime of 10.9(8)s. The binding energy of this state was determined to be in the interval $0.1584(2) $ eV $ < E_b < 0.2669(2)$ eV, using laser photodetachment threshold (LPT) spectroscopy. An autodetaching state with a lifetime of 480(10) microseconds was also observed. Theoretical calculations of the excited-state compositions, energies, and magnetic-dipole transition lifetimes were performed using the multiconfiguration Dirac-Hartree-Fock and relativistic configuration interaction methods. The calculated lifetimes of the $^3F_{3}$ and $^3F_{2}$ fine structure levels are in excellent agreement with the measured values. The present study should provide valuable insights into electron correlation effects in negative ions and forbidden radiative transitions. △ Less

Submitted 9 April, 2024; originally announced April 2024.

arXiv:2404.01954 [pdf, other]

HyperCLOVA X Technical Report

Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment to responsible AI. The model is evaluated across various benchmarks, including comprehensive reasoning, knowledge, commonsense, factuality, coding, math, chatting, instruction-following, and harmlessness, in both Korean and English. HyperCLOVA X exhibits strong reasoning capabilities in Korean backed by a deep understanding of the language and cultural nuances. Further analysis of the inherent bilingual nature and its extension to multilingualism highlights the model's cross-lingual proficiency and strong generalization ability to untargeted languages, including machine translation between several language pairs and cross-lingual inference tasks. We believe that HyperCLOVA X can provide helpful guidance for regions or countries in developing their sovereign LLMs. △ Less

Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

Comments: 44 pages; updated authors list and fixed author names

arXiv:2403.01845 [pdf, other]

NASH: Neural Architecture Search for Hardware-Optimized Machine Learning Models

Authors: Mengfei Ji, Yuchun Chang, Baolin Zhang, Zaid Al-Ars

Abstract: As machine learning (ML) algorithms get deployed in an ever-increasing number of applications, these algorithms need to achieve better trade-offs between high accuracy, high throughput and low latency. This paper introduces NASH, a novel approach that applies neural architecture search to machine learning hardware. Using NASH, hardware designs can achieve not only high throughput and low latency b… ▽ More As machine learning (ML) algorithms get deployed in an ever-increasing number of applications, these algorithms need to achieve better trade-offs between high accuracy, high throughput and low latency. This paper introduces NASH, a novel approach that applies neural architecture search to machine learning hardware. Using NASH, hardware designs can achieve not only high throughput and low latency but also superior accuracy performance. We present four versions of the NASH strategy in this paper, all of which show higher accuracy than the original models. The strategy can be applied to various convolutional neural networks, selecting specific model operations among many to guide the training process toward higher accuracy. Experimental results show that applying NASH on ResNet18 or ResNet34 achieves a top 1 accuracy increase of up to 3.1% and a top 5 accuracy increase of up to 2.2% compared to the non-NASH version when tested on the ImageNet data set. We also integrated this approach into the FINN hardware model synthesis tool to automate the application of our approach and the generation of the hardware model. Results show that using FINN can achieve a maximum throughput of 324.5 fps. In addition, NASH models can also result in a better trade-off between accuracy and hardware resource utilization. The accuracy-hardware (HW) Pareto curve shows that the models with the four NASH versions represent the best trade-offs achieving the highest accuracy for a given HW utilization. The code for our implementation is open-source and publicly available on GitHub at https://github.com/MFJI/NASH. △ Less

Submitted 10 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

arXiv:2403.00585 [pdf, other]

Decentralized Uncoded Storage Elastic Computing with Heterogeneous Computation Speeds

Authors: Wenbo Huang, Xudong You, Kai Wan, Robert Caiming Qiu, Mingyue Ji

Abstract: Elasticity plays an important role in modern cloud computing systems. Elastic computing allows virtual machines (i.e., computing nodes) to be preempted when high-priority jobs arise, and also allows new virtual machines to participate in the computation. In 2018, Yang et al. introduced Coded Storage Elastic Computing (CSEC) to address the elasticity using coding technology, with lower storage and… ▽ More Elasticity plays an important role in modern cloud computing systems. Elastic computing allows virtual machines (i.e., computing nodes) to be preempted when high-priority jobs arise, and also allows new virtual machines to participate in the computation. In 2018, Yang et al. introduced Coded Storage Elastic Computing (CSEC) to address the elasticity using coding technology, with lower storage and computation load requirements. However, CSEC is limited to certain types of computations (e.g., linear) due to the coded data storage based on linear coding. Then Centralized Uncoded Storage Elastic Computing (CUSEC) with heterogeneous computation speeds was proposed, which directly copies parts of data into the virtual machines. In all existing works in elastic computing, the storage assignment is centralized, meaning that the number and identity of all virtual machines possible used in the whole computation process are known during the storage assignment. In this paper, we consider Decentralized Uncoded Storage Elastic Computing (DUSEC) with heterogeneous computation speeds, where any available virtual machine can join the computation which is not predicted and thus coordination among different virtual machines' storage assignments is not allowed. Under a decentralized storage assignment originally proposed in coded caching by Maddah-Ali and Niesen, we propose a computing scheme with closed-form optimal computation time. We also run experiments over MNIST dataset with Softmax regression model through the Tencent cloud platform, and the experiment results demonstrate that the proposed DUSEC system approaches the state-of-art best storage assignment in the CUSEC system in computation time. △ Less

Submitted 1 March, 2024; originally announced March 2024.

Comments: 10 pages, 8 figures, submitted to ISIT2024

arXiv:2401.16769 [pdf, other]

doi 10.1088/1367-2630/ad5619

Tracing quantum correlations back to collective interferences

Authors: Ming Ji, Jonte R. Hance, Holger F. Hofmann

Abstract: In this paper, we investigate the possibility of explaining nonclassical correlations between two quantum systems in terms of quantum interferences between collective states of the two systems. We achieve this by mapping the relations between different measurement contexts in the product Hilbert space of a pair of two-level systems onto an analogous sequence of interferences between paths in a sin… ▽ More In this paper, we investigate the possibility of explaining nonclassical correlations between two quantum systems in terms of quantum interferences between collective states of the two systems. We achieve this by mapping the relations between different measurement contexts in the product Hilbert space of a pair of two-level systems onto an analogous sequence of interferences between paths in a single-particle interferometer. The relations between different measurement outcomes are then traced to the distribution of probability currents in the interferometer, where paradoxical relations between the outcomes are identified with currents connecting two states that are orthogonal and should therefore exclude each other. We show that the relation between probability currents and correlations can be represented by continuous conditional (quasi)probability currents through the interferometer, given by weak values; the violation of the noncontextual assumption is expressed by negative conditional currents in some of the paths. Since negative conditional currents correspond to the assignment of negative conditional probabilities to measurements results in different measurement contexts, the necessity of such negative probability currents represents a failure of noncontextual local realism. Our results help to explain the meaning of nonlocal correlations in quantum mechanics, and support Feynman's claim that interference is the origin of all quantum phenomena. △ Less

Submitted 3 June, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

Comments: 10 pages, 4 figures

Journal ref: New J. Phys. 26 063021 (2024)

arXiv:2401.13569 [pdf, other]

SPARC-LoRa: A Scalable, Power-efficient, Affordable, Reliable, and Cloud Service-enabled LoRa Networking System for Agriculture Applications

Authors: Xi Wang, Bryan Hatasaka, Zhengyan Liu, Sayali Tope, Mohit Karkhanis, Seungbeom Noh, Farhan Sium, Ravi V. Mural, Hanseup Kim, Carlos Mastrangelo, Ling Zang, James Schnable, Mingyue Ji

Abstract: With the rapid development of cloud and edge computing, Internet of Things (IoT) applications have been deployed in various aspects of human life. In this paper, we design and implement a holistic LoRa-based IoT system with LoRa communication capabilities, named SPARC-LoRa, which consists of field sensor nodes and a gateway connected to the Internet. SPARC-LoRa has the following important features… ▽ More With the rapid development of cloud and edge computing, Internet of Things (IoT) applications have been deployed in various aspects of human life. In this paper, we design and implement a holistic LoRa-based IoT system with LoRa communication capabilities, named SPARC-LoRa, which consists of field sensor nodes and a gateway connected to the Internet. SPARC-LoRa has the following important features. First, the proposed wireless network of SPARC-LoRa is even-driven and using off-the-shelf microcontroller and LoRa communication modules with a customized PCB design to integrate all the hardware. This enables SPARC-LoRa to achieve low power consumption, long range communication, and low cost. With a new connection-based upper layer protocol design, the scalability and communication reliability of SPARC-loRa can be achieved. Second, an open source software including sensor nodes and servers is designed based on Docker container with cloud storage, computing, and LTE functionalities. In order to achieve reliable wireless communication under extreme conditions, a relay module is designed and applied to SPARC-LoRa to forward the data from sensor nodes to the gateway node. The system design and implementation is completely open source and hosted on the DigitalOcean Droplet Cloud. Hence, the proposed system enables further research and applications in both academia and industry. The proposed system has been tested in real fields under different and extreme environmental conditions in Salt Lake City, Utah and the University of Nebraska-Lincoln. The experimental results validate the features of SPARC-LoRa including low power, reliability, and cloud services provided by SPARC-LoRa. △ Less

Submitted 24 January, 2024; originally announced January 2024.

Comments: 6 pages, 8 figures, submitted for publication

arXiv:2401.12151 [pdf, other]

Uncoded Storage Coded Transmission Elastic Computing with Straggler Tolerance in Heterogeneous Systems

Authors: Xi Zhong, Joerg Kliewer, Mingyue Ji

Abstract: In 2018, Yang et al. introduced a novel and effective approach, using maximum distance separable (MDS) codes, to mitigate the impact of elasticity in cloud computing systems. This approach is referred to as coded elastic computing. Some limitations of this approach include that it assumes all virtual machines have the same computing speeds and storage capacities, and it cannot tolerate stragglers… ▽ More In 2018, Yang et al. introduced a novel and effective approach, using maximum distance separable (MDS) codes, to mitigate the impact of elasticity in cloud computing systems. This approach is referred to as coded elastic computing. Some limitations of this approach include that it assumes all virtual machines have the same computing speeds and storage capacities, and it cannot tolerate stragglers for matrix-matrix multiplications. In order to resolve these limitations, in this paper, we introduce a new combinatorial optimization framework, named uncoded storage coded transmission elastic computing (USCTEC), for heterogeneous speeds and storage constraints, aiming to minimize the expected computation time for matrix-matrix multiplications, under the consideration of straggler tolerance. Within this framework, we propose optimal solutions with straggler tolerance under relaxed storage constraints. Moreover, we propose a heuristic algorithm that considers the heterogeneous storage constraints. Our results demonstrate that the proposed algorithm outperforms baseline solutions utilizing cyclic storage placements, in terms of both expected computation time and storage size. △ Less

Submitted 22 January, 2024; originally announced January 2024.

Comments: 6 pages, 1 figure, accepted in ICC 2024

arXiv:2401.10063 [pdf, other]

doi 10.3847/1538-4357/ad3930

Stability of C$_{59}$ Knockout Fragments from Femtoseconds to Infinity

Authors: Michael Gatchell, Naemi Florin, Suvasthika Indrajith, José Eduardo Navarro Navarrete, Paul Martini, MingChao Ji, Peter Reinhed, Stefan Rosén, Ansgar Simonsson, Henrik Cederquist, Henning T. Schmidt, Henning Zettergren

Abstract: We have studied the stability of C$_{59}$ anions as a function of time, from their formation on femtosecond timescales to their stabilization on second timescales and beyond, using a combination of theory and experiments. The C$_{59}^-$ fragments were produced in collisions between C$_{60}$ fullerene anions and neutral helium gas at a velocity of 90 km/s (corresponding to a collision energy of 166… ▽ More We have studied the stability of C$_{59}$ anions as a function of time, from their formation on femtosecond timescales to their stabilization on second timescales and beyond, using a combination of theory and experiments. The C$_{59}^-$ fragments were produced in collisions between C$_{60}$ fullerene anions and neutral helium gas at a velocity of 90 km/s (corresponding to a collision energy of 166 eV in the center-of-mass frame). The fragments were then stored in a cryogenic ion-beam storage ring at the DESIREE facility where they were followed for up to one minute. Classical molecular dynamics simulations were used to determine the reaction cross section and the excitation energy distributions of the products formed in these collisions. We found that about 15 percent of the C$_{59}^-$ ions initially stored in the ring are intact after about 100 ms, and that this population then remains intact indefinitely. This means that C$_{60}$ fullerenes exposed to energetic atoms and ions, such as stellar winds and shock waves, will produce stable, highly reactive products, like C$_{59}$, that are fed into interstellar chemical reaction networks. △ Less

Submitted 2 April, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

Comments: 11 pages, 8 figures

Journal ref: The Astrophysical Journal, 966:146 (2024)

arXiv:2401.01822 [pdf, other]

HawkRover: An Autonomous mmWave Vehicular Communication Testbed with Multi-sensor Fusion and Deep Learning

Authors: Ethan Zhu, Haijian Sun, Mingyue Ji

Abstract: Connected and automated vehicles (CAVs) have become a transformative technology that can change our daily life. Currently, millimeter-wave (mmWave) bands are identified as the promising CAV connectivity solution. While it can provide high data rate, their realization faces many challenges such as high attenuation during mmWave signal propagation and mobility management. Existing solution has to in… ▽ More Connected and automated vehicles (CAVs) have become a transformative technology that can change our daily life. Currently, millimeter-wave (mmWave) bands are identified as the promising CAV connectivity solution. While it can provide high data rate, their realization faces many challenges such as high attenuation during mmWave signal propagation and mobility management. Existing solution has to initiate pilot signal to measure channel information, then apply signal processing to calculate the best narrow beam towards the receiver end to guarantee sufficient signal power. This process takes significant overhead and time, hence not suitable for vehicles. In this study, we propose an autonomous and low-cost testbed to collect extensive co-located mmWave signal and other sensors data such as LiDAR (Light Detection and Ranging), cameras, ultrasonic, etc, traditionally for ``automated'', to facilitate mmWave vehicular communications. Intuitively, these sensors can build a 3D map around the vehicle and signal propagation path can be estimated, eliminating iterative the process via pilot signals. This multimodal data fusion, together with AI, is expected to bring significant advances in ``connected'' research. △ Less

Submitted 4 January, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

Comments: submitted to IEEE conferences for future publications

arXiv:2401.01288 [pdf, other]

Physics-informed Generalizable Wireless Channel Modeling with Segmentation and Deep Learning: Fundamentals, Methodologies, and Challenges

Authors: Ethan Zhu, Haijian Sun, Mingyue Ji

Abstract: Channel modeling is fundamental in advancing wireless systems and has thus attracted considerable research focus. Recent trends have seen a growing reliance on data-driven techniques to facilitate the modeling process and yield accurate channel predictions. In this work, we first provide a concise overview of data-driven channel modeling methods, highlighting their limitations. Subsequently, we in… ▽ More Channel modeling is fundamental in advancing wireless systems and has thus attracted considerable research focus. Recent trends have seen a growing reliance on data-driven techniques to facilitate the modeling process and yield accurate channel predictions. In this work, we first provide a concise overview of data-driven channel modeling methods, highlighting their limitations. Subsequently, we introduce the concept and advantages of physics-informed neural network (PINN)-based modeling and a summary of recent contributions in this area. Our findings demonstrate that PINN-based approaches in channel modeling exhibit promising attributes such as generalizability, interpretability, and robustness. We offer a comprehensive architecture for PINN methodology, designed to inform and inspire future model development. A case-study of our recent work on precise indoor channel prediction with semantic segmentation and deep learning is presented. The study concludes by addressing the challenges faced and suggesting potential research directions in this field. △ Less

Submitted 2 January, 2024; originally announced January 2024.

Comments: Submitted to IEEE Magazine for potential future publications

arXiv:2312.10002 [pdf, ps, other]

On the Injectivity of Euler Integral Transforms with Hyperplanes and Quadric Hypersurfaces

Authors: Mattie Ji

Abstract: The Euler characteristic transform (ECT) is an integral transform used widely in topological data analysis. Previous efforts by Curry et al. and Ghrist et al. have independently shown that the ECT is injective on all compact definable sets. In this work, we first study the injectivity of the ECT on definable sets that are not necessarily compact and prove a complete classification of constructible… ▽ More The Euler characteristic transform (ECT) is an integral transform used widely in topological data analysis. Previous efforts by Curry et al. and Ghrist et al. have independently shown that the ECT is injective on all compact definable sets. In this work, we first study the injectivity of the ECT on definable sets that are not necessarily compact and prove a complete classification of constructible functions that the Euler characteristic transform is not injective on. We then introduce the quadric Euler characteristic transform (QECT) as a natural generalization of the ECT by detecting definable shapes with quadric hypersurfaces rather than hyperplanes. We also discuss some criteria for the injectivity of QECT. △ Less

Submitted 21 May, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

Comments: 12 pages

arXiv:2312.08405 [pdf, ps, other]

Connectivity keeping spiders in k-connected bipartite graphs

Authors: Meng Ji

Abstract: Luo, Tian and Wu [Discrete Math. 345 (4) (2022) 112788] conjectured that for any tree $T$ with bipartition $(X,Y)$, every $k$-connected bipartite graph $G$ with minimum degree at least $k+w$, where $w=\max\{|X|,|Y|\}$, contains a tree $T'\cong T$ such that $κ(G-V(T'))\geq k$. In the paper, we confirm the conjecture for the spider by a new method, where a spider is a tree with at most one vertex of… ▽ More Luo, Tian and Wu [Discrete Math. 345 (4) (2022) 112788] conjectured that for any tree $T$ with bipartition $(X,Y)$, every $k$-connected bipartite graph $G$ with minimum degree at least $k+w$, where $w=\max\{|X|,|Y|\}$, contains a tree $T'\cong T$ such that $κ(G-V(T'))\geq k$. In the paper, we confirm the conjecture for the spider by a new method, where a spider is a tree with at most one vertex of degree at least three. △ Less

Submitted 22 December, 2023; v1 submitted 13 December, 2023; originally announced December 2023.

Comments: 6 pages. arXiv admin note: substantial text overlap with arXiv:2212.04637

arXiv:2311.05357 [pdf, other]

Long radial coherence of electron temperature fluctuations in non-local transport in HL-2A plasmas

Authors: Zhongbing Shi, Kairui Fang, Jingchun Li, Xiaolan Zou, Zhaoyang Lu, Jie Wen, Zhanhui Wang, Xuantong Ding, Wei Chen, Zengchen Yang, Min Jiang Xiaoquan Ji, Ruihai Tong, Yonggao Li, Peiwang Shi, Wulyv Zhong, Min Xu

Abstract: The dynamics of long-wavelength ($k_θ<1.4 \mathrm{\ cm^{-1}}$), broadband (20-200 kHz) electron temperature fluctuations ($\tilde T_e/T_e$) of plasmas in gas-puff experiments were observed for the first time in HL-2A tokamak. In a relative low density ($n_e(0) \simeq 0.91 \sim 1.20 \times10^{19}/m^3$) scenario, after gas-puffing the core temperature increases and the edge temperature drops. On the… ▽ More The dynamics of long-wavelength ($k_θ<1.4 \mathrm{\ cm^{-1}}$), broadband (20-200 kHz) electron temperature fluctuations ($\tilde T_e/T_e$) of plasmas in gas-puff experiments were observed for the first time in HL-2A tokamak. In a relative low density ($n_e(0) \simeq 0.91 \sim 1.20 \times10^{19}/m^3$) scenario, after gas-puffing the core temperature increases and the edge temperature drops. On the contrary, temperature fluctuation drops at the core and increases at the edge. Analyses show the non-local emergence is accompanied with a long radial coherent length of turbulent fluctuations. While in a higher density ($n_e(0) \simeq 1.83 \sim 2.02 \times10^{19}/m^3$) scenario, the phenomena were not observed. Furthermore, compelling evidence indicates that $\textbf{E} \times \textbf{B}$ shear serves as a substantial contributor to this extensive radial interaction. This finding offers a direct explanatory link to the intriguing core-heating phenomenon witnessed within the realm of non-local transport. △ Less

Submitted 9 November, 2023; originally announced November 2023.

arXiv:2310.09889 [pdf, other]

The Capacity Region of Information Theoretic Secure Aggregation with Uncoded Groupwise Keys

Authors: Kai Wan, Hua Sun, Mingyue Ji, Tiebin Mi, Giuseppe Caire

Abstract: This paper considers the secure aggregation problem for federated learning under an information theoretic cryptographic formulation, where distributed training nodes (referred to as users) train models based on their own local data and a curious-but-honest server aggregates the trained models without retrieving other information about users' local data. Secure aggregation generally contains two ph… ▽ More This paper considers the secure aggregation problem for federated learning under an information theoretic cryptographic formulation, where distributed training nodes (referred to as users) train models based on their own local data and a curious-but-honest server aggregates the trained models without retrieving other information about users' local data. Secure aggregation generally contains two phases, namely key sharing phase and model aggregation phase. Due to the common effect of user dropouts in federated learning, the model aggregation phase should contain two rounds, where in the first round the users transmit masked models and, in the second round, according to the identity of surviving users after the first round, these surviving users transmit some further messages to help the server decrypt the sum of users' trained models. The objective of the considered information theoretic formulation is to characterize the capacity region of the communication rates in the two rounds from the users to the server in the model aggregation phase, assuming that key sharing has already been performed offline in prior. In this context, Zhao and Sun completely characterized the capacity region under the assumption that the keys can be arbitrary random variables. More recently, an additional constraint, known as "uncoded groupwise keys," has been introduced. This constraint entails the presence of multiple independent keys within the system, with each key being shared by precisely S users. The capacity region for the information-theoretic secure aggregation problem with uncoded groupwise keys was established in our recent work subject to the condition S > K - U, where K is the number of total users and U is the designed minimum number of surviving users. In this paper we fully characterize of the the capacity region for this problem by proposing a new converse bound and an achievable scheme. △ Less

Submitted 12 November, 2023; v1 submitted 15 October, 2023; originally announced October 2023.

Comments: 37 pages, 3 figures

arXiv:2309.14641 [pdf, other]

Adaptive Denoising-Enhanced LiDAR Odometry for Degeneration Resilience in Diverse Terrains

Authors: Mazeyu Ji, Wenbo Shi, Yujie Cui, Chengju Liu, Qijun Chen

Abstract: The flexibility of Simultaneous Localization and Mapping (SLAM) algorithms in various environments has consistently been a significant challenge. To address the issue of LiDAR odometry drift in high-noise settings, integrating clustering methods to filter out unstable features has become an effective module of SLAM frameworks. However, reducing the amount of point cloud data can lead to potential… ▽ More The flexibility of Simultaneous Localization and Mapping (SLAM) algorithms in various environments has consistently been a significant challenge. To address the issue of LiDAR odometry drift in high-noise settings, integrating clustering methods to filter out unstable features has become an effective module of SLAM frameworks. However, reducing the amount of point cloud data can lead to potential loss of information and possible degeneration. As a result, this research proposes a LiDAR odometry that can dynamically assess the point cloud's reliability. The algorithm aims to improve adaptability in diverse settings by selecting important feature points with sensitivity to the level of environmental degeneration. Firstly, a fast adaptive Euclidean clustering algorithm based on range image is proposed, which, combined with depth clustering, extracts the primary structural points of the environment defined as ambient skeleton points. Then, the environmental degeneration level is computed through the dense normal features of the skeleton points, and the point cloud cleaning is dynamically adjusted accordingly. The algorithm is validated on the KITTI benchmark and real environments, demonstrating higher accuracy and robustness in different environments. △ Less

Submitted 6 February, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

arXiv:2309.11722 [pdf, other]

Efficient Core-selecting Incentive Mechanism for Data Sharing in Federated Learning

Authors: Mengda Ji, Genjiu Xu, Jianjun Ge, Mingqiang Li

Abstract: Federated learning is a distributed machine learning system that uses participants' data to train an improved global model. In federated learning, participants cooperatively train a global model, and they will receive the global model and payments. Rational participants try to maximize their individual utility, and they will not input their high-quality data truthfully unless they are provided wit… ▽ More Federated learning is a distributed machine learning system that uses participants' data to train an improved global model. In federated learning, participants cooperatively train a global model, and they will receive the global model and payments. Rational participants try to maximize their individual utility, and they will not input their high-quality data truthfully unless they are provided with satisfactory payments based on their data quality. Furthermore, federated learning benefits from the cooperative contributions of participants. Accordingly, how to establish an incentive mechanism that both incentivizes inputting data truthfully and promotes stable cooperation has become an important issue to consider. In this paper, we introduce a data sharing game model for federated learning and employ game-theoretic approaches to design a core-selecting incentive mechanism by utilizing a popular concept in cooperative games, the core. In federated learning, the core can be empty, resulting in the core-selecting mechanism becoming infeasible. To address this, our core-selecting mechanism employs a relaxation method and simultaneously minimizes the benefits of inputting false data for all participants. However, this mechanism is computationally expensive because it requires aggregating exponential models for all possible coalitions, which is infeasible in federated learning. To address this, we propose an efficient core-selecting mechanism based on sampling approximation that only aggregates models on sampled coalitions to approximate the exact result. Extensive experiments verify that the efficient core-selecting mechanism can incentivize inputting high-quality data and stable cooperation, while it reduces computational overhead compared to the core-selecting mechanism. △ Less

Submitted 26 September, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

arXiv:2309.03142 [pdf, ps, other]

Euler Characteristics and Homotopy Types of Definable Sublevel Sets, with Applications to Topological Data Analysis

Authors: Mattie Ji, Kun Meng, Kexin Ding

Abstract: Given a definable function $f: S \to \mathbb{R}$ on a definable set $S$, we study sublevel sets of the form $S^f_t = \{x \in S: f(x) \leq t\}$ for all $t \in \mathbb{R}$. Using o-minimal structures, we prove that the Euler characteristic of $S^f_t$ is right continuous with respect to $t$. Furthermore, when $S$ is compact, we show that $S^f_{t+δ}$ deformation retracts to $S^f_t$ for all sufficientl… ▽ More Given a definable function $f: S \to \mathbb{R}$ on a definable set $S$, we study sublevel sets of the form $S^f_t = \{x \in S: f(x) \leq t\}$ for all $t \in \mathbb{R}$. Using o-minimal structures, we prove that the Euler characteristic of $S^f_t$ is right continuous with respect to $t$. Furthermore, when $S$ is compact, we show that $S^f_{t+δ}$ deformation retracts to $S^f_t$ for all sufficiently small $δ> 0$. Applying these results, we also characterize the connections between the following concepts in topological data analysis: the Euler characteristic transform (ECT), smooth ECT, Euler-Radon transform (ERT), and smooth ERT. △ Less

Submitted 4 November, 2023; v1 submitted 6 September, 2023; originally announced September 2023.

Comments: 20 page

MSC Class: Primary: 03C64; 46M20. Secondary: 55N31

arXiv:2308.14249 [pdf, other]

Statistical Inference on Grayscale Images via the Euler-Radon Transform

Authors: Kun Meng, Mattie Ji, Jinyu Wang, Kexin Ding, Henry Kirveslahti, Ani Eloyan, Lorin Crawford

Abstract: Tools from topological data analysis have been widely used to represent binary images in many scientific applications. Methods that aim to represent grayscale images (i.e., where pixel intensities instead take on continuous values) have been relatively underdeveloped. In this paper, we introduce the Euler-Radon transform, which generalizes the Euler characteristic transform to grayscale images by… ▽ More Tools from topological data analysis have been widely used to represent binary images in many scientific applications. Methods that aim to represent grayscale images (i.e., where pixel intensities instead take on continuous values) have been relatively underdeveloped. In this paper, we introduce the Euler-Radon transform, which generalizes the Euler characteristic transform to grayscale images by using o-minimal structures and Euler integration over definable functions. Coupling the Karhunen-Loeve expansion with our proposed topological representation, we offer hypothesis-testing algorithms based on the chi-squared distribution for detecting significant differences between two groups of grayscale images. We illustrate our framework via extensive numerical experiments and simulations. △ Less

Submitted 27 August, 2023; originally announced August 2023.

Comments: 85 pages, 9 figures

arXiv:2308.14237 [pdf, ps, other]

Explicit equations of the fake projective plane $(a=7,p=2,\emptyset,D_3 X_7)$

Authors: Lev Borisov, Mattie Ji, Yanxin Li

Abstract: We find explicit equations of the fake projective plane $(a=7,p=2,\emptyset,D_3 X_7)$, which lies in the same class as the fake projective plane $(a=7,p=2,\emptyset,D_3 2_7)$ with $21$ automorphisms whose equations were previously found by Borisov and Keum. The method involves finding a birational model of a common Galois cover of these two surfaces. We find explicit equations of the fake projective plane $(a=7,p=2,\emptyset,D_3 X_7)$, which lies in the same class as the fake projective plane $(a=7,p=2,\emptyset,D_3 2_7)$ with $21$ automorphisms whose equations were previously found by Borisov and Keum. The method involves finding a birational model of a common Galois cover of these two surfaces. △ Less

Submitted 27 August, 2023; originally announced August 2023.

Comments: 12 pages. The relevant Mathematica, Magma, Macaulay2 codes and equations produced can be found in the ancillary folder and links in the bibliography

MSC Class: 14J29; 14Q10

arXiv:2308.10429 [pdf, ps, other]

On the Geometry of a Fake Projective Plane with $21$ Automorphisms

Authors: Lev Borisov, Mattie Ji, Yanxin Li, Sargam Mondal

Abstract: A fake projective plane is a complex surface with the same Betti numbers as $\mathbb{C} P^2$ but not biholomorphic to it. We study the fake projective plane $\mathbb{P}_{\operatorname{fake}}^2 = (a = 7, p = 2, \emptyset, D_3 2_7)$ in the Cartwright-Steger classification. In this paper, we exploit the large symmetries given by… ▽ More A fake projective plane is a complex surface with the same Betti numbers as $\mathbb{C} P^2$ but not biholomorphic to it. We study the fake projective plane $\mathbb{P}_{\operatorname{fake}}^2 = (a = 7, p = 2, \emptyset, D_3 2_7)$ in the Cartwright-Steger classification. In this paper, we exploit the large symmetries given by $\operatorname{Aut}(\mathbb{P}_{\operatorname{fake}}^2) = C_7 \rtimes C_3$ to construct an embedding of this surface into $\mathbb{C} P^5$ as a system of $56$ sextics with coefficients in $\mathbb{Q}(\sqrt{-7})$. For each torsion line bundle $T \in \operatorname{Pic}(\mathbb{P}_{\operatorname{fake}}^2)$, we also compute and study the linear systems $|nH + T|$ with small $n$, where $H$ is an ample generator of the Néron-Severi group. △ Less

Submitted 16 September, 2024; v1 submitted 20 August, 2023; originally announced August 2023.

Comments: 9 pages. The relevant Mathematica, Magma, Macaulay2 codes and equations produced can be found in the ancillary folder and links in the bibliography. Accepted by Involve, a Journal of Mathematics

MSC Class: 14E25; 14Q10 (Primary) 14C20; 14J29 (Secondary)

arXiv:2308.01241 [pdf, other]

Digital Twin Brain: a simulation and assimilation platform for whole human brain

Authors: Wenlian Lu, Longbin Zeng, Xin Du, Wenyong Zhang, Shitong Xiang, Huarui Wang, Jiexiang Wang, Mingda Ji, Yubo Hou, Minglong Wang, Yuhao Liu, Zhongyu Chen, Qibao Zheng, Ningsheng Xu, Jianfeng Feng

Abstract: In this work, we present a computing platform named digital twin brain (DTB) that can simulate spiking neuronal networks of the whole human brain scale and more importantly, a personalized biological brain structure. In comparison to most brain simulations with a homogeneous global structure, we highlight that the sparseness, couplingness and heterogeneity in the sMRI, DTI and PET data of the brai… ▽ More In this work, we present a computing platform named digital twin brain (DTB) that can simulate spiking neuronal networks of the whole human brain scale and more importantly, a personalized biological brain structure. In comparison to most brain simulations with a homogeneous global structure, we highlight that the sparseness, couplingness and heterogeneity in the sMRI, DTI and PET data of the brain has an essential impact on the efficiency of brain simulation, which is proved from the scaling experiments that the DTB of human brain simulation is communication-intensive and memory-access intensive computing systems rather than computation-intensive. We utilize a number of optimization techniques to balance and integrate the computation loads and communication traffics from the heterogeneous biological structure to the general GPU-based HPC and achieve leading simulation performance for the whole human brain-scaled spiking neuronal networks. On the other hand, the biological structure, equipped with a mesoscopic data assimilation, enables the DTB to investigate brain cognitive function by a reverse-engineering method, which is demonstrated by a digital experiment of visual evaluation on the DTB. Furthermore, we believe that the developing DTB will be a promising powerful platform for a large of research orients including brain-inspiredintelligence, rain disease medicine and brain-machine interface. △ Less

Submitted 2 August, 2023; originally announced August 2023.

Comments: 12 pages, 11 figures

arXiv:2307.06583 [pdf, other]

doi 10.1088/1367-2630/ad0bd4

Contextuality, Coherences, and Quantum Cheshire Cats

Authors: Jonte R. Hance, Ming Ji, Holger F. Hofmann

Abstract: We analyse the quantum Cheshire cat using contextuality theory, to see if this can tell us anything about how best to interpret this paradox. We show that this scenario can be analysed using the relation between three different measurements, which seem to result in a logical contradiction. We discuss how this contextual behaviour links to weak values, and coherences between prohibited states. Rath… ▽ More We analyse the quantum Cheshire cat using contextuality theory, to see if this can tell us anything about how best to interpret this paradox. We show that this scenario can be analysed using the relation between three different measurements, which seem to result in a logical contradiction. We discuss how this contextual behaviour links to weak values, and coherences between prohibited states. Rather than showing a property of the particle is disembodied, the quantum Cheshire cat instead demonstrates the effects of these coherences, which are typically found in pre- and postselected systems. △ Less

Submitted 10 November, 2023; v1 submitted 13 July, 2023; originally announced July 2023.

Comments: 11 pages, 3 figures. Accepted in New J. Phys., matches accepted version

Journal ref: New J. Phys. 25 113028 (2023)

arXiv:2306.17607 [pdf, ps, other]

Complete bipartite graphs without small rainbow stars

Authors: Weizhen Chen, Meng Ji, Yaping Mao, Meiqin Wei

Abstract: The $k$-edge-colored bipartite Gallai-Ramsey number $\operatorname{bgr}_k(G:H)$ is defined as the minimum integer $n$ such that $n^2\geq k$ and for every $N\geq n$, every edge-coloring (using all $k$ colors) of complete bipartite graph $K_{N,N}$ contains a rainbow copy of $G$ or a monochromatic copy of $H$. In this paper, we first study the structural theorem on the complete bipartite graph… ▽ More The $k$-edge-colored bipartite Gallai-Ramsey number $\operatorname{bgr}_k(G:H)$ is defined as the minimum integer $n$ such that $n^2\geq k$ and for every $N\geq n$, every edge-coloring (using all $k$ colors) of complete bipartite graph $K_{N,N}$ contains a rainbow copy of $G$ or a monochromatic copy of $H$. In this paper, we first study the structural theorem on the complete bipartite graph $K_{n,n}$ with no rainbow copy of $K_{1,3}$. Next, we utilize the results to prove the exact values of $\operatorname{bgr}_{k}(P_4: H)$, $\operatorname{bgr}_{k}(P_5: H)$, $\operatorname{bgr}_{k}(K_{1,3}: H)$, where $H$ is a various union of cycles and paths and stars. △ Less

Submitted 13 December, 2023; v1 submitted 30 June, 2023; originally announced June 2023.

Comments: 13 pages

arXiv:2306.06831 [pdf, other]

doi 10.1103/PhysRevA.108.062213

Quantum contextuality of complementary photon polarizations explored by adaptive input state control

Authors: Kengo Matsuyama, Ming Ji, Holger F. Hofmann, Masataka Iinuma

Abstract: We experimentally investigate non-local contextual relations between complementary photon polarizations by adapting the entanglement and the local polarizations of a two-photon state to satisfy three deterministic conditions demonstrating both quantum contextuality and non-locality. The key component of this adaptive input state control is the variable degree of entanglement of the photon source.… ▽ More We experimentally investigate non-local contextual relations between complementary photon polarizations by adapting the entanglement and the local polarizations of a two-photon state to satisfy three deterministic conditions demonstrating both quantum contextuality and non-locality. The key component of this adaptive input state control is the variable degree of entanglement of the photon source. Local polarization rotations can optimize two of the three correlations, and the variation of the entanglement optimizes the third correlation. Our results demonstrate that quantum contextuality is based on a non-trivial trade-off between local complementarity and quantum correlations. △ Less

Submitted 11 June, 2023; originally announced June 2023.

Comments: 17 pages, 5 figures

Journal ref: Phys. Rev. A 108, 062213 (2023)

arXiv:2306.03401 [pdf, other]

A Lightweight Method for Tackling Unknown Participation Statistics in Federated Averaging

Authors: Shiqiang Wang, Mingyue Ji

Abstract: In federated learning (FL), clients usually have diverse participation statistics that are unknown a priori, which can significantly harm the performance of FL if not handled properly. Existing works aiming at addressing this problem are usually based on global variance reduction, which requires a substantial amount of additional memory in a multiplicative factor equal to the total number of clien… ▽ More In federated learning (FL), clients usually have diverse participation statistics that are unknown a priori, which can significantly harm the performance of FL if not handled properly. Existing works aiming at addressing this problem are usually based on global variance reduction, which requires a substantial amount of additional memory in a multiplicative factor equal to the total number of clients. An important open problem is to find a lightweight method for FL in the presence of clients with unknown participation rates. In this paper, we address this problem by adapting the aggregation weights in federated averaging (FedAvg) based on the participation history of each client. We first show that, with heterogeneous participation statistics, FedAvg with non-optimal aggregation weights can diverge from the optimal solution of the original FL objective, indicating the need of finding optimal aggregation weights. However, it is difficult to compute the optimal weights when the participation statistics are unknown. To address this problem, we present a new algorithm called FedAU, which improves FedAvg by adaptively weighting the client updates based on online estimates of the optimal weights without knowing the statistics of client participation. We provide a theoretical convergence analysis of FedAU using a novel methodology to connect the estimation error and convergence. Our theoretical results reveal important and interesting insights, while showing that FedAU converges to an optimal solution of the original objective and has desirable properties such as linear speedup. Our experimental results also verify the advantage of FedAU over baseline methods with various participation patterns. △ Less

Submitted 15 April, 2024; v1 submitted 6 June, 2023; originally announced June 2023.

Comments: Accepted to ICLR 2024

arXiv:2305.14873 [pdf, other]

doi 10.22331/q-2024-02-14-1255

Quantitative relations between different measurement contexts

Authors: Ming Ji, Holger F. Hofmann

Abstract: In quantum theory, a measurement context is defined by an orthogonal basis in a Hilbert space, where each basis vector represents a specific measurement outcome. The precise quantitative relation between two different measurement contexts can thus be characterized by the inner products of nonorthogonal states in that Hilbert space. Here, we use measurement outcomes that are shared by different con… ▽ More In quantum theory, a measurement context is defined by an orthogonal basis in a Hilbert space, where each basis vector represents a specific measurement outcome. The precise quantitative relation between two different measurement contexts can thus be characterized by the inner products of nonorthogonal states in that Hilbert space. Here, we use measurement outcomes that are shared by different contexts to derive specific quantitative relations between the inner products of the Hilbert space vectors that represent the different contexts. It is shown that the probabilities that describe the paradoxes of quantum contextuality can be derived from a very small number of inner products, revealing details of the fundamental relations between measurement contexts that go beyond a basic violation of noncontextual limits. The application of our analysis to a product space of two systems reveals that the nonlocality of quantum entanglement can be traced back to a local inner product representing the relation between measurement contexts in only one system. Our results thus indicate that the essential nonclassical features of quantum mechanics can be traced back to the fundamental difference between quantum superpositions and classical alternatives. △ Less

Submitted 8 February, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

Comments: 12 pages, 4 figures, accepted for publication in Quantum

Journal ref: Quantum 8, 1255 (2024)

arXiv:2305.05332 [pdf, ps, other]

Fundamental Limits of Multi-Message Private Computation

Authors: Ali Gholami, Kai Wan, Tayyebeh Jahani-Nezhad, Hua Sun, Mingyue Ji, Giuseppe Caire

Abstract: In a typical formulation of the private information retrieval (PIR) problem, a single user wishes to retrieve one out of $ K$ files from $N$ servers without revealing the demanded file index to any server. This paper formulates an extended model of PIR, referred to as multi-message private computation (MM-PC), where instead of retrieving a single file, the user wishes to retrieve $P>1$ linear comb… ▽ More In a typical formulation of the private information retrieval (PIR) problem, a single user wishes to retrieve one out of $ K$ files from $N$ servers without revealing the demanded file index to any server. This paper formulates an extended model of PIR, referred to as multi-message private computation (MM-PC), where instead of retrieving a single file, the user wishes to retrieve $P>1$ linear combinations of files while preserving the privacy of the demand information. The MM-PC problem is a generalization of the private computation (PC) problem (where the user requests one linear combination of the files), and the multi-message private information retrieval (MM-PIR) problem (where the user requests $P>1$ files). A baseline achievable scheme repeats the optimal PC scheme by Sun and Jafar $P$ times, or treats each possible demanded linear combination as an independent file and then uses the near optimal MM-PIR scheme by Banawan and Ulukus. In this paper, we propose a new MM-PC scheme that significantly improves upon the baseline schemes. In doing so, we design the queries inspired by the structure in the cache-aided scalar linear function retrieval scheme by Wan {\it et al.}, which leverages the dependency between linear functions to reduce the amount of communications. To ensure the decodability of our scheme, we propose a new method to benefit from the existing dependency, referred to as the sign assignment step. In the end, we use Maximum Distance Separable matrices to code the queries, which allows the reduction of download from the servers, while preserving privacy. By the proposed schemes, we characterize the capacity within a multiplicative factor of $2$. △ Less

Submitted 23 August, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

Comments: A version of this paper is submitted to IEEE Transactions on Communications. A short version was accepted and presented at ISIT 2024 in Athens

arXiv:2305.05143 [pdf, other]

Fundamental Limits of Distributed Linearly Separable Computation under Cyclic Assignment

Authors: Wenbo Huang, Kai Wan, Hua Sun, Mingyue Ji, Robert Caiming Qiu, Giuseppe Caire

Abstract: This paper studies the master-worker distributed linearly separable computation problem, where the considered computation task, referred to as linearly separable function, is a typical linear transform model widely used in cooperative distributed gradient coding, real-time rendering, linear transformers, etc. %A master asks $\Nsf$ distributed workers to compute a linearly separable function from… ▽ More This paper studies the master-worker distributed linearly separable computation problem, where the considered computation task, referred to as linearly separable function, is a typical linear transform model widely used in cooperative distributed gradient coding, real-time rendering, linear transformers, etc. %A master asks $\Nsf$ distributed workers to compute a linearly separable function from $\Ksf$ datasets. The computation task on $\Ksf$ datasets can be expressed as $\Ksf_{\rm c}$ linear combinations of $\Ksf$ messages, where each message is the output of an individual function on one dataset. Straggler effect is also considered, such that from the answers of any $\Nsf_{\rm r}$ of the $\Nsf$ distributed workers, the master should accomplish the task. The computation cost is defined as the number of datasets assigned to each worker, while the communication cost is defined as the number of (coded) messages that should be received. The objective is to characterize the optimal tradeoff between the computation and communication costs. The problem has remained so far open, even under the cyclic data assignment.Since in fact various distributed computing schemes were proposed in the literature under the cyclic data assignment, with this paper we close the problem for the cyclic assignment. This paper proposes a new computing scheme with the cyclic assignment based on the concept of interference alignment, by treating each message which cannot be computed by a worker as an interference from this worker. Under the cyclic assignment, the proposed computing scheme is then proved to be optimal when $\Nsf=\Ksf$ and be order optimal within a factor of $2$ otherwise. △ Less

Submitted 19 February, 2024; v1 submitted 8 May, 2023; originally announced May 2023.

Comments: A short version has been accepted in ISIT2023. 9pages, 2figures, conference; The Journal version has been submitted to TIT, 44papges, 8figueres

arXiv:2212.08496 [pdf, other]

Federated Learning with Flexible Control

Authors: Shiqiang Wang, Jake Perazzone, Mingyue Ji, Kevin S. Chan

Abstract: Federated learning (FL) enables distributed model training from local data collected by users. In distributed systems with constrained resources and potentially high dynamics, e.g., mobile edge networks, the efficiency of FL is an important problem. Existing works have separately considered different configurations to make FL more efficient, such as infrequent transmission of model updates, client… ▽ More Federated learning (FL) enables distributed model training from local data collected by users. In distributed systems with constrained resources and potentially high dynamics, e.g., mobile edge networks, the efficiency of FL is an important problem. Existing works have separately considered different configurations to make FL more efficient, such as infrequent transmission of model updates, client subsampling, and compression of update vectors. However, an important open problem is how to jointly apply and tune these control knobs in a single FL algorithm, to achieve the best performance by allowing a high degree of freedom in control decisions. In this paper, we address this problem and propose FlexFL - an FL algorithm with multiple options that can be adjusted flexibly. Our FlexFL algorithm allows both arbitrary rates of local computation at clients and arbitrary amounts of communication between clients and the server, making both the computation and communication resource consumption adjustable. We prove a convergence upper bound of this algorithm. Based on this result, we further propose a stochastic optimization formulation and algorithm to determine the control decisions that (approximately) minimize the convergence bound, while conforming to constraints related to resource consumption. The advantage of our approach is also verified using experiments. △ Less

Submitted 16 December, 2022; originally announced December 2022.

Comments: Accepted to IEEE INFOCOM 2023

arXiv:2212.04637 [pdf, ps, other]

Connectivity keeping spiders in k-connected graphs

Authors: Zhong Huang, Meng Ji

Abstract: Fujita and Kawarabayashi [J. Combin. Theory, Ser. B 98 (2008), 805--811] conjectured that for all positive integers $k$, $m$, there is a (least) non-negative integer $f_{k}(m)$ such that every $k$-connected graph $G$ with $δ(G)\geq \lfloor\frac{3k}{2}\rfloor+ f_{k}(m)-1$ contains a connected subgraph $W$ of order $m$ such that $G-V(W)$ is still $k$-connected. Mader confirmed Fujita-Kawarabayashi's… ▽ More Fujita and Kawarabayashi [J. Combin. Theory, Ser. B 98 (2008), 805--811] conjectured that for all positive integers $k$, $m$, there is a (least) non-negative integer $f_{k}(m)$ such that every $k$-connected graph $G$ with $δ(G)\geq \lfloor\frac{3k}{2}\rfloor+ f_{k}(m)-1$ contains a connected subgraph $W$ of order $m$ such that $G-V(W)$ is still $k$-connected. Mader confirmed Fujita-Kawarabayashi's conjecture by proving $f_{k}(m)=m$ and $W$ is a path. In this paper, the authors will confirm Fujita-Kawarabayashi's conjecture again by proving $f_{k}(m)=m$ and $W$ is a spider by a new method, where a spider is a tree with at most one vertex of degree at least three. Meanwhile, this result will verify a conjecture proposed by Mader [J. Graph Theory 65 (2010), 61--69] for the case of the spider. △ Less

Submitted 13 December, 2023; v1 submitted 8 December, 2022; originally announced December 2022.

Comments: 7 pages

arXiv:2211.02199 [pdf, other]

doi 10.1103/PhysRevA.107.022208

Characterization of the non-classical relation between measurement outcomes represented by non-orthogonal quantum states

Authors: Ming Ji, Holger F. Hofmann

Abstract: Quantum mechanics describes seemingly paradoxical relations between the outcomes of measurements that cannot be performed jointly. In Hilbert space, the outcomes of such incompatible measurements are represented by non-orthogonal states. In this paper, we investigate how the relation between outcomes represented by non-orthogonal quantum states differs from the relations suggested by a joint assig… ▽ More Quantum mechanics describes seemingly paradoxical relations between the outcomes of measurements that cannot be performed jointly. In Hilbert space, the outcomes of such incompatible measurements are represented by non-orthogonal states. In this paper, we investigate how the relation between outcomes represented by non-orthogonal quantum states differs from the relations suggested by a joint assignment of measurement outcomes that do not depend on the actual measurement context. The analysis is based on a well-known scenario where three statements about the impossibilities of certain outcomes would seem to make a specific fourth outcome impossible as well, yet quantum theory allows the observation of that outcome with a non-vanishing probability. We show that the Hilbert space formalism modifies the relation between the four measurement outcomes by defining a lower bound of the fourth probability that increases as the total probability of the first three outcomes drops to zero. Quantum theory thus makes the violation of non-contextual consistency between the measurement outcomes not only possible, but actually requires it as a necessary consequence of the Hilbert space inner products that describe the contextual relation between the outcomes of different measurements. △ Less

Submitted 21 December, 2022; v1 submitted 3 November, 2022; originally announced November 2022.

Comments: 10 pages, 1 figure, improved introduction and notation

Journal ref: Phys. Rev. A 107, 022208 (2023)

arXiv:2210.13006 [pdf, other]

doi 10.1103/PhysRevD.108.024030

Testing general relativity with TianQin: the prospect of using the inspiral signals of black hole binaries

Authors: Changfu Shi, Mujie Ji, Jian-dong Zhang, Jianwei Mei

Abstract: In this paper, we carry out a systematic study of the prospect of testing general relativity with the inspiral signals of black hole binaries that could be detected with TianQin. The study is based on the parameterized post-Einsteinian (ppE) waveform, so that many modified gravity theories can be covered simultaneously. We consider black hole binaries with total masses ranging from… ▽ More In this paper, we carry out a systematic study of the prospect of testing general relativity with the inspiral signals of black hole binaries that could be detected with TianQin. The study is based on the parameterized post-Einsteinian (ppE) waveform, so that many modified gravity theories can be covered simultaneously. We consider black hole binaries with total masses ranging from $10\rm M_\odot\sim10^7 M_\odot$ and ppE corrections at post-Newtonian (PN) orders ranging from $-4$PN to $2$PN. Compared to the current ground-based detectors, TianQin can improve the constraints on the ppE phase parameter $β$ by orders of magnitude. For example, the improvement at the $-4$PN and $2$PN orders can be about $13$ and $3$ orders of magnitude (compared to the results from GW150914), respectively. Compared to future ground-based detectors, such as ET, TianQin is expected to be superior below the $-1$PN order, and for corrections above the $-0.5$PN order, TianQin is still competitive near the large mass end of the low mass range $[10 \rm M_\odot, \,10^3 \rm M_\odot]\,$. Compared to the future space-based detector LISA, TianQin can be competitive in the lower mass end as the PN order is increased. For example, at the $-4$PN order, LISA is always superior for sources more massive than about $30\rm M_\odot\,$, while at the $2$PN order, TianQin becomes competitive for sources less massive than about $10^4\rm M_\odot$. We also study the scientific potentials of detector networks involving TianQin, LISA and ET, and discuss the constraints on specific theories such as the dynamic Chern-Simons theory and the Einstein-dilaton Gauss-Bonnet theory. △ Less

Submitted 24 October, 2022; originally announced October 2022.

arXiv:2210.04024 [pdf, other]

Demand Layering for Real-Time DNN Inference with Minimized Memory Usage

Authors: Mingoo Ji, Saehanseul Yi, Changjin Koo, Sol Ahn, Dongjoo Seo, Nikil Dutt, Jong-Chan Kim

Abstract: When executing a deep neural network (DNN), its model parameters are loaded into GPU memory before execution, incurring a significant GPU memory burden. There are studies that reduce GPU memory usage by exploiting CPU memory as a swap device. However, this approach is not applicable in most embedded systems with integrated GPUs where CPU and GPU share a common memory. In this regard, we present De… ▽ More When executing a deep neural network (DNN), its model parameters are loaded into GPU memory before execution, incurring a significant GPU memory burden. There are studies that reduce GPU memory usage by exploiting CPU memory as a swap device. However, this approach is not applicable in most embedded systems with integrated GPUs where CPU and GPU share a common memory. In this regard, we present Demand Layering, which employs a fast solid-state drive (SSD) as a co-running partner of a GPU and exploits the layer-by-layer execution of DNNs. In our approach, a DNN is loaded and executed in a layer-by-layer manner, minimizing the memory usage to the order of a single layer. Also, we developed a pipeline architecture that hides most additional delays caused by the interleaved parameter loadings alongside layer executions. Our implementation shows a 96.5% memory reduction with just 14.8% delay overhead on average for representative DNNs. Furthermore, by exploiting the memory-delay tradeoff, near-zero delay overhead (under 1 ms) can be achieved with a slightly increased memory usage (still an 88.4% reduction), showing the great potential of Demand Layering. △ Less

Submitted 8 October, 2022; originally announced October 2022.

Comments: 14 pages, 16 figures. Accepted to the 43rd IEEE Real-Time Systems Symposium (RTSS), 2022

arXiv:2209.05229 [pdf, other]

Resilience of small PAHs in interstellar clouds: Efficient stabilization of cyanonaphthalene by fast radiative cooling

Authors: Mark H. Stockett, James N. Bull, Henrik Cederquist, Suvasthika Indrajith, MingChao Ji, José E. Navarro Navarrete, Henning T. Schmidt, Henning Zettergren, Boxing Zhu

Abstract: After decades of speculation and searching, astronomers have recently identified specific Polycyclic Aromatic Hydrocarbons (PAHs) in space. Remarkably, the observed abundance of cyanonaphthalene (CNN, C10H7CN) in the Taurus Molecular Cloud (TMC-1) is six orders of magnitude higher than expected from astrophysical modeling. Here, we report absolute unimolecular dissociation and radiative cooling ra… ▽ More After decades of speculation and searching, astronomers have recently identified specific Polycyclic Aromatic Hydrocarbons (PAHs) in space. Remarkably, the observed abundance of cyanonaphthalene (CNN, C10H7CN) in the Taurus Molecular Cloud (TMC-1) is six orders of magnitude higher than expected from astrophysical modeling. Here, we report absolute unimolecular dissociation and radiative cooling rate coefficients of the 1-CNN isomer in its cationic form. These results are based on measurements of the time-dependent neutral product emission rate and Kinetic Energy Release distributions produced from an ensemble of internally excited 1-CNN + studied in an environment similar to that in interstellar clouds. We find that Recurrent Fluorescence - radiative relaxation via thermally populated electronic excited states - efficiently stabilizes 1-CNN+ , owing to a large enhancement of the electronic transition probability by vibronic coupling. Our results help explain the anomalous abundance of CNN in TMC-1 and challenge the widely accepted picture of rapid destruction of small PAHs in space. △ Less

Submitted 12 September, 2022; originally announced September 2022.

arXiv:2209.01211 [pdf, other]

Cross-Camera Deep Colorization

Authors: Yaping Zhao, Haitian Zheng, Mengqi Ji, Ruqi Huang

Abstract: In this paper, we consider the color-plus-mono dual-camera system and propose an end-to-end convolutional neural network to align and fuse images from it in an efficient and cost-effective way. Our method takes cross-domain and cross-scale images as input, and consequently synthesizes HR colorization results to facilitate the trade-off between spatial-temporal resolution and color depth in the sin… ▽ More In this paper, we consider the color-plus-mono dual-camera system and propose an end-to-end convolutional neural network to align and fuse images from it in an efficient and cost-effective way. Our method takes cross-domain and cross-scale images as input, and consequently synthesizes HR colorization results to facilitate the trade-off between spatial-temporal resolution and color depth in the single-camera imaging system. In contrast to the previous colorization methods, ours can adapt to color and monochrome cameras with distinctive spatial-temporal resolutions, rendering the flexibility and robustness in practical applications. The key ingredient of our method is a cross-camera alignment module that generates multi-scale correspondences for cross-domain image alignment. Through extensive experiments on various datasets and multiple settings, we validate the flexibility and effectiveness of our approach. Remarkably, our method consistently achieves substantial improvements, i.e., around 10dB PSNR gain, upon the state-of-the-art methods. Code is at: https://github.com/IndigoPurple/CCDC △ Less

Submitted 7 September, 2022; v1 submitted 26 August, 2022; originally announced September 2022.

Comments: 12 pages, 6 figures

arXiv:2208.08916 [pdf, other]

doi 10.1093/imrn/rnad003

Rationality of real conic bundles with quartic discriminant curve

Authors: Lena Ji, Mattie Ji

Abstract: We study real double covers of $\mathbb P^1\times\mathbb P^2$ branched over a $(2,2)$-divisor, which have the structure of a conic bundle threefold with smooth quartic discriminant curve via the second projection. In each isotopy class of smooth plane quartics, we construct examples where the total space of the conic bundle is rational. For five of the six isotopy classes we construct $\mathbb C$-… ▽ More We study real double covers of $\mathbb P^1\times\mathbb P^2$ branched over a $(2,2)$-divisor, which have the structure of a conic bundle threefold with smooth quartic discriminant curve via the second projection. In each isotopy class of smooth plane quartics, we construct examples where the total space of the conic bundle is rational. For five of the six isotopy classes we construct $\mathbb C$-rational examples that have obstructions to rationality over $\mathbb R$, and for the sixth class, we show that the models we consider are all rational. Moreover, for three of the five classes with irrational members, we give characterizations of rationality using the topology of the real locus and the intermediate Jacobian torsor obstruction of Hassett--Tschinkel and Benoist--Wittenberg. The double cover models we consider were introduced and previously studied by S. Frei, S. Sankar, B. Viray, I. Vogt, and the first author. △ Less

Submitted 20 March, 2023; v1 submitted 18 August, 2022; originally announced August 2022.

Comments: 23 pages, 4 figures. In IMRN. v2: Minor edits to Section 3.3 and Remark 11 of the published version to reflect that Pic^1_Gamma always has real points

MSC Class: 14E08 (Primary); 14P25; 14M20; 14D06 (Secondary)

arXiv:2206.07551 [pdf, other]

Unknown-Aware Domain Adversarial Learning for Open-Set Domain Adaptation

Authors: JoonHo Jang, Byeonghu Na, DongHyeok Shin, Mingi Ji, Kyungwoo Song, Il-Chul Moon

Abstract: Open-Set Domain Adaptation (OSDA) assumes that a target domain contains unknown classes, which are not discovered in a source domain. Existing domain adversarial learning methods are not suitable for OSDA because distribution matching with $\textit{unknown}$ classes leads to negative transfer. Previous OSDA methods have focused on matching the source and the target distribution by only utilizing… ▽ More Open-Set Domain Adaptation (OSDA) assumes that a target domain contains unknown classes, which are not discovered in a source domain. Existing domain adversarial learning methods are not suitable for OSDA because distribution matching with $\textit{unknown}$ classes leads to negative transfer. Previous OSDA methods have focused on matching the source and the target distribution by only utilizing $\textit{known}$ classes. However, this $\textit{known}$-only matching may fail to learn the target-$\textit{unknown}$ feature space. Therefore, we propose Unknown-Aware Domain Adversarial Learning (UADAL), which $\textit{aligns}$ the source and the target-$\textit{known}$ distribution while simultaneously $\textit{segregating}$ the target-$\textit{unknown}$ distribution in the feature alignment procedure. We provide theoretical analyses on the optimized state of the proposed $\textit{unknown-aware}$ feature alignment, so we can guarantee both $\textit{alignment}$ and $\textit{segregation}$ theoretically. Empirically, we evaluate UADAL on the benchmark datasets, which shows that UADAL outperforms other methods with better feature alignments by reporting state-of-the-art performances. △ Less

Submitted 24 October, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

Comments: Accepted at NeurIPS 2022

arXiv:2205.13648 [pdf, other]

A Unified Analysis of Federated Learning with Arbitrary Client Participation

Authors: Shiqiang Wang, Mingyue Ji

Abstract: Federated learning (FL) faces challenges of intermittent client availability and computation/communication efficiency. As a result, only a small subset of clients can participate in FL at a given time. It is important to understand how partial client participation affects convergence, but most existing works have either considered idealized participation patterns or obtained results with non-zero… ▽ More Federated learning (FL) faces challenges of intermittent client availability and computation/communication efficiency. As a result, only a small subset of clients can participate in FL at a given time. It is important to understand how partial client participation affects convergence, but most existing works have either considered idealized participation patterns or obtained results with non-zero optimality error for generic patterns. In this paper, we provide a unified convergence analysis for FL with arbitrary client participation. We first introduce a generalized version of federated averaging (FedAvg) that amplifies parameter updates at an interval of multiple FL rounds. Then, we present a novel analysis that captures the effect of client participation in a single term. By analyzing this term, we obtain convergence upper bounds for a wide range of participation patterns, including both non-stochastic and stochastic cases, which match either the lower bound of stochastic gradient descent (SGD) or the state-of-the-art results in specific settings. We also discuss various insights, recommendations, and experimental results. △ Less

Submitted 26 October, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

Comments: Accepted to NeurIPS 2022

arXiv:2205.07541 [pdf]

Ferrimagnetism in stable non-metal covalent organic framework

Authors: Dongge Ma, Yuhang Qian, Mingyang Ji, Jiani Li, Jundan Li, Anan Liu, Yaohui Zhu

Abstract: We synthesized a pure organic non-metal crystalline covalent organic framework TAPA-BTD-COF by bottom-up Schiff base chemical reaction. And this imine-based COF is stable in aerobic condition and room-temperature. We discovered that this TAPA-BTD-COF exhibited strong magneticity in 300 K generating magnetic hysteresis loop in M-H characterization and giant chimol up to 0.028. And we further conduc… ▽ More We synthesized a pure organic non-metal crystalline covalent organic framework TAPA-BTD-COF by bottom-up Schiff base chemical reaction. And this imine-based COF is stable in aerobic condition and room-temperature. We discovered that this TAPA-BTD-COF exhibited strong magneticity in 300 K generating magnetic hysteresis loop in M-H characterization and giant chimol up to 0.028. And we further conducted zero-field cooling and field-cooling measurement of M-T curves. The as-synthesized materials showed a large chi/mol up to 0.028 in 300 K and increasing to 0.037 in 4.0 K with 200 Oe measurement field. The TAPA-BTD-COF 1/chimol~T curve supported its ferrimagnetism, with an intrinsic delta temperature as -33.03 K by extrapolating the 1/chimol~T curve. From the continuously increasing slope of 1/chimol~T, we consider that this TAPA-BTD-COF belongs to ferrimagnetic other than antiferromagnetic materials. And the large chimol value 0.028 at 300 K and 0.037 at 4.0 K also supported this, since common antiferromagnetic materials possess chimol in the range of 10-5 to 10-3 as weak magnetics other than strong magnetic materials such as ferrimagnetics and ferromagnetics. Since this material is purely non-metal organic polymer, the possibility of d-block and f-block metal with unpaired-electron induced magnetism can be excluded. Besides, since the COF does not involve free-radical monomer in the processes of synthesis, we can also exclude the origin of free-radical induced magnetism. According to recent emerging flat-band strong correlated exotic electron property, this unconventional phenomenon may relate to n-type doping on the flat-band locating in the CBM, thus generating highly-localized electron with infinite effective mass and exhibiting strong correlation, which accounts for this non-trivial strong and stable ferrimagneticity at room-temperature and aerobic atmospheric conditions. △ Less

Submitted 16 May, 2022; originally announced May 2022.

Comments: 24 pages, 7 figures, 1 table and 33 references

Showing 1–50 of 170 results for author: Ji, M