-
Optimal Communication and Key Rate Region for Hierarchical Secure Aggregation with User Collusion
Authors:
Xiang Zhang,
Kai Wan,
Hua Sun,
Shiqiang Wang,
Mingyue Ji,
Giuseppe Caire
Abstract:
Secure aggregation is concerned with the task of securely uploading the inputs of multiple users to an aggregation server without letting the server know the inputs beyond their summation. It finds broad applications in distributed machine learning paradigms such as federated learning (FL) where multiple clients, each having access to a proprietary dataset, periodically upload their locally traine…
▽ More
Secure aggregation is concerned with the task of securely uploading the inputs of multiple users to an aggregation server without letting the server know the inputs beyond their summation. It finds broad applications in distributed machine learning paradigms such as federated learning (FL) where multiple clients, each having access to a proprietary dataset, periodically upload their locally trained models (abstracted as inputs) to a parameter server which then generates an aggregate (e.g., averaged) model that is sent back to the clients as an initializing point for a new round of local training. To enhance the data privacy of the clients, secure aggregation protocols are developed using techniques from cryptography to ensure that the server infers no more information of the users' inputs beyond the desired aggregated input, even if the server can collude with some users. Although laying the ground for understanding the fundamental utility-security trade-off in secure aggregation, the simple star client-server architecture cannot capture more complex network architectures used in practical systems. Motivated by hierarchical federated learning, we investigate the secure aggregation problem in a $3$-layer hierarchical network consisting of clustered users connecting to an aggregation server through an intermediate layer of relays. Besides the conventional server security which requires that the server learns nothing beyond the desired sum of inputs, relay security is also imposed so that the relays infer nothing about the users' inputs and remain oblivious. For such a hierarchical secure aggregation (HSA) problem, we characterize the optimal multifaceted trade-off between communication (in terms of user-to-relay and relay-to-server communication rates) and secret key generation efficiency (in terms of individual key and source key rates).
△ Less
Submitted 21 October, 2024; v1 submitted 17 October, 2024;
originally announced October 2024.
-
Mutual neutralization of C$_{60}^+$ and C$_{60}^-$ ions: Excitation energies and state-selective rate coefficients
Authors:
Michael Gatchell,
Raka Paul,
MingChao Ji,
Stefan Rosén,
Richard D. Thomas,
Henrik Cederquist,
Henning T. Schmidt,
Åsa Larson,
Henning Zettergren
Abstract:
Context: Mutual neutralization between cations and anions play an important role in determining the charge-balance in certain astrophysical environments. However, empirical data for such reactions involving complex molecular species has been lacking due to challenges in performing experimental studies, leaving the astronomical community to rely on decades old models with large uncertainties for de…
▽ More
Context: Mutual neutralization between cations and anions play an important role in determining the charge-balance in certain astrophysical environments. However, empirical data for such reactions involving complex molecular species has been lacking due to challenges in performing experimental studies, leaving the astronomical community to rely on decades old models with large uncertainties for describing these processes in the interstellar medium. Aims: To investigate the mutual neutralization (MN) reaction, C$_{60}^+$ + C$_{60}^-$ $\rightarrow$ C$_{60}^*$ + C$_{60}$, for collisions at interstellar-like conditions. Methods: The mutual neutralization reaction between C$_{60}^+$ and C$_{60}^-$ at collision energies of 100\,meV was studied using the Double ElectroStatic Ion Ring ExpEriment, DESIREE, and its merged-beam capabilities. To aid in the interpretation of the experimental results, semi-classical modeling based on the Landau-Zener approach was performed for the studied reaction. Results: We experimentally identify a narrow range of kinetic energies for the neutral reaction products. Modeling was used to calculate the quantum state-selective reaction probabilities, absolute cross sections, and rate coefficients of these MN reactions, using the experimental results as a benchmark. The MN cross sections are compared with model results for electron attachment to C$_{60}$ and electron recombination with C$_{60}^+$. Conclusions: The present results show that it is crucial to take mutual polarization effects, the finite sizes, and the final quantum states of both molecular ions into account for reliable predictions of MN rates expected to strongly influence the charge-balance and chemistry in, e.g., dense molecular clouds.
△ Less
Submitted 18 September, 2024;
originally announced September 2024.
-
GraspSplats: Efficient Manipulation with 3D Feature Splatting
Authors:
Mazeyu Ji,
Ri-Zhao Qiu,
Xueyan Zou,
Xiaolong Wang
Abstract:
The ability for robots to perform efficient and zero-shot grasping of object parts is crucial for practical applications and is becoming prevalent with recent advances in Vision-Language Models (VLMs). To bridge the 2D-to-3D gap for representations to support such a capability, existing methods rely on neural fields (NeRFs) via differentiable rendering or point-based projection methods. However, w…
▽ More
The ability for robots to perform efficient and zero-shot grasping of object parts is crucial for practical applications and is becoming prevalent with recent advances in Vision-Language Models (VLMs). To bridge the 2D-to-3D gap for representations to support such a capability, existing methods rely on neural fields (NeRFs) via differentiable rendering or point-based projection methods. However, we demonstrate that NeRFs are inappropriate for scene changes due to their implicitness and point-based methods are inaccurate for part localization without rendering-based optimization. To amend these issues, we propose GraspSplats. Using depth supervision and a novel reference feature computation method, GraspSplats generates high-quality scene representations in under 60 seconds. We further validate the advantages of Gaussian-based representation by showing that the explicit and optimized geometry in GraspSplats is sufficient to natively support (1) real-time grasp sampling and (2) dynamic and articulated object manipulation with point trackers. With extensive experiments on a Franka robot, we demonstrate that GraspSplats significantly outperforms existing methods under diverse task settings. In particular, GraspSplats outperforms NeRF-based methods like F3RM and LERF-TOGO, and 2D detection methods.
△ Less
Submitted 3 September, 2024;
originally announced September 2024.
-
Cloud-Based Federation Framework and Prototype for Open, Scalable, and Shared Access to NextG and IoT Testbeds
Authors:
Maxwell McManus,
Tenzin Rinchen,
Annoy Dey,
Sumanth Thota,
Zhaoxi Zhang,
Jiangqi Hu,
Xi Wang,
Mingyue Ji,
Nicholas Mastronarde,
Elizabeth Serena Bentley,
Michael Medley,
Zhangyu Guan
Abstract:
In this work, we present a new federation framework for UnionLabs, an innovative cloud-based resource-sharing infrastructure designed for next-generation (NextG) and Internet of Things (IoT) over-the-air (OTA) experiments. The framework aims to reduce the federation complexity for testbeds developers by automating tedious backend operations, thereby providing scalable federation and remote access…
▽ More
In this work, we present a new federation framework for UnionLabs, an innovative cloud-based resource-sharing infrastructure designed for next-generation (NextG) and Internet of Things (IoT) over-the-air (OTA) experiments. The framework aims to reduce the federation complexity for testbeds developers by automating tedious backend operations, thereby providing scalable federation and remote access to various wireless testbeds. We first describe the key components of the new federation framework, including the Systems Manager Integration Engine (SMIE), the Automated Script Generator (ASG), and the Database Context Manager (DCM). We then prototype and deploy the new Federation Plane on the Amazon Web Services (AWS) public cloud, demonstrating its effectiveness by federating two wireless testbeds: i) UB NeXT, a 5G-and-beyond (5G+) testbed at the University at Buffalo, and ii) UT IoT, an IoT testbed at the University of Utah. Through this work we aim to initiate a grassroots campaign to democratize access to wireless research testbeds with heterogeneous hardware resources and network environment, and accelerate the establishment of a mature, open experimental ecosystem for the wireless community. The API of the new Federation Plane will be released to the community after internal testing is completed.
△ Less
Submitted 28 August, 2024; v1 submitted 26 August, 2024;
originally announced August 2024.
-
An Unsupervised Learning Framework Combined with Heuristics for the Maximum Minimal Cut Problem
Authors:
Huaiyuan Liu,
Xianzhang Liu,
Donghua Yang,
Hongzhi Wang,
Yingchi Long,
Mengtong Ji,
Dongjing Miao,
Zhiyu Liang
Abstract:
The Maximum Minimal Cut Problem (MMCP), a NP-hard combinatorial optimization (CO) problem, has not received much attention due to the demanding and challenging bi-connectivity constraint. Moreover, as a CO problem, it is also a daunting task for machine learning, especially without labeled instances. To deal with these problems, this work proposes an unsupervised learning framework combined with h…
▽ More
The Maximum Minimal Cut Problem (MMCP), a NP-hard combinatorial optimization (CO) problem, has not received much attention due to the demanding and challenging bi-connectivity constraint. Moreover, as a CO problem, it is also a daunting task for machine learning, especially without labeled instances. To deal with these problems, this work proposes an unsupervised learning framework combined with heuristics for MMCP that can provide valid and high-quality solutions. As far as we know, this is the first work that explores machine learning and heuristics to solve MMCP. The unsupervised solver is inspired by a relaxation-plus-rounding approach, the relaxed solution is parameterized by graph neural networks, and the cost and penalty of MMCP are explicitly written out, which can train the model end-to-end. A crucial observation is that each solution corresponds to at least one spanning tree. Based on this finding, a heuristic solver that implements tree transformations by adding vertices is utilized to repair and improve the solution quality of the unsupervised solver. Alternatively, the graph is simplified while guaranteeing solution consistency, which reduces the running time. We conduct extensive experiments to evaluate our framework and give a specific application. The results demonstrate the superiority of our method against two techniques designed.
△ Less
Submitted 15 August, 2024;
originally announced August 2024.
-
A New Theoretical Perspective on Data Heterogeneity in Federated Optimization
Authors:
Jiayi Wang,
Shiqiang Wang,
Rong-Rong Chen,
Mingyue Ji
Abstract:
In federated learning (FL), data heterogeneity is the main reason that existing theoretical analyses are pessimistic about the convergence rate. In particular, for many FL algorithms, the convergence rate grows dramatically when the number of local updates becomes large, especially when the product of the gradient divergence and local Lipschitz constant is large. However, empirical studies can sho…
▽ More
In federated learning (FL), data heterogeneity is the main reason that existing theoretical analyses are pessimistic about the convergence rate. In particular, for many FL algorithms, the convergence rate grows dramatically when the number of local updates becomes large, especially when the product of the gradient divergence and local Lipschitz constant is large. However, empirical studies can show that more local updates can improve the convergence rate even when these two parameters are large, which is inconsistent with the theoretical findings. This paper aims to bridge this gap between theoretical understanding and practical performance by providing a theoretical analysis from a new perspective on data heterogeneity. In particular, we propose a new and weaker assumption compared to the local Lipschitz gradient assumption, named the heterogeneity-driven pseudo-Lipschitz assumption. We show that this and the gradient divergence assumptions can jointly characterize the effect of data heterogeneity. By deriving a convergence upper bound for FedAvg and its extensions, we show that, compared to the existing works, local Lipschitz constant is replaced by the much smaller heterogeneity-driven pseudo-Lipschitz constant and the corresponding convergence upper bound can be significantly reduced for the same number of local updates, although its order stays the same. In addition, when the local objective function is quadratic, more insights on the impact of data heterogeneity can be obtained using the heterogeneity-driven pseudo-Lipschitz constant. For example, we can identify a region where FedAvg can outperform mini-batch SGD even when the gradient divergence can be arbitrarily large. Our findings are validated using experiments.
△ Less
Submitted 22 July, 2024;
originally announced July 2024.
-
Graph Neural Networks and Deep Reinforcement Learning Based Resource Allocation for V2X Communications
Authors:
Maoxin Ji,
Qiong Wu,
Pingyi Fan,
Nan Cheng,
Wen Chen,
Jiangzhou Wang,
Khaled B. Letaief
Abstract:
In the rapidly evolving landscape of Internet of Vehicles (IoV) technology, Cellular Vehicle-to-Everything (C-V2X) communication has attracted much attention due to its superior performance in coverage, latency, and throughput. Resource allocation within C-V2X is crucial for ensuring the transmission of safety information and meeting the stringent requirements for ultra-low latency and high reliab…
▽ More
In the rapidly evolving landscape of Internet of Vehicles (IoV) technology, Cellular Vehicle-to-Everything (C-V2X) communication has attracted much attention due to its superior performance in coverage, latency, and throughput. Resource allocation within C-V2X is crucial for ensuring the transmission of safety information and meeting the stringent requirements for ultra-low latency and high reliability in Vehicle-to-Vehicle (V2V) communication. This paper proposes a method that integrates Graph Neural Networks (GNN) with Deep Reinforcement Learning (DRL) to address this challenge. By constructing a dynamic graph with communication links as nodes and employing the Graph Sample and Aggregation (GraphSAGE) model to adapt to changes in graph structure, the model aims to ensure a high success rate for V2V communication while minimizing interference on Vehicle-to-Infrastructure (V2I) links, thereby ensuring the successful transmission of V2V link information and maintaining high transmission rates for V2I links. The proposed method retains the global feature learning capabilities of GNN and supports distributed network deployment, allowing vehicles to extract low-dimensional features that include structural information from the graph network based on local observations and to make independent resource allocation decisions. Simulation results indicate that the introduction of GNN, with a modest increase in computational load, effectively enhances the decision-making quality of agents, demonstrating superiority to other methods. This study not only provides a theoretically efficient resource allocation strategy for V2V and V2I communications but also paves a new technical path for resource management in practical IoV environments.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging
Authors:
Jinuk Kim,
Marwa El Halabi,
Mingi Ji,
Hyun Oh Song
Abstract:
Recent works show that reducing the number of layers in a convolutional neural network can enhance efficiency while maintaining the performance of the network. Existing depth compression methods remove redundant non-linear activation functions and merge the consecutive convolution layers into a single layer. However, these methods suffer from a critical drawback; the kernel size of the merged laye…
▽ More
Recent works show that reducing the number of layers in a convolutional neural network can enhance efficiency while maintaining the performance of the network. Existing depth compression methods remove redundant non-linear activation functions and merge the consecutive convolution layers into a single layer. However, these methods suffer from a critical drawback; the kernel size of the merged layers becomes larger, significantly undermining the latency reduction gained from reducing the depth of the network. We show that this problem can be addressed by jointly pruning convolution layers and activation functions. To this end, we propose LayerMerge, a novel depth compression method that selects which activation layers and convolution layers to remove, to achieve a desired inference speed-up while minimizing performance loss. Since the corresponding selection problem involves an exponential search space, we formulate a novel surrogate optimization problem and efficiently solve it via dynamic programming. Empirical results demonstrate that our method consistently outperforms existing depth compression and layer pruning methods on various network architectures, both on image classification and generation tasks. We release the code at https://github.com/snu-mllab/LayerMerge.
△ Less
Submitted 8 July, 2024; v1 submitted 18 June, 2024;
originally announced June 2024.
-
Imagine the Unseen: Occluded Pedestrian Detection via Adversarial Feature Completion
Authors:
Shanshan Zhang,
Mingqian Ji,
Yang Li,
Jian Yang
Abstract:
Pedestrian detection has significantly progressed in recent years, thanks to the development of DNNs. However, detection performance at occluded scenes is still far from satisfactory, as occlusion increases the intra-class variance of pedestrians, hindering the model from finding an accurate classification boundary between pedestrians and background clutters. From the perspective of reducing intra…
▽ More
Pedestrian detection has significantly progressed in recent years, thanks to the development of DNNs. However, detection performance at occluded scenes is still far from satisfactory, as occlusion increases the intra-class variance of pedestrians, hindering the model from finding an accurate classification boundary between pedestrians and background clutters. From the perspective of reducing intra-class variance, we propose to complete features for occluded regions so as to align the features of pedestrians across different occlusion patterns. An important premise for feature completion is to locate occluded regions. From our analysis, channel features of different pedestrian proposals only show high correlation values at visible parts and thus feature correlations can be used to model occlusion patterns. In order to narrow down the gap between completed features and real fully visible ones, we propose an adversarial learning method, which completes occluded features with a generator such that they can hardly be distinguished by the discriminator from real fully visible features. We report experimental results on the CityPersons, Caltech and CrowdHuman datasets. On CityPersons, we show significant improvements over five different baseline detectors, especially on the heavy occlusion subset. Furthermore, we show that our proposed method FeatComp++ achieves state-of-the-art results on all the above three datasets without relying on extra cues.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Lifetimes of excited states in P-, As- and Sb-
Authors:
J. Karls,
M. Björkhage,
M. Blom,
N. D. Gibson,
O. Hemdal Lundgren,
M. Ji,
M. K. Kristiansson,
D. Leimbach,
J. E. Navarro Navarrete,
P. Reinhed,
A. Ringvall-Moberg,
S. Rosen,
H. T. Schmidt,
A. Simonsson,
D. Hanstorp
Abstract:
Radiative lifetimes of three elements of the nitrogen group have been experimentally investigated at the Double ElectroStatic Ion Ring Experiment (DESIREE) facility at Stockholm University. The experiments were performed through selective laser photodetachment of excited states of P$^-$, As$^-$ and Sb$^-$ ions stored in a cryogenic storage ring. The experimental results were compared with theoreti…
▽ More
Radiative lifetimes of three elements of the nitrogen group have been experimentally investigated at the Double ElectroStatic Ion Ring Experiment (DESIREE) facility at Stockholm University. The experiments were performed through selective laser photodetachment of excited states of P$^-$, As$^-$ and Sb$^-$ ions stored in a cryogenic storage ring. The experimental results were compared with theoretically predicted lifetimes, yielding a mixture of very good agreements in some cases and large discrepancies in others. These results are part of our efforts to map out the lifetimes of all excited states in negative ions. This data can be used to benchmark atomic theories, in particularly with respect to the degree of electron correlation that is incorporated in various theoretical models.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Precision measurements on Si-
Authors:
J. Karls,
H. Cederquist,
N. D. Gibson,
J. Grumer,
M. Ji,
I. Kardasch,
D. Leimbach,
P. Martini,
J. E. Navarro Navarrete,
R. Poulose,
S. Rosen,
H. T. Schmidt,
A. Simonsson,
H. Zettergren,
D. Hanstorp
Abstract:
High-precision measurements of the electron affinities (EA) of the three stable isotopes of silicon, $^{28}$Si, $^{29}$Si and $^{30}$Si, have been performed at the cryogenic electrostatic ion-beam storage ring DESIREE. The quantum states of the ions were manipulated using laser depletion, and the ions were photodetached by laser photodetachment threshold spectroscopy. These EA values are the first…
▽ More
High-precision measurements of the electron affinities (EA) of the three stable isotopes of silicon, $^{28}$Si, $^{29}$Si and $^{30}$Si, have been performed at the cryogenic electrostatic ion-beam storage ring DESIREE. The quantum states of the ions were manipulated using laser depletion, and the ions were photodetached by laser photodetachment threshold spectroscopy. These EA values are the first reported for $^{29}$Si$^-$ and $^{30}$Si$^-$ and provide a reduced uncertainty for $^{28}$Si$^-$. The resulting EAs are $EA(^{28}$Si$) = 1.38952201(17)$ eV, $EA(^{29}$Si$) = 1.38952172(12)$ eV and $EA(^{29}$Si$) = 1.38952078(12)$ eV, with the corresponding isotope shifts $IS(^{29-28}$Si$) = 0.29(16)$ micro eV and $IS(^{30-28}$Si$) = 1.23(16) $ micro eV. In addition to these measurements, the resolution and signal-to-background level was sufficient to reveal the hyperfine structure splitting in the $^{29}$Si$^-$ isotope, which we report to be $1.8(4) micro eV.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
Lifetimes of excited states in Rh-
Authors:
J. Karls,
J. Grumer,
S. Schiffmann,
N. D. Gibson,
M. Ji,
M. K. Kristiansson,
D. Leimbach,
J. E. Navarro Navarrete,
Y. Pena Rodrıguez,
R. Ponce,
A. Ringvall-Moberg,
H. T. Schmidt,
S. E. Spielman,
C. W. Walter,
T. Brage,
D. Hanstorp
Abstract:
The radiative decay of excited states of the negative ion of rhodium, Rh$^-$, has been investigated experimentally and theoretically. The experiments were conducted at the Double ElectroStatic Ion Ring Experiment (DESIREE) facility at Stockholm University using selective photodetachment from a stored ion beam to monitor the time evolution of the excited state populations. The lifetimes of the Rh…
▽ More
The radiative decay of excited states of the negative ion of rhodium, Rh$^-$, has been investigated experimentally and theoretically. The experiments were conducted at the Double ElectroStatic Ion Ring Experiment (DESIREE) facility at Stockholm University using selective photodetachment from a stored ion beam to monitor the time evolution of the excited state populations. The lifetimes of the Rh$^-$ $^3F_{3}$ and $^3F_{2}$ fine structure levels were measured to be 3.2(6)~s and 21(4)~s, respectively. An additional, previously unreported, higher-lying bound state of mixed $^1D_2+^3P_2+(4d^95s)^1D_2+^3F_2$ composition was observed and found to have a lifetime of 10.9(8)s. The binding energy of this state was determined to be in the interval $0.1584(2) $ eV $ < E_b < 0.2669(2)$ eV, using laser photodetachment threshold (LPT) spectroscopy. An autodetaching state with a lifetime of 480(10) microseconds was also observed. Theoretical calculations of the excited-state compositions, energies, and magnetic-dipole transition lifetimes were performed using the multiconfiguration Dirac-Hartree-Fock and relativistic configuration interaction methods. The calculated lifetimes of the $^3F_{3}$ and $^3F_{2}$ fine structure levels are in excellent agreement with the measured values. The present study should provide valuable insights into electron correlation effects in negative ions and forbidden radiative transitions.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
HyperCLOVA X Technical Report
Authors:
Kang Min Yoo,
Jaegeun Han,
Sookyo In,
Heewon Jeon,
Jisu Jeong,
Jaewook Kang,
Hyunwook Kim,
Kyung-Min Kim,
Munhyong Kim,
Sungju Kim,
Donghyun Kwak,
Hanock Kwak,
Se Jung Kwon,
Bado Lee,
Dongsoo Lee,
Gichang Lee,
Jooho Lee,
Baeseong Park,
Seongjin Shin,
Joonsang Yu,
Seolki Baek,
Sumin Byeon,
Eungsup Cho,
Dooseok Choe,
Jeesung Han
, et al. (371 additional authors not shown)
Abstract:
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t…
▽ More
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment to responsible AI. The model is evaluated across various benchmarks, including comprehensive reasoning, knowledge, commonsense, factuality, coding, math, chatting, instruction-following, and harmlessness, in both Korean and English. HyperCLOVA X exhibits strong reasoning capabilities in Korean backed by a deep understanding of the language and cultural nuances. Further analysis of the inherent bilingual nature and its extension to multilingualism highlights the model's cross-lingual proficiency and strong generalization ability to untargeted languages, including machine translation between several language pairs and cross-lingual inference tasks. We believe that HyperCLOVA X can provide helpful guidance for regions or countries in developing their sovereign LLMs.
△ Less
Submitted 13 April, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
NASH: Neural Architecture Search for Hardware-Optimized Machine Learning Models
Authors:
Mengfei Ji,
Yuchun Chang,
Baolin Zhang,
Zaid Al-Ars
Abstract:
As machine learning (ML) algorithms get deployed in an ever-increasing number of applications, these algorithms need to achieve better trade-offs between high accuracy, high throughput and low latency. This paper introduces NASH, a novel approach that applies neural architecture search to machine learning hardware. Using NASH, hardware designs can achieve not only high throughput and low latency b…
▽ More
As machine learning (ML) algorithms get deployed in an ever-increasing number of applications, these algorithms need to achieve better trade-offs between high accuracy, high throughput and low latency. This paper introduces NASH, a novel approach that applies neural architecture search to machine learning hardware. Using NASH, hardware designs can achieve not only high throughput and low latency but also superior accuracy performance. We present four versions of the NASH strategy in this paper, all of which show higher accuracy than the original models. The strategy can be applied to various convolutional neural networks, selecting specific model operations among many to guide the training process toward higher accuracy. Experimental results show that applying NASH on ResNet18 or ResNet34 achieves a top 1 accuracy increase of up to 3.1% and a top 5 accuracy increase of up to 2.2% compared to the non-NASH version when tested on the ImageNet data set. We also integrated this approach into the FINN hardware model synthesis tool to automate the application of our approach and the generation of the hardware model. Results show that using FINN can achieve a maximum throughput of 324.5 fps. In addition, NASH models can also result in a better trade-off between accuracy and hardware resource utilization. The accuracy-hardware (HW) Pareto curve shows that the models with the four NASH versions represent the best trade-offs achieving the highest accuracy for a given HW utilization. The code for our implementation is open-source and publicly available on GitHub at https://github.com/MFJI/NASH.
△ Less
Submitted 10 March, 2024; v1 submitted 4 March, 2024;
originally announced March 2024.
-
Decentralized Uncoded Storage Elastic Computing with Heterogeneous Computation Speeds
Authors:
Wenbo Huang,
Xudong You,
Kai Wan,
Robert Caiming Qiu,
Mingyue Ji
Abstract:
Elasticity plays an important role in modern cloud computing systems. Elastic computing allows virtual machines (i.e., computing nodes) to be preempted when high-priority jobs arise, and also allows new virtual machines to participate in the computation. In 2018, Yang et al. introduced Coded Storage Elastic Computing (CSEC) to address the elasticity using coding technology, with lower storage and…
▽ More
Elasticity plays an important role in modern cloud computing systems. Elastic computing allows virtual machines (i.e., computing nodes) to be preempted when high-priority jobs arise, and also allows new virtual machines to participate in the computation. In 2018, Yang et al. introduced Coded Storage Elastic Computing (CSEC) to address the elasticity using coding technology, with lower storage and computation load requirements. However, CSEC is limited to certain types of computations (e.g., linear) due to the coded data storage based on linear coding. Then Centralized Uncoded Storage Elastic Computing (CUSEC) with heterogeneous computation speeds was proposed, which directly copies parts of data into the virtual machines. In all existing works in elastic computing, the storage assignment is centralized, meaning that the number and identity of all virtual machines possible used in the whole computation process are known during the storage assignment. In this paper, we consider Decentralized Uncoded Storage Elastic Computing (DUSEC) with heterogeneous computation speeds, where any available virtual machine can join the computation which is not predicted and thus coordination among different virtual machines' storage assignments is not allowed. Under a decentralized storage assignment originally proposed in coded caching by Maddah-Ali and Niesen, we propose a computing scheme with closed-form optimal computation time. We also run experiments over MNIST dataset with Softmax regression model through the Tencent cloud platform, and the experiment results demonstrate that the proposed DUSEC system approaches the state-of-art best storage assignment in the CUSEC system in computation time.
△ Less
Submitted 1 March, 2024;
originally announced March 2024.
-
Tracing quantum correlations back to collective interferences
Authors:
Ming Ji,
Jonte R. Hance,
Holger F. Hofmann
Abstract:
In this paper, we investigate the possibility of explaining nonclassical correlations between two quantum systems in terms of quantum interferences between collective states of the two systems. We achieve this by mapping the relations between different measurement contexts in the product Hilbert space of a pair of two-level systems onto an analogous sequence of interferences between paths in a sin…
▽ More
In this paper, we investigate the possibility of explaining nonclassical correlations between two quantum systems in terms of quantum interferences between collective states of the two systems. We achieve this by mapping the relations between different measurement contexts in the product Hilbert space of a pair of two-level systems onto an analogous sequence of interferences between paths in a single-particle interferometer. The relations between different measurement outcomes are then traced to the distribution of probability currents in the interferometer, where paradoxical relations between the outcomes are identified with currents connecting two states that are orthogonal and should therefore exclude each other. We show that the relation between probability currents and correlations can be represented by continuous conditional (quasi)probability currents through the interferometer, given by weak values; the violation of the noncontextual assumption is expressed by negative conditional currents in some of the paths. Since negative conditional currents correspond to the assignment of negative conditional probabilities to measurements results in different measurement contexts, the necessity of such negative probability currents represents a failure of noncontextual local realism. Our results help to explain the meaning of nonlocal correlations in quantum mechanics, and support Feynman's claim that interference is the origin of all quantum phenomena.
△ Less
Submitted 3 June, 2024; v1 submitted 30 January, 2024;
originally announced January 2024.
-
SPARC-LoRa: A Scalable, Power-efficient, Affordable, Reliable, and Cloud Service-enabled LoRa Networking System for Agriculture Applications
Authors:
Xi Wang,
Bryan Hatasaka,
Zhengyan Liu,
Sayali Tope,
Mohit Karkhanis,
Seungbeom Noh,
Farhan Sium,
Ravi V. Mural,
Hanseup Kim,
Carlos Mastrangelo,
Ling Zang,
James Schnable,
Mingyue Ji
Abstract:
With the rapid development of cloud and edge computing, Internet of Things (IoT) applications have been deployed in various aspects of human life. In this paper, we design and implement a holistic LoRa-based IoT system with LoRa communication capabilities, named SPARC-LoRa, which consists of field sensor nodes and a gateway connected to the Internet. SPARC-LoRa has the following important features…
▽ More
With the rapid development of cloud and edge computing, Internet of Things (IoT) applications have been deployed in various aspects of human life. In this paper, we design and implement a holistic LoRa-based IoT system with LoRa communication capabilities, named SPARC-LoRa, which consists of field sensor nodes and a gateway connected to the Internet. SPARC-LoRa has the following important features. First, the proposed wireless network of SPARC-LoRa is even-driven and using off-the-shelf microcontroller and LoRa communication modules with a customized PCB design to integrate all the hardware. This enables SPARC-LoRa to achieve low power consumption, long range communication, and low cost. With a new connection-based upper layer protocol design, the scalability and communication reliability of SPARC-loRa can be achieved. Second, an open source software including sensor nodes and servers is designed based on Docker container with cloud storage, computing, and LTE functionalities. In order to achieve reliable wireless communication under extreme conditions, a relay module is designed and applied to SPARC-LoRa to forward the data from sensor nodes to the gateway node. The system design and implementation is completely open source and hosted on the DigitalOcean Droplet Cloud. Hence, the proposed system enables further research and applications in both academia and industry. The proposed system has been tested in real fields under different and extreme environmental conditions in Salt Lake City, Utah and the University of Nebraska-Lincoln. The experimental results validate the features of SPARC-LoRa including low power, reliability, and cloud services provided by SPARC-LoRa.
△ Less
Submitted 24 January, 2024;
originally announced January 2024.
-
Uncoded Storage Coded Transmission Elastic Computing with Straggler Tolerance in Heterogeneous Systems
Authors:
Xi Zhong,
Joerg Kliewer,
Mingyue Ji
Abstract:
In 2018, Yang et al. introduced a novel and effective approach, using maximum distance separable (MDS) codes, to mitigate the impact of elasticity in cloud computing systems. This approach is referred to as coded elastic computing. Some limitations of this approach include that it assumes all virtual machines have the same computing speeds and storage capacities, and it cannot tolerate stragglers…
▽ More
In 2018, Yang et al. introduced a novel and effective approach, using maximum distance separable (MDS) codes, to mitigate the impact of elasticity in cloud computing systems. This approach is referred to as coded elastic computing. Some limitations of this approach include that it assumes all virtual machines have the same computing speeds and storage capacities, and it cannot tolerate stragglers for matrix-matrix multiplications. In order to resolve these limitations, in this paper, we introduce a new combinatorial optimization framework, named uncoded storage coded transmission elastic computing (USCTEC), for heterogeneous speeds and storage constraints, aiming to minimize the expected computation time for matrix-matrix multiplications, under the consideration of straggler tolerance. Within this framework, we propose optimal solutions with straggler tolerance under relaxed storage constraints. Moreover, we propose a heuristic algorithm that considers the heterogeneous storage constraints. Our results demonstrate that the proposed algorithm outperforms baseline solutions utilizing cyclic storage placements, in terms of both expected computation time and storage size.
△ Less
Submitted 22 January, 2024;
originally announced January 2024.
-
Stability of C$_{59}$ Knockout Fragments from Femtoseconds to Infinity
Authors:
Michael Gatchell,
Naemi Florin,
Suvasthika Indrajith,
José Eduardo Navarro Navarrete,
Paul Martini,
MingChao Ji,
Peter Reinhed,
Stefan Rosén,
Ansgar Simonsson,
Henrik Cederquist,
Henning T. Schmidt,
Henning Zettergren
Abstract:
We have studied the stability of C$_{59}$ anions as a function of time, from their formation on femtosecond timescales to their stabilization on second timescales and beyond, using a combination of theory and experiments. The C$_{59}^-$ fragments were produced in collisions between C$_{60}$ fullerene anions and neutral helium gas at a velocity of 90 km/s (corresponding to a collision energy of 166…
▽ More
We have studied the stability of C$_{59}$ anions as a function of time, from their formation on femtosecond timescales to their stabilization on second timescales and beyond, using a combination of theory and experiments. The C$_{59}^-$ fragments were produced in collisions between C$_{60}$ fullerene anions and neutral helium gas at a velocity of 90 km/s (corresponding to a collision energy of 166 eV in the center-of-mass frame). The fragments were then stored in a cryogenic ion-beam storage ring at the DESIREE facility where they were followed for up to one minute. Classical molecular dynamics simulations were used to determine the reaction cross section and the excitation energy distributions of the products formed in these collisions. We found that about 15 percent of the C$_{59}^-$ ions initially stored in the ring are intact after about 100 ms, and that this population then remains intact indefinitely. This means that C$_{60}$ fullerenes exposed to energetic atoms and ions, such as stellar winds and shock waves, will produce stable, highly reactive products, like C$_{59}$, that are fed into interstellar chemical reaction networks.
△ Less
Submitted 2 April, 2024; v1 submitted 18 January, 2024;
originally announced January 2024.
-
HawkRover: An Autonomous mmWave Vehicular Communication Testbed with Multi-sensor Fusion and Deep Learning
Authors:
Ethan Zhu,
Haijian Sun,
Mingyue Ji
Abstract:
Connected and automated vehicles (CAVs) have become a transformative technology that can change our daily life. Currently, millimeter-wave (mmWave) bands are identified as the promising CAV connectivity solution. While it can provide high data rate, their realization faces many challenges such as high attenuation during mmWave signal propagation and mobility management. Existing solution has to in…
▽ More
Connected and automated vehicles (CAVs) have become a transformative technology that can change our daily life. Currently, millimeter-wave (mmWave) bands are identified as the promising CAV connectivity solution. While it can provide high data rate, their realization faces many challenges such as high attenuation during mmWave signal propagation and mobility management. Existing solution has to initiate pilot signal to measure channel information, then apply signal processing to calculate the best narrow beam towards the receiver end to guarantee sufficient signal power. This process takes significant overhead and time, hence not suitable for vehicles. In this study, we propose an autonomous and low-cost testbed to collect extensive co-located mmWave signal and other sensors data such as LiDAR (Light Detection and Ranging), cameras, ultrasonic, etc, traditionally for ``automated'', to facilitate mmWave vehicular communications. Intuitively, these sensors can build a 3D map around the vehicle and signal propagation path can be estimated, eliminating iterative the process via pilot signals. This multimodal data fusion, together with AI, is expected to bring significant advances in ``connected'' research.
△ Less
Submitted 4 January, 2024; v1 submitted 3 January, 2024;
originally announced January 2024.
-
Physics-informed Generalizable Wireless Channel Modeling with Segmentation and Deep Learning: Fundamentals, Methodologies, and Challenges
Authors:
Ethan Zhu,
Haijian Sun,
Mingyue Ji
Abstract:
Channel modeling is fundamental in advancing wireless systems and has thus attracted considerable research focus. Recent trends have seen a growing reliance on data-driven techniques to facilitate the modeling process and yield accurate channel predictions. In this work, we first provide a concise overview of data-driven channel modeling methods, highlighting their limitations. Subsequently, we in…
▽ More
Channel modeling is fundamental in advancing wireless systems and has thus attracted considerable research focus. Recent trends have seen a growing reliance on data-driven techniques to facilitate the modeling process and yield accurate channel predictions. In this work, we first provide a concise overview of data-driven channel modeling methods, highlighting their limitations. Subsequently, we introduce the concept and advantages of physics-informed neural network (PINN)-based modeling and a summary of recent contributions in this area. Our findings demonstrate that PINN-based approaches in channel modeling exhibit promising attributes such as generalizability, interpretability, and robustness. We offer a comprehensive architecture for PINN methodology, designed to inform and inspire future model development. A case-study of our recent work on precise indoor channel prediction with semantic segmentation and deep learning is presented. The study concludes by addressing the challenges faced and suggesting potential research directions in this field.
△ Less
Submitted 2 January, 2024;
originally announced January 2024.
-
On the Injectivity of Euler Integral Transforms with Hyperplanes and Quadric Hypersurfaces
Authors:
Mattie Ji
Abstract:
The Euler characteristic transform (ECT) is an integral transform used widely in topological data analysis. Previous efforts by Curry et al. and Ghrist et al. have independently shown that the ECT is injective on all compact definable sets. In this work, we first study the injectivity of the ECT on definable sets that are not necessarily compact and prove a complete classification of constructible…
▽ More
The Euler characteristic transform (ECT) is an integral transform used widely in topological data analysis. Previous efforts by Curry et al. and Ghrist et al. have independently shown that the ECT is injective on all compact definable sets. In this work, we first study the injectivity of the ECT on definable sets that are not necessarily compact and prove a complete classification of constructible functions that the Euler characteristic transform is not injective on. We then introduce the quadric Euler characteristic transform (QECT) as a natural generalization of the ECT by detecting definable shapes with quadric hypersurfaces rather than hyperplanes. We also discuss some criteria for the injectivity of QECT.
△ Less
Submitted 21 May, 2024; v1 submitted 15 December, 2023;
originally announced December 2023.
-
Connectivity keeping spiders in k-connected bipartite graphs
Authors:
Meng Ji
Abstract:
Luo, Tian and Wu [Discrete Math. 345 (4) (2022) 112788] conjectured that for any tree $T$ with bipartition $(X,Y)$, every $k$-connected bipartite graph $G$ with minimum degree at least $k+w$, where $w=\max\{|X|,|Y|\}$, contains a tree $T'\cong T$ such that $κ(G-V(T'))\geq k$. In the paper, we confirm the conjecture for the spider by a new method, where a spider is a tree with at most one vertex of…
▽ More
Luo, Tian and Wu [Discrete Math. 345 (4) (2022) 112788] conjectured that for any tree $T$ with bipartition $(X,Y)$, every $k$-connected bipartite graph $G$ with minimum degree at least $k+w$, where $w=\max\{|X|,|Y|\}$, contains a tree $T'\cong T$ such that $κ(G-V(T'))\geq k$. In the paper, we confirm the conjecture for the spider by a new method, where a spider is a tree with at most one vertex of degree at least three.
△ Less
Submitted 22 December, 2023; v1 submitted 13 December, 2023;
originally announced December 2023.
-
Long radial coherence of electron temperature fluctuations in non-local transport in HL-2A plasmas
Authors:
Zhongbing Shi,
Kairui Fang,
Jingchun Li,
Xiaolan Zou,
Zhaoyang Lu,
Jie Wen,
Zhanhui Wang,
Xuantong Ding,
Wei Chen,
Zengchen Yang,
Min Jiang Xiaoquan Ji,
Ruihai Tong,
Yonggao Li,
Peiwang Shi,
Wulyv Zhong,
Min Xu
Abstract:
The dynamics of long-wavelength ($k_θ<1.4 \mathrm{\ cm^{-1}}$), broadband (20-200 kHz) electron temperature fluctuations ($\tilde T_e/T_e$) of plasmas in gas-puff experiments were observed for the first time in HL-2A tokamak. In a relative low density ($n_e(0) \simeq 0.91 \sim 1.20 \times10^{19}/m^3$) scenario, after gas-puffing the core temperature increases and the edge temperature drops. On the…
▽ More
The dynamics of long-wavelength ($k_θ<1.4 \mathrm{\ cm^{-1}}$), broadband (20-200 kHz) electron temperature fluctuations ($\tilde T_e/T_e$) of plasmas in gas-puff experiments were observed for the first time in HL-2A tokamak. In a relative low density ($n_e(0) \simeq 0.91 \sim 1.20 \times10^{19}/m^3$) scenario, after gas-puffing the core temperature increases and the edge temperature drops. On the contrary, temperature fluctuation drops at the core and increases at the edge. Analyses show the non-local emergence is accompanied with a long radial coherent length of turbulent fluctuations. While in a higher density ($n_e(0) \simeq 1.83 \sim 2.02 \times10^{19}/m^3$) scenario, the phenomena were not observed. Furthermore, compelling evidence indicates that $\textbf{E} \times \textbf{B}$ shear serves as a substantial contributor to this extensive radial interaction. This finding offers a direct explanatory link to the intriguing core-heating phenomenon witnessed within the realm of non-local transport.
△ Less
Submitted 9 November, 2023;
originally announced November 2023.
-
The Capacity Region of Information Theoretic Secure Aggregation with Uncoded Groupwise Keys
Authors:
Kai Wan,
Hua Sun,
Mingyue Ji,
Tiebin Mi,
Giuseppe Caire
Abstract:
This paper considers the secure aggregation problem for federated learning under an information theoretic cryptographic formulation, where distributed training nodes (referred to as users) train models based on their own local data and a curious-but-honest server aggregates the trained models without retrieving other information about users' local data. Secure aggregation generally contains two ph…
▽ More
This paper considers the secure aggregation problem for federated learning under an information theoretic cryptographic formulation, where distributed training nodes (referred to as users) train models based on their own local data and a curious-but-honest server aggregates the trained models without retrieving other information about users' local data. Secure aggregation generally contains two phases, namely key sharing phase and model aggregation phase. Due to the common effect of user dropouts in federated learning, the model aggregation phase should contain two rounds, where in the first round the users transmit masked models and, in the second round, according to the identity of surviving users after the first round, these surviving users transmit some further messages to help the server decrypt the sum of users' trained models. The objective of the considered information theoretic formulation is to characterize the capacity region of the communication rates in the two rounds from the users to the server in the model aggregation phase, assuming that key sharing has already been performed offline in prior. In this context, Zhao and Sun completely characterized the capacity region under the assumption that the keys can be arbitrary random variables. More recently, an additional constraint, known as "uncoded groupwise keys," has been introduced. This constraint entails the presence of multiple independent keys within the system, with each key being shared by precisely S users. The capacity region for the information-theoretic secure aggregation problem with uncoded groupwise keys was established in our recent work subject to the condition S > K - U, where K is the number of total users and U is the designed minimum number of surviving users. In this paper we fully characterize of the the capacity region for this problem by proposing a new converse bound and an achievable scheme.
△ Less
Submitted 12 November, 2023; v1 submitted 15 October, 2023;
originally announced October 2023.
-
Adaptive Denoising-Enhanced LiDAR Odometry for Degeneration Resilience in Diverse Terrains
Authors:
Mazeyu Ji,
Wenbo Shi,
Yujie Cui,
Chengju Liu,
Qijun Chen
Abstract:
The flexibility of Simultaneous Localization and Mapping (SLAM) algorithms in various environments has consistently been a significant challenge. To address the issue of LiDAR odometry drift in high-noise settings, integrating clustering methods to filter out unstable features has become an effective module of SLAM frameworks. However, reducing the amount of point cloud data can lead to potential…
▽ More
The flexibility of Simultaneous Localization and Mapping (SLAM) algorithms in various environments has consistently been a significant challenge. To address the issue of LiDAR odometry drift in high-noise settings, integrating clustering methods to filter out unstable features has become an effective module of SLAM frameworks. However, reducing the amount of point cloud data can lead to potential loss of information and possible degeneration. As a result, this research proposes a LiDAR odometry that can dynamically assess the point cloud's reliability. The algorithm aims to improve adaptability in diverse settings by selecting important feature points with sensitivity to the level of environmental degeneration. Firstly, a fast adaptive Euclidean clustering algorithm based on range image is proposed, which, combined with depth clustering, extracts the primary structural points of the environment defined as ambient skeleton points. Then, the environmental degeneration level is computed through the dense normal features of the skeleton points, and the point cloud cleaning is dynamically adjusted accordingly. The algorithm is validated on the KITTI benchmark and real environments, demonstrating higher accuracy and robustness in different environments.
△ Less
Submitted 6 February, 2024; v1 submitted 25 September, 2023;
originally announced September 2023.
-
Efficient Core-selecting Incentive Mechanism for Data Sharing in Federated Learning
Authors:
Mengda Ji,
Genjiu Xu,
Jianjun Ge,
Mingqiang Li
Abstract:
Federated learning is a distributed machine learning system that uses participants' data to train an improved global model. In federated learning, participants cooperatively train a global model, and they will receive the global model and payments. Rational participants try to maximize their individual utility, and they will not input their high-quality data truthfully unless they are provided wit…
▽ More
Federated learning is a distributed machine learning system that uses participants' data to train an improved global model. In federated learning, participants cooperatively train a global model, and they will receive the global model and payments. Rational participants try to maximize their individual utility, and they will not input their high-quality data truthfully unless they are provided with satisfactory payments based on their data quality. Furthermore, federated learning benefits from the cooperative contributions of participants. Accordingly, how to establish an incentive mechanism that both incentivizes inputting data truthfully and promotes stable cooperation has become an important issue to consider. In this paper, we introduce a data sharing game model for federated learning and employ game-theoretic approaches to design a core-selecting incentive mechanism by utilizing a popular concept in cooperative games, the core. In federated learning, the core can be empty, resulting in the core-selecting mechanism becoming infeasible. To address this, our core-selecting mechanism employs a relaxation method and simultaneously minimizes the benefits of inputting false data for all participants. However, this mechanism is computationally expensive because it requires aggregating exponential models for all possible coalitions, which is infeasible in federated learning. To address this, we propose an efficient core-selecting mechanism based on sampling approximation that only aggregates models on sampled coalitions to approximate the exact result. Extensive experiments verify that the efficient core-selecting mechanism can incentivize inputting high-quality data and stable cooperation, while it reduces computational overhead compared to the core-selecting mechanism.
△ Less
Submitted 26 September, 2023; v1 submitted 20 September, 2023;
originally announced September 2023.
-
Euler Characteristics and Homotopy Types of Definable Sublevel Sets, with Applications to Topological Data Analysis
Authors:
Mattie Ji,
Kun Meng,
Kexin Ding
Abstract:
Given a definable function $f: S \to \mathbb{R}$ on a definable set $S$, we study sublevel sets of the form $S^f_t = \{x \in S: f(x) \leq t\}$ for all $t \in \mathbb{R}$. Using o-minimal structures, we prove that the Euler characteristic of $S^f_t$ is right continuous with respect to $t$. Furthermore, when $S$ is compact, we show that $S^f_{t+δ}$ deformation retracts to $S^f_t$ for all sufficientl…
▽ More
Given a definable function $f: S \to \mathbb{R}$ on a definable set $S$, we study sublevel sets of the form $S^f_t = \{x \in S: f(x) \leq t\}$ for all $t \in \mathbb{R}$. Using o-minimal structures, we prove that the Euler characteristic of $S^f_t$ is right continuous with respect to $t$. Furthermore, when $S$ is compact, we show that $S^f_{t+δ}$ deformation retracts to $S^f_t$ for all sufficiently small $δ> 0$. Applying these results, we also characterize the connections between the following concepts in topological data analysis: the Euler characteristic transform (ECT), smooth ECT, Euler-Radon transform (ERT), and smooth ERT.
△ Less
Submitted 4 November, 2023; v1 submitted 6 September, 2023;
originally announced September 2023.
-
Statistical Inference on Grayscale Images via the Euler-Radon Transform
Authors:
Kun Meng,
Mattie Ji,
Jinyu Wang,
Kexin Ding,
Henry Kirveslahti,
Ani Eloyan,
Lorin Crawford
Abstract:
Tools from topological data analysis have been widely used to represent binary images in many scientific applications. Methods that aim to represent grayscale images (i.e., where pixel intensities instead take on continuous values) have been relatively underdeveloped. In this paper, we introduce the Euler-Radon transform, which generalizes the Euler characteristic transform to grayscale images by…
▽ More
Tools from topological data analysis have been widely used to represent binary images in many scientific applications. Methods that aim to represent grayscale images (i.e., where pixel intensities instead take on continuous values) have been relatively underdeveloped. In this paper, we introduce the Euler-Radon transform, which generalizes the Euler characteristic transform to grayscale images by using o-minimal structures and Euler integration over definable functions. Coupling the Karhunen-Loeve expansion with our proposed topological representation, we offer hypothesis-testing algorithms based on the chi-squared distribution for detecting significant differences between two groups of grayscale images. We illustrate our framework via extensive numerical experiments and simulations.
△ Less
Submitted 27 August, 2023;
originally announced August 2023.
-
Explicit equations of the fake projective plane $(a=7,p=2,\emptyset,D_3 X_7)$
Authors:
Lev Borisov,
Mattie Ji,
Yanxin Li
Abstract:
We find explicit equations of the fake projective plane $(a=7,p=2,\emptyset,D_3 X_7)$, which lies in the same class as the fake projective plane $(a=7,p=2,\emptyset,D_3 2_7)$ with $21$ automorphisms whose equations were previously found by Borisov and Keum. The method involves finding a birational model of a common Galois cover of these two surfaces.
We find explicit equations of the fake projective plane $(a=7,p=2,\emptyset,D_3 X_7)$, which lies in the same class as the fake projective plane $(a=7,p=2,\emptyset,D_3 2_7)$ with $21$ automorphisms whose equations were previously found by Borisov and Keum. The method involves finding a birational model of a common Galois cover of these two surfaces.
△ Less
Submitted 27 August, 2023;
originally announced August 2023.
-
On the Geometry of a Fake Projective Plane with $21$ Automorphisms
Authors:
Lev Borisov,
Mattie Ji,
Yanxin Li,
Sargam Mondal
Abstract:
A fake projective plane is a complex surface with the same Betti numbers as $\mathbb{C} P^2$ but not biholomorphic to it. We study the fake projective plane $\mathbb{P}_{\operatorname{fake}}^2 = (a = 7, p = 2, \emptyset, D_3 2_7)$ in the Cartwright-Steger classification. In this paper, we exploit the large symmetries given by…
▽ More
A fake projective plane is a complex surface with the same Betti numbers as $\mathbb{C} P^2$ but not biholomorphic to it. We study the fake projective plane $\mathbb{P}_{\operatorname{fake}}^2 = (a = 7, p = 2, \emptyset, D_3 2_7)$ in the Cartwright-Steger classification. In this paper, we exploit the large symmetries given by $\operatorname{Aut}(\mathbb{P}_{\operatorname{fake}}^2) = C_7 \rtimes C_3$ to construct an embedding of this surface into $\mathbb{C} P^5$ as a system of $56$ sextics with coefficients in $\mathbb{Q}(\sqrt{-7})$. For each torsion line bundle $T \in \operatorname{Pic}(\mathbb{P}_{\operatorname{fake}}^2)$, we also compute and study the linear systems $|nH + T|$ with small $n$, where $H$ is an ample generator of the Néron-Severi group.
△ Less
Submitted 16 September, 2024; v1 submitted 20 August, 2023;
originally announced August 2023.
-
Digital Twin Brain: a simulation and assimilation platform for whole human brain
Authors:
Wenlian Lu,
Longbin Zeng,
Xin Du,
Wenyong Zhang,
Shitong Xiang,
Huarui Wang,
Jiexiang Wang,
Mingda Ji,
Yubo Hou,
Minglong Wang,
Yuhao Liu,
Zhongyu Chen,
Qibao Zheng,
Ningsheng Xu,
Jianfeng Feng
Abstract:
In this work, we present a computing platform named digital twin brain (DTB) that can simulate spiking neuronal networks of the whole human brain scale and more importantly, a personalized biological brain structure. In comparison to most brain simulations with a homogeneous global structure, we highlight that the sparseness, couplingness and heterogeneity in the sMRI, DTI and PET data of the brai…
▽ More
In this work, we present a computing platform named digital twin brain (DTB) that can simulate spiking neuronal networks of the whole human brain scale and more importantly, a personalized biological brain structure. In comparison to most brain simulations with a homogeneous global structure, we highlight that the sparseness, couplingness and heterogeneity in the sMRI, DTI and PET data of the brain has an essential impact on the efficiency of brain simulation, which is proved from the scaling experiments that the DTB of human brain simulation is communication-intensive and memory-access intensive computing systems rather than computation-intensive. We utilize a number of optimization techniques to balance and integrate the computation loads and communication traffics from the heterogeneous biological structure to the general GPU-based HPC and achieve leading simulation performance for the whole human brain-scaled spiking neuronal networks. On the other hand, the biological structure, equipped with a mesoscopic data assimilation, enables the DTB to investigate brain cognitive function by a reverse-engineering method, which is demonstrated by a digital experiment of visual evaluation on the DTB. Furthermore, we believe that the developing DTB will be a promising powerful platform for a large of research orients including brain-inspiredintelligence, rain disease medicine and brain-machine interface.
△ Less
Submitted 2 August, 2023;
originally announced August 2023.
-
Contextuality, Coherences, and Quantum Cheshire Cats
Authors:
Jonte R. Hance,
Ming Ji,
Holger F. Hofmann
Abstract:
We analyse the quantum Cheshire cat using contextuality theory, to see if this can tell us anything about how best to interpret this paradox. We show that this scenario can be analysed using the relation between three different measurements, which seem to result in a logical contradiction. We discuss how this contextual behaviour links to weak values, and coherences between prohibited states. Rath…
▽ More
We analyse the quantum Cheshire cat using contextuality theory, to see if this can tell us anything about how best to interpret this paradox. We show that this scenario can be analysed using the relation between three different measurements, which seem to result in a logical contradiction. We discuss how this contextual behaviour links to weak values, and coherences between prohibited states. Rather than showing a property of the particle is disembodied, the quantum Cheshire cat instead demonstrates the effects of these coherences, which are typically found in pre- and postselected systems.
△ Less
Submitted 10 November, 2023; v1 submitted 13 July, 2023;
originally announced July 2023.
-
Complete bipartite graphs without small rainbow stars
Authors:
Weizhen Chen,
Meng Ji,
Yaping Mao,
Meiqin Wei
Abstract:
The $k$-edge-colored bipartite Gallai-Ramsey number $\operatorname{bgr}_k(G:H)$ is defined as the minimum integer $n$ such that $n^2\geq k$ and for every $N\geq n$, every edge-coloring (using all $k$ colors) of complete bipartite graph $K_{N,N}$ contains a rainbow copy of $G$ or a monochromatic copy of $H$. In this paper, we first study the structural theorem on the complete bipartite graph…
▽ More
The $k$-edge-colored bipartite Gallai-Ramsey number $\operatorname{bgr}_k(G:H)$ is defined as the minimum integer $n$ such that $n^2\geq k$ and for every $N\geq n$, every edge-coloring (using all $k$ colors) of complete bipartite graph $K_{N,N}$ contains a rainbow copy of $G$ or a monochromatic copy of $H$. In this paper, we first study the structural theorem on the complete bipartite graph $K_{n,n}$ with no rainbow copy of $K_{1,3}$. Next, we utilize the results to prove the exact values of $\operatorname{bgr}_{k}(P_4: H)$, $\operatorname{bgr}_{k}(P_5: H)$, $\operatorname{bgr}_{k}(K_{1,3}: H)$, where $H$ is a various union of cycles and paths and stars.
△ Less
Submitted 13 December, 2023; v1 submitted 30 June, 2023;
originally announced June 2023.
-
Quantum contextuality of complementary photon polarizations explored by adaptive input state control
Authors:
Kengo Matsuyama,
Ming Ji,
Holger F. Hofmann,
Masataka Iinuma
Abstract:
We experimentally investigate non-local contextual relations between complementary photon polarizations by adapting the entanglement and the local polarizations of a two-photon state to satisfy three deterministic conditions demonstrating both quantum contextuality and non-locality. The key component of this adaptive input state control is the variable degree of entanglement of the photon source.…
▽ More
We experimentally investigate non-local contextual relations between complementary photon polarizations by adapting the entanglement and the local polarizations of a two-photon state to satisfy three deterministic conditions demonstrating both quantum contextuality and non-locality. The key component of this adaptive input state control is the variable degree of entanglement of the photon source. Local polarization rotations can optimize two of the three correlations, and the variation of the entanglement optimizes the third correlation. Our results demonstrate that quantum contextuality is based on a non-trivial trade-off between local complementarity and quantum correlations.
△ Less
Submitted 11 June, 2023;
originally announced June 2023.
-
A Lightweight Method for Tackling Unknown Participation Statistics in Federated Averaging
Authors:
Shiqiang Wang,
Mingyue Ji
Abstract:
In federated learning (FL), clients usually have diverse participation statistics that are unknown a priori, which can significantly harm the performance of FL if not handled properly. Existing works aiming at addressing this problem are usually based on global variance reduction, which requires a substantial amount of additional memory in a multiplicative factor equal to the total number of clien…
▽ More
In federated learning (FL), clients usually have diverse participation statistics that are unknown a priori, which can significantly harm the performance of FL if not handled properly. Existing works aiming at addressing this problem are usually based on global variance reduction, which requires a substantial amount of additional memory in a multiplicative factor equal to the total number of clients. An important open problem is to find a lightweight method for FL in the presence of clients with unknown participation rates. In this paper, we address this problem by adapting the aggregation weights in federated averaging (FedAvg) based on the participation history of each client. We first show that, with heterogeneous participation statistics, FedAvg with non-optimal aggregation weights can diverge from the optimal solution of the original FL objective, indicating the need of finding optimal aggregation weights. However, it is difficult to compute the optimal weights when the participation statistics are unknown. To address this problem, we present a new algorithm called FedAU, which improves FedAvg by adaptively weighting the client updates based on online estimates of the optimal weights without knowing the statistics of client participation. We provide a theoretical convergence analysis of FedAU using a novel methodology to connect the estimation error and convergence. Our theoretical results reveal important and interesting insights, while showing that FedAU converges to an optimal solution of the original objective and has desirable properties such as linear speedup. Our experimental results also verify the advantage of FedAU over baseline methods with various participation patterns.
△ Less
Submitted 15 April, 2024; v1 submitted 6 June, 2023;
originally announced June 2023.
-
Quantitative relations between different measurement contexts
Authors:
Ming Ji,
Holger F. Hofmann
Abstract:
In quantum theory, a measurement context is defined by an orthogonal basis in a Hilbert space, where each basis vector represents a specific measurement outcome. The precise quantitative relation between two different measurement contexts can thus be characterized by the inner products of nonorthogonal states in that Hilbert space. Here, we use measurement outcomes that are shared by different con…
▽ More
In quantum theory, a measurement context is defined by an orthogonal basis in a Hilbert space, where each basis vector represents a specific measurement outcome. The precise quantitative relation between two different measurement contexts can thus be characterized by the inner products of nonorthogonal states in that Hilbert space. Here, we use measurement outcomes that are shared by different contexts to derive specific quantitative relations between the inner products of the Hilbert space vectors that represent the different contexts. It is shown that the probabilities that describe the paradoxes of quantum contextuality can be derived from a very small number of inner products, revealing details of the fundamental relations between measurement contexts that go beyond a basic violation of noncontextual limits. The application of our analysis to a product space of two systems reveals that the nonlocality of quantum entanglement can be traced back to a local inner product representing the relation between measurement contexts in only one system. Our results thus indicate that the essential nonclassical features of quantum mechanics can be traced back to the fundamental difference between quantum superpositions and classical alternatives.
△ Less
Submitted 8 February, 2024; v1 submitted 24 May, 2023;
originally announced May 2023.
-
Fundamental Limits of Multi-Message Private Computation
Authors:
Ali Gholami,
Kai Wan,
Tayyebeh Jahani-Nezhad,
Hua Sun,
Mingyue Ji,
Giuseppe Caire
Abstract:
In a typical formulation of the private information retrieval (PIR) problem, a single user wishes to retrieve one out of $ K$ files from $N$ servers without revealing the demanded file index to any server. This paper formulates an extended model of PIR, referred to as multi-message private computation (MM-PC), where instead of retrieving a single file, the user wishes to retrieve $P>1$ linear comb…
▽ More
In a typical formulation of the private information retrieval (PIR) problem, a single user wishes to retrieve one out of $ K$ files from $N$ servers without revealing the demanded file index to any server. This paper formulates an extended model of PIR, referred to as multi-message private computation (MM-PC), where instead of retrieving a single file, the user wishes to retrieve $P>1$ linear combinations of files while preserving the privacy of the demand information. The MM-PC problem is a generalization of the private computation (PC) problem (where the user requests one linear combination of the files), and the multi-message private information retrieval (MM-PIR) problem (where the user requests $P>1$ files). A baseline achievable scheme repeats the optimal PC scheme by Sun and Jafar $P$ times, or treats each possible demanded linear combination as an independent file and then uses the near optimal MM-PIR scheme by Banawan and Ulukus. In this paper, we propose a new MM-PC scheme that significantly improves upon the baseline schemes. In doing so, we design the queries inspired by the structure in the cache-aided scalar linear function retrieval scheme by Wan {\it et al.}, which leverages the dependency between linear functions to reduce the amount of communications. To ensure the decodability of our scheme, we propose a new method to benefit from the existing dependency, referred to as the sign assignment step. In the end, we use Maximum Distance Separable matrices to code the queries, which allows the reduction of download from the servers, while preserving privacy. By the proposed schemes, we characterize the capacity within a multiplicative factor of $2$.
△ Less
Submitted 23 August, 2024; v1 submitted 9 May, 2023;
originally announced May 2023.
-
Fundamental Limits of Distributed Linearly Separable Computation under Cyclic Assignment
Authors:
Wenbo Huang,
Kai Wan,
Hua Sun,
Mingyue Ji,
Robert Caiming Qiu,
Giuseppe Caire
Abstract:
This paper studies the master-worker distributed linearly separable computation problem, where the considered computation task, referred to as linearly separable function, is a typical linear transform model widely used in cooperative distributed gradient coding, real-time rendering, linear transformers, etc. %A master asks $\Nsf$ distributed workers to compute a linearly separable function from…
▽ More
This paper studies the master-worker distributed linearly separable computation problem, where the considered computation task, referred to as linearly separable function, is a typical linear transform model widely used in cooperative distributed gradient coding, real-time rendering, linear transformers, etc. %A master asks $\Nsf$ distributed workers to compute a linearly separable function from $\Ksf$ datasets. The computation task on $\Ksf$ datasets can be expressed as $\Ksf_{\rm c}$ linear combinations of $\Ksf$ messages, where each message is the output of an individual function on one dataset. Straggler effect is also considered, such that from the answers of any $\Nsf_{\rm r}$ of the $\Nsf$ distributed workers, the master should accomplish the task. The computation cost is defined as the number of datasets assigned to each worker, while the communication cost is defined as the number of (coded) messages that should be received. The objective is to characterize the optimal tradeoff between the computation and communication costs. The problem has remained so far open, even under the cyclic data assignment.Since in fact various distributed computing schemes were proposed in the literature under the cyclic data assignment, with this paper we close the problem for the cyclic assignment. This paper proposes a new computing scheme with the cyclic assignment based on the concept of interference alignment, by treating each message which cannot be computed by a worker as an interference from this worker. Under the cyclic assignment, the proposed computing scheme is then proved to be optimal when $\Nsf=\Ksf$ and be order optimal within a factor of $2$ otherwise.
△ Less
Submitted 19 February, 2024; v1 submitted 8 May, 2023;
originally announced May 2023.
-
Federated Learning with Flexible Control
Authors:
Shiqiang Wang,
Jake Perazzone,
Mingyue Ji,
Kevin S. Chan
Abstract:
Federated learning (FL) enables distributed model training from local data collected by users. In distributed systems with constrained resources and potentially high dynamics, e.g., mobile edge networks, the efficiency of FL is an important problem. Existing works have separately considered different configurations to make FL more efficient, such as infrequent transmission of model updates, client…
▽ More
Federated learning (FL) enables distributed model training from local data collected by users. In distributed systems with constrained resources and potentially high dynamics, e.g., mobile edge networks, the efficiency of FL is an important problem. Existing works have separately considered different configurations to make FL more efficient, such as infrequent transmission of model updates, client subsampling, and compression of update vectors. However, an important open problem is how to jointly apply and tune these control knobs in a single FL algorithm, to achieve the best performance by allowing a high degree of freedom in control decisions. In this paper, we address this problem and propose FlexFL - an FL algorithm with multiple options that can be adjusted flexibly. Our FlexFL algorithm allows both arbitrary rates of local computation at clients and arbitrary amounts of communication between clients and the server, making both the computation and communication resource consumption adjustable. We prove a convergence upper bound of this algorithm. Based on this result, we further propose a stochastic optimization formulation and algorithm to determine the control decisions that (approximately) minimize the convergence bound, while conforming to constraints related to resource consumption. The advantage of our approach is also verified using experiments.
△ Less
Submitted 16 December, 2022;
originally announced December 2022.
-
Connectivity keeping spiders in k-connected graphs
Authors:
Zhong Huang,
Meng Ji
Abstract:
Fujita and Kawarabayashi [J. Combin. Theory, Ser. B 98 (2008), 805--811] conjectured that for all positive integers $k$, $m$, there is a (least) non-negative integer $f_{k}(m)$ such that every $k$-connected graph $G$ with $δ(G)\geq \lfloor\frac{3k}{2}\rfloor+ f_{k}(m)-1$ contains a connected subgraph $W$ of order $m$ such that $G-V(W)$ is still $k$-connected. Mader confirmed Fujita-Kawarabayashi's…
▽ More
Fujita and Kawarabayashi [J. Combin. Theory, Ser. B 98 (2008), 805--811] conjectured that for all positive integers $k$, $m$, there is a (least) non-negative integer $f_{k}(m)$ such that every $k$-connected graph $G$ with $δ(G)\geq \lfloor\frac{3k}{2}\rfloor+ f_{k}(m)-1$ contains a connected subgraph $W$ of order $m$ such that $G-V(W)$ is still $k$-connected. Mader confirmed Fujita-Kawarabayashi's conjecture by proving $f_{k}(m)=m$ and $W$ is a path. In this paper, the authors will confirm Fujita-Kawarabayashi's conjecture again by proving $f_{k}(m)=m$ and $W$ is a spider by a new method, where a spider is a tree with at most one vertex of degree at least three. Meanwhile, this result will verify a conjecture proposed by Mader [J. Graph Theory 65 (2010), 61--69] for the case of the spider.
△ Less
Submitted 13 December, 2023; v1 submitted 8 December, 2022;
originally announced December 2022.
-
Characterization of the non-classical relation between measurement outcomes represented by non-orthogonal quantum states
Authors:
Ming Ji,
Holger F. Hofmann
Abstract:
Quantum mechanics describes seemingly paradoxical relations between the outcomes of measurements that cannot be performed jointly. In Hilbert space, the outcomes of such incompatible measurements are represented by non-orthogonal states. In this paper, we investigate how the relation between outcomes represented by non-orthogonal quantum states differs from the relations suggested by a joint assig…
▽ More
Quantum mechanics describes seemingly paradoxical relations between the outcomes of measurements that cannot be performed jointly. In Hilbert space, the outcomes of such incompatible measurements are represented by non-orthogonal states. In this paper, we investigate how the relation between outcomes represented by non-orthogonal quantum states differs from the relations suggested by a joint assignment of measurement outcomes that do not depend on the actual measurement context. The analysis is based on a well-known scenario where three statements about the impossibilities of certain outcomes would seem to make a specific fourth outcome impossible as well, yet quantum theory allows the observation of that outcome with a non-vanishing probability. We show that the Hilbert space formalism modifies the relation between the four measurement outcomes by defining a lower bound of the fourth probability that increases as the total probability of the first three outcomes drops to zero. Quantum theory thus makes the violation of non-contextual consistency between the measurement outcomes not only possible, but actually requires it as a necessary consequence of the Hilbert space inner products that describe the contextual relation between the outcomes of different measurements.
△ Less
Submitted 21 December, 2022; v1 submitted 3 November, 2022;
originally announced November 2022.
-
Testing general relativity with TianQin: the prospect of using the inspiral signals of black hole binaries
Authors:
Changfu Shi,
Mujie Ji,
Jian-dong Zhang,
Jianwei Mei
Abstract:
In this paper, we carry out a systematic study of the prospect of testing general relativity with the inspiral signals of black hole binaries that could be detected with TianQin. The study is based on the parameterized post-Einsteinian (ppE) waveform, so that many modified gravity theories can be covered simultaneously. We consider black hole binaries with total masses ranging from…
▽ More
In this paper, we carry out a systematic study of the prospect of testing general relativity with the inspiral signals of black hole binaries that could be detected with TianQin. The study is based on the parameterized post-Einsteinian (ppE) waveform, so that many modified gravity theories can be covered simultaneously. We consider black hole binaries with total masses ranging from $10\rm M_\odot\sim10^7 M_\odot$ and ppE corrections at post-Newtonian (PN) orders ranging from $-4$PN to $2$PN. Compared to the current ground-based detectors, TianQin can improve the constraints on the ppE phase parameter $β$ by orders of magnitude. For example, the improvement at the $-4$PN and $2$PN orders can be about $13$ and $3$ orders of magnitude (compared to the results from GW150914), respectively. Compared to future ground-based detectors, such as ET, TianQin is expected to be superior below the $-1$PN order, and for corrections above the $-0.5$PN order, TianQin is still competitive near the large mass end of the low mass range $[10 \rm M_\odot, \,10^3 \rm M_\odot]\,$. Compared to the future space-based detector LISA, TianQin can be competitive in the lower mass end as the PN order is increased. For example, at the $-4$PN order, LISA is always superior for sources more massive than about $30\rm M_\odot\,$, while at the $2$PN order, TianQin becomes competitive for sources less massive than about $10^4\rm M_\odot$. We also study the scientific potentials of detector networks involving TianQin, LISA and ET, and discuss the constraints on specific theories such as the dynamic Chern-Simons theory and the Einstein-dilaton Gauss-Bonnet theory.
△ Less
Submitted 24 October, 2022;
originally announced October 2022.
-
Demand Layering for Real-Time DNN Inference with Minimized Memory Usage
Authors:
Mingoo Ji,
Saehanseul Yi,
Changjin Koo,
Sol Ahn,
Dongjoo Seo,
Nikil Dutt,
Jong-Chan Kim
Abstract:
When executing a deep neural network (DNN), its model parameters are loaded into GPU memory before execution, incurring a significant GPU memory burden. There are studies that reduce GPU memory usage by exploiting CPU memory as a swap device. However, this approach is not applicable in most embedded systems with integrated GPUs where CPU and GPU share a common memory. In this regard, we present De…
▽ More
When executing a deep neural network (DNN), its model parameters are loaded into GPU memory before execution, incurring a significant GPU memory burden. There are studies that reduce GPU memory usage by exploiting CPU memory as a swap device. However, this approach is not applicable in most embedded systems with integrated GPUs where CPU and GPU share a common memory. In this regard, we present Demand Layering, which employs a fast solid-state drive (SSD) as a co-running partner of a GPU and exploits the layer-by-layer execution of DNNs. In our approach, a DNN is loaded and executed in a layer-by-layer manner, minimizing the memory usage to the order of a single layer. Also, we developed a pipeline architecture that hides most additional delays caused by the interleaved parameter loadings alongside layer executions. Our implementation shows a 96.5% memory reduction with just 14.8% delay overhead on average for representative DNNs. Furthermore, by exploiting the memory-delay tradeoff, near-zero delay overhead (under 1 ms) can be achieved with a slightly increased memory usage (still an 88.4% reduction), showing the great potential of Demand Layering.
△ Less
Submitted 8 October, 2022;
originally announced October 2022.
-
Resilience of small PAHs in interstellar clouds: Efficient stabilization of cyanonaphthalene by fast radiative cooling
Authors:
Mark H. Stockett,
James N. Bull,
Henrik Cederquist,
Suvasthika Indrajith,
MingChao Ji,
José E. Navarro Navarrete,
Henning T. Schmidt,
Henning Zettergren,
Boxing Zhu
Abstract:
After decades of speculation and searching, astronomers have recently identified specific Polycyclic Aromatic Hydrocarbons (PAHs) in space. Remarkably, the observed abundance of cyanonaphthalene (CNN, C10H7CN) in the Taurus Molecular Cloud (TMC-1) is six orders of magnitude higher than expected from astrophysical modeling. Here, we report absolute unimolecular dissociation and radiative cooling ra…
▽ More
After decades of speculation and searching, astronomers have recently identified specific Polycyclic Aromatic Hydrocarbons (PAHs) in space. Remarkably, the observed abundance of cyanonaphthalene (CNN, C10H7CN) in the Taurus Molecular Cloud (TMC-1) is six orders of magnitude higher than expected from astrophysical modeling. Here, we report absolute unimolecular dissociation and radiative cooling rate coefficients of the 1-CNN isomer in its cationic form. These results are based on measurements of the time-dependent neutral product emission rate and Kinetic Energy Release distributions produced from an ensemble of internally excited 1-CNN + studied in an environment similar to that in interstellar clouds. We find that Recurrent Fluorescence - radiative relaxation via thermally populated electronic excited states - efficiently stabilizes 1-CNN+ , owing to a large enhancement of the electronic transition probability by vibronic coupling. Our results help explain the anomalous abundance of CNN in TMC-1 and challenge the widely accepted picture of rapid destruction of small PAHs in space.
△ Less
Submitted 12 September, 2022;
originally announced September 2022.
-
Cross-Camera Deep Colorization
Authors:
Yaping Zhao,
Haitian Zheng,
Mengqi Ji,
Ruqi Huang
Abstract:
In this paper, we consider the color-plus-mono dual-camera system and propose an end-to-end convolutional neural network to align and fuse images from it in an efficient and cost-effective way. Our method takes cross-domain and cross-scale images as input, and consequently synthesizes HR colorization results to facilitate the trade-off between spatial-temporal resolution and color depth in the sin…
▽ More
In this paper, we consider the color-plus-mono dual-camera system and propose an end-to-end convolutional neural network to align and fuse images from it in an efficient and cost-effective way. Our method takes cross-domain and cross-scale images as input, and consequently synthesizes HR colorization results to facilitate the trade-off between spatial-temporal resolution and color depth in the single-camera imaging system. In contrast to the previous colorization methods, ours can adapt to color and monochrome cameras with distinctive spatial-temporal resolutions, rendering the flexibility and robustness in practical applications. The key ingredient of our method is a cross-camera alignment module that generates multi-scale correspondences for cross-domain image alignment. Through extensive experiments on various datasets and multiple settings, we validate the flexibility and effectiveness of our approach. Remarkably, our method consistently achieves substantial improvements, i.e., around 10dB PSNR gain, upon the state-of-the-art methods. Code is at: https://github.com/IndigoPurple/CCDC
△ Less
Submitted 7 September, 2022; v1 submitted 26 August, 2022;
originally announced September 2022.
-
Rationality of real conic bundles with quartic discriminant curve
Authors:
Lena Ji,
Mattie Ji
Abstract:
We study real double covers of $\mathbb P^1\times\mathbb P^2$ branched over a $(2,2)$-divisor, which have the structure of a conic bundle threefold with smooth quartic discriminant curve via the second projection. In each isotopy class of smooth plane quartics, we construct examples where the total space of the conic bundle is rational. For five of the six isotopy classes we construct $\mathbb C$-…
▽ More
We study real double covers of $\mathbb P^1\times\mathbb P^2$ branched over a $(2,2)$-divisor, which have the structure of a conic bundle threefold with smooth quartic discriminant curve via the second projection. In each isotopy class of smooth plane quartics, we construct examples where the total space of the conic bundle is rational. For five of the six isotopy classes we construct $\mathbb C$-rational examples that have obstructions to rationality over $\mathbb R$, and for the sixth class, we show that the models we consider are all rational. Moreover, for three of the five classes with irrational members, we give characterizations of rationality using the topology of the real locus and the intermediate Jacobian torsor obstruction of Hassett--Tschinkel and Benoist--Wittenberg. The double cover models we consider were introduced and previously studied by S. Frei, S. Sankar, B. Viray, I. Vogt, and the first author.
△ Less
Submitted 20 March, 2023; v1 submitted 18 August, 2022;
originally announced August 2022.
-
Unknown-Aware Domain Adversarial Learning for Open-Set Domain Adaptation
Authors:
JoonHo Jang,
Byeonghu Na,
DongHyeok Shin,
Mingi Ji,
Kyungwoo Song,
Il-Chul Moon
Abstract:
Open-Set Domain Adaptation (OSDA) assumes that a target domain contains unknown classes, which are not discovered in a source domain. Existing domain adversarial learning methods are not suitable for OSDA because distribution matching with $\textit{unknown}$ classes leads to negative transfer. Previous OSDA methods have focused on matching the source and the target distribution by only utilizing…
▽ More
Open-Set Domain Adaptation (OSDA) assumes that a target domain contains unknown classes, which are not discovered in a source domain. Existing domain adversarial learning methods are not suitable for OSDA because distribution matching with $\textit{unknown}$ classes leads to negative transfer. Previous OSDA methods have focused on matching the source and the target distribution by only utilizing $\textit{known}$ classes. However, this $\textit{known}$-only matching may fail to learn the target-$\textit{unknown}$ feature space. Therefore, we propose Unknown-Aware Domain Adversarial Learning (UADAL), which $\textit{aligns}$ the source and the target-$\textit{known}$ distribution while simultaneously $\textit{segregating}$ the target-$\textit{unknown}$ distribution in the feature alignment procedure. We provide theoretical analyses on the optimized state of the proposed $\textit{unknown-aware}$ feature alignment, so we can guarantee both $\textit{alignment}$ and $\textit{segregation}$ theoretically. Empirically, we evaluate UADAL on the benchmark datasets, which shows that UADAL outperforms other methods with better feature alignments by reporting state-of-the-art performances.
△ Less
Submitted 24 October, 2022; v1 submitted 15 June, 2022;
originally announced June 2022.
-
A Unified Analysis of Federated Learning with Arbitrary Client Participation
Authors:
Shiqiang Wang,
Mingyue Ji
Abstract:
Federated learning (FL) faces challenges of intermittent client availability and computation/communication efficiency. As a result, only a small subset of clients can participate in FL at a given time. It is important to understand how partial client participation affects convergence, but most existing works have either considered idealized participation patterns or obtained results with non-zero…
▽ More
Federated learning (FL) faces challenges of intermittent client availability and computation/communication efficiency. As a result, only a small subset of clients can participate in FL at a given time. It is important to understand how partial client participation affects convergence, but most existing works have either considered idealized participation patterns or obtained results with non-zero optimality error for generic patterns. In this paper, we provide a unified convergence analysis for FL with arbitrary client participation. We first introduce a generalized version of federated averaging (FedAvg) that amplifies parameter updates at an interval of multiple FL rounds. Then, we present a novel analysis that captures the effect of client participation in a single term. By analyzing this term, we obtain convergence upper bounds for a wide range of participation patterns, including both non-stochastic and stochastic cases, which match either the lower bound of stochastic gradient descent (SGD) or the state-of-the-art results in specific settings. We also discuss various insights, recommendations, and experimental results.
△ Less
Submitted 26 October, 2022; v1 submitted 26 May, 2022;
originally announced May 2022.
-
Ferrimagnetism in stable non-metal covalent organic framework
Authors:
Dongge Ma,
Yuhang Qian,
Mingyang Ji,
Jiani Li,
Jundan Li,
Anan Liu,
Yaohui Zhu
Abstract:
We synthesized a pure organic non-metal crystalline covalent organic framework TAPA-BTD-COF by bottom-up Schiff base chemical reaction. And this imine-based COF is stable in aerobic condition and room-temperature. We discovered that this TAPA-BTD-COF exhibited strong magneticity in 300 K generating magnetic hysteresis loop in M-H characterization and giant chimol up to 0.028. And we further conduc…
▽ More
We synthesized a pure organic non-metal crystalline covalent organic framework TAPA-BTD-COF by bottom-up Schiff base chemical reaction. And this imine-based COF is stable in aerobic condition and room-temperature. We discovered that this TAPA-BTD-COF exhibited strong magneticity in 300 K generating magnetic hysteresis loop in M-H characterization and giant chimol up to 0.028. And we further conducted zero-field cooling and field-cooling measurement of M-T curves. The as-synthesized materials showed a large chi/mol up to 0.028 in 300 K and increasing to 0.037 in 4.0 K with 200 Oe measurement field. The TAPA-BTD-COF 1/chimol~T curve supported its ferrimagnetism, with an intrinsic delta temperature as -33.03 K by extrapolating the 1/chimol~T curve. From the continuously increasing slope of 1/chimol~T, we consider that this TAPA-BTD-COF belongs to ferrimagnetic other than antiferromagnetic materials. And the large chimol value 0.028 at 300 K and 0.037 at 4.0 K also supported this, since common antiferromagnetic materials possess chimol in the range of 10-5 to 10-3 as weak magnetics other than strong magnetic materials such as ferrimagnetics and ferromagnetics. Since this material is purely non-metal organic polymer, the possibility of d-block and f-block metal with unpaired-electron induced magnetism can be excluded. Besides, since the COF does not involve free-radical monomer in the processes of synthesis, we can also exclude the origin of free-radical induced magnetism. According to recent emerging flat-band strong correlated exotic electron property, this unconventional phenomenon may relate to n-type doping on the flat-band locating in the CBM, thus generating highly-localized electron with infinite effective mass and exhibiting strong correlation, which accounts for this non-trivial strong and stable ferrimagneticity at room-temperature and aerobic atmospheric conditions.
△ Less
Submitted 16 May, 2022;
originally announced May 2022.